§ insights · audit telemetry
What we've learned from auditing the web.
Aggregated AI-readiness signals from 499 unique websites. Refreshed 2026-05-08.
§ score distribution
How do websites score on AI readiness?
Mean Score
74.2
Median Score
74
Std Deviation
±9.1
§ industry breakdown
Which industries score the highest?
§ check compliance
Which checks do websites pass most often?
Pass rate per check across all benchmarked sites. Higher is better.
Fetch, Render, and URL Integrity
Page fetch returns 200
n=494
Response time < 3 s
n=494
HTML size under 2 MB
n=343
Page served over HTTPS
n=343
No excessive redirect chain
n=343
Response is HTML (text/html)
n=343
Page served over HTTPS
n=151
HTML size reasonable (< 512 KB)
n=151
Redirect chain ≤ 1 hop
n=151
Bot Access & Control Plane
ai.txt available and well-formed
n=499
llms.txt available and well-formed
n=499
Sitemap looks like XML (not HTML)
n=499
AI crawlers not blocked
n=499
Sitemap declared in robots.txt
n=499
Sitemap available
n=499
robots.txt available
n=488
Meta robots does not block AI
n=176
Meta robots is not noindex
n=176
Current page found in sitemap
n=36
X-Robots-Tag does not block AI
n=9
X-Robots-Tag is not noindex
n=9
Structured Data
FAQPage schema present
n=499
Organization schema present
n=499
Person schema present
n=499
Article schema present
n=499
Product schema present
n=499
WebPage schema present
n=499
WebSite schema present
n=499
BreadcrumbList schema present
n=499
JSON-LD present
n=499
JSON-LD syntax is valid
n=494
JSON-LD schemas have required properties
n=71
HTML Extractability & Main Content Clarity
Exactly one H1
n=499
OpenGraph basics present
n=499
HTML lang attribute present
n=499
Title tag present
n=499
Favicon link present
n=499
Sufficient on-page text
n=499
Title and H1 aligned
n=499
Viewport meta present
n=499
Canonical URL present
n=499
Meta description present
n=499
Title length reasonable
n=499
Text-to-HTML ratio reasonable
n=499
Meta description length reasonable
n=499
Canonical URL matches site origin
n=499
Images have alt text
n=494
Viewport configured for mobile devices
n=151
Canonical URL matches the current page URL
n=108
Entity Clarity
Trust & Security
About/Company link present
n=499
Press / featured-in signals
n=499
Terms link present
n=499
Contact link present
n=499
Privacy policy link present
n=499
Testimonials / case studies signals
n=499
RSS or Atom feed discoverable
n=494
Internal link density supports entity graph crawling
n=494
Security headers present (HSTS, CSP, X-Frame-Options, X-Content-Type)
n=494
Hreflang tags for multilingual targeting
n=171
SearchAction schema present (sitelinks search box)
n=97
Speakable schema markup present
n=5
Content Freshness & Authority
Last-modified date present (schema or meta)
n=494
Author or byline present (schema, meta, or visible)
n=494
Publication date present (schema or meta)
n=494
Visible <time> element with datetime attribute
n=494
FAQPage schema present for direct answers
n=17
Article or BlogPosting schema present
n=10
Content Answerability
Content includes structured elements (lists, tables)
n=494
Content includes unique data points (numbers, percentages, prices)
n=494
Content sections have sufficient depth (≥ 40 words median)
n=483
Headings use question phrasing (who, what, how…)
n=412
Content uses definition patterns ("X is…", "refers to…")
n=214
FAQ or expandable Q&A section on page
n=107