Web Audit Insights
Data from 420 unique websites, updated 2026-04-04.
Score Distribution
74.2Mean Score
75Median Score
±9.1Std Deviation
Average Score by Industry
Check Compliance Rates
How often each audit check passes across all benchmarked sites.
CheckComplianceDistributionSample
Fetch, Render, and URL Integrity
Page fetch returns 20099%n=413HTML size under 2 MB69.5%n=413Response time under 3 seconds93.7%n=413Page served over HTTPS99.8%n=413No excessive redirect chain84.7%n=413Response is HTML (text/html)100%n=413Bot Access & Control Plane
ai.txt available17.9%n=420llms.txt available32.4%n=420Sitemap looks like XML (not HTML)77.4%n=420AI crawlers not blocked78.6%n=420robots.txt available95.2%n=420Sitemap declared in robots.txt74%n=420Sitemap available77.4%n=420Meta robots does not block AI98.6%n=145Meta robots is not noindex95.2%n=145X-Robots-Tag does not block AI87.5%n=8X-Robots-Tag is not noindex100%n=8Structured Data
FAQPage schema present3.6%n=420Organization schema present28.6%n=420Person schema present0.7%n=420Article schema present1.9%n=420Product schema present0.7%n=420WebPage schema present14.3%n=420WebSite schema present27.4%n=420BreadcrumbList schema present6.4%n=420JSON-LD present50.7%n=420JSON-LD syntax is valid50.6%n=413HTML Extractability & Main Content Clarity
Exactly one H157.1%n=420OpenGraph basics present61.7%n=420HTML lang attribute present91.7%n=420Title tag present95.7%n=420Favicon link present88.3%n=420Sufficient on-page text89.3%n=420Title and H1 aligned13.3%n=420Viewport meta present94.5%n=420Canonical URL present71%n=420Meta description present78.8%n=420Title length reasonable65.2%n=420Text-to-HTML ratio reasonable26.7%n=420Meta description length reasonable54.8%n=420Canonical URL matches site origin14.8%n=420Images have alt text39.5%n=413Entity Clarity
Organization or LocalBusiness schema present28.8%n=420Social profile links present66.7%n=420Brand/entity name appears in H116%n=420Brand/entity name appears in title75.2%n=420Trust & Security
About/Company link present68.8%n=420Press / featured-in signals46.4%n=420Terms link present49.8%n=420Contact link present54.5%n=420Privacy policy link present71.7%n=420Testimonials / case studies signals12.1%n=420RSS or Atom feed discoverable14.3%n=413Internal link density supports entity graph crawling66.6%n=413Security headers present (HSTS, CSP, X-Frame-Options, X-Content-Type)56.2%n=413Hreflang tags for multilingual targeting100%n=145SearchAction schema present (sitelinks search box)100%n=78Speakable schema markup present100%n=4Content Freshness & Authority
Last-modified date present (schema or meta)9.4%n=413Author or byline present (schema, meta, or visible)17.7%n=413Publication date present (schema or meta)9%n=413Visible <time> element with datetime attribute7.3%n=413FAQPage schema present for direct answers100%n=14Article or BlogPosting schema present100%n=8Content Answerability
Content includes structured elements (lists, tables)75.3%n=413Content includes unique data points (numbers, percentages, prices)56.9%n=413Content sections have sufficient depth (≥ 40 words median)92%n=401Headings use question phrasing (who, what, how…)4.3%n=349Content uses definition patterns ("X is…", "refers to…")100%n=183FAQ or expandable Q&A section on page15.9%n=88