AI Readiness Report

theguardian.com

How visible theguardian.com is to AI engines like ChatGPT, Perplexity, Gemini, and Claude, based on a technical scan of crawler access, structured data, and citation signals.

Summary

theguardian.com has an AI Readiness score of 40/100, placing it at Maturity Level 0: Invisible. The site was scanned on April 27, 2026 across 7 technical signals that influence whether AI engines like ChatGPT, Perplexity, Gemini, and Claude can discover, parse, and cite content from theguardian.com.

40
out of 100

Maturity Level 0

Invisible

Based on 40 out of 100 points across 7 technical signals that influence whether AI engines can discover, parse, and cite theguardian.com.

Final URL: https://www.theguardian.com/international Scanned: 2026-04-27 Tranco rank: #187

Signal breakdown

Content-level signals were checked on: https://www.theguardian.com/news/gallery/2026/apr/24/wild-horses-marathon-runne…

AI crawlers allowed

0/20 pts

Blocks one or more AI crawlers in robots.txt. AI engines cannot index pages they cannot fetch.

Fix: Remove Disallow rules for AI user-agents in robots.txt, or add explicit Allow lines.

llms.txt present

0/10 pts

No /llms.txt file. AI engines have no curated guide to the site's purpose and key pages.

Fix: Publish /llms.txt with a short site description and links to your most important pages.

Organization schema

0/15 pts

Homepage lacks JSON-LD Organization markup. AI engines cannot identify the entity behind the site.

Fix: Add JSON-LD Organization with name, url, logo, and sameAs links to social profiles.

Article schema

15/15 pts

Article, BlogPosting, or NewsArticle markup is present on content pages.

FAQPage schema

0/15 pts

No FAQPage schema detected on homepage, content pages, or /faq endpoints.

Fix: Add a FAQ section with FAQPage JSON-LD. This is one of the most AI-cited schema types.

Author Person schema

10/10 pts

JSON-LD Person schema with sameAs or url to verifiable identity is present.

Sitemap in robots.txt

15/15 pts

A Sitemap: directive in robots.txt points to the sitemap.xml file.

See if AI engines actually mention theguardian.com

The score above measures technical readiness. The next step is measuring whether ChatGPT, Perplexity, Gemini, and Claude currently mention theguardian.com for the queries that matter. The free AI Visibility Audit runs live queries against the engines and reports back.

Run free AI Visibility Audit →

Frequently asked questions

What is theguardian.com's AI Readiness score?

theguardian.com scored 40 out of 100 on the Appearly AI Readiness scan, placing it at Maturity Level 0 (Invisible).

Does theguardian.com have ai crawlers allowed?

No. Blocks one or more AI crawlers in robots.txt. AI engines cannot index pages they cannot fetch. Remove Disallow rules for AI user-agents in robots.txt, or add explicit Allow lines.

Does theguardian.com have llms.txt present?

No. No /llms.txt file. AI engines have no curated guide to the site's purpose and key pages. Publish /llms.txt with a short site description and links to your most important pages.

Does theguardian.com have organization schema?

No. Homepage lacks JSON-LD Organization markup. AI engines cannot identify the entity behind the site. Add JSON-LD Organization with name, url, logo, and sameAs links to social profiles.

Does theguardian.com have article schema?

Yes. Article, BlogPosting, or NewsArticle markup is present on content pages.

Does theguardian.com have faqpage schema?

No. No FAQPage schema detected on homepage, content pages, or /faq endpoints. Add a FAQ section with FAQPage JSON-LD. This is one of the most AI-cited schema types.

Does theguardian.com have author person schema?

Yes. JSON-LD Person schema with sameAs or url to verifiable identity is present.

Does theguardian.com have sitemap in robots.txt?

Yes. A Sitemap: directive in robots.txt points to the sitemap.xml file.

Methodology & sources

Appearly scans up to 8 pages per domain (homepage, robots.txt, /llms.txt, sitemap, and a discovered content page) to evaluate 7 signals weighted from 10 to 20 points each. The maturity model maps total score to one of 5 levels: Invisible (0), Discoverable (1), Indexable (2), Retrievable (3), Cited (4).

Sources and references:

Published:  ·  Last updated:  ·  Publisher: Appearly (AI Visibility Platform)