Question 1

What's the difference between llms.txt and llms-full.txt?

Accepted Answer

llms.txt is the navigation file: a brief overview + sectioned links to key pages. llms-full.txt is the content file: longer-form text intended to give an LLM enough context to answer detailed questions about your product without crawling further. Both are optional; large sites typically ship both, smaller sites ship only llms.txt.

Question 2

Is llms.txt actually used? Or is it just hype?

Accepted Answer

Adoption is growing fast. Major LLM providers including Anthropic, OpenAI and Perplexity have publicly stated they parse llms.txt when present. Tools like Cursor, Continue.dev and Claude Code use it to ground their answers. Adoption is still early (<3% of top sites) but trajectory is clear — and being early on a new standard pays off in SEO every time.

Question 3

Will llms.txt replace robots.txt + sitemap.xml?

Accepted Answer

No, they serve different purposes. robots.txt controls crawl access, sitemap.xml lists URLs for indexing, llms.txt provides STRUCTURED CONTEXT for LLMs. You should have all three. Crawlers (Googlebot, Bingbot) keep using robots/sitemap; AI agents (GPTBot, ClaudeBot) increasingly prefer llms.txt for context.

Question 4

What if I don't want AI bots crawling my site?

Accepted Answer

Block them in robots.txt: 'User-agent: GPTBot' Disallow: /' (and same for the other 7 bots). But if your business depends on discoverability — SaaS, e-commerce, content sites, agencies — blocking AI bots in 2026 is roughly equivalent to blocking Google in 2010. Customers ask AI for recommendations now.

Question 5

I'm using Cloudflare. Is that blocking AI bots?

Accepted Answer

Possibly. Cloudflare's 'Bot Fight Mode' and the newer 'Block AI Bots' toggle are both default-on for some plans. Check Dashboard > Security > Bots. Our checker explicitly tests live access as each AI agent so you find this problem in 60 seconds rather than wondering why your traffic disappeared from ChatGPT.

Question 6

What should be in my llms.txt?

Accepted Answer

Per the spec: a H1 with your site name, a blockquote with a 1-3 sentence summary, then sections of bulleted links. Common sections: 'Documentation', 'Examples', 'API Reference', 'Optional'. The optional section is for less-critical pages. Keep total file <2KB if possible — LLMs work with limited context.

Question 7

Where do I host llms.txt?

Accepted Answer

Always at the root: yourdomain.com/llms.txt. The spec doesn't allow alternate locations. If you have multiple subdomains, host one per subdomain. SPAs and static sites: just drop the file in your /public or /static folder.

Question 8

Does Schema.org actually matter for AI agents?

Accepted Answer

Yes, and increasingly so. While LLMs can extract some context from natural-language HTML, structured JSON-LD gives them deterministic answers about your business — your name, location, products, prices, opening hours. Sites with good Schema.org get cited correctly when AI summarizes; sites without get vague or wrong answers.

Free llms.txt
validator

What is llms.txt and why does it matter

What our llms.txt validator tests

/llms.txt presence and HTTP status

Spec compliance — H1, blockquote intro, sections

Section structure — Required vs Optional

Link reachability and link descriptions

/llms-full.txt presence (extended spec)

AI bot user-agent accessibility (live HTTP)

robots.txt rules for AI bots

Schema.org structured data coverage

JavaScript-only rendering detection

OpenGraph completeness

Behind the scenes

Frequently asked questions

Related free tools

Full AI-Readiness Checker

robots.txt Tester

Security Headers Checker

Run a complete audit, not just one check

Free llms.txtvalidator