A new technical SEO checkbox is spreading across the web.
It sits at yoursite.com/llms.txt. Yoast can generate it for WordPress sites. Wix has rolled it out for its users. SEO tools and consultants are already turning it into another thing site owners feel behind on.
The pitch goes like this. Add this file and AI reads your site better. Get cited more in ChatGPT. Show up in AI answers.
The data does not back that up. So before you spend time on this, here is what llms.txt actually does, what it does not do, and the one place it genuinely earns its keep.
This content was originally published on masilat.com
What it is
llms.txt is a plain markdown file that sits at the root of your site. It lists your important pages as clean text, no ads, no navigation, no popups. The idea is to hand an AI a tidy summary of your content so it can read the main thing without parsing your entire page structure.
Jeremy Howard proposed it in 2024. Solid idea on paper. The confusion starts when people decide what it is supposed to be for.
It is not robots.txt. Stop calling it that.
The most common thing you will read is that llms.txt is “robots.txt for AI.” That framing is wrong, and it sends people in the wrong direction.
robots.txt controls bots. It tells crawlers where they can and cannot go. It blocks access.
llms.txt does the opposite. It invites AI in and points it toward your best pages. It includes content, not restricts it.
One blocks. One includes. Google’s John Mueller said the robots.txt comparison does not hold. If you want to control which AI bots crawl your site, that is a robots.txt job. There is a setup for that further down.
Does AI actually read it?
Here is where it falls apart.
A SE Ranking study across around 300,000 domains found about 10% had a llms.txt file. Among the 50 most-cited domains in AI answers, one had it.
OtterlyAI ran an experiment on a live site and tracked over 62,000 AI bot visits across 90 days. Requests to the llms.txt file: 0.1% of total traffic. Limy went bigger, tracking over 500 million AI bot events. GPTBot, ClaudeBot, PerplexityBot, OAI-SearchBot, Google-Extended. All of them skipped the file and crawled HTML directly.
Not one major provider has publicly committed to using llms.txt as a citation or ranking signal. Not OpenAI, not Anthropic, not Google, not Meta, not Mistral.
As an AI search play right now, the file does close to nothing. Anyone selling it as a GEO ranking lever is selling you a guess.
The Google situation
You may have seen people say Google supports llms.txt, then seen others say Google said no. Both camps are right. They are reading two different parts of Google.
Google Search has been clear. At Search Central in July 2025, Gary Illyes confirmed Google does not support llms.txt and has no plans to. John Mueller compared it to the old keywords meta tag, the self-declared signal Google killed years ago because it was too easy to game. Google’s own 2026 AI search guidance names llms.txt as a tactic that does not help.
So where does “but Google added it to their docs” come from?
In December 2025, an llms.txt file briefly appeared on Google’s own developer documentation. The SEO community noticed immediately. Google pulled it the same day. Mueller clarified: the Search team did not add it, did not use it, and does not endorse it. An internal content tool generated it automatically. Nobody caught it in time.
Crawling a file is not the same as using it. Googlebot fetches plenty of things it does not act on.
The second misread: Chrome is not Search. Google’s Chrome team added llms.txt as a check inside Lighthouse’s Agentic Browsing audit. That check is for browser-based agents reading a page, not for search rankings or AI citation frequency. Same company, two completely separate products with different jobs.
Once you see that split, the apparent contradiction disappears.
Where it actually works
Not search. Agents.
AI coding agents fetch llms.txt routinely. Point Cursor, Claude Code, GitHub Copilot, or Windsurf at a documentation site and they look for the file to find the right pages and load fewer tokens. This is why Stripe, Vercel, Cloudflare, and Anthropic all ship a clean one. Their users are building with these agents every day. A well-curated file means the agent writes working integration code. Without it, the agent guesses, hallucinates an endpoint, and the developer loses an hour debugging something that was never real.
The next wave is shopping agents. When an agent buys on behalf of a user, it needs a clean, machine-readable view of your catalog, pricing rules, and stock. Brands that hand agents a structured file win transactions. Brands with cluttered category HTML lose them to a competitor whose site is easier to parse.
The honest rule: if your readers include developers, or you run docs, or you build API products, a good llms.txt does real work today. If you run a service business and want more ChatGPT citations, it does close to nothing.
What actually moves AI visibility
Crawlable pages. Clean structure. Schema markup so machines understand what each page is. Strong, first-hand content. Real authority signals. None of this is new, and all of it works across every AI engine.
If you want to control how AI crawlers treat your site, robots.txt is the tool. OpenAI documents its own bots and tells site owners to manage them there. Here is a setup that blocks training and scraping bots while letting the on-demand citation fetchers through:
# Block training and scraping
User-agent: GPTBot
Disallow: /
User-agent: Google-Extended
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: CCBot
Disallow: /
User-agent: Meta-ExternalAgent
Disallow: /
# Allow on-demand citation fetches
User-agent: ChatGPT-User
Allow: /
User-agent: Claude-User
Allow: /
User-agent: PerplexityBot
Allow: /
Adjust it to fit what you want to allow. This file controls the bots. llms.txt never did, and it was never designed to.
The call
If a plugin generates llms.txt for free, leave it on. Costs nothing. Cheap insurance if adoption shifts later. Do not build anything around it expecting search traffic.
If you build for agents or run documentation, put time into a proper one. It pays off in that context.
Everywhere else, your hours go further on content, schema, authority, and a clean technical setup. That is still the lever.
The file a lot of people are rushing to add is an agent tool wearing an SEO costume.
Sources
- SE Ranking, llms.txt adoption study, 300,000 domains: https://seranking.com/blog/llms-txt/
- OtterlyAI, 90-day llms.txt log experiment: https://otterly.ai/blog/the-llms-txt-experiment/
- Limy, llms.txt in 2026, 500 million bot events analysed: https://limy.ai/blog/llms.txt-in-2026-the-full-guide
- Search Engine Journal, Mueller compares llms.txt to keywords meta tag: https://www.searchenginejournal.com/google-says-llms-txt-comparable-to-keywords-meta-tag/544804/
- Search Engine Land, llms.txt is not the new meta keywords: https://searchengineland.com/no-llms-txt-is-not-the-new-meta-keywords-458199
- TechWyse, Google AI search guide and Chrome Lighthouse audit: https://www.techwyse.com/news/ai-search/google-ai-search-optimization-guide-llms-txt-lighthouse-audit
- llms.txt specification: https://llmstxt.org/