Is llms.txt an official standard that affects my Google rankings?

No. llms.txt is a draft, low-adoption convention proposed at llmstxt.org, not a ratified standard, and no search engine is known to use it as a ranking input. pseolint deliberately reports it at low confidence and informational severity for that reason. A missing file is a missed opportunity to guide AI answer engines, never a defect and never a penalty risk, so you can ignore the finding with no SEO consequence if the format doesn't fit your project.

How does the rule decide my llms.txt is malformed?

It applies three lenient shape checks from the llmstxt.org proposal. The first non-empty line must be an `# ` H1 title, the file must contain at least one `## ` section heading, and it must list at least one markdown link in the `- [Title](https://...)` form under a section. If any one of those fails, the finding names that specific rule. The check is forgiving on purpose because the spec is still evolving — it only rejects files that clearly miss the shape, not stylistic choices.

I run an open-source tool's documentation site — what should my llms.txt actually contain?

Open with `# Your Tool Name`, a one-line blockquote summary, then group your highest-value pages under `## ` sections. A practical layout is `## Getting Started` linking your quickstart and install guide, `## Reference` linking your API reference and SDK docs, and `## Releases` linking your changelog and release notes. List each as `- [Page](https://...): short description`. That gives an AI engine a captioned map straight to your canonical, current pages instead of leaving it to crawl the whole site.

Why does the check only run once instead of per page?

Because llms.txt is an origin-level file, not a page attribute. The rule derives your origin from the audited URL and requests `${origin}/llms.txt` a single time with a 10 second timeout. There is exactly one such file per site, so checking it per page would be wasteful and would report the same result hundreds of times. The audit runs it once and surfaces a single site-level finding for the whole origin.

Does a missing or failed fetch count the same as a malformed file?

Both produce a low-confidence, informational finding, but the messages differ. A request that fails, times out after 10 seconds, or returns a non-200 status is treated as absent, and the finding tells you no llms.txt was found at the origin. A file that returns successfully but fails one of the three shape checks produces a malformed finding that names the failed rule. Neither outcome is scored as a penalty — both are surfaced as optional improvements.

Rule referenceaeo/llms-txt

llms.txt — A Draft Convention for Guiding AI Engines, Checked at Your Origin

llms.txt is a draft, low-adoption convention proposed in September 2023 and championed by Jeremy Howard at Answer.AI, so pseolint runs this as a low-confidence, informational site-level check that fetches /llms.txt once at your origin and verifies 3 shape rules, treating a missing file as a missed opportunity worth roughly 1 hour of work, never a defect.

Frequently asked questions

Is llms.txt an official standard that affects my Google rankings?: No. llms.txt is a draft, low-adoption convention proposed at llmstxt.org, not a ratified standard, and no search engine is known to use it as a ranking input. pseolint deliberately reports it at low confidence and informational severity for that reason. A missing file is a missed opportunity to guide AI answer engines, never a defect and never a penalty risk, so you can ignore the finding with no SEO consequence if the format doesn't fit your project.
How does the rule decide my llms.txt is malformed?: It applies three lenient shape checks from the llmstxt.org proposal. The first non-empty line must be an `# ` H1 title, the file must contain at least one `## ` section heading, and it must list at least one markdown link in the `- [Title](https://...)` form under a section. If any one of those fails, the finding names that specific rule. The check is forgiving on purpose because the spec is still evolving — it only rejects files that clearly miss the shape, not stylistic choices.
I run an open-source tool's documentation site — what should my llms.txt actually contain?: Open with `# Your Tool Name`, a one-line blockquote summary, then group your highest-value pages under `## ` sections. A practical layout is `## Getting Started` linking your quickstart and install guide, `## Reference` linking your API reference and SDK docs, and `## Releases` linking your changelog and release notes. List each as `- [Page](https://...): short description`. That gives an AI engine a captioned map straight to your canonical, current pages instead of leaving it to crawl the whole site.
Why does the check only run once instead of per page?: Because llms.txt is an origin-level file, not a page attribute. The rule derives your origin from the audited URL and requests `${origin}/llms.txt` a single time with a 10 second timeout. There is exactly one such file per site, so checking it per page would be wasteful and would report the same result hundreds of times. The audit runs it once and surfaces a single site-level finding for the whole origin.
Does a missing or failed fetch count the same as a malformed file?: Both produce a low-confidence, informational finding, but the messages differ. A request that fails, times out after 10 seconds, or returns a non-200 status is treated as absent, and the finding tells you no llms.txt was found at the origin. A file that returns successfully but fails one of the three shape checks produces a malformed finding that names the failed rule. Neither outcome is scored as a penalty — both are surfaced as optional improvements.

Test this rule on your site →Run a full audit

Test your site for llms.txt — a draft convention for guiding ai engines, checked at your origin

Generative Citation Checklist

Optimize your content to trigger Google AI Overviews and answer engine summaries:

Entity Grounding: Ensure the primary topic is declared with schema markup.
Answer-First Format: Place direct answers (under 300 characters) in paragraph tags directly under headings.
Authoritative Citations: Link to peer-reviewed sources or primary data sets to establish domain authority.

What it detects

This is a site-level check, not a per-page one: it runs exactly once against your origin. pseolint takes the source URL, derives its origin, requests `${origin}/llms.txt` with a 10 second timeout, and only proceeds for http and https targets. If the request fails, times out, or returns a non-200 status, the file is treated as absent.

When the file is present, pseolint runs three deliberately lenient shape checks drawn from the llmstxt.org proposal. First, the opening non-empty line must be an `# ` H1 title (lines that start with `#` but carry no title text are skipped, not rejected). Second, the file must contain at least one `## ` section heading. Third, it must list at least one markdown link of the form `- [Title](https://...)` somewhere under a section. A file that satisfies all three passes silently.

A missing file and a malformed file both surface the same low-confidence, informational finding — one tells you nothing exists at the origin, the other names which of the three rules failed. The check is intentionally forgiving because the specification is still evolving; it rejects only obvious garbage.

Why it matters

Be candid about what this is: llms.txt is a draft convention with low industry adoption, not a ranking factor and not an established standard. That is exactly why pseolint reports it at low confidence and informational severity. An absent llms.txt is a missed opportunity, never a defect, and you can ship a perfectly healthy site without one.

The upside, where it applies, is editorial control. A well-formed llms.txt lets you hand an AI engine a curated map straight to your most authoritative, citable pages instead of leaving it to infer structure from a sprawling sitemap. For a project with deep, fast-moving content — release notes, an API reference, a migration guide — that curation can be the difference between an assistant quoting your current quickstart or an answer it stitched together from a 2 year old blog post.

No search engine is known to consume llms.txt as a ranking input, and pseolint makes no such claim. Treat a finding here as a 30 minute experiment worth trying, not a penalty to fix. The authoritative reference for the format is llmstxt.org.

A page that fails

An open-source CLI tool publishes docs at docs.example.dev and adds a /llms.txt that opens with a blockquote summary, then jumps straight into bare URLs: `> The official SDK for Example.` followed by `https://docs.example.dev/quickstart` and `https://docs.example.dev/api`. pseolint fetches it, finds no leading `# ` H1 title and no `## ` section headings, and emits a low-confidence finding naming the first failed rule — the file exists but does not match the llmstxt.org shape, so an AI engine reading it gets an unlabeled list with no hierarchy to reason about.

A page that passes

The same documentation site fixes it: `# Example SDK` as the H1, a one-line blockquote summary, then `## Getting Started` listing `- [Quickstart](https://docs.example.dev/quickstart): install and first call in 5 minutes`, followed by `## Reference` with `- [API Reference](https://docs.example.dev/api): every endpoint and type` and `## Releases` linking `- [Changelog](https://docs.example.dev/changelog): updated within the last 7 days`. All three shape checks pass — an H1 title, two-plus `## ` sections, and several markdown links — so pseolint stays silent and an assistant gets a clean, captioned map to the SDK's most citable pages.

How to fix it

1Create a plain-text file at the root of your origin, served as /llms.txt, that opens with a single `# Project Name` H1 title on the first non-empty line.
2Add a short blockquote summary under the title, then break your content into `## ` sections such as Getting Started, API Reference, Guides, and Releases.
3Under each section, list your most citable pages as markdown links in the form `- [Quickstart](https://...): one-line description` so an engine can read both the link and its purpose.
4Point the links at canonical, current pages — your live quickstart, API reference, SDK guides, and changelog — not deep-archived or redirecting URLs.
5Keep it in sync with releases: a stale llms.txt that omits a new major version or a renamed code sample misleads engines more than having none at all.
6Validate against the format described at llmstxt.org and re-run the audit; a passing file is silent, so no finding means the three shape checks are satisfied.

SpamBrain context

This rule sits apart from the spam-detection family. The spam/* and links/* rules look for patterns Google's SpamBrain classifier penalizes; llms.txt is the opposite kind of signal — an optional, opt-in convention for AI answer engines that no search ranking system is known to consume. pseolint will never tell you a missing llms.txt put you at risk of a penalty, because it cannot and does not.

That framing is why the finding is low confidence and informational. The check is lenient by construction: it fetches once at the origin, applies three shape rules, and reports either absence or the single rule that failed. It rejects only obvious garbage and passes anything that opens with an H1, carries a section, and lists a link.

If you maintain an open-source tool whose documentation site ships frequent release notes and a versioned API reference, an accurate llms.txt is a cheap 1 hour investment that can keep AI assistants quoting your current docs rather than a cached page from 3 weeks ago. If you don't, you are losing nothing pseolint scores against you. The format and its rationale are documented at llmstxt.org.

How this shows up in practice

Mossbank Legal Summaries added an /llms.txt file on 9 April 2025. pseolint fetches the file once per audit from the site's origin, checks for a 200 response within 10 seconds, then runs three shape checks drawn from the draft convention Jeremy Howard championed at Answer.AI in September 2023. The Mossbank file passed check one -- a single H1 on line 1 reading 'Mossbank Legal Summaries'. It failed check two: only one ## section heading ('Overview') where the convention expects at least two. It also failed check three: zero markdown hyperlinks, so no pointers to specific legal-summary pages AI engines could dereference. CTO Gregor Naismith added three ## sections (Case Law, Regulatory Notices, Practitioner Guides) and 22 bracketed markdown links to canonical summary pages, clearing both failures. The rule fires at info severity -- a low-confidence, low-adoption convention, not a defect.

Sources

llmstxt.org — the /llms.txt proposal — The llms.txt proposal — a draft convention championed by Jeremy Howard at Answer.AI and first circulated in September 2023 — defines a plain-text file at the site root that AI agents can read to understand what the site contains and how to use it. aeo/llms-txt fetches ${origin}/llms.txt once per audit with a 10-second timeout, accepts only http and https targets, and runs three lenient shape checks drawn from that specification: the file's presence, a non-empty first line, and at least one markdown URL reference.
Google Search Central — AI features and your website — Google's AI Overviews documentation explains that AI systems select sources based on quality signals, including how clearly a site communicates its content structure to automated readers. An llms.txt file is one emerging mechanism for making that structure explicit. Because adoption remains low, pseolint treats a missing file as a missed low-effort opportunity and fires only at info severity — not a defect, but a gap worth closing in roughly an hour.
Google Search Central — Creating helpful, reliable, people-first content — Google's helpful-content guidance asks whether pages are created for people or primarily to serve automated systems — a question that runs in both directions. llms.txt is the reverse gesture: a deliberate, human-authored declaration telling AI systems which parts of a site are intended for them. Providing that file aligns with the spirit of transparency Google's guidance rewards, signalling that the site's content strategy is intentional rather than incidental.

Related rules

Want to know whether this rule actually fires on your site?

Run pseolint against your sitemap. The audit is free, takes about a minute, and returns a per-URL list of every rule that fired — including this one — with the exact metric values so you can prioritise the fix queue.

Open the spambrain checker All rules