Question 1

Is this actually checking SpamBrain or just guessing?

Accepted Answer

We don't have access to Google's SpamBrain classifier — nobody outside Google does. The checker runs against rules we inferred from public Search Central documentation, the March 2024 and May 2024 spam policy updates, leaked Search API documents, and observed before/after patterns on sites that got hit. Treat the score as a structured second opinion, not a verdict. If your score is high, Google probably agrees; if it's low, you've eliminated the obvious failure modes.

Question 2

Will running the audit hurt my site or get me penalized?

Accepted Answer

No. We send standard GET requests with a clearly identified user agent (pseolint/0.7.4 +https://pseolint.dev/bot), respect robots.txt and Crawl-delay, cap concurrency at 5, and stop at 50 pages or 50 MB total. Your analytics won't see the traffic and Search Console won't flag anything. Audits are read-only.

Question 3

How is this different from a generic SEO crawler like Screaming Frog or Sitebulb?

Accepted Answer

Generic crawlers report on technical SEO — broken links, missing alt text, redirect chains. They are also paid: Screaming Frog runs £199/yr, Sitebulb $35/mo, Ahrefs Site Audit $129/mo. pseolint is free and MIT-licensed. The SpamBrain checker only reports on signals that look like they map to spam classification: thin content thresholds (default 300-word floor), near-duplicate templates above 85% SimHash similarity, doorway patterns, AI-generated boilerplate above an 80% ratio, internal link cliques, third-party content abuse. It's a much narrower, more opinionated lens.

Question 4

What if my site has 200,000 pages and you only audit 50?

Accepted Answer

The 50-page sample is weighted to oversample templated URL patterns, so you'll usually see your worst clusters even on huge sites. That said, sampling is lossy — a single bad template that lives in a tiny corner of the sitemap can be missed. If you need full coverage, the Pro plan audits up to 500 pages per run and supports scheduled monitoring.

Question 5

Does this catch sites hit by the March 2024 scaled content abuse update?

Accepted Answer

It catches the structural patterns that update was designed to demote — pages that read like reusable templates with one variable swapped per page, large-scale AI generation without unique research, and content farms that publish more than they could plausibly fact-check. We don't directly query whether a domain is currently demoted (that data isn't public) but the signals overlap heavily with what the update penalized.

Question 6

Is the rule engine open source?

Accepted Answer

Yes. The full rule set lives at github.com/ouranos-labs/pseolint under the MIT license as the @pseolint/core package — you can run it locally with the CLI (`npm i -g pseolint`), audit your CI builds, or fork the rules. The hosted checker on this page is the same engine wrapped in a sampler and a UI.

Free SpamBrain checker for programmatic SEO sites

What it does

Why it matters

How it works

What you get

FAQ

What a scan turns up

Sources

Related tools