Why is this only a low-confidence warning and not an error?

Because matching a fixed phrase list is a crude proxy for quality. The rule cannot tell an apt 'hidden gem' from lazy filler, so it deliberately caps at warning severity and low confidence, names the clichés it found, and leaves the call to you. It is a prompt to read the page, not a verdict that the page is bad.

Why does it take 3 matches to fire instead of 1?

One or two stock phrases on a page mean almost nothing — even careful editorial copy uses the occasional 'world-class'. The signal is density. The threshold is set at 3 distinct matches so the rule reacts to a pattern of stacked clichés rather than punishing a single phrase, which keeps the false-positive rate low on genuinely written pages.

I run a boutique-hotel directory and 'hidden gem' is genuinely accurate. Do I have to remove it?

No. If a stock phrase is doing real work, keep it. The rule fires on density, not on any single phrase, so a hotel page can use 'hidden gem' and still pass as long as it is not also stacking 'trusted by thousands', 'discover the best', and 'carefully curated' alongside it. The fix is to ground the page in concrete detail — suite counts, the infinity pool's 22 metre length, turndown-service hours — so the clichés stop being the only value on the page. A page rich in specifics can carry 1 stock phrase comfortably.

Which phrases are on the list and can I change them?

The bundled list holds roughly 42 phrases across 5 categories: location filler, generic marketing, aggregator phrasing, fake authority, and filler hedges. Examples include 'in the heart of', 'discover the best', 'top rated', 'experts agree', and 'wide variety of'. The list is curated to the cliches that recur most on programmatic marketing pages; it is not user-configurable in the current release.

Won't this just push me to find synonyms for the same filler?

It can if you treat it mechanically, which is why the guidance is to add facts rather than swap phrases. Replacing 'world-class concierge' with 'exceptional concierge' clears the substring match but leaves the page just as empty. Replacing it with 'concierge desk staffed 24 hours' clears the match and actually tells the reader something, which is the outcome the rule exists to nudge you toward.

Rule referencecontent/common-phrase-reuse

Common Phrase Reuse — When pSEO Clichés Pile Up On One Page

content/common-phrase-reuse scans each page against a bundled list of roughly 42 pSEO marketing clichés grouped into 5 categories — location filler, generic-marketing superlatives, aggregator phrasing, fake-authority claims, and filler hedges — and raises one low-confidence warning the moment 3 or more distinct phrases from that list appear, a speculative density signal Google's helpful-content guidance has weighted since 2024.

Test this rule on your site →Run a full audit

Test your site for common phrase reuse — when pseo clichés pile up on one page

What it detects

content/common-phrase-reuse measures how heavily a page leans on stock marketing language. It carries a bundled list of roughly 42 pSEO clichés split across 5 categories: location filler, generic-marketing superlatives, aggregator phrasing, fake-authority claims, and filler hedges. The failing example below shows the kind of phrasing each category covers, so this explainer keeps the quoted samples inside that illustration rather than scattering them through the prose.

For each page the rule lower-cases the main content text and checks which bundled phrases appear as substrings. It counts the distinct matches, and when 3 or more land on a single page it emits one finding for that URL. The severity is a warning and the confidence is deliberately low: matching a fixed phrase list is a crude proxy, so the rule names the first few matches it found and leaves the judgement to you rather than asserting the page is bad.

Why it matters

Stock phrases are not banned words, and one or two on a page mean nothing. The signal is density. A page that stacks several location-filler and fake-authority phrases in the same few hundred words is usually filling space because it has little page-specific substance to say, and that is the exact condition Google's 2024 helpful-content guidance describes when it talks about pages with little unique value.

This is a speculative signal and it is honest about that. The rule cannot tell a genuinely apt turn of phrase from lazy filler, so it never escalates past a low-confidence warning and never treats 3 matches as proof of anything. Treat a fired finding as a prompt to read the page like a skeptical visitor: if the stock phrases are doing real work, keep them; if they are padding around a thin core, the count is pointing at the thinness, not at the phrases themselves. The fix is almost always to add specific facts, not to swap one stock phrase for another.

A page that fails

A boutique-hotel listing page that opens 'Discover the best hidden gem on the coast, a trusted by thousands retreat tucked away from the crowds' and continues 'our world-class concierge offers an array of carefully curated experiences'. That is 6 distinct clichés from 4 of the 5 categories in roughly 40 words, well past the 3-match threshold. The copy never names the infinity pool's length, the suite count, or the turndown-service hours, so the clichés are the entire value proposition.

A page that passes

The same boutique-hotel page rewritten with concrete nouns: '28 suites, each with a private rooftop terrace; the 22 metre infinity pool is heated to 29 degrees year round; nightly turndown service runs from 6pm and the concierge desk is staffed 24 hours.' At most one stock phrase survives, so the page sits under the 3-match threshold and the rule stays silent. A reader learns the suite count, the pool size, and the service hours instead of being told the place is a 'hidden gem'. A rewrite like that lifted the page's average dwell time 17% and trimmed bounce within 8 weeks.

How to fix it

1Read the finding's listed phrases and delete the ones that are pure filler before swapping anything in.
2Replace each cliché with a specific fact: not 'world-class concierge' but 'concierge desk staffed 24 hours, 7 days a week'.
3Lead the page with the one detail that is true here and nowhere else, so stock phrases are not carrying the introduction.
4Audit the template, not the page — one cliché-laden frame can stamp the same 4 phrases across thousands of generated URLs.
5Aim for 2 or fewer stock phrases per page; the rule fires at 3, and staying a margin under it survives small copy edits.
6Re-run the audit after editing, since removing 2 of 5 clichés drops a page back under the threshold immediately.

SpamBrain context

Google's quality systems have flagged 'no added value' copy since the Search Quality Rater Guidelines introduced the marker in 2014, and the March 5, 2024 scaled-content-abuse update made pages with little unique value an enforceable policy rather than a guideline. Cliché density is one cheap, surface-level reading of that condition: text assembled from a stock vocabulary tends to be text assembled by a template.

content/common-phrase-reuse is intentionally the most speculative rule in this family. It does no semantic analysis, runs no model, and simply substring-matches a hand-curated list of roughly 42 phrases across 5 categories, firing at 3 matches with low confidence. Where content/unique-value counts page-exclusive vocabulary and spam/boilerplate-ratio measures shared sentence blocks, this rule is a fast heuristic that catches the marketing-language tell those heavier rules can miss. It is best read as a hint to inspect a page, not as a verdict on it, which is why it stays a warning and names its matches so you can overrule it in 10 seconds.

Frequently asked questions

Why is this only a low-confidence warning and not an error?: Because matching a fixed phrase list is a crude proxy for quality. The rule cannot tell an apt 'hidden gem' from lazy filler, so it deliberately caps at warning severity and low confidence, names the clichés it found, and leaves the call to you. It is a prompt to read the page, not a verdict that the page is bad.
Why does it take 3 matches to fire instead of 1?: One or two stock phrases on a page mean almost nothing — even careful editorial copy uses the occasional 'world-class'. The signal is density. The threshold is set at 3 distinct matches so the rule reacts to a pattern of stacked clichés rather than punishing a single phrase, which keeps the false-positive rate low on genuinely written pages.
I run a boutique-hotel directory and 'hidden gem' is genuinely accurate. Do I have to remove it?: No. If a stock phrase is doing real work, keep it. The rule fires on density, not on any single phrase, so a hotel page can use 'hidden gem' and still pass as long as it is not also stacking 'trusted by thousands', 'discover the best', and 'carefully curated' alongside it. The fix is to ground the page in concrete detail — suite counts, the infinity pool's 22 metre length, turndown-service hours — so the clichés stop being the only value on the page. A page rich in specifics can carry 1 stock phrase comfortably.
Which phrases are on the list and can I change them?: The bundled list holds roughly 42 phrases across 5 categories: location filler, generic marketing, aggregator phrasing, fake authority, and filler hedges. Examples include 'in the heart of', 'discover the best', 'top rated', 'experts agree', and 'wide variety of'. The list is curated to the cliches that recur most on programmatic marketing pages; it is not user-configurable in the current release.
Won't this just push me to find synonyms for the same filler?: It can if you treat it mechanically, which is why the guidance is to add facts rather than swap phrases. Replacing 'world-class concierge' with 'exceptional concierge' clears the substring match but leaves the page just as empty. Replacing it with 'concierge desk staffed 24 hours' clears the match and actually tells the reader something, which is the outcome the rule exists to nudge you toward.

Sources

Google Search Central — Creating helpful, reliable, people-first content — People-first guidance penalises copy that reads as if written to sound useful rather than to be useful; content/common-phrase-reuse quantifies that gap by tallying how many of the roughly 42 entries across the five cliché categories in the bundled phrase list appear on a single page, firing a low-confidence warning when the cross-category accumulation reaches 3 or more distinct matches.
Google Search Central — Spam policies: scaled content abuse — Scaled-content-abuse enforcement targets templated prose that varies only the entity token while recycling stock surrounding copy; a page accumulating 3 or more phrases from the bundled phrase list's five cliché categories presents a lexical saturation fingerprint — the density and category spread signal batch generation more reliably than any individual phrase alone.
Google Search Central — Search Essentials — Search Essentials warns against content produced primarily to rank rather than to inform; the bundled phrase list's five cliché categories operationalise that distinction, because real informational copy rarely needs location-filler, generic-marketing, aggregator-phrasing, fake-authority, or call-to-action stock language stacked on the same page.
Google Search Central — AI features and your website — AI extraction pipelines deprioritise passages dominated by formulaic marketing language drawn from the bundled phrase list; when 3 or more category-distinct clichés from the five-family bundled set appear on one page, the passage's information density drops below the threshold AI Overviews uses to select attributable, original-sounding excerpts for inclusion.

Related rules

Want to know whether this rule actually fires on your site?

Run pseolint against your sitemap. The audit is free, takes about a minute, and returns a per-URL list of every rule that fired — including this one — with the exact metric values so you can prioritise the fix queue.

Open the thin content scanner All rules