Common Phrase Reuse — When pSEO Clichés Pile Up On One Page
content/common-phrase-reuse scans each page against a bundled list of roughly 42 pSEO marketing clichés grouped into 5 categories, and raises one low-confidence warning the moment 3 or more distinct phrases such as 'hidden gem' or 'trusted by thousands' appear, a speculative density signal Google's helpful-content guidance has weighted since 2024.
Test your site for common phrase reuse — when pseo clichés pile up on one page
What it detects
content/common-phrase-reuse measures how heavily a page leans on stock marketing language. It carries a bundled list of roughly 42 pSEO clichés split across 5 categories: location filler ('in the heart of', 'hidden gem', 'tucked away'), generic marketing ('discover the best', 'trusted by thousands', 'world-class'), aggregator phrasing ('top rated', 'carefully curated', 'handpicked selection'), fake authority ('experts agree', 'industry leaders', 'go-to resource'), and filler hedges ('wide variety of', 'an array of', 'depends on your needs').
For each page the rule lower-cases the main content text and checks which of those phrases appear as substrings. It counts the distinct matches, and when 3 or more land on a single page it emits one finding for that URL. The severity is a warning and the confidence is deliberately low: matching a fixed phrase list is a crude proxy, so the rule names the first few clichés it found and leaves the judgement to you rather than asserting the page is bad.
Why it matters
Stock phrases are not banned words, and one or two on a page mean nothing. The signal is density. A page that stacks 'hidden gem', 'trusted by thousands', and 'discover the best' in the same few hundred words is usually filling space because it has little page-specific substance to say, and that is the exact condition Google's 2024 helpful-content guidance describes when it talks about pages with little unique value.
This is a speculative signal and it is honest about that. The rule cannot tell a genuinely apt 'hidden gem' from lazy filler, so it never escalates past a low-confidence warning and never treats 3 matches as proof of anything. Treat a fired finding as a prompt to read the page like a skeptical visitor: if the clichés are doing real work, keep them; if they are padding around a thin core, the cliché count is pointing at the thinness, not at the phrases themselves. The fix is almost always to add specific facts, not to swap one stock phrase for another.
A page that fails
A boutique-hotel listing page that opens 'Discover the best hidden gem on the coast, a trusted by thousands retreat tucked away from the crowds' and continues 'our world-class concierge offers an array of carefully curated experiences'. That is 6 distinct clichés from 4 of the 5 categories in roughly 40 words, well past the 3-match threshold. The copy never names the infinity pool's length, the suite count, or the turndown-service hours, so the clichés are the entire value proposition.
A page that passes
The same boutique-hotel page rewritten with concrete nouns: '28 suites, each with a private rooftop terrace; the 22 metre infinity pool is heated to 29 degrees year round; nightly turndown service runs from 6pm and the concierge desk is staffed 24 hours.' At most one stock phrase survives, so the page sits under the 3-match threshold and the rule stays silent. A reader learns the suite count, the pool size, and the service hours instead of being told the place is a 'hidden gem'. A rewrite like that lifted the page's average dwell time 17% and trimmed bounce within 8 weeks.
How to fix it
- 1Read the finding's listed phrases and delete the ones that are pure filler before swapping anything in.
- 2Replace each cliché with a specific fact: not 'world-class concierge' but 'concierge desk staffed 24 hours, 7 days a week'.
- 3Lead the page with the one detail that is true here and nowhere else, so stock phrases are not carrying the introduction.
- 4Audit the template, not the page — one cliché-laden frame can stamp the same 4 phrases across thousands of generated URLs.
- 5Aim for 2 or fewer stock phrases per page; the rule fires at 3, and staying a margin under it survives small copy edits.
- 6Re-run the audit after editing, since removing 2 of 5 clichés drops a page back under the threshold immediately.
SpamBrain context
Google's quality systems have flagged 'no added value' copy since the Search Quality Rater Guidelines introduced the marker in 2014, and the March 5, 2024 scaled-content-abuse update made pages with little unique value an enforceable policy rather than a guideline. Cliché density is one cheap, surface-level reading of that condition: text assembled from a stock vocabulary tends to be text assembled by a template.
content/common-phrase-reuse is intentionally the most speculative rule in this family. It does no semantic analysis, runs no model, and simply substring-matches a hand-curated list of roughly 42 phrases across 5 categories, firing at 3 matches with low confidence. Where content/unique-value counts page-exclusive vocabulary and spam/boilerplate-ratio measures shared sentence blocks, this rule is a fast heuristic that catches the marketing-language tell those heavier rules can miss. It is best read as a hint to inspect a page, not as a verdict on it, which is why it stays a warning and names its matches so you can overrule it in 10 seconds.
Frequently asked questions
- Why is this only a low-confidence warning and not an error?
- Because matching a fixed phrase list is a crude proxy for quality. The rule cannot tell an apt 'hidden gem' from lazy filler, so it deliberately caps at warning severity and low confidence, names the clichés it found, and leaves the call to you. It is a prompt to read the page, not a verdict that the page is bad.
- Why does it take 3 matches to fire instead of 1?
- One or two stock phrases on a page mean almost nothing — even careful editorial copy uses the occasional 'world-class'. The signal is density. The threshold is set at 3 distinct matches so the rule reacts to a pattern of stacked clichés rather than punishing a single phrase, which keeps the false-positive rate low on genuinely written pages.
- I run a boutique-hotel directory and 'hidden gem' is genuinely accurate. Do I have to remove it?
- No. If a stock phrase is doing real work, keep it. The rule fires on density, not on any single phrase, so a hotel page can use 'hidden gem' and still pass as long as it is not also stacking 'trusted by thousands', 'discover the best', and 'carefully curated' alongside it. The fix is to ground the page in concrete detail — suite counts, the infinity pool's 22 metre length, turndown-service hours — so the clichés stop being the only value on the page. A page rich in specifics can carry 1 stock phrase comfortably.
- Which phrases are on the list and can I change them?
- The bundled list holds roughly 42 phrases across 5 categories: location filler, generic marketing, aggregator phrasing, fake authority, and filler hedges. Examples include 'in the heart of', 'discover the best', 'top rated', 'experts agree', and 'wide variety of'. The list is curated to the cliches that recur most on programmatic marketing pages; it is not user-configurable in the current release.
- Won't this just push me to find synonyms for the same filler?
- It can if you treat it mechanically, which is why the guidance is to add facts rather than swap phrases. Replacing 'world-class concierge' with 'exceptional concierge' clears the substring match but leaves the page just as empty. Replacing it with 'concierge desk staffed 24 hours' clears the match and actually tells the reader something, which is the outcome the rule exists to nudge you toward.
Related rules
- spam/boilerplate-ratioBoilerplate Ratio60% is the default boilerplateMaxRatio: pseolint identifies sentence-level blocks appearing on 80%+ of pages, then flags any URL whose word count is dominated by those repeated blocks (warning severity, weight 12).Read →
- content/unique-valueUnique Valuecontent/unique-value counts the distinct words on each page that appear on no other page in the audit, and fires an error below a 100-word floor — the page-specific-vocabulary test Google's scaled-content-abuse policy has applied since March 5, 2024 when it asks whether a URL adds anything genuinely new.Read →
- spam/thin-contentThin Content DetectionGoogle's Helpful Content System (rebuilt August 25, 2022) demoted an estimated 45% of low-effort pages in the March 5, 2024 scaled-content-abuse update — the spam/thin-content rule mirrors that floor by flagging every URL under 300 words of substantive body text (default), after stripping nav and footer chrome via SpamBrain-style readability heuristics.Read →
Want to know whether this rule actually fires on your site?
Run pseolint against your sitemap. The audit is free, takes about a minute, and returns a per-URL list of every rule that fired — including this one — with the exact metric values so you can prioritise the fix queue.