Many websites do not have too little text. They have too little substance.
An article can be 2,000 words long and still add almost nothing of its own. A category page can be short and still deliver exactly the right answer. is therefore not simply a question of word count.
The RankScan insight combines three signals that together indicate a clear quality problem:
- “Thin content”: The page offers too little unique value or covers the only superficially.
- “Low ”: The content contains too many generic statements and too few specific, verifiable pieces of information.
- “Missing source evidence”: Claims, numbers or specialist information are not backed up sufficiently.
This combination is especially critical because it affects both classic and modern AI visibility. Pages without substance, facts and sources have a lower chance of being perceived as a helpful answer, trustworthy source or citeable section.
The key point is the right interpretation:
is not created through more words, but through better informational value.
- Thin content does not simply mean “short text”; it means missing added value.
- Long content can also be thin if it only repeats familiar statements.
- Fact density describes how many specific, verifiable pieces of information a text contains. It is a RankScan quality criterion, not an official Google metric.
- shows whether important claims are supported by reliable evidence.
- Google recommends helpful, reliable and instead of content created primarily for search engines.
- Automatically or mass-generated content without added value can violate Google’s spam policies.
- For , clear definitions, concrete data, structured sections and robust sources are especially important.
- A good thin-content check does not identify thin content only by word count, but by search intent, fact density, sources, redundancy and performance.
- The best measure is often not “more content”, but consolidation, updating or clear differentiation.
What Is Thin Content? #
Thin content refers to content with little unique user value. A page is thin if it does not sufficiently answer a search intent, provides hardly any concrete information or adds nothing substantial to existing content.
In its spam policies, Google refers to practices such as automatically generated content without added value, copied content, doorway pages and thin affiliate pages. The decisive factor is not the production method alone, but whether the content was created primarily to manipulate rankings instead of helping users.
Source: Google Search Central – Spam Policies for Google Web Search
Thin content can take many forms:
- very short pages without a clear answer,
- long texts with a lot of filler,
- copied manufacturer descriptions,
- automatically generated mass content,
- almost identical location pages,
- empty category or tag pages,
- affiliate pages without original evaluation,
- blog articles without a new perspective,
- guides without sources,
- glossary entries without context,
- AI-written texts without editorial review.
Thin Content Is Not a Word-Count Problem #
A common mistake is the assumption:
few words = thin content
many words = high-quality content
That is not true.
A short answer can be high quality if it is precise. A long article can be thin if it consists mainly of repetition, vague phrases and statements everyone already knows.
Example:
Weak:
Content quality is very important because good content is rated better by Google and convinces users.Stronger:
Content quality emerges when a page fully answers the search intent, provides concrete facts, shows recognizable and supports central claims with reliable sources.
The second sentence is not much longer, but it is much more specific.
Thin content should therefore not be evaluated only by text length, but by:
- search intent,
- content coverage,
- share of factual information,
- source quality,
- uniqueness,
- redundancy,
- performance,
- page type,
- competitor comparison.
What Does “Thin Content” Mean in RankScan? #
The “Thin content” insight means: a page appears too weak in relation to its goal, page type or search intent.
This can mean:
- too little content for a complex topic,
- no clear answer to the ,
- no original perspective,
- many generic statements,
- hardly any examples,
- no data,
- no sources,
- overlap with other pages,
- too little visible expertise,
- very low impressions despite ,
- “ – currently not indexed” in Google Search Console (GSC),
- strong competing pages with significantly higher informational value.
Important: not every short page is problematic. A contact page, privacy page or login page does not need to be a long guide. What matters is whether the page is helpful enough for its purpose.
What Does “Low Fact Density” Mean? #
Fact density is not an official Google metric. In RankScan, it serves as an editorial quality criterion: how many concrete, verifiable and helpful pieces of information does a piece of content contain in relation to general statements?
The “Low fact density” insight means: the content contains too few concrete, verifiable pieces of information relative to the text volume.
Low fact density can be recognized by:
- many empty phrases,
- many generic statements,
- few numbers,
- few definitions,
- few examples,
- hardly any process steps,
- hardly any comparisons,
- no data points,
- no clear claims,
- a lot of marketing language,
- little verifiable substance.
High fact density does not mean overloading every sentence with numbers. It means that the text has more concrete informational power.
Examples of fact-dense elements:
- definitions,
- numbers,
- time periods,
- sources,
- tables,
- checklists,
- concrete examples,
- before-and-after comparisons,
- process steps,
- criteria,
- proprietary data,
- concrete observations from projects.
What Does “Missing Source Evidence” Mean? #
The “Missing source evidence” insight means: the content contains statements that would require sources, but they are not sufficiently documented.
This especially applies to:
- statistics,
- studies,
- legal statements,
- medical statements,
- financial statements,
- technical standards,
- current market data,
- statements about Google,
- statements about AI systems,
- benchmarks,
- comparisons,
- industry trends.
For helpful, reliable and people-first content, Google recommends checking questions around originality, trustworthiness, source quality and expertise, among other factors.
Source: Google Search Central – Creating helpful, reliable, people-first content
Source quality does not mean adding random links. Good sources are:
- primary sources,
- official documentation,
- public authorities,
- research papers,
- reputable studies,
- professional associations,
- proprietary data,
- transparent methodology,
- traceable case studies.
Why High-Quality Content Matters for SEO and AI Visibility #
Search engines and AI systems evaluate content differently, but both benefit from clear, reliable and structured information.
For classic , high-quality content helps because it:
- satisfies search intent better,
- keeps users on the page longer,
- earns more internal and external links,
- enables better ,
- builds stronger topical relevance,
- strengthens trust.
For AI visibility, it helps because AI systems often retrieve, summarize and cite content section by section. Passages are especially suitable when they:
- answer a clear question,
- contain concrete definitions,
- present facts in a structured way,
- name sources,
- contain little filler,
- provide original information.
Google describes and AI Mode as search features in which content from the web may be used, and it advises website owners to continue making content accessible and helpful according to the same principles.
Source: Google Search Central – AI features and your website
Important: high-quality content does not guarantee a mention in AI answers. But it improves the conditions for being considered a clear and trustworthy source.
Increase Fact Density: From Empty Phrases to Citeable Statements #
Fact density is created when statements are specific, verifiable and useful.
| Low fact density | High fact density |
|---|---|
| Our solution saves a lot of time. | In the example process, processing time drops from 12 to 5 minutes per case. |
| 404 errors are bad for SEO. | 404 errors are especially critical when , backlinks, URLs or former ranking pages are affected. |
| Meta descriptions should not be too long. | Meta descriptions are shortened depending on available space; the most important message should therefore appear within the first 100–120 characters. |
| Good content needs sources. | Statistics, studies, legal statements and technical standards should be supported with primary sources or official documentation. |
A simple rule:
Remove every statement that sounds interchangeable — or make it more specific.
Source Quality: Which Sources Are Useful? #
Not every source has the same value.
| Source type | Quality | Example |
|---|---|---|
| Primary source | very high | Google documentation, official documentation of an |
| Public authority / institution | very high | Swiss Federal Statistical Office, EU, Federal Office of Public Health, OECD |
| Research / paper | high | arXiv, universities, academic journals |
| Professional association | high to medium | industry associations, standards bodies |
| Proprietary data | high, if methodology is transparent | own analysis, survey, logfile evaluation |
| Trade publication | medium to high | established industry media |
| Blog without methodology | low to medium | use only as supplementary material |
| AI-generated summary | low | do not use as a source for facts |
For important claims, the following applies:
If the statement needs to be verifiable, it needs a reliable source.
Which Statements Need Sources? #
Sources are especially important for:
- numbers,
- percentages,
- trends,
- studies,
- benchmarks,
- legal statements,
- medical statements,
- financial statements,
- technical standards,
- statements about Google updates,
- statements about AI systems,
- market comparisons,
- historical developments,
- product or tool comparisons.
External sources are usually not necessary for:
- your own recommendations,
- clearly marked experience-based insights,
- internal processes,
- examples,
- checklists,
- simple definitions,
- editorial evaluations.
High-Quality Content by Page Type #
Not every page type needs the same kind of content.
| Page type | What high quality means |
|---|---|
| Guide | fully answer the search intent, examples, sources, clear structure |
| Product page | original product information, images, availability, Frequently Asked Questions (FAQ), usage, reviews |
| Category | explain the selection, filter logic, buying advice, internal linking |
| Service page | concrete service, target audience, process, benefit, cases, contact |
| Glossary | precise definition, context, examples, related terms |
| FAQ | real user questions, clear answers, no artificial questions |
| Landing page | focus, benefit, proof, trust, clear conversion |
| Author page | experience, areas of expertise, publications, profiles |
| Study | methodology, data basis, results, limitations |
| Comparison | criteria, transparent methodology, table, conclusion |
Thin content should always be evaluated in the context of the page type. A good product page needs different substance than a scientific guide.
Prioritization: Which Content Problems Are Critical? #
| Problem | Priority | Why |
|---|---|---|
| “” content (YMYL), meaning topics that can affect money or health, without sources and author | High | high trust risk |
| Important landing page with generic content | High | commercial potential remains unused |
| Many almost identical location pages | High | doorway or redundancy risk |
| Product pages with copied manufacturer descriptions | High | no original added value |
| Guide with many empty phrases and few facts | Medium to high | weak citeability and relevance |
| Blog article with no impressions over a long period | Medium | check performance and search intent |
| Empty tag or archive pages | Medium | often or consolidate |
| Individual short contact page | Low | the purpose may still be fulfilled |
| Privacy page without long text | Low | not SEO target content |
The most important rule:
Prioritize pages with visibility potential, business relevance or trust risk.
Improve Content: Added Value Instead of More Words #
When RankScan reports thin content, there is not just one solution. You should decide based on the cause.
| Diagnosis | Best measure |
|---|---|
| Topic covered only superficially | deepen the content |
| Content outdated | update substantially |
| many similar pages | consolidate |
| no search intent | set noindex or remove |
| copied manufacturer text | add original content |
| missing sources | add primary sources and evidence |
| low fact density | remove empty phrases, add facts |
| missing expertise | add author, reviewer, experience |
| wrong page type | adapt format to the |
| weak internal linking | strengthen clusters and links |
Consolidate Instead of Multiplying #
Many websites try to build visibility by creating more and more pages. This often results in many thin URLs.
Examples:
/seo-consulting-lucerne
/seo-consulting-zug
/seo-consulting-zurich
/seo-consulting-basel
If these pages are almost identical and do not offer local added value, they create a weak pattern.
Better:
- add real local information,
- show cases or references for each location,
- name contact persons or location data,
- or consolidate pages.
When several thin pages serve the same topic, merging them is often stronger than expanding every individual page.
What to Do After a RankScan Finding #
When RankScan reports “Thin content”, “Low fact density” or “Missing source evidence”, you should proceed systematically.
Step 1: Determine the Page Type #
Check whether it is a:
- guide,
- product page,
- category,
- service page,
- glossary,
- FAQ,
- location page,
- archive,
- landing page.
The page type determines how much depth and what kind of substance makes sense.
Step 2: Check Search Intent #
Ask:
- What does the user expect?
- Which page types currently rank?
- Is there a or ?
- Are the top results guides, shops, comparisons, videos or tools?
- Which questions appear in the SERP?
- Which information is missing on our page?
Step 3: Evaluate Performance #
Check:
- impressions,
- clicks,
- , meaning the click rate in search results,
- ranking position,
- indexing status,
- internal links,
- backlinks,
- conversion relevance,
- update date.
Not every weak page is worth improving. Prioritize potential.
Step 4: Evaluate Substance #
Check:
- does the page fully answer the search intent?
- are there concrete examples?
- is there data?
- are there sources?
- is there original experience?
- is there a clear structure?
- is there redundancy?
- is the content better than the top 10?
- is the content citeable?
Step 5: Choose the Measure #
Decide:
Update?
Deepen?
Merge?
Noindex?
Delete?
?
Rebuild as a cluster page?
A page without search potential and without user value should not be artificially inflated.
Step 6: Check Again After the Fix #
After the revision:
- crawl again with RankScan,
- monitor GSC data,
- check ranking development,
- update sources,
- adjust internal links,
- redirect old thin URLs,
- check noindex,
- check the snippet.
What a Good Thin-Content Check Looks At #
A good content-quality check should do more than report “too few words”.
A good check evaluates:
- text volume in relation to page type,
- visible main content,
- ratio of empty phrases to facts,
- source quality,
- internal and external evidence,
- search intent,
- content overlap,
- ,
- indexing status,
- GSC performance,
- date,
- author and trust signals,
- structured headings,
- FAQ/Q&A structure,
- proprietary data or examples,
- citeability of individual sections,
- thin templates,
- empty category, tag or filter pages.
This turns “Thin content”, “Low fact density” and “Missing source evidence” into concrete optimization tasks.
Example: 30 Thin Articles Become 8 Strong Clusters #
Initial Situation #
A B2B software provider has 30 short blog and glossary articles with 200 to 400 words. Many cover similar topics.
RankScan reports:
“Thin content”
“Low fact density”
“Missing source evidence”
Analysis #
- many articles do not rank,
- several URLs compete for similar search intents,
- content contains hardly any examples,
- sources are missing,
- internal linking is weak,
- topics feel fragmented.
Solution #
- Group topics.
- Define eight strong cluster pages.
- Consolidate thin content.
- Add original examples and product data.
- Link sources and studies.
- Improve the H2/H3 structure.
- Redirect old URLs to suitable new pages with 301 redirects.
- Bundle internal links on cluster pages.
- Crawl again after deployment.
Result #
The website has fewer but stronger pages. Relevance, fact density and source quality increase. This does not guarantee better rankings, but it creates significantly better conditions for classic SEO and AI visibility.
Common Mistakes With High-Quality Content #
Mistake 1: Treating Word Count as the Goal #
More words do not automatically mean better content.
Mistake 2: Merely Retelling the Top 10 #
If you only summarize existing rankings, you create no informational gain.
Mistake 3: Publishing AI Texts Without Review #
Automated content without editorial review, original perspective and added value can be problematic.
Mistake 4: Leaving Out Sources #
This weakens credibility, especially for numbers, trends and legal, financial or health-related topics.
Mistake 5: Only Changing the Date on Outdated Content #
A new date without a real update is not a quality improvement.
Mistake 6: Hiding Thin Content With Canonical or Noindex #
Technical control can be useful. But the underlying quality problem should still be solved strategically.
Mistake 7: Creating Too Many Similar Pages #
More URLs can dilute authority if each page offers only little unique value.
Checklist: Review High-Quality Content #
Use this checklist:
- Which search intent should the page fulfill?
- Does the page type match the SERP?
- Does the page answer the topic completely?
- Are there concrete definitions?
- Are there examples?
- Are there numbers or data?
- Are there sources for strong claims?
- Is there original experience or proprietary data?
- Is the content up to date?
- Are author and visible?
- Is there a clear H2/H3 structure?
- Are sections citeable?
- Is there redundancy with other pages?
- Are there internal links to suitable cluster pages?
- Should the page be optimized, consolidated, set to noindex or removed?
In addition, content updates, summaries and key takeaways, Entity SEO and Schema Markup help narrow down the cause and prioritize the next SEO measures.
FAQ About Thin Content and High-Quality Content #
What is thin content?
Thin content is content with too little unique added value. This can include short, superficial, copied, redundant or automatically generated content without real help for users.
Is thin content always short content?
No. Long texts can also be thin content if they contain many empty phrases and provide little concrete information.
How do I identify thin content?
Typical signals include low impressions, no rankings, little main content, missing search intent, missing sources, low fact density and strong overlap with other pages.
What does fact density mean?
Fact density describes how many concrete, verifiable pieces of information a text contains. This includes definitions, numbers, examples, data, criteria, sources and clear statements.
Why are sources important?
Sources strengthen trust and traceability. Reliable sources should be linked especially for statistics, studies, technical standards and sensitive topics.
Should I delete thin pages?
Not always. First check whether they have potential. Possible measures include updating, expanding, consolidating, setting noindex, deleting or merging through redirects.
Does high-quality content help with AI visibility?
It can help because AI systems can process clear, structured, fact-rich and sourced passages more easily. A mention or citation is not guaranteed.
How often should I update content?
That depends on the topic. Technical, legal, medical, financial and trend-dependent content should be reviewed more regularly than timeless foundational articles.
What is citeable content?
Citeable content contains clear definitions, concrete data, reliable sources, traceable statements and well-structured sections.
Conclusion: High-Quality Content Is Documented Substance #
Thin content is not a problem of short texts. It is a problem of missing substance.
The RankScan insights “Thin content”, “Low fact density” and “Missing source evidence” show where content does not help enough, does not provide enough evidence or does not have enough original informational power.
The best approach is:
- clarify page type and search intent,
- check performance and potential,
- replace empty phrases with facts,
- support strong claims with sources,
- add original experience and data,
- consolidate similar pages,
- strengthen internal links,
- check again after the revision.
This creates content that is not just longer, but better: more specific, more trustworthy, more useful and more citeable.