
12th March 2026

Duplicate content is one of the most misunderstood topics in SEO. Many businesses worry about penalties, while others assume a small percentage of repeated content is harmless. In reality, the issue is neither panic nor percentages.
When multiple versions of the same or similar content exist, search engines must decide which URL to rank. If that decision is unclear, authority fragments and performance becomes unstable.
In this guide, we explain what duplicate content actually is and the different forms it takes on real websites. We then clarify whether duplicate content affects SEO and address the common question of how much duplication is too much. Finally, we outline a prioritised approach to fixing duplicate content without wasting time on low impact changes.
Duplicate content is content that appears at more than one URL. That can mean two identical pages on your own website or it can mean the same content appearing across different domains.
Search engines, like Google, do not apply a penalty simply because duplication exists. The issue is that they must decide which version to index and rank. If you create multiple competing versions, you reduce complicate this process.
Duplicate content is usually a structural issue. Most websites create it through templates, filters or inconsistent URL handling. In this section, we outline the main forms.
This is where two URLs show the same content with no meaningful differences. Common causes of having this form of duplicate content are:
In these cases, search engines will generally select one version as canonical. If you do not define that clearly, they decide for you.
Near-duplicates are pages that are largely similar but not identical. Typical examples of near duplicate web pages include:
If two pages target the same (or very similar) intent and query set, search engines may consolidate them. That often results in ranking volatility or one page suppressing the other.
Boilerplate content is repeated text that appears across many pages. Examples of this include:
Some repetition is normal. The issue arises when repeated content dominates the page and the unique section is minimal. In that case, many URLs begin to look interchangeable.
This is one of the most common technical causes, and relates to where two different URLs load the same (or very similar) content. For example:
/category/shoes
/category/shoes?colour=black
/category/shoes?sort=price
If these URLs are indexable, you create multiple versions of essentially the same page. On large ecommerce sites, this can generate thousands of duplicate or near-duplicate URLs. That, in turn, increases crawl load and dilutes internal link equity.
These forms of duplicate content also fit into three main categories:
This is duplicate content on the same website; i.e. it occurs within your own domain. Common causes include:
Internal duplication is controllable. It is usually resolved through redirects, canonical tags or improved URL structure.
This happens when similar or identical content appears across different websites. Typical scenarios include:
Search engines tend to choose one version to rank. This means that, if your content is identical to competitors, you rely entirely on domain strength rather than differentiation.
Technical duplication is driven by infrastructure rather than copy. Examples of this type of duplication include:
This type of duplication often scales quickly, and can be quite common on large ecommerce platforms. Addressing it usually requires a combination of technical SEO changes and website development support.
First things first, there is no such thing as a duplicate content penalty. Search engines try to assign authority, relevance and trust to one URL per intent. When multiple URLs contain the same or very similar content, those signals fragment. This is why duplicate content affects rankings.
Search engines use links, internal structure and content signals to determine which page should rank. If you create multiple versions of the same page, then external (and internal) links may point to different URLs and search engines must choose a canonical version. Due to this, no single URL accumulates the full amount of authority that it could.
The result of duplicate content then is lower average rankings, ranking volatility and the “wrong” page appearing for a query. During SEO audits , we regularly see competing URLs suppress each other until consolidation occurs.
There is no fixed percentage of duplicate content that becomes dangerous. You will often see claims that 20% or 30% duplication is acceptable. Search engines do not work like that.
The question is not how much content is repeated, but whether duplication fragments authority across multiple URLs. That is what affects rankings, not having “too much duplicate content”.
Do not start by rewriting content. Start by identifying whether you have a structural problem or a content problem. This is especially important as structural issues usually have the bigger impact. With this in mind, follow these steps if you are looking to fix duplicate content on your website or across domains:
Not all duplication needs fixing. Instead, you have a meaningful issue if:
If none of these apply, duplication may not be holding you back.
Structural duplication is usually the biggest source of wasted authority. With this in mind, follow these tips:
Once structural issues are resolved, this is when it makes sense to address content-level duplication. The key things to look out for are:
In many cases, the correct move is consolidation, not rewriting; specifically, merging weaker pages into the strongest version. This also has the advantage of strengthening one page instead of maintaining several underperforming ones.
For duplicate content across websites, such as shared manufacturer descriptions, rewriting everything is rarely efficient. Instead, prioritise high margin or high competition products, as these are ultimately the ones that you want to be ranked.
The key action here is to add unique value beyond the base description content. For example, you could introduce original buying guidance or comparison content that is specific to your business.
Duplicate content only becomes a problem when it costs you rankings and revenue. If multiple URLs compete for the same intent, you make it harder for search engines to rank the page that actually drives sales. To fix that, you simply need to consolidate authority to the pages that matter most.
Klaudia Majewska is an SEO Account Manager responsible for planning, executing and reporting on SEO campaigns across a range of clients. Her work focuses on turning strategy into consistent, measurable performance through clear priorities and ongoing optimisation. Klaudia has a strong technical SEO background and works closely with emerging AI-led search formats. She specialises in making sure products and services are structured and presented in ways that perform across both traditional search results and newer AI-driven search experiences.
Clear up SEO issues that stop the right pages from performing.
Duplicate content is not about penalties or percentages. It becomes a problem when search engines struggle to understand which pages should rank, causing visibility, traffic and enquiries to suffer. Our SEO team helps identify and fix duplicate content issues that affect rankings, crawl efficiency and site performance.
Fix Your SEO Issues