Bytestart - The online small business portal
Search over 1500 Articles!


SEO Startup - Create and manage your own professional website and use the Search Engine Optimisation features to get your site noticed by Google. All for just £200 + VAT Click here to find out more.


SEO Duplicate Web Content Penalty Myth Exploded

 print  e-mail 

The "duplicate content penalty" myth is one of the biggest obstacles I face in getting web professionals to embrace reprint content. The myth is that search engines will penalise a site if much of its content is also on other websites.

Clarification: there is a real duplicate content penalty for content that is duplicated with minor or no variation across the pages of a single site. There is also a "mirror" penalty for a site that is more or less substantially duplicating another single site. What I'm talking about here is the reprint of pages of content individually, rather than in a mass, on multiple sites.

Another clarification: "penalty" is a loaded concept in SEO. "Penalty" means that search engines will punish a website for violations of the engine's terms of service. The punishment can mean making it less likely that the site will appear in search results. Punishment can also mean removal from the search engine's index of web pages ("de-indexing" or "delisting").


How have I exploded the "duplicate content penalty" myth?
  • PageRank. Many thousands of high-PageRank sites reprint content and provide content for reprint. The most obvious case is the news wires such as Reuters (PR 8) and the Associated Press (PR 9) that reprint to sites such as www.nytimes.com (PR 10).
  • The proliferation of content reprint sites. There are now hundreds of websites devoted to reprint content because it's a cheap, easy magnet for web traffic, especially search engine traffic.
  • Experience. I've seen significant search engine traffic both from distributing content to be reprinted and from reprinting content on the site.
How I Doubled Search Engine Traffic with Reprint Content

When I first started distributing content for my main site, I was stunned by the highly targeted traffic I got from visitors clicking on the link at the end of the article. Search engine traffic also slowly increased both from the links and from having content on the site.

But I was even more stunned with the search engine traffic I got when I started putting reprint articles on the site in September. I had written quite a number of reprint articles for clients and accumulated a few webmaster "fans" who looked out for my articles to reprint them. I wanted to make it easier for them to find all the reprint articles I had written.

I didn't want to draw too much attention to these articles, which had nothing to do with the main subject of the site, web content. So I secluded the articles in one section of the site.

The articles got a surprising amount of search engine traffic. The traffic was overwhelmingly from Google, and for long multiple-word search strings that just happened to be in the article word for word.


Why was I surprised with all the search engine traffic?
  1. The articles had so little link popularity. The link popularity to the articles came primarily from a single link to the "reprint content" page from the homepage, which linked to category pages, which linked to the articles themselves–three clicks from the homepage. The sitemap was enormous, well over 100 links, so its PageRank contribution was minimal. Since these articles were on the site such a short time I strongly doubt they got any links from other sites.

  2. The articles had so much competition. These articles had been reprinted far more widely than the average reprint article, which is lucky if it makes it into a few dedicated reprint sites. As part of my service I had done most of the legwork of reprinting my clients' articles for them. In fact, I guarantee at least 100 reprints on Google-indexed web pages either for each article or group of articles. So that's up to 100 web pages, sometimes more, that were competing with my web page to appear in search engine results for the search string.

Why Do Reprint Articles Get Search Engine Traffic?

You would think Google would just pick one web page with the article as the authoritative edition and send all the traffic to it.

But that's not how Google works. All the search engines look at factors beyond just the content on the web page. They look at links. Google, at least, claims to look at 100 factors total. Many of these must relate to the content on the page, but not all of them.

The whole experience has given me great insight into what factors Google uses in addition to what we would consider the page itself, and the relative importance of each.

  • Web page titles (the one in the html title tag) are extremely important as tie-breakers between two otherwise equally matched pages. Most reprinters waste the html title, using the article title as the web page title. Set yourself apart by creating unique five-to-ten-word web page titles that include target keywords.

  • Content tweaks. You can also introduce the article with a unique, keyword-laden editor's note, and finish the article off with some keyword-laced comments.

  • Intra-site link popularity and anchor text (that is, for links to the article page from other web pages on the site) are also important. If you can't link to the page from the homepage, keep it as close to the homepage as possible and weed out extraneous links (try putting all your site policies on a single page).

Reprint articles, like the search engine traffic they bring, cost nothing. Don't look a gift horse in the mouth. Forget the "duplicate content penalty." Get in on content reprints and share the search engine wealth.

About the author: Joel Walsh owns UpMarket Content which has Joel's articles available for reprint, and also lets you order the complete website promotion content package of distribution and creation of web content.

Posted December 1, 2005





Latest articles in Google Tips
 
Google Sandbox - new sites taking ages to get hits!
[March 24, 2006] Much has been written about the Google "Sandbox" theory, wherby new sites take many months before they receive search engine referrals. Some thoughts on this theory and links to further reading
 
Google manipulates search results: A boost for small business?
[March 11, 2006] Getting a Top 10 ranking in Google's search results is becoming harder by the day, especially for small businesses with limited budgets. But if a Top 10 ranking for your major keywords has eluded you so far, there may be hope for you.
 
Google Ban - How not to get banned by Google!
[February 6, 2006] Given that Google now provides over 70% of all Internet search traffic, the last possible thing any small business site owner would want is to be banned from the Google index!
 
BMW banned by Google for dubious SEO techniques!
[February 6, 2006] With so many small businesses now reliant on Google for a large chunk of their online traffic, the news that Google appears to have banned the German BMW site from its search results for dubious search engine marketing techniques just goes to show how important it is not to push your luck too far when trying to get high ranking for your keywords.
 
The sparring and spin of the Google dance
[December 21, 2005] The Guardian reports on how unscrupulous firms are manipulating world's leading search engine
 
SEO Duplicate Web Content Penalty Myth Exploded
[December 1, 2005] Discussion of the myth that search engines will penalise a site if much of its content is also on other websites.
 
Under the thumb of Google's Jagger Update
[October 20, 2005] As many of our newsletter subscribers and visitors take a keen interest in search engine ranking changes, I thought I'd write a few words about Google's latest update, nicknamed "Jagger".
 
“Google bombing” and what it can teach us about search engine optimisation
[September 29, 2005] Google bombing involves creating links to a certain page using the target keyword in the link. This is a legitimate method to use to improve search engine rankings, even if it is a method abused by the Google-bombers.
 
What is the Google Dance?
[August 31, 2005] Google is so critical to most site’s search optimisation process that entire sites have been set up purely to analyse how often, and precisely how Google updates its search results each month or so.
 
How long since Google updated Pagerank?
[August 22, 2005] According to one of our favourite web marketing tools (Page Rank Update List History), Google hasn't updates its 'Pagerank' or backlinks for 83 days.
 
 










Blue Egg Ecommerce
Web Design, search engine optimisation service, ecommerce solutions and emarketing specialists.



Free Bytestart News feeds