Twitter has a serious duplicate content problem

by Patrick Altoft on March 23, 2009

Twitter seems to have got things the wrong way round when it comes to SEO.

They are blocking the search pages which Google really wants to index and not blocking the pages that really shouldn’t be indexed at all. Currently the api.twitter.com sub-domain has over 2 million pages indexed and the explore.twitter.com sub-domain has over 8 million pages indexed.

It’s not clear how Google is finding these pages, over 10 million isn’t the result of bloggers randomly linking to the wrong things. My guess is a rogue sitemap or RSS application is going wrong.

Patrick Altoft is Director of Search at Leeds based digital & SEO agency Branded3. Patrick also runs Blogstorm.

You can get our blog posts delivered for free by email every day - simply add your email address to the box below or alternatively grab the RSS feed.

Read some similar posts

{ 3 comments… read them below or add one }

Vincent Abry 23 Mar 2009 at 7:15 pm

May be Twitter does it intentionally because they don’t want to let Google index their search results. May be they want people come and stay on Twitter?

Durward Sobek 24 Mar 2009 at 1:33 pm

Well, Twitter may be facing this dilemma because of SEO but it may not be the only reason.

Vinay 24 Mar 2009 at 6:21 pm

Read something similar few weeks back ~ Twitter Domain Duplicate Content Issues – http://hwork.org/2009/03/06/twitter-domain-duplicate-content-issues/

{ 3 tweetbacks }

4 mparent77772 (Marc Parent) 02/09/2010 at 3:53 pm

Twitter blocking the search pages which Google really wants to index [Fact. 2/3 of mine are GONE!]
http://tinyurl.com/dl8tp6

5 creativetype (Dee B ?) 02/09/2010 at 3:53 pm

Twitter has a serious duplicate content problem http://tinyurl.com/dl8tp6

6 patrickaltoft (Patrick Altoft) 02/09/2010 at 3:53 pm

Twitter has a serious duplicate content problem: Twitter seems to have got things the wrong way round when it co.. http://tinyurl.com/dl8tp6

Leave a Comment (registration is optional)

Registration is free, takes about 5 seconds and is worth doing.

You can use these HTML tags and attributes:
<a href=""> <b> <blockquote> <code> <em> <i> <strike> <strong>