Papering over the cracks with rel=canonical

by Patrick Altoft on February 13, 2009

The SEO world is alight this morning with the announcement that a new tag, rel=”canonical” will solve the problem websites and search engines have with duplicate content.

Basically if your URLs have additional parameters creating duplicate content you can now tell the engines which page is the canonical version without resorting to a 301 redirect.

So a page with session ID’s or affiliate ID’s such as:
http://www.website.com/page.php?aff=123

http://www.website.com/page.php?sessid=27956924876

Could include the rel=”canonical” tag to tell the engines the real version of the page:

<link rel="canonical" href="http://www.website.com/page.php">

Most people think this is a great idea but I’m not convinced. To start with nobody really knows how the engines will deal with this, will it work quite as well as a 301 redirect? Until we test it’s impossible to know for sure.

The URL http://starwars.wikia.com/wiki/Nelvana_Limited Google gave as an example of a page which used the tag is still indexed although doesn’t rank in the search results which seems strange to me. It certainly isn’t behaving like a 301 redirect in that example.

Sounds great—can I see a live example?
Yes, wikia.com helped us as a trusted tester. For example, you’ll notice that the source code on the URL http://starwars.wikia.com/wiki/Nelvana_Limited specifies its rel=”canonical” as: http://starwars.wikia.com/wiki/Nelvana.

The two URLs are nearly identical to each other, except that Nelvana_Limited, the first URL, contains a brief message near its heading. It’s a good example of using this feature. With rel=”canonical”, properties of the two URLs are consolidated in our index and search results display wikia.com’s intended version.

The main issue with this command is that any developers who are creating sites and content management systems with duplicate content and session ID’s are very unlikely to use the new tag in the correct manner. If people know that this sort of thing is a problem then they have most probably fixed it already and if they don’t know it’s an issue then they will never use the new command.

Everybody in the SEO world knows how to fix duplicate content issues so unless this actually reaches developers and CMS manufacturers it won’t help very much.

Clearly there are situations where a CMS can’t be fixed and in those circumstance the rel=canonical tag will be useful.

My advice is to keep doing things the right way and force the engines to index the right content, as soon as you give them a choice the potential arises for them to choose the wrong page.

When the search engines came out with sitemaps people thought it was a great way to get content indexed, what they failed to consider was that the content wasn’t being indexed for a reason. If a page has enough link juice to rank then it will be indexed, if it doesn’t have enough links to rank then forcing it to be indexed just means you have a page that’s in the index but doesn’t rank and what use is that?

Patrick Altoft is Director of Search at Leeds based digital & SEO agency Branded3. Patrick also runs Blogstorm.

You can get our blog posts delivered for free by email every day - simply add your email address to the box below or alternatively grab the RSS feed.

Read some similar posts

This website uses IntenseDebate comments, but they are not currently loaded because either your browser doesn't support JavaScript, or they didn't load fast enough.

{ 8 comments… read them below or add one }

Patrick Altoft 13 Feb 2009 at 1:11 pm
Find me on Twitter

Also this might be a big red “I’m an SEO trying to make my affiliate links pass PageRank flag”.

More comments from Patrick Altoft
Will Critchlow 13 Feb 2009 at 1:25 pm

Can you use it across domains? Could be useful for syndication of content in that case…

Patrick Altoft 13 Feb 2009 at 1:34 pm
Find me on Twitter

No, you can’t use it across domains.

More comments from Patrick Altoft
Frank the Tank 13 Feb 2009 at 1:35 pm

I think you’re right. This rel-Tag fixes “self-inflicted” problems from my point of view. Lot’s of duplicate content is produce by bad cms-software.

However: this rel-tag might be useful for session-urls (e.g. for web 1.0 online shops ;) )

Ciaran 13 Feb 2009 at 1:39 pm

Brilliant encapsulated what I’ve been trying (but failing) to express all morning.

Gaver 14 Feb 2009 at 2:22 am

I wonder if perhaps this is a bandaid for those without .htaccess permissions or capability.

Trying to accomplish a redirect on a win platform is not an easy task when your a client on a farm and “one of many”.

And even if you have access to .htaccess on a linux farm … the script to accomplish the redirect is not for the masses.

G3

5ubliminal 26 Mar 2009 at 1:46 pm

Try this plugin: Canonical URL Wordpress Plugin.

Zoya 03 Jul 2009 at 5:13 pm

Hello Patrick,

I have a website which is hosted by blogger, I am facing duplicate content issues. Can yuo please advice of anything that can be done to resolve this..

According to google webmaster tools I have changed the keywords in the label and title which made it seem that they are duplicate but infact they were not.

Can you please advice if using canonical in blogger will be of any help. Many Thanks

X.

{ 2 trackbacks }

Suchmaschinen führen rel=canonical ein | Frank the Tank
02.13.09 at 1:21 pm
Can you use rel = canonical to fix duplicate comment problems caused by comment pagination in wordpress? » malcolm coles
02.15.09 at 11:50 pm

Leave a Comment (registration is optional)

Registration is free, takes about 5 seconds and is worth doing.

You can use these HTML tags and attributes:
<a href=""> <b> <blockquote> <code> <em> <i> <strike> <strong>