Google patent granted: Rank adjustments via click data

by Patrick Altoft on / 11 responses

Google has this week been granted a patent for adjusting search rankings based on click data for related queries.

The patent, entitled Rank-adjusted content items, talks about the same types of query refinements we discussed earlier this month in the satisfaction rates post and in previous brand update analysis.

In plain English the patent explains that Google may be counting the number of people who search for “insurance” and then search for “confused.com”. Google can then analyse the queries and determine whether they are statistically significant enough for confused.com to be displayed in the rankings for “insurance”.

Rank-adjusted content items

Abstract
Click logs and query logs are processed to identify statistical search patterns. A search session is compared to the statistical search patterns. Content items responsive to a query of the search session are identified, and a ranking of the content items is adjusted based on the comparison.

Inventors: Datar; Mayur (Santa Clara, CA), Dhamdhere; Kedar (Sunnyvale, CA), Garg; Ashutosh (Sunnyvale, CA)
Assignee: Google Inc. (Mountain View, CA)
Appl. No.: 11/694,268
Filed: March 30, 2007

Content items, e.g., video and/or audio files, web pages for particular subjects, news articles, etc., can be identified by a search engine in response to a query. The query can include one or more search terms, and the search engine can identify and rank the content items based on the search terms in the query. Typically the content items are displayed according to the rank.

The content items, however, are often identified only in response to a particular query, i.e., the search engine may identify and rank content items to independently for each query. For example, for three different queries, the search, engine may return a particular identification and rank of content items for each particular query, regardless of the other queries. In such implementations, a particular content item that may be highly relevant to a user’s current interests may not be identified and/or highly ranked and presented to the user until the user has conducted multiple searches. Additionally, other users may experience similar challenges when searching for content.

SUMMARY

Disclosed herein are systems and methods of identifying content items. In one implementation, click logs and query logs are processed to identify statistical search patterns based on the click logs and query logs. A search session is compared to the statistical search patterns. Content items responsive to a query of the search session are identified, and a ranking of the content items is adjusted based on the comparison.

In another implementation, query paths and content terminuses associated with query paths are identified. Additionally, a context of a search session is identified and a determination of whether the context is related, to one or more of the query paths is made. Content items responsive to a query of the search session are identified based on the determination.

In another implementation, a system includes a mining engine and an adjusting engine. The mining engine mines click logs and query logs to identify query paths and content terminuses associated with the query paths. The adjusting engine adjusts a ranking of content items responsive to a search session query based on the identified query paths and content terminuses.

In one implementation, identification of a context of a search session facilitates the adjusting of a ranking of one or more content items in response to a search session query. The adjustment can, for example, be based on the likelihood that a current user is searching for the rank-adjusted content items because a statistically significant number of prior users that exhibited a similar behavior to the current user selected the rank-adjusted content items.

Patrick Altoft is Director of Search at Branded3, a Leeds SEO & Digital Agency specialising in SEO, Web Design, Development & Social Media.

Get daily posts direct to your inbox

You can get our blog posts delivered for free by email every day - simply add your email address to the box above, or alternatively you can grab the RSS feed.

Comments

Read the 10 comments below, or add your own!

PaulB
November 1, 2009 at 1:46pm

It is interesting and something ive thought they may have been thinking about for a while. At the same time i really cant see how they can use user generated data on any kind of large scale. If user clicks did have any kind of serious effect on rankings the black hats would be all over it. Why got to a the trouble of link spamming when you can get your bot farm to click on a few google results to achive ranking?

PaulB

Reply

November 1, 2009 at 5:45pm

User data is actually a lot harder to game than link data – think about how hard it is for Google to spot some paid links.

Reply

November 1, 2009 at 2:29pm

Interesting patent. May serve the searches well.

Reply

Dave
November 1, 2009 at 7:56pm

User data would be a lot harder to game – don’t forget google already does a (relatively) good job of stopping click fraud with adwords listings.

Reply

PaulB
November 1, 2009 at 9:11pm

Google is good at detecting click fraud? Not so sure : http://www.theregister.co.uk/2009/10/23/botnet_generated_click_fraud/

Reply

November 2, 2009 at 6:55am

Thanks for the post and for sharing the very resourceful information here.

Reply

November 2, 2009 at 4:49pm

Paul – while I agree with you that click fraud still happens, that report from theregister isn’t solely about Google, its figures come from a range of ad networks.

Reply

PaulB
November 2, 2009 at 6:46pm

I agree that google probably is better than most at detecting click fraud, however there will be one crucial metric missing in serp clicks. When they work out PPC for an add on say adsense they can track if the user who clicked made a sale (if adwords buyer agrees) and this helps them work out visitor quality.

However with serp CTR they have no idea what happens after the click so would seem to me very hard to measure the quality/likelyhood that the request is a bot.

PaulB

Reply

November 2, 2009 at 8:12pm

Hi, here is a Report how Google deals with Click-Fraud.
http://googleblog.blogspot.com/pdf/Tuzhilin_Report.pdf
There are 3-steps to prevent Ffraud and over 1.000 People at Google works on this topic. The rate of invalid clicks is described there as < 0.02%.
I think this paper is worth reading… ;-)
Mario

Reply

December 11, 2009 at 6:52pm

You are all missing the main point. Google already has access to everybodies search behaviour and history. The data mining methods mentioned have already determined, using the law of large numbers and wisdom of crowds, a data set that can be trusted. It’s not something anyone can game on a scale not to be noticed!
Google can then use this trusted dataset with it’s other trusted datasets such as link popularity to produce a ranking score.

Reply

1 trackbacks

Leave a comment

Your email address will not be published. Fields marked with an asterisk are required.
 

  *

  *

You can use one of the following tags:
<a href=""><blockquote><code><em><strike><strong>