Receive the weekly sampler of posts and "Resource of the Week".
Subscribe »

Enter your
email address:

My Account »


Bookmark and Share

Testimonial?
If you find ResourceShelf useful, please supply a testimonial »








Home > ResourceBlog > Article

« All ResourceBlog Articles

 

Bookmark and Share   \"Feed\"

Wednesday, 24th February 2010

Stephen Levy on Google’s Algorithm

Barry Schwartz at Search Engine Land points out a new "exclusive" article by noted tech writer, Stephen Levy, published in Wired. It tells the story of Google's algorithm (what the company will make public, of course) "rules the web."

Access the Complete Article

Here are a Few Sections of the Article (It's Fairly Long) We Found Most Interesting

This year, Google will introduce 550 or so improvements to its algorithm

Even the Bingers [those who work at Bing] confess that, when it comes to the simple task of taking a search term and returning relevant results, Google is still miles ahead. But they also think that if they can come up with a few areas where Bing excels, people will get used to tapping a different search engine for some kinds of queries.

The search engine currently uses more than 200 signals to help rank its results.

Note: Directly above this sentence in the actual article it mentions how Google exploited hyperlinks and sometimes you would come across a page where your search terms were not found. This could be the reason. This is still an issue. Just today we came across several pages that did not have our search terms.

The most recent major change, codenamed Caffeine, revamped the entire indexing system to make it even easier for engineers to add signals.

The Article Includes a Chart of "Some of the Most Significant Additions and Adaptations Since the Dawn of PageRank."

1. BackRub, 1997
This is the product that became Google. Here's an archived version of the home page.

2. August, 2001: "New Algorithm"

3. February, 2003: "Local connectivity analysis"

4. Summer, 2003: "Fritz"

This initiative allows Google to update its index constantly, instead of in big batches.

5. June, 2005: "Personalized Results"

6. December, 2005: "Big Daddy"

Engine update allows for more-comprehensive Web crawling.

7. May, 2007: Universal Search

8. December 2009:

Displays results from Twitter and blogs as they are published.

NOTE: Want to learn more about Google and PageRank? The original research paper written by Larry Page and Sergey Brin, "The Anatomy of a Large-Scale Hypertext Search Engine (PDF) is still an engaging, interesting, and for the most part, not to technical of a read. Remember, the paper is more than 10 years old and things have changed.

Two things missing in our view from Levy's article are an example or two of things that did not work and why. Of course, Google's success has been nothing short of amazing and some might even say unique. But, that doesn't mean they haven't had a problem or two. It would be interesting to learn about them.

Also, it would have been worth a paragraph or two to discuss search algorithms that preceded PageRank. We're specifically thinking about the work of Jon Kleinberg and his hubs and authorities approach. Several articles in a section titled, "Web Analysis and Search: Hubs and Authorities" on this page are worth looking at. This paper from Scientific American (1999) is excellent and actually does some comparing of core concepts between Google and Clever. Kleinberg was a member of IBM's Clever team. The engine itself was never publicly released but their web site is still online. You have to wonder if the world would be a different place if Clever was released before Google (which is possible). Finally, one of the papers on Kleinberg's page, "Authoritative Sources in a Hyperlinked Environment," is cited by Page and Brin in their "Anatomy" paper.

Source: Wired


Category:

Views: 502



blog comments powered by Disqus

« All ResourceBlog Articles

 

Read about the FreePint FamilyThe FreePint Family is a family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success.

'FreePint... provides most of my professional development because it won't come through work and [other resources] just don't cut it.'

Read about the FreePint Family »


Visit the FreePint ShopFreePint Shop: FreePint sells reports, resources and subscription products to support your information work and information-related decisions.

Latest: FreePint Volume: Critical Insight on Social Media 2012 (01 Feb 2012) | FUMSI Report: Folio on Conferences and Continuing Professional Development (26 Jan 2012) | FreePint Research Report: Information Governance Policies and Priorities (25 Jan 2012) | Docuticker Report: DocuTips on Health Literacy (19 Jan 2012) | VIP Magazine: 98 (18 Jan 2012)

Browse the FreePint Shop »


FUMSI ForumFUMSI Forum: Do you have a research question? Post it to the FUMSI Forum, where professionals share Q&A and useful tips on how to Find, Use, Manage and Share Information. It's free.

Latest FUMSI Forum postings: Most Shared Content on Finding Information (09 Feb 2012) | Times are changing - a FUMSI Editorial (09 Feb 2012) | [TIPPLE] eBook resources - Share (07 Feb 2012) | Most Shared Content on Sharing Information (01 Feb 2012) | Our own worst enemy? - a FUMSI Editorial (01 Feb 2012)

Visit the FUMSI Forum and post »


VIP LiveWireVIP LiveWire: Offers commentary on emerging news stories of interest to premium content users, vendors and industry insiders.

Latest VIP LiveWire postings: Compliance - it's not just financial (10 Feb 2012) | Social media and BRIC - new report (08 Feb 2012) | Reuters takes the social media pulse (08 Feb 2012) | How to deal with the tech-savvy customer? (08 Feb 2012) | More ways for employers to poke around (01 Feb 2012)

Visit the VIP LiveWire »






Subscribe

Subscribe to the ResourceShelf Newsletter and receive the weekly sampler of posts and Resource of the Week.

Find out more »

ResourceShelf sponsored by:

Article Categories

All Article Categories »

Archive

All Archives »