Receive the weekly sampler of posts and "Resource of the Week".
Subscribe »

Enter your
email address:

My Account »


Bookmark and Share

Testimonial?
If you find ResourceShelf useful, please supply a testimonial »








Home > ResourceBlog > Article

« All ResourceBlog Articles

 

Bookmark and Share   \"Feed\"

Monday, 23rd July 2007

New Essay: Adversarial Information Retrieval: The Manipulation of Web Content

New: Adversarial Information Retrieval: The Manipulation of Web Content
by Dennis Fetterly, Research Software Development Engineer, Microsoft Research

From the abstract:

Focuses on how search engine results can be manipulated by content providers. By using methods such as adding unrelated content to meta tags, duplicating information, and cloaking content so it is indexed differently, Web pages can improve their result rankings. While the economic incentive for achieving high rankings is significant, manipulated results undermine the trust of millions of users. Fetterly suggests research into link-based spam detection and the identification of spam blogs, and advocates the development of a clear set of rules for search engines.

Note to info pros:
For most of you, this article will not be anything you haven't heard about before. Perhaps for some in a bit greater detail.

Like we've said MANY times, it's important for info pros to understand not only how search engines work (technically) but also the business of commercial web search. Places to keep current not only include our blog but also Search Engine Land, Sphinn, Search Engine Roundtable, SEOMoz, Searchblog, and many others.

Of course, many in the SEO business (search engine optimization) do what it takes to properly reverse engineer an engine to get their clients' content to the top of the results. But, just like any other business, others do "whatever it takes" to get the job done.

It's not worth arguing whether it's a good or bad thing because, at least for now, that's the way it works.

This is why info pros need to understand a small degree about how business operates, know about a variety of search tools (or as Danny Sullivan calls them, "voices"), use the best tool for each job (just like selecting the best reference book), build collections of specialty or vertical engines (very important), and take advantage of the advanced search options that for the most part go unused but can help create a more precise result set. Of course, all of this is also worth teaching/training your patrons about. We're talking "drivers ed" of search and IR. Btw, it's also likely patrons will want to know more about the business of search.

Source: ACM (Association for Computing Machinery)

See Also: Make Sure to Review Dennis Fetterly's Web Page for Several Full Text Papers that Might Be of Interest
+ Detecting Spam Web Pages Through Content Analysis
Alexandros Ntoulas, Marc Najork, Mark Manasse, and Dennis Fetterly.
15th International World Wide Web Conference (May 2006). [PDF]

+ Dennis Fetterly, Mark Manasse, Marc Najork, and Janet Wiener. A Large-Scale Study of the Evolution of Web Pages. Software: Practice & Experience, 34(2):213-237, February 2004. [draft]


Category:

Views: 653



blog comments powered by Disqus

« All ResourceBlog Articles

 

Read about the FreePint FamilyThe FreePint Family is a family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success.

'FreePint... provides most of my professional development because it won't come through work and [other resources] just don't cut it.'

Read about the FreePint Family »


Visit the FreePint ShopFreePint Shop: FreePint sells reports, resources and subscription products to support your information work and information-related decisions.

Latest: FreePint Volume: Critical Insight on Social Media 2012 (01 Feb 2012) | FUMSI Report: Folio on Conferences and Continuing Professional Development (26 Jan 2012) | FreePint Research Report: Information Governance Policies and Priorities (25 Jan 2012) | Docuticker Report: DocuTips on Health Literacy (19 Jan 2012) | VIP Magazine: 98 (18 Jan 2012)

Browse the FreePint Shop »


FUMSI ForumFUMSI Forum: Do you have a research question? Post it to the FUMSI Forum, where professionals share Q&A and useful tips on how to Find, Use, Manage and Share Information. It's free.

Latest FUMSI Forum postings: Most Shared Content on Finding Information (09 Feb 2012) | Times are changing - a FUMSI Editorial (09 Feb 2012) | [TIPPLE] eBook resources - Share (07 Feb 2012) | Most Shared Content on Sharing Information (01 Feb 2012) | Our own worst enemy? - a FUMSI Editorial (01 Feb 2012)

Visit the FUMSI Forum and post »


VIP LiveWireVIP LiveWire: Offers commentary on emerging news stories of interest to premium content users, vendors and industry insiders.

Latest VIP LiveWire postings: Compliance - it's not just financial (10 Feb 2012) | Social media and BRIC - new report (08 Feb 2012) | Reuters takes the social media pulse (08 Feb 2012) | How to deal with the tech-savvy customer? (08 Feb 2012) | More ways for employers to poke around (01 Feb 2012)

Visit the VIP LiveWire »






Subscribe

Subscribe to the ResourceShelf Newsletter and receive the weekly sampler of posts and Resource of the Week.

Find out more »

ResourceShelf sponsored by:

Article Categories

All Article Categories »

Archive

All Archives »