Receive the weekly sampler of posts and "Resource of the Week".
Subscribe »

Enter your
email address:

My Account »


Bookmark and Share

Testimonial?
If you find ResourceShelf useful, please supply a testimonial »








Home > ResourceBlog > Article

« All ResourceBlog Articles

 

Bookmark and Share   Feed

Tuesday, 4th July 2006

More Googler Research Papers Compiled Online; Object-Level Verticals and Phlat "Personal" Searching (Demo) from MS

First, this article/"pseudo press release" from MS Research Asia discusses "object-level vertical search."

From the "story,"

Object-Level Vertical Search takes a refined approach that is a significant advance from traditional Web search. The latter paradigm is based on a page-level relevance ranking approach, in which pages that receive links from many other pages are adjudged to have more value by the very fact that they are popular. If more people link to a given page, it must have something to offer—that is the presumption.

In reality, we all know what happens. A search query returns a list of Web pages, some of which may have more relevance to what we are seeking, some with less. It’s up to us, then, to start clicking on likely candidates and scanning the pages for the information we want. It works, to a degree. We’re in the neighborhood, but we’re still looking for the right house. Object-Level Vertical Search is designed to put us on the doorstep. “In Object-Level Vertical Search,” Wen says, “we want to extract and integrate information from the Web about specific objects. “For example, in academic search for a researcher, his information may be distributed on different Web sites. We need to collect, extract, and integrate all of this information. On one Web site, we may find the e-mail address of this person. On another Web site, we can find his telephone number and his publications. “We collect all this information and integrate it. Then, after extraction and integration, the results will be a virtual page containing all the related information about this person.”

It goes on to say:

There are a series of steps involved in the Object-Level Vertical Search process:

* Web Crawling: to collect relevant information on the Web efficiently.
* Classification: Does a page contain information on products, papers, people, or some other desired category?
* Extraction: pulling specific information about the search query from the relevant Web pages. For a product, for instance, that could mean product name, brand, image, description, and price.

In many ways, this sounds like:
+ What companies like ZoomInfo are doing. Disparate web resources, one-stop results collected on to a single page.

+ Taking a federated search tool and then, behind the scenes, helping the user select the databases they should query and then building a clean set of results. It also sounds similar to what well qualified info pros have been doing for years and years with tools like Dialog. Yes, it's technically possible to search every Dialog file simultaneously but the great researcher users the tools available and lots of skill to create this object-level vertical process and then a final report for the customer. They also have the advantage of using the structure that Dialog provides.

Next,
a paper from the Text Mining Search and Navigation Research team at MS on personal search and organization:
Fast, flexible filtering with Phlat - Personal search and organization made easy.
by Edward Cutrell, Daniel Robbins, Susan Dumais, Raman Sarin.
Presented at CHI '06: Proceedings of the SIGCHI conference on Human Factors in computing systems
January 2006 New York, NY, USA

From the abstract:

Systems for fast search of personal information are rapidly becoming ubiquitous. Such systems promise to dramatically improve personal information management, yet most are modeled on Web search in which users know very little about the content that they are searching. We describe the design and deployment of a system called Phlat tha optimizes search for personal information with an intuitive interface that merges search and browsing through a variet of associative and contextual cues. In addition, Phlat supports a unified tagging (labeling) scheme for organizing personal content across storage systems (files, email, etc.). The system has been deployed to hundreds of employees within our organization. We report on both quantitative and qualitative aspects of system use.

Want to learn more and demo Phlat? Go to http://research.microsoft.com/adapt/phlat/. It works with Windows Desktop Search. A guide to using Phlat and screen caps are here.

Finally, the Google Labs site has been updated with a large helping of new (2006) papers by Googler's. Here are just a few that we noticed. Some are available on the web while others are not:

+ A Large Scale Study of Wireless Search Behavior: Google Mobile Search

+ Names and Similarities on the Web: Fact Extraction in the Fast Lane

+ Organizing and Searching the World Wide Web of Facts - Step One: the One-Million Fact Extraction Challenge

+ Sender Reputation in a Large Webmail Service

+ Limits to Anti Phishing

+ Retroactive Answering of Search Queries

+ Social- and Interactive-Television Applications Based on Real-Time Ambient-Audio Identification

+ An Experimental Study of the Skype Peer-to-Peer VoIP System

+ Large Scale Image-Based Adult-Content Filtering

+ Using Encyclopedic Knowledge for Named Entity Disambiguation

++ Many more papers available here.

Views: 788




blog comments powered by Disqus

« All ResourceBlog Articles

 

Read about the FreePint FamilyFreePint Family

A family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success. Read more »


FeedLatest Family Articles:


Click to view the article Quilting big data threads
Thursday, 24th May 2012

Recently I have found myself cooing over visualisation maps (and heat maps) of health and well being resources. The content rich data is overlayed with mapping technologies, and some interesting themes and patterns are emerging.


Click to view the article The fallacy of information overload
Wednesday, 23rd May 2012

A lot of the talk around social media in the last year has been around information overload. Social media has provided us with new and exciting ways to create content. But it has also meant learning new ways to manage and engage with social media tools. Are we teetering on the edge of an information overload precipice?


Click to view the article Information overload: fact, fantasy or filter failure?
Wednesday, 23rd May 2012

Information overload is a figment of your imagination. Or a failure of your filter. Or a symptom of your technological submissiveness. Depends on who you ask.


Click to view the article Newsdesk: tracking millions of pieces of information a day
Tuesday, 22nd May 2012

What if you had to sort through 3.5 million articles and social media posts a day and try to pull out the most relevant items for your organisation? What if you then had to cobble it all together into something readable for your top groups and executives in your organisation?


Click to view the article Alacra Compliance adds managerial oversight
Tuesday, 22nd May 2012

Alacra Compliance saves time by aggregating information from both free and fee-based sources and enabling users to conduct an accurate federated search across these sources (coined “simultaneous search” by Alacra).


All Family Articles »
Family Articles by Category »


Tell us what you're working on,
and we'll talk to you about how FreePint can help »


FreePint Family Testimonials

"Fabulous resource to learn of unique tools and insights. Very useful." Manager, Futures and Forecasting, Virginia, USA

More testimonials »






Subscribe

Subscribe to the ResourceShelf Newsletter and receive the weekly sampler of posts and Resource of the Week.

Find out more »

ResourceShelf sponsored by:

Article Categories

All Article Categories »

Archive

All Archives »