Receive the weekly sampler of posts and "Resource of the Week".
Subscribe »

Enter your
email address:

My Account »


Bookmark and Share

Testimonial?
If you find ResourceShelf useful, please supply a testimonial »








Home > ResourceBlog > Article

« All ResourceBlog Articles

 

Bookmark and Share   Feed

Friday, 10th August 2007

Enhancing Search and Browse Using Automated Clustering of Subject Metadata

Enhancing Search and Browse Using Automated Clustering of Subject Metadata
From the abstract:

The Web puzzle of online information resources often hinders end-users from effective and efficient access to these resources. Clustering resources into appropriate subject-based groupings may help alleviate these difficulties, but will it work with heterogeneous material? The University of Michigan and the University of California Irvine joined forces to test automatically enhancing metadata records using the Topic Modeling algorithm on the varied OAIster corpus.

Clustering can not only serve to help the searcher find what they are (or think they are) looking for but we think it's just as valuable to present ideas, concepts, names, trends, etc. that the searcher might not see by browsing one record at a time.

Notes 1: We have posted about OAIster many times on ResourceShelf. This database aggregates over 12 million records from more than 854 contributors. A very useful and important resource.

Notes 2: Of course no conversation about clustering can take place without mention Raul Valdes-Perez, Co-Founder and CEO of Vivisimo. Vivisimo powers Clusty and offers dynamic clustering for enterprise search. While white papers can only say so much, this paper from Vivisimo (PDF) about information overload and what they call "selective ignorance" is worthy of your attention. You can learn more by browsing the Vivisimo site, reading the company blog (with commentary from Valdes-Perez) and using some of their projects beyond Clusty.
Examples:

+ Clustermed.info
Clustering (and note the many ways to cluster) PubMed since this search is using controlled fields.

+ The USA government portal search at USASearch.gov/

+ New Zealand Government Search

Shakespeare Searched

+ BioMetaCluster

+ Medical Info Search

+ 9/11 Commission Report

+++ and some often missed Clusty features:
+ Cluster Jobs (via Indeed.com)
+ Cluster Blog Posts

and the Cool Clusty Clouds from the Clusty Labs.

Notes 3: Since Ask.com provides "Zoom Related Info" that offers terms to narrow and/or expand a search (based on concepts) and in some cases, show related names (again, based on concepts), I'll keep my comments to this example and some of what has been written elsewhere. 1 ||| 2 ||| 3.
Btw, Zoom Related Search is also available with Ask Image search.

Here's an example using a search for Paul McCartney). Note the Zoom Related Results in the left pane.

Thanks to Pete Weiss for the news tip.


Category:

Views: 896




blog comments powered by Disqus

« All ResourceBlog Articles

 

Read about the FreePint FamilyFreePint Family

A family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success. Read more »


FeedLatest Family Articles:


Click to view the article Quilting big data threads
Thursday, 24th May 2012

Recently I have found myself cooing over visualisation maps (and heat maps) of health and well being resources. The content rich data is overlayed with mapping technologies, and some interesting themes and patterns are emerging.


Click to view the article The fallacy of information overload
Wednesday, 23rd May 2012

A lot of the talk around social media in the last year has been around information overload. Social media has provided us with new and exciting ways to create content. But it has also meant learning new ways to manage and engage with social media tools. Are we teetering on the edge of an information overload precipice?


Click to view the article Information overload: fact, fantasy or filter failure?
Wednesday, 23rd May 2012

Information overload is a figment of your imagination. Or a failure of your filter. Or a symptom of your technological submissiveness. Depends on who you ask.


Click to view the article Newsdesk: tracking millions of pieces of information a day
Tuesday, 22nd May 2012

What if you had to sort through 3.5 million articles and social media posts a day and try to pull out the most relevant items for your organisation? What if you then had to cobble it all together into something readable for your top groups and executives in your organisation?


Click to view the article Alacra Compliance adds managerial oversight
Tuesday, 22nd May 2012

Alacra Compliance saves time by aggregating information from both free and fee-based sources and enabling users to conduct an accurate federated search across these sources (coined “simultaneous search” by Alacra).


All Family Articles »
Family Articles by Category »


Tell us what you're working on,
and we'll talk to you about how FreePint can help »


FreePint Family Testimonials

"Fabulous resource to learn of unique tools and insights. Very useful." Manager, Futures and Forecasting, Virginia, USA

More testimonials »






Subscribe

Subscribe to the ResourceShelf Newsletter and receive the weekly sampler of posts and Resource of the Week.

Find out more »

ResourceShelf sponsored by:

Article Categories

All Article Categories »

Archive

All Archives »