Receive the weekly sampler of posts and "Resource of the Week".
Subscribe »

Enter your
email address:

My Account »


Bookmark and Share

Testimonial?
If you find ResourceShelf useful, please supply a testimonial »








Home > ResourceBlog > Article

« All ResourceBlog Articles

 

Bookmark and Share   \"Feed\"

Saturday, 26th June 2010

New York Times Releases First Collection of Subject Descriptors as Linked Open Data

If you're not a cataloger, a metadata pro, semantic web expert, etc. the following post from the NY Times Open Blog is still worthy of your attention. It's an interesting and educational read (perhaps a refresher)? It's obviously written for developers and subject experts but we think everyone will take away a few or more facts after reading it. It also contains a very interesting 2001 NY Times article on the origins of the NY Times index.

If you're a metadata expert or developer it contains info about more open data that the New York Times continues to make available on a regular basis and how to use it.

From the Blog Post:

Today we’re releasing our latest batch of subject headings to the Linked Data Cloud. We’re especially excited about this release because it’s the first to contain subject descriptors. Our previous releases contained only subject headings for people, places and organizations. While these subject headings are really useful, they only allow you to explore our archive in terms of “who” and “where.” Our descriptors allow you to query the archive in terms of “what” — and that allows you to find articles based on the subject they discuss. Examples of descriptors include Law and Legislation, Dark Matter (Astronomy) and Ponzi Schemes.

The post continues with brief but useful explanations of scope notes, broad versus narrow tags (with an illustration), and term combinations.

As we move the terms from print to digital, this format translates to term combinations:

+ Accidents and Safety + Airlines and Airplanes = Plane Crash
+ Accidents and Safety + Boats and Boating = Boating Accident
+ Accidents and Safety + Railroads = Train Crash
+ Accidents and Safety + Roads and Traffic = Car Crash

The scope notes include indications about which terms frequently appear in term combinations.

Numbers

+ 498, the number of the most frequently used descriptors now available

+ NY Times plans on releasing all 3500 in the "coming months."

+ Appox. 10,500 tags (of all types are currently available)

To Review All Tags/Descriptors/Subject Headings released to this point, visit this page.

Experts can download and view the SKOS Files (Simple Knowledge Organization System)

The rest of us can browse the vocabulary by letter. In the "R" section you'll see that Recession and Depression is a descriptor. If you click on the URI you'll see the first time the term was used, the last time the term was used, a scope note, and links to the term "in action" on DBPedia and Freebase.

Here's a subject heading for location. Specifically, Red Square in Moscow. Clicking the URI leads you here.

Much of the same as you saw above. Two things we didn't see with the descriptor are:

+ Where the first column reads: owl:same as and has a URL that includes geonames.org (another database you might want to know about. Click and your taken directly to a Google aerial image of Red Square.

+ Further down the page you'll spot:prefLabel - en. This is the preferred label of the location. In this case it's Red Square (Moscow).

Views: 1443



blog comments powered by Disqus

« All ResourceBlog Articles

 

Read about the FreePint FamilyThe FreePint Family is a family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success.

'FreePint... provides most of my professional development because it won't come through work and [other resources] just don't cut it.'

Read about the FreePint Family »


Visit the FreePint ShopFreePint Shop: FreePint sells reports, resources and subscription products to support your information work and information-related decisions.

Latest: FreePint Volume: Critical Insight on Social Media 2012 (01 Feb 2012) | FUMSI Report: Folio on Conferences and Continuing Professional Development (26 Jan 2012) | FreePint Research Report: Information Governance Policies and Priorities (25 Jan 2012) | Docuticker Report: DocuTips on Health Literacy (19 Jan 2012) | VIP Magazine: 98 (18 Jan 2012)

Browse the FreePint Shop »


FUMSI ForumFUMSI Forum: Do you have a research question? Post it to the FUMSI Forum, where professionals share Q&A and useful tips on how to Find, Use, Manage and Share Information. It's free.

Latest FUMSI Forum postings: Most Shared Content on Finding Information (09 Feb 2012) | Times are changing - a FUMSI Editorial (09 Feb 2012) | [TIPPLE] eBook resources - Share (07 Feb 2012) | Most Shared Content on Sharing Information (01 Feb 2012) | Our own worst enemy? - a FUMSI Editorial (01 Feb 2012)

Visit the FUMSI Forum and post »


VIP LiveWireVIP LiveWire: Offers commentary on emerging news stories of interest to premium content users, vendors and industry insiders.

Latest VIP LiveWire postings: Compliance - it's not just financial (10 Feb 2012) | Social media and BRIC - new report (08 Feb 2012) | Reuters takes the social media pulse (08 Feb 2012) | How to deal with the tech-savvy customer? (08 Feb 2012) | More ways for employers to poke around (01 Feb 2012)

Visit the VIP LiveWire »






Subscribe

Subscribe to the ResourceShelf Newsletter and receive the weekly sampler of posts and Resource of the Week.

Find out more »

ResourceShelf sponsored by:

Article Categories

All Article Categories »

Archive

All Archives »