The Microsoft Web N-gram service has been in beta since April and researchers, worldwide, have been developing amazing applications using this service. Here is an example featuring Multiword tag clouds developed by Dr. Li Ding from RPI (Rensselaer Polytechnic Institute).
The Web N-gram services provide you access to:
+ Content types: Document Body, Document Title, Anchor Texts
+ Model types: Smoothed models
+ N-gram availability: unigram, bigram, trigram, N-gram with N=4, 5
+ Training size (Body): All documents indexed by Bing in the en-us market
+ Access: Hosted Services by Microsoft
+ Updates: Periodical updates
Late last year, we introduced a private beta testing of the Web N-gram Services. We are now expanding access in the Public Beta Web N-gram Services to include professors, students, and researchers from around the world.
We sat down with Kuansan Wang, Principle Researcher from the Microsoft research team to discuss Web N-gram as well as some of the new announcements that are coming from SIGIR.
The FreePint Family is a family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success.
'FreePint... provides most of my professional development because it won't come through work and [other resources] just don't cut it.'
FUMSI Forum: Do you have a research question? Post it to the FUMSI Forum, where professionals share Q&A and useful tips on how to Find, Use, Manage and Share Information. It's free.