New Demos from Next Generation Search Group at University of Helsinki
New Demos from The Next Generation Search Group at University of Helsinki
From the NGIR web site:
The Next Generation Information Retrieval group looks at search and information retrieval in a world impacted by Linux and Google where open source and open standards are becoming a dominant paradigm for internet services, and information retrieval is viewed as a key function in productive internet use. The group uses probabilistic and information-theoretic methods to model information retrieval, and is committed to open source software development. The group also believes distributed, semantic-based and multilingual methods will have a central role in the future of information retrieval.
The project will conduct research in the design, use and interoperability of topic-specific search engines with the goal of developing an open source prototype of a distributed, semantic-based search engine. Existing search engines provide poor foundation for semantic web operations...Alvis is funded by EU's Sixth Framework Programme for Research and Technological Development.
A collection of news articles collected from a predefined set of Internet Search related news sites at regular intervals. The database covers the latest articles as well as archives from different popular Web news and blogs resources.
+ SMART
Topic specific search for the recent EU project Statistical Multilingual Analysis for Retrieval and Translation.
Overview of SMART search available here. Uses ALVIS technology.
The crawl uses the ALVIS focussed crawler that is guided by keywords. The key phrases relevant to the crawl are one of the following:
relevance to statistical machine translation with key phrases: cross-?lingual information access, smt systems?, statistical machine translation, textual information access, statistical translation models?, cross-?lingual information retrieval, information extraction,
or both of :
relevance to machine learning with key phrases: machine learning, statistical learning, kernel methods?, string kernels?, rational kernels?, online learning, support vector machines?, SVM, principal component analysis, independent component analysis, PCA, ICA, discriminative language models?, canonical correlation analysis, margin-?based translation models?, statistical language, latent dirichlet, automatic processing,
+ relevance to machine translation with key phrases: machine translation, information retrieval, language models?, translation models?, computational linguistics, lexicon extraction, comprehension aids?, multilingual lexicon, user trials, user evaluation, parallel corpora, language modelling, computer aided translation, comprehension aids, multilingual lexicons?, multilingual corpora, cross-?language information retrieval, natural language processing, multilingual lexicon extraction, human language technology, machine translation technology, machine translation systems?, cross-?lingual information retrieval, linguistic resources.
A family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success. Read more »
Recently I have found myself cooing over visualisation maps (and heat maps) of health and well being resources. The content rich data is overlayed with mapping technologies, and some interesting themes and patterns are emerging.
A lot of the talk around social media in the last year has been around information overload. Social media has provided us with new and exciting ways to create content. But it has also meant learning new ways to manage and engage with social media tools. Are we teetering on the edge of an information overload precipice?
Information overload is a figment of your imagination. Or a failure of your filter. Or a symptom of your technological submissiveness. Depends on who you ask.
What if you had to sort through 3.5 million articles and social media posts a day and try to pull out the most relevant items for your organisation? What if you then had to cobble it all together into something readable for your top groups and executives in your organisation?
Alacra Compliance saves time by aggregating information from both free and fee-based sources and enabling users to conduct an accurate federated search across these sources (coined “simultaneous search” by Alacra).