The news release does confirm something we mention further along in this post.
While the Twitter archive will not be posted online, the Library envisions posting selected content around topics or themes, similar to existing VHP (Veterans History Project) presentations.
The news release also contains info about the (National Digital Information Infrastructure and Preservation Program (NDIIPP) where we frequently link to digital preservation news, accessible projects, profiles, and presentations.
It also contains several quotes. Here's one from the Dr. James Bilington, the Librarian of Congress:
"The Twitter digital archive has extraordinary potential for research into our contemporary way of life," said Librarian of Congress James H. Billington. "This information provides detailed evidence about how technology based social networks form and evolve over time. The collection also documents a remarkable range of social trends. Anyone who wants to understand how an ever-broadening public is using social media to engage in an ongoing debate regarding social and cultural issues will have need of this material."
Billington added: "The Library looks at this as an opportunity to add new kinds of information without subtracting from our responsibility to manage our overall collection. Working with the Twitter archive will also help the Library extend its capability to provide stewardship for very large sets of born-digital materials."
Just two posts down the page from this one is news about Google's latest release, a searchable "timeline" archive of Twitter tweets (aka Google Replay) that will eventually date back to the beginning of the service. Twitter archive search from Google is rolling out over the next few days but we've include a link to use it now.
At the conclusion of that post, we wondered if others would also get access to the archive and build their own services. We were specifically thinking of Bing and services like Collecta,Scoopler, and others.
Expect to see an emphasis on the scholarly and research implications of the acquisition.
One thing we would like to know is if the Twitter archive (via the Library of Congress) will be searchable online by the general public or only accessible to researchers who visit LC in DC.
From what we have pieced together, the LC/Twitter archive database will not be accessible to the public via the web or at LC itself.
Btw, Matt's blog post includes some well-known (you might want to also consider using the word "historic") tweets. Nice work.
A blog post begins with info about the LC preservation project and concludes with info about Google's searchable archive (aka Google Replay). We also get a statistic.
Today, fifty-five million tweets a day are sent to Twitter and that number is climbing sharply. A tiny percentage of accounts are protected but most of these tweets are created with the intent that they will be publicly available.
We also learn the following:
It should be noted that there are some specifics regarding this arrangement. Only after a six-month delay can the Tweets will be used for internal library use, for non-commercial research, for public display by the library (yet another reason to visit LC when your in DC), and for preservation.
An LC announcement is do out soon. We will update this post with it. (Update: 11/15, Noon, Still Waiting).
We also remain interested in knowing if Bing/Microsoft and other Twitter engines will be able to offer the public a searchable archive?
A family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success. Read more »
Recently I have found myself cooing over visualisation maps (and heat maps) of health and well being resources. The content rich data is overlayed with mapping technologies, and some interesting themes and patterns are emerging.
A lot of the talk around social media in the last year has been around information overload. Social media has provided us with new and exciting ways to create content. But it has also meant learning new ways to manage and engage with social media tools. Are we teetering on the edge of an information overload precipice?
Information overload is a figment of your imagination. Or a failure of your filter. Or a symptom of your technological submissiveness. Depends on who you ask.
What if you had to sort through 3.5 million articles and social media posts a day and try to pull out the most relevant items for your organisation? What if you then had to cobble it all together into something readable for your top groups and executives in your organisation?
Alacra Compliance saves time by aggregating information from both free and fee-based sources and enabling users to conduct an accurate federated search across these sources (coined “simultaneous search” by Alacra).