Receive the weekly sampler of posts and "Resource of the Week".
Subscribe »

Enter your
email address:

My Account »


Bookmark and Share

Testimonial?
If you find ResourceShelf useful, please supply a testimonial »








Home > ResourceBlog > Article

« All ResourceBlog Articles

 

Bookmark and Share   Feed

Thursday, 22nd June 2006

Internet Archive's Brewster Kahle Profiled in a New Article

While digitizing content gets tons of press these days, archiving the web is a very important issue for info pros that deserves lots of attention and work. Although many web archiving projects exist around the globe, The Internet Archive is perhaps the most well known.

The IA is home to The Wayback Machine (over 55 million pages archived back to 1996). along with serveral special collections.

The IA's founder, leader, Internet legend (and ResourceShelf reader (-; ), Brewster Kahle, is profiled in this new News.com report by Elinor Mills.

We were happy to see that Elinor's article not only talks about The Wayback Machine but other IA archiving initiatives (including the Open Content Alliance, its live music archive, its moving image archive (including the Prelinger Collection and Vintage Cartoon Collection) and more. Of course, legal issues are also discussed in the article.

The only thing the article is missing is a look at the important work that the IA is doing with its Archive-It program allowing institutions to create their own web archives. Examples include:

  • NEW Archive: National Government Statistical Websites
  • The websites of statistical agencies of countries may contain data, reports, statistical yearbooks, press releases, methodological guides, and other information of continuing interest to social scientists and historians. 75 websites in roman alphabets from Sub-Saharan Africa, Central Eurasia, East Asia, Latin America, the Near East, Russia and Eastern Europe, and South Asia are included.

It's also worth pointing out that the Internet Archive is working with the National Archives and Records Administration (NARA). In January, NARA, with the help of the IA, released the “2004 Presidential Term Web Harvest” containing over 75 million pages. In March 2006, this archive became keyword searchable using Nutch technology.

The IA's crawler, Heretix, is open source.

"Let's have a library system that is in the great traditions of Thomas Jefferson, Andrew Carnegie, and the Library of Alexandria," he [Kahle] says while showing a reporter around the Internet Archive's offices in San Francisco's Presidio. "If we are able to build that library again with the vision of the Greeks but the technology of the modern era, that's something to be proud of."

See Also: Archived April 2006 Webinar About Archive-It Program (via Educause)

Views: 2139




blog comments powered by Disqus

« All ResourceBlog Articles

 

Read about the FreePint FamilyFreePint Family

A family of resources to help information workers be more effective, raise the value of information in their organisations and contribute to success. Read more »


FeedLatest Family Articles:


Click to view the article Quilting big data threads
Thursday, 24th May 2012

Recently I have found myself cooing over visualisation maps (and heat maps) of health and well being resources. The content rich data is overlayed with mapping technologies, and some interesting themes and patterns are emerging.


Click to view the article The fallacy of information overload
Wednesday, 23rd May 2012

A lot of the talk around social media in the last year has been around information overload. Social media has provided us with new and exciting ways to create content. But it has also meant learning new ways to manage and engage with social media tools. Are we teetering on the edge of an information overload precipice?


Click to view the article Information overload: fact, fantasy or filter failure?
Wednesday, 23rd May 2012

Information overload is a figment of your imagination. Or a failure of your filter. Or a symptom of your technological submissiveness. Depends on who you ask.


Click to view the article Newsdesk: tracking millions of pieces of information a day
Tuesday, 22nd May 2012

What if you had to sort through 3.5 million articles and social media posts a day and try to pull out the most relevant items for your organisation? What if you then had to cobble it all together into something readable for your top groups and executives in your organisation?


Click to view the article Alacra Compliance adds managerial oversight
Tuesday, 22nd May 2012

Alacra Compliance saves time by aggregating information from both free and fee-based sources and enabling users to conduct an accurate federated search across these sources (coined “simultaneous search” by Alacra).


All Family Articles »
Family Articles by Category »


Tell us what you're working on,
and we'll talk to you about how FreePint can help »


FreePint Family Testimonials

"Fabulous resource to learn of unique tools and insights. Very useful." Manager, Futures and Forecasting, Virginia, USA

More testimonials »






Subscribe

Subscribe to the ResourceShelf Newsletter and receive the weekly sampler of posts and Resource of the Week.

Find out more »

ResourceShelf sponsored by:

Article Categories

All Article Categories »

Archive

All Archives »