Thursday, 27th December 2007
Web Archives: Archive-It Adds Advanced Search Interface
Archive-It Adds Advanced Search Option
We regularly post about new permanent web archives being added to the Archive-It collection.
Archive-It is a service from the Internet Archive that works with various groups ("partners") to archive their web site and other important pages, often coming from news events. At the moment, Archive-It has 469 collections, encompassing over 255 million URL's. Those totals grow almost daily.
Unlike another IA project, The Wayback Machine, Archive-It pages can be keyword searched. The service uses Nutch open source search software.
Now, we can report that Archive-It has recently added an Advanced Search Interface using a common design.
+ Search with all terms (AND), any one of the search terms (OR), none of the terms (NOT)
+ Limit by Host URL
+ Limit by Total Documents per host
+ Limit by file format (9 file formats including MS Word Docs, PDF files, and QuickTime Files)
+ Limit to a Specific Partner and then (if available) a specific collection
Queries can also be turned into RSS feeds with a single click.
See Also: Selected Recent Collections Added to Archive-It
1 ||| 2 ||| 3 ||| 4