Web Archiving Service developers Erik Hetzner and Scott Fisher will present at this year’s Code4Lib conference on the tools they used to re-index over 600 million files in the web archives: “Indexing Big Data with Tika, Solr & MapReduce”. The session takes place on Wednesday, February 8th, 1:00-1:20 Pacific, and live streaming of Code4Lib will be available.
UPDATE: This was a great presentation, particularly for those of you curious to know more about the technical details of web archiving. Slides and video will be linked from here when available. 2/13/12