Recent Enhancements, News, and Activities

  • WAS 2.0 Release re-scheduled for 2013. One of the key, new features of WAS 2.0 will be the Solr index. It will bring faster and more accurate search results, advanced search features, and a more robust indexing system. In order to implement the Solr index, the entire collection, which is now over 50 Terabytes, has to be re-indexed. This is a very complex and machine intensive process. Due to some complications encountered during the re-indexing, the WAS 2.0 release has been postponed to early 2013. All the development on the UI enhancements and new curator tools has been completed, but since these developments are inseparable from the new Solr indexing infrastructure, their release has also been postponed to 2013.
  • WAS public site redesign moving forward. The graphic design is completed and the project has now moved to the CDL Web Production team for implementation.
  • New WAS logo and tagline. In order to align the WAS logo with the other services in the UC3 portfolio, a new logo was created (see below).  In addition, a new descriptive tagline now provides a clear overview of the WAS service.

 

  • WAS presentation at UC3 workshops.  Two 2-day workshops Nov 8-9 in Oakland and Nov 13-14 in Irvine introduced library staff to basic data curation concepts, presented the challenges and solutions involved in delivering data curation services, provided information about UC3’s tools and services, and shared campus solutions and approaches to data curation services.  100 participants attended the workshops.  Rosalie Lack presented an overview of WAS (see presentation http://bit.ly/U196aC [pdf]) and discussed how institutions are currently using it and the future challenges of web archiving.
  • IIPC training. Rosalie Lack attended a 5-day International Internet Preservation Consortium-sponsored training, “How to Fit In? Integrating a web archiving program in your organization”. The workshop was hosted and organized by the French National Library and was attended by staff from libraries across Europe, Egypt, and the US. At the workshop, a well-rounded picture of how web archiving could be integrated into existing library systems was presented, new developments in web archiving showcased, and challenges were discussed.
  • Digital Library Federation (DLF) and NGTS activities. Erik Hetzner, WAS Tech Lead, attended DLF in early November.  As a member of a UC Next-Generation Technical Services (NGTS) lightening team,  along with other members — Todd Grappone (UCLA), Declan Fleming (UCSD), Brian Tingle (CDL) and Susan Perry (UCSC) – organized a birds of a feather session at which representatives from Hydra and Islandora discussed their services. Their lightening team (1.C) is charged with developing a model for a UC systemwide DAMS. Learn more: http://ucngts.tumblr.com/
  • End of Term (EOT) Archive crawls. CDL is working in collaboration with the Harvard University Library, Internet Archive, Library of Congress, University of North Texas Libraries, and the U.S. Government Printing Office on a continued partnership to archive 2012-2013 End of Term (EOT) sites.  Internet Archive’s final crawl for 2012 was completed mid November; there will be additional crawls run in early January and continue through inauguration day.  Background information available here: http://eotarchive.cdlib.org/2012.html

 

New public content: UCLA Library goes public with Occupy Web Archive

The UCLA Library Occupy Web Archive is now available for searching and browsing on the WAS site. The collection documents local Occupy movements and events on the west coast of the United States, Mexico, and Brazil.  The collection provides a geographic, topic browse that makes it easy to navigate through over 30 cities, states and countries.

Available here:  http://webarchives.cdlib.org/a/occupy

WAS Activity, November/December 2012

  • 4353 archives actively collected
  • 2773 sites collected
  • 3.5 TB of data collected

WAS Service Description

The Web Archiving Service (WAS) enables librarians, archivists and researchers to capture, curate and preserve websites and web‐published materials.   WAS makes it easy to build web archives, with scheduling and other tools to help manage your archive.  You control public access to your archives and can configure the appearance and navigation of each archive.  We also provide collection development consultation and help desk support for web archiving questions.

WAS Service Manager
Rosalie Lack rosalie.lack@ucop.edu or washelp@ucop.edu

WAS Training Materials, Guides, FAQs and Webinars
WAS training materials and guides available here: http://webarchives.cdlib.org/p/curators

Service Monitoring and Availability
Check system status page http://www.cdlib.org/contact/system.html