Inside CDL

California Recall Election Project

Through the University of California libraries preservation program, the UC is seeking to preserve California's cultural artifacts and provide enduring access to them. The California Digital Library recognizes the historical significance of the 2003 California Recall Election and that material related to this momentous occasion will have lasting value to UC researchers. We also recognize that much of this material is on the web and therefore extremely volatile. The California Recall Election Project is an undertaking to capture and make available — for non-commercial, educational and scholarly research purposes — a collection of web sites from this historic 2003 California gubernatorial recall election.

In cooperation with the Stanford Computer Science Department and the San Diego Supercomputer Center, the CDL has crawled and saved web sites associated with this election. Our selection criteria are modeled on the Library of Congress' MINERVA Project.

The next step for the project will be exploring possibilities for presentation and access to these materials. The UCLA Online Campaign Literature Archive has also captured web sites related to the election, and has begun making them available.

Resources

Bibliographies

Selected Articles/Reports

Technical Reports

  • End user ARC problems: To solve upgrade problems, a new header format is suggested. October 28, 2004 [PDF]
  • Open source OAI metadata harvesting tools: Open source OAI harvesting tools were surveyed for ease of use, installation difficulty, and robustness. Revised December 6, 2004 [PDF]
  • Plagiarex for digesting web site text: Web archivists need a method to detect significant changes in a document to reduce redundancy in the archive and to document interesting changes. The plagiarex MD5 is created by calculating the MD5 digest of a list of the five longest lower-case words in a document. December 6, 2004 [PDF]
  • Web crawler requirements: Lists criteria for the next generation web crawler. November 5, 2004 [PDF]
Selected Projects
Contact the CDL
  • Questions or comments about recall materials: Email the recall project team at recall2003@cdlib.org