<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>California Digital Library &#187; Web Archiving</title>
	<atom:link href="http://www.cdlib.org/cdlinfo/category/digital-preservation/web-archiving/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cdlib.org/cdlinfo</link>
	<description>The Official CDL Blog</description>
	<lastBuildDate>Mon, 20 May 2013 21:54:33 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Five things I learned at IIPC</title>
		<link>http://www.cdlib.org/cdlinfo/2013/05/16/five-things-i-learned-at-iipc/</link>
		<comments>http://www.cdlib.org/cdlinfo/2013/05/16/five-things-i-learned-at-iipc/#comments</comments>
		<pubDate>Thu, 16 May 2013 13:54:12 +0000</pubDate>
		<dc:creator>Rosalie Lack</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Web Archiving]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=13607</guid>
		<description><![CDATA[I recently attended the International Internet Preservation Consortium (IIPC) General Assembly.  The IIPC is a consortium of libraries, academic institutions and other organization engaged in web archiving.  The IIPC’s General Assembly included  ... <a href="http://www.cdlib.org/cdlinfo/2013/05/16/five-things-i-learned-at-iipc/">More</a>...]]></description>
				<content:encoded><![CDATA[<p><a href="http://netpreserve.org/"><img class="alignleft" style="border: 0px;margin-left: 8px;margin-right: 8px" title="IIPC logo" alt="iipclogo" src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/05/iipclogo-300x93.png" width="192" height="59" /></a>I recently attended the <a href="http://netpreserve.org/general-assembly/2013/program">International Internet Preservation Consortium (IIPC) General Assembly</a>. <strong> </strong>The IIPC is a consortium of libraries, academic institutions and other organization engaged in web archiving.  The<strong> </strong>IIPC’s General Assembly included three days of member meetings and two days of meetings open to the public. The theme of the public conference day was: Scholarly Access to Web Archives: Progress, Requirements, and Challenges.</p>
<p>Ahmed Alsum from the Web Science and Digital Libraries Research Group at Old Dominion University posted a comprehensive <a href="http://ws-dl.blogspot.com/2013/05/2013-04-22-iipc-ga-2013.html">summary of the GA</a>. As you can see from his summary, there were many great presentations and discussions. It was very hard to choose just five things to share, but here they are:</p>
<h3>1. Dark(ish) Archives</h3>
<p>Because of copyright and privacy issues, many of the national libraries in Europe cannot provide online, public access to their web archives. They can only allow access in the library and many do not even allow printing in the library. So, how do you raise the awareness of web archiving when no one can see the archives?! There was much discussion about creating site lists/registries for the sites in these archives – some felt this would only lead to disappointment when the user finds out that they have to travel to the archive to see the materials. Sound familiar? Yes, finding aids. And YES, they are extremely useful!</p>
<div id="attachment_13625" class="wp-caption alignright" style="width: 203px"><a href="http://commons.wikimedia.org/wiki/File:TequilaToolsMuseum.JPG"><img class=" wp-image-13625          " style="border: 40px" title="Harvesting tools" alt="harvesting tools" src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/05/harvesting-tools.jpg" width="193" height="252" /></a><p class="wp-caption-text">Harvesting tools. <br />Source: WikiMedia Commons</p></div>
<h3>2. Common tools support is critical</h3>
<p>Most IIPC members are using the same suite of OS tools –Heritrix and Open Source Wayback. There was a lot of concern about the development path for these tools. At member breakout sessions where future paths were discussed and prioritized, there was a clear message that tools are important. The IIPC steering committee quickly responded by supporting tool management as a top priority for the organization. Be on the lookout for updates and concrete plans shortly.</p>
<h3>3. National and University Library of Slovenia is innovative</h3>
<p>Besides being wonderful conference hosts, the National and University Library of Slovenia is doing some innovative work when it comes to web archiving. They demo’d a prototype (see <a href="http://netpreserve.org/sites/default/files/resources/Predstavitev_07.pdf">screenshots</a>) of a tool for end users to engage and interact with web archives. It includes features such as: gather and save sites; annotate sites; tag sites; and crowd sourcing of metadata. The next generation of web archives is here!</p>
<h3>4. Researchers use of web archives</h3>
<p>There were several informative presentations by researchers about how they are using web archives. <a href="http://telemme.mmsh.univ-aix.fr/membres/Sophie_Gebeil">Sophie Geibeil</a>, an historian from Aix-Marseille-Université, uses the archives to study the untold story of North African immigration. <a href="http://www.luc.edu/soc/academics_facultystaff_doughteryM.shtml">Megan Dougherty</a>, a social scientist from Loyola University, was not as interested in site content as much as she was in taking an anthropological point of view of the sites; that is, studying the social aspects of sites – how people share sites, interact with sites, etc. Niels Brügger, from <a href="http://www.netlab.dk/">Netlab</a> at Aarhus University, discussed their various research projects in the areas of digital humanities and internet studies, including <a href="http://www.netlab.dk/projects/resaw/">RESAW</a> and <a href="http://www.netlab.dk/projects/p6-fundamental-tools-for-web-archive-research-futarc/">FUTARC</a>. Helen Hockx-Yu, <a href="http://www.webarchive.org.uk/ukwa/">UK Web Archive</a> at the British Library, presented <a href="http://britishlibrary.typepad.co.uk/webarchive/2012/07/uk-web-archive-in-the-eyes-of-scholars.html">UK Web Archives in the eyes of scholars</a>.  She made the case for thinking of the archives not as documents but rather as large datasets for data mining and analysis.</p>
<h3>5. Dancing into the future?</h3>
<p>David Rosenthal provided historical context for the early days of web crawling and also provided some future challenges. Not surprising, the web as mainly HTML links is rapidly becoming a thing of the past. Turns out all the current, problematic areas (including rich media, database driven features, dynamically generated URIs, etc.) remain challenging, and now add to that the fact that the new web is more and more JavaScript. Is there no rest for the weary? David did leave us with a light at the end of tunnel. He talked about recent work by the Institute National de L’Audiovisuel (INA) in Paris. The team there created a live archive proxy that shows great promise to enable the capture of some of the more problematic content. Also, there is Memento, which provides an aggregation of web archives each collected in slightly different ways by institutions so moving toward covering all the bases.</p>
<div id="attachment_13631" class="wp-caption alignleft" style="width: 206px"><a href="http://commons.wikimedia.org/wiki/File:AdeleFred1921.jpg"><img class=" wp-image-13631            " style="border: 30px" title="Fred Astaire" alt="AdeleFred1921" src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/05/AdeleFred1921.jpg" width="196" height="163" /></a><p class="wp-caption-text">Fred Astaire. <br />Source: WikiMedia Commons</p></div>
<p>David also had a great analogy for one of the challenges of preserving the web today; he says it is like “preserving theatre or dance” because as we view the web it changes to become an individual experience based on who we are, displaying customized ads and other personalized content.   As he put it: “Every performance is a unique and unrepeatable interaction between the performers, in this case a vast collection of dynamically changing databases, and the audience. Actually, it is even worse. Preserving the Web is like preserving a dance performed billions of times, each time for an audience of one, who is also the director of their individual performance.” (Source: <a href="http://blog.dshr.org/2013/04/talk-on-harvesting-future-web-at.html">http://blog.dshr.org/2013/04/talk-on-harvesting-future-web-at.html</a>)</p>
<p>Overall, it was an excellent, thought-provoking conference. Clearly there are lots of challenges ahead for web archiving, but so many more opportunities.</p>
<p>&nbsp;</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=13607" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2013/05/16/five-things-i-learned-at-iipc/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>WAS Service Update: April 2013</title>
		<link>http://www.cdlib.org/cdlinfo/2013/05/10/was-service-update-april-2013/</link>
		<comments>http://www.cdlib.org/cdlinfo/2013/05/10/was-service-update-april-2013/#comments</comments>
		<pubDate>Fri, 10 May 2013 19:29:55 +0000</pubDate>
		<dc:creator>Rosalie Lack</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Web Archiving]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=13544</guid>
		<description><![CDATA[ Recent Enhancements, News, and Activities  WAS at Society of California Archivists (SCA), Berkeley, CA. April 11 – 13. The WAS team attended and staffed a booth. Rosalie Lack  ... <a href="http://www.cdlib.org/cdlinfo/2013/05/10/was-service-update-april-2013/">More</a>...]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/05/was_logo.jpg"><img src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/05/was_logo.jpg" alt="was_logo" width="324" height="62" class="aligncenter size-full wp-image-13545" /></a></p>
<p><strong>Recent Enhancements, News, and Activities</strong></p>
<ul>
<li><strong>WAS at Society of California Archivists (SCA), Berkeley, CA. April 11 – 13.</strong> The WAS team attended and staffed a booth. Rosalie Lack also gave a short presentation to the Online Archive of California (OAC)’s Users Group meeting.  <a href="http://www.slideshare.net/rosalielack/society-of-california-archivists-sca">http://www.slideshare.net/rosalielack/society-of-california-archivists-sca</a></li>
<li><strong>International Internet Preservation Consortium General Assembly (IIPC), Ljubljana, Slovenia. April 22‐ 26.</strong> The IIPC is a consortium of libraries, academic institutions and other organization engaged in web archiving. IIPC actively promotes the development of open source tools, standards and best practices in web archiving. Rosalie Lack participated as an IIPC member as well as a member of the IIPC’s Access Working Group member. The <a href="http://netpreserve.org/general-assembly/2013/program">IIPC’s General Assembly</a> includes 3 days of member meetings and 2 days open to the public. The theme of the public conference day was: Scholarly Access to Web Archives: Progress, Requirements, and Challenges.</li>
<li><strong>Bug fixes.</strong> The WAS technical team (Erik Hetzner and Scott Fisher) fixed several ongoing, but intermittent bugs that were causing display issues in both the public and curator interfaces. You should see a marked improvement in display. A reminder to please report any problems that you encounter by sending an email to: <a href="mailto:washelp@ucop.edu">washelp@ucop.edu</a>.</li>
</ul>
<p><strong>WAS Activity, April 2013</strong></p>
<ul>
<li>113 archives actively collected</li>
<li>1340 sites collected</li>
<li>1.5 TB of data collected</li>
</ul>
<p><strong>WAS Service Description</strong><br />
The Web Archiving Service (WAS) enables librarians, archivists and researchers to capture, curate and preserve websites and web‐published materials. WAS makes it easy to build web archives, with scheduling and other tools to help manage your archive. You control public access to your archives and can configure the appearance and navigation of each archive. We also provide collection development consultation and help desk support for web archiving questions.</p>
<p>
<strong>WAS Service Manager</strong><br />
Rosalie Lack, <a href="mailto:rosalie.lack@ucop.edu">rosalie.lack@ucop.edu</a> or <a href="mailto:washelp@ucop.edu">washelp@ucop.edu</a>.</p>
<p>
<strong>WAS Training Materials, Guides, FAQs and Webinars</strong><br />
WAS training materials and guides are available here:  <a href="http://webarchives.cdlib.org/p/curators">http://webarchives.cdlib.org/p/curators</a>.</p>
<p>
<strong>Service Monitoring and Availability</strong><br />
Check CDL&#8217;s system status page at <a href="http://www.cdlib.org/contact/system.html">http://www.cdlib.org/contact/system.html</a>.</p>
<p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=13544" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2013/05/10/was-service-update-april-2013/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>WAS Service Update: March 2013</title>
		<link>http://www.cdlib.org/cdlinfo/2013/04/18/was-service-update-march-2013/</link>
		<comments>http://www.cdlib.org/cdlinfo/2013/04/18/was-service-update-march-2013/#comments</comments>
		<pubDate>Thu, 18 Apr 2013 21:44:16 +0000</pubDate>
		<dc:creator>Rosalie Lack</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Web Archiving]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=13364</guid>
		<description><![CDATA[ Recent Enhancements, News, and Activities  Upgrade to Wayback 1.7. We upgraded the version of Wayback (the tool that is used to replay web crawls). This has resulted in  ... <a href="http://www.cdlib.org/cdlinfo/2013/04/18/was-service-update-march-2013/">More</a>...]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/04/was_logo.jpg"><img class="aligncenter size-full wp-image-13365" alt="was_logo" src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/04/was_logo.jpg" width="324" height="62" /></a></p>
<p><strong>Recent Enhancements, News, and Activities</strong></p>
<ul>
<li><strong>Upgrade to Wayback 1.7.</strong> We upgraded the version of Wayback (the tool that is used to replay web crawls). This has resulted in increased general reliability, as well as improved display.</li>
<li><strong>End of Term (EOT) Archive crawls.</strong> The final crawl for the EOT 2012 was completed by Internet Archive. Planning has started to transfer content to the Library of Congress and other partners. The CDL will be working later in the year with Internet Archive to merge the 2012 records with the 2008 site (<a href="http://eotarchive.cdlib.org/index.html">http://eotarchive.cdlib.org/index.html</a>). Background information available here:<br />
<a href="http://eotarchive.cdlib.org/2012.html">http://eotarchive.cdlib.org/2012.html</a>.</li>
</ul>
<p><strong>WAS Activity, March 2013</strong></p>
<ul>
<li>95 archives actively collected</li>
<li>1508 sites collected</li>
<li>1.4 TB of data collected</li>
</ul>
<p><strong>WAS Service Description</strong><br />
The Web Archiving Service (WAS) enables librarians, archivists and researchers to capture, curate and preserve websites and web‐published materials. WAS makes it easy to build web archives, with scheduling and other tools to help manage your archive. You control public access to your archives and can configure the appearance and navigation of each archive. We also provide collection development consultation and help desk support for web archiving questions.</p>
<p><strong>WAS Service Manager</strong><br />
Rosalie Lack, <a href="mailto:rosalie.lack@ucop.edu">rosalie.lack@ucop.edu</a> or <a href="mailto:washelp@ucop.edu">washelp@ucop.edu</a>.</p>
<p><strong>WAS Training Materials, Guides, FAQs and Webinars</strong><br />
WAS training materials and guides are available here: <a href="http://webarchives.cdlib.org/p/curators">http://webarchives.cdlib.org/p/curators</a>.</p>
<p><strong>Service Monitoring and Availability</strong><br />
Check CDL&#8217;s system status page at <a href="http://www.cdlib.org/contact/system.html">http://www.cdlib.org/contact/system.html</a>.</p>
<p>&nbsp;</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=13364" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2013/04/18/was-service-update-march-2013/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>WAS Service Update: February 2013</title>
		<link>http://www.cdlib.org/cdlinfo/2013/03/19/was-service-update-february-2013/</link>
		<comments>http://www.cdlib.org/cdlinfo/2013/03/19/was-service-update-february-2013/#comments</comments>
		<pubDate>Tue, 19 Mar 2013 23:00:18 +0000</pubDate>
		<dc:creator>Rosalie Lack</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Web Archiving]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=13136</guid>
		<description><![CDATA[ Recent Enhancements, News, and Activities  WAS 2.0 Release. Work continues on the release. SJSU Intern. Caitlin O&#8217;Neal López has joined our team, virtually, as an intern from San  ... <a href="http://www.cdlib.org/cdlinfo/2013/03/19/was-service-update-february-2013/">More</a>...]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/03/was_logo.jpg"><img class="aligncenter size-full wp-image-13137" title="was_logo" src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/03/was_logo.jpg" alt="" width="324" height="62" /></a></p>
<p><strong>Recent Enhancements, News, and Activities</strong></p>
<ul>
<li><strong>WAS 2.0 Release. </strong>Work continues on the release.</li>
<li><strong>SJSU Intern.</strong> Caitlin O&#8217;Neal López has joined our team, virtually, as an intern from San Jose State University School of Library and Information Science. (Caitlin currently lives in Seoul, South Korea.) Caitlin will be working on developing new WAS tutorial videos. To help us prioritize which videos to do first, Caitlin sent a survey to current WAS subscribers. The top 5 subject areas indentified were: Analyze Capture Results; Export XML; Duplicate Reduction; Archive your domain or a large website, and Rights Management. We have many other topics on our list and will take them on as we can.</li>
<li><strong>WAS Outreach Materials.</strong> We created a new 4&#215;6 postcard (see image). If you would like us to mail some to you for an event or meeting, please contact Rosalie Lack at <a href="mailto:rosalie.lack@ucop.edu">rosalie.lack@ucop.edu</a>.</li>
<p><a href="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/03/WAS_postcard.jpg"><img class="alignright size-full wp-image-13146" title="WAS_postcard" src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/03/WAS_postcard.jpg" alt="WAS postcard" width="199" height="291" /></a></p>
<li><strong>New WAS Institution.</strong> Read the announcement of our newest<br />
WAS subscriber &#8211; Mount Holyoke College.<br />
<a href="http://www.cdlib.org/cdlinfo/2013/03/03/mount-holyoke-college-signs-up-for-was/">http://www.cdlib.org/cdlinfo/2013/03/03/mount-holyoke-college-signs-up-for-was/</a></li>
<li><strong>Network Outage.</strong> On Thursday February 28 at 1:29 PM (PT), there was a general UCOP (UC Office of the President) network outage which impacted connectivity for CDL operations and services including the DMPTool. At 2:33 PM, we received word from the UCOP systems office that the UCOP network was coming back online. A second interruption occurred around 3 PM, although that outage was resolved within a few minutes. This outage did not have any effect on existing crawls. Since the crawler, Heritrix, is quite robust at dealing with sites that are unavailable, it would have paused at the time of the outage and picked back up once connectivity was reestablished. Please contact us at <a href="mailto:washelp@ucop.edu">washelp@ucop.edu</a> with any questions or concerns about this network outage.</li>
</ul>
<p><strong>New public content: University of California Office of the President (UCOP) site</strong><br />
<a href="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/03/UCOP_Archive.jpg"><img src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/03/UCOP_Archive.jpg" alt="" title="UCOP_Archive" width="650" height="40" class="aligncenter size-full wp-image-13160" /></a></p>
<p>UCOP archived the ucop.edu domain prior to redesigning the site; new site launched November 2012. Available here: <a href="http://webarchives.cdlib.org/a/ucoppre2012">http://webarchives.cdlib.org/a/ucoppre2012</a>.</p>
<p>
<strong>WAS Activity, February 2013</strong></p>
<ul>
<li>100 archives actively collected</li>
<li>1753 sites collected</li>
<li>1.4 TB of data collected</li>
</ul>
<p><strong>WAS Service Description</strong><br />
The Web Archiving Service (WAS) enables librarians, archivists and researchers to capture, curate and preserve websites and web‐published materials. WAS makes it easy to build web archives, with scheduling and other tools to help manage your archive. You control public access to your archives and can configure the appearance and navigation of each archive. We also provide collection development consultation and help desk support for web archiving questions.</p>
<p>
<strong>WAS Service Manager</strong><br />
Rosalie Lack, <a href="mailto:rosalie.lack@ucop.edu">rosalie.lack@ucop.edu</a> or <a href="mailto:washelp@ucop.edu">washelp@ucop.edu</a>.</p>
<p>
<strong>WAS Training Materials, Guides, FAQs and Webinars</strong><br />
WAS training materials and guides are available here: <a href="http://webarchives.cdlib.org/p/curators">http://webarchives.cdlib.org/p/curators</a>.</p>
<p>
<strong>Service Monitoring and Availability</strong><br />
Check CDL&#8217;s system status page at <a href="http://www.cdlib.org/contact/system.html">http://www.cdlib.org/contact/system.html</a>.</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=13136" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2013/03/19/was-service-update-february-2013/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mount Holyoke College Signs up for WAS</title>
		<link>http://www.cdlib.org/cdlinfo/2013/03/03/mount-holyoke-college-signs-up-for-was/</link>
		<comments>http://www.cdlib.org/cdlinfo/2013/03/03/mount-holyoke-college-signs-up-for-was/#comments</comments>
		<pubDate>Sun, 03 Mar 2013 17:52:16 +0000</pubDate>
		<dc:creator>Rosalie Lack</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Web Archiving]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=13039</guid>
		<description><![CDATA[The CDL Web Archiving Service (WAS) is extremely pleased to welcome Mount Holyoke College as its newest subscriber. Leslie Fields, Head of Archives &#38; Special Collections at Mount Holyoke, explains  ... <a href="http://www.cdlib.org/cdlinfo/2013/03/03/mount-holyoke-college-signs-up-for-was/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>The CDL <a title="WAS" href="http://was.cdlib.org">Web Archiving Service (WAS)</a> is extremely pleased to welcome Mount Holyoke College as its newest subscriber. Leslie Fields, Head of Archives &amp; Special Collections at Mount Holyoke, explains “after testing several options we determined that WAS was the best choice for us. We felt the interface was straightforward and easy to use; we liked the option to make collections public or to keep them private; we received very good customer service when we asked our many questions; and we liked the consortium pricing.”</p>
<div id="attachment_13044" class="wp-caption alignright" style="width: 210px"><br />
<img class="size-full wp-image-13044 " src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/02/dwightfront.jpg" alt="Dwight Hall Mount Holyoke College" width="200" height="162" /><p class="wp-caption-text">From Mount Holyoke College website</p></div>
<p>Mount Holyoke’s web archiving activities are the continuation of a <a href="https://www.mtholyoke.edu/archives/exhibits/nhprcgrantproject">NHPRC electronic records grant project</a> that they completed in 2011. The goal of the grant work was to establish procedures for ingest, processing, preservation, and providing access to campus electronic records of enduring value. During the period they worked on the grant staff realized that the News &amp; Events portion of the project was really all about web archiving.</p>
<p>The result was an investigation into possible web archiving options, including creating an in-house service. They quickly realized that with their current level of staffing an in-house solution was not a viable option. After reviewing other services and consulting with a variety of library teams, they selected WAS.</p>
<p>Now that they are signed up for WAS, staff will first focus on administrative and academic sites at Mount Holyoke, as well as News and Events pages. Then they will address faculty and course sites, as well as sites that document student life, such as student organization pages.</p>
<p>Welcome Mount Holyoke College!</p>
<p>For further information about the Web Archiving Service, see <a href="http://was.cdlib.org/">http://was.cdlib.org</a>, or email <a href="mailto:washelp@ucop.edu">washelp@ucop.edu</a>.</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=13039" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2013/03/03/mount-holyoke-college-signs-up-for-was/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>WAS Service Update: January 2013</title>
		<link>http://www.cdlib.org/cdlinfo/2013/02/21/was-service-update-january-2013-2/</link>
		<comments>http://www.cdlib.org/cdlinfo/2013/02/21/was-service-update-january-2013-2/#comments</comments>
		<pubDate>Thu, 21 Feb 2013 23:06:56 +0000</pubDate>
		<dc:creator>Rosalie Lack</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Web Archiving]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=13023</guid>
		<description><![CDATA[ Recent Enhancements, News, and Activities  WAS 2.0 Release Work on the new release continues. Significant progress was made in January; testing will begin in early February. Based on  ... <a href="http://www.cdlib.org/cdlinfo/2013/02/21/was-service-update-january-2013-2/">More</a>...]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/02/was_logo.jpg"><img class="aligncenter size-full wp-image-13024" title="was_logo" src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/02/was_logo.jpg" alt="" width="324" height="62" /></a><br />
<strong>Recent Enhancements, News, and Activities</strong></p>
<ul>
<li><strong>WAS 2.0 Release Work</strong> on the new release continues. Significant progress was made in January; testing will begin in early February. Based on the results of testing, we will have a release date.</li>
<li><strong>VM Migration.</strong> We started the planning process for a very large, infrastructure project to migrate from Solaris physical hosts to Linux virtual machines. The migration will also include moving WAS storage to San Diego SuperComputer Center (SDSC). The migration will begin after the WAS 2.0 release.</li>
<li><strong>New WAS institution.</strong> We are very happy to welcome Mount Holyoke College in Massachusetts as our newest WAS subscriber!</li>
</ul>
<p><strong>WAS Activity, January 2013</strong></p>
<ul>
<li>103 archives actively collected</li>
<li>1971 sites collected</li>
<li>1.8 TB of data collected</li>
</ul>
<p><strong>WAS Service Description</strong><br />
The Web Archiving Service (WAS) enables librarians, archivists and researchers to capture, curate and preserve websites and web‐published materials. WAS makes it easy to build web archives, with scheduling and other tools to help manage your archive. You control public access to your archives and can configure the appearance and navigation of each archive. We also provide collection development consultation and help desk support for web archiving questions.</p>
<p>
<strong>WAS Service Manager</strong><br />
Rosalie Lack, <a href="mailto:rosalie.lack@ucop.edu">rosalie.lack@ucop.edu</a> or <a href="mailto:washelp@ucop.edu">washelp@ucop.edu</a>.</p>
<p>
<strong>WAS Training Materials, Guides, FAQs and Webinars</strong><br />
WAS training materials and guides are available here: <a href="http://webarchives.cdlib.org/p/curators">http://webarchives.cdlib.org/p/curators</a>.</p>
<p>
<strong>Service Monitoring and Availability</strong><br />
Check CDL&#8217;s system status page at <a href="http://www.cdlib.org/contact/system.html">http://www.cdlib.org/contact/system.html</a>.</p>
<p>&nbsp;</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=13023" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2013/02/21/was-service-update-january-2013-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Temporary Changing of the Guard at UC3</title>
		<link>http://www.cdlib.org/cdlinfo/2013/01/31/temporary-changing-of-the-guard-at-uc3/</link>
		<comments>http://www.cdlib.org/cdlinfo/2013/01/31/temporary-changing-of-the-guard-at-uc3/#comments</comments>
		<pubDate>Thu, 31 Jan 2013 18:50:51 +0000</pubDate>
		<dc:creator>Jayne Dickson</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[DataUp]]></category>
		<category><![CDATA[EZID]]></category>
		<category><![CDATA[Merritt]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=12911</guid>
		<description><![CDATA[ Patricia Cruse, CDL’s Director of Digital Preservation (UC3), will be reducing her duties at the CDL between January 14 and June 30, 2013.  Trisha will be working remotely from  ... <a href="http://www.cdlib.org/cdlinfo/2013/01/31/temporary-changing-of-the-guard-at-uc3/">More</a>...]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/01/generic_UC3_logo1.jpg"><img class="aligncenter size-full wp-image-12912" title="generic_UC3_logo" src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/01/generic_UC3_logo1.jpg" alt="" width="114" height="41" /></a></p>
<p>Patricia Cruse, CDL’s Director of Digital Preservation (UC3), will be reducing her duties at the CDL between January 14 and June 30, 2013.  Trisha will be working remotely from Italy and Spain while she accompanies her husband on his sabbatical from the University of San Francisco (USF).  During this time Trisha will continue working on various initiatives on a limited basis and will be available via email at <a href="mailto:patricia.cruse@ucop.edu">patricia.cruse@ucop.edu</a>.   Please don’t hesitate to contact Trisha about work or great places to see or eat in Italy or Spain!</p>
<p>Rosalie Lack, who has returned to CDL to assume the duties of the Web Archiving Service (WAS) manager, will take on many of the responsibilities for the day-to-day running of UC3.  These include administrative and systemwide activities as well as budgetary, communication, outreach, marketing and UC3 team management.   Rosalie can be reached at <a href="mailto:rosalie.lack@ucop.edu">rosalie.lack@ucop.edu</a>.</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=12911" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2013/01/31/temporary-changing-of-the-guard-at-uc3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Progress of Preservation</title>
		<link>http://www.cdlib.org/cdlinfo/2013/01/16/the-progress-of-preservation/</link>
		<comments>http://www.cdlib.org/cdlinfo/2013/01/16/the-progress-of-preservation/#comments</comments>
		<pubDate>Wed, 16 Jan 2013 23:17:01 +0000</pubDate>
		<dc:creator>Laine Farley</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Opinion and Commentary]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[DataCite]]></category>
		<category><![CDATA[DataUp]]></category>
		<category><![CDATA[DMPTool]]></category>
		<category><![CDATA[EZID]]></category>
		<category><![CDATA[Message from the Executive Director]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=12808</guid>
		<description><![CDATA[I was very pleased by the nice recognition from Library of Congress’s “Top Ten Digital Preservation Developments of 2012” of three projects CDL has been involved in: The DataUp Project.  ... <a href="http://www.cdlib.org/cdlinfo/2013/01/16/the-progress-of-preservation/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>I was very pleased by the nice recognition from Library of Congress’s “<a href="http://blogs.loc.gov/digitalpreservation/2013/01/top-10-digital-preservation-developments-of-2012/">Top Ten Digital Preservation Developments of 2012</a>” of three projects CDL has been involved in:</p>
<blockquote><p><strong>The DataUp Project</strong>. The University of California Curation Center at the California Digital Library continued to produce useful tools and services in support of digital preservation with <a href="http://dataup.cdlib.org/about_project.html">DataUp</a>, “an open source tool helping researchers document, manage, and archive their tabular data… within the scientist’s workflow.”</p>
<p><strong>End of Term Web Archive</strong>. The <a href="http://eotarchive.cdlib.org/2012.html">End of Term 2012 project</a> got underway to capture U.S. Government websites between the first and second administration of President Barack Obama. Project partners include the California Digital Library, Internet Archive, Library of Congress, University of North Texas Libraries and the U.S. Government Printing Office.</p></blockquote>
<p>We have also been participating through meetings and briefings in the development of another project on the list, the Digital Preservation Network.</p>
<p>The rest of this “Top Ten” list is equally impressive and it is heartening to realize that so much has been accomplished in the area of digital preservation in our community. Yet I can’t help but note that it can still be a hard sell to administrators to justify new or increased expenditures for something that seems abstract, unpredictable and never ending. This challenge was magnified for me recently when I presented a paper in December, 2012 at the 3rd conference on <a href="http://www.rinascimento-digitale.it/conference2012-introduction.phtml">Cultural Heritage online &#8211; Trusted Digital Repositories &amp; Trusted Professionals</a> in Florence, Italy.</p>
<p>The conference began with a parade of local officials and cultural heritage ministers from the city and region, all extolling the importance of preserving cultural heritage in the digital age. Indeed, the commitment to preservation is everywhere in this city that was the heart of the Renaissance, its museums and public places overflowing with the riches of the past. I was particularly struck by the exhibits at the <a href="http://www.museogalileo.it/en/index.html?%2Fbdviewer%40selid=1978491">Galileo Museum</a></p>
<div id="attachment_12818" class="wp-caption alignright" style="width: 458px"><a href="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/01/Florence-Galileo-Museum1.jpg"><img class="wp-image-12818     " src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/01/Florence-Galileo-Museum1.jpg" alt="" width="448" height="336" /></a><p class="wp-caption-text">Sundials, Galileo Museum, Florence</p></div>
<p>where the artifacts and experimental documentation of not only Galileo but also his contemporaries were showcased in spectacular fashion. Even with all of the support given to cultural heritage, the Italians at the conference still feel they need to shore up digital preservation, but they were focusing on standards and compliance more than on basic funding.</p>
<p>My presentation, on the other hand, was about how we have shifted from talking about preservation as an ultimate and costly activity to curation as part of the ongoing process of creating and managing digital content. We must be aware of incentives—such as mandates from funders and publishers rather than government initiatives&#8211;for our researchers to practice good stewardship of their research output. Thus we have invested in tools such as the DataUP mentioned by LC as well as the Data Management Planning Tool (DMPTool which made the 2011 Top Ten list), EZID (which won the DataCite 2012 Gold Award for assigning more than 250,000 DOIs (digital object identifiers) in one year), and the Web Archiving Service to make it digital content easier to manage when it comes time to preserve it for the future. After all, today’s research can become tomorrow’s cultural heritage and think what we would be missing if we weren’t able to see what Galileo was up to.</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=12808" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2013/01/16/the-progress-of-preservation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DataUp, End of Term Web Archive Cited Among Top Ten Digital Preservation Developments of 2012</title>
		<link>http://www.cdlib.org/cdlinfo/2013/01/16/dataup-end-of-term-web-archive-cited-among-top-ten-digital-preservation-developments-of-2012/</link>
		<comments>http://www.cdlib.org/cdlinfo/2013/01/16/dataup-end-of-term-web-archive-cited-among-top-ten-digital-preservation-developments-of-2012/#comments</comments>
		<pubDate>Wed, 16 Jan 2013 21:08:01 +0000</pubDate>
		<dc:creator>Ellen Meltzer</dc:creator>
				<category><![CDATA[Curation Micro-services]]></category>
		<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Web Archiving]]></category>
		<category><![CDATA[DataUp]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=12800</guid>
		<description><![CDATA[Two CDL projects are among a list of top ten digital preservation developments of 2012, according to Bill LeFurgy, Digital Initiatives Manager at the Library of Congress, DataUp and the  ... <a href="http://www.cdlib.org/cdlinfo/2013/01/16/dataup-end-of-term-web-archive-cited-among-top-ten-digital-preservation-developments-of-2012/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>Two CDL projects are among a list of top ten digital preservation developments of 2012, according to Bill LeFurgy, Digital Initiatives Manager at the Library of Congress, <a href="http://dataup.cdlib.org/">DataUp</a> and the <a href="http://eotarchive.cdlib.org/2012.html">End of Term Web Archive</a> are included in this list.  Stated LeFurgy, “I cast a wide net and mustered my objectivity in picking activities with the potential for broad, collaborative impact in the world-wide effort to keep digital material available and accessible over time. The resulting list covers an assortment of practical, hands-on information, as well as tools for helping with outreach, program assessment and research data management.” </p>
<p>See the full article at <a href="http://blogs.loc.gov/digitalpreservation/2013/01/top-10-digital-preservation-developments-of-2012/">http://blogs.loc.gov/digitalpreservation/2013/01/top-10-digital-preservation-developments-of-2012/</a></p>
<p>Congratulations to our UC3 colleagues at CDL, along with our partners in these endeavors including the Gordon and Betty Moore Foundation, Microsoft Research Connections and DataONE for DataUp; and Harvard University Library, Internet Archive, Library of Congress, University of North Texas Libraries and the U.S. Government Printing Office for the End of Term Web Archive.</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=12800" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2013/01/16/dataup-end-of-term-web-archive-cited-among-top-ten-digital-preservation-developments-of-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>WAS Service Update – November/December 2012</title>
		<link>http://www.cdlib.org/cdlinfo/2013/01/14/was-service-update-novemberdecember-2012/</link>
		<comments>http://www.cdlib.org/cdlinfo/2013/01/14/was-service-update-novemberdecember-2012/#comments</comments>
		<pubDate>Mon, 14 Jan 2013 22:23:22 +0000</pubDate>
		<dc:creator>Rosalie Lack</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Web Archiving]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=12763</guid>
		<description><![CDATA[Recent Enhancements, News, and Activities  WAS 2.0 Release re-scheduled for 2013. One of the key, new features of WAS 2.0 will be the Solr index. It will bring faster  ... <a href="http://www.cdlib.org/cdlinfo/2013/01/14/was-service-update-novemberdecember-2012/">More</a>...]]></description>
				<content:encoded><![CDATA[<p><strong>Recent Enhancements, News, and Activities</strong></p>
<ul>
<li><strong>WAS 2.0 Release re-scheduled for 2013</strong>. One of the key, new features of WAS 2.0 will be the Solr index. It will bring faster and more accurate search results, advanced search features, and a more robust indexing system. In order to implement the Solr index, the entire collection, which is now over 50 Terabytes, has to be re-indexed. This is a very complex and machine intensive process. Due to some complications encountered during the re-indexing, the WAS 2.0 release has been postponed to early 2013. All the development on the UI enhancements and new curator tools has been completed, but since these developments are inseparable from the new Solr indexing infrastructure, their release has also been postponed to 2013.</li>
<li><strong>WAS public site redesign moving forward</strong>. The graphic design is completed and the project has now moved to the CDL Web Production team for implementation.</li>
<li><strong>New WAS logo and tagline</strong>. In order to align the WAS logo with the other services in the UC3 portfolio, a new logo was created (see below).  In addition, a new descriptive tagline now provides a clear overview of the WAS service.</li>
</ul>
<p style="padding-left: 60px"><a href="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/01/WAS_logo.png"><img class="size-medium wp-image-12764 alignleft" style="border: 0px" src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/01/WAS_logo-300x54.png" alt="" width="300" height="54" /></a>&#8216;</p>
<p>&nbsp;</p>
<ul>
<li><strong>WAS presentation at UC3 workshops</strong>.  Two 2-day workshops Nov 8-9 in Oakland and Nov 13-14 in Irvine introduced library staff to basic data curation concepts, presented the challenges and solutions involved in delivering data curation services, provided information about UC3’s tools and services, and shared campus solutions and approaches to data curation services.  100 participants attended the workshops.  Rosalie Lack presented an overview of WAS (see presentation <a href="http://bit.ly/U196aC">http://bit.ly/U196aC</a> [pdf]) and discussed how institutions are currently using it and the future challenges of web archiving.</li>
<li><strong>IIPC training</strong>. Rosalie Lack attended a 5-day International Internet Preservation Consortium-sponsored training, “How to Fit In? Integrating a web archiving program in your organization”. The workshop was hosted and organized by the French National Library and was attended by staff from libraries across Europe, Egypt, and the US. At the workshop, a well-rounded picture of how web archiving could be integrated into existing library systems was presented, new developments in web archiving showcased, and challenges were discussed.</li>
<li><strong>Digital Library Federation (DLF) and NGTS activities</strong>. Erik Hetzner, WAS Tech Lead, attended DLF in early November.  As a member of a UC Next-Generation Technical Services (NGTS) lightening team,  along with other members &#8212; Todd Grappone (UCLA), Declan Fleming (UCSD), Brian Tingle (CDL) and Susan Perry (UCSC) – organized a birds of a feather session at which representatives from Hydra and Islandora discussed their services. Their lightening team (1.C) is charged with developing a model for a UC systemwide DAMS. Learn more: <a href="http://ucngts.tumblr.com/">http://ucngts.tumblr.com/</a></li>
<li><strong>End of Term (EOT) Archive crawls</strong>. CDL is working in collaboration with the Harvard University Library, Internet Archive, Library of Congress, University of North Texas Libraries, and the U.S. Government Printing Office on a continued partnership to archive 2012-2013 End of Term (EOT) sites.  Internet Archive’s final crawl for 2012 was completed mid November; there will be additional crawls run in early January and continue through inauguration day.  Background information available here: <a href="http://eotarchive.cdlib.org/2012.html">http://eotarchive.cdlib.org/2012.html</a></li>
</ul>
<p>&nbsp;</p>
<p><strong>New public content: UCLA Library goes public with Occupy Web Archive</strong></p>
<p><a href="http://webarchives.cdlib.org/a/occupy"><img class="alignnone size-medium wp-image-12765" src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2013/01/BannerLogoOcupaMinor2-300x110.jpg" alt="" width="300" height="110" /></a></p>
<p>The UCLA Library Occupy Web Archive is now available for searching and browsing on the WAS site. The collection documents local Occupy movements and events on the west coast of the United States, Mexico, and Brazil.  The collection provides a geographic, topic browse that makes it easy to navigate through over 30 cities, states and countries.</p>
<p>Available here:  <a href="http://webarchives.cdlib.org/a/occupy">http://webarchives.cdlib.org/a/occupy</a></p>
<p><strong>WAS Activity, November/December 2012</strong></p>
<ul>
<li>4353 archives actively collected</li>
<li>2773 sites collected</li>
<li>3.5 TB of data collected</li>
</ul>
<p><strong>WAS Service Description</strong></p>
<p>The Web Archiving Service (WAS) enables librarians, archivists and researchers to capture, curate and preserve websites and web‐published materials.   WAS makes it easy to build web archives, with scheduling and other tools to help manage your archive.  You control public access to your archives and can configure the appearance and navigation of each archive.  We also provide collection development consultation and help desk support for web archiving questions.</p>
<p><strong>WAS Service Manager<br />
</strong>Rosalie Lack rosalie.lack@ucop.edu or washelp@ucop.edu</p>
<p><strong>WAS Training Materials, Guides, FAQs and Webinars<br />
</strong>WAS training materials and guides available here: <a href="http://webarchives.cdlib.org/p/curators">http://webarchives.cdlib.org/p/curators</a></p>
<p><strong>Service Monitoring and Availability<br />
</strong>Check system status page <a href="http://www.cdlib.org/contact/system.html">http://www.cdlib.org/contact/system.html</a></p>
<p>&nbsp;</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=12763" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2013/01/14/was-service-update-novemberdecember-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
