<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>California Digital Library &#187; digitization</title>
	<atom:link href="http://www.cdlib.org/cdlinfo/tag/digitization/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cdlib.org/cdlinfo</link>
	<description>The Official CDL Blog</description>
	<lastBuildDate>Mon, 20 May 2013 21:54:33 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Merritt Service Update: April &#8211; May 2012 http://merritt.cdlib.org</title>
		<link>http://www.cdlib.org/cdlinfo/2012/06/05/merritt-service-update-april-may-2012-httpmerritt-cdlib-org/</link>
		<comments>http://www.cdlib.org/cdlinfo/2012/06/05/merritt-service-update-april-may-2012-httpmerritt-cdlib-org/#comments</comments>
		<pubDate>Tue, 05 Jun 2012 15:43:30 +0000</pubDate>
		<dc:creator>Perry Willett</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Digital Preservation Repository]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[digitization]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=11750</guid>
		<description><![CDATA[Merritt Service Description Merritt is a production level service that provides the UC community with an easy to use tool to manage, archive, and share their content. Content can be  ... <a href="http://www.cdlib.org/cdlinfo/2012/06/05/merritt-service-update-april-may-2012-httpmerritt-cdlib-org/">More</a>...]]></description>
				<content:encoded><![CDATA[<p><strong>Merritt Service Description </strong>Merritt is a production level service that provides the UC community with an easy to use tool to manage, archive, and share their content. Content can be deposited and managed via a user-interface or an API.</p>
<p><strong>Merritt Service Managers</strong><strong> </strong>Perry Willett <a href="mailto:perry.willett@ucop.edu">perry.willett@ucop.edu</a> and Adrian Turner <a href="mailto:adrian.turner@ucop.edu">adrian.turner@ucop.edu</a> or <a href="mailto:uc3@ucop.edu">uc3@ucop.edu</a>.</p>
<p><strong>Merritt Training Materials, Guides, FAQs and Webinars</strong> More information about Merritt is available at <a href="http://www.cdlib.org/services/uc3/merritt">http://www.cdlib.org/services/uc3/merritt</a> or by sending an inquiry to <a href="mailto:uc3@ucop.edu">uc3@ucop.edu</a> </p>
<p>See also Merritt webinars: <a href="http://www.cdlib.org/services/uc3/uc3webinars.html">http://www.cdlib.org/services/uc3/uc3webinars.html</a></p>
<p><strong>Recent Enhancements, News, and Activities </strong></p>
<p>• We’ve posted a summary of Merritt development activities and target timeframes on the UC3 Curation wiki: <a href="https://confluence.ucop.edu/display/Curation/Home">https://confluence.ucop.edu/display/Curation/Home</a> and will continue to use the wiki as a place to update the community on activities.</p>
<p>• In coordination with the UCLA Library and DiscoveryGarden, we are planning to move forward on work to integrate Islandora with Merritt.  UCLA will be conducting the development work using a forthcoming Islandora API, with consulting, testing, and project support from DiscoveryGarden and CDL.  Islandora is an open source digital asset management system currently under evaluation for implementation by the UCLA Library.  We are seeking to deploy Merritt as the preservation storage layer under Islandora’s Drupal-based system, in place of Fedora, which usually fills that role.   We will prepare documentation, to assist other Islandora implementers with this process, once the project is completed.</p>
<p>• Work is in-progress on enhancements to the Merritt user interface, to support public access to Merritt collections.  This has been identified as the top priority for Merritt development.  The designation of collections and/or objects to be exposed publicly is performed by providers, based on local policy decisions.  Merritt curators will be able to designate their collections publicly accessible, and users will have direct access to materials stored in Merritt.  We will hold a webinar to demo this new functionality, once it is available. </p>
<p>• We are documenting how UC campus libraries are utilizing or planning to integrate Merritt within local workflows.  Brief case studies &#8212; including recent profiles of UC Santa Barbara, UC San Francisco, and UC Santa Cruz’s use of Omeka with other systems &#8212; are featured on our UC3 Curation wiki.  • The UC Irvine Libraries are now submitting the content from their DSpace repository, called UCISpace http://ucispace.lib.uci.edu/. These UCISpace collections include a number of resources from the Libraries’ special collections and archives.  Content will be submitted via the Merritt API.  Special thanks to the UC Irvine Libraries Digital Scholarship Service team, and to Matthew McKinley for his work to connect these two systems.</p>
<p>• We are in the process of contracting with the San Diego Supercomputer Center to utilize their cloud storage service. This will allow for further cost-savings and will extend the replication of content stored in Merritt. </p>
<p>• We are continuing work on our self-audit of the Merritt repository, based upon the Trustworthy Repository Audit Certification (TRAC) checklist.  Information about policies and practices is being posted on the TRAC pages on our UC3 Curation wiki &lt;<a href="https://confluence.ucop.edu/display/Curation/TRAC">https://confluence.ucop.edu/display/Curation/TRAC</a>&gt; and we encourage feedback and comments from the community. </p>
<p>• We implemented a number of upgrades to our Ruby on Rails web application framework, which underlies a number of Merritt features and functions, and also added patches to our indexing system.</p>
<p>• DCXL project, sponsored by Microsoft Research and the Gordon and Betty Moore Foundation, will enable the preservation and sharing of research data via Microsoft Excel. Merritt will be a storage node for this project, allowing researchers to save, share and preserve their data in Merritt.  We have been working with developers at Microsoft to permit the submission of Excel spreadsheets to Merritt.  You can read more about the DCXL project at http://dcxl.cdlib.org/?p=692</p>
<p>• We have staged collections that were formerly in the Digital Preservation Repository (DPR), for migration to Merritt.  This is in preparation of decommissioning the legacy DPR system.  We have contacted clients with collections in the DPR, to confirm whether or not they would like us to migrate their collections forward. Please contact us with any questions about this migration.</p>
<p><strong>Service Monitoring and Availability </strong>Check Merritt’s system status page <a href="http://www.cdlib.org/contact/system.html">http://www.cdlib.org/contact/system.html</a></p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=11750" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2012/06/05/merritt-service-update-april-may-2012-httpmerritt-cdlib-org/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Reading 5 Million Books at Once: Google N-Grams at TEDxBoston</title>
		<link>http://www.cdlib.org/cdlinfo/2011/09/22/reading-5-million-books-at-once-google-n-grams-at-tedxboston/</link>
		<comments>http://www.cdlib.org/cdlinfo/2011/09/22/reading-5-million-books-at-once-google-n-grams-at-tedxboston/#comments</comments>
		<pubDate>Thu, 22 Sep 2011 23:40:02 +0000</pubDate>
		<dc:creator>jcolman</dc:creator>
				<category><![CDATA[Mass Digitization]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[cool stuff]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[digital humanities]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[google]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=10658</guid>
		<description><![CDATA[Google has digitized millions of books from libraries across the world, including the UC Libraries. While our digitized books are great for traditional reading and research, the corpus also offers  ... <a href="http://www.cdlib.org/cdlinfo/2011/09/22/reading-5-million-books-at-once-google-n-grams-at-tedxboston/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>Google has digitized millions of books from libraries across the world, including the UC Libraries. While our digitized books are great for traditional reading and research, the corpus also offers a unique opportunity for new kinds of inquiry. For instance: what can we learn about the evolution of culture by analyzing the written record over time on a massive scale?  How can we quantify the change of languages over time?</p>
<p>In <a href="http://www.ted.com/talks/what_we_learned_from_5_million_books.html">this great TEDx talk</a>, two Harvard researchers (Jean-Baptiste Michel and Erez Lieberman Aiden) discuss the insights they&#8217;ve gleaned from the Google N-gram Viewer. From the creation of a metric for censorship to the orthography of frustration, the presentation introduces ideas that could spark great research in the digital humanities and elsewhere.  It&#8217;s fun to watch, too.</p>
<p>You can also <a href="http://ngrams.googlelabs.com/">try out your own N-grams</a>!</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=10658" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2011/09/22/reading-5-million-books-at-once-google-n-grams-at-tedxboston/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Scripps Institution of Oceanography Library Finishes Digitization</title>
		<link>http://www.cdlib.org/cdlinfo/2010/05/20/5239/</link>
		<comments>http://www.cdlib.org/cdlinfo/2010/05/20/5239/#comments</comments>
		<pubDate>Thu, 20 May 2010 21:28:09 +0000</pubDate>
		<dc:creator>jcolman</dc:creator>
				<category><![CDATA[Collection Development]]></category>
		<category><![CDATA[Mass Digitization]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[HathiTrust]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=5239</guid>
		<description><![CDATA[Working together with CDL and Google, UCSD has finished digitizing nearly 100,000 volumes from the collections of the Scripps Institution of Oceanography Library – the world&#8217;s largest oceanographic library. Included are  ... <a href="http://www.cdlib.org/cdlinfo/2010/05/20/5239/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>Working together with CDL and Google, UCSD has <a href="http://ucsdnews.ucsd.edu/newsrel/general/05-20OceanographyLibrary.asp" target="_self">finished digitizing</a> nearly 100,000 volumes from the collections of the <a href="http://libraries.ucsd.edu/locations/sio/">Scripps Institution of Oceanography Library</a> – the world&#8217;s largest oceanographic library. Included are works on <a href="http://hdl.handle.net/2027/uc1.31822008810988">marine biology</a>, <a href="http://hdl.handle.net/2027/uc1.31822035214493">climate science</a>, <a href="http://hdl.handle.net/2027/uc1.31822033837253">ecology</a>, <a href="http://hdl.handle.net/2027/uc1.31822032513848">geology</a>, and many <a href="http://hdl.handle.net/2027/uc1.31822012246120">other subjects</a>. The collection includes scientific expedition reports from as far back as the 18th century, many of which were previously unavailable in digital form. We&#8217;re very excited to have these wonderful materials join our other UC collections in HathiTrust.</p>
<p>The Scripps Library is one of many UC collections that have been digitized recently. Among others, thousands of books from the <a href="http://www.library.ucla.edu/libraries/eastasian/">East Asian library</a> at UCLA, the <a href="http://library.ucsc.edu/science">Science and Engineering library</a> at UCSC, and the great variety of materials at <a href="http://www.lib.berkeley.edu/NRLF/">NRLF</a> are going online every month. The Mass Digitization group at CDL continues to coordinate and facilitate this work, and we are constantly surprised by its depth and breadth. You&#8217;ll find more information at <a href="http://booksearch.blogspot.com/2010/05/surfacing-treasures-of-deep-with.html">Google</a> and <a href="http://ucsdnews.ucsd.edu/newsrel/general/05-20OceanographyLibrary.asp">UCSD</a>, and keep an eye out further updates from CDL on the next UC collections going digital!</p>
<div class="mceTemp mceIEcenter">
<dt><a href="http://www.cdlib.org/cdlinfo/wp-content/uploads/2010/05/sealion2.jpg"><img class="size-full wp-image-5255  " src="http://www.cdlib.org/cdlinfo/wp-content/uploads/2010/05/sealion2.jpg" alt="The California Sea Lion" width="306" height="382" /></a></dt>
<p><strong>The California Sea Lion (Image from UCSD Libraries)</strong></p>
<p style="text-align: left">
</div>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=5239" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2010/05/20/5239/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
