<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>California Digital Library &#187; Technology</title>
	<atom:link href="http://www.cdlib.org/cdlinfo/category/technology/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cdlib.org/cdlinfo</link>
	<description>The Official CDL Blog</description>
	<lastBuildDate>Thu, 16 May 2013 14:49:20 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>California Digital Library Announces Release of XTF Version 3.1</title>
		<link>http://www.cdlib.org/cdlinfo/2012/08/08/california-digital-library-announces-release-of-xtf-version-3-1/</link>
		<comments>http://www.cdlib.org/cdlinfo/2012/08/08/california-digital-library-announces-release-of-xtf-version-3-1/#comments</comments>
		<pubDate>Wed, 08 Aug 2012 20:23:52 +0000</pubDate>
		<dc:creator>Lisa Schiff</dc:creator>
				<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Publishing]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=12020</guid>
		<description><![CDATA[The California Digital Library (CDL) is pleased to announce the release of version 3.1 of XTF (http://xtf.cdlib.org/), an open source, highly flexible software application that supports the search, browse and  ... <a href="http://www.cdlib.org/cdlinfo/2012/08/08/california-digital-library-announces-release-of-xtf-version-3-1/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>The California Digital Library (CDL) is pleased to announce the release of version 3.1 of <a href="http://xtf.cdlib.org/">XTF</a> (<a href="http://xtf.cdlib.org/">http://xtf.cdlib.org/</a>), an open source, highly flexible software application that supports the search, browse and display of heterogeneous digital content.  XTF provides efficient and practical methods for creating customized end-user interfaces for distinct digital content collections and is used by institutions worldwide. </p>
<p>Major features in the 3.1 release include:</p>
<ul>
<li>Improved schema handling for EAD finding aids.  In addition to EAD 2002 DTD,  XTF now provides support for search and display of:
<ul>
<li>EAD 2002 schema and EAD 2002 RelaxNG finding aids</li>
<li>Output from Archivists’ Toolkit and Archon</li>
</ul>
</li>
<li>Better OAI 2.0 conformance</li>
<li>Dynamic site maps to support optimal search engine indexing</li>
</ul>
<p>See the <a href="http://xtf.cdlib.org/documentation/changelog/#3.1">3.1 change log</a> (<a href="http://xtf.cdlib.org/documentation/changelog/#3.1">http://xtf.cdlib.org/documentation/changelog/#3.1</a>) for further details.</p>
<p>XTF is a combination of Java and XSLT 2.0 that indexes, queries, and displays digital objects and is based on open source software (e.g. Lucene and Saxon).  XTF can be downloaded from the <a href="http://xtf.cdlib.org/download/">XTF website</a> (<a href="http://xtf.cdlib.org/download/">http://xtf.cdlib.org/download/</a>) or from the <a href="http://sourceforge.net/projects/xtf/">XTF Project page on SourceForge</a> (<a href="http://sourceforge.net/projects/xtf/">http://sourceforge.net/projects/xtf/</a>), where the source code can also be found. </p>
<p>The XTF website also provides a <a href="http://xtf.cdlib.org/download/">self-guided tutorial</a> and a sample of the <a href="http://xtf.cdlib.org:8080/xtf/search">default installation</a> (<a href="http://xtf.cdlib.org:8080/xtf/search">http://xtf.cdlib.org:8080/xtf/search</a>), demonstrating the capabilities of the tool out-of-the-box. Both of these resources provide a quick view of the capabilities of XTF prior to download.</p>
<p>Offering a suite of customizable features that support diverse intellectual access to content, XTF interfaces can be designed to support the distinct tools and presentations that are useful and meaningful to specific audiences.  In addition, XTF offers the following core features:</p>
<ul>
<li><strong>Easy to deploy: </strong>Drops directly in to a Java application server such as Tomcat or Resin; has been tested on Solaris, Mac, Linux, and Windows operating systems.</li>
<li><strong>Easy to configure: </strong>Can create indexes on any XML element or attribute; entire presentation layer is customizable via XSLT.</li>
<li><strong>Robust:</strong><strong> </strong>Optimized to perform well on large documents (e.g., a single text that exceeds 10MB of encoded text); scales to perform well on collections of millions of documents; provides full Unicode support.</li>
<li><strong>Extensible:</strong></li>
<ul>
<li>Works well with a variety of authentication systems (e.g., IP address lists, LDAP, Shibboleth).</li>
<li>Provides an interface for external data lookups to support thesaurus-based term expansion, recommender systems, etc.</li>
<li>Can power other digital library services (e.g., XTF contains an OAI-PMH data provider that allows others to harvest metadata, and an SRU interface that exposes searches to federated search engines).</li>
<li>Can be deployed as separate, modular pieces of a third-party system (e.g., the module that displays snippets of matching text).</li>
</ul>
<li><strong>Powerful for the enduser: </strong><strong></strong></li>
<ul>
<li>Spell checking of queries<strong></strong></li>
<li>Faceted displays for browsing<strong></strong></li>
<li>Dynamically updated browse lists<strong></strong></li>
<li>Session-based bookbags</li>
</ul>
</ul>
<p>These basic features can be tuned and modified.  For instance, the same bookbag feature that allows users to store links to entire books can also store links to citable elements of an object, such as a note or other reference.</p>
<p>Examples of XTF-based applications both within and outside of the CDL include:</p>
<ul>
<li><a href="http://www.escholarship.org/">eScholarship</a> (<a href="http://www.escholarship.org/">http://www.escholarship.org</a>), the University of California’s open access scholarly publishing and research platform.</li>
<li><a href="http://www.marktwainproject.org/">Mark Twain Project Online</a> (<a href="http://www.marktwainproject.org/">http://www.marktwainproject.org</a>), developed by the Mark Twain Papers Project, the CDL and the University of California Press.</li>
<li><a href="http://calisphere.universityofcalifornia.edu/">Calisphere</a> (<a href="http://calisphere.universityofcalifornia.edu/">http://calisphere.universityofcalifornia.edu/</a>), a curated collection of primary sources keyed to the curriculum standards of California’s K-12 community, developed by the CDL.</li>
<li><a href="http://socialarchive.iath.virginia.edu/xtf/search">SNAC: The Social Networks and Archival Context Project (prototype)</a> (<a href="http://socialarchive.iath.virginia.edu/xtf/search">http://socialarchive.iath.virginia.edu/xtf/search</a>), linking together descriptions of people from finding aids using the new standard Encoded Archival Context-Corporate Bodies, Persons, and Families (EAC-CPF), d<em>eveloped by </em><a href="http://socialarchive.iath.virginia.edu/staff.html#iath" target="_blank">IATH, University of Virginia</a><em> (<a href="http://socialarchive.iath.virginia.edu/staff.html#iath">http://socialarchive.iath.virginia.edu/staff.html#iath</a> ), the CDL and the </em><a href="http://socialarchive.iath.virginia.edu/staff.html#ucbsi" target="_blank">UC Berkeley School of Information</a><em> (<a href="http://socialarchive.iath.virginia.edu/staff.html#ucbsi">http://socialarchive.iath.virginia.edu/staff.html#ucbsi</a>).</em></li>
<li>Various collections at the University of Sydney, Australia, including: <a href="http://frontiers.library.usyd.edu.au/">Frontiers of Science</a>, University of Sydney Library (<a href="http://frontiers.library.usyd.edu.au/">http://frontiers.library.usyd.edu.au/</a>); the <a href="http://va.library.usyd.edu.au/">Sydney College of the Arts Archive</a> (<a href="http://va.library.usyd.edu.au/">http://va.library.usyd.edu.au</a>).</li>
<li><a href="http://www.encyclopedia.chicagohistory.org/">The Encyclopedia of Chicago</a> (<a href="http://www.encyclopedia.chicagohistory.org/">http://www.encyclopedia.chicagohistory.org/</a>), developed by the Chicago History Museum, The Newberry Library, and Northwestern University.</li>
<li><a href="http://webapp1.dlib.indiana.edu/newton/">The Chymistry of Isaac Newton</a> (<a href="http://webapp1.dlib.indiana.edu/newton/">http://webapp1.dlib.indiana.edu/newton/</a>) and <a href="http://webapp1.dlib.indiana.edu/swinburne/">The Swinburne Project</a> (<a href="http://webapp1.dlib.indiana.edu/swinburne/">http://webapp1.dlib.indiana.edu/swinburne/</a>), Indiana University.</li>
</ul>
<p>&nbsp;</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=12020" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2012/08/08/california-digital-library-announces-release-of-xtf-version-3-1/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CDL Departures May, June 2012</title>
		<link>http://www.cdlib.org/cdlinfo/2012/05/18/cdl-departures-may-june-2012/</link>
		<comments>http://www.cdlib.org/cdlinfo/2012/05/18/cdl-departures-may-june-2012/#comments</comments>
		<pubDate>Fri, 18 May 2012 16:13:51 +0000</pubDate>
		<dc:creator>Ellen Meltzer</dc:creator>
				<category><![CDATA[Digital Preservation (UC3)]]></category>
		<category><![CDATA[Discovery & Delivery]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Staff News]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=11695</guid>
		<description><![CDATA[CDL reports with mixed feelings the upcoming retirements of three of our valued colleagues: John Ober, Margaret Low, and Claudia Woo.  John Ober began his stint as CDL Assistant Director  ... <a href="http://www.cdlib.org/cdlinfo/2012/05/18/cdl-departures-may-june-2012/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>CDL reports with mixed feelings the upcoming retirements of three of our valued colleagues: John Ober, Margaret Low, and Claudia Woo. </p>
<p>John Ober began his stint as CDL Assistant Director for Education and Communications in 1998 and leaves us as Manager of Infrastructure and Applications Support, having held several other positions in between including Director of Scholarly Communications.  Prior to coming to CDL, he served as Development Librarian for Electronic Resources for the Center for Science, Technology and Information Resources/Library Learning Complex, California State University(CSU), Monterey Bay; and Acting Director, Library Systems, at UC Berkeley. As an Instructor at CSU Monterey and an Assistant Professor at UC Berkeley, John taught courses on database management, networks and networked information, and the management of information technology</p>
<p>We will miss John’s intelligence, high standards, excellent judgment, writing skills, passion for scholarship, level-headedness, fairness, tact, moral compass and his wry sense of humor.  Many of his memos included final sections for “the insatiably curious” with tie-ins to perfectly matched and ironically titled country-western songs.  (A recent example had several with the words “share” and “point” in their titles following a memo about UCOP’s move to SharePoint.) </p>
<p>John leaves CDL May 24.  He and his wife Barbara will be moving to Chico where they plan to expand their gardening space, raise chickens, become involved in the civic life of Chico, and spend time with family.  In the spirit of the C-W song, “When You Leave Walk Out Backwards, So I&#8217;ll Think You&#8217;re Walking In”.   We will miss you, John, and wish you all the best!</p>
<p>Margaret Low will be retiring from the CDL on June 28 after 30 years of service to UC.  Margaret has been at CDL and its predecessor organization DLA (Division of Library Automation) since the early days and has made significant contributions throughout her valued years of service.  For the past several years Margaret has been a key member of the University of California Curation Center (UC3) team, most recently holding responsibilities for infrastructure, deployment, and operational support for all UC3 service offerings, as well as being the primary point of contact with CDL’s Infrastructure and Applications Support (IAS) and the ITS Unix group.  She has always brought to these activities great professionalism, collegiality, and unflagging good humor. </p>
<p>After CDL, Margaret plans to use her technical skills to host a personal website for her knitting, quilting and embroidery projects as well becoming more involved with the local craft communities.  She will be spending more time with her family in Southern California and planning long road trips. Since one of her many contributions to CDL was her organization of the coffee club, crucial to those early morning coffee drinkers, we wish you lots of good, strong coffee, Margaret!</p>
<p>Discovery &amp; Delivery’s stalwart Claudia Woo is retiring on June 27.  Claudia started her career in library automation in 1981 working on the budget program.  In the late 1980s Claudia was responsible for the generation of the California Academic Library List of Serials (CALLS) on microfiche. After serials were added to the Melvyl database, and as Abstracting &amp; Indexing (A&amp;I) databases were added throughout the 1990s Claudia worked on her all time favorite program – SGGP, a generic table-driven program that could be configured to process a wide variety of bibliographic metadata records. The trick was to call generic routines in just the right order to accomplish almost any bibliographic metadata normalization.  With the retirement of the mainframe and the locally hosted Melvyl A&amp;I databases Claudia played a leading role in sustaining Aleph Melvyl for nearly 10 years, as well as working on early metasearch programs like Searchlight and on VDX.</p>
<p>Claudia has also provided backup for many Discovery &amp; Delivery systems. There is a growing folder on CDL’s shared server that speaks to Claudia’s contributions.</p>
<p>Claudia has been a wonderful colleague to us all and we admire the quiet good work she does in any community she enters.</p>
<p>Congratulations Claudia – we will miss you!</p>
<p>(Stephan Abrams, Lynne Cameron and Ellen Meltzer contributed to this piece.)</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=11695" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2012/05/18/cdl-departures-may-june-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CDL Position: Manager, Infrastructure and Applications Support</title>
		<link>http://www.cdlib.org/cdlinfo/2012/03/12/cdl-position-manager-infrastructure-and-applications-support/</link>
		<comments>http://www.cdlib.org/cdlinfo/2012/03/12/cdl-position-manager-infrastructure-and-applications-support/#comments</comments>
		<pubDate>Mon, 12 Mar 2012 16:19:10 +0000</pubDate>
		<dc:creator>Ellen Meltzer</dc:creator>
				<category><![CDATA[Infrastructure Services]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=11481</guid>
		<description><![CDATA[Are you excited about supporting the building and creative use of the world&#8217;s scholarship and knowledge?  Are you a strong IT technical manager who can meet the challenges of managing  ... <a href="http://www.cdlib.org/cdlinfo/2012/03/12/cdl-position-manager-infrastructure-and-applications-support/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>Are you excited about supporting the building and creative use of the world&#8217;s scholarship and knowledge?  Are you a strong IT technical manager who can meet the challenges of managing complex relationships, keeping technologies up-to-date and working with developers to support multiple development, staging and production environments? </p>
<p><strong>About us</strong></p>
<p>We do more than put information online. At the California Digital Library (CDL), we provide technology and expertise to help the University of California collect, publish, access, and preserve its full range of information resources.  CDL takes a comprehensive approach to digital library development that balances experimentation with reliable production services strategy.  We pursue simple, economical solutions to big challenges.</p>
<p>We manage and continuously improve server, storage, and network infrastructure for application development, staging, and production activities through partnerships with state-of-the-art data centers, Amazon Web Services, and others.</p>
<p>Our open-access publishing platforms help the brightest minds easily disseminate their findings and ideas, putting technology to work for the producers and consumers of new knowledge.</p>
<p>We make interactive library systems work well together, so our users find and access the information they need as quickly as possible.</p>
<p>We are actively building services to manage and preserve digital information of all kinds—from images to raw data to the deepest websites. Our curatorial approach to digital information helps scholars build on past discoveries to create new knowledge.</p>
<p>Learn more at CDL’s <a href="http://www.cdlib.org/">website</a>.</p>
<p> <strong>Requirements</strong></p>
<ul>
<li>BS in computer science, information management or a related field and 10+ years working in production environments with UNIX-based operating systems &amp; software development that includes a minimum of five years in a supervisory capacity. Master&#8217;s degree preferred</li>
<li>5+ years information technology resource management experience including management of a team of high-level technical employees</li>
<li>Proven experience managing &amp; motivating skilled &amp; knowledgeable staff in a technical environment</li>
<li>Demonstrated experience negotiating arrangements leading to productive &amp; sustainable IT &#8220;partner relationships&#8221; with data centers, vendors, and external IT infrastructure providers</li>
<li>Demonstrated experience with technical project management</li>
</ul>
<p><strong>Preferred</strong></p>
<ul>
<li>Knowledge and technical competence in the digital library and higher education domain<strong></strong></li>
</ul>
<p> More information</p>
<p><a href="https://jobs.ucop.edu/applicants/Central?quickFind=55094">https://jobs.ucop.edu/applicants/Central?quickFind=55094</a></p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=11481" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2012/03/12/cdl-position-manager-infrastructure-and-applications-support/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Meet Virtual Martin</title>
		<link>http://www.cdlib.org/cdlinfo/2012/02/13/meet-virtual-martin/</link>
		<comments>http://www.cdlib.org/cdlinfo/2012/02/13/meet-virtual-martin/#comments</comments>
		<pubDate>Mon, 13 Feb 2012 18:57:24 +0000</pubDate>
		<dc:creator>Rachael Hu</dc:creator>
				<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Publishing]]></category>
		<category><![CDATA[Staff News]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=11021</guid>
		<description><![CDATA[ We here at the California Digital Library (CDL) are very  serious when it comes to all things digital. Some of our colleagues work  virtually so we’ve explored  ... <a href="http://www.cdlib.org/cdlinfo/2012/02/13/meet-virtual-martin/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>
<img class="alignright size-medium" src="http://www.cdlib.org/images/misc/mar-tin-robot_300x300b.jpg" alt="Robot Mar-Tin" width="300" height="300" />We here at the California Digital Library (CDL) are very  serious when it comes to all things digital. Some of our colleagues work  virtually so we’ve explored the gamut of long-distance communication solutions.  The latest solution, a robot, has been so much fun for us all to experience  that we wanted to share it with folks outside our office. </p>
<p>Our colleague, Martin Haye, a Senior Developer who is the  mastermind behind some of CDL’s most important open source contributions, such  as the <a href="http://xtf.cdlib.org/">XTF search and display framework</a> and  the <a href="http://www.cdlib.org/services/publishing/escholarship.html">eScholarship  back-end systems</a> , has engineered a user-friendly robot that pretty much  puts him in our offices without physically having to be here.</p>
<p>Here’s what he told us…</p>
<p><strong>Q: How long have you been working virtually?</strong></p>
<p><strong>A:</strong> About four years now. I’ve worked at a distance from the  beginning of my time at CDL but at first it was just by email and phone.</p>
<p><strong>Q: You’ve been through a couple of different iterations that  I’ve seen- could you tell me how the idea for a virtual you got started?</strong></p>
<p><strong>A:</strong> Well, it started out because we moved to an office space that  had a really nice counter space and I wanted to take advantage of it. …I wanted  to put myself using Skype on a laptop on the counter, plus be portable enough  for people to carry me to meetings. But no one actually ever did that because  there were too many cords to deal with.</p>
<p><strong>Q: Have you noticed a difference in how people are reacting  to this iteration of virtual Martin over any of the other iterations?</strong></p>
<p><strong>A:</strong> Lots and lots of people have been talking to me about it.  I really like that I’ve been able to achieve the level of personal interaction  that I have dreamed of. Before people had to walk in to [my office space] to  see my video self or have a specific question to ask me. But now I have a  chance for hallway encounters. For example, I encountered a colleague in the  hallway and had an ad-hoc conversation that led to a 15-20 minute meeting. But  during that encounter laughs were exchanged and a relationship was renewed.</p>
<p><strong>Q: Do you have an official name for your new virtual self?</strong></p>
<p><strong>A:</strong> Well, I’m actually just going to say that I’m Martin and  if you want to call my medium something then you could say that it’s Martin’s  robot. I got the librarian cataloguing stamp of approval for this naming  convention. A colleague of ours offered to look up the proper naming convention  for such a setup in the AACR2 (Anglo-American Cataloguing Rules, Second  Edition) and we came away with the rule about spirits that channel through a  medium. Since I am the spirit of this virtual setup, the main name or entry  should be my own name and if someone wants to put a name to the robot medium it  would just be a secondary name or entry.</p>
<p><strong>Q: Can you tell me a bit about how you built this robot?</strong></p>
<p><strong>A:</strong> Well, it all started at Christmas a year ago when I got a  robot kit as a Christmas gift. It had an aluminum base with a motor and wheels.  But it took me over a year to design and implement the setup that you now see.  I wanted the robot to be tall enough so that I could talk with people at a  conference table and I wanted it to have a pretty substantial base so that it  wouldn’t tip over.  To make it tall  enough I created a shape like an oil derrick so that’s what you’d see if you  took the jacket off.  The power system  was the most challenging component and I had to experiment with a couple of  different solutions.  I wanted something  that would allow me to be mobile, reliable, and safe. One system design would  have been too much of an electrocution hazard. I finally settled on a hefty uninterruptible  power supply (ups). Next, I included a plain old monitor on the setup, and a  Mac Mini to run the video software that allows you to see me. Finally, I have  two cameras attached in front and back. The front camera has a fish eye lens  and can see 180 degrees both horizontally and vertically. The back camera is  pointed down and helps me when I want to back into my work space where I  connect the plug to the electrical outlet for charging in between my little  forays around the office.</p>
<p>Also, the video conferencing system is something that I’ve  programmed myself based on open source software.</p>
<p><strong>Q: What’s been the most challenging part of the robot? Have  you had any accidents?</strong></p>
<p><strong>A:</strong> Maneuvering is definitely a challenge. Sometimes I can’t  see very well and I bump into a lot of walls. One time I brushed a postcard off  of someone’s wall when they weren’t there and I couldn’t pick it up. I drove  back later to apologize and that was fun.</p>
<p>But for the most part it’s been an enormous success and a  really great way to interact with my colleagues.</p>
<p>Next time you’re at the CDL offices, be sure to stop by to  say hello to Martin.</p>
<p><strong>Photo credit: </strong><em>Craig Thompson</em></p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=11021" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2012/02/13/meet-virtual-martin/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Web Archiving Service Tech News &#8211; Code4Lib</title>
		<link>http://www.cdlib.org/cdlinfo/2012/02/06/web-archiving-service-tech-news-code4lib/</link>
		<comments>http://www.cdlib.org/cdlinfo/2012/02/06/web-archiving-service-tech-news-code4lib/#comments</comments>
		<pubDate>Tue, 07 Feb 2012 00:18:08 +0000</pubDate>
		<dc:creator>Tracy Seneca</dc:creator>
				<category><![CDATA[Technology]]></category>
		<category><![CDATA[Web Archiving]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=11126</guid>
		<description><![CDATA[Web Archiving Service developers Erik Hetzner and Scott Fisher will present at this year’s Code4Lib conference on the tools they used to re-index over 600 million files in the web  ... <a href="http://www.cdlib.org/cdlinfo/2012/02/06/web-archiving-service-tech-news-code4lib/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>Web Archiving Service developers Erik Hetzner and Scott Fisher will present at this year’s Code4Lib conference on the tools they used to re-index over 600 million files in the web archives: “Indexing Big Data with Tika, Solr &amp; MapReduce”.   The session takes place on Wednesday, February 8th, 1:00-1:20 Pacific, and live streaming of Code4Lib will be available. </p>
<p>UPDATE: This was a great presentation, particularly for those of you curious to know more about the technical details of web archiving.  Slides and video will be linked from here when available. 2/13/12</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=11126" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2012/02/06/web-archiving-service-tech-news-code4lib/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>CDL Security guidelines adopted and posted</title>
		<link>http://www.cdlib.org/cdlinfo/2011/10/27/cdl-security-guidelines-adopted-and-posted/</link>
		<comments>http://www.cdlib.org/cdlinfo/2011/10/27/cdl-security-guidelines-adopted-and-posted/#comments</comments>
		<pubDate>Thu, 27 Oct 2011 16:54:16 +0000</pubDate>
		<dc:creator>John Ober</dc:creator>
				<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=10776</guid>
		<description><![CDATA[By John Ober, Manager, Infrastructure and Applications Support The CDL has adopted and published a set of Information Technology Key Security Guidelines &#38; Baseline Supporting Practices, along with a companion  ... <a href="http://www.cdlib.org/cdlinfo/2011/10/27/cdl-security-guidelines-adopted-and-posted/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>By John Ober, Manager, Infrastructure and Applications Support</p>
<p>The CDL has adopted and published a set of <a href="http://www.cdlib.org/about/it_sec_guidelines.html">Information Technology Key Security Guidelines &amp; Baseline Supporting Practices</a>, along with a companion <a href="http://www.cdlib.org/about/sysacc_acct_mgmt.html">System Access and Account Management Policy</a>. The guidelines are available at <a href="http://www.cdlib.org/about/policies.html">http://www.cdlib.org/about/policies.html</a>. Thanks are due to the CDL Tech Council and Director&#8217;s Cabinet, and the html markup of Eric Satzman’s web production group (notably Craig Thompson).</p>
<p>These security guidelines complement the March adoption of a privacy policy. Enumerating &#8220;departmental&#8221; level policies, guidelines, and baseline supporting practices for both privacy and IT security that complement existing Federal, State, and University policies allows the CDL to act consistently and confidently in these areas and to respond to related inquiries from the CDL&#8217;s constituencies and clients.</p>
<p>Continuing the trend, CDL Executive Director Laine Farley has charged a small group – John Ober, Lisa Schiff, Joan Starr, and Lena Zentall &#8211; to develop a departmental Business/Service Continuity and Disaster Recovery framework and implementation plan whose initial results should be available in February, 2012. As with the others, that effort will interpret and extend existing UC and UCOP policies and procedures (see <a href="http://www.ucop.edu/riskmgt/emergprep/">http://www.ucop.edu/riskmgt/emergprep/</a>).</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=10776" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2011/10/27/cdl-security-guidelines-adopted-and-posted/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Reading 5 Million Books at Once: Google N-Grams at TEDxBoston</title>
		<link>http://www.cdlib.org/cdlinfo/2011/09/22/reading-5-million-books-at-once-google-n-grams-at-tedxboston/</link>
		<comments>http://www.cdlib.org/cdlinfo/2011/09/22/reading-5-million-books-at-once-google-n-grams-at-tedxboston/#comments</comments>
		<pubDate>Thu, 22 Sep 2011 23:40:02 +0000</pubDate>
		<dc:creator>jcolman</dc:creator>
				<category><![CDATA[Mass Digitization]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[cool stuff]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[digital humanities]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[google]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=10658</guid>
		<description><![CDATA[Google has digitized millions of books from libraries across the world, including the UC Libraries. While our digitized books are great for traditional reading and research, the corpus also offers  ... <a href="http://www.cdlib.org/cdlinfo/2011/09/22/reading-5-million-books-at-once-google-n-grams-at-tedxboston/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>Google has digitized millions of books from libraries across the world, including the UC Libraries. While our digitized books are great for traditional reading and research, the corpus also offers a unique opportunity for new kinds of inquiry. For instance: what can we learn about the evolution of culture by analyzing the written record over time on a massive scale?  How can we quantify the change of languages over time?</p>
<p>In <a href="http://www.ted.com/talks/what_we_learned_from_5_million_books.html">this great TEDx talk</a>, two Harvard researchers (Jean-Baptiste Michel and Erez Lieberman Aiden) discuss the insights they&#8217;ve gleaned from the Google N-gram Viewer. From the creation of a metric for censorship to the orthography of frustration, the presentation introduces ideas that could spark great research in the digital humanities and elsewhere.  It&#8217;s fun to watch, too.</p>
<p>You can also <a href="http://ngrams.googlelabs.com/">try out your own N-grams</a>!</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=10658" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2011/09/22/reading-5-million-books-at-once-google-n-grams-at-tedxboston/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>California Digital Library Announces Release of XTF Version 3.0</title>
		<link>http://www.cdlib.org/cdlinfo/2011/04/06/california-digital-library-announces-release-of-xtf-version-3-0/</link>
		<comments>http://www.cdlib.org/cdlinfo/2011/04/06/california-digital-library-announces-release-of-xtf-version-3-0/#comments</comments>
		<pubDate>Wed, 06 Apr 2011 21:37:47 +0000</pubDate>
		<dc:creator>Lisa Schiff</dc:creator>
				<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Publishing]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=9987</guid>
		<description><![CDATA[The California Digital Library (CDL) is pleased to announce the release of version 3.0 of XTF (http://xtf.cdlib.org/), an open source, highly flexible software application that supports the search, browse and  ... <a href="http://www.cdlib.org/cdlinfo/2011/04/06/california-digital-library-announces-release-of-xtf-version-3-0/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>The California Digital Library (CDL) is pleased to announce the release of version 3.0 of <a href="http://xtf.cdlib.org/">XTF</a> (<a href="http://xtf.cdlib.org/">http://xtf.cdlib.org/</a>), an open source, highly flexible software application that supports the search, browse and display of heterogeneous digital content.  XTF provides efficient and practical methods for creating customized end-user interfaces for distinct digital content collections and is used by institutions worldwide. </p>
<p> Highlights from the 3.0 release include:</p>
<ul>
<li>Scanned book display support in default UI</li>
<li>Stability improvements to index rotation support</li>
<li>Globalization and RSS support</li>
<li>Further Unicode improvements</li>
<li>Many bug fixes</li>
</ul>
<p>See the full <a href="http://xtf.cdlib.org/documentation/changelog/">change log</a> (<a href="http://xtf.cdlib.org/documentation/changelog/">http://xtf.cdlib.org/documentation/changelog/</a>) for further details.</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=9987" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2011/04/06/california-digital-library-announces-release-of-xtf-version-3-0/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>eXtensible Text Framework (XTF) Website Launched</title>
		<link>http://www.cdlib.org/cdlinfo/2010/09/29/extensible-text-framework-xtf-website-launched/</link>
		<comments>http://www.cdlib.org/cdlinfo/2010/09/29/extensible-text-framework-xtf-website-launched/#comments</comments>
		<pubDate>Wed, 29 Sep 2010 22:43:05 +0000</pubDate>
		<dc:creator>Elise Proulx</dc:creator>
				<category><![CDATA[eScholarship]]></category>
		<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Publishing]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[eXtensible Text Framework]]></category>
		<category><![CDATA[XTF]]></category>

		<guid isPermaLink="false">http://www.cdlib.org/cdlinfo/?p=8011</guid>
		<description><![CDATA[By Elise Proulx, Publishing Group Marketing &#38; Outreach Coordinator The Publishing Group of the California Digital Library (CDL) announces the launch of the eXtensible Text Framework (XTF) website (http://xtf.cdlib.org/), supporting  ... <a href="http://www.cdlib.org/cdlinfo/2010/09/29/extensible-text-framework-xtf-website-launched/">More</a>...]]></description>
				<content:encoded><![CDATA[<p>By Elise Proulx, Publishing Group Marketing &amp; Outreach Coordinator</p>
<p>The Publishing Group of the California Digital Library (CDL) announces the launch of the<strong> eXtensible Text Framework (XTF) website </strong><a title="XTF" href="http://xtf.cdlib.org/" target="_self">(http://xtf.cdlib.org/</a>), supporting a robust open-source application for providing access to digital content. Developed and maintained by the CDL, XTF functions as the primary access technology for the CDL’s digital collections and similar projects worldwide.</p>
<p>XTF excels in supporting rapid, customized application development and deployment. Its high degree of extensibility and performance (even for large documents and large collections) frees implementers to focus on building sophisticated presentations for their digital object collections.</p>
<p>“It&#8217;s all about balancing flexibility and ease of use: putting infinite customization ability in the hands of curators and scholars with a driving need to provide deep access to their special collections,” says XTF lead developer Martin Haye.</p>
<p>XTF-based applications range from primary source image collections to publishing platforms and archival finding aid repositories at the University of California and many other institutions, including Northwestern University, the University of Sydney (Australia), Indiana University, Visual Arkiv (Sweden), Durham University (UK), and the University of Virginia.</p>
<p>Implementations include:</p>
<p>•	CDL’s <strong>eScholarship </strong>(<a title="eScholarship" href="http://www.escholarship.org" target="_blank">http://www.escholarship.org</a>/), UC’s open access scholarly publishing platform, which publishes recent research from across the 10 campuses as well as nearly 40 UC-based scholarly journals<br />
•	CDL’s <strong>Online Archive of California</strong> (<a title="Online Archive of California" href="http://www.oac.cdlib.org/" target="_blank">http://www.oac.cdlib.org/</a>), a collection of more than 20,000 archival finding aids and 200,000 digital primary sources (images and texts) from more than 150 archives, libraries, and other institutions in the state of California<br />
•	<strong>The Encyclopedia of Chicago</strong> (<a title="The Encyclopedia of Chicago" href="http://encyclopedia.chicagohistory.org/" target="_blank">http://encyclopedia.chicagohistory.org/</a>), a collaboration between the Chicago  Historical Society, Northwestern University, and the Newberry Library<br />
•	<strong>Indiana University’s The Chymistry of Isaac Newton</strong> (<a title="The Chymistry of Isaac Newton" href="http://webapp1.dlib.indiana.edu/newton/" target="_blank">http://webapp1.dlib.indiana.edu/newton/</a>), a digital repository of transcriptions of Newton’s alchemical manuscripts</p>
<p>The new site serves as an expanded resource for programmers, librarians, and the general public to explore and implement the Java and XSLT 2.0-based framework. Features include:</p>
<p>•	<strong>XTF application download </strong>– full release or core updates to maintain customizations (<a title="XTF Download" href="http://xtf.cdlib.org/download/" target="_blank">http://xtf.cdlib.org/download/</a>)<br />
•	<strong>Documentation</strong>, including downloadable deployment guide, programming guide, and tag reference (<a title="XTF Documentation" href="http://xtf.cdlib.org/documentation" target="_blank">http://xtf.cdlib.org/documentation</a>)<br />
•	<strong>Video tutorials</strong> focusing on basic setup and customization of XTF (<a title="XTF Tutorials" href="http://xtf.cdlib.org/getting-started-tutorials" target="_blank">http://xtf.cdlib.org/getting-started-tutorials</a>)<br />
•	<strong>Example XTF implementations </strong>highlighting customized features. (<a title="XTF Implementations" href="http://xtf.cdlib.org/xtf" target="_blank">http://xtf.cdlib.org/xtf</a>)</p>
<p>For a full list of XTF’s features and benefits, as well as a technical overview, please visit <a title="About XTF" href="http://xtf.cdlib.org/about" target="_blank">http://xtf.cdlib.org/about</a> or address queries to Martin Haye at <a href="mailto:martin.haye@ucop.edu">martin.haye@ucop.edu</a>.</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=8011" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2010/09/29/extensible-text-framework-xtf-website-launched/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>California Digital Library Announces Self-Guided Tutorial for the eXtensible Text Framework (XTF)</title>
		<link>http://www.cdlib.org/cdlinfo/2009/03/17/california-digital-library-announces-self-guided-tutorial-for-the-extensible-text-framework-xtf/</link>
		<comments>http://www.cdlib.org/cdlinfo/2009/03/17/california-digital-library-announces-self-guided-tutorial-for-the-extensible-text-framework-xtf/#comments</comments>
		<pubDate>Tue, 17 Mar 2009 17:09:57 +0000</pubDate>
		<dc:creator>Lisa Schiff</dc:creator>
				<category><![CDATA[Newsletter]]></category>
		<category><![CDATA[Publishing]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://cdlinfo.cdlib.org/blog/2009/03/17/california-digital-library-announces-self-guided-tutorial-for-the-extensible-text-framework-xtf/</guid>
		<description><![CDATA[<p>The California Digital Library (CDL) is pleased to announce the availability of an extensive self-guided tutorial for its eXtensible Text Framework (XTF) application.</p>]]></description>
				<content:encoded><![CDATA[<p><P>By Lisa Schiff, eScholarship Publishing  Program Technical Lead</P></p>
<p>The California Digital Library (CDL) is pleased to announce the  availability of an extensive self-guided tutorial for its <a href="http://xtf.cdlib.org/">eXtensible Text Framework (XTF)</a> application.&nbsp; XTF is an open source, highly customizable piece of software supporting the search,  browse, and display of heterogeneous digital content and offering efficient and practical methods  for creating customized end-user interfaces for distinct digital collections.&nbsp; The tutorial provides guidance for implementing and customizing  XTF, from core functionality to overall look and feel.<a href="http://sourceforge.net/project/showfiles.php?group_id=119724">Downloads for the Mac and Windows</a> operating systems are available from the <a href="http://sourceforge.net/projects/xtf/"> XTF Project page on SourceForge</a>, along with  the complete distribution and documentation.</p>
<p>The tutorial comes with a complete XTF package that is ready to run when uncompressed; no other installation is required.&nbsp; It contains nine modules spanning the most powerful and popular features, including how to: </p>
<ul>
<li>Add  new content</li>
<li>Change metadata</li>
<li>Change logo and colors</li>
<li>Increase significance of titles in ranking hits</li>
<li>Customize and enable default status of advanced search </li>
<li>Change fields displayed in search results</li>
<li>Enable structural searching</li>
<li>Create a hierarchical facet</li>
<li>Change footnote behavior</li>
</ul>
<p><strong>XTF Background and Overview</strong><br />
Since first developing and deploying this indexing and display technology in 2005, the CDL has worked to build and maintain XTF as a highly customizable application built upon tested components already in use by the digital library and search  communities &#8211; in particular the  Lucene text search engine, Java, XML, and XSLT.&nbsp; By coordinating these pieces in a single platform that can be used to create multiple unique applications, the CDL has succeeded in dramatically reducing the investment in infrastructure, staff training, and development for new digital content projects. </p>
<p>XTF offers the  following core features out of the  box:</p>
<ul>
<li><strong>Easy to deploy: </strong>Drops directly in to a Java application server such as  Tomcat or Resin; has been tested on Solaris, Mac, Linux, and Windows operating systems</li>
<li><strong>Easy to configure: </strong>Can create indexes on any XML element or attribute; entire  presentation layer is customizable via XSLT</li>
<li><strong>Robust: </strong>Optimized to perform  well on large documents (e.g., text that exceeds 10MB of encoded text); scales to perform well on collections of millions of documents; provides full Unicode  support</li>
<li><strong>Extensible:</strong></li>
<ul>
<li>Works well with a variety of authentication systems (e.g., IP  address lists, LDAP, Shibboleth)</li>
<li>Provides an interface for external data lookups to support  thesaurus-based term expansion, recommender systems, etc.</li>
<li>Can power other digital library services (e.g., XTF contains an  OAI-PMH data provider that allows others to harvest metadata, and an SRU  interface that exposes searches to federated search engines)</li>
<li>Can be deployed as separate, modular pieces of a third-party  system </li>
</ul>
<li><strong>Powerful for the end user: </strong><strong> </strong></li>
<ul>
<li>Spell checking of queries<strong></strong></li>
<li>Faceted displays for browsing<strong></strong></li>
<li>Dynamically updated browse lists<strong></strong></li>
<li>Session-based bookbags<strong></strong></li>
</ul>
</ul>
<p>A sampling of XTF-based applications  include:</p>
<ul>
<li><a href="http://www.marktwainproject.org/">Mark Twain Project Online</a> (<a href="http://www.marktwainproject.org/">http://www.marktwainproject.org</a>),  developed by the Mark Twain Papers  Project, the CDL and the University  of California Press.</li>
<li><a href="http://www.calisphere.org/">Calisphere</a> (<a href="http://www.calisphere.org/">http://www.calisphere.org/</a>), developed by  the CDL. </li>
<li><a href="http://www.encyclopedia.chicagohistory.org/">The Encyclopedia of Chicago</a> (<a href="http://www.encyclopedia.chicagohistory.org/">http://www.encyclopedia.chicagohistory.org/</a>)</li>
<li><a href="http://webapp1.dlib.indiana.edu/newton/">The Chymistry of Isaac Newton</a> (<a href="http://webapp1.dlib.indiana.edu/newton/">http://webapp1.dlib.indiana.edu/newton/</a>)  and <a href="http://webapp1.dlib.indiana.edu/swinburne/www/swinburne/">The Swinburne  Project</a> (<a href="http://webapp1.dlib.indiana.edu/swinburne/www/swinburne/">http://webapp1.dlib.indiana.edu/swinburne/www/swinburne/</a>)</li>
<li><a href="http://labs.nypl.org/2007/10/30/extensible-text-framework-xtf/">Finding  Aids at the New York Public Library</a> (<a href="http://labs.nypl.org/2007/10/30/extensible-text-framework-xtf/">http://labs.nypl.org/2007/10/30/extensible-text-framework-xtf/</a>)</li>
<li><a href="http://sunsite2.berkeley.edu:8088/xtf/servlet/org.cdlib.xtf.crossQuery.CrossQuery?rmode=btr">EECS  Technical Reports</a> (<a href="http://sunsite2.berkeley.edu:8088/xtf/servlet/org.cdlib.xtf.crossQuery.CrossQuery?rmode=btr">http://sunsite2.berkeley.edu:8088/xtf/servlet/org.cdlib.xtf.crossQuery.CrossQuery?rmode=btr</a>)</li>
</ul>
<p>For more  information, visit <a href="http://xtf.cdlib.org/ ">http://xtf.cdlib.org/ </a>.</p>
 <img src="http://www.cdlib.org/cdlinfo/?feed-stats-post-id=212" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.cdlib.org/cdlinfo/2009/03/17/california-digital-library-announces-self-guided-tutorial-for-the-extensible-text-framework-xtf/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
