digitization – California Digital Libraryhttp://www.cdlib.org/cdlinfo
The Official CDL BlogThu, 08 Dec 2016 22:53:27 +0000en-UShourly1https://wordpress.org/?v=4.7Merritt Service Update: April – May 2012 http://merritt.cdlib.orghttp://www.cdlib.org/cdlinfo/2012/06/05/merritt-service-update-april-may-2012-httpmerritt-cdlib-org/
Tue, 05 Jun 2012 15:43:30 +0000http://www.cdlib.org/cdlinfo/?p=11750More...]]>Merritt Service Description Merritt is a production level service that provides the UC community with an easy to use tool to manage, archive, and share their content. Content can be deposited and managed via a user-interface or an API.

• In coordination with the UCLA Library and DiscoveryGarden, we are planning to move forward on work to integrate Islandora with Merritt. UCLA will be conducting the development work using a forthcoming Islandora API, with consulting, testing, and project support from DiscoveryGarden and CDL. Islandora is an open source digital asset management system currently under evaluation for implementation by the UCLA Library. We are seeking to deploy Merritt as the preservation storage layer under Islandora’s Drupal-based system, in place of Fedora, which usually fills that role. We will prepare documentation, to assist other Islandora implementers with this process, once the project is completed.

• Work is in-progress on enhancements to the Merritt user interface, to support public access to Merritt collections. This has been identified as the top priority for Merritt development. The designation of collections and/or objects to be exposed publicly is performed by providers, based on local policy decisions. Merritt curators will be able to designate their collections publicly accessible, and users will have direct access to materials stored in Merritt. We will hold a webinar to demo this new functionality, once it is available.

• We are documenting how UC campus libraries are utilizing or planning to integrate Merritt within local workflows. Brief case studies — including recent profiles of UC Santa Barbara, UC San Francisco, and UC Santa Cruz’s use of Omeka with other systems — are featured on our UC3 Curation wiki. • The UC Irvine Libraries are now submitting the content from their DSpace repository, called UCISpace http://ucispace.lib.uci.edu/. These UCISpace collections include a number of resources from the Libraries’ special collections and archives. Content will be submitted via the Merritt API. Special thanks to the UC Irvine Libraries Digital Scholarship Service team, and to Matthew McKinley for his work to connect these two systems.

• We are in the process of contracting with the San Diego Supercomputer Center to utilize their cloud storage service. This will allow for further cost-savings and will extend the replication of content stored in Merritt.

• We are continuing work on our self-audit of the Merritt repository, based upon the Trustworthy Repository Audit Certification (TRAC) checklist. Information about policies and practices is being posted on the TRAC pages on our UC3 Curation wiki <https://confluence.ucop.edu/display/Curation/TRAC> and we encourage feedback and comments from the community.

• We implemented a number of upgrades to our Ruby on Rails web application framework, which underlies a number of Merritt features and functions, and also added patches to our indexing system.

• DCXL project, sponsored by Microsoft Research and the Gordon and Betty Moore Foundation, will enable the preservation and sharing of research data via Microsoft Excel. Merritt will be a storage node for this project, allowing researchers to save, share and preserve their data in Merritt. We have been working with developers at Microsoft to permit the submission of Excel spreadsheets to Merritt. You can read more about the DCXL project at http://dcxl.cdlib.org/?p=692

• We have staged collections that were formerly in the Digital Preservation Repository (DPR), for migration to Merritt. This is in preparation of decommissioning the legacy DPR system. We have contacted clients with collections in the DPR, to confirm whether or not they would like us to migrate their collections forward. Please contact us with any questions about this migration.

]]>Reading 5 Million Books at Once: Google N-Grams at TEDxBostonhttp://www.cdlib.org/cdlinfo/2011/09/22/reading-5-million-books-at-once-google-n-grams-at-tedxboston/
Thu, 22 Sep 2011 23:40:02 +0000http://www.cdlib.org/cdlinfo/?p=10658More...]]>Google has digitized millions of books from libraries across the world, including the UC Libraries. While our digitized books are great for traditional reading and research, the corpus also offers a unique opportunity for new kinds of inquiry. For instance: what can we learn about the evolution of culture by analyzing the written record over time on a massive scale? How can we quantify the change of languages over time?

In this great TEDx talk, two Harvard researchers (Jean-Baptiste Michel and Erez Lieberman Aiden) discuss the insights they’ve gleaned from the Google N-gram Viewer. From the creation of a metric for censorship to the orthography of frustration, the presentation introduces ideas that could spark great research in the digital humanities and elsewhere. It’s fun to watch, too.

]]>Scripps Institution of Oceanography Library Finishes Digitizationhttp://www.cdlib.org/cdlinfo/2010/05/20/5239/
Thu, 20 May 2010 21:28:09 +0000http://www.cdlib.org/cdlinfo/?p=5239More...]]>Working together with CDL and Google, UCSD has finished digitizing nearly 100,000 volumes from the collections of the Scripps Institution of Oceanography Library – the world’s largest oceanographic library. Included are works on marine biology, climate science, ecology, geology, and many other subjects. The collection includes scientific expedition reports from as far back as the 18th century, many of which were previously unavailable in digital form. We’re very excited to have these wonderful materials join our other UC collections in HathiTrust.

The Scripps Library is one of many UC collections that have been digitized recently. Among others, thousands of books from the East Asian library at UCLA, the Science and Engineering library at UCSC, and the great variety of materials at NRLF are going online every month. The Mass Digitization group at CDL continues to coordinate and facilitate this work, and we are constantly surprised by its depth and breadth. You’ll find more information at Google and UCSD, and keep an eye out further updates from CDL on the next UC collections going digital!