7.x to CLAW Migration Sprint - Complete!

Submitted by dlamb on Tue, 09/04/2018 - 15:49

The Islandora community has just wrapped up a very successful sprint dedicated to migrating from 7.x to Islandora CLAW. We at the Islandora Foundation want to give a big thanks to everyone who put in time during this sprint, as well as the organizations who lent us their talent on the company dime. We also want to give a special shout out to the Metadata Interest Group, who collectively put in a ton of time and tackled some intense questions for those who want to use a migration to Islandora CLAW as a chance to do metadata cleanup. During the course of two weeks, we managed to accomplish a lot. As of right now you can:

Migrate over objects based on content type

Migrate ALL the datastreams (except AUDIT, which is a special case)

Extract metadata from any XML datastream and make it a Drupal field

Model authorities such as people, organizations, and subjects

Convert MODS to CSV using Cara Key's (LSU) XML2CSV tool

There's still some work left to do, though. On the horizon for the near term, be on the look out for:

Migrating the AUDIT datastream

Modeling more/different types of authorities

Examples of extracting authorities from FOXML

A workflow for those who want to use OpenRefine to reconcile linked data authorities during the migration process

Moving forward, this is an excellent chance for people to try out the tools we're developing and point them at their existing repositories. Our migration tool, originally developed by Jared Whiklo (University of Manitoba), is available on Github. And if you want to give modeling authorities a go, check out our new controlled_access_terms module, which was made by Seth Shaw (University of Nevada Las Vegas). If anyone has feedback/issues/questions, please feel free to create an issue or post a message on the mailing list.
Here's a full list of all the people and organizations who helped make this once-considered-impossible feat a reality:

Benjamin Rosner - Barnard Collge, CU

Pat Dunlavey - Born-Digital

Andrija Sagic - Library "Milutin Bojic"

Ann McShane - Library Company of Philadelphia

Cara Key - Louisiana State University

Jason Peak - Louisiana State University

Jonathan Green - LYRASIS

Rachel Leach - Mount Holyoke College

Mark Jordan - Simon Fraser University

Adam Soroka - Smithsonian Institution

Rachel Tillay - Tulane University

Pete Clarke - University College Dublin

Jared Whiklo - University of Manitoba

Mike Bolam - University of Pittsburgh

Seth Shaw - University of Nevada Las Vegas

Paul Pound - University of Prince Edward Island

Rosie Le Faive - University of Prince Edward Island

Nat Kanthan - University of Toronto Scarborough

Marcus Barnes - University of Toronto Scarborough

Carolyn Moritz - Vassar College

Thanks to everyone involved! And if you missed out on this sprint, don't fret. We'll be holding another Islandora CLAW community sprint later this year after Islandora 7.x-1.12 is released.