Center for Informatics Research in Science and Scholarshiphttp://hdl.handle.net/2142/8980
Research and publications from the Center for Informatics Research in Science and ScholarshipTue, 20 Mar 2018 02:22:52 GMT2018-03-20T02:22:52ZCenter for Informatics Research in Science and Scholarshiphttps://www.ideals.illinois.edu:443/bitstream/id/194657/http://hdl.handle.net/2142/8980
Disambiguating Descriptions: Mapping Digital Special Collections Metadata into Linked Open Data Formatshttp://hdl.handle.net/2142/96215
Disambiguating Descriptions: Mapping Digital Special Collections Metadata into Linked Open Data Formats
Jett, Jacob; Cole, Timothy W.; Han, Myung-Ja K.
In this poster we describe the Linked Open Data (LOD) for Digital Special Collections project at the University of Illinois at Urbana-Champaign and describe some of the particular challenges that legacy metadata poses for representation in LOD formats. LOD formats are primarily based on the World Wide Web Consortium’s Resource Description Framework standard which demands both that entities be named by opaque universal identifiers whenever possible but also that metadata descriptions for entities be as unambiguous as possible. The challenges for disambiguating those descriptions are illustrated through examples drawn from digital special collections based at four different digital libraries
Linked Open Data; special collections; digital special collections; metadata
Fri, 01 Jan 2016 00:00:00 GMThttp://hdl.handle.net/2142/962152016-01-01T00:00:00ZJett, JacobCole, Timothy W.Han, Myung-Ja K.Therkildsen Field Station at Emiquon: Data Management Beginning in 2013http://hdl.handle.net/2142/94951
Therkildsen Field Station at Emiquon: Data Management Beginning in 2013
Baker, Karen S.
Attention to data management at Therkildsen Field Station at Emiquon (TFSE) began in 2013. The University of Illinois Springfield built this research field station by the Illinois River between Lewistown and Havana on land owned by The Nature Conservancy called Emiquon Preserve. TFSE became a member of a partnership of university, government, and non-profit organizations that supports research activities including floodplain restoration. An ethnographic investigation studied data practices and the introduction of data management at the station as well as at Emiquon Preserve. Site history and data-related topics based on interviews of participants are presented. Themes relating to data activities are presented along with observations of existing digital infrastructure arrangements. This case study provides an example of data management planning by scientific research participants for a site-based, multi-partner community.
Emiquon; data management; database; field station; ecology; The Nature Conservancy
Thu, 01 Dec 2016 00:00:00 GMThttp://hdl.handle.net/2142/949512016-12-01T00:00:00ZBaker, Karen S.Taxonomic Work as Information Work: Design for Semantic Refactoringhttp://hdl.handle.net/2142/88917
Taxonomic Work as Information Work: Design for Semantic Refactoring
Thomer, Andrea K.; Twidale, Michael B.; Guo, Jinlong; Yoder, Matthew J.
Taxonomy is the branch of science concerned with classi- fying organisms: drawing the line between cats and dogs, fish and fowl, animals and vegetables. Modern taxonomic work is built on a hundreds-year-old tradition of qualitative research and description. There are aspects of this work that illustrate the pervasiveness and difficulty of a particular kind of qualitative data wrangling, which we call semantic refactoring: the review, normalization, and re-engineering of semantic structures. Because taxonomic work is con- ducted over long time spans, the processes underlying se- mantic refactoring become more visible. An examination of taxonomic data practices may inform our understanding of how (and if) collections of qualitative data scale, particularly when collaboratively created.
Scientific workflows; qualitative data; taxonomy; human- information interaction; biodiversity informatics; qualitative data; ontologies; classification
Sun, 28 Feb 2016 00:00:00 GMThttp://hdl.handle.net/2142/889172016-02-28T00:00:00ZThomer, Andrea K.Twidale, Michael B.Guo, JinlongYoder, Matthew J.Workset Creation for Scholarly Analysis: Recommendations and Prototyping Project Reportshttp://hdl.handle.net/2142/88864
Workset Creation for Scholarly Analysis: Recommendations and Prototyping Project Reports
Downie, J. Stephen; Cole, Timothy; Senseney, Megan
This document assembles and describes the outcomes of the four prototyping projects undertaken as part of the Workset Creation for Scholarly Analysis (WCSA) research project (2013 – 2015). Each prototyping project team provided its own final report. These reports are assembled together and included in this document. Based on the totality of results reported, the WCSA project team also provide a set of overarching recommendations for HTRC implementation and adoption of research conducted by the Prototyping Project teams. The work described here was made possible through the generous support of The Andrew W. Mellon Foundation (Grant Ref # 21300666).
HathiTrust Research Center; worksets
Tue, 22 Dec 2015 00:00:00 GMThttp://hdl.handle.net/2142/888642015-12-22T00:00:00ZDownie, J. StephenCole, TimothySenseney, MeganPace of Change: A Preliminary YesWorkflow Case Studyhttp://hdl.handle.net/2142/88856
Pace of Change: A Preliminary YesWorkflow Case Study
Senseney, Megan
YesWorkflow (YW) is a grassroots initiative that “aims to provide a number of the benefits of using a scientific workflow management system without having to rewrite scripts and other scientific software.” YW represents an improvement to workflow modeling by allowing scientists and scholars to bypass traditional workflow engines by utilizing a small suite of modeling annotations that can be integrated as comments directly into an existing script. To date, YesWorkflow is still in its prototyping and development phase, but a handful of early adopters have already begun piloting the tool in a variety of scientific domains. To test the potential of utilizing YesWorkflow in a digital humanities context, this case study explores Professor Ted Underwood’s research outlined in the 2015 article “How Quickly do Literary Standards Change”, co-authored with Jordan Sellers and posted to FigShare with a link to the project’s GitHub repository.
provenance; digital humanities; case study
Wed, 27 Jan 2016 00:00:00 GMThttp://hdl.handle.net/2142/888562016-01-27T00:00:00ZSenseney, MeganBibliographic Ontologies Comparative Features Datasethttp://hdl.handle.net/2142/88356
Bibliographic Ontologies Comparative Features Dataset
Nurmikko-Fuller, Terhi; Fallaw, Colleen; Jett, Jacob; Page, Kevin R.; Cole, Timothy W.; Maden, Christopher; Willcox, Pip; Senseney, Megan; Downie, J. Stephen
This spreadsheet provides a column-wise comparison of four leading RDF-based bibliographic vocabularies: MODS/MADS RDF, BIBFRAME, Schema.org and FRBRoo. The data demonstrates areas of alignment among the ontologies with regards to equivalent entities, properties, or groupings thereof.
MADS/MODS; FRBRoo; BIBFRAME; Schema.org; Ontology; Bibliographic Descriptions; Datasets
Thu, 29 Oct 2015 00:00:00 GMThttp://hdl.handle.net/2142/883562015-10-29T00:00:00ZNurmikko-Fuller, TerhiFallaw, ColleenJett, JacobPage, Kevin R.Cole, Timothy W.Maden, ChristopherWillcox, PipSenseney, MeganDownie, J. StephenCapisco: Semantic Analysis of Documents from the HathiTrust Corpushttp://hdl.handle.net/2142/79023
Capisco: Semantic Analysis of Documents from the HathiTrust Corpus
Hinze, Annika; Taube-Schock, Craig; Cunningham, Sally Jo; Bainbridge, David
The Capisco project developed a suite of tools that analyze documents by the semantics of their content and metadata. Clustering documents by semantic similarity opens a wealth of opportunities for scholarly research.The project was designed in close collaboration with two humanities scholars, from the areas of Maori & Pacific Studies and Historical Anthropology, who provided ongoing input and feedback during the development process. This report was submitted to the Workset Creation for Scholarly Analysis: Prototyping Project, funded by the Andrew W. Mellon Foundation.
semantic analysis; HathiTrust; HathiTrust Research Center
Mon, 15 Jun 2015 00:00:00 GMThttp://hdl.handle.net/2142/790232015-06-15T00:00:00ZHinze, AnnikaTaube-Schock, CraigCunningham, Sally JoBainbridge, DavidDistributed Metadata Correction and Annotationhttp://hdl.handle.net/2142/79021
Distributed Metadata Correction and Annotation
Muñoz, Trevor
The Distributed Metadata Correction and Annotation project, led by the Maryland Institute for Technology in the Humanities (MITH), proposed to develop a set of services and interfaces that would allow scholarly research teams to pull metadata records from the HathiTrust APIs, correct and annotate these records using standardized vocabularies, gather corrections and annotations from other application instances, and export them in formats suitable for publication as linked data. MITH also proposed to produce a demonstration of an index service that would allow research groups to register their data publications in order to make them available to other groups through a discovery interface. This report was submitted to the Workset Creation for Scholarly Analysis: Prototyping Project, funded by the Andrew W. Mellon Foundation.
metadata; annotation; HathiTrust; HathiTrust Research Center; Foreign Literatures in America
Mon, 15 Jun 2015 00:00:00 GMThttp://hdl.handle.net/2142/790212015-06-15T00:00:00ZMuñoz, TrevorVisualPage: Workset Creation through Image Analysis of Document Pageshttp://hdl.handle.net/2142/79020
VisualPage: Workset Creation through Image Analysis of Document Pages
Biggers, Keith; Audenaert, Neal; Houston, Natalie M.
The VisualPage project has developed a software application that uses the visual characteristics of digitized printed pages to identify documents that contain three types of visually distinctive materials of interest to humanities researchers: poetry, music, and illustrations. The prototype demonstrates the value of using visual analysis of document images in conjunction with more traditional textual analysis to enable scholars to ask more refined questions about texts and their physical manifestations. This report was submitted to the Workset Creation for Scholarly Analysis: Prototyping Project, funded by the Andrew W. Mellon Foundation.
visual page features; HathiTrust; HathiTrust Research Center
Mon, 15 Jun 2015 00:00:00 GMThttp://hdl.handle.net/2142/790202015-06-15T00:00:00ZBiggers, KeithAudenaert, NealHouston, Natalie M.ElEPHãT: Early English Print in the HathiTrust, a Linked Semantic Worksets Prototypehttp://hdl.handle.net/2142/79017
ElEPHãT: Early English Print in the HathiTrust, a Linked Semantic Worksets Prototype
Page, Kevin; Willcox, Pip
The ElEPHãT project – Early English Print in HathiTrust, a Linked Semantic Worksets Prototype – demonstrates the use of Linked Data for combining, through worksets, information from independent collections into a coherent view which can be studied and analyzed to facilitate and improve academic investigation of the constituents. This report was submitted to the Workset Creation for Scholarly Analysis: Prototyping Project, funded by the Andrew W. Mellon Foundation.
linked data; Semantic Web; EEBO-TCP; HathiTrust; HathiTrust Research Center
Mon, 15 Jun 2015 00:00:00 GMThttp://hdl.handle.net/2142/790172015-06-15T00:00:00ZPage, KevinWillcox, Pip