How to make ImmPort data fit for secondary use Barry Smith

Similar presentations

Presentation on theme: "How to make ImmPort data fit for secondary use Barry Smith"— Presentation transcript:

1
How to make ImmPort data fit for secondary use Barry Smith http://ontology.buffalo.edu/smith

2
Goals of ImmPort Accelerate a more collaborative and coordinated research environment Create an integrated database that broadens the usefulness of scientific data Advance the pace and quality of scientific discovery Integrate relevant data sets from participating laboratories, public and government databases, and private data sources Promote rapid availability of important findings Provide analysis tools to advance immunological research

3
Improve immunology research through enhanced Collaboration Coordination Discoverability Integration Analyzability Hypothesis: all of these ends will be promoted by describing ImmPort data using terms from shared high quality ontologies

4
ImmPort data is already being tagged with ontology terms For example where data is prepared to meet FDA requirements where data is published to meet NIH mandates for reusability in the post-submission phase, where data is analyzed by third parties But this tagging is partial uncoordinated uses ontologies and analysis tools of varying quality

39
Two kinds of definitions human readable definitions support consistency of data entry logical definitions – allow logical analysis of data – support aggregation of data – allow automatic validation of consistent data entry Definitions can often be taken over from already existing public domain ontologies such as GO use of ready-made definitions supports discoverability, and creates automatic linkage to huge bodies of public domain data

55
NIAID Sample Data Sharing Plan (Last Reviewed February 12, 2013) Sharing of data generated by this project is an essential part of our proposed activities and will be carried out in several different ways. Presentations at national scientific meetings. … it is expected that approximately four presentations at national meetings would be appropriate. … Annual lectureship. A lectureship has brought to the University distinguished scientists and clinicians … Newsletter. The [disease interest group] publishes a newsletter … Web site of the Interest Group. The [interest group] currently maintains a Web site where information [about the disease] is posted … Annual [Disease] Awareness week…. SAGE Library Data. It is our explicit intention that these [Serial analysis of gene expression] data will be placed in a readily accessible public database. …Serial analysis of gene expression

56
NIAID Sample Data Sharing Plan SAGE Library Data. It is our explicit intention that these [Serial analysis of gene expression] data will be placed in a readily accessible public database. …Serial analysis of gene expression – but how will these data be described?

57
Proposal All data sharing plans for NIAID-funded research should require: paper abstracts and SDY summaries be tagged with ontology terms tables and figures in papers be tagged with ontology terms