Slideshare uses cookies to improve functionality and performance, and to provide you with relevant advertising. If you continue browsing the site, you agree to the use of cookies on this website. See our User Agreement and Privacy Policy.

Slideshare uses cookies to improve functionality and performance, and to provide you with relevant advertising. If you continue browsing the site, you agree to the use of cookies on this website. See our Privacy Policy and User Agreement for details.

2.
System requirements
• Insertion and retrieval of data has to be done quickly and easily
• Should be possible to export the data so it can be analyzed with other informatics
systems
• Should support statistical assessments
• Have user-friendly visualization capabilities
• Controlled access to data, based on user roles, accounting for data privacy issues
• Easy dissemination of related studies and results
• Always online (web-based)
• Help finding additional information about the microorganisms present in the biological
samples

3.
Overview of the workflow of field and lab work
PROTOFILWWPROTOFILWW

13.
Major Text Mining technologies used
• Lucene is a high-performance text search engine
library.
• Solr is a standalone enterprise search server with a
REST-like API
• UIMA is a powerful infrastructure for the storage,
transport, and retrieval of document and annotation
knowledge accumulated in NLP pipeline systems
• LINNAEUS is a popular organism name identification
system for biomedical literature that is capable of
normalizing to unambiguous NCBI taxonomy identifiers

20.
Preventive Medicine
 Alert the user to the risk of Type 2 Diabetes.
 How?
1. We know the user has a gene mutation associated with Type 2
Diabetes, because he gave us is genome!
2. We know what he has eaten, because he told us!
3. We know what exercise he’s been doing, because he told us!
4. Genehome connects the dots!