Scientists to assemble 'knowledgebase' on plants, microbes, to aid US biofuel, environment efforts

Jul 15, 2011

Combining information about plants, microbes, and the complex biomolecular interactions that take place inside these organisms into a single, integrated “knowledgebase” will greatly enhance scientists’ ability to access and share data, and use it to improve the production of biofuels and other useful products.

In the decade that has passed since the completion of the first draft sequence of the human genome, biologists have grown increasingly aware of a problem ironically generated by the success of their work. Biological experiments in the age of genomics -- including DNA sequencing, gene expression profiles, studies of cell-signaling pathways, protein binding, and other information-rich inquiries -- generate quantities of raw data so immense that they threaten to overwhelm researchers' ability to make sense of them.

Two Cold Spring Harbor Laboratory (CSHL) investigators are among the leaders of a multi-institutional effort announced this week by the U.S. Department of Energy (DOE) to address the problem in one particular area of research involving plant and microbial life. The team has been awarded funding to create out of many separate streams of biological information a single, integrated cyber-"knowledgebase" (called Kbase, for short) focused specifically on these two fundamentally important forms of life.

A knowledgebase is an essential tool of systems biology  an approach to the study of life that depends on integrating multiple information types and bringing them into meaningful relation, providing a basis to measure and model biological activity within an organism or across groups of organisms. A particularly exciting aspect of the project is that it will enable scientists to discover currently unknown relationships that exist between species and between groups of species and the surrounding environment  interrelated and interdependent communities of microbes and plants, in this case.

"In contrast to a conventional database, a knowledgebase is really an entire body of knowledge," explains Doreen Ware, Ph.D., of the U.S. Department of Agriculture and a CSHL Adjunct Associate Professor. "In Kbase we will focus on a specific assortment of plants and microbes that the Energy Department hopes to exploit to produce biofuels, to sequester carbon in the ecosystem, and to clean up environmental pollution." Ware has been named principal investigator of the portion of Kbase devoted to plant life.

Quantitative biologist Michael Schatz, Ph.D., a CSHL Assistant Professor, is a co-investigator on Kbase whose work explains a key dimension of the project. "It's not as if we have been asked to go out and grow or collect plants and microbes," he says. "What we've really been challenged to do by the Department of Energy is to find ways of integrating different kinds of data and different kinds of tools that can be used to analyze those data."

Schatz offers the analogy of Google, which enables anyone with internet access "to tap into all of human activity, all of human knowledge," to the extent it has been recorded in digital form. Today, he notes, there is no portal like Google for scientists who work with plants and microbes. "There are many different 'silos' of information that have been painstakingly collected; and there are a number of existing tools that bring some strands of data into relation. But there is no overarching tool that can be used across silos," Schatz says.

"We think by creating such a collection of tools and data sources, we're going to be able to facilitate question-asking about huge datasets. It is our hope that this will help us make progress on improved ways to generate biofuels or on how to get the maximum yield out of plants even when the climate is very hot, dry, or wet. All of this knowledge is extractable from data that has already or is now being generated. The challenge is how, in a sense, to liberate it, so it can be put to use."

Thanks to the power of cloud computing, scientists across institutions will be able to query Kbase in a highly flexible fashion, and on a democratized basis, since Kbase will be accessible to scientists everywhere. This will eliminate the need for science teams to separately gather and store essentially similar data sets, as a condition for conducting experiments.

The entire Kbase effort, spanning plants, microbes, and metacommunities (microbes in the context of the vast communities in which they live, both in the environment and within other living things) will be led by Adam Arkin of Lawrence Berkeley National Laboratory. Co-principal investigators include Rick Stevens of Argonne National Laboratory, Robert Cottingham of Oak Ridge National Laboratory, and Sergei Maslov of Long Island's Brookhaven National Laboratory, who, in concert with CSHL's Ware, will be deeply involved in the plant section of Kbase.

Related Stories

Scientists may gain a new insight into the relationship between viruses and their environments thanks to a new computational technology developed by researchers at the U.S. Department of Energy's Argonne National Laboratory. ...

Today sees the launch of Ensembl Plants - a freely available web resource for plant genomics research - by the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI), in partnership with the ...

(PhysOrg.com) -- A four-year, multi-institutional effort co-led by three Cold Spring Harbor Laboratory scientists culminated today in publication of a landmark series of papers in the journal Science reveal ...

The emerging field of metagenomics, where the DNA of entire communities of microbes is studied simultaneously, presents the greatest opportunity -- perhaps since the invention of the microscope -- to revolutionize understanding ...

Scientists have sequenced and compared the genomes of planktonic microbes living throughout the water column in the Pacific Ocean. The pioneering study yielded insight into the specialization of microbial communities ...

(PhysOrg.com) -- Anyone cracking open a cold beer is probably not considering the wastewater left over after the beer was brewed. But for Cornell researchers, that vinegary effluent is a scientific playground ...

Recommended for you

(Phys.org)—There is no mistaking the first action potential you ever fired. It was the one that blocked all the other sperm from stealing your egg. After that, your spikes only got more interesting. Waves ...

Male reproductive organ development in maize involves a complex array of ribonucleic acid molecules (RNAs) with potentially diverse activities in gene regulation, demonstrated by new research from the University ...

Once fat cells form, they might shrink during weight loss, but they do not disappear, a fact that has derailed many a diet. Yale researchers in the March 2 issue of the journal Nature Cell Biology descri ...

It has generally been believed that microRNAs control biological processes by simultaneously, though modestly, repressing a large number of genes. But in a study published in Developmental Cell, a group ...

Modern biology has attained deep knowledge of how cells work, but the mechanisms by which cellular structures assemble and grow to the right size largely remain a mystery. Now, Princeton University researchers ...