As of December 2013 this is no longer the public-facing home page for the Broad GDAC Firehose pipeline. Please visit gdac.broadinstitue.org for the Broad GDAC Firehose home page.

Born of the desire to systematize analyses from The Cancer Genome Atlas pilot and scale their execution to the dozens of remaining diseases to be studied, Firehose now sits atop ~40 terabytes of TCGA data and reliably executes more than 6000 pipelines per month.

The Broad Institute TCGA GDAC Firehose Provides

Version-stamped, standardized datasets

Precursor to automated analyses: aggregates all available sample batches into a single, uniformly-formatted bolus (one per disease X datatype), which can be immediately fed to algorithmic codes without further data preparation