Summary: MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner. It finished assembling a soil metagenomics dataset with 252â Gbps in 44.1 and 99.6â h on a single computing node with and without a graphics processing unit, respectively. MEGAHIT assembles the data as a whole, i.e. no pre-processing like partitioning and normalization was needed. When compared with previous methods on assembling the soil data, MEGAHIT generated a three-time larger assembly, with longer contig N50 and average contig length; furthermore, 55.8% of the reads were aligned to the assembly, giving a fourfold improvement.

Summary: MEGAHIT is a NGS de novo assembler for assembling large and complex metagenomics data in a time- and cost-efficient manner. It finished assembling a soil metagenomics dataset with 252â Gbps in 44.1 and 99.6â h on a single computing node with and without a graphics processing unit, respectively. MEGAHIT assembles the data as a whole, i.e. no pre-processing like partitioning and normalization was needed. When compared with previous methods on assembling the soil data, MEGAHIT generated a three-time larger assembly, with longer contig N50 and average contig length; furthermore, 55.8% of the reads were aligned to the assembly, giving a fourfold improvement.

-

dc.language

eng

-

dc.relation.ispartof

Bioinformatics

-

dc.rights

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

-

dc.rights

This is a pre-copy-editing, author-produced PDF of an article accepted for publication in [Bioinformatics] following peer review. The definitive publisher-authenticated version [2015, v. 31 n. 10, p. 1674-1676] is available online at: http://dx.doi.org/10.1093/bioinformatics/btv033

-

dc.title

MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph