Table 2. Assembly statistics at various steps during processing.
Contig grouping | No. of contigs | N50a | Total sequence (bp) |
---|---|---|---|
Megahit assemblies 200–499 bp | 24,999,285 | n.d. | 9,293,098,676 |
Megahit assemblies 500–1,999 bp | 16,103,221 | n.d. | 13,382,057,993 |
Megahit assemblies ≥2 kb | 1,517,360 | 4,658 | 6,691,877,664 |
Megahit assemblies ≥2 kb (post-CD-HIT-EST) | 1,126,975 | 4,520 | 4,894,479,496 |
Minimus2 contigs | 158,414 | 15,394 | 1,727,079,865 |
Minimus2 + unincorporated Megahit contigs ≥2 kb (data-rich-contigs) | 660,937 | 5,466 | 3,612,405,904 |
Minimus2 + unincorporated Megahit contigs ≥7.5 kb (binned-contigs) | 95,506 | 20,556 | 1,725,063,313 |
Notes.
N50—length of DNA sequence above which 50% of the total is contained.