Table 2.
Assembler | Sequencing data | No. contigs | Total length | No. contigs (≥1 kbp) | Total length (≥1 kbp) | Largest contig | GC (%) | N50 |
---|---|---|---|---|---|---|---|---|
CA | Illumina MiSeq | 3,785 | 86,045,090 | 3,670 | 85,991,832 | 362,075 | 30.7 | 49,810 |
MaSurCA | Illumina MiSeq | 27,180 | 96,959,781 | 16,645 | 90,159,179 | 129,528 | 30.8 | 10,719 |
CLCBio | Illumina MiSeq | 72,775 | 104,010,594 | 12,719 | 78,215,812 | 115,234 | 29.3 | 7,816 |
CA | Illumina-corrected PacBio reads | 4,436 | 94,016,717 | 4,436 | 94,016,717 | 901,957 | 31.0 | 61,260 |
CA | Illumina- and 454-corrected PacBio reads | 4,222 | 100,293,095 | 4,222 | 100,293,095 | 905,793 | 30.8 | 89,633 |
HGAP2 | PacBio reads | 2,601 | 102,405,157 | 2,601 | 102,405,157 | 1,570,895 | 30.7 | 163,655 |
HGAP2 with trimming | PacBio reads | 2,250 | 96,410,008 | 2,183 | 96,369,305 | 1,570,872 | 30.8 | 180,288 |
This table provides the statistics about the contigs generated by various combinations of assemblers and data sets. The statistics reported include the number of contigs, total length of sequence, number of contigs ≥ 1 kbp, the total length of contigs ≥ 1 kbp, the size of the largest contig, the percent GC, and the N50.