Table 2.
Tracking contiguity of the genome assembly across versions using 4 common metrics: Scaffold N50, size of the smallest scaffold comprising the largest 50% of the assembly; Scaffold L50 number of scaffolds comprising the largest 50% of the genome; Scaffolds, total number of scaffolds comprising the full assembly; Size, the approximate number of base pairs in the assembly. BUSCO—percent complete Core Vertebrate Genes (CVG)
Assembly | Step | N50 | L50 | Scaffolds | Size | BUSCO |
---|---|---|---|---|---|---|
v1.1 | SuperNova | 12,629,056 | 37 | 58,149 | 2.0 Gb | 85.5% |
v1.2 | Tigmint | 6,460,730 | 69 | 59,469 | 2.0 Gb | 85.5% |
v1.3 | ARCS | 7,457,274 | 57 | 58,603 | 2.0 Gb | 85.5% |
v1.4 | TGS-GapCloser | 7,468,733 | 57 | 58,603 | 2.0 Gb | 88.0% |
v1.5 | NextPolish | 7,605,248 | 57 | 58,603 | 2.0 Gb | 88.8% |
v1.6 | 3D-DNA | 126,215,344 | 7 | 56,114 | 2.0 Gb | 88.9% |
v1.7 | Redundancy-filter | 134,006,883 | 6 | 32,127 | 1.9 Gb | 88.7% |
v1.8-v2.1 | +10kb cutoff | 134,006,883 | 6 | 1,823 | 1.8 Gb | 88.3% |