Skip to main content
. 2025 Aug 12;122(33):e2507237122. doi: 10.1073/pnas.2507237122

Table 1.

Epithemia genome assembly statistics

E. clementina E. pelagica
Genome size (bp) 418,007,894 60,195,788
GC 44.3% 48.19%
QV 38.52
Contig/chromosome # 642 15
N50 1,108,441
L90 412
Gene # 26,453 20,203
Repeat % 80% 27.36%
BUSCOgenome 100% 100%
BUSCOprotein 99% 94%
Diazoplast genome size (bp) 3,072,807 2,483,960
Diazoplast gene # 1,910 1,679

Summary of assembly statistics for E. clementina and, where applicable, E. pelagica. Quality value (QV) represents a log-scaled estimate of the base accuracy across the genome, where a QV of 40 is 99.99% accurate. N50 and L90 are measures of genome contiguity. N50 represents the contig length (bp) such that 50% of the genome is contained in contigs ≥N50. L90 represents the minimum number of contigs required to contain 90% of the genome. Finally, BUSCO (Benchmarking of Single Copy Orthologues) is an estimate of completeness of the genome (BUSCOgenome) and proteome (BUSCOprotein) of E. clementina and E. pelagica. “–” indicates no statistic.