Table 1.
Epithemia genome assembly statistics
| E. clementina | E. pelagica | |
|---|---|---|
| Genome size (bp) | 418,007,894 | 60,195,788 |
| GC | 44.3% | 48.19% |
| QV | 38.52 | – |
| Contig/chromosome # | 642 | 15 |
| N50 | 1,108,441 | – |
| L90 | 412 | – |
| Gene # | 26,453 | 20,203 |
| Repeat % | 80% | 27.36% |
| BUSCOgenome | 100% | 100% |
| BUSCOprotein | 99% | 94% |
| Diazoplast genome size (bp) | 3,072,807 | 2,483,960 |
| Diazoplast gene # | 1,910 | 1,679 |
Summary of assembly statistics for E. clementina and, where applicable, E. pelagica. Quality value (QV) represents a log-scaled estimate of the base accuracy across the genome, where a QV of 40 is 99.99% accurate. N50 and L90 are measures of genome contiguity. N50 represents the contig length (bp) such that 50% of the genome is contained in contigs ≥N50. L90 represents the minimum number of contigs required to contain 90% of the genome. Finally, BUSCO (Benchmarking of Single Copy Orthologues) is an estimate of completeness of the genome (BUSCOgenome) and proteome (BUSCOprotein) of E. clementina and E. pelagica. “–” indicates no statistic.