Skip to main content
. 2023 Aug 31;24:327. doi: 10.1186/s12859-023-05449-z

Table 6.

Genomes de novo annotated with GALBA using reference protein sets listed in Additional file 1: Table S1 as use cases that demonstrate the applicability of GALBA

Species Assembly Size (Gbp) nSeqs N50 (nt) BUSCO C (%) RM (%)
Vespula vulgaris GCA_014466185.1 0.18 35 8,304,510 94.9 19.5
Vespula germanica GCA_014466195.1 0.18 133 8,396,154 93.6 19.9
Vespula pensylvanica GCA_014466175.1 0.18 225 8,532,720 96.2 19.4
Polistes dominula GCA_001465965.1 0.21 1,483 1,625,592 95.7 48.1
Balaenoptera bonaerensis GCA_000978805.1 2.23 421,444 20,082 54.1 34.0
Eubalaena japonica GCA_004363455.1 2.69 1,353,963 39,813 74.9 43.3
Inia geoffrensis GCA_004363515.1 2.60 1,213,610 26,707 67.7 43.8
Kogia breviceps GCA_004363705.1 2.76 1,252,072 28,812 66.1 41.3
Phocoena phocoena GCA_004363495.1 2.70 1,331,158 115,969 85.9 44.7
Platanista gangetica GCA_004363435.1 2.67 1,098,790 23,933 59.1 44.7
Ziphius cavirostris GCA_004364475.1 3.15 3,758,276 3,608 39.9 45.1
Coix aquatica GCA_009725075.1 1.62 2,012 148,397,812 97.8 83.3

nSeqs number of sequences in the assembly, BUSCO C percentage of BUSCOs detected as complete, RM percentage of repeatmasked nucleotides in assembly