Skip to main content
. 2020 Sep 11;10(11):3897–3906. doi: 10.1534/g3.120.401711

Table 2. The S. glanis genome assembly and annotation statistics.

Genome assemblya
Contig statistics
Number of contigs 105,816
Total contig size (bp) 712,999,588
Contig N50 size (bp) 13,869
Largest contig (bp) 140,841
Scaffold statistics
Number of scaffolds 25,703
Total scaffold size (bp) 793,358,859
Scaffold N50 size (bp) 3,169,562
Largest scaffold (bp) 13,715,129
GC content (%) 39.2
Unknown base (%) 10.1
BUSCO genome completeness
Complete 3,859 (84.2%)
Complete and single copy 3,717 (81.1%)
Complete and duplicated 142 (3.1%)
Fragmented 312 (6.8%)
Missing 413 (9.0%)
Annotation
Number of protein-coding genes 21,316
with partial EST support 10,260
with > 90% EST support 4,989
with full length EST support 3,795
with > 100 RNAseq reads aligned 17,330
with > 10 RNAseq reads aligned 19,855
Number of functionally-annotated proteins 20,532
Mean protein length (interquartile range, aa) 501 (218-617)
Longest protein (aa) 27,306 (titin-like)
Average number of exons per gene (mean length, interquartile range) 9 (212, 89-194 bp)
Average number of introns per gene (length, interquartile range) 8 (1,208, 133-1,274 bp)
BUSCO completeness of the predicted gene models
Complete 3,427 (74.8%)
Complete and single copy 3,248 (70.9%)
Complete and duplicated 179 (3.9%)
Fragmented 403 (8.8%)
Missing 754 (16.4%)
a

Minimum scaffold length: 1 Kb.