Table 2. The S. glanis genome assembly and annotation statistics.
| Genome assemblya | |
|---|---|
| Contig statistics | |
| Number of contigs | 105,816 |
| Total contig size (bp) | 712,999,588 |
| Contig N50 size (bp) | 13,869 |
| Largest contig (bp) | 140,841 |
| Scaffold statistics | |
| Number of scaffolds | 25,703 |
| Total scaffold size (bp) | 793,358,859 |
| Scaffold N50 size (bp) | 3,169,562 |
| Largest scaffold (bp) | 13,715,129 |
| GC content (%) | 39.2 |
| Unknown base (%) | 10.1 |
| BUSCO genome completeness | |
| Complete | 3,859 (84.2%) |
| Complete and single copy | 3,717 (81.1%) |
| Complete and duplicated | 142 (3.1%) |
| Fragmented | 312 (6.8%) |
| Missing | 413 (9.0%) |
| Annotation | |
| Number of protein-coding genes | 21,316 |
| with partial EST support | 10,260 |
| with > 90% EST support | 4,989 |
| with full length EST support | 3,795 |
| with > 100 RNAseq reads aligned | 17,330 |
| with > 10 RNAseq reads aligned | 19,855 |
| Number of functionally-annotated proteins | 20,532 |
| Mean protein length (interquartile range, aa) | 501 (218-617) |
| Longest protein (aa) | 27,306 (titin-like) |
| Average number of exons per gene (mean length, interquartile range) | 9 (212, 89-194 bp) |
| Average number of introns per gene (length, interquartile range) | 8 (1,208, 133-1,274 bp) |
| BUSCO completeness of the predicted gene models | |
| Complete | 3,427 (74.8%) |
| Complete and single copy | 3,248 (70.9%) |
| Complete and duplicated | 179 (3.9%) |
| Fragmented | 403 (8.8%) |
| Missing | 754 (16.4%) |
Minimum scaffold length: 1 Kb.