Table 2. Features of the gene and protein sets for S. haematobium V3, V2 and other key schistosome species.
Feature | S. haematobium V3 | S. haematobium V2 | S. mansonic | S. bovisc | S. japonicumc |
---|---|---|---|---|---|
Number of genes/mRNA | 9431/14,700 | 9314/9314 | 10,172/14,499 | 11,576/11,576 | 10,089/16,936 |
Gene lengtha | 23,252 ± 25,748 | 18,333 ± 20,681 | 21,682 ± 24,112 | 12,618 ± 16,045 | 18,366 ± 21,336 |
mRNA length | 3892 ± 3651 | 2195 ± 1978 | 2794 ± 2266 | 1458 ± 1501 | 2578 ± 2068 |
Coding domain length | 1600 ± 1659 | 2004 ± 1881 | 1775 ± 1895 | 1458 ± 1501 | 1537 ± 1498 |
Exon length | 487 ± 1118 | 263 ± 343 | 320 ± 468 | 259 ± 314 | 333 ± 540 |
Protein length | 532 ± 553 | 666 ± 625 | 591 ± 632 | 485 ± 500 | 512 ± 499 |
Number of 5’ UTRs | 12,563 | 3097 | 14,157 | n/ad | 12,421 |
Number of 3’ UTRs | 12,888 | 2935 | 14,171 | n/a | 12,503 |
Complete BUSCOsb | 736 (77.1%) | 639 (67.0%) | 752 (78.8%) | 577 (60.5%) | 688 (72.1%) |
Complete and single-copy BUSCOs | 582 (61.0%) | 628 (65.8%) | 607 (63.6%) | 548 (57.4%) | 386 (40.5%) |
Complete and duplicated BUSCOs | 154 (16.1%) | 11 (1.2%) | 145 (15.2%) | 29 (3.0%) | 302 (31.7%) |
Fragmented BUSCOs | 26 (2.7%) | 53 (5.6%) | 24 (2.5%) | 114 (11.9%) | 43 (4.5%) |
Missing BUSCOs | 192 (20.1%) | 262 (27.5%) | 178 (18.7%) | 263 (27.6%) | 223 (23.4%) |
a Lengths presented as mean ± standard deviation.
b Number of Benchmarking Universal Single-Copy Orthologs (BUSCOs) identified (protein mode), and percentage of the 954 genes for the Metazoa data set.
c NCBI accession numbers: PRJEA36577, PRJNA520774 and PRJNA451066. Data sets were obtained from WormBase Parasite (release WBPS15).
d not available.