Skip to main content
. 2022 Feb 15;18(2):e1010288. doi: 10.1371/journal.ppat.1010288

Table 2. Features of the gene and protein sets for S. haematobium V3, V2 and other key schistosome species.

Feature S. haematobium V3 S. haematobium V2 S. mansonic S. bovisc S. japonicumc
Number of genes/mRNA 9431/14,700 9314/9314 10,172/14,499 11,576/11,576 10,089/16,936
Gene lengtha 23,252 ± 25,748 18,333 ± 20,681 21,682 ± 24,112 12,618 ± 16,045 18,366 ± 21,336
mRNA length 3892 ± 3651 2195 ± 1978 2794 ± 2266 1458 ± 1501 2578 ± 2068
Coding domain length 1600 ± 1659 2004 ± 1881 1775 ± 1895 1458 ± 1501 1537 ± 1498
Exon length 487 ± 1118 263 ± 343 320 ± 468 259 ± 314 333 ± 540
Protein length 532 ± 553 666 ± 625 591 ± 632 485 ± 500 512 ± 499
Number of 5’ UTRs 12,563 3097 14,157 n/ad 12,421
Number of 3’ UTRs 12,888 2935 14,171 n/a 12,503
Complete BUSCOsb 736 (77.1%) 639 (67.0%) 752 (78.8%) 577 (60.5%) 688 (72.1%)
Complete and single-copy BUSCOs 582 (61.0%) 628 (65.8%) 607 (63.6%) 548 (57.4%) 386 (40.5%)
Complete and duplicated BUSCOs 154 (16.1%) 11 (1.2%) 145 (15.2%) 29 (3.0%) 302 (31.7%)
Fragmented BUSCOs 26 (2.7%) 53 (5.6%) 24 (2.5%) 114 (11.9%) 43 (4.5%)
Missing BUSCOs 192 (20.1%) 262 (27.5%) 178 (18.7%) 263 (27.6%) 223 (23.4%)

a Lengths presented as mean ± standard deviation.

b Number of Benchmarking Universal Single-Copy Orthologs (BUSCOs) identified (protein mode), and percentage of the 954 genes for the Metazoa data set.

c NCBI accession numbers: PRJEA36577, PRJNA520774 and PRJNA451066. Data sets were obtained from WormBase Parasite (release WBPS15).

d not available.