Abstract
Infection with Schistosoma parasitic flatworms ( Schistosoma haematobium, Schistosoma mansoni and Schistosoma japonicum ) causes the neglected tropical disease schistosomiasis. There is a need to identify new chemotherapies to treat these parasites, and G-protein coupled receptors (GPCRs) are a logical druggable targets to explore given they control key aspects of schistosome biology such as neuromuscular function and reproduction. Updated chromosome level genome assemblies for each of the three major species have recently been released. However, studies on these GPCRs require accurate, updated genome annotations. Here, we have re-annotated the GPCRs present in each of the three major schistosome species.
Figure 1. GPCRs of all three major schistosome species.
A. Number of GPCRs by class and subclass for all three major schistosome species. For S. mansoni , annotation of GPCRs identified in prior analysis of the v5 genome assembly (Hahnel et al., 2018) is compared to the annotation of the v9 genome in this study. B. Phylogenetic tree of schistosome Class A GPCRs. Sequences were aligned with MUSCLE and TrimAI was used to remove poorly aligned sequences and degap the alignment. IQ-TREE was then used to generate a maximum likelihood phylogeny (1000 bootstrap replicates). ★ = GPCR has been experimentally deorphanized.
Description
Schistosomes are parasitic blood flukes that cause the neglected tropical disease schistosomiasis. Approximately 140 million people are estimated to be infected each year with the three major species causing human disease ( S. haematobium , S. mansoni and S. japonicum )(GBD 2019 Diseases and Injuries Collaborators, 2020). Expanded administration of praziquantel chemotherapy over the past decade has shown promising reductions in the prevalence of infections (Kokaliaris et al., 2022). However, since praziquantel monotherapy is the only treatment currently on the market, there is a need to identify additional antischistosomal targets and chemotherapies.
G-protein coupled receptors (GPCRs) are membrane receptors that signal through various second messengers or accessory proteins (ex. Gβγ or β-arrestin) to control a wide range of cellular processes. These receptors are involved in the mechanism of action for over 1/3rd of all FDA approved drugs (Hauser et al., 2017) and play a key role in numerous aspects of flatworm biology that could conceivably be subverted by anthelmintics. For example, GPCRs are involved in parasite neuromuscular function (reviewed in Ribeiro et al., 2012), reproduction (reviewed in Hahnel et al., 2018), and chemosensation (reviewed in Wheeler et al., 2022).
Several prior studies annotated the complement of GPCRs in different schistosomes as draft genomes for these species were released (Berriman et al., 2009; Campos et al., 2014; Hahnel et al., 2018; Zamanian et al., 2011). Recent advances in long read sequencing technologies have greatly improved genome assemblies from the initial versions published over a decade ago. Chromosome level assemblies have now been released for all three major species ( S. mansoni (Buddenborg et al., 2021), S. haematobium (Stroehlein et al., 2022) and S. japonicum (Luo et al., 2022)).
Updated annotations are necessary as these genome assemblies have become more complete and the gene models have improved. S. mansoni is the most commonly studied schistosome species in the laboratory and its genome assembly (PRJEA36577) has received the most efforts in iterative improvements over the years. The initial assembly reported 92 GPCRs (Berriman et al., 2009), and the most recent annotation effort counted 115 members of the family (Hahnel et al., 2018). However, that effort was working off of version 5 of the genome, and the current release of the S. mansoni chromosome level assembly is version 9 (Buddenborg et al., 2021). Similarly, annotation of GPCRs in the S. heamatobium genome utilized the initial draft genome published a decade ago (Campos et al., 2014; Young et al., 2012). Prior annotations have been valuable in facilitating experimental work on this gene family, but they likely represent incomplete estimates of the GPCRs present in these species.
We set out to generate a new phylogenetic analysis of GPCRs using the updated genome assemblies for S. mansoni (bioproject PRJEA36577, (Buddenborg et al., 2021)), S. japonicum (bioproject PRJNA739049, (Luo et al., 2022)) and S. haematobium (bioproject PRJNA78265, (Stroehlein et al., 2022)). Prior GPCR annotation studies have been published, and of these reference (Hahnel et al., 2018) is the most up to date and so protein sequences for this dataset (termed ‘ S. mansoni v5 GPCRs’) were used to search proteomes for all three species. Two approaches were taken to perform this initial search. First, S. mansoni v5 GPCRs were used as a query to perform a BLASTp search against the predicted proteomes for each of the new chromosome-level assemblies using a liberal cut off (E value < 10). Second, HMMER3 (v3.3.2) was used to search each of these proteomes using profiles built from S. mansoni v5 GPCRs annotated in (Hahnel et al., 2018).
These candidates were then triaged based on expected number of transmembrane domains (TMD). Topology prediction was performed with DeepTMHMM (Hallgren et al., 2022). While GPCRs have 7 TMD, sequences with between 4-10 TMD were retained to allow flexibility with potential errors in gene models. This yielded 183, 168 and 174 sequences for S. mansoni, S. japonicum and S. haematobium , respectively.
Finally, GPCR sequences were retained if they met one of the following two criteria. First, a BLASTp search was performed using sequences with 4-10 TMD as a query against the old S. mansoni v5 proteome. If the top hit for this backblast was annotated as a GPCR in (Hahnel et al., 2018) it was retained. Second, a conserved domain search was performed on the sequences with 4-10 TMD. Those sequences containing domains that correspond to GPCRs (for example, PFAM GPCR clan CL0192) were retained. The final count of S. mansoni GPCRs was 126, up from 115 reported in (Hahnel et al., 2018). However, the actual number of new S. mansoni GPCR sequences identified is 17. The reason for this is that several gene IDs in the S. mansoni v5 assembly used in (Hahnel et al., 2018) have been removed from the current S. mansoni assembly. S. haematobium contained 122 GPCRs, up from 79 reported in (Campos et al., 2014). S. japonicum contained 61 GPCRs, which is the first comprehensive GPCR annotation effort for this species.
These GPCR sequences were then classified as shown in Figure 1A. BLASTp search was performed to identify class B, C and F GPCRs, using the annotated S. mansoni v5 GPCRs as a query. This left the class A GPCRs, which were then used to produce the phylogenetic tree shown in Figure 1B. Briefly, amino acid sequences for class A GPCRs of all three species were aligned with MUSCLE, TrimAl (Capella-Gutiérrez et al., 2009) was used to identify poorly aligned sequences which were removed from the dataset, and IQ-TREE (Minh et al., 2020) was then used to generate a maximum likelihood phylogeny (Figure 1). This tree broadly reproduced the topology reported in (Hahnel et al., 2018). When the subclasses of GPCRs previously annotated for S. mansoni in (Hahnel et al., 2018) are considered (opsin, orphan amine, biogenic amine, peptide NPY-F-FF-like, peptide FLPR-like and PROF), we find that new GPCRs were added to each category (see the comparison of old v5 and current v9 GPCR count shown in Table 1). GPCRs that have been experimentally deorphanized are shown with stars on the phylogenetic tree. These are mainly restricted to the biogenic amines, where several receptors have been cloned and recombinantly expressed to validate ligand pairing. This includes a 5-HT7-like serotonin receptor (Chan et al., 2018, 2016; Patocka et al., 2014), a dopamine D2-like receptor (Taman and Ribeiro, 2009), a histamine-responsive receptor (El-Shehabi and Ribeiro, 2010) and a muscarinic acetylcholine receptor (MacDonald et al., 2015). Several Platyhelminth neuropeptide receptors have been functionally expressed ( (Omar et al., 2007; Saberi et al., 2016), , including two from S. mansoni (Weth et al. 2020), supporting the feasibility of flatworm peptidergic GPCR expression. The present dataset will hopefully facilitate further work in this area.
Differing GPCR numbers between species may be due to differences in methodology for genome sequencing, assembly and annotation, or they may reflect real biological differences between African and Asian schistosomes. The complement of S. mansoni and S. haematobium GPCRs are similar (both are African species infecting humans), while the number and distribution of GPCRs varies in S. japonicum (an Asian zoonotic species).
This dataset is intended as a resource to aid investigators studying schistosome GPCRs. However, a limitation is that these predictions are only as reliable as the annotated gene models. For genes that have very low expression in the biological samples being sequenced, there may be scarce transcriptomic data to guild in silico predictions. Conceivably, a GPCR could be important in a limited period during a life cycle stage underrepresented in sequencing data, and these gene models may not be predicted as accurately. It is possible that these annotations are still an incomplete count if there are partial gene models - although efforts were made to retain these sequences with a liberal cutoff of 4 TMD. Experiments on individual receptors will need to verify sequences (for example, by 5’/3’ RACE) to enable functional expression and receptor deorphanization. This resource is a starting point to enable those studies on GPCRs as potential anthelmintic drug targets.
Methods
Predicted proteomes for the chomorosome level genome assemblies of the three species were retrieved as follows. S. mansoni protein sequences were downloaded from wormbase parasite (WBPS17), assembly SM_V9. For genes with multiple predicted transcripts, the longest representative was chosen and others were excluded from the dataset. S. japonicum protein sequences were downloaded from NCBI, bioproject accession number PRJNA739049, isolate jaV3_Hu on July, 2022. S. haematobium protein sequences for bioproject accession number PRJNA78265, Shae.V3 were downloaded from NCBI on July, 2022. Previously annotated S. mansoni v5 GPCR protein sequences were retrieved from the supplemental data in (Hahnel et al., 2018). BLASTp and HMMER (version 3.3.2) queries of these sequences against the predicted proteomes were performed locally. DeepTMHMM (version 1.0.18) was used for prediction of transmembrane topology, accessed online through https://dtu.biolib.com/DeepTMHMM/. Conserved domain search of candidate GPCRs was performed using Batch CD-Search (Lu et al., 2020) to search PFAM domains with an E value cuttoff of 0.01. Candidate GPCRs were considered to contain an annotated GPCR domains if they were found to have one of the following PFAM accession numbers or clan IDs; pfam00001, pfam00002, pfam13853, pfam10292, pfam10316, pfam10317, pfam10318, pfam10327, pfam10320, pfam10321, pfam10322, pfam10323, pfam10324, pfam10325, pfam10328, pfam01534, pfam11710, pfam11970, pfam02101, pfam03383, pfam03125, pfam02118, pfam02076, pfam05296, pfam03402, cl37946, cl28897, pfam01392, pfam00003, pfam01825. For phylogenetic analysis, multiple sequence alignment of the GPCR sequences was performed using MUSCLE and alignments were trimmed using TrimAl (Capella-Gutiérrez et al., 2009). Sequences that did not meet the criteria of 0.7 residue overlap and 70% sequence overlap were removed from the alignment, which was then realigned and trimmed in TrimAl with a 40% gap threshold. The resulting multiple-sequence alignment was then inputted into IQ-TREE (version 2.2.0; Minh et al., 2020), which constructed the maximum likelihood tree using standard model selection and 1000 bootstrap replicates. The phylogenetic tree was visualized using iTOL v6 (Letunic and Bork, 2021).
Extended Data
Description: FASTA file of GPCR sequences shown in Figure 1. Resource Type: Dataset. DOI: 10.22002/19xvm-8d568
Acknowledgments
Funding
This work was supported by funding from NIH-NIAID R21AI146540 and R21AI153545
References
- Berriman M, Haas BJ, LoVerde PT, Wilson RA, Dillon GP, Cerqueira GC, Mashiyama ST, Al-Lazikani B, Andrade LF, Ashton PD, Aslett MA, Bartholomeu DC, Blandin G, Caffrey CR, Coghlan A, Coulson R, Day TA, Delcher A, DeMarco R, Djikeng A, Eyre T, Gamble JA, Ghedin E, Gu Y, Hertz-Fowler C, Hirai H, Hirai Y, Houston R, Ivens A, Johnston DA, Lacerda D, Macedo CD, McVeigh P, Ning Z, Oliveira G, Overington JP, Parkhill J, Pertea M, Pierce RJ, Protasio AV, Quail MA, Rajandream MA, Rogers J, Sajid M, Salzberg SL, Stanke M, Tivey AR, White O, Williams DL, Wortman J, Wu W, Zamanian M, Zerlotini A, Fraser-Liggett CM, Barrell BG, El-Sayed NM. The genome of the blood fluke Schistosoma mansoni. Nature. 2009 Jul 16;460(7253):352–358. doi: 10.1038/nature08160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buddenborg Sarah K, Tracey Alan, Berger Duncan J, Lu Zhigang, Doyle Stephen R, Fu Beiyuan, Yang Fengtang, Reid Adam J, Rodgers Faye H, Rinaldi Gabriel, Sankaranarayanan Geetha, Böhme Ulrike, Holroyd Nancy, Berriman Matthew. 2021 Aug 13; doi: 10.1101/2021.08.13.456314. [DOI] [Google Scholar]
- Campos TD, Young ND, Korhonen PK, Hall RS, Mangiola S, Lonie A, Gasser RB. Identification of G protein-coupled receptors in Schistosoma haematobium and S. mansoni by comparative genomics. Parasit Vectors. 2014 May 27;7:242–242. doi: 10.1186/1756-3305-7-242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009 Jun 8;25(15):1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chan JD, Day TA, Marchant JS. Coalescing beneficial host and deleterious antiparasitic actions as an antischistosomal strategy. Elife. 2018 Jul 30;7 doi: 10.7554/eLife.35755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chan JD, McCorvy JD, Acharya S, Johns ME, Day TA, Roth BL, Marchant JS. A Miniaturized Screen of a Schistosoma mansoni Serotonergic G Protein-Coupled Receptor Identifies Novel Classes of Parasite-Selective Inhibitors. PLoS Pathog. 2016 May 17;12(5):e1005651–e1005651. doi: 10.1371/journal.ppat.1005651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- El-Shehabi F, Ribeiro P. Histamine signalling in Schistosoma mansoni: immunolocalisation and characterisation of a new histamine-responsive receptor (SmGPR-2). Int J Parasitol. 2010 Apr 27;40(12):1395–1406. doi: 10.1016/j.ijpara.2010.04.006. [DOI] [PubMed] [Google Scholar]
- GBD 2019 Diseases and Injuries Collaborators Global burden of 369 diseases and injuries in 204 countries and territories, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet. 2020 Oct 17;396(10258):1204–1222. doi: 10.1016/S0140-6736(20)30925-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hahnel S, Wheeler N, Lu Z, Wangwiwatsin A, McVeigh P, Maule A, Berriman M, Day T, Ribeiro P, Grevelding CG. Tissue-specific transcriptome analyses provide new insights into GPCR signalling in adult Schistosoma mansoni. PLoS Pathog. 2018 Jan 18;14(1):e1006718–e1006718. doi: 10.1371/journal.ppat.1006718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hallgren Jeppe, Tsirigos Konstantinos D., Pedersen Mads Damgaard, Almagro Armenteros José Juan, Marcatili Paolo, Nielsen Henrik, Krogh Anders, Winther Ole. 2022 Apr 10; doi: 10.1101/2022.04.08.487609. [DOI] [Google Scholar]
- Hauser AS, Attwood MM, Rask-Andersen M, Schiöth HB, Gloriam DE. Trends in GPCR drug discovery: new agents, targets and indications. Nat Rev Drug Discov. 2017 Oct 27;16(12):829–842. doi: 10.1038/nrd.2017.178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kokaliaris C, Garba A, Matuska M, Bronzan RN, Colley DG, Dorkenoo AM, Ekpo UF, Fleming FM, French MD, Kabore A, Mbonigaba JB, Midzi N, Mwinzi PNM, N'Goran EK, Polo MR, Sacko M, Tchuem Tchuenté LA, Tukahebwa EM, Uvon PA, Yang G, Wiesner L, Zhang Y, Utzinger J, Vounatsou P. Effect of preventive chemotherapy with praziquantel on schistosomiasis among school-aged children in sub-Saharan Africa: a spatiotemporal modelling study. Lancet Infect Dis. 2021 Dec 2;22(1):136–149. doi: 10.1016/S1473-3099(21)00090-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Letunic I, Bork P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021 Jul 2;49(W1):W293–W296. doi: 10.1093/nar/gkab301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo F, Yang W, Yin M, Mo X, Pang Y, Sun C, Zhu B, Zhang W, Yi C, Li Z, Wang J, Xu B, Feng Z, Huang Y, Lu Y, Hu W. A chromosome-level genome of the human blood fluke Schistosoma japonicum identifies the genomic basis of host-switching. Cell Rep. 2022 Apr 5;39(1):110638–110638. doi: 10.1016/j.celrep.2022.110638. [DOI] [PubMed] [Google Scholar]
- Lu S, Wang J, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, Gwadz M, Hurwitz DI, Marchler GH, Song JS, Thanki N, Yamashita RA, Yang M, Zhang D, Zheng C, Lanczycki CJ, Marchler-Bauer A. CDD/SPARCLE: the conserved domain database in 2020. Nucleic Acids Res. 2020 Jan 8;48(D1):D265–D268. doi: 10.1093/nar/gkz991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacDonald K, Kimber MJ, Day TA, Ribeiro P. A constitutively active G protein-coupled acetylcholine receptor regulates motility of larval Schistosoma mansoni. Mol Biochem Parasitol. 2015 Sep 10;202(1):29–37. doi: 10.1016/j.molbiopara.2015.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, Lanfear R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol Biol Evol. 2020 May 1;37(5):1530–1534. doi: 10.1093/molbev/msaa015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Omar HH, Humphries JE, Larsen MJ, Kubiak TM, Geary TG, Maule AG, Kimber MJ, Day TA. Identification of a platyhelminth neuropeptide receptor. Int J Parasitol. 2007 Jan 26;37(7):725–733. doi: 10.1016/j.ijpara.2006.12.019. [DOI] [PubMed] [Google Scholar]
- Patocka N, Sharma N, Rashid M, Ribeiro P. Serotonin signaling in Schistosoma mansoni: a serotonin-activated G protein-coupled receptor controls parasite movement. PLoS Pathog. 2014 Jan 16;10(1):e1003878–e1003878. doi: 10.1371/journal.ppat.1003878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ribeiro P, Gupta V, El-Sakkary N. Biogenic amines and the control of neuromuscular signaling in schistosomes. Invert Neurosci. 2012 Apr 18;12(1):13–28. doi: 10.1007/s10158-012-0132-y. [DOI] [PubMed] [Google Scholar]
- Saberi A, Jamal A, Beets I, Schoofs L, Newmark PA. GPCRs Direct Germline Development and Somatic Gonad Function in Planarians. PLoS Biol. 2016 May 10;14(5):e1002457–e1002457. doi: 10.1371/journal.pbio.1002457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schistosomiasis (bilharzia). https://www.who.int/health-topics/schistosomiasis
- Stroehlein AJ, Korhonen PK, Lee VV, Ralph SA, Mentink-Kane M, You H, McManus DP, Tchuenté LT, Stothard JR, Kaur P, Dudchenko O, Aiden EL, Yang B, Yang H, Emery AM, Webster BL, Brindley PJ, Rollinson D, Chang BCH, Gasser RB, Young ND. Chromosome-level genome of Schistosoma haematobium underpins genome-wide explorations of molecular variation. PLoS Pathog. 2022 Feb 15;18(2):e1010288–e1010288. doi: 10.1371/journal.ppat.1010288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Taman A, Ribeiro P. Investigation of a dopamine receptor in Schistosoma mansoni: functional studies and immunolocalization. Mol Biochem Parasitol. 2009 Jun 21;168(1):24–33. doi: 10.1016/j.molbiopara.2009.06.003. [DOI] [PubMed] [Google Scholar]
- Weth O, Haeberlein S, Haimann M, Zhang Y, Grevelding CG. Towards deorphanizing G protein-coupled receptors of Schistosoma mansoni using the MALAR yeast two-hybrid system. Parasitology. 2019 Dec 16;147(8):865–872. doi: 10.1017/S0031182019001756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wheeler NJ, Hallem EA, Zamanian M. Making sense of sensory behaviors in vector-borne helminths. Trends Parasitol. 2022 Aug 2;38(10):841–853. doi: 10.1016/j.pt.2022.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Young ND, Jex AR, Li B, Liu S, Yang L, Xiong Z, Li Y, Cantacessi C, Hall RS, Xu X, Chen F, Wu X, Zerlotini A, Oliveira G, Hofmann A, Zhang G, Fang X, Kang Y, Campbell BE, Loukas A, Ranganathan S, Rollinson D, Rinaldi G, Brindley PJ, Yang H, Wang J, Wang J, Gasser RB. Whole-genome sequence of Schistosoma haematobium. Nat Genet. 2012 Jan 15;44(2):221–225. doi: 10.1038/ng.1065. [DOI] [PubMed] [Google Scholar]
- Zamanian M, Kimber MJ, McVeigh P, Carlson SA, Maule AG, Day TA. The repertoire of G protein-coupled receptors in the human parasite Schistosoma mansoni and the model organism Schmidtea mediterranea. BMC Genomics. 2011 Dec 6;12:596–596. doi: 10.1186/1471-2164-12-596. [DOI] [PMC free article] [PubMed] [Google Scholar]

