Abstract
Schistosomiasis, which infects more than 230 million people, is vectored by freshwater snails. We identified viral sequences in the transcriptomes of Biomphalaria pfeifferi (BP) and Bulinus globosus (BuG), two of the world’s most important schistosomiasis vectors in Africa. Sequences from 26 snails generated using Illumina Hi-Seq or 454 sequencing were assembled using Trinity and CAP3 and putative virus sequences were identified using a bioinformatics pipeline. Phylogenetic analyses were performed using viral RNA-dependent RNA polymerase and coat protein sequences to establish relatedness between virus sequences identified and those of known viruses. Viral sequences were identified from the entire snail holobiont, including symbionts, ingested material and organisms passively associated with the snails. Sequences derived from more than 17 different viruses were found including five near full-length genomes, most of which were small RNA viruses with positive sense RNA genomes (i.e., picorna-like viruses) and some of which are likely derived from adherent or ingested diatoms. Based on phylogenetic analysis, five of these viruses (including BPV2 and BuGV2) along with four Biomphalaria glabrata viruses reported previously, cluster with known invertebrate viruses and are putative viruses of snails. The presence of RNA sequences derived from four of these novel viruses in samples was confirmed. Identification of the genome sequences of candidate snail viruses provides a first step toward characterization of additional gastropod viruses, including from species of biomedical significance.
Keywords: Snail virus, Schistosomiasis vector, Virus discovery, Biomphalaria pfeifferi, Bulinus globosus, Kenya, RNA virus, Diatom/marine virus, Iflavirus, Dicistrovirus
Introduction
Several species of freshwater snail play an integral role in transmission of schistosomiasis, a tropical disease that infects over 230 million people with a further 780 million at risk, primarily in sub-Saharan Africa (Colley et al., 2014). In Africa, snails of the genera Biomphalaria and Bulinus transmit human intestinal and urinary schistosomiasis, respectively. Snails serve as the obligatory hosts of the asexually-reproducing larval stages of schistosomes that can produce hundreds to thousands of human-infective cercariae per day. Some infected snails can continue to release cercariae into the water for several months (Mutuku et al., 2014), posing a significant hurdle for those hoping to prevent new schistosome infections in people.
Biological control provides an alternative approach to target the snail hosts of schistosomiasis, and both snail predators (prawns, Macrobrachium) (Sokolow et al., 2015), and snail pathogens (bacteria, Paenibacillus) (Duval et al., 2015a; Duval et al., 2015b) have potential in this regard. As viruses also have potential for use in the biological control of snails similar to other pestiferous invertebrates (Sosa-Gómez et al., 2020), we sought to characterize the virome of two snail vector species.
In the search for snail viruses, the presence of contaminating viruses from the surrounding environment must be taken into consideration. Viral sequences discovered from seawater (Culley, Lang & Suttle, 2003; Culley, Lang & Suttle, 2006; Culley, Lang & Suttle, 2007; Culley et al., 2014; Culley & Steward, 2007; Lopez-Bueno et al., 2015), freshwater (Bolduc et al., 2012; Djikeng et al., 2009; Mohiuddin & Schellhorn, 2015) and aquatic unicellular organisms (Kimura & Tomaru, 2015; Lang, Culley & Suttle, 2004; Nagasaki et al., 2004; Nagasaki et al., 2005; Takao et al., 2006) have revealed that the vast majority of viruses in aquatic environments are small RNA viruses with positive sense, single stranded RNA genomes (SRV+) similar to the eukaryotic virome (Culley, Lang & Suttle, 2003; Suttle, 2005; Suttle, 2007). These RNA viruses show genomic, structural and protein sequence similarities to insect viruses, including dicistroviruses and iflaviruses (Suttle, 2007). However, the majority of the SRV+ discovered recently from aquatic environments are not well characterized with only a few SRV+ that infect unicellular organisms having been studied (Kimura & Tomaru, 2015; Lang, Culley & Suttle, 2004; Nagasaki et al., 2004; Nagasaki et al., 2005). SRV+ have also been isolated from shrimp (Mari et al., 2002), crabs (Boros et al., 2011), bivalves (Rosani & Gerdol, 2017; Rosani et al., 2019) and partial sequences of virus-derived genes identified from the crop of an invasive snail (Cardoso et al., 2012). Several novel SRV+ from Biomphalaria glabrata were isolated by analyzing available RNA sequence data for this Neotropical snail (Adema et al., 2017; Galinier et al., 2017), and analysis of the transcriptome of B. pfeifferi revealed nearly 1,300 transcripts putatively derived from viruses (Buddenborg et al., 2017).
Here we report use of our bioinformatics pipeline (Liu, Vijayendran & Bonning, 2011), to identify viral sequences in large sequence databases derived from two of the most important schistosomiasis snail vectors in Africa (Biomphalaria pfeifferi and Bulinus globosus) and examined the RNA sequences for evidence of the presence of viruses. We identified sequences derived from multiple viruses that based on phylogenetic analysis, are predicted to infect and replicate in the snail. We also identified sequences from numerous other viruses including four near full length genomes, that are likely infectious to diatoms and other aquatic microorganisms associated with the snail.
Materials and Methods
Collection of snails
Individual snails of two species, Biomphalaria pfeifferi (BP) and Bulinus globosus (BuG), were collected from Kenya. BP snails (1–21, Table 1) were collected from Kasabong Stream, West Kenya (00°9″6.84″S, 34°20′7.80″E) in 2013 and treated as described previously to test snail responses to schistosome infection (Buddenborg et al., 2017). The sample set includes control snails (uninfected with schistosomes) and snails representing various infection treatments. BuG samples (22–24, Table 1) were collected from Mombasa, East Kenya (00°3′54.83″S, 39°31.72″E) in 2013. Two additional BP snails (25–26, Table 1) were collected from Asao Stream, West Kenya (00°19′5.50″S, 35°0′24.99″E) in 2012. Snails were placed individually into 1.5 ml tubes with one ml of TRIzol (Invitrogen, Carlsbad, CA, USA) and stored at −80 °C until RNA extraction. This project was undertaken with approval of Kenya’s National Commission for Science, Technology and Innovation (permit number NACOSTI/P/15/9609/4270), Kenya Wildlife Service (KWS-0045-03-21 and export permit 0004754), and National Environment Management Authority (NEMA/AGR/46/2014 and NEMA/AGR/149/2021).
Table 1. Snail samples from which virus sequences were derived.
Species | Sample | Abbreviation (Buddenborg et al., 2017) | Sequencing method | No. virus-derived contigs |
---|---|---|---|---|
Biomphalaria pfeifferi (Kasabong) |
1–3 | Control-R1 to R3 | Illumina RNA-Seq | 50 |
4–6 | Shedding R1-R3 | 43 | ||
7–9 | 1d-R1 to R3 | 8 | ||
10–12 | 3d-R1 to R3 | 21 | ||
13–15 | BP control × molluscicide | 30 | ||
16–18 | BP infected × molluscicide | 45 | ||
19–21 | – | 9 | ||
Bulinus globosus (Mombasa) |
22–24 | – | 9 | |
B. pfeifferi (Asao) |
25 | – | 454 GS FLX | 0 |
26 | – | 17 |
Extraction of RNA, library preparation and sequencing
Methods for RNA extraction from snail samples (Table 1) and high-through put sequencing were as described previously (Buddenborg et al., 2017). Briefly, snails were homogenized individually in Trizol using plastic pestles (USA Scientific, Ocala, FL, USA), followed by RNA extraction according to the manufacturer’s instructions (Invitrogen, Carlsbad, CA, USA). Genomic DNA contamination was removed with RNase-free DNase I (New England BioLabs, Ipswich, MA, USA) at 37 °C for 10 min. Samples were further purified using the PureLink RNA Mini Kit (Thermo Scientific, Waltham, MA, USA). RNA quality and quantity were evaluated on a Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA, USA) and Nanodrop ND-1000 (Thermo Scientific, Waltham, MA, USA). Complementary DNA was prepared and Illumina Hi-Seq sequencing conducted at the National Center for Genomic Research in Santa Fe, New Mexico (http://www.ncgr.org). Complementary DNA libraries were paired-end sequenced (2 × 50 base reads) on a HiSeq2000 instrument (Illumina, Carlsbad, CA, USA). A total of 20 to 30 million reads of 50 nucleotides were generated from each individual snail with three sequencing replicates per sample (Buddenborg et al., 2017). Sequence data are available at NCBI under BioProject ID PRJNA383396. RNA-Seq data were also acquired for BP samples 25 and 26 (Table 1) on the 454 GS FLX platform (Roche, Basel Switzerland) at the University of New Mexico Biology Molecular Biology Facility using standard procedures (http://ceti.unm.edu).
The quality of sequencing reads was examined using FastQC v0.11.3 (Andrews http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). When necessary, bases with low quality scores (quality score threshold, q = 30) were trimmed using Fastx-toolkit v0.0.13 (http://hannonlab.cshl.edu/fastx_toolkit/). Viral genome-derived sequences were either assembled from sequence data from individual snails, or from the combined sequence data from three replicate snails (see sample lane, Table 1). The Illumina sequencing reads and 454 sequencing data were assembled using the Trinity assembler (v2.0.6) (Haas et al., 2013). The contigs obtained from the Trinity assembly were then further assembled by using the CAP3 assembler (Huang & Madan, 1999) to merge redundant sequences. The merged contigs (≥300 nt) were used for annotation to identify viral sequences. Virus-derived contigs that encoded >800 aa and represented sequences derived from novel viruses were selected for analysis of viral sequence abundance. To determine relative abundance of the sequencing reads derived from viral sequences, MAQ v0.5.0 (http://maq.sourceforge.net/maq-man.shtml; Command maq map using default settings) was used for mapping of reads to target assembled sequences.
Identification and annotation of viral sequences
To identify putative viral sequences, the assembled contigs were annotated by BLAST search v.2.10.0 (Camacho et al., 2009) against a local viral protein database of viral proteins from RefSeq (Release 93). Contigs that hit viral proteins with a cutoff E-Value of 0.01 were extracted and further annotated by in-house BLAST against the NCBI nr database. The contigs with viral hits were extracted and sequences derived from putative viruses further analyzed: Contigs that hit the same viral group were manually checked using BioEdit v7.2.5 for potential generation of longer sequence fragments.
Confirmation of viral sequences
The presence of assembled viral sequences was confirmed for BPV2, BPV3, BPV4 and BuGV1 by RT-PCR. Complementary DNA was generated from the snail-derived RNA used for sequencing, by using the SuperScript III First-Strand Synthesis System for RT-PCR (Invitrogen, Waltham, MA, USA) according to the manufacturer’s directions. Primer design was based on the assembled viral sequences (Table S1). PCR was conducted using GoTaq DNA polymerase (Promega, Madison, WI, USA). For short read (<1.5 kb) amplification, PCR was conducted with an initial 94 °C for 5 min, followed by 94 °C, 1 min, 62 °C, 1 min, 72 °C, 5 min for 30 cycles, and 72 °C for 10 min. For long read (>1.5 kb) amplification, PCR was conducted with an initial 95 °C for 5 min, followed by 95 °C, 45 s, 56 °C for 45 s, 72 °C 2 min for 35 cycles, and 72 °C for 8 min.
Multiple sequence alignment, conserved domain identification and phylogenetic analysis
Conserved domains of the viral sequences identified were used for phylogenetic analysis. The conserved domains of RNA-dependent RNA polymerase (RdRP) and capsid proteins or putative capsid sequences were identified by BLAST domain search against the NCBI nr database (Marchler-Bauer et al., 2017). Internal ribosome entry sites (IRES) of dicistro-like viruses were predicted by IRESite (http://iresite.org/). Sequences of conserved domains were extracted and used for phylogenetic analysis. The domain sequence was aligned using MAFFT (with MAFFT-auto and default parameters) (Katoh & Standley, 2013). Phylogenetic trees were constructed and generated with IQ-TREE using maximum-likelihood (Nguyen et al., 2015) with 10,000 bootstraps (Hoang et al., 2018). The best partitioning scheme and models were selected by ModelFinder (Kalyaanamoorthy et al., 2017). The phylogenetic trees were also drawn using FigTree (v1.4.2) (Notredame, Higgins & Heringa, 2000). General sequence manipulations (editing and alignment) were performed using BioEdit v7.2.5 (http://www.mbio.ncsu.edu/BioEdit/page2.html).
Results
Viral sequences in the snail transcriptome
The number of contigs derived from putative viral sequences from the snail samples is shown in Table 1. As our primary targets were sequences derived from actual viruses rather than viral sequences integrated into the genome of the snail (primarily retrotransposon sequences), putative endogenous viral element (EVE) sequences identified by BLAST were not included in this analysis. Redundant sequences were removed from putative viral contigs. Five near full-length SRV+ genome sequences (four from BP and one from BuG; Table 2) were assembled from the remaining contigs. In addition, 54 and three non-redundant partial viral genome sequences were also identified from BP and BuG, respectively (Table S2). The vast majority of the viral contigs were derived from SRV+ with most top hits to viruses from aquatic environments, particularly to unicellular photoautotrophs. Other sequences hit insect SRV+, demonstrating considerable diversity in the populations of viruses associated with the snails.
Table 2. Distribution of virus-derived sequences among snail samples.
Sp. | Sample | BPV1a, b | BPV2a | BPV3a | BPV4a | BPVS1 | BPVS6 | BPVS16 | BPVS17 | BPVS33 | BPVS42 | BuGV1a | BuGV2 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
BP | 1–3 | 0 | 44,740/263c | 5,781/35 | 1,495/9 | 900 | 863 | 278 | 903 | 42 | 817 | 0 | 0 |
4–6 | 0 | 12,305/72 | 2,713/16 | 1,670/10 | 463 | 1,282 | 1,666 | 642 | 62 | 123 | 0 | 0 | |
7–9 | 0 | 5,468/32 | 926/6 | 2,544/15 | 223 | 74 | 216 | 265 | 5 | 39 | 0 | 0 | |
10–12 | 0 | 2,107/12 | 571/3 | 175/1 | 189 | 72 | 30 | 203 | 793 | 19 | 0 | 0 | |
13–15 | 0 | 28,951/170 | 2,709/16 | 9,222/56 | 1,202 | 744 | 426 | 1,181 | 113 | 1,452 | 0 | 0 | |
16–18 | 0 | 19,084/112 | 2,008/12 | 5,792/35 | 607 | 2,725 | 740 | 781 | 210 | 209 | 0 | 0 | |
19–21 | 0 | 492/6 (#21) | 218/3 | 0 | 0 | 5,150 | 0 | 0 | 667 | 55 | 0 | 0 | |
BuG | 22–24 | 0 | 0 | 0 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 24,622/298 | 134/15 |
Notes:
Sample designations are as described in Table 1.
The distribution of virus-derived sequences is shown for five near-complete and seven partial viral genomes. Sequencing reads were mapped to assembled viral sequences using the MAQ assembly program.
Near full-length genome sequence identified.
Isolated from BP454 sequencing data only (not shown here).
Number of reads mapped/fold-coverage of the assembled genome.
Viral sequences identified from B. pfeifferi
A novel small RNA virus discovered from 454 sequencing data
For the two BP snails from Asao, Kenya, 17 contigs derived from putative viral sequences were identified from approximately 3,000 contigs (>300 nt) assembled by Trinity. One near complete small RNA viral genome (named Biomphalaria pfeifferi-associated virus 1, BPV1: GenBank accession number MT108488; Text S1) was recovered. BPV1 appeared to be a novel dicistro-like virus of at least 8,367 nt (Fig. 1). The predicted ORF1 polyprotein of BPV1 showed 30% protein sequence identity with the counterpart predicted polyprotein of Marine RNA virus JP-A (MRNAVJPA) (Culley, Lang & Suttle, 2007), a Marnavirus of Marnaviridae in the order Picornavirales (http://ictvonline.org/virusTaxonomy.asp). The second closest hit was Rhizosolenia setigera RNA virus 01 (RhsRNAV01) (Nagasaki et al., 2004), with 27% protein sequence identify. RhsRNAV01 is the type member of a recently established genus, Bacillarnavirus of Picornavirales. Two conserved domains, RNA-helicase and RdRP (pfam00910, cd01699 and pfam00680) of dicistroviruses were identified in ORF1 (Fig. 1). ORF2 encodes putative coat proteins (CP). Four conserved CP domains of SRV+ were identified: two Rhv-like CP domains (CP, cd00205, pfm00073), dicistro_VP4 (pfam11492) and CrPV_capsid (pfam08762). E-values were consistently very high for these predicted domains. BPV1 sequences were only assembled from sample 25 (Table 1).
Near full-length small RNA virus sequences assembled from BP Illumina sequencing samples
From the 21 BP samples collected from Kasabong Stream, Kenya in 2013, three near complete small RNA viral genome sequences were assembled. The viruses were named Biomphalaria pfeifferi-associated virus 2 (BPV2: GenBank Accession No. MT108489), Biomphalaria pfeifferi-associated virus 3 (BPV3: GenBank Accession No. MT108490) and Biomphalaria pfeifferi-associated virus 4 (BPV4: GenBank Accession No. MT108491), with genomes of at least 8,491, 8,255 and 8,253 nt, respectively (Fig. 1; Text S1). BPV2 and BPV3 are structurally similar to dicistrovirus-like BPV1, while BPV4 encodes a single ORF, in which non-structural and structural proteins are expressed as a single polypeptide prior to proteolytic cleavage into functional viral proteins. BPV4 is similar in genome structure to Marnaviruses (Fig. 2).
The top BLASTp hits of BPV2 ORF1 and ORF2 predicted polyproteins were the ORF1 and ORF2 polyproteins of Antarctic picorna-like virus 1 (APLV1), which was discovered from Antarctic lakes, but for which the host is unknown (Lopez-Bueno et al., 2015). Protein sequence identities of approximately 31% and 34% were shown between ORF1 and ORF2 predicted polyproteins of BPV2 and APLV1, respectively. Conserved domains identified from the ORF1 polyprotein were RNA-helicase, peptidase_C3 (pfam00548) and RdRP (Fig. 1). In the ORF2 polyprotein, two Rh_v CP domains were identified, with one domain overlapping a Calicivirus CP domain (pfam00916) (Fig 1). The VP4 and a conserved C-terminal CP domain (CrPV_capsid) were not found in the ORF2-encoded protein of BPV2.
The top hits for the ORF1 and ORF2 polyproteins of BPV3 were both from Marine RNA virus-JP B (MRNAVJPB) (Culley, Lang & Suttle, 2007), which is classified as a member of Marnavirus. The sequence identities were ~35% between the corresponding ORF polyproteins of BPV3 and MRNAVJPB. ORF2 of BPV3 encodes 942 aa, which is the longest ORF2 of the dicistrovirus-like sequences discovered from snails. Three CP domains were identified in the ORF2 polyprotein of BPV3, Rhv_like CP domain, VP4, and the CrPV_capsid domain, which also contained conserved sequences of an Rhv_like domain. For all three dicistrovirus-like viruses, translation of BPV2 ORF2 is expected to be mediated by an internal ribosomal entry site (IRES) (Bonning & Miller, 2010).
BPV4 is the only SRV+ identified from BP that encodes a single ORF encoding a polyprotein of 2,685 aa. The genomic structure of this virus is similar to the type virus of genus Marnavirus, Heterosigma akashiwo RNA virus (HaRNAV) (Fig. 2) (Lang, Culley & Suttle, 2004). The N-terminus of the ORF encodes non-structural proteins, while the C-terminus encodes structural proteins. The non-structural proteins of BPV4 have homology to the ORF1 polyprotein of Marine RNA virus SF2, an unclassified dicistro-like virus, and the structural proteins hit the ORF2 polyprotein of Chaetoceros sp. RNA virus 2, an unclassified marine planktonic diatom discistro-like virus (Tomaru et al., 2013a; Tomaru et al., 2013b) with 27% and 29% sequence identities, respectively. The single ORF of BPV4 encodes helicase, RdRP and CP domains (Fig. 1).
Partial SRV+ sequences identified from the BP transcriptome
In addition to the four near complete full-length SRV+ genome sequences described above from BP, 44 putative SRV+ contigs were identified, ranging from 203–3,624 nt (Table S2). Among these partial viral sequences, 23 (52%) were derived from regions encoding non-structural polyproteins, while 21 (48%) contigs hit structural polyprotein sequences. These partial sequences were derived from multiple viruses. The regions from which the sequences were derived were determined by BLAST annotation and are illustrated in Fig. 3. Based on overlap of the structural protein sequences (BPVS 6, 16, 17, 18, 33, 42 43 and 44), the 44 contigs represent at least eight novel SRV+. These eight contigs encode 374–900 aa of the putative partial CP sequences, covering more than 30% of the capsid protein sequences. While 23 contigs hit SRV+ non-structural protein coding regions, only one contig (BPVS1, 1,097 aa) covered 60% of the non-structural polyprotein coding sequence (Fig. 3). Although the protein sequences encoded by the virus-derived RNA fragments hit various SRV+, the majority of hits were from genes encoded by marine RNA viruses (Table S2), similar to the other BPV species described above.
Other viral sequences
Nine contigs of 206–2,969 nt had homology to several viruses (Table S2), including BPVS52 with homology to a DNA virus infecting the diatom Chaetoceros salsugineum, relatively short sequences with homology to the SRV+ Picalivirus D (Greninger & DeRisi, 2015a) (BPVS51), Puurmala virus (a handavirus of Bunyaviridae; BPVS53) and a plant virus, Currant latent virus (BPVS54). In addition, there were three contigs (BPVS 48, 49 and 50) that hit hypoviruses. While the top hit of BPVS45 is Rosellinia necatrix fusarivirus 1, this contig also hit hypovirus sequences. Hypoviruses are double-stranded RNA (dsRNA) viruses, which are only known to infect Fungi (http://ictvonline.org/virusTaxonomy.asp). Protein sequence identities of the four putative hypoviral sequences to known hypoviruses were 26–50%.
Viral sequences discovered from the B. globosus transcriptome
Fewer viral sequences were assembled from the three sets of BuG sequence data. Only nine contigs derived from viral sequences were identified by BLASTx. From the nine virus-related contigs, one near complete SRV+ genome sequence, named Bulinus globosus-associated virus 1 (BuGV1: GenBank Accession No. MT108492) and a Nora virus-like virus (Bulinus globosus-associated virus 2, BuGV2: Fragment 1 GenBank Accession No. MT108493; Fragment 2, MT108494) were identified. The other putative viral contigs are listed in Table S2.
Similar to BPV1, BPV2 and BPV3, BuGV1 is dicistrovirus-like with 8,528 nt of assembled sequence (Fig. 1). The non-structural ORF1 polyprotein of BuGV1 has homology to MRNAVJPB with 32% sequence identity, while the top hit of the structural ORF2 polyprotein was the ORF2 polyprotein of Marine RNA virus SF-1 (MRNAVSF1) (Greninger & DeRisi, 2015b) with 34% sequence identity. The conserved helicase and RdRP domains were identified from the ORF1 polyprotein, and two Rhv-like and VP4 domains were found in the ORF2 polyprotein (Fig. 1).
In addition to BuGV1, two contigs of 5,278 and 1,529 nt hit an insect Nora virus (VP2, replication related protein), suggesting these two contigs may be derived from the same virus (BuGV2). Nora virus is an SRV+ of Drosophila that encodes four potential ORFs (Habayeb, Ekengren & Hultmark, 2006) (Fig. 4). BLAST annotation showed that the contig of 1,529 nt hit a helicase domain within the Nora virus non-structural proteins, while the contig of 5,278 nt contains two ORFs. The 5′-end of this contig encodes partial sequences of a non-structural protein, while the 3′-half of the contig encodes an ORF of 893 aa (Fig. 4). While this ORF polyprotein did not hit any viral genes, two rhv-like CP domains were identified suggesting that it encodes a putative CP gene. Based on the ORF arrangement, the viral sequences encoded by these two contigs were not derived from a Nora-like virus, but likely from a new type of virus with similarity to dicistroviruses (Fig. 2). BuGV2 currently lacks more than 1 kb of sequence in ORF1.
Confirmation of viral sequences
As BPV2-derived sequence was present in all but one of the sequencing datasets used for virus sequence identification (Table 2), we first designed primers (Table S1) to amplify a short fragment (339 nt) to confirm virus sequence presence by RT-PCR. Only two of the 21 BP samples (19 and 20) along with the three BuG samples (22–24), tested negative for the presence of BPV2 (Fig. 5). The absence of BPV2 sequence from BuG as determined by RT-PCR was consistent with the absence of BPV2 sequences in these transcriptomes, and the absence of BPV2 from samples 19 and 20 is consistent with the relatively low number of reads mapped for sequence data based on snails 19 to 21 (Table 2). We then designed primers to generate longer fragments of BPV2, BPV3, BPV4 and BuGV1 (Table S1). BPV2, BPV3 and BPV4 sequences were only amplified from BP snail samples, while BuGV1 sequences were only found in BuG samples (Fig. 5; Fig. S1).
Distribution of viral sequences among snail samples
To determine the distribution of virus-derived sequences among the snail samples, we examined the viral sequences by mapping the Illumina sequencing reads (50–100 nt) to the five near complete (Fig. 1) and to seven of the partial virus genome sequences with fragments encoding >800 aa (Table 2). BPV1, which was isolated from a snail collected from Asao, Kenya and sequenced with the 454 platform, was not detected from any of the BP isolated from Kasabong, Kenya. The viral sequences discovered from BuG samples (collected from eastern Kenya) were not detected from the BP samples (collected from western Kenya). Similarly, no reads derived from BuG samples mapped to the viral sequences assembled from the BP samples, except for BPVS6. Only three BuG reads matched the BPVS6 sequence presumably derived from sequence fragments conserved in SRV+. These results suggest that snail-associated virus sequences identified in the current study were specific to both snail species and geography. Among the BP samples collected from Kasabong, the viral sequences were widely distributed across all BP snails. The highest sequence coverage for BP viruses was for BPV2, with 6–263 fold coverage of different regions of the genome. The highest viral sequence coverage was observed from BuG samples, with up to 298-fold sequence coverage for BuGV1 (Table 2).
Phylogenetic analysis of SRV+ identified in snails and related RNA viruses from other organisms
To examine phylogenetic relationships among the viral sequences identified, we used peptide sequences derived from the RdRP domains and CP for analysis. The RdRP sequence is highly conserved among RNA viruses (Koonin, Gorbalenya & Chumakov, 1989; Zanotto et al., 1996), while the viral capsid protein that functions in virus host range is more divergent. Four snail viral genome sequences (Biomphalaria glabrata virus 1, BGV1; Biomphalaria glabrata virus 2, BGV2; Biomphalaria glabrata virus 3, BGV3; and Biomphalaria virus 4, BiV4) identified from RNA sequence data for B. glabrata (Adema et al., 2017; Galinier et al., 2017) were included in the phylogenetic analyses. The virus names from Adema et al., were used in our analysis (based on RdRP sequences: BGV1 is BiV2 from Galinier et al. (2017), BGV2 is BiV1, and BGV3 is BiV3).
Annotation of the snail-derived SRV+ sequences revealed that these viruses are closely related to Marnaviruses (with algal hosts), Bacillarnaviruses (with marine diatom hosts) and insect small RNA viruses (dicistrovirus-like, iflavirus-like and others). There was no similarity to SRV+ infecting marine mammals, e.g., Seal picornalike virus 1 (Kapoor et al., 2008). The RdRP domain is highly conserved in RNA viruses (Koonin et al., 2008). To analyze phylogenetic relationships between the snail-derived SRV+ sequences and similar viruses identified from aquatic environments and insects, the conserved RdRP sequences of selected viruses (based on BLAST annotation results with conserved domain cd01699) were extracted, aligned and used for construction of a phylogenetic tree. This RdRP-based phylogenetic analysis clustered the snail viruses into three groups (Fig. 6): First, BPV1, BPV3, BPV4 and BuGV1 grouped with viruses isolated from diatoms and aquatic environments, and hence these four viruses are unlikely to infect snails. This analysis suggests that the RdRP of BPV3 and BuGV1, and BPV1 and BPV4 are closely related, although the genome of BPV4 encodes a single ORF while that of BPV1 encodes two. RdRP sequences from the other snail-derived viruses separated into two major clusters primarily comprised of insect viruses. For the second group, BPV2 and BPVS1 RdRP sequences grouped with distro-like viruses, along with viruses of marine invertebrates (MCDV, MRTV and TSV) and two marine RNA viruses (APLV1 and LaVUC1) with unknown hosts. Third, based on RdRP sequences, four snail viruses (BGV1, BGV2, BGV3 and BuGV2) together with two unclassified insect viruses (AGV1, APV) and one insect Nora virus (NoV) form a unique clade. It is notable that although closely related with 62.3% RdRP sequence identity, BGV2 and BuGV2 were isolated from different snail genera collected from different geographic locations (BG from South America and BuG from Africa). Although BLAST analysis indicated similarity of some snail virus genome sequences with iflaviruses, none grouped with the iflavirus clade on RdRP analysis. BiV4 did not group into any of the three clusters. The RdRP of this virus is distant from those of the insect viruses or marine viruses used in this analysis.
In addition to eight complete small RNA viral CP, five sequence fragments (BPVS6, 16, 17, 33 and 42) encode near full-length putative CP (>90%; Fig. 3) and were included in the phylogenetic analysis based on CP sequences. As with RdRP domains, annotation of the CP sequences also showed protein sequence similarity to several marine viruses and insect RNA viruses (Fig. 7). For phylogenetic trees based on structural proteins, we used the CP or putative CP sequences of selected viruses isolated from aquatic unicellular organisms, marine samples, insect SRV+, and SRV+ discovered from snails (Fig. 7). BPV3, BuGV1, BPVS33 and BPVS42 grouped with marnaviruses, bacillarnaviruses and other unclassified SRV+ isolated from seawater. BPV1, BPV4 and BPVS16 group adjacent to the marine-derived virus groups and separate from BGV2 and BuGV2. Similar to the phylogenetic tree based on RdRP domains (Fig. 6), the CP of BGV2 and BuGV2 group together and based on CP sequences appear to be distant from other viruses isolated from the snail transcriptomes (Fig. 7). BPV2 and BPVS17 grouped with other marine-derived viruses adjacent to the dicistro-like viruses, while BGV1, BGV3, BPVS6 and BiV4 grouped adjacent to the iflavirus group. Interestingly, BPV2 and BPVS17 grouped with Antarctic picorna-like virus 1 (Lopez-Bueno et al., 2015). The RdRP K B2009con15 (Culley et al., 2014) isolated from seawater also grouped with insect related SRV+, suggesting that these viral sequences may be derived from viruses infecting invertebrates. The notable CP sequence variation among the SRV+ isolated from aquatic environments could reflect the complex evolution of SRV+ (Koonin & Dolja, 1993) or recent divergence (Soltis & Soltis, 2003).
Taken together, both RdRP and CP sequences suggest that BPV1, BPV3, BPV4 and BuGV1 are aquatic viruses, while BPV2 is dicistro-virus like. While the CP of BGV1 and BGV3 place these viruses as iflavirus-like, this association is unclear from analysis of RdRP sequences.
Discussion
We identified more than 17 diverse novel viral sequences by analyzing transcriptome data for two important snail vectors of schistosomiasis, B. pfeifferi and B. globosus. In so doing, we have characterized one aspect of the complex microbiome encountered by schistosome parasites in their snail vectors (Dheilly et al., 2019), with the potential use of such viruses for biological control for suppression of snail populations. Multiple sequences derived from putative viruses with (+)ssRNA, (−)ssRNA, dsRNA and DNA genomes were identified based on protein sequence annotations (Table S2). However, most viral sequences were derived from novel viruses with (+)ssRNA with more than 12 viruses isolated from BP (four near full-length genomes, and partial sequences from at least eight viruses) and four from BuG (one near full length genome, and three partial sequences). All snails appeared to be associated with multiple SRV+ with five viruses from the present study (BPV2, BPVS1, BPVS6, BVPS17 of BP, BuGV2 of BuG) predicted to replicate in the snail vector based on phylogenetic relatedness to arthropod viruses.
Diverse genome structures evident in snail-derived viruses
The order Picornavirales includes highly divergent viral groups including Caliciviridae, Dicistroviridae, Iflaviridae, Comoviridae, Sequiviridae, Potyviridae and Marnaviridae. This “picornavirus-like superfamily” was classified into six clades based on genomic RNA structures (Koonin et al., 2008). SRV+ discovered from aquatic environments were either dicistro-like (Bacillarnavirus and Labyrnavirus, belonging to Clade 1), or encoded a single polypeptide (Marnavirus, Clade 1), in which the nonstructural proteins are located at the N-terminus, and the structural proteins at the C-terminus (Fig. 2). This type of genome structure differs from those of polioviruses, sequiviruses or insect iflaviruses (genome structure-based Clade 6), which also encode a single ORF, but with the structural and non-structural proteins located at the N- and C-terminus respectively (Fig. 2).
The majority of viruses identified from the two species of snails reported here, along with viruses identified previously from B. glabrata (Adema et al., 2017; Galinier et al., 2017) are similar to dicistroviruses and Marnaviruses (Fig. 2). However, a second type of dicistro-like virus (BGV2 (Adema et al., 2017), Fig. 2) was also identified. The CP of these two closely related viruses has Rh_v but no CrPV domains and show no similarity to other viruses by BLAST analysis, while the ORF encoding nonstructural proteins shows sequence homology to Nora virus that infects Drosophila (Fig. 4). The genome structure of BGV3 (Fig. 2) differs from those of all other viruses found from snails or aquatic environments. The genome structure of BGV3 resembles that of an insect iflavirus. The CP of BGV3 shows partial homology to an insect virus (highest BLAST score with Aphis glycines virus 1; GenBank KF360262.1) and other insect or marine RNA viruses.
SRV+ of insects, snails and unicellular eukaryotes are similar in genome and protein sequence suggesting a common ancestor
The viruses with positive sense, single strand RNA genomes from snails, along with viruses from diatoms and aquatic environments are similar to insect SRV+. This similarity is apparent from the viral genome structure with a single ORF or two ORFs, and from protein sequence similarities. Expression of dicistrovirus ORF2 is mediated by an IRES. Potential IRES sequences were predicted in the intergenic regions (IGR) of BPV1, BPV2, BPV3 and BuGV1 by IRESite (http://www.iresite.org/). Similar to insect SRV+, SRV+ from aquatic environments usually have four capsid subunits (VP1–VP4) that are generated on proteolytic cleavage of the structural polyprotein. The subunit arrangement of the viral structural proteins of snail and diatom infecting SRV+ is similar to that observed in insect infecting dicistroviruses, in that the smallest VP4 is not the first subunit in the structural polyprotein (Fig. 1) (Bonning & Miller, 2010; Liljas et al., 2002). In contrast, in the SRV+ infecting mammals, VP4 is the first CP component (Liljas et al., 2002). Comparison of SRV+ from insects, snails, diatoms and other viruses from aquatic environments suggests that these viruses have a common ancestor.
Not all snail-derived viruses are snail viruses
Virus sequences identified from snail samples reflect virus populations associated with the snail holobiont, i.e., the snail and all associated organisms. The majority of snail-derived virus sequences had similarity to the sequences of unicellular eukaryote-infecting RNA viruses (Marnavirus, Bacillarnavirus, Labyrnaviruses and other unclassified marine RNA viruses), insect SRV+ (dicistroviruses, iflaviruses and others, e.g., Nora virus,) and SRV+ isolated from aquatic arthropods (shrimp and crab). Few BLAST hits were from vertebrate SRV+, e.g., Picalivirus and Calicivirus, commonly found in sewage-derived RNA (Ng et al., 2012). These results suggest that the SRV+ sequences from snails are diverse and likely derived from multiple sources, i.e., from food material, snail parasites or symbionts, or microorganisms passively associated either peripherally or internally with the snail. There was no correlation between schistosome infection or treatment and the presence of any of the viruses identified (Table 2; Buddenborg et al., 2019).
In the past decade, metagenomic analysis of viral sequences from seawater and freshwater revealed previously unknown SRV+ populations in aquatic environments (Culley, Lang & Suttle, 2003; Culley, Lang & Suttle, 2007; Culley et al., 2014; Djikeng et al., 2009; Lopez-Bueno et al., 2015; Mohiuddin & Schellhorn, 2015; Yoshida et al., 2013). However, the majority of the aquatic environment-derived viral sequences were partial, short sequences, and the hosts of these putative viruses are largely unknown. Only a few SRV+ infecting diatoms and other unicellular organisms have been investigated, and three novel (+)ssRNA viral genera, e.g., Marnaviridae, Bacillarnavirus and Labyranvirus in the order Picornavirales have been classified (Lang, Culley & Suttle, 2004; Viruses ICoTo, 2015). The SRV+ sequences discovered from freshwater snails were either close to sequences of diatom-infecting viruses, or of insect-infecting SRV+ and SRV+ with unknown hosts isolated from aquatic environments (Figs. 6, 7: Table S2), suggesting that the SRV+ from freshwater snails may include viruses that infect both unicellular organisms and aquatic invertebrates. Indeed, phylogenetic trees based on RdRP and CP, consistently grouped BPV1, BPV3, BPV4 and BuGV1 with diatom and marine RNA viruses, and BPV2 with invertebrate/dicistrovirus-like viruses. The CP (but not RdRP sequences) grouped BGV1 and BGV3 with iflaviruses (Figs. 6, 7).
Although empirical tests are required to establish which viruses infect snails, based on sequence similarity and phylogenetic relationships, we conclude that BuGV2 and BPV2 along with viruses from which genome fragments BPVS1, BPVS6 and BPVS17 were derived, are likely to be snail-infecting viruses. This work also supports the likelihood that BGV1, BGV2, BGV3 and BiV4 are snail-infecting viruses. Further investigation is needed for confirmation of a subset of these viruses, identification of their respective hosts, and assessment of chronic versus acute pathology for those that replicate in the snail. The Bge cell line derived from embryos of the schistosome-transmitting snail Biomphalaria glabrata (Rinaldi et al., 2015) will provide an invaluable tool for further study of molluscan virology, potentially allowing for in vitro propagation and analysis of the putative snail viruses described herein.
Supplemental Information
Acknowledgments
The authors thank Dr. John VanDyk for IT support at Iowa State University, and Martin Mutuku from KEMRI for assistance with collection of snails. Technical assistance was provided by the University of New Mexico Molecular Biology Facility. The content for this paper is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Funding Statement
This work was supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award Number P30GM110907 and the National Institute of Health (NIH) grant R37AI101438. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Additional Information and Declarations
Competing Interests
The authors declare that they have no competing interests.
Author Contributions
Sijun Liu conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.
Si-Ming Zhang performed the experiments, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.
Sarah K. Buddenborg performed the experiments, authored or reviewed drafts of the paper, and approved the final draft.
Eric S. Loker conceived and designed the experiments, authored or reviewed drafts of the paper, coordinated with appropriate authorities to allow for this work to be done, and approved the final draft.
Bryony C. Bonning conceived and designed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, and approved the final draft.
Field Study Permissions
The following information was supplied relating to field study approvals (i.e., approving body and any reference numbers):
This project was undertaken with approval of Kenya’s National Commission for Science, Technology and Innovation (permit number NACOSTI/P/15/9609/4270), Kenya Wildlife Service (KWS-0045-03-21 and export permit 0004754), and National Environment Management Authority (NEMA/AGR/46/2014 and NEMA/AGR/149/2021).
DNA Deposition
Data Availability
The following information was supplied regarding data availability:
Data are available at NCBI BioProject under PRJNA383396 and PRJNA451210.
References
- Adema et al. (2017).Adema CM, Hillier LW, Jones CS, Loker ES, Knight M, Minx P, Oliveira G, Raghavan N, Shedlock A, do Amaral LR, Arican-Goktas HD, Assis JG, Baba EH, Baron OL, Bayne CJ, Bickham-Wright U, Biggar KK, Blouin M, Bonning BC, Botka C, Bridger JM, Buckley KM, Buddenborg SK, Lima Caldeira R, Carleton J, Carvalho OS, Castillo MG, Chalmers IW, Christensens M, Clifton S, Cosseau C, Coustau C, Cripps RM, Cuesta-Astroz Y, Cummins SF, di Stephano L, Dinguirard N, Duval D, Emrich S, Feschotte C, Feyereisen R, FitzGerald P, Fronick C, Fulton L, Galinier R, Gava SG, Geusz M, Geyer KK, Giraldo-Calderon GI, de Souza Gomes M, Gordy MA, Gourbal B, Grunau C, Hanington PC, Hoffmann KF, Hughes D, Humphries J, Jackson DJ, Jannotti-Passos LK, de Jesus Jeremias W, Jobling S, Kamel B, Kapusta A, Kaur S, Koene JM, Kohn AB, Lawson D, Lawton SP, Liang D, Limpanont Y, Liu S, Lockyer AE, Lovato TL, Ludolf F, Magrini V, McManus DP, Medina M, Misra M, Mitta G, Mkoji GM, Montague MJ, Montelongo C, Moroz LL, Munoz-Torres MC, Niazi U, Noble LR, Oliveira FS, Pais FS, Papenfuss AT, Peace R, Pena JJ, Pila EA, Quelais T, Raney BJ, Rast JP, Rollinson D, Rosse IC, Rotgans B, Routledge EJ, Ryan KM, Scholte LLS, Storey KB, Swain M, Tennessen JA, Tomlinson C, Trujillo DL, Volpi EV, Walker AJ, Wang T, Wannaporn I, Warren WC, Wu XJ, Yoshino TP, Yusuf M, Zhang SM, Zhao M, Wilson RK. Whole genome analysis of a schistosomiasis-transmitting freshwater snail. Nature Communications. 2017;8(1):15451. doi: 10.1038/ncomms15451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolduc et al. (2012).Bolduc B, Shaughnessy DP, Wolf YI, Koonin EV, Roberto FF, Young M. Identification of novel positive-strand RNA viruses by metagenomic analysis of archaea-dominated Yellowstone hot springs. Journal of Virology. 2012;86(10):5562–5573. doi: 10.1128/JVI.07196-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonning & Miller (2010).Bonning BC, Miller WA. Dicistroviruses. Annual Review of Entomology. 2010;55(1):129–150. doi: 10.1146/annurev-ento-112408-085457. [DOI] [PubMed] [Google Scholar]
- Boros et al. (2011).Boros A, Pankovics P, Simmonds P, Reuter G. Novel positive-sense, single-stranded RNA (+ssRNA) virus with di-cistronic genome from intestinal content of freshwater carp (Cyprinus carpio) PLOS ONE. 2011;6(12):e29145. doi: 10.1371/journal.pone.0029145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buddenborg et al. (2017).Buddenborg SK, Bu L, Zhang SM, Schilkey FD, Mkoji GM, Loker ES. Transcriptomic responses of Biomphalaria pfeifferi to Schistosoma mansoni: investigation of a neglected African snail that supports more S. mansoni transmission than any other snail species. PLOS Neglected Tropical Diseases. 2017;11(10):e0005984. doi: 10.1371/journal.pntd.0005984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buddenborg et al. (2019).Buddenborg SK, Kamel B, Hanelt B, Bu L, Zhang SM, Mkoji GM. The in vivo transcriptome of Schistosoma mansoni in the prominent vector species Biomphalaria pfeifferi with supporting observations from Biomphalaria glabrata. PLOS Neglected Tropical Diseases. 2019;13(9):e0007013. doi: 10.1371/journal.pntd.0007013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Camacho et al. (2009).Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10(1):421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cardoso et al. (2012).Cardoso AM, Cavalcante JJ, Cantao ME, Thompson CE, Flatschart RB, Glogauer A, Scapin SM, Sade YB, Beltrao PJ, Gerber AL, Martins OB, Garcia ES, de Souza W, Vasconcelos AT. Metagenomic analysis of the microbiota from the crop of an invasive snail reveals a rich reservoir of novel genes. PLOS ONE. 2012;7(11):e48505. doi: 10.1371/journal.pone.0048505. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Colley et al. (2014).Colley DG, Bustinduy AL, Secor WE, King CH. Human schistosomiasis. Lancet. 2014;383(9936):2253–2264. doi: 10.1016/S0140-6736(13)61949-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Culley, Lang & Suttle (2003).Culley AI, Lang AS, Suttle CA. High diversity of unknown picorna-like viruses in the sea. Nature. 2003;424:1054–1057. doi: 10.1038/nature01886nature01886. [DOI] [PubMed] [Google Scholar]
- Culley, Lang & Suttle (2006).Culley AI, Lang AS, Suttle CA. Metagenomic analysis of coastal RNA virus communities. Science. 2006;312:1795–1798. doi: 10.1126/science.1127404. [DOI] [PubMed] [Google Scholar]
- Culley, Lang & Suttle (2007).Culley AI, Lang AS, Suttle CA. The complete genomes of three viruses assembled from shotgun libraries of marine RNA virus communities. Virology Journal. 2007;4:69. doi: 10.1186/1743-422X-4-69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Culley et al. (2014).Culley AI, Mueller JA, Belcaid M, Wood-Charlson EM, Poisson G, Steward GF. The characterization of RNA viruses in tropical seawater using targeted PCR and metagenomics. mBio. 2014;5(3):e01210. doi: 10.1128/mBio.01210-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Culley & Steward (2007).Culley AI, Steward GF. New genera of RNA viruses in subtropical seawater, inferred from polymerase gene sequences. Applied and Environmental Microbiology. 2007;73:5937–5944. doi: 10.1128/AEM.01065-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dheilly et al. (2019).Dheilly NM, Martínez Martínez J, Rosario K, Brindley PJ, Fichorova RN, Kaye JZ, Kohl KD, Knoll LJ, Lukeš J, Perkins SL, Poulin R, Schriml L, Thompson LR. Parasite microbiome project: grand challenges. PLOS Pathogens. 2019;15(10):e1008028. doi: 10.1371/journal.ppat.1008028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Djikeng et al. (2009).Djikeng A, Kuzmickas R, Anderson NG, Spiro DJ. Metagenomic analysis of RNA viruses in a fresh water lake. PLOS ONE. 2009;4(9):e7264. doi: 10.1371/journal.pone.0007264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Duval et al. (2015b).Duval D, Galinier R, Mouahid G, Toulza E, Allienne JF, Portela J, Calvayrac C, Rognon A, Arancibia N, Mitta G, Theron A, Gourbal B. A novel bacterial pathogen of Biomphalaria glabrata: a potential weapon for schistosomiasis control? PLOS Neglected Tropical Diseases. 2015b;9(2):e0003489. doi: 10.1371/journal.pntd.0003489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Duval et al. (2015a).Duval D, Galinier R, Mouahid G, Toulza E, Allienne JF, Portela J, Calvayrac C, Rognon A, Arancibia N, Mitta G, Theron A, Gourbal B. Correction: a novel bacterial pathogen of Biomphalaria glabrata: a potential weapon for schistosomiasis control? PLOS Neglected Tropical Diseases. 2015a;9(6):e0003815. doi: 10.1371/journal.pntd.0003815. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galinier et al. (2017).Galinier R, Tetreau G, Portet A, Pinaud S, Duval D, Gourbal B. First characterization of viruses from freshwater snails of the genus Biomphalaria, the intermediate host of the parasite Schistosoma mansoni. Acta Tropica. 2017;167(Suppl):196–203. doi: 10.1016/j.actatropica.2016.12.021. [DOI] [PubMed] [Google Scholar]
- Greninger & DeRisi (2015a).Greninger AL, DeRisi JL. Draft genome sequence of Picalivirus D recovered from San Francisco wastewater. Genome Announcements. 2015a;3:e00654 15. doi: 10.1128/genomeA.00654-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greninger & DeRisi (2015b).Greninger AL, DeRisi JL. Draft genome sequences of marine RNA viruses SF-1, SF-2, and SF-3 recovered from San Francisco wastewater. Genome Announcements. 2015b;3:e00653 15. doi: 10.1128/genomeA.00653-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haas et al. (2013).Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, Couger MB, Eccles D, Li B, Lieber M, Macmanes MD, Ott M, Orvis J, Pochet N, Strozzi F, Weeks N, Westerman R, William T, Dewey CN, Henschel R, Leduc RD, Friedman N, Regev A. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nature Protocols. 2013;8(8):1494–1512. doi: 10.1038/nprot.2013.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Habayeb, Ekengren & Hultmark (2006).Habayeb MS, Ekengren SK, Hultmark D. Nora virus, a persistent virus in Drosophila, defines a new picorna-like virus family. Journal of General Virology. 2006;87(10):3045–3051. doi: 10.1099/vir.0.81997-0. [DOI] [PubMed] [Google Scholar]
- Hoang et al. (2018).Hoang DT, Chernomor O, von Haeseler A, Minh BQ, Vinh LS. UFBoot2: improving the ultrafast bootstrap approximation. Molecular Biology and Evolution. 2018;35(2):518–522. doi: 10.1093/molbev/msx281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang & Madan (1999).Huang X, Madan A. CAP3: a DNA sequence assembly program. Genome Research. 1999;9(9):868–877. doi: 10.1101/gr.9.9.868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalyaanamoorthy et al. (2017).Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nature Methods. 2017;14(6):587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kapoor et al. (2008).Kapoor A, Victoria J, Simmonds P, Wang C, Shafer RW, Nims R, Nielsen O, Delwart E. A highly divergent picornavirus in a marine mammal. Journal of Virology. 2008;82(1):311–320. doi: 10.1128/JVI.01240-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh & Standley (2013).Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular Biology and Evolution. 2013;30(4):772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kimura & Tomaru (2015).Kimura K, Tomaru Y. Discovery of two novel viruses expands the diversity of single-stranded DNA and single-stranded RNA viruses infecting a cosmopolitan marine diatom. Applied and Environmental Microbiology. 2015;81(3):1120–1131. doi: 10.1128/AEM.02380-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koonin & Dolja (1993).Koonin EV, Dolja VV. Evolution and taxonomy of positive-strand RNA viruses: implications of comparative analysis of amino acid sequences. Critical Reviews in Biochemistry and Molecular Biology. 1993;28(5):375–430. doi: 10.3109/10409239309078440. [DOI] [PubMed] [Google Scholar]
- Koonin, Gorbalenya & Chumakov (1989).Koonin EV, Gorbalenya AE, Chumakov KM. Tentative identification of RNA-dependent RNA polymerases of dsRNA viruses and their relationship to positive strand RNA viral polymerases. FEBS Letters. 1989;252(1–2):42–46. doi: 10.1016/0014-5793(89)80886-5. [DOI] [PubMed] [Google Scholar]
- Koonin et al. (2008).Koonin EV, Wolf YI, Nagasaki K, Dolja VV. The big bang of picorna-like virus evolution antedates the radiation of eukaryotic supergroups. Nature Reviews Microbiology. 2008;6:925–939. doi: 10.1038/nrmicro2030. [DOI] [PubMed] [Google Scholar]
- Lang, Culley & Suttle (2004).Lang AS, Culley AI, Suttle CA. Genome sequence and characterization of a virus (HaRNAV) related to picorna-like viruses that infects the marine toxic bloom-forming alga Heterosigma akashiwo. Virology. 2004;320:206–217. doi: 10.1016/j.virol.2003. [DOI] [PubMed] [Google Scholar]
- Liljas et al. (2002).Liljas L, Tate J, Lin T, Christian P, Johnson JE. Evolutionary and taxonomic implications of conserved structural motifs between picornaviruses and insect picorna-like viruses. Archives of Virology. 2002;147:59–84. doi: 10.1007/s705-002-8303-1. [DOI] [PubMed] [Google Scholar]
- Liu, Vijayendran & Bonning (2011).Liu S, Vijayendran D, Bonning BC. Next generation sequencing technologies for insect virus discovery. Viruses. 2011;3(10):1849–1869. doi: 10.3390/v3101849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lopez-Bueno et al. (2015).Lopez-Bueno A, Rastrojo A, Peiro R, Arenas M, Alcami A. Ecological connectivity shapes quasispecies structure of RNA viruses in an Antarctic lake. Molecular Ecology. 2015;24(19):4812–4825. doi: 10.1111/mec.13321. [DOI] [PubMed] [Google Scholar]
- Marchler-Bauer et al. (2017).Marchler-Bauer A, Bo Y, Han L, He J, Lanczycki CJ, Lu S, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, Gwadz M, Hurwitz DI, Lu F, Marchler GH, Song JS, Thanki N, Wang Z, Yamashita RA, Zhang D, Zheng C, Geer LY, Bryant SH. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Research. 2017;45(D1):D200–D203. doi: 10.1093/nar/gkw1129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mari et al. (2002).Mari J, Poulos BT, Lightner DV, Bonami J-R. Shrimp Taura syndrome virus: genomic characterization and similarity with members of the genus Cricket paralysis-like viruses. Journal of General Virology. 2002;83:915–926. doi: 10.1099/0022-1317-83-4-915. [DOI] [PubMed] [Google Scholar]
- Mohiuddin & Schellhorn (2015).Mohiuddin M, Schellhorn HE. Spatial and temporal dynamics of virus occurrence in two freshwater lakes captured through metagenomic analysis. Frontiers in Microbiology. 2015;6:960. doi: 10.3389/fmicb.2015.00960. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mutuku et al. (2014).Mutuku MW, Dweni CK, Mwangi M, Kinuthia JM, Mwangi IN, Maina GM, Agola LE, Zhang SM, Maranga R, Loker ES, Mkoji GM. Field-derived Schistosoma mansoni and Biomphalaria pfeifferi in Kenya: a compatible association characterized by lack of strong local adaptation, and presence of some snails able to persistently produce cercariae for over a year. Parasit Vectors. 2014;7(1):533. doi: 10.1128/AEM.70.2.704-711.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nagasaki et al. (2004).Nagasaki K, Tomaru Y, Katanozaka N, Shirai Y, Nishida K, Itakura S, Yamaguchi M. Isolation and characterization of a novel single-stranded RNA virus infecting the bloom-forming diatom Rhizosolenia setigera. Applied and Environmental Microbiology. 2004;70:704–711. doi: 10.1128/AEM.70.2.704-711.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nagasaki et al. (2005).Nagasaki K, Tomaru Y, Takao Y, Nishida K, Shirai Y, Suzuki H, Nagumo T. Previously unknown virus infects marine diatom. Applied and Environmental Microbiology. 2005;71(7):3528–3535. doi: 10.1128/AEM.71.7.3528-3535.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ng et al. (2012).Ng TF, Marine R, Wang C, Simmonds P, Kapusinszky B, Bodhidatta L, Oderinde BS, Wommack KE, Delwart E. High variety of known and new RNA and DNA viruses of diverse origins in untreated sewage. Journal of Virology. 2012;86(22):12161–12175. doi: 10.1128/JVI.00869-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nguyen et al. (2015).Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Molecular Biology and Evolution. 2015;32(1):268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Notredame, Higgins & Heringa (2000).Notredame C, Higgins DG, Heringa J. T-Coffee: a novel method for fast and accurate multiple sequence alignment. Journal of Molecular Biology. 2000;302(1):205–217. doi: 10.1006/jmbi.2000.4042. [DOI] [PubMed] [Google Scholar]
- Rinaldi et al. (2015).Rinaldi G, Yan H, Nacif-Pimenta R, Matchimakul P, Bridger J, Mann VH, Smout MJ, Brindley PJ, Knight M. Cytometric analysis, genetic manipulation and antibiotic selection of the snail embryonic cell line Bge from Biomphalaria glabrata, the intermediate host of Schistosoma mansoni. International Journal for Parasitology. 2015;45(8):527–535. doi: 10.1016/j.ijpara.2015.02.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rosani & Gerdol (2017).Rosani U, Gerdol M. A bioinformatics approach reveals seven nearly-complete RNA-virus genomes in bivalve RNA-seq data. Virus Research. 2017;239(10):33–42. doi: 10.1016/j.virusres.2016.10.009. [DOI] [PubMed] [Google Scholar]
- Rosani et al. (2019).Rosani U, Shapiro M, Venier P, Allam B. A needle in a haystack: tracing bivalve-associated viruses in high-throughput transcriptomic data. Viruses. 2019;11(3):205. doi: 10.3390/v11030205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sokolow et al. (2015).Sokolow SH, Huttinger E, Jouanard N, Hsieh MH, Lafferty KD, Kuris AM, Riveau G, Senghor S, Thiam C, N’Diaye A, Faye DS, De Leo GA. Reduced transmission of human schistosomiasis after restoration of a native river prawn that preys on the snail intermediate host. Proceedings of the National Academy of Sciences USA. 2015;112(31):9650–9655. doi: 10.1073/pnas.1502651112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soltis & Soltis (2003).Soltis PS, Soltis DE. Applying the Bootstrap in phylogeny reconstruction. Statistical Science. 2003;18(2):256–267. doi: 10.1214/ss/1063994980. [DOI] [Google Scholar]
- Sosa-Gómez et al. (2020).Sosa-Gómez DR, Morgado FS, Corrêa RFT, Silva LA, Ardisson-Araújo DMP, Rodrigues BMP, Oliveira EE, Aguiar RWS, Ribeiro BM. Entomopathogenic viruses in the neotropics: current status and recently discovered species. Neotropical Entomology. 2020;49(3):315–331. doi: 10.1007/s13744-020-00770-1. [DOI] [PubMed] [Google Scholar]
- Suttle (2005).Suttle CA. Viruses in the sea. Nature. 2005;437(7057):356–361. doi: 10.1038/nature04160. [DOI] [PubMed] [Google Scholar]
- Suttle (2007).Suttle CA. Marine viruses–major players in the global ecosystem. Nature Reviews Microbiology. 2007;5(10):801–812. doi: 10.1038/nrmicro1750. [DOI] [PubMed] [Google Scholar]
- Takao et al. (2006).Takao Y, Mise K, Nagasaki K, Okuno T, Honda D. Complete nucleotide sequence and genome organization of a single-stranded RNA virus infecting the marine fungoid protist Schizochytrium sp. Journal of General Virology. 2006;87:723. doi: 10.1099/vir.0.81204-0. [DOI] [PubMed] [Google Scholar]
- Tomaru et al. (2013a).Tomaru Y, Toyoda K, Kimura K, Takao Y, Sakurada K, Nakayama N, Nagasaki K. Isolation and characterization of a single-stranded RNA virus that infects the marine planktonic diatom Chaetoceros sp. (SS08-C03) Phycological Research. 2013a;61(1):27–36. doi: 10.1111/j.1440-1835.2012.00670.x. [DOI] [Google Scholar]
- Tomaru et al. (2013b).Tomaru Y, Toyoda K, Suzuki H, Nagumo T, Kimura K, Takao Y. New single-stranded DNA virus with a unique genomic structure that infects marine diatom Chaetoceros setoensis. Scientific Reports. 2013b;3(1):3337. doi: 10.1038/srep03337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Viruses ICoTo (2015).Viruses ICoTo Virus taxonomy: 2015 release. 2015. http://www.ictvonline.org/virusTaxonomy.asp?msl_id=30 http://www.ictvonline.org/virusTaxonomy.asp?msl_id=30
- Yoshida et al. (2013).Yoshida M, Takaki Y, Eitoku M, Nunoura T, Takai K. Metagenomic analysis of viral communities in (hado) pelagic sediments. PLOS ONE. 2013;8(2):e57271. doi: 10.1371/journal.pone.0057271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zanotto et al. (1996).Zanotto PM, Gibbs MJ, Gould EA, Holmes EC. A reevaluation of the higher taxonomy of viruses based on RNA polymerases. Journal of Virology. 1996;70:6083–6096. doi: 10.1128/JVI.70.9.6083-6096.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The following information was supplied regarding data availability:
Data are available at NCBI BioProject under PRJNA383396 and PRJNA451210.