The enormous diversity of RNA viruses in insects is continuously validated. Parasitoid wasps, as biocontrol insects which are widely used against insect pests in agroecosystems, may also carry many “good” RNA viruses.
KEYWORDS: RNA viruses, parasitoid wasp, rice weevil, Picornavirales, Mononegavirales, RdRp
ABSTRACT
In this study, many virus-like fragments were obtained from transcriptomes of three wasp species, including Anisopteromalus calandrae (8), Lariophagus distinguendus (3), and Theocolax elegans (18), which can parasitize and control rice weevil Sitophilus oryzae, a serious insect pest of farm-stored grains. By further bioinformatic analysis and sequencing, we identified six novel RNA viruses with complete genomes and named them WWPSRV-1, WWPSRV-2, AcPSRV-1, AcNSRV-1, AcNSRV-2, and LdNSRV-1. PCR-based detection revealed that WWPSRV-1 and WWPSRV-2 had the possibility of interspecies virus transmission, especially WWPSRV-2, which was also present in the rice weevil adults. Phylogenetically, three out of these six viruses appeared to be members of order Picornavirales: WWPSRV-1 belonged to unassigned virus families of this order, whereas WWPSRV-2 and AcPSRV-1 belonged to families Iflaviridae and Dicistroviridae, respectively. The conserved picornavirus-typical domains helicase, protease, and RNA-dependent RNA polymerase could be found in the nonstructural protein encoded by the three viruses, whose genomes consisted of the different numbers of open reading frames (ORFs). The other three RNA viruses could be classified to order Mononegavirales: AcNSRV-1 and AcNSRV-2 belonged to family Lispiviridae, whereas LdNSRV-1 belonged to a big family Rhabdoviridae. The genomes of the three viruses contained at least five ORFs, encoding deduced proteins in the following order: 3′-N-P-M-G-L-5′. All the ORFs were separated by conserved intergenic sequences which likely regulated the transcription termination and initiation. Our findings enhance the understanding of RNA viruses in weevil wasps and set the foundation for the future study of the association among weevils, weevil wasps, and RNA viruses.
IMPORTANCE The enormous diversity of RNA viruses in insects is continuously validated. Parasitoid wasps, as biocontrol insects which are widely used against insect pests in agroecosystems, may also carry many “good” RNA viruses. Some RNA viruses in parasitoid wasps have been reported to affect the host wasps or the wasps’ host. Here, six novel RNA viruses with complete genomes were identified in three parasitoid wasps of the rice weevil. One of these viruses was also detected in the rice weevil adults. Phylogenetically, WWPSRV-1 was the first unambiguous detection of Nora-like virus in insect parasitoids. WWPSRV-2 and AcPSRV-1 belong to families Iflaviridae and Dicistroviridae, some viruses of which can result in lethal infections in silkworms and honeybees. The other three RNA viruses belong to order Mononegavirales, which comprises many well-known insect-associated viruses.
INTRODUCTION
RNA viruses are increasingly discovered in invertebrate species which display enormous diversity and can be generally divided into positive-, negative-, or double-stranded RNA viruses (1–3). Species abundance and diversity of RNA viruses also exist in insects, the largest group in invertebrates (4–6). Nowadays, the application of metagenomics accelerates the studies of viral identification, persistence, spread, and interaction with both their insect hosts and other microbes. For example, using metagenomic analysis, seven novel positive-strand RNA viruses, belonging to the families Dicistroviridae, Parvoviridae, and Circoviridae, are identified in an important stingless bee Melipona quadrifasciata which have the largest copy numbers in unhealthy bees and may be pathogenic (7). Eight novel RNA viruses, including four positive-strand RNA viruses, two negative-strand RNA viruses, and two double-stranded RNA, are discovered in the high-throughput sequencing data of an important invasive agricultural insect pest, the oriental fruit fly Bactrocera dorsalis (8). Through a metatranscriptomics analysis, many new RNA virus species are discovered in different mosquito populations, including many that affect human health. This indicates that the diversity of RNA viruses is affected by the host mosquito taxon and genetic background (9–11). With understanding of mosquito viromes, the prevention and control of pathogenic arboviruses will be more easily manipulated.
Parasitoid wasps (Hymenoptera: Apocrita) are widely used as biocontrol agents against insect (Arthropoda: Hexapoda: Insecta) pests in agroecosystems. They are frequently associated with viruses or virion-like particles (VLPs). In the past decade, RNA viruses have been reported in some parasitoid wasps, as well as in virus-host interactions. Several viruses have been found to affect the host wasps or the wasps’ host. For example, Pteromalus puparum negative-strand RNA virus 1, an artovirus found in Pteromalus puparum wasp which preys on butterflies of several species, increases the longevity of the wasp while it reduces female offspring numbers (12).
The rice weevil, Sitophilus oryzae (Coleoptera: Curculionidae), a serious insect pest of farm-stored grains, is prevalent mainly in the south and the center of China (13). It could cause significant losses in the grain weight and reduce nutritive value (14). Three hymenopteran Pteromalidae parasitoids, including Anisopteromalus calandrae, Lariophagus distinguendus, and Theocolax (Choetospila) elegans, which can parasitize the rice weevil larvae in nature, have great potential in biological control of S. oryzae (15, 16). To better understand the relationship between these parasitoid wasps and their hosts, we sequenced the transcriptomes of the three wasps. Six novel viruses were discovered in wasps A. calandrae, L. distinguendus, and T. elegans, named weevil wasp positive-strand RNA virus 1 (WWPSRV-1), weevil wasp positive-strand RNA virus 2 (WWPSRV-2), Anisopteromalus calandrae positive-strand RNA virus 1 (AcPSRV-1), Anisopteromalus calandrae negative-strand RNA virus 1 (AcNSRV-1), Anisopteromalus calandrae negative-strand RNA virus 2 (AcNSRV-2), and Lariophagus distinguendus negative-strand RNA virus 1 (LdNSRV-1). Moreover, we analyzed the genome organization and phylogeny of these viruses. Our results revealed that WWPSRV-1, WWPSRV-2, and AcPSRV-1 were related to picornaviruses and AcNSRV-1, AcNSRV-2, and LdNSRV-1 were related to mononegaviruses.
RESULTS
Virus-like contigs obtained from the transcriptomes of three parasitoid wasps.
Through transcriptome analysis, 29 virus-like fragments were obtained in three weevil wasp species, including A. calandrae (8), L. distinguendus (3), and T. elegans (18) (Table S3). The majority of small contigs were in T. elegans. Based on BLASTX results, these fragments were classified into one of two main groups: positive-strand RNA virus (11) or negative-strand RNA virus (18) (Table S3). All three wasps had these two types of virus-like sequences (Table 1 and Table S3).
TABLE 1.
The distribution of the viruses in the three different wasps’ transcriptomesa
| Virus | Abbreviation | Length (bp) | Anisopteromalus calandrae | Lariophagus distinguendus | Theocolax elegans |
|---|---|---|---|---|---|
| Weevil wasp positive-strand RNA virus 1 | WWPSRV-1 | 12,332 | + | + | + |
| Weevil wasp positive-strand RNA virus 2 | WWPSRV-2 | 9,409 | + | + | + |
| Anisopteromalus calandrae positive-strand RNA virus 1 | AcPSRV-1 | 7,558 | + | − | − |
| Anisopteromalus calandrae negative-strand RNA virus 1 | AcNSRV-1 | 12,702 | + | − | − |
| Anisopteromalus calandrae negative-strand RNA virus 2 | AcNSRV-2 | 12,118 | + | − | − |
| Lariophagus distinguendus negative-strand RNA virus 1 | LdNSRV-1 | 10,832 | − | + | − |
+, exists; −, does not exist.
Virus-like contigs identified in three different wasps’ transcriptomes. AC, Anisopteromalus calandrae; LD, Lariophagus distinguendus; and TE, Theocolax elegans. Download Table S3, XLSX file, 0.01 MB (12.4KB, xlsx) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
WWPSRV-1, WWPSRV-2, and AcPSRV-1 were related to positive-strand RNA virus.
(i) WWPSRV-1. Three large contigs obtained from the wasps’ transcriptomes (AC_Contig_1, LD_Contig_1, and TE_Contig_1) (Text S1) were highly related to Nasonia vitripennis virus and could be assembled into one big virus-like contig (12,325 bp), named WWPSRV-1 (Table S3 and Fig. S1A). By sequencing the assembled viral genome, including the 5′ and 3′ genome termini, the complete genome of WWPSRV-1 was confirmed to be 12,332 nucleotides (nt) in length, excluding poly-A (Fig. 1A and 2A). G+C pairs comprised 39.94% of the nucleotides. The WWPSRV-1 genome contained four ORFs in 5′to 3′ orientation, located at nt positions 1,003 to 8,739, 8,711 to 9,385, 9,367 to 10,947, and 10,940 to 12,007, respectively (Table 2). The four deduced ORFs of WWPSRV-1 were not in the same frame. ORF1 and ORF3 were in the same frame, while ORF2 and ORF4 were in the other frame. ORF1 to 4 were overlapping by a fragment (range from 2 to 23 bp, excluding the start and stop codons), respectively (Table 2 and Fig. 2A). The leader and trailer regions of the WWPSRV-1 genome were 1,002 and 325 nt in length, respectively (Table 2). A palindromic structure existed in the 5′ leader regions (Fig. 2A). The trailer regions ended with poly-A. Transcriptome sequencing (RNA-seq) reads of the three wasps mapped to the WWPSRV-1 genome exhibited similar fluctuating distribution on the viral genomic RNA, with a lower coverage close to 5′ ends of the genome (Fig. 2A).
FIG 1.
PCR confirmation of WWPSRV-1 (A), WWPSRV-2 (B), AcNSRV-1 (C), and LdNSRV-1 (D) in the parasitoid wasps Anisopteromalus calandrae, Lariophagus distinguendus, and Theocolax elegans and their host Sitophilus oryzae adults. The primers used to confirm viral genome sequence are shown in the Table S1.
FIG 2.
Schematic presentation of the genome of RNA viruses discovered in the Anisopteromalus calandrae (AC), Lariophagus distinguendus (LD), and Theocolax elegans (TE) transcriptomes. (A) Schematic presentation of the genomic organization of positive-strand RNA viruses (WWPSRV-1, WWPSRV-2, and AcPSRV-1). Boxes indicate the position and length of each ORF arranged in the 5′–3′ orientation. In the boxes, the positions of the putative structural proteins (capsid protein, CP 1 to 3) and the nonstructural proteins (helicase, Hel; protease, Pro; RNA-dependent RNA polymerase, RdRp) are shown. There is a zinc finger (Znf) domain in the WWPSRV-2 genome. (B) Schematic presentation of the genomic organization of negative-strand RNA viruses (AcNSRV-1, AcNSRV-2, and LdNSRV-1). Boxes indicate the position and length of each ORF (N, P, M, G, and L genes) arranged in the 3′–5′ negative sense. P6 is found only in the AcNSRV-2 genome. In the L genes, the positions of the putative RdRp, mRNA-capping (Cap), and methyltransferase (MT) domains are shown. RNA-seq mapping for AC (red), LD (green), and TE (blue) showed fluctuating read distributions on viral genomic RNA.
TABLE 2.
Features of the ORFs encoded by (+)ssRNA viruses discovered in the transcriptomes of weevil waspsa
| Virus | ORF | ORF genomic location (nt) | Length (nt | aa) | Protein mass (kDa) | Isoelectric point | Signal peptide | Top BLASTX match | E value | Identity | Accession number |
|---|---|---|---|---|---|---|---|---|---|---|
| WWPSRV-1 | ORF 1 | 1,003 to 8,739 | 7,737 | 2,578 | 295.59 | 8.05 | − | RNA-dependent RNA polymerase (Nasonia vitripennis virus) | 3E−143 | 78.15% | ACN94444.1 |
| ORF 2 | 8,711 to 9,385 | 675 | 224 | 25.73 | 4.79 | − | ORF3 (Nasonia vitripennis virus) | 3E−44 | 47.59% | ACN94446.1 | |
| ORF 3 | 9,367 to 10,947 | 1,581 | 526 | 58.01 | 9.04 | − | ORF4 (Nasonia vitripennis virus) | 0 | 78.60% | ACN94448.1 | |
| ORF 4 | 10,940 to 12,007 | 1,068 | 355 | 38.20 | 4.48 | − | ORF5 (Nasonia vitripennis virus) | 0 | 75.64% | ACN94449.1 | |
| WWPSRV-2 | ORF | 34 to 9,228 | 9,195 | 3,064 | 347.01 | 6.19 | − | Polyprotein (Diabrotica virgifera virgifera virus 1) | 0 | 40.81% | APF29088.1 |
| AcPSRV−1 | ORF 1 | 183 to 4,343 | 4,161 | 1,386 | 159.61 | 8.04 | − | Nonstructural polyprotein (Dicistroviridae sp.) | 2E−162 | 36.21% | QJI52218.1 |
| ORF 2 | 4,660 to 7,302 | 2,643 | 880 | 99.19 | 6.59 | − | Structural polyprotein (black queen cell virus) | 7E−136 | 36.23% | ASS83231.1 |
−, does not exist.
Assemble strategy view of the contigs from the different wasps’ transcriptomes. Download FIG S1, TIF file, 0.4 MB (410.2KB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Contigs from these three wasps. AC, Anisopteromalus calandrae; LD, Lariophagus distinguendus; and TE, Theocolax elegans. Download Text S1, TXT file, 0.09 MB (89.5KB, txt) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Primers used in this study. Download Table S1, XLSX file, 0.02 MB (25.6KB, xlsx) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
(ii) WWPSRV-2. Four contigs (AC_Contig_3, LD_Contig_2, and TE_Contig_2 and 3) (Text S1) were assembled into a big virus-like contig (4,899 bp), which had a high similarity with Diabrotica virgifera virgifera virus 1 (DvvV-1) polyprotein (coverage range from amino acid [aa] 1,440 to aa 2,995) (Table S3 and Fig. S1B). Moreover, AC_Contig_2 (3,308 bp) (Text S1) was also aligned to DvvV-1 polyprotein (coverage range from aa 210 to aa 1,002) (Table S3). By sequencing these contigs, the five contigs were verified to belong to the same virus, named WWPSRV-2 (Fig. 1B). The complete genome of WWPSRV-2 was 9,409 nt in length, excluding poly-A. G+C pairs comprised 40.95% of the nucleotides. Only one large ORF was predicted in WWPSRV-2 genome, located at nt 34 to 9,228 in 5′ to 3′ direction. The 5′ and 3′ untranslated regions (UTR) of WWPSRV-2 genome were 33 and 181 nt in length, respectively (Fig. 2A and Table 2). The fluctuating distributions of the three read maps were similar, with the higher coverage area near 3′ ends of the viral genome. However, a small number RNA-seq reads of L. distinguendus could map to the WWPSRV-2 genome (Fig. 1A). Of note, PCR detection results revealed that this virus could exist in the adults of S. oryzae (Fig. 1B and Fig. S2).
PCR detection of the six viruses in the Sitophilus oryzae adults. In lanes 1 to 7 were the detection results from using specific primers from WWPSRV-1, WWPSRV-2, AcPSRV-1, AcNSRV-1, AcNSRV-2, LdNSRV-1, and actin (weevil), respectively. The primers used to detect viral genome sequence are shown in Table S1. Download FIG S2, TIF file, 0.3 MB (324.8KB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
(iii) AcPSRV-1. AC_Contig_4 (Text S1) was 7,558 bp in length, including poly-A, and contained regions similar with viral RNA-dependent RNA polymerase (RdRp) of Dicistroviridae sp. (Table S3). It contained two large ORFs in the 5′ to 3′ direction, located at nt positions 183 to 4,343 and 4,660 to 7,302, respectively. The contig was provisionally named AcPSRV-1, and G+C pairs comprised 34.10% of the nucleotides. The 5′UTR, intergenic region (IGR), and 3′UTR of AcPSRV-1 genome were 182, 242, and 247 nt in length, respectively (Fig. 2A and Table 2). However, this contig failed to be confirmed by PCR. Only the RNA-seq reads of A. calandrae could map to AcPSRV-1 genome. The higher coverage area also appeared close to 3′ ends of the viral genome (Fig. 2A).
AcNSRV-1, AcNSRV-2, and LdNSRV-1 were related to negative-strand RNA virus.
(i) AcNSRV-1. Four contigs related to negative-strand RNA virus were discovered in A. calandrae. The two long contigs (AC_Contig_5, 12,699 bp; AC_Contig_6, 12,118 bp) (Text S1), which both had high identities with Linepithema humile rhabdo-like virus 1, were provisionally designated AcNSRV-1 and AcNSRV-2, respectively (Table S3). By sequencing, the confirmed complete genome of AcNSRV-1 was determined to be 12,702 nucleotides in length (Fig. 1C and 2B). G+C pairs comprised 37.85% of the nucleotides. The genome contained five large ORFs in the 3′ to 5′ direction, located at nt positions 209 to 1,498, 1,584 to 2,957, 3,081 to 3,479, 3,920 to 5,536, and 6,089 to 12,265, respectively (Table 3 and Fig. 2B). The fluctuating distribution of RNA-seq reads of A. calandrae exhibited that reads in the five ORF regions were much more numerous than those in the nearby untranslated region. The position of the peak corresponded exactly to the position of each ORF (Fig. 2B). The leader and trailer regions of the AcNSRV-1 genome were 208 and 437 nt in length, respectively. Their terminal nucleotides were complementary (Table 3 and Fig. 2B) and could form a putative panhandle structure which may be involved in genome replication (17).
TABLE 3.
Features of the ORFs encoded by (−)ssRNA viruses discovered in the transcriptomes of weevil waspsa
| Virus | ORF | Protein | ORF genomic location (nt) | Length (nt | aa) | Protein mass (kDa) | Isoelectric point | Signal peptide | No. of glycosylation sites |
No. of phosphorylation sites |
Top BLASTX match | E value | Identity | Accession number | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| O-linked | N-linked | Ser | Thr | Tyr | ||||||||||||
| AcNSRV-1 | 1 | N | 209 to 1,498 | 1,290 | 429 | 48.27 | 5.75 | − | 8 | 0 | 20 | 10 | 5 | Putative capsid (Linepithema humile rhabdo-like virus 1) | 6E−40 | 27.29% | AXA52563.1 |
| 2 | P | 1,584 to 2,957 | 1,374 | 457 | 50.98 | 5.90 | − | 20 | 0 | 24 | 19 | 4 | ND | ||||
| 3 | M | 3,081 to 3,479 | 399 | 132 | 14.87 | 6.89 | − | 1 | 0 | 8 | 4 | 3 | ND | ||||
| 4 | G | 3,920 to 5,536 | 1,617 | 538 | 60.24 | 6.71 | + | 1 | 2 | 32 | 14 | 6 | Putative glycoprotein (Hubei rhabdo-like virus 3) | 2E−19 | 25.50% | YP_009336887.1 | |
| 5 | L | 6,089 to 12,265 | 6,177 | 2,058 | 236.10 | 7.78 | − | 16 | 0 | 104 | 67 | 32 | Putative RdRp-complex (Linepithema humile rhabdo-like virus 1) | 0 | 40.27% | AXA52562.1 | |
| AcNSRV-2 | 1 | N | 29 to 1,330 | 1,302 | 433 | 47.77 | 6.43 | − | 0 | 0 | 13 | 11 | 6 | Putative capsid (Linepithema humile rhabdo-like virus 1) | 7.00E−53 | 30.75% | AXA52563.1 |
| 2 | P | 1,518 to 2,624 | 1,107 | 368 | 40.35 | 5.32 | − | 21 | 0 | 30 | 17 | 5 | ND | ||||
| 3 | M | 2,803 to 3,183 | 381 | 126 | 14.23 | 7.75 | − | 0 | 0 | 6 | 9 | 5 | ND | ||||
| 4 | G | 3,453 to 5,075 | 1,623 | 540 | 60.08 | 8.36 | + | 7 | 5 | 33 | 19 | 9 | Glycoprotein (Blattodean arli-related virus OKIAV101) | 3.00E−25 | 25.12% | QMP82174.1 | |
| 5 | P6 | 5,214 to 5,429 | 216 | 71 | 8.23 | 7.76 | − | 0 | 0 | 2 | 1 | 0 | ND | ||||
| 6 | L | 5,572 to 11,886 | 6,315 | 2,104 | 239.37 | 6.84 | − | 14 | 0 | 91 | 65 | 38 | Putative RdRp-complex (Linepithema humile rhabdo-like virus 1) | 0 | 40.19% | AXA52562.1 | |
| LdNSRV-1 | 1 | N | 44 to 1,408 | 1,365 | 454 | 49.41 | 6.13 | − | 5 | 0 | 18 | 20 | 4 | Nucleocapsid protein (hymenopteran rhabdo-related virus OKIAV109) | 3.00E−99 | 39.81% | QMP82140.1 |
| 2 | P | 1,499 to 2,587 | 1,089 | 362 | 39.51 | 6.98 | − | 21 | 0 | 25 | 12 | 3 | ND | ||||
| 3 | M | 2,611 to 3,342 | 732 | 243 | 27.11 | 8.94 | − | 3 | 0 | 13 | 6 | 2 | Hypothetical protein (hymenopteran rhabdo-related virus OKIAV109) | 2.00E−18 | 33.51% | QMP82142.1 | |
| 4 | G | 3,542 to 5,080 | 1,539 | 512 | 57.70 | 7.66 | + | 2 | 2 | 32 | 21 | 2 | Glycoprotein (hymenopteran rhabdo-related virus OKIAV109) | 3.00E−48 | 28.07% | QMP82143.1 | |
| 5 | L | 5,169 to 11,519 | 6,351 | 2,116 | 238.53 | 8.42 | − | 28 | 0 | 133 | 61 | 23 | RdRp (hymenopteran rhabdo-related virus OKIAV109) | 0 | 48.73% | QMP82144.1 | |
+, exists; −, does not exist; ND, no significant homology was detected.
(ii) AcNSRV-2. The complete genome of AcNSRV-2 failed to be confirmed by PCR. The known length was 12,118 nucleotides and G+C pairs comprised 37.85% of the nucleotides. The genome contained six ORFs in the 3′ to 5′ direction, located at nt positions 29 to 1,330, 1,518 to 2,624, 2,803 to 3,183, 3,453 to 5,075, 5,214 to 5,429, and 5,572 to 11,886, respectively (Table 3 and Fig. 2B). The six ORFs were also confirmed by the RNA-seq reads fluctuating distribution (Fig. 2B). The leader and trailer regions of the AcNSRV-2 genome were 28 and 232 nt in length, respectively. Their terminal nucleotides were not complementary.
(iii) LdNSRV-1. LD_Contig_3 (10,832 bp) in L. distinguendus, designated LdNSRV-1, showed a certain sequence similarity with a rhabdovirus, Wuhan ant virus. The complete genome of LdNSRV-1 verified by sequencing was 11,633 nucleotides in length with G+C pairs comprising 45.64% of the nucleotides (Fig. 1D and 2B). It contained five large ORFs in the 3′ to 5′ direction, located at nt positions 44 to 1,408, 1,499 to 2,587, 2,611 to 3,342, 3,542 to 5,080, and 5,169 to 11,519, respectively (Table 3 and Fig. 2B). The fluctuating distribution of RNA-seq reads of L. distinguendus also indicated the existence of the five ORFs (Fig. 2B). The leader and trailer regions of the LdNSRV-1 genome were 43 and 114 nt in length, respectively. Their terminal nucleotides were not complementary.
In T. elegans, the contigs related to negative-strand RNA virus were too small to get the complete virus genome (Table S3).
WWPSRV-1 represented a novel virus related to Nora virus.
General properties of all ORFs in WWPSRV-1 genome were listed in Table 2. The predicted translation products of WWPSRV-1 ORF1 to 4 were found to have amino acid sequences similar to those of the Nasonia vitripennis virus RdRp and ORF3, 4, and 5, respectively. ORF1 protein was predicted to be a large transmembrane polyprotein (2,578 aa and isoelectric point [pI] of 8.05) with three noncytoplasmic domains, two cytoplasmic domains, and four transmembrane regions (Fig. S3A). A specified putative helicase domain (aa 824 to 1,005), a trypsin-like serine protease domain (aa 1,773 to 1,995), and a RdRp domain (aa 2,157 to 2,530) were successively existent in the noncytoplasmic domain of ORF1 protein (Fig. 2A and Fig. S4).
Graphical representation of the polyproteins encoded by the (+)ssRNA viruses identified in the wasps’ transcriptomes. (A) The polyprotein encoded by WWPSRV-1 ORF1. (B) The polyprotein encoded by WWPSRV-2 ORF. (C) The polyprotein encoded by AcPSRV-1 ORF1. Numbers represent the amino acid positions from the N terminus to C terminus. Download FIG S3, TIF file, 0.2 MB (217.6KB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Structure of helicase, protease, and RdRp domains of WWPSRV-1, WWPSRV-2, and AcPSRV-1. (A) Partial amino acids containing helicase motifs A to C are shown with their secondary structures (red cylinders, α-helices; green arrows, β-strands). Sequence logos for helicase motifs A to C of 12 positive-strand RNA viruses belonging to the families Dicistroviridae (ABPV, CrPV, TrV, and BQCV), Iflaviridae (DWV, SBV, SBPV, and ApIV), and unclassified Nora virus-related viruses (HaNV, AiNV, ANV, and DiNV) are shown with the corresponding sequences of WWPSRV-1, WWPSRV-2, and AcPSRV-1. (B) The three-dimensional structural models of the helicase domains of these three viruses predicted by the Phyre2 web. The positions of motifs A to C are marked. Motif A, magenta; motif B, red; motif C, blue. Confidence in the models of WWPSRV-1, WWPSRV-2, and AcPSRV-1: 172 residues (95%) modeled at >90% accuracy, 166 residues (97%) modeled at >90% accuracy, and 178 residues (98%) modeled at >90% accuracy, respectively. (C) Multiple sequence alignment of the putative protease domains of these 3 viruses and 12 other picorna-like viruses. Identical residues are shaded in dark blue and similar residues are in light blue. (D) Partial amino acid containing RdRp motifs A to H are shown with their secondary structures (red cylinders, α-helices; green arrows, β-strands). Sequence logos for RdRp motifs A to H of 12 positive-strand RNA viruses are shown with the corresponding sequences of these 3 viruses. (E) The three-dimensional structural models of the RdRp domains of these three viruses predicted by the Phyre2 web. The positions of motifs A to H are marked. Motif A, magenta; motif B, orange; motif C, yellow; motif D, cyan; motif E, wheat; motif F, blue; motif G, red; motif H, white. Confidence in the models of WWPSRV-1, WWPSRV-2, and AcPSRV-1: 374 residues (100%) modeled at >90% accuracy, 442 residues (100%) modeled at >90% accuracy and 388 residues (100%) modeled at >90% accuracy, respectively. Full names and references of these viruses are shown in Table S2. Download FIG S4, TIF file, 1.4 MB (1.4MB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
ORF2 protein (244 aa and pI = 4.79) contained three coils, which was similar with tropomyosin, according to InterProScan. VIRALpro suggested that it was capsid protein (distance = 0.4141) and had the tail sequence (distance = 0.1172). ORF3 (526 aa and pI = 9.04) and ORF4 (355 aa and pI = 4.48) proteins were also suggested to encode capsid proteins (distance = 1.0846 and 0.8483, respectively).
To determine the relationship of WWPSRV-1 with other picorna-like viruses, we conducted a maximum likelihood phylogenetic analysis based on the amino acid core sequences of RdRp. Notably, WWPSRV-1 clustered as a distinct lineage in Nora virus-related clade, indicating the need for a novel virus genus or family (Fig. 3).
FIG 3.
Phylogenetic analysis of positive-strand RNA viruses discovered in the parasitoid wasps’ transcriptomes. Maximum likelihood phylogenetic tree was constructed based on the deduced RdRp domain. Viruses are from the order Picornavirales and the GenBank accession numbers are shown in Table S2. Purple circles on the branches of each tree relate to 50 to 100% bootstrap support calculated from 1,000 replicates.
Viral sequences used for phylogenetic analysis, sequence alignments, and protein prediction in this study. Download Table S2, XLSX file, 0.01 MB (15.1KB, xlsx) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
WWPSRV-2 was a novel iflavirus.
Only a big ORF was predicted in WWPSRV-2 genome, which encoded a large polyprotein. The putative domains in the polyprotein were arranged successively in the order CP1, CP2, CP3, helicase, protease, and RdRp, corresponding to that in iflaviruses (Fig. 2A).
The polyprotein (3,064 aa and pI = 6.19) was a large transmembrane polyprotein that consisted of four noncytoplasmic domains, three cytoplasmic domains, and six transmembrane regions. All three predicted picornavirus-like capsid protein domains appeared in the first noncytoplasmic domain. A typical helicase domain (aa 1,440 to 1,611) was present in the third noncytoplasmic domain. Moreover, a relatively small motif (aa 1,799 to 1,842) predicted as a zinc finger domain was also in that noncytoplasmic domain. The putative protease (aa 2,212 to 2,417) and RdRp (aa 2,490 to 2,931) domains were in the last noncytoplasmic domain (Table 2 and Fig. S3B and S4).
To determine the phylogenetic relationship of WWPSRV-2 in other iflaviruses, phylogenetic analysis was performed using the highly conserved RdRp region. WWPSRV-2 could form a clade with the known members of iflaviruses, such as sacbrood virus (SBV) and Lygus lineolaris virus 1 (LyLV-1) (Fig. 3).
AcPSRV-1 was a novel dicistrovirus.
Two big ORFs were present in the AcPSRV-1 genome, which encoded nonstructural and structural proteins, corresponding to that in dicistroviruses. The nonstructural protein encoded by ORF1 was a large transmembrane polyprotein (1,386 aa and pI = 8.04) that consisted of two noncytoplasmic domains, one cytoplasmic domain, and two transmembrane regions. The helicase (aa 67 to 248), protease (aa 595 to 831), and RdRp (aa 963 to 1,350) domains were in the different two noncytoplasmic domains (Fig. S3C and S4). The structural protein (880 aa and pI = 6.59) encoded by ORF2 was a polyprotein in which three picornavirus-like capsid protein domains exist (Fig. 2A and Table 2).
Phylogenetic analysis of the highly conserved RdRp region was performed to confirm the relationship of AcPSRV-1 in other dicistroviruses. AcPSRV-1 could be clustered with Himetobi P virus (HiPV), black queen cell virus (BQCV), and Triatoma virus (TrV), which belong to genus Triatovirus (Fig. 3).
Gene junction analysis of the three negative-strand RNA viruses.
By comparing the 5′ and 3′ untranslated regions and intergenic regions of the AcNSRV-1, AcNSRV-2, and LdNSRV-1, we pinpointed for each ORF the putative termination signals (TS), intergenic spacer (IS), and transcription initiation (TI). A highly conserved motif was identified in the 5′ and 3′ untranslated regions and intergenic regions of the AcNSRV-1, AcNSRV-2, or LdNSRV-1, respectively.
In AcNSRV-1 genome, the conserved transcription initiation motif of 3′-UUUCA(C/U/G)-5′ was identified upstream of each putative ORF. The transcription termination motif of 3′-AUA(A/U)AUCUUUUU-5′ was detected downstream of every ORF. The conserved intergenic spacer 3′-C(.)A-5′ was found in each intergenic region (Fig. 4A).
FIG 4.
Comparison of putative gene junction regions between ORFs in AcNSRV-1 (A), AcNSRV-2 (B), and LdNSRV-1 (C). Alignments of the putative gene junction sequences are shown in a 3′ to 5′ orientation in the negative strand. TS, termination signals; IS, intergenic sequence; TI, transcription initiation. The consensus motif is shown below the sequences for each viral ORF. Numbers at the beginning of the sequences represent the nucleotide positions from in a 3′ to 5′ orientation.
In AcNSRV-2 genome, the conserved transcription initiation motif of 3′-UUCA(G/C)-5′ was identified upstream of each putative ORF, and the transcription termination motif of 3′-UAAAUCUUUUU(U)-5′ was detected downstream of every ORF. The conserved intergenic spacer 3′-GAG-5′ was found in the intergenic region among N/P, P/M, and M/L (Fig. 4B).
In LdNSRV-1 genome, each ORF was separated by a highly conserved gene junction with the consensus 3′-UAAU(U)CUUUUU(U)(G/A)GUUCC(U/G)U-5′. The conserved transcription initiation motif of 3′-UUCC(U/G)U-5′ was identified upstream of each putative ORF. The transcription termination motif of 3′-UA(A)U(U)(C/A)UUUUU(U)(U)-5′ was detected downstream of every ORF. The conserved intergenic spacer 3′-(G/A)(G)-5′ was found in each intergenic region (Fig. 4C).
AcNSRV-1 and AcNSRV-2 were novel viruses belonging to family Lispiviridae.
The general properties of the tentative proteins encoded by the AcNSRV-1 and AcNSRV-2 genome were shown in Table 3. Both the ORF1 proteins encoded by the two viruses had close to 30% identities with the putative capsid of Linepithema humile rhabdo-like virus 1. Through the VIRALpro analysis, the ORF1 proteins were suggested to have the capsid sequences (distance was 0.5556 in AcNSRV-1 and distance was 1.2285 in AcNSRV-2). Accordingly, the ORF1 proteins of the two viruses were supposed to be the putative nucleocapsid proteins (N), which had similarity in length (1,290 aa and 1,302 aa, respectively) and isoelectric point (5.75 and 6.43, respectively).
Only ORF4 proteins encoded by the two viruses were predicted to be transmembrane polyproteins and to possess a cleavable signal peptide. Compared to the three transmembrane regions present in the AcNSRV-1 ORF4 protein, only one was in the AcNSRV-2 ORF4 protein (Fig. S5A and B). The AcNSRV-1 ORF4 protein had 25.5% identity with the putative glycoprotein of Hubei rhabdo-like virus 3. In addition, it was classified to the fusion glycoprotein F0, Paramyxoviridae (E value = 1.00E−08), within the InterProScan. The AcNSRV-2 ORF4 protein showed 25.12% identity with the putative glycoprotein of Blattodean arli-related virus OKIAV101. Moreover, only the ORF4 proteins of the two viruses were recommended to have the potential N-linked sites. All of these results indicated that ORF4 encoded the viral glycoprotein (G).
Graphical representation of the G proteins encoded by the (−)ssRNA viruses identified in the wasps’ transcriptomes. (A) The polyprotein encoded by AcNSRV-1 ORF4. (B) The polyprotein encoded by AcNSRV-1 ORF4. (C) The polyprotein encoded by LdNSRV-1 ORF4. Numbers represent the amino acid positions from the N terminus to C terminus. Download FIG S5, TIF file, 0.2 MB (185.6KB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
The AcNSRV-1 ORF5 and the AcNSRV-1 ORF6 were supposed to encode the viral large polymerase proteins (L). Both L proteins exhibited more than 40% similarity with the putative RdRp-complex of Linepithema humile rhabdo-like virus 1. Through the InterProScan, the putative RdRp, mRNA-capping enzyme (Cap, or named polyribonucleotidyltransferase, PRNTase), and methyltransferase (MT) domains were successively arranged in the L protein (Fig. 2B and Fig. S6).
Structure of the RdRp, PRNTase, and methyltransferase domains of AcNSRV-1, AcNSRV-2, and LdNSRV-1. (A) Partial amino acid containing RdRp motifs A to F and GHP (Gly-His-Pro) are shown with their secondary structures (red cylinders, α-helices; green arrows, β-strands). Sequence logos for RdRp motifs A to G and GHP of 12 negative-strand RNA viruses belonging to the families Lispiviridae (HbRLV-3, TcTV-6, LHRLV-1, BerV, IARV-OKIAV103, and BARV-OKIAV102) and Rhabdoviridae (WHFV-2, SyFV-3, LNV-2, HyRRV-OKIAV38, SFRV, and LeRRV-OKIAV34) are shown with the corresponding sequences of AcNSRV-1, AcNSRV-2, and LdNSRV-1. (B) The three-dimensional structural models of the RdRp domains of these three viruses predicted by the Phyre2 web. The positions of motifs A to F and GHP are marked. Motif GHP, red; motif A, yellow; motif B, magenta; motif C, cyan; motif D, orange; motif E, white; motif F, blue. Confidence in the models of AcNSRV-1, AcNSRV-2, and LdNSRV-1: 828 residues (100%) modeled at >90% accuracy, 851 residues (99%) modeled at >90% accuracy, and 877 residues (100%) modeled at >90% accuracy, respectively. (C) Partial amino acids containing PRNTase motifs A to E are shown with their secondary structures (red cylinders, α-helices; green arrows, β-strands). Sequence logos for PRNTase motifs A to E of 12 negative-strand RNA viruses are shown with the corresponding sequences of these three viruses. (D) The three-dimensional structural models of PRNTase domains of these three viruses predicted by the Phyre2 web. The positions of motifs A to E are marked with orange. Confidence in the models of AcNSRV-1, AcNSRV-2, and LdNSRV-1: 235 residues (100%) modeled at >90% accuracy, 235 residues (100%) modeled at >90% accuracy, and 237 residues (100%) modeled at >90% accuracy, respectively. (E) Partial amino acid containing methyltransferase motifs (I to IV and VI to X) are shown with their secondary structures (red cylinders, α-helices; green arrows, β-strands). Sequence logos for methyltransferase motifs of 12 negative-strand RNA viruses are shown with the corresponding sequences of these three viruses. (F) The three-dimensional structural models of the methyltransferase domains of these three viruses predicted by the Phyre2 web. KDKE observed in motif X, motif IV, motif VI, and motif VIII, are marked with red, magenta, cyan, and orange, respectively. Motif I is marked with blue. Confidence in the models of AcNSRV-1, AcNSRV-2, and LdNSRV-1: 195 residues (100%) modeled at >90% accuracy, 197 residues (100%) modeled at >90% accuracy, and 194 residues (100%) modeled at >90% accuracy, respectively. Full names and references of these viruses are shown in Table S2. Download FIG S6, TIF file, 2.2 MB (2.2MB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Based on the conserved genomic orientation of mononegaviruses (3′-N-P-M-G-L-5′), ORF2 protein of AcNSRV-1 (pI = 5.90) or AcNSRV-2 (pI = 5.32), containing 47 or 52 potential phosphorylation sites, was most likely the phosphoprotein (P). ORF3 protein of AcNSRV-1 (pI = 6.89) or AcNSRV-2 (pI = 7.75), containing 15 or 20 phosphorylation sites, may be a phosphorylated matrix protein (M). In addition, the AcNSRV-2 had an additional small protein P6 (71 aa and pI = 7.76) lying between the G and L proteins. This was also confirmed by the RNA-seq reads’ fluctuating distribution (Fig. 2B).
Phylogenetic analysis of the highly conserved region of L protein was performed to confirm the relationship of AcNSRV-1 and AcNSRV-2 in other lispiviruses. AcNSRV-1 and AcNSRV-2 could be clustered with Berant virus and Linepithema humile rhabdo-like virus 1, which belong to the family Lispiviridae (Fig. 5).
FIG 5.
Phylogenetic analysis of negative-strand RNA viruses discovered in the parasitoid wasps’ transcriptomes. Maximum likelihood phylogenetic tree was constructed based on core motif in L protein. Viruses are from the order Mononegavirales and the GenBank accession numbers are shown in Table S2. Purple circles on the branches of each tree relate to 50 to 100% bootstrap support calculated from 1,000 replicates.
LdNSRV-1 was a novel rhabdovirus.
The general properties of the tentative proteins encoded by the LdNSRV-1 genome were shown in Table 3. The LdNSRV-1 ORF1 protein had a close-to-39.81% identity with nucleocapsid protein of hymenopteran rhabdo-related virus OKIAV109 (HyRRV-OKIAV109). It was suggested to have the capsid sequence (distance = 0.3497) with VIRALpro. Accordingly, the ORF1 protein was supposed to be the N protein. ORF4 protein was predicted to be a transmembrane polyprotein which only contained one transmembrane region and to possess a cleavable signal peptide (Fig. S5C). It has 28.07% identity with HyRRV-OKIAV109 glycoprotein. Moreover, ORF4 protein was recommended to have two potential N-linked sites. All of these indicated that ORF4 encoded the viral G protein. The LdNSRV-1 ORF5 was supposed to encode the viral L protein which also contained the putative RdRp, PRNTase, and MT domains (Fig. 2B and Fig. S6). It exhibited more than 40% similarity with HyRRV-OKIAV109 RdRp-complex. The LdNSRV-1 ORF2 protein (pI = 6.98), containing 40 putative phosphorylation sites, was most likely a P protein. ORF3 protein (pI = 8.94), containing 21 putative phosphorylation sites, may be an M protein.
Phylogenetic analysis of the highly conserved region of L protein was performed to confirm the relationship of LdNSRV-1 in other rhabdoviruses. LdNSRV-1 could be clustered with Lasius neglectus virus 2 and hymenopteran rhabdo-related virus OKIAV38, which belong to the genus Alphahymrhavirus (Fig. 5).
DISCUSSION
More and more RNA viruses have been discovered with the increasing transcriptome data of insects. Also, whole-transcriptome analysis has become a very important method in virus identification and surveillance in insects relevant to economics and public health. For example, 42 putative novel picornaviruses are discovered in varroa-free Australian Apis mellifera (18), three novel RNA viruses are identified in Aedes vexans nipponii collected at a densely populated district of Seoul, and many insect-specific viruses, belonging to Baculoviridae, Nimaviridae, and Iflaviridae, are found in Culex quinquefasciatus and Culex tritaeniorhynchus gathered in China and Kenya (11, 19). In this study, we identified six novel RNA viruses from three species of hymenopteran Pteromalidae parasitoids of the rice weevil S. oryzae. We characterized organization of viral genomes, their phylogenetic locations, and structures of viral nonstructural protein. These results indicate that the positive-strand and negative-strand RNA viruses could be simultaneously present in these three wasps.
Picornaviruses identified here are WWPSRV-1, WWPSRV-2, and AcPSRV-1. The organization of these viral genomes varies greatly. WWPSRV-1 possesses four ORFs in the genome, whereas WWPSRV-2 and AcPSRV-1 possess monocistronic and dicistronic genomes, respectively. In addition, these three viruses have the similar fluctuating distribution of sequencing reads, with the higher coverage area near 3′ ends of the viral genome (Fig. 2A). The 3′ bias of the viral genome coverage is likely to be attributed to the expression of subgenomic RNAs (sgRNAs) (20). However, it also cannot be ignored that RNA degradation or shearing during sample handling and RNA extraction may also lead to a controllable 3′ bias in short-read RNA-seq data (21). The analysis of PCR results and transcriptome data (Fig. 1A and B and Fig. 2A) confirms that WWPSRV-1 and WWPSRV-2 can be detected in these three wasps. The observations demonstrate that these two viruses may have the possibility of interspecies virus transmission. Just like Drosophila C virus (DCV), they can infect a much broader range of Drosophila species through horizontal transmission (22). Significantly, WWPSRV-2, which belong to family Iflaviridae of which some viruses can result in lethal infections in silkworms (23) and honeybees (24), can also be detected in the wasp host rice weevil. Moreover, it has been reported that an iflavirus Dinocampus coccinellae paralysis virus, found in Dinocampus coccinellae wasps which parasitize the lady beetle Coleomegilla maculata, can replicate in lady beetle cerebral ganglia and thus induce changes in lady beetle behavior such as tremors, gait disturbance, and limitations in movement (25). Accordingly, WWPSRV-2 may also play a role in the virus-wasp-weevil interaction. WWPSRV-1 shows high sequence similarity with Nasonia vitripennis virus, for which the incomplete genome was identified from cDNA libraries of parasitoid wasps Nasonia vitripennis (26). In the phylogenetic tree, WWPSRV-1 could cluster as a distinct lineage in Nora virus-related clade. In this clade, the most studied virus is Nora virus, which is a persistent nonpathogenic virus that infects various Drosophila species (27, 28). However, Nora virus-infected D. melanogaster exhibits an increase in immune-related gene expression over time (29). AcPSRV-1 has a dicistronic genome with gene organization similar to that of dicistroviruses. Phylogenetic analyses of the RdRp domain revealed that, clustering with Himetobi P virus (detected in the small brown planthopper Laodelphax striatellus) as closest relative, AcPSRV-1 belongs to genus Triatovirus (30). In this genus, black queen cell virus, which also infects honeybees (hymenopteran insect), causes broad health problems with risk of colony sustainability (31).
Mononegaviruses identified here are AcNSRV-1, AcNSRV-2, and LdNSRV-1. AcNSRV-1 and AcNSRV-2 belong to family Lispiviridae of the order Mononegavirales, whereas LdNSRV-1 belongs to a big family Rhabdoviridae. Sixteen viral genome sequences which belong to family Lispiviridae (3, found in hymenopteran orussidae, cynipidae, or aphelinidae insects, respectively) and Rhabdoviridae (13, found in hymenopteran eulophidae, sphecidae, chrysididae, pompilidae, sapygidae, or mymaridae insects, respectively) have been identified in the hymenopteran insects (32–34). The impact of these viruses on the host wasps has rarely been determined. However, some insects, such as flies, mosquitos, and aphids, when infected by the rhabdoviral sigmaviruses can become paralyzed or die after exposure to high concentrations of CO2, whereas uninfected hosts can recover (27, 35, 36).
The arrangement of the genomes of AcNSRV-1 and LdNSRV-1 follows the typical basic five-ORF pattern (3′-N-P-M-G-L-5′) of mononegaviral genomes (37). The mRNA of each gene can be synthesized directly from the viral genome (38). The high-coverage area of sequencing reads of AcNSRV-1 and LdNSRV-1 reveals that the N, P, M, and G may have a high expression level. Moreover, AcNSRV-2 contains an additional P6 protein between G and L proteins (Fig. 2B). The small additional proteins interposed between the typical basic protein were often identified in the plant rhabdoviruses (39). Some of these proteins’ functions have been made clear, e.g., P3 acts as a movement protein in rice transitory yellowing virus (RTYV) and rice yellow stunt rhabdovirus (RYSV) and P6 acts as a systemic RNA silencing suppressor in RYSV (40–42). In addition, a highly conserved motif in the intergenic junction which could be grouped into TS, IS, and TI is predicted to be present in all three viruses. Moreover, the 3′ and 5′ end sequences of AcNSRV-1 are complementary and can form a putative panhandle structure thought to be involved in genome replication (17).
In conclusion, we have expanded the diversity of RNA viruses in the parasitoid wasps, and these RNA viruses may play an important role in the biocontrol of rice weevils. For better application of this information, much work should be accomplished. Just like PpNSRV-1 (12), it also should be confirmed whether these viruses can affect the parasitism rate and the life span of parasitoid wasps. Moreover, the virus transmission among wasps and their host weevil should also be investigated.
MATERIALS AND METHODS
Insect rearing.
Rice weevils (S. oryzae) were initially collected from a cereal storehouse in Hangzhou, China in 2017. They were successively reared on wheat seeds in the laboratory at 30 ± 1°C, 80% relative humidity, and 12-h light/12-h dark photoperiod. The mated adult beetles were provided with the fresh wheat seeds to lay eggs for a week (the seeds were replaced weekly). After 10 to 15 days, eggs developed into elder larvae used as hosts for the wasps. The colonies of A. calandrae, L. distinguendus, and T. elegans were initially collected from the parasitized rice weevil hosts of the same sites. The adults of the three wasp species were independently reared on 10% (vol/vol) honey at 25 ± 1°C, 50% relative humidity, 12-h light/12-h dark photoperiod. For parasitism, the mated adults of the three wasp species were placed together with the wheat seeds that contained the host larvae. These parasitized beetle larvae were reared in the same conditions as were the wasp adults. The adults of A. calandrae or L. distinguendus emerged after 15 days. The adults of T. elegans emerged after 20 days. Each wasp colony was independently reared in the laboratory at least three generations before the experiments.
Next-generation sequencing, sequence assembly, and virus-related sequence discovery.
Total RNA from 20 female adults of A. calandrae, L. distinguendus, or T. elegans was extracted using TRIzol (Invitrogen, CA, USA). The RNA quality was assessed using NanoDrop 2000 (ThermoFisher Scientific, Waltham, MA) and an Agilent 2100 bioanalyzer (ThermoFisher Scientific, CA, USA). RNA concentration was measured using an Invitrogen Qubit 2.0 Fluorometer (ThermoFisher Scientific, CA, USA). RNA samples (RNA integrity number > 8) were sent to Novogene company (Beijing, China) for the subsequent library construction. Sequencing libraries were generated using NEBNext Ultra RNA Library Prep kit for Illumina (NEB, USA) following the manufacturer’s recommendations and index codes were added to attribute sequences to each sample. The clustering of the index-coded samples was performed on a cBot Cluster Generation System using TruSeq PE cluster kit v3-cBot-HS (Illumia) according to the manufacturer’s instructions. After cluster generation, the library preparations were sequenced on an Illumina Hiseq platform and paired-end reads were generated. The low-quality, adaptor-polluted, and high content of unknown base (N) reads were removed. The clean reads were performed with de novo assembly using Trinity program (43). After assembly, the remaining contigs were annotated with BLASTX (E values ≤ 1e−5) in the NCBI nonredundant protein sequence databases to identify the virus-related sequences.
Viral genome sequencing and detection.
Primers for viral genome sequencing and detection were designed based on the virus genome-like contigs (Table S1). Total RNA from adult female wasps or adult weevils was extracted using TRIzol. Single-stranded cDNA was synthesized from the RNA using the TransScript one-step gDNA removal and cDNA synthesis SuperMix kit (TransGen Biotech, Beijing, China). cDNA was used as a template for PCR. The terminal sequences of the viral genome were confirmed by 5′ or 3′ RACE according to the instructions of the SMART RACE cDNA amplification kit (Clontech, CA, USA). All amplified PCR products were cloned into pGEM-T Easy vectors (Promega, Beijing Branch, China) and sequenced.
Bioinformatic and phylogenetic analysis.
Nucleotide sequence analysis and assembly were performed using DNAStar Lasergene software version 7.0 (Madison, WI, USA). Viral ORFs were predicted using open-source NCBI ORF finder (https://www.ncbi.nlm.nih.gov/orffinder/). Molecular weight and isoelectric point (pI) of the predicted ORFs were calculated via open-source ProtParam (https://web.expasy.org/protparam/). The viral proteins were queried through InterPro (http://www.ebi.ac.uk/interpro/) and Phyre2 for protein modeling, prediction, and analysis (44). ViralPro was used to determine whether the viral proteins were possibly capsid or tail proteins (45). Potential phosphorylation sites were determined by open-source NetPhos 3.1 (46). Glycosylation sites were determined by NetOGlyc 4.0 (47) and NetNGlyc 1.0 (https://services.healthtech.dtu.dk/service.php?NetNGlyc-1.0).
To identify putative conserved transcription termination and initiation sequences of (−)ssRNA viruses (negative-sense single-stranded RNA), noncoding viral genome regions were analyzed using the open-source MEME suite 5.3.0 for the motifs analysis (48).
To identify the viral genomic RNA distribution in these three wasps’ transcriptomes, clean RNA-seq reads from all three wasps were mapped to the complete genome of the six novel RNA viruses using the CLC Genomics Workbench 12.
Multiple alignments of amino acid sequences were performed using Clustal Omega webserver (49) and edited by open-source Jalview software version 2.11.1.3 (50). Amino acid sequence logos for the conserved motifs were generated using the WebLogo program (51). Phylogenetically informative sites were selected from the MUSCLE alignment result by using Gblocks in PhyloSuite v1.2.2 (49, 52). The phylogenetic tree was constructed using IQ-TREE in PhyloSuite with the maximum likelihood method of 1,000-fold bootstrap (52). Virus abbreviations and accession numbers are listed in Table S2.
Data availability.
The GenBank accession numbers of WWPSRV-1, WWPSRV-2, AcPSRV-1, AcNSRV-1, AcNSRV-2, and LdNSRV-1 are MW864600, MW864601, MW864599, MW864602, MW864603, and MW864604, respectively. BioProject accession for the RNA-seq of three parasitoid wasps of the rice weevil Sitophilus oryzae is PRJNA719021.
ACKNOWLEDGMENTS
This work was supported by Key Program of National Natural Science Foundation of China (31830074) and Major International (Regional) Joint Research Project of NSFC (31620103915).
REFERENCES
- 1.Shi M, Lin XD, Vasilakis N, Tian JH, Li CX, Chen LJ, Eastwood G, Diao XN, Chen MH, Chen X, Qin XC, Widen SG, Wood TG, Tesh RB, Xu J, Holmes EC, Zhang YZ. 2016. Divergent viruses discovered in arthropods and vertebrates revise the evolutionary history of the flaviviridae and related viruses. J Virol 90:659–669. doi: 10.1128/JVI.02036-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Shi M, Lin XD, Tian JH, Chen LJ, Chen X, Li CX, Qin XC, Li J, Cao JP, Eden JS, Buchmann J, Wang W, Xu J, Holmes EC, Zhang YZ. 2016. Redefining the invertebrate RNA virosphere. Nature 540:539–543. doi: 10.1038/nature20167. [DOI] [PubMed] [Google Scholar]
- 3.Wolf YI, Kazlauskas D, Iranzo J, Lucia-Sanz A, Kuhn JH, Krupovic M, Dolja VV, Koonin EV. 2018. Origins and Evolution of the Global RNA Virome. mBio 9. doi: 10.1128/mBio.02329-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Liu S, Chen Y, Bonning BC. 2015. RNA virus discovery in insects. Curr Opin Insect Sci 8:54–61. doi: 10.1016/j.cois.2014.12.005. [DOI] [PubMed] [Google Scholar]
- 5.Possee RD, King LA. 2014. Insect Viruses. eLS. [Google Scholar]
- 6.Wu H, Pang R, Cheng T, Xue L, Zeng H, Lei T, Chen M, Wu S, Ding Y, Zhang J, Shi M, Wu Q. 2020. Abundant and Diverse RNA viruses in insects revealed by RNA-seq analysis: ecological and evolutionary implications. mSystems 5. doi: 10.1128/mSystems.00039-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Caesar L, Cibulski SP, Canal CW, Blochtein B, Sattler A, Haag KL. 2019. The virome of an endangered stingless bee suffering from annual mortality in southern Brazil. J Gen Virol 100:1153–1164. doi: 10.1099/jgv.0.001273. [DOI] [PubMed] [Google Scholar]
- 8.Zhang W, Gu Q, Niu J, Wang JJ. 2020. The RNA virome and its dynamics in an invasive fruit fly, bactrocera dorsalis, imply interactions between host and viruses. Microb Ecol 80:423–434. doi: 10.1007/s00248-020-01506-9. [DOI] [PubMed] [Google Scholar]
- 9.Pettersson JH, Shi M, Eden JS, Holmes EC, Hesson JC. 2019. Meta-transcriptomic comparison of the RNA viromes of the mosquito vectors Culex pipiens and Culex torrentium in Northern Europe. Viruses 11:1033. doi: 10.3390/v11111033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Shi M, Neville P, Nicholson J, Eden JS, Imrie A, Holmes EC. 2017. High-resolution metatranscriptomics reveals the ecological dynamics of mosquito-associated RNA viruses in Western Australia. J Virol 91doi: 10.1128/JVI.00680-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Atoni E, Wang Y, Karungu S, Waruhiu C, Zohaib A, Obanda V, Agwanda B, Mutua M, Xia H, Yuan Z. 2018. Metagenomic virome analysis of culex mosquitoes from Kenya and China. Viruses 10:30. doi: 10.3390/v10010030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wang F, Fang Q, Wang B, Yan Z, Hong J, Bao Y, Kuhn JH, Werren JH, Song Q, Ye G. 2017. A novel negative-stranded RNA virus mediates sex ratio in its parasitoid host. PLoS Pathog 13:e1006201. doi: 10.1371/journal.ppat.1006201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wu F, Yan XP. 2018. Distribution of the related weevil species Sitophilus oryzae and S. zeamais (Coleoptera: Curculionidae) in farmer stored grains of China. J Econ Entomol 111:1461–1468. doi: 10.1093/jee/toy061. [DOI] [PubMed] [Google Scholar]
- 14.Padín S, Dal Bello G, Fabrizio M. 2002. Grain loss caused by Tribolium castaneum, Sitophilus oryzae and Acanthoscelides obtectus in stored durum wheat and beans treated with Beauveria bassiana. J Stored Products Res 38:69–74. doi: 10.1016/S0022-474X(00)00046-1. [DOI] [Google Scholar]
- 15.Lucas E, Riudavets J. 2002. Biological and mechanical control of Sitophilus oryzae (Coleoptera: Curculionidae) in rice. J Stored Products Res 38:293–304. doi: 10.1016/S0022-474X(01)00033-9. [DOI] [Google Scholar]
- 16.Adarkwah C, Obeng-Ofori D, Opuni-Frimpong E, Ulrichs C, Scholler M. 2019. Predator-parasitoid-host interaction: biological control of Rhyzopertha dominica and Sitophilus oryzae by a combination of Xylocoris flavipes and Theocolax elegans in stored cereals. Entomol Exp Appl 167:118–128. doi: 10.1111/eea.12760. [DOI] [Google Scholar]
- 17.Jackson AO, Dietzgen RG, Goodin MM, Bragg JN, Deng M. 2005. Biology of plant rhabdoviruses. Annu Rev Phytopathol 43:623–660. doi: 10.1146/annurev.phyto.43.011205.141136. [DOI] [PubMed] [Google Scholar]
- 18.Roberts JMK, Anderson DL, Durr PA. 2018. Metagenomic analysis of Varroa-free Australian honey bees (Apis mellifera) shows a diverse Picornavirales virome. J Gen Virol 99:818–826. doi: 10.1099/jgv.0.001073. [DOI] [PubMed] [Google Scholar]
- 19.Sanborn MA, Klein TA, Kim HC, Fung CK, Figueroa KL, Yang Y, Asafo-Adjei EA, Jarman RG, Hang J. 2019. Metagenomic analysis reveals three novel and prevalent mosquito viruses from a single pool of Aedes vexans nipponii collected in the Republic of Korea. Viruses 11:222. doi: 10.3390/v11030222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Visser M, Bester R, Burger JT, Maree HJ. 2016. Next-generation sequencing for virus detection: covering all the bases. Virol J 13:85. doi: 10.1186/s12985-016-0539-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Stark R, Grzelak M, Hadfield J. 2019. RNA sequencing: the teenage years. Nat Rev Genet 20:631–656. doi: 10.1038/s41576-019-0150-2. [DOI] [PubMed] [Google Scholar]
- 22.Kapun M, Nolte V, Flatt T, Schlotterer C. 2010. Host range and specificity of the Drosophila C virus. PLoS One 5:e12421. doi: 10.1371/journal.pone.0012421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Himeno M, Onodera K, Tanami Y. 1974. Properties of flacherie virus of the silkworm, Bombyx mori. J Invertebrate Pathology 23:164–171. doi: 10.1016/0022-2011(74)90180-3. [DOI] [PubMed] [Google Scholar]
- 24.Ribiere M, Olivier V, Blanchard P. 2010. Chronic bee paralysis: a disease and a virus like no other? J Invertebrate Pathology 103:S120–S131. doi: 10.1016/j.jip.2009.06.013. [DOI] [PubMed] [Google Scholar]
- 25.Dheilly NM, Maure F, Ravallec M, Galinier R, Doyon J, Duval D, Leger L, Volkoff AN, Misse D, Nidelet S, Demolombe V, Brodeur J, Gourbal B, Thomas F, Mitta G. 2015. Who is the puppet master? Replication of a parasitic wasp-associated virus correlates with host behaviour manipulation. Proc Biol Sci 282:20142773. doi: 10.1098/rspb.2014.2773. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Oliveira DCSG, Hunter WB, Ng J, Desjardins CA, Dang PM, Werren JH. 2010. Data mining cDNAs reveals three new single stranded RNA viruses in Nasonia (Hymenoptera: Pteromalidae). Insect Mol Biol 19 Suppl 1:99–107. doi: 10.1111/j.1365-2583.2009.00934.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Webster CL, Waldron FM, Robertson S, Crowson D, Ferrari G, Quintana JF, Brouqui JM, Bayne EH, Longdon B, Buck AH, Lazzaro BP, Akorli J, Haddrill PR, Obbard DJ. 2015. The discovery, distribution, and evolution of viruses associated with Drosophila melanogaster. PLoS Biol 13:e1002210. doi: 10.1371/journal.pbio.1002210. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Habayeb MS, Ekengren SK, Hultmark D. 2006. Nora virus, a persistent virus in Drosophila, defines a new picorna-like virus family. J Gen Virol 87:3045–3051. doi: 10.1099/vir.0.81997-0. [DOI] [PubMed] [Google Scholar]
- 29.Lopez W, Page AM, Carlson DJ, Ericson BL, Cserhati MF, Guda C, Carlson KA. 2018. Analysis of immune-related genes during Nora virus infection of Drosophila melanogaster using next generation sequencing. AIMS Microbiol 4:123–139. doi: 10.3934/microbiol.2018.1.123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Xu Y, Huang L, Wang Z, Fu S, Che J, Qian Y, Zhou X. 2014. Identification of Himetobi P virus in the small brown planthopper by deep sequencing and assembly of virus-derived small interfering RNAs. Virus Res 179:235–240. doi: 10.1016/j.virusres.2013.11.004. [DOI] [PubMed] [Google Scholar]
- 31.Dainat B, Evans JD, Chen YP, Gauthier L, Neumann P. 2012. Predictive markers of honey bee colony collapse. PLoS One 7:e32151. doi: 10.1371/journal.pone.0032151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Meng E, Tang B, Sanchez-Garcia FJ, Qiao T, Fu L, Wang Y, Hou YM, Wu JL, Chen ZM. 2019. The first complete genome sequence of a novel Tetrastichus brontispae RNA virus-1 (TbRV-1). Viruses 11:257. doi: 10.3390/v11030257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Kafer S, Paraskevopoulou S, Zirkel F, Wieseke N, Donath A, Petersen M, Jones TC, Liu S, Zhou X, Middendorf M, Junglen S, Misof B, Drosten C. 2019. Re-assessing the diversity of negative strand RNA viruses in insects. PLoS Pathog 15:e1008224. doi: 10.1371/journal.ppat.1008224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Simmonds TJ, Carrillo D, Burke GR. 2016. Characterization of a venom gland-associated rhabdovirus in the parasitoid wasp Diachasmimorpha longicaudata. J Insect Physiol 91–92:48–55. doi: 10.1016/j.jinsphys.2016.06.009. [DOI] [PubMed] [Google Scholar]
- 35.Longdon B, Obbard DJ, Jiggins FM. 2010. Sigma viruses from three species of Drosophila form a major new clade in the rhabdovirus phylogeny. Proc Biol Sci 277:35–44. doi: 10.1098/rspb.2009.1472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Sylvester ES, Richardson J. 1992. Aphid-borne rhabdoviruses-relationships with their vectors, p 313–341. In Harris KF (ed), Advances in disease vector research. Springer New York, New York, NY. doi: 10.1007/978-1-4612-2910-0_10. [DOI] [Google Scholar]
- 37.Pringle CR, Easton AJ. 1997. Monopartite negative strand RNA genomes. Semin Virol 8:49–57. doi: 10.1006/smvy.1997.0105. [DOI] [Google Scholar]
- 38.Whelan SP, Barr JN, Wertz GW. 2004. Transcription and replication of nonsegmented negative-strand RNA viruses. Curr Top Microbiol Immunol 283:61–119. doi: 10.1007/978-3-662-06099-5_3. [DOI] [PubMed] [Google Scholar]
- 39.Liu Y, Du Z, Wang H, Zhang S, Cao M, Wang X. 2018. Identification and characterization of wheat yellow striate virus, a novel leafhopper-transmitted nucleorhabdovirus infecting wheat. Front Microbiol 9:468. doi: 10.3389/fmicb.2018.00468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Guo H, Song X, Xie C, Huo Y, Zhang F, Chen X, Geng Y, Fang R. 2013. Rice yellow stunt rhabdovirus protein 6 suppresses systemic RNA silencing by blocking RDR6-mediated secondary siRNA synthesis. Mol Plant Microbe Interact 26:927–936. doi: 10.1094/MPMI-02-13-0040-R. [DOI] [PubMed] [Google Scholar]
- 41.Huang YW, Geng YF, Ying XB, Chen XY, Fang RX. 2005. Identification of a movement protein of rice yellow stunt rhabdovirus. J Virol 79:2108–2114. doi: 10.1128/JVI.79.4.2108-2114.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Hiraguri A, Hibino H, Hayashi T, Netsu O, Shimizu T, Uehara-Ichiki T, Omura T, Sasaki N, Nyunoya H, Sasaya T. 2012. The movement protein encoded by gene 3 of rice transitory yellowing virus is associated with virus particles. J Gen Virol 93:2290–2298. doi: 10.1099/vir.0.044420-0. [DOI] [PubMed] [Google Scholar]
- 43.Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng QD, Chen ZH, Mauceli E, Hacohen N, Gnirke A, Rhind N, di Palma F, Birren BW, Nusbaum C, Lindblad-Toh K, Friedman N, Regev A. 2011. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29:644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kelley LA, Mezulis S, Yates CM, Wass MN, Sternberg MJE. 2015. The Phyre2 web portal for protein modeling, prediction and analysis. Nat Protoc 10:845–858. doi: 10.1038/nprot.2015.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Galiez C, Magnan CN, Coste F, Baldi P. 2016. VIRALpro: a tool to identify viral capsid and tail sequences. Bioinformatics 32:1405–1407. doi: 10.1093/bioinformatics/btv727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Blom N, Gammeltoft S, Brunak S. 1999. Sequence and structure-based prediction of eukaryotic protein phosphorylation sites. J Mol Biol 294:1351–1362. doi: 10.1006/jmbi.1999.3310. [DOI] [PubMed] [Google Scholar]
- 47.Steentoft C, Vakhrushev SY, Joshi HJ, Kong Y, Vester-Christensen MB, Schjoldager KT, Lavrsen K, Dabelsteen S, Pedersen NB, Marcos-Silva L, Gupta R, Bennett EP, Mandel U, Brunak S, Wandall HH, Levery SB, Clausen H. 2013. Precision mapping of the human O-GalNAc glycoproteome through SimpleCell technology. EMBO J 32:1478–1488. doi: 10.1038/emboj.2013.79. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS. 2009. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res 37:W202–W208. doi: 10.1093/nar/gkp335. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Madeira F, Park YM, Lee J, Buso N, Gur T, Madhusoodanan N, Basutkar P, Tivey ARN, Potter SC, Finn RD, Lopez R. 2019. The EMBL-EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Res 47:W636–W641. doi: 10.1093/nar/gkz268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. 2009. Jalview version 2-a multiple sequence alignment editor and analysis workbench. Bioinformatics 25:1189–1191. doi: 10.1093/bioinformatics/btp033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Crooks GE, Hon G, Chandonia JM, Brenner SE. 2004. WebLogo: a sequence logo generator. Genome Res 14:1188–1190. doi: 10.1101/gr.849004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Zhang D, Gao F, Jakovlic I, Zou H, Zhang J, Li WX, Wang GT. 2020. PhyloSuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol Ecol Resour 20:348–355. doi: 10.1111/1755-0998.13096. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Virus-like contigs identified in three different wasps’ transcriptomes. AC, Anisopteromalus calandrae; LD, Lariophagus distinguendus; and TE, Theocolax elegans. Download Table S3, XLSX file, 0.01 MB (12.4KB, xlsx) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Assemble strategy view of the contigs from the different wasps’ transcriptomes. Download FIG S1, TIF file, 0.4 MB (410.2KB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Contigs from these three wasps. AC, Anisopteromalus calandrae; LD, Lariophagus distinguendus; and TE, Theocolax elegans. Download Text S1, TXT file, 0.09 MB (89.5KB, txt) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Primers used in this study. Download Table S1, XLSX file, 0.02 MB (25.6KB, xlsx) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
PCR detection of the six viruses in the Sitophilus oryzae adults. In lanes 1 to 7 were the detection results from using specific primers from WWPSRV-1, WWPSRV-2, AcPSRV-1, AcNSRV-1, AcNSRV-2, LdNSRV-1, and actin (weevil), respectively. The primers used to detect viral genome sequence are shown in Table S1. Download FIG S2, TIF file, 0.3 MB (324.8KB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Graphical representation of the polyproteins encoded by the (+)ssRNA viruses identified in the wasps’ transcriptomes. (A) The polyprotein encoded by WWPSRV-1 ORF1. (B) The polyprotein encoded by WWPSRV-2 ORF. (C) The polyprotein encoded by AcPSRV-1 ORF1. Numbers represent the amino acid positions from the N terminus to C terminus. Download FIG S3, TIF file, 0.2 MB (217.6KB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Structure of helicase, protease, and RdRp domains of WWPSRV-1, WWPSRV-2, and AcPSRV-1. (A) Partial amino acids containing helicase motifs A to C are shown with their secondary structures (red cylinders, α-helices; green arrows, β-strands). Sequence logos for helicase motifs A to C of 12 positive-strand RNA viruses belonging to the families Dicistroviridae (ABPV, CrPV, TrV, and BQCV), Iflaviridae (DWV, SBV, SBPV, and ApIV), and unclassified Nora virus-related viruses (HaNV, AiNV, ANV, and DiNV) are shown with the corresponding sequences of WWPSRV-1, WWPSRV-2, and AcPSRV-1. (B) The three-dimensional structural models of the helicase domains of these three viruses predicted by the Phyre2 web. The positions of motifs A to C are marked. Motif A, magenta; motif B, red; motif C, blue. Confidence in the models of WWPSRV-1, WWPSRV-2, and AcPSRV-1: 172 residues (95%) modeled at >90% accuracy, 166 residues (97%) modeled at >90% accuracy, and 178 residues (98%) modeled at >90% accuracy, respectively. (C) Multiple sequence alignment of the putative protease domains of these 3 viruses and 12 other picorna-like viruses. Identical residues are shaded in dark blue and similar residues are in light blue. (D) Partial amino acid containing RdRp motifs A to H are shown with their secondary structures (red cylinders, α-helices; green arrows, β-strands). Sequence logos for RdRp motifs A to H of 12 positive-strand RNA viruses are shown with the corresponding sequences of these 3 viruses. (E) The three-dimensional structural models of the RdRp domains of these three viruses predicted by the Phyre2 web. The positions of motifs A to H are marked. Motif A, magenta; motif B, orange; motif C, yellow; motif D, cyan; motif E, wheat; motif F, blue; motif G, red; motif H, white. Confidence in the models of WWPSRV-1, WWPSRV-2, and AcPSRV-1: 374 residues (100%) modeled at >90% accuracy, 442 residues (100%) modeled at >90% accuracy and 388 residues (100%) modeled at >90% accuracy, respectively. Full names and references of these viruses are shown in Table S2. Download FIG S4, TIF file, 1.4 MB (1.4MB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Viral sequences used for phylogenetic analysis, sequence alignments, and protein prediction in this study. Download Table S2, XLSX file, 0.01 MB (15.1KB, xlsx) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Graphical representation of the G proteins encoded by the (−)ssRNA viruses identified in the wasps’ transcriptomes. (A) The polyprotein encoded by AcNSRV-1 ORF4. (B) The polyprotein encoded by AcNSRV-1 ORF4. (C) The polyprotein encoded by LdNSRV-1 ORF4. Numbers represent the amino acid positions from the N terminus to C terminus. Download FIG S5, TIF file, 0.2 MB (185.6KB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Structure of the RdRp, PRNTase, and methyltransferase domains of AcNSRV-1, AcNSRV-2, and LdNSRV-1. (A) Partial amino acid containing RdRp motifs A to F and GHP (Gly-His-Pro) are shown with their secondary structures (red cylinders, α-helices; green arrows, β-strands). Sequence logos for RdRp motifs A to G and GHP of 12 negative-strand RNA viruses belonging to the families Lispiviridae (HbRLV-3, TcTV-6, LHRLV-1, BerV, IARV-OKIAV103, and BARV-OKIAV102) and Rhabdoviridae (WHFV-2, SyFV-3, LNV-2, HyRRV-OKIAV38, SFRV, and LeRRV-OKIAV34) are shown with the corresponding sequences of AcNSRV-1, AcNSRV-2, and LdNSRV-1. (B) The three-dimensional structural models of the RdRp domains of these three viruses predicted by the Phyre2 web. The positions of motifs A to F and GHP are marked. Motif GHP, red; motif A, yellow; motif B, magenta; motif C, cyan; motif D, orange; motif E, white; motif F, blue. Confidence in the models of AcNSRV-1, AcNSRV-2, and LdNSRV-1: 828 residues (100%) modeled at >90% accuracy, 851 residues (99%) modeled at >90% accuracy, and 877 residues (100%) modeled at >90% accuracy, respectively. (C) Partial amino acids containing PRNTase motifs A to E are shown with their secondary structures (red cylinders, α-helices; green arrows, β-strands). Sequence logos for PRNTase motifs A to E of 12 negative-strand RNA viruses are shown with the corresponding sequences of these three viruses. (D) The three-dimensional structural models of PRNTase domains of these three viruses predicted by the Phyre2 web. The positions of motifs A to E are marked with orange. Confidence in the models of AcNSRV-1, AcNSRV-2, and LdNSRV-1: 235 residues (100%) modeled at >90% accuracy, 235 residues (100%) modeled at >90% accuracy, and 237 residues (100%) modeled at >90% accuracy, respectively. (E) Partial amino acid containing methyltransferase motifs (I to IV and VI to X) are shown with their secondary structures (red cylinders, α-helices; green arrows, β-strands). Sequence logos for methyltransferase motifs of 12 negative-strand RNA viruses are shown with the corresponding sequences of these three viruses. (F) The three-dimensional structural models of the methyltransferase domains of these three viruses predicted by the Phyre2 web. KDKE observed in motif X, motif IV, motif VI, and motif VIII, are marked with red, magenta, cyan, and orange, respectively. Motif I is marked with blue. Confidence in the models of AcNSRV-1, AcNSRV-2, and LdNSRV-1: 195 residues (100%) modeled at >90% accuracy, 197 residues (100%) modeled at >90% accuracy, and 194 residues (100%) modeled at >90% accuracy, respectively. Full names and references of these viruses are shown in Table S2. Download FIG S6, TIF file, 2.2 MB (2.2MB, tif) .
Copyright © 2021 Wang et al.
This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Data Availability Statement
The GenBank accession numbers of WWPSRV-1, WWPSRV-2, AcPSRV-1, AcNSRV-1, AcNSRV-2, and LdNSRV-1 are MW864600, MW864601, MW864599, MW864602, MW864603, and MW864604, respectively. BioProject accession for the RNA-seq of three parasitoid wasps of the rice weevil Sitophilus oryzae is PRJNA719021.





