ABSTRACT
Mosquitoes and biting Culicoides species are arbovirus vectors. Effective virome profile surveillance is essential for the prevention and control of insect-borne diseases. From June to September 2021, we collected eight species of female mosquito and Culicoides on Zhoushan Island, China, and used meta-viromic sequencing to analyze their virome compositions and characteristics. The classified virus reads were distributed in 191 genera in 66 families. The virus sequences in mosquitoes with the largest proportions were Iflaviridae (30.03%), Phasmaviridae (23.09%), Xinmoviridae (21.82%), Flaviviridae (13.44%), and Rhabdoviridae (8.40%). Single-strand RNA+ viruses formed the largest proportions of viruses in all samples. Blood meals indicated that blood-sucking mosquito hosts were mainly chicken, duck, pig, and human, broadly consistent with the habitats where the mosquitoes were collected. Novel viruses of the Orthobunyavirus, Narnavirus, and Iflavirus genera were found in Culicoides by de-novo assembly. The viruses with vertebrate hosts carried by mosquitoes and Culicoides also varied widely. The analysis of unclassified viruses and deep-learning analysis of the “dark matter” in the meta-viromic sequencing data revealed the presence of a large number of unknown viruses.
IMPORTANCE The monitoring of the viromes of mosquitoes and Culicoides, widely distributed arbovirus transmission vectors, is crucial to evaluate the risk of infectious disease transmission. In this study, the compositions of the viromes of mosquitoes and Culicoides on Zhoushan Island varied widely and were related mainly to the host species, with different host species having different core viromes. and many unknown sequences in the Culicoides viromes remain to be annotated, suggesting the presence of a large number of unknown viruses.
KEYWORDS: core virome, Culicoides, meta-virome, mosquito, virome characteristics
INTRODUCTION
Mosquitoes and biting Culicoides species are common vectors of arboviruses that cause a variety of significant diseases, including dengue virus, Zika virus, Getah virus, West Nile virus, Japanese encephalitis virus, bluetongue virus, and Banna virus (1–12). Apart from arboviruses, many insect-specific viruses (ISVs) carried by mosquitoes and Culicoides remain to be discovered and characterized. Some ISVs have shown the potential to modulate arbovirus replication and the vector competence of mosquitoes; for example, the Palm Creek virus, Nhumirim virus, Eilat virus, and Culex flavivirus may inhibit the West Nile, dengue, and chikungunya viruses (13–23).
Viromes carried by mosquitoes have been shown to be species specific (24, 25), and the core virome is related closely to vertical transmission and the environment (26–28). The viruses of the core viromes of various organisms have been characterized and shown to be significantly similar among individuals of the same mosquito species and diverse among species (26, 29). Consistent with vertical virus transmission, the core virome was found to remain stable in laboratory-grown and field-collected Aedes albopictus across developmental stages (26). Comparative analysis of the most abundant viruses has enabled the characterization of the viromes of different species, and the identification of species likely to transmit the same virus (28). Culicoides are abundant and can actively or mechanically carry a variety of pathogens (30). Most of the viruses transmitted by Culicoides are members of the Reoviridae, Rhabdoviridae, and Perbunyaviridae families (31). However, few studies have been performed to examine Culicoides viromes. Meta-viromic approaches have enabled the characterization of viromes carried by Culicoides in some regions and promoted the discovery of new viruses. Analysis of the RNA viromes of 10 Culicoides species in Thrace, southeastern Europe, led to the identification of novel RNA viruses with genomic signatures and revealed that each Culicoides species carried a specific virus set (32). The viromes of Culicoides impunctatus populations in Scotland (33), Culicoides imicola in Senegal (34), Culicoides arakawae in Japan (35), and mixed Culicoides populations (6, 36) have been examined.
Next-generation sequencing has greatly facilitated the study of the viromes carried by mosquitoes and Culicoides, and accelerated the discovery of new pathogens. The identification of many new RNA and DNA viruses by next-generation sequencing is of great significance for the dynamic monitoring of pathogens and the response to emerging and recurrent infectious diseases (37, 38). In a typical viral group study, however, much of the sequence data, including a significant number of unannotated sequences, is unknown and is referred to as “dark matter.” The advent of computational tools for the identification of viral sequences in metagenomics has improved our ability to identify known and novel viruses. In recent years, deep-learning methods have been used to analyze the presence of dark matter in sequences, which adds understanding of the viral components (33–36) and enables the functionally relevant representation of unknown and unannotated sequences (39). In this study, we attempted to use deep-learning methods to characterize potential viral sequences in mosquitoes and midges.
In the present study, we identified species of mosquitoes and Culicoides in five habitats related closely to human activities on Zhoushan Island through molecular biology and morphological analysis. Using meta-viromic sequencing, bioinformatics analysis, and deep learning, we explored the characteristics and compositions of the core viromes of mosquitoes and Culicoides on the island. Our findings improve the understanding of the mosquito and Culicoides viral spectra and aid the evaluation of the risk of local mosquito- and midge-borne infectious disease occurrence.
RESULTS
Samples.
In total, approximately 12,300 mosquitoes of at least eight species from four genera (Culex, Anopheles, Aedes, and Armigeres) were collected at the sampling sites. The dominant mosquito species differed among sites (Table S1). About 5,000 Culicoides were collected from the sites; based on their morphological characteristics, three dominant species (C. arakawae, C. lungchiensis, and C. punctatus) were identified (Fig. 1A). Mosquito and midge group and sequence data are provided in Table 1.
FIG 1.
Maps from Ali Cloud data visualization platform. The sampling, sequencing, and comparative viromic analysis of mosquitoes and Culicoides. (A) Sampling sites in the main island of Zhoushan city. The map shows the distribution of samples’ location and habitat of five sampling sites in Dinghai district, Zhoushan city, Zhejiang Province, China. (B) Meta-virome sequencing and bioinformatic analysis pipeline of mosquitoes and Culicoides samples. (C) Comparative viromic analysis processes of Culicoides and mosquito samples.
TABLE 1.
Pooling information of mosquitoes and Culicoides samples
| Pool name | Species | Sample location | Sample counta | Visible blood engorgement |
|---|---|---|---|---|
| Midge_1 | Culicoides | Farmland | 30 | Unknown |
| Midge_2 | Culicoides | Residential area | 30 | Unknown |
| Midge_3 | Culicoides | Pond | 20 | Unknown |
| Midge_4 | Culicoides | Residential area | 50 | Unknown |
| Midge_5 | Culicoides | Residential area | 50 | Unknown |
| Ae.a_1 | Aedes albopictus | Residential area | 8 | No |
| Ae.a_2 | Aedes albopictus | Village | 4 | Yes |
| Ae.v | Aedes vexans | Village | 2 | No |
| An.s_1 | Anopheles sinensis | Pig farm | 10 | No |
| An.s_2 | Anopheles sinensis | Pond | 10 | No |
| An.s_3 | Anopheles sinensis | Pond | 10 | No |
| Ar.s_1 | Armigeres subalbatus | Residential area | 1 | No |
| Ar.s_2 | Armigeres subalbatus | Village | 2 | Yes |
| C.p.p_5 | Culex pipiens pallens | Residential area | 1 | Yes |
| C.p.p_6 | Culex pipiens pallens | Residential area | 2 | No |
| C.p_1 | Culex pipiens | Farmland | 5 | No |
| C.p_2 | Culex pipiens | Pig farm | 2 | No |
| C.p_3 | Culex pipiens | Pond | 1 | No |
| C.p_4 | Culex pipiens | Residential area | 7 | No |
| C.q | Culex quinquefasciatus | Residential area | 8 | No |
| C.t_1 | Culex tritaeniorhynchus | Farmland | 10 | No |
| C.t_2 | Culex tritaeniorhynchus | Village | 10 | No |
| C.t_3 | Culex tritaeniorhynchus | Village | 5 | No |
| C.t_4 | Culex tritaeniorhynchus | Farmland | 1 | Yes |
| C.t_5 | Culex tritaeniorhynchus | Pig farm | 10 | No |
| C.t_6 | Culex tritaeniorhynchus | Pig farm | 8 | Yes |
| C.t_7 | Culex tritaeniorhynchus | Pond | 10 | No |
| C.t_8 | Culex tritaeniorhynchus | Residential area | 8 | No |
The number of samples in each pool is related to the number of samples collected from each site.
Virome compositions.
Meta-viromic sequencing of 28 vector pools yielded an average of 9,634,984 (range = 360,702 to 55,161,908) reads per pool for virus discovery and characterization. Reads assigned to viruses represented 0.007% to 97% of all reads in each pool (Fig. 2A). The classified viral reads were distributed in 191 genera of 66 families; 99.67% of them belonged to six families. The largest proportion (93%) belonged to the Polycipiviridae, found mainly in Culicoides samples (99.99%). The viral families commonly identified in the mosquito viromes were Iflaviridae (30.03%), Phasmaviridae (23.09%), Xinmoviridae (21.82%), Flaviviridae (13.44%), and Rhabdoviridae (8.40%), found mainly in Culex pipiens pallens, Anopheles sinensis, Culex tritaeniorhynchus, and Ae. albopictus, respectively.
FIG 2.
Composition of sequencing reads annotated with Kraken2. (A) The proportion of the sequencing reads annotated to eukaryotes, bacteria, archaea and fungi. (B) The composition of the top 10 classified viral families in each sample pool.
Host species are important drivers of virome composition.
Our data suggested that the virome composition is species related and that different species have different core virome groups, especially Culicoides (Polycipiviridae), Ae. albopictus (Rhabdoviridae), and A. sinensis (Phasmaviridae, Xinmoviridae; Fig. 2B and 3A). The virome composition had little to do with the habitat environment (stress = 0.12; Fig. 3B). Baculoviridae, Mimiviridae, Poxviridae, Polydnaviridae, and Siphoviridae were identified in all C. pipiens samples. One C. p. pallens sample (C.p.p_5) contained a large number of Iflaviridae reads and was obviously an outlier, probably because the mosquitoes were full of blood (Fig. 2B and 3). Thirty-eight viral families were found in C. tritaeniorhynchus, and 19 of them were common to all habitats.
FIG 3.
The relationship between virome, host species, and habitat environment. (A) Heatmap of sample clustering based on RPM. The heatmap shows the normalized reads counts by meta-viromic Seq on the log2 scale. The hierarchical clustering is based on the Euclidean distance matrix calculated from the normalized reads count. The viral family shown in the heatmap are from the taxonomic annotation by Kraken2 and Bracken. (B) Nonmetric multidimensional scaling (NMDS) of virus on family level. Using Bray-Curtis dissimilarities calculated from the abundance of virome, NMDS was then applied to the ordination analysis for the beta diversity. STRESS = 0.1234, PERMANOVA test on species: P = 0.001 and R2 = 0.6129.
Virome characterization for important arboviruses and ISVs from C. tritaeniorhynchus, Ae. Albopictus, and Culicoides.
Among all 10 viral taxonomy, single-strand (ss)RNA+ were annotated with the longest viral contigs, followed by ssRNA–, double-strand (ds)DNA, and dsRNA. The major viral genome types varied among the sampled speices. For example, the major type in Culicoides and Ae. albopictus was ssRNA+, whereas that in A. sinensis was ssRNA–. Among all samples, we annotated 14 viruses with vertebrate hosts (Table S3).
Previous studies have shown that arboviruses are concentrated in six viral families: Flaridae, Rhabdoviridae, Peribunyaviridae, Togavividae, Reoviridae, and Phenuivividae (40). We analyzed contig data from these six families. West Nile virus, dengue virus 3, High Island virus (Reoviridae), Guapiacu virus (Flaviviridae), and Piry virus (Rhabdoviridae) sequences were found in Ae. albopictus. Some contigs were found to have 80% to 100% aa identity with the NS5 region of dengue virus 3 (GenBank: MH051731.1; Fig. S1) and 80% identity with that of Aedes aegypti, suspected to reflect endogenous viral elements or part of the Aedes genome. No viral gene was found in the sequences annotated as dengue virus and West Nile virus, and the corresponding PCR results were negative. In Culicoides, Quang Binh virus (Flaviviridae), Wuhan mosquito virus 9 and Culex tritaeniorhynchus rhabdovirus (Rhabdoviridae), and a novel Thimiri orthobunyavirus (Bunyaviridae) were identified. A small amount of Culex flavivirus was found in Culicoides and C. tritaeniorhynchus. Wuhan mosquito virus 9 was found in midges and mosquitoes, and its sequence included open reading frames (ORFs) 1 to 3, glycoprotein, and RNA-dependent RNA polymerase (RdRp). The conserved domains include Mononeg_RNApol and paramyx_RNAcap. Phylogenetic analysis based on the RdRp showed the closest affinity to Wuhan mosquito virus 9 (GenBank: YP009305112.1; Fig. 4B). The whole Xincheng mosquito virus genome, including Mononeg_mRNAcap and paramyx_RNAcap, was obtained from A. sinensis by de-novo assembly (Fig. 4A). A long (9,751 bp) contig with 81.69% nucleotide identity with Culex densovirus was found in Armigeres subalbatus. Only one viral protein, Aedes aegypti Thai densovirus nonstructural protein 1, was found, and no conserved domain was detected. Although the RdRp fragment could not be found and whether it was a new virus could not be confirmed, the ability of some densovirus to reduce mosquito populations deserves attention (41). We also found a 3,071-bp contig annotated as Aedes pseudoscutellaris reovirus with 70.43% nucleotide identity and 59.02% aa identity in the VP5 region in C. pipiens.
FIG 4.
Phylogenetic tree of virus in mosquitoes and Culicoides samples. (A) Maximum likelihood phylogeny of Wuhan mosquito virus 9 based on the amino acid sequence of RdRp gene. (B) Maximum likelihood phylogeny of Xincheng mosquito virus based on the amino acid sequence of RdRp gene.
Source of blood meal from mosquitoes.
We analyzed blood meals in five pooled mosquito samples from four sampling sites. We compiled their corresponding Chordata contigs and chose animals that mosquitoes and Culicoides might feed on. The blood meal sources fell into four broad categories of Aves and Mammalia (chicken, duck, pig, and human) that were broadly consistent with the habitats where the mosquitoes were collected. Blood meals from A. subalbatus samples from the village and Ae. albopictus samples from the residential area were mainly human, and that of C. p. pallens from the residential area was mainly duck. Those of C. tritaeniorhynchus collected from the farmland and pig farm were chicken and pig, respectively. Rodent sequences were found in various samples, potentially representing a link in the mosquito-borne virus transmission chain. The viruses carried by the blood-sucking mosquitoes can be carried by rats, chickens, pigs, and humans, concordant with the blood meal calls.
Potential discovery of novel viruses and dark matter.
De-novo read assembly revealed the presence of a novel narnavirus (Narnaviridae) in two Culicoides samples (Midge_3 and Midge_4), which was named Culicoides narna-like virus 1 (CNV1). The longest ORF in the 2,485-bp genome of CNV1 encodes an RdRp related closely to Sanya ochthera mantis narnavirus 2 (36.36% aa similarity; Table S4a). The narnavirus hosts phylogenetically closest to CNV1 are arthropods, including Ochthera mantis and Drosophila (Fig. 5A). Many Simbu serogroup virus (Orthobunyavirus, Bunyaviridae) sequences were found in the Midge_1 and Midge_3 samples. Parts of the L segment (4,186 bp, ~81.27% nucleotide identity with MH484336.1) and S segment (982 bp, ~85.35% nucleotide identity with MH484338.1) were obtained by de-novo assembly, and the L segment contained a conserved domain annotated as the Bunya RdRp. Phylogenetic analysis showed that it was closest to Thimiri orthobunyavirus (Fig. 5C). We also assembled a contig (10,785 bp) annotated as Hubei picorna-like virus 29 (73.10% nucleotide identity), with 44.12% identity of the hypothetical protein. Three conserved domains were identified using the NCBI CD-Search software with an E-value of e−5: ps-ssRNAv-Picornavirales (catalytic core domain of RdRp), rhv_like (Picornavirus capsid protein domain_like), and RNA_helicase (Table S5). A phylogenetic tree constructed based on a polyprotein (3,156 aa) suggested that the contig represented a novel Iflavirus, a picorna-like virus that infects arthropods (Fig. 5B). Another contig was annotated as Hubei tombus-like virus 20 and was considered to be a new viral sequence because it showed <80% nucleotide identity. A 3,323-bp novel viral sequence of Zhejiang mosquito virus 3 was found in C. pipiens and showed 93.01% nucleotide identity with the reference genome (KX883461.1 3226 bp). Phylogenetic analysis showed that the RdRp gene was related most closely to Zhejiang mosquito virus 3 (ASA47305.1; Fig. 5D), followed by some narnaviruses, such as Aedes japonicus narnavirus 1 (QIP67846.2) and Xanthi narna-like virus (QRD99904.1).
FIG 5.
Phylogenetic tree of potential novel virus in mosquitoes and Culicoides samples. (A) Maximum likelihood phylogeny of unclassified Narnavirus based on the amino acid sequence of part RdRp gene. (B) Maximum likelihood phylogeny of unclassified iflavirus based on the amino acid sequence of polyprotein. (C) Maximum likelihood phylogeny of unclassified Orthobunyavirus based on the amino acid sequence of part segment L. (D) Maximum likelihood phylogeny of Zhejiang mosquito virus 3 based on the amino acid sequence of RdRp gene.
Nearly 50% (n = 2,704,391) of the contigs were unannotated; 2,180,762 were annotated (Fig. 6A). The proportions of suspected viral sequences predicted by DeepVirFinder (DVF) varied widely among samples and ranged from 5.46% (for C.p_2) to 25.27% (for Midge_5). The proportion of unannotated contigs predicted to be viral sequences was larger in Culicoides than in mosquito samples (Fig. 6B). The median length of the sequences identified by BLAST was 421 to 794 bp and that of the sequences predicted by DVF was 578 to 887 bp (Table S6). The GC content of sequences identified by BLAST ranged from 35% to 49% (mean, 42.7%) and that of sequences predicted by DVF ranged from 36% to 47% (mean, 43.5%), with no significant difference between methods (P = 0.18, n = 28; Fig. S2).
FIG 6.
Comparison of the proportion of viral unblast contigs in mosquitoes and Culicoides. (A) The proportion of the annotated contigs. (B) The comparison of annotated contigs between mosquitoes and Culicoides. Welch’s t test was used for comparison between mosquitoes and Culicoides.
DISCUSSION
This study is the first in which high-throughput sequencing was used to characterize the viruses carried by mosquitoes and midges on Zhoushan Island, contributing to the knowledge of arthropod viromics in the coastal areas of southeastern China. Similar to previous findings, we found that C. tritaeniorhynchus and Ae. albopictus were the dominant mosquito species and had preferred habitats (42), and C. arakawae is a common local Culicoides species (43).
Viruses of the Rhabdoviridae, Polycipiviridae, Phasmaviridae, Phenuiviridae, and other virus families were identified in this study, as in other studies, but in markedly different proportions (32). In this study, Polycipiviridae viruses (e.g., Hubei picorna-like virus 82, a ssRNA+ insect-related virus found in spiders) were very abundant in Culicoides viromes, which has not been reported previously. Meta-viromic sequencing of C. impunctatus viromes revealed viruses of the Polycipiviridae family, but in only one group (33). We also made the novel identification of Thimiri orthobunyavirus (genus Orthobunyavirus), originally isolated in birds (44) and known to be hosted by Culicoides histrio (45), in Culicoides collected from the farm and pond sites, suggesting that this virus exists in different environments and organisms in the study region. To our knowledge, this is the first description of Simbu serogroup viruses in Culicoides from Zhoushan Island. Orthobunyaviruses of the Simbu serogroup are transmitted by insects (primarily biting midges) and infect mammals and birds (46). Recent research suggests that Culicoides are associated with potential Simbu serogroup virus transmission and continued prevalence in livestock (46–49). Thus, further epidemiological investigation of Simbu serogroup viral infection in the study region is needed. Our research has expanded the group of known Culicoides-associated viruses and demonstrated that much about these viruses remains unknown.
Different mosquito species may carry unique virus groups (24, 29), which will help to strengthen the monitoring and research on important mosquito species that carry insect-borne viruses. The core viromes identified in this study were very diverse among mosquito species and different than those reported previously (28, 50); the eukaryotic viromes of mosquitoes and Culicoides collected on Zhoushan Island are stable and diverse (27–29). More Hubei mosquito virus 2 was found in C. tritaeniorhynchus. Wuhan mosquito virus 1 (Phasmaviridae) and Xincheng mosquito virus (Xinmoviridae) were found in A. sinensis and a relatively complete Xincheng mosquito virus was assembled. Similar results were obtained in previous mosquito virome studies conducted in Hubei Province, China (25, 51), but not in studies of the same mosquito species conducted on other continents (28). Species-specific mosquito viromes may be stable within regional ecosystems, as core virome genomes are usually found in relatively small areas (e.g., Guadeloupe, Zhoushan) or large geographic areas (e.g., Australia, Northern Europe, and parts of China) are conservative and stable (28). The Xincheng mosquito virus that we found is not completely consistent with that found in A. sinensis in Hubei Province (~96.7% nucleotide similarity), and other viruses have similar findings, reflecting small differences in the core viromes of mosquito species at this geographic scale.
In this study, we extended the scope of molecular approaches to identify blood meal sources using meta-viromic sequencing data (52). The host identification was broadly consistent with the habitats where the samples were collected, which enables xenosurveillance. A long contig of Avian leukosis virus was found in a group of apparently blood-sucking C. tritaeniorhynchus mosquitoes, for which the blood meal source was identified as chickens on a farm. This fragment accounts for 70% of the full length of the virus (GenBank no. MK951780.1). As viral proteins were found in it, we consider it to be an exogenous viral sequence. When the virus is incorporated into the mosquito's blood meal during feeding, the meta-viromic of satiated mosquitoes can reflect the viral diversity of many mammals, birds, and humans by means of xenosurveillance (53–55). We believe that xenosurveillance for viruses with vertebrate hosts in the viromes of mosquitoes and midges could play an important role in the monitoring of local vertebrate diseases.
Genome sequencing has enhanced our understanding of emerging viromes by providing blueprints of the evolutionary and functional diversity of viruses, but sequences always contain dark matter that cannot be identified or matched. As not all viruses can be isolated and cultured in the laboratory, numerous algorithms, databases, and pipelines have been developed to process virome sequencing data and support dark matter exploration (56–58). Using DVF and CheckV software, we performed a preliminary exploration of a large number of unannotated sequences in the study samples, looking for possibly viral sequences in the dark matter. The dark matter contained a certain proportion of viral sequences, with a larger proportion of unannotated sequences predicted to be viruses in midge than in mosquito samples. A fraction of the ORFs was annotated as viral proteins with <50% similarity. We speculate that midges carry a large number of unknown viruses. Further exploration is needed to confirm this speculation and examine virus evolution in these species.
This study has some limitations. Due to the COVID-19 pandemic, we were able to collect samples at only a small number of sites, and not continuously in all cases. Moreover, although meta-viromic sequencing aids the detection of less abundant viruses, it does not enable the distinction of endogenous viral elements; thus, further analysis and validation are needed to enhance the robustness of the data. We will continue to monitor the viromes of mosquitoes and biting midges on Zhoushan Island and other islands off China’s southeastern coast.
MATERIALS AND METHODS
Sample collection and species identification.
Zhoushan Island is located in northeast Zhejiang Province, China. It has a subtropical monsoon climate with multiple typhoons occurring in summer, which facilitates the efficient reproduction of mosquitoes, Culicoides, and the viruses they carry. Imported cases of dengue fever on Zhoushan Island have been reported, and outbreaks of dengue fever have occurred in nearby Daishan County (59). From June 2021 to September 2021, we collected samples of mosquitoes and Culicoides at five locations on Zhoushan Island: a pig farm, farmland, a pond, a village, and an urban residential area (Fig. 1A; Table S1). The samples were collected using UV traps. Captured mosquitoes and Culicoides were stored immediately at −20°C, and then identified by experienced field biologists based on their morphological characteristics under an anatomical lens while on ice. The samples were then transported to the laboratory in a freezer, and the mosquito species were confirmed by sequencing of the cytochrome c oxidase subunit I gene (60). All samples were then stored at −80°C until further analysis.
RNA extraction.
One to 10 female mosquitoes or 20 to 50 Culicoides were pooled based on species, sampling location, and visible blood engorgement of the mosquitoes. When more than 10 mosquitoes or 50 midges had been collected from a sampling site, 10 or 50 were selected randomly. Each pooled sample was washed once with sterile phosphate-buffered saline solution and then twice in diethyl pyrocarbonate (DEPC)-treated water to remove potential surface microorganisms. RNA was dissolved in 30 μL non-RNase DEPC and frozen at −80°C.
cDNA library preparation, amplification, and sequencing.
DNA library construction and sequencing were performed at the Beijing Genomics Institute. A nucleic acid microbes purification kit (BGI PathoGenesis Pharmaceutical Technology, BGI-Shenzhen, Shenzhen, China) was used to enrich the viruses in all samples. The remaining RNA was fragmented to about 280 bp and then reverse transcribed into cDNA, followed by second-strand synthesis. Using the synthetic double-stranded DNA, a DNA library was constructed through end repair, unique dual-index adaptor ligation, and PCR amplification. The library was qualified with a Qubit 4.0 fluorometer (Thermo Fisher Scientific, Foster City, CA, USA) and the fragment size was analyzed with a Qsep100 system (Hangzhou Houze Biotechnology Co., Ltd.). After circularization and DNA nanoball generation, the resulting libraries were sequenced with an MGI high-throughput sequencing set (PE150) on MGISEQ-2000/T7 platforms (MGI, Shenzhen, China; Fig. 1B).
Bioinformatics analysis.
After the removal of adapter, low-quality, and low-complexity reads, high-quality genome sequencing data were generated with the fastp software (v. 0.20.0) (61). Clean reads were delivered for the identification of microorganisms through meta-viromic next-generation sequencing analysis. The reads were first mapped against the host sequence database to remove host genome sequences using hisat2 (v. 2.1.0) (62) (Table S2). The MEGAHIT software (v. 1.1.2) (63) was used to assemble reads into a scaffold sequence. A nonredundant set was compiled by clustering nucleic acids with ≥90% identity using CD-HIT (v. 4.8.1, cd-hit-est mode) (64).
Viral sequence identification.
Host-removal filtering reads were used to identify microorganisms with Kraken2 (v. 2.1.2) (62) and the Refseq and NR databases. We used Bracken (v. 2.6) (65) to acquire a highly accurate statistical abundance of species. The BWA software (v. 0.7.17) (63) was used to map the reads into a genome. To understand the distribution of species within the microbial cargo of the mosquitoes and Culicoides, we first examined the overall proportions of nonhost reads that could be assigned to viral, bacterial, eukaryotic, and archaeal organisms. BLAST (v. 2.5.0+, default parameters and E-value threshold of 10−5) was used to align the scaffold sequence with the NT database. Then, the sequences annotated as viruses were extracted and evaluated using CheckV software (v. 1.0.1) (66). To identify endogenous viral sequences, we used NCBI BlASTp searches for eukaryotes, bacteria, and archaea (NCBI taxonomy IDs 2759, 2, 2157) with the default parameters and E-value threshold of 10−5. We also performed online BLASTn alignment for retroviral sequences. Relationships between identified viruses and their hosts were determined based on NCBI taxonomy ID pairs in the Virus-Host database published on March 15, 2022 (https://www.genome.jp/virushostdb/) (67). Viral proteins were predicted using the “meta” pattern in Prodigal (v. 2.6.3) (68).
Viral structure, conserved domain, and phylogenetic analyses.
All proteins were annotated by the NCBI nr database using BLASTp (E-value threshold of 1E-10). The threshold of identity for new sequences was set with amino acid (aa) sequences < 90% or nucleotide sequences < 85%. Conserved domains were searched for using the NCBI CD-Search tool (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) (69). Maximum-likelihood phylogenetic trees were produced using MEGA 7 (70) with 1,000 bootstrap replicates.
Exploration of viral sequences in dark matter.
To annotate potential viral sequences among unclassified sequences, DeepVirFinder (DVF, v. 1.0) (56) was run with the Python script, and contigs (>500 bp) with scores > 0.9 and P < 0.05 were considered to be viral (Fig. 1C). Sequences annotated as viruses were extracted and evaluated using CheckV (66). Proteins were predicted using the “meta” pattern in Prodigal (v. 2.6.3).
PCR verification.
To verify the presence of dengue virus and West Nile virus (71) in the samples, conventional PCR was performed using the FastKing One-Step RT-PCR kit (TIANGEN Biotech, China). The Baltimore virus classification system was used with BLAST based on the ICGT database of the International Committee on Taxonomy of Viruses (ICTV, https://talk.ictvonline.org/).
Data availability.
The raw sequencing data sets for the current study are available in the NCBI Sequence Read Archive repository, under the Bioproject with accession code PRJNA847438 (NCBI BioProject database, www.ncbi.nlm.nih.gov/bioproject/847438).
ACKNOWLEDGMENTS
We thank Guoping Liu from the Northern Theater Center for Disease Control and Prevention for his support in the identification of midge species in this study.
The opinions expressed by authors contributing to this paper do not necessarily reflect the opinions of the Chinese PLA Center for Disease Control and Prevention or the institutions with which the authors are affiliated.
We declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
X.Y., S.Q., X.L., H.S., C.W., and Y.C. designed the study. S.Q. and H.S. collected samples; X.Y., S.Q., N.Z., J.C., M.J., and F.L. performed the laboratory work on the identification, RNA extraction, and PCR verification; X.Y., S.Q., and X.L. supervised the molecular analyses; X.Y. and S.Q. drafted and revised the manuscript; L.X., Y.C., C.W., H.S., Y.W., and J.G. reviewed and revised the manuscript.
The study was supported by a grant from National Key Research and Development Program of China (2019YFC1200500, 2019YFC1200501), National Natural Science Foundation of China (12171295, 32171508, 31900151).
Footnotes
Supplemental material is available online only.
Contributor Information
Hua Shi, Email: placdc@139.com.
Changjun Wang, Email: science2008@hotmail.com.
Yong Chen, Email: chenyonger@126.com.
Blaire Steven, Connecticut Agricultural Experiment Station.
REFERENCES
- 1.Ferreira-de-Lima VH, Lima-Camara TN. 2018. Natural vertical transmission of dengue virus in Aedes aegypti and Aedes albopictus: a systematic review. Parasit Vectors 11:77. doi: 10.1186/s13071-018-2643-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Yang L, Chen Y, Yan H, Zhang P, Xu X, Tang B, Zhao P, Ren R. 2015. A survey of the 2014 dengue fever epidemic in Guangzhou, China. Emerg Microbes Infect 4:e57. doi: 10.1038/emi.2015.57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Sukhralia S, Verma M, Gopirajan S, Dhanaraj PS, Lal R, Mehla N, Kant CR. 2019. From dengue to Zika: the wide spread of mosquito-borne arboviruses. Eur J Clin Microbiol Infect Dis 38:3–14. doi: 10.1007/s10096-018-3375-7. [DOI] [PubMed] [Google Scholar]
- 4.Sam SS, Mohamed-Romai-Noor NA, Teoh BT, Hamim ZR, Ng HY, Abd-Jamil J, Khor CS, Hassan SS, Ahmad H, AbuBakar S. 2022. Group IV Getah virus in culex mosquitoes, Malaysia. Emerg Infect Dis 28:475–477. doi: 10.3201/eid2802.204887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Liu H, Zhang X, Li LX, Shi N, Sun XT, Liu Q, Jin NY, Si XK. 2019. First isolation and characterization of Getah virus from cattle in northeastern China. BMC Vet Res 15:320. doi: 10.1186/s12917-019-2061-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Zhang Y, Lei W, Wang Y, Sui H, Liu B, Li F, He Y, Li Z, Fu S, Wang L, Xu L, Mahe M, Gao Z, Mamutijiang T, Lv Z, Xiang N, Zhou L, Ni D, Liang G, Li Q, Wang H, Feng Z. 2021. Surveillance of West Nile virus infection in Kashgar Region, Xinjiang, China, 2013–2016. Sci Rep 11:14010. doi: 10.1038/s41598-021-93309-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Ciota AT. 2017. West Nile virus and its vectors. Curr Opin Insect Sci 22:28–36. doi: 10.1016/j.cois.2017.05.002. [DOI] [PubMed] [Google Scholar]
- 8.Faizah AN, Kobayashi D, Maekawa Y, Amoa-Bosompem M, Fauziyah S, Mulyatno KC, Subekti S, Rohmah EA, Lusida MI, Mori Y, Miura K, Hirayama K, Isawa H, Sawabe K. 2021. Identification and isolation of Japanese encephalitis virus genotype IV from Culex vishnui collected in Bali, Indonesia in 2019. Am J Trop Med Hyg 105:813–817. doi: 10.4269/ajtmh.20-1554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Song S, Fu SH, Zhou XX, Zhang JK, Li W, Liu LJ, Li JS, Wang J, Lin Y, Li XL, He Y, Lei WW, Wang HY, Wang B, Lu XQ, Liang GD. 2018. Mosquitoes, midges and related arboviruses in southeast Sichuan province. Zhonghua Liu Xing Bing Xue Za Zhi 39:1381–1386. [DOI] [PubMed] [Google Scholar]
- 10.Duan YL, Li L, Bellis G, Yang ZX, Li HC. 2021. Detection of bluetongue virus in Culicoides spp. in southern Yunnan Province, China. Parasit Vectors 14:68. doi: 10.1186/s13071-020-04518-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Song S, Li Y, Fu S, Lei W, Guo X, Feng Y, Gao X, Li X, Yang Z, Xu Z, He Y, Wang H, Zhou H, Wang B, Lu X, Liang G. 2017. Genome sequencing and phylogenetic analysis of Banna virus (genus Seadornavirus, family Reoviridae) isolated from Culicoides. Sci China Life Sci 60:1372–1382. doi: 10.1007/s11427-017-9190-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wang HY, Takasaki T, Fu SH, Sun XH, Zhang HL, Wang ZX, Hao ZY, Zhang JK, Tang Q, Kotaki A, Tajima S, Liang XF, Yang WZ, Kurane I, Liang GD. 2007. Molecular epidemiological analysis of Japanese encephalitis virus in China. J Gen Virol 88:885–894. doi: 10.1099/vir.0.82185-0. [DOI] [PubMed] [Google Scholar]
- 13.Nasar F, Erasmus JH, Haddow AD, Tesh RB, Weaver SC. 2015. Eilat virus induces both homologous and heterologous interference. Virology 484:51–58. doi: 10.1016/j.virol.2015.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Bolling BG, Olea-Popelka FJ, Eisen L, Moore CG, Blair CD. 2012. Transmission dynamics of an insect-specific flavivirus in a naturally infected Culex pipiens laboratory colony and effects of co-infection on vector competence for West Nile virus. Virology 427:90–97. doi: 10.1016/j.virol.2012.02.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kent RJ, Crabtree MB, Miller BR. 2010. Transmission of West Nile virus by Culex quinquefasciatus say infected with Culex Flavivirus Izabal. PLoS Negl Trop Dis 4:e671. doi: 10.1371/journal.pntd.0000671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kuwata R, Isawa H, Hoshino K, Sasaki T, Kobayashi M, Maeda K, Sawabe K. 2015. Analysis of mosquito-borne Flavivirus superinfection in Culex tritaeniorhynchus (Diptera: Culicidae) cells persistently infected with Culex Flavivirus (Flaviviridae). J Med Entomol 52:222–229. doi: 10.1093/jme/tju059. [DOI] [PubMed] [Google Scholar]
- 17.Talavera S, Birnberg L, Nuñez AI, Muñoz-Muñoz F, Vázquez A, Busquets N. 2018. Culex flavivirus infection in a Culex pipiens mosquito colony and its effects on vector competence for Rift Valley fever phlebovirus. Parasit Vectors 11:310. doi: 10.1186/s13071-018-2887-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hobson-Peters J, Yam AW, Lu JW, Setoh YX, May FJ, Kurucz N, Walsh S, Prow NA, Davis SS, Weir R, Melville L, Hunt N, Webb RI, Blitvich BJ, Whelan P, Hall RA. 2013. A new insect-specific flavivirus from northern Australia suppresses replication of West Nile virus and Murray Valley encephalitis virus in co-infected mosquito cells. PLoS One 8:e56534. doi: 10.1371/journal.pone.0056534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hall-Mendelin S, McLean BJ, Bielefeldt-Ohmann H, Hobson-Peters J, Hall RA, van den Hurk AF. 2016. The insect-specific Palm Creek virus modulates West Nile virus infection in and transmission by Australian mosquitoes. Parasit Vectors 9:414. doi: 10.1186/s13071-016-1683-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kenney JL, Solberg OD, Langevin SA, Brault AC. 2014. Characterization of a novel insect-specific flavivirus from Brazil: potential for inhibition of infection of arthropod cells with medically important flaviviruses. J Gen Virol 95:2796–2808. doi: 10.1099/vir.0.068031-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Goenaga S, Kenney JL, Duggal NK, Delorey M, Ebel GD, Zhang B, Levis SC, Enria DA, Brault AC. 2015. Potential for co-infection of a mosquito-specific Flavivirus, Nhumirim virus, to block West Nile virus transmission in mosquitoes. Viruses 7:5801–5812. doi: 10.3390/v7112911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Schultz MJ, Frydman HM, Connor JH. 2018. Dual insect specific virus infection limits Arbovirus replication in Aedes mosquito cells. Virology 518:406–413. doi: 10.1016/j.virol.2018.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Öhlund P, Lundén H, Blomström AL. 2019. Insect-specific virus evolution and potential effects on vector competence. Virus Genes 55:127–137. doi: 10.1007/s11262-018-01629-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Thongsripong P, Chandler JA, Kittayapong P, Wilcox BA, Kapan DD, Bennett SN. 2021. Metagenomic shotgun sequencing reveals host species as an important driver of virome composition in mosquitoes. Sci Rep 11:8448. doi: 10.1038/s41598-021-87122-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Xia H, Wang Y, Shi C, Atoni E, Zhao L, Yuan Z. 2018. Comparative metagenomic profiling of viromes associated with four common mosquito species in China. Virol Sin 33:59–66. doi: 10.1007/s12250-018-0015-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Shi C, Zhao L, Atoni E, Zeng W, Hu X, Matthijnssens J, Yuan Z, Xia H. 2020. Stability of the Virome in lab- and field-collected aedes albopictus mosquitoes across different developmental stages and possible core viruses in the publicly available Virome data of Aedes Mosquitoes. mSystems 5. doi: 10.1128/mSystems.00640-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Coatsworth H, Bozic J, Carrillo J, Buckner EA, Rivers AR, Dinglasan RR, Mathias DK. 2022. Intrinsic variation in the vertically transmitted core virome of the mosquito Aedes aegypti. Mol Ecol 31:2545–2561. doi: 10.1111/mec.16412. [DOI] [PubMed] [Google Scholar]
- 28.Konstantinidis K, Dovrolis N, Kouvela A, Kassela K, Rosa Freitas MG, Nearchou A, de Courcy Williams M, Veletza S, Karakasiliotis I. 2022. Defining virus-carrier networks that shape the composition of the mosquito core virome of a local ecosystem. Virus Evol 8:veac036. doi: 10.1093/ve/veac036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Shi C, Beller L, Deboutte W, Yinda KC, Delang L, Vega-Rúa A, Failloux AB, Matthijnssens J. 2019. Stable distinct core eukaryotic viromes in different mosquito species from Guadeloupe, using single mosquito viral metagenomics. Microbiome 7:121. doi: 10.1186/s40168-019-0734-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Žiegytė R, Platonova E, Kinderis E, Mukhin A, Palinauskas V, Bernotienė R. 2021. Culicoides biting midges involved in transmission of haemoproteids. Parasit Vectors 14:27. doi: 10.1186/s13071-020-04516-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Elbers AR, Koenraadt CJ, Meiswinkel R. 2015. Mosquitoes and Culicoides biting midges: vector range and the influence of climate change. Rev Sci Tech 34:123–137. doi: 10.20506/rst.34.1.2349. [DOI] [PubMed] [Google Scholar]
- 32.Konstantinidis K, Bampali M, de Courcy Williams M, Dovrolis N, Gatzidou E, Papazilakis P, Nearchou A, Veletza S, Karakasiliotis I. 2022. Dissecting the species-specific virome in Culicoides of Thrace. Front Microbiol 13:802577. doi: 10.3389/fmicb.2022.802577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Modha S, Hughes J, Bianco G, Ferguson HM, Helm B, Tong L, Wilkie GS, Kohl A, Schnettler E. 2019. Metaviromics reveals unknown viral diversity in the biting midge Culicoides impunctatus. Viruses 11:865. doi: 10.3390/v11090865. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Temmam S, Monteil-Bouchard S, Robert C, Baudoin JP, Sambou M, Aubadie-Ladrix M, Labas N, Raoult D, Mediannikov O, Desnues C. 2016. Characterization of viral communities of biting midges and identification of novel Thogotovirus species and Rhabdovirus genus. Viruses 8:77. doi: 10.3390/v8030077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kobayashi D, Murota K, Faizah AN, Amoa-Bosompem M, Higa Y, Hayashi T, Tsuda Y, Sawabe K, Isawa H. 2020. RNA virome analysis of hematophagous Chironomoidea flies (Diptera: Ceratopogonidae and Simuliidae) collected in Tokyo, Japan. Med Entomol Zool 71:225–243. doi: 10.7601/mez.71.225. [DOI] [Google Scholar]
- 36.Temmam S, Monteil-Bouchard S, Sambou M, Aubadie-Ladrix M, Azza S, Decloquement P, Khalil JY, Baudoin JP, Jardot P, Robert C, La Scola B, Mediannikov OY, Raoult D, Desnues C. 2015. Faustovirus-like asfarvirus in hematophagous biting midges and their vertebrate hosts. Front Microbiol 6:1406. doi: 10.3389/fmicb.2015.01406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Öhlund P, Hayer J, Lundén H, Hesson JC, Blomström AL. 2019. Viromics reveal a number of novel RNA viruses in Swedish mosquitoes. Viruses 11:1027. doi: 10.3390/v11111027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Atoni E, Wang Y, Karungu S, Waruhiu C, Zohaib A, Obanda V, Agwanda B, Mutua M, Xia H, Yuan Z. 2018. Metagenomic virome analysis of Culex mosquitoes from Kenya and China. Viruses 10:30. doi: 10.3390/v10010030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hoarfrost A, Aptekmann A, Farfañuk G, Bromberg Y. 2022. Deep learning of a bacterial and archaeal universal language of life enables transfer learning and illuminates microbial dark matter. Nat Commun 13:2606. doi: 10.1038/s41467-022-30070-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.de Almeida JP, Aguiar ER, Armache JN, Olmo RP, Marques JT. 2021. The virome of vector mosquitoes. Curr Opin Virol 49:7–12. doi: 10.1016/j.coviro.2021.04.002. [DOI] [PubMed] [Google Scholar]
- 41.Johnson RM, Rasgon JL. 2018. Densonucleosis viruses ('densoviruses') for mosquito and pathogen control. Curr Opin Insect Sci 28:90–97. doi: 10.1016/j.cois.2018.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Jin Y, Yi J. 2016. Study on distribution of mosquito vectors and associated communicable disease pathogens in islands of Zhoushan. China 27:168–171. [Google Scholar]
- 43.Ruo-Xue Z, Yong-Qiang H, Hong-Fei M, Chen-Chen J, Jiang-Bing S. 2018. Investigation on the populations of Culicoides midges in Jintang district of Zhoushan in Zhejiang province. Chin J Vector Biol & Control 29:476–478. [Google Scholar]
- 44.Kapuscinski ML, Bergren NA, Russell BJ, Lee JS, Borland EM, Hartman DA, King DC, Hughes HR, Burkhalter KL, Kading RC, Stenglein MD. 2021. Genomic characterization of 99 viruses from the bunyavirus families Nairoviridae, Peribunyaviridae, and Phenuiviridae, including 35 previously unsequenced viruses. PLoS Pathog 17:e1009315. doi: 10.1371/journal.ppat.1009315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Huang B, Allcock R, Warrilow D. 2016. Newly characterized arboviruses of northern Australia. Virology Rep 6:11–17. doi: 10.1016/j.virep.2016.01.001. [DOI] [Google Scholar]
- 46.Behar A, Rot A, Lavon Y, Izhaki O, Gur N, Brenner J. 2020. Seasonal and spatial variation in Culicoides community structure and their potential role in transmitting Simbu serogroup viruses in Israel. Transbound Emerg Dis 67:1222–1230. doi: 10.1111/tbed.13457. [DOI] [PubMed] [Google Scholar]
- 47.Brenner J, Tsuda T, Yadin H, Chai D, Stram Y, Kato T. 2004. Serological and clinical evidence of a teratogenic Simbu serogroup virus infection of cattle in Israel, 2001–2003. Vet Ital 40:119–123. [PubMed] [Google Scholar]
- 48.Möhlmann TWR, Oymans J, Wichgers Schreur PJ, Koenraadt CJM, Kortekaas J, Vogels CBF. 2019. Vector competence of biting midges and mosquitoes for Shuni virus. PLoS Negl Trop Dis 13:e0006609. doi: 10.1371/journal.pntd.0006609. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- 49.Oluwayelu D, Wernike K, Adebiyi A, Cadmus S, Beer M. 2018. Neutralizing antibodies against Simbu serogroup viruses in cattle and sheep, Nigeria, 2012–2014. BMC Vet Res 14:277. doi: 10.1186/s12917-018-1605-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Atoni E, Zhao L, Hu C, Ren N, Wang X, Liang M, Mwaliko C, Yuan Z, Xia H. 2020. A dataset of distribution and diversity of mosquito-associated viruses and their mosquito vectors in China. Sci Data 7:342. doi: 10.1038/s41597-020-00687-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Li CX, Shi M, Tian JH, Lin XD, Kang YJ, Chen LJ, Qin XC, Xu J, Holmes EC, Zhang YZ. 2015. Unprecedented genomic diversity of RNA viruses in arthropods reveals the ancestry of negative-sense RNA viruses. Elife 4. doi: 10.7554/eLife.05378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Batson J, Dudas G, Haas-Stapleton E, Kistler AL, Li LM, Logan P, Ratnasiri K, Retallack H. 2021. Single mosquito metatranscriptomics identifies vectors, emerging pathogens and reservoirs in one assay. Elife 10. doi: 10.7554/eLife.68353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Fauver JR, Gendernalik A, Weger-Lucarelli J, Grubaugh ND, Brackney DE, Foy BD, Ebel GD. 2017. The use of xenosurveillance to detect human bacteria, parasites, and viruses in mosquito bloodmeals. Am J Trop Med Hyg 97:324–329. doi: 10.4269/ajtmh.17-0063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Grubaugh ND, Sharma S, Krajacich BJ, Fakoli LS, III, Bolay FK, Diclaro JW, II, Johnson WE, Ebel GD, Foy BD, Brackney DE. 2015. Xenosurveillance: a novel mosquito-based approach for examining the human-pathogen landscape. PLoS Negl Trop Dis 9:e0003628. doi: 10.1371/journal.pntd.0003628. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Brinkmann A, Nitsche A, Kohl C. 2016. Viral metagenomics on blood-feeding arthropods as a tool for human disease surveillance. Int J Mol Sci 17. doi: 10.3390/ijms17101743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Ren J, Song K, Deng C, Ahlgren NA, Fuhrman JA, Li Y, Xie X, Poplin R, Sun F. 2020. Identifying viruses from metagenomic data using deep learning. Quant Biol 8:64–77. doi: 10.1007/s40484-019-0187-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Liang G, Bushman FD. 2021. The human virome: assembly, composition and host interactions. Nat Rev Microbiol 19:514–527. doi: 10.1038/s41579-021-00536-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Miao Y, Liu F, Hou T, Liu Y. 2022. Virtifier: a deep learning-based identifier for viral sequences from metagenomes. Bioinformatics 38:1216–1222. doi: 10.1093/bioinformatics/btab845. [DOI] [PubMed] [Google Scholar]
- 59.Ye L, Tan Q, Yi B, Yu J, Guan L, Zhang Z, Li S, Wang Q, Ren Y, Shu J. 2020. Epidemiological survey of the first outbreak of dengue fever in island area, Zhejiang Daishan county. Dis Surveillance 35:307–310. [Google Scholar]
- 60.Delgado-Serra S, Viader M, Ruiz-Arrondo I, Miranda M, Barceló C, Bueno-Marí R, Hernández-Triana LM, Miquel M, Lester K, Jurado-Rivera JA, Paredes-Esquivel C. 2021. Molecular characterization of mosquito diversity in the Balearic Islands. J Med Entomol 58:608–615. doi: 10.1093/jme/tjaa217. [DOI] [PubMed] [Google Scholar]
- 61.Chen S, Zhou Y, Chen Y, Gu J. 2018. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. 2019. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol 37:907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Li D, Liu CM, Luo R, Sadakane K, Lam TW. 2015. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31:1674–1676. doi: 10.1093/bioinformatics/btv033. [DOI] [PubMed] [Google Scholar]
- 64.Fu L, Niu B, Zhu Z, Wu S, Li W. 2012. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28:3150–3152. doi: 10.1093/bioinformatics/bts565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Lu J, Breitwieser FP, Thielen PM, Salzberg SL. 2016. Bracken: estimating species abundance in metagenomics data. bioRxiv.
- 66.Nayfach S, Camargo AP, Schulz F, Eloe-Fadrosh E, Roux S, Kyrpides NC. 2021. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat Biotechnol 39:578–585. doi: 10.1038/s41587-020-00774-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Mihara T, Nishimura Y, Shimizu Y, Nishiyama H, Yoshikawa G, Uehara H, Hingamp P, Goto S, Ogata H. 2016. Linking virus genomes with host taxonomy. Viruses 8:66. doi: 10.3390/v8030066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, Hauser LJ. 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Lu S, Wang J, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR, Gwadz M, Hurwitz DI, Marchler GH, Song JS, Thanki N, Yamashita RA, Yang M, Zhang D, Zheng C, Lanczycki CJ, Marchler-Bauer A. 2020. CDD/SPARCLE: the conserved domain database in 2020. Nucleic Acids Res 48:D265–D268. doi: 10.1093/nar/gkz991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Kumar S, Stecher G, Tamura K. 2016. MEGA7: molecular evolutionary genetics analysis Version 7.0 for bigger datasets. Mol Biol Evol 33:1870–1874. doi: 10.1093/molbev/msw054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Prajapati S, Napit R, Bastola A, Rauniyar R, Shrestha S, Lamsal M, Adhikari A, Bhandari P, Yadav SR, Manandhar KD. 2020. Molecular phylogeny and distribution of dengue virus serotypes circulating in Nepal in 2017. PLoS One 15:e0234929. doi: 10.1371/journal.pone.0234929. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Table S1. Download spectrum.02688-22-s0001.xlsx, XLSX file, 0.01 MB (9.3KB, xlsx)
Table S2. Download spectrum.02688-22-s0002.xlsx, XLSX file, 0.01 MB (9.6KB, xlsx)
Table S3. Download spectrum.02688-22-s0003.xlsx, XLSX file, 0.6 MB (581.8KB, xlsx)
Table S4a. Download spectrum.02688-22-s0004.xlsx, XLSX file, 0.3 MB (268.6KB, xlsx)
Table S4b. Download spectrum.02688-22-s0005.xlsx, XLSX file, 0.2 MB (262.6KB, xlsx)
Table S5. Download spectrum.02688-22-s0006.xlsx, XLSX file, 0.2 MB (188KB, xlsx)
Table S6. Download spectrum.02688-22-s0007.xlsx, XLSX file, 0.01 MB (12.2KB, xlsx)
Fig. S1 and S2. Download spectrum.02688-22-s0008.pdf, PDF file, 0.1 MB (140.7KB, pdf)
Data Availability Statement
The raw sequencing data sets for the current study are available in the NCBI Sequence Read Archive repository, under the Bioproject with accession code PRJNA847438 (NCBI BioProject database, www.ncbi.nlm.nih.gov/bioproject/847438).






