ABSTRACT
We present a draft genome of a novel rhabdovirus, called Grenada mosquito rhabdovirus 1 (GMRV1), with homology to Wuhan mosquito virus 9 (WMV9) (NCBI reference sequence NC_031303), isolated from Deinocerites mosquitoes. The genome has a length of 14,420 nucleotides and encodes five open reading frames.
GENOME ANNOUNCEMENT
Members of the Rhabdoviridae family are enveloped negative-sense single-stranded RNA viruses. They are known to infect a broad range of hosts, including plants, mammals, and insects (1). Rhabdovirus genomes contain five key structural proteins, a nucleocapsid protein N, phosphoprotein P, matrix protein M, glycoprotein G, and polymerase L, as well as 3′ leader and 5′ trailer sequences. The sizes of the genomes range from 11 kb to 15 kb.
Grenada mosquito rhabdovirus 1 (GMRV1) was recovered from a pool of 29 female Deinocerites sp. mosquitoes captured near Black Bay Beach in St. John Parish on the island of Grenada (12.12°N, 61.75°W) in March 2015. RNA was extracted from the pool of mosquitoes with TRIzol LS reagent (Ambion) following the manufacturer's instructions. rRNA (human-mouse-rat) was depleted from the sample, and a next-generation sequencing (NGS) library was prepared with the IntegenX RNA-Seq directional kit for Illumina. The library was sequenced at the Harvard Medical School BioPolymers Facility using HiSeq rapid sequencing to generate 100-bp paired-end reads.
We used Pickaxe, an in-house virus discovery software, on the set of 287 million sequence reads (2). After subtraction of sequences with an alignment to Culicidae and human sequences in GenBank, 90 million nonhost reads remained. Using CLC Assembly Cell (Qiagen), Pickaxe assembled the 78 million nonhost reads into 34,956 contigs that were subsequently annotated using a BLAST+/RAPSearch pipeline (3, 4).
A 14,420-nucleotide (nt) contig was initially identified as a rhabdovirus from a RAPSearch alignment with 46% identity to the RNA-dependent RNA polymerase (RdRp) of Wuhan mosquito virus 9 (WMV9) (NCBI reference sequence NC_031303). GMRV1 contains five open reading frames (ORFs), a 151-nt 3′ leader, and a 356-nt 5′ trailer. The 3′ leader and 5′ trailer show no complementarity. ORFs 1 to 5 are located at nt 152 to 1699, nt 1847 to 3256, nt 3375 to 4274, nt 4454 to 6397, and nt 7192 to 14064, respectively. We predict these ORFs to encode the N, P, M, G, and L proteins based on BLASTP searches and the genomic layout of Rhabdoviridae. The nucleoprotein N shares 37% identity with ORF1 in WMV9, the matrix protein M shares 22% identity with ORF3 in WMV9, the glycoprotein G shares 30% identity with the glycoprotein in WMV9, and the polymerase protein L shares 47% identity with the RdRp in WMV9. A BLASTP search of the predicted phosphoprotein P did not produce any homologous virus proteins.
RNA virus phylogeny was recently expanded through deep sequencing of metagenomes (5, 6). Given the large phylogenetic distance of GMRV1 to WMV9 and the limited number of other viral matches to its ORFs, this finding expands the known diversity of negative-sense single-stranded RNA [ssRNA(−)] viruses.
Accession number(s).
The genome sequence of GMRV1 was deposited in GenBank under the accession number MG385079.
ACKNOWLEDGMENTS
This research was supported by funds from Microsoft Research.
We thank the following institutions for their support: St. George's University School of Medicine Department of Microbiology, School of Arts & Sciences, School of Veterinary Medicine, WINDREF, Ministry of Health Grenada, and Ministry of Agriculture Grenada. We also thank those involved in site identification, sample collection and processing, Grant Lambert, Ravindra Naraine, Andrea Easter-Pilcher, Brian Pilcher, Gary Brown, Trevor Noel, Randall Waechter, Calum Macpherson, Makeda Matthew, Avi Bahadoor-Yetman, Shanice McKain, and Karla Farmer.
Footnotes
Citation Xu CL, Cantalupo PG, Sáenz-Robles MT, Baldwin A, Fitzpatrick D, Norris DE, Jackson E, Pipas JM. 2018. Draft genome sequence of a novel rhabdovirus isolated from Deinocerites mosquitoes. Genome Announc 6:e01438-17. https://doi.org/10.1128/genomeA.01438-17.
REFERENCES
- 1.Dietzgen RG, Calisher C, Kurath HG, Kuzmin IV, Rodriguez LL, Stone DM, Tesh R, Tordo BN, Walker P, Wetzel JT, Whitfield AE. 2011. Family Rhabdoviridae, p 686–714. In King AMQ, Adams MJ, Carstens EB, Lefkowitz EJ (ed), Virus taxonomy, ninth report of the international committee on taxonomy of viruses. Elsevier, Oxford, United Kingdom. [Google Scholar]
- 2.Cantalupo PG, Katz JP, Pipas JM. 2018. Viral sequences in human cancer. Virology 513:208–216. doi: 10.1016/j.virol.2017.10.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zhao Y, Tang H, Ye Y. 2012. RAPSearch2: a fast and memory-efficient protein similarity search tool for next-generation sequencing data. Bioinformatics 28:125–126. doi: 10.1093/bioinformatics/btr595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Shi M, Lin XD, Tian JH, Chen LJ, Chen X, Li CX, Qin XC, Li J, Cao JP, Eden JS, Buchmann J, Wang W, Xu J, Holmes EC, Zhang YZ. 2016. Redefining the invertebrate RNA virosphere. Nature 540:539–543. doi: 10.1038/nature20167. [DOI] [PubMed] [Google Scholar]
- 6.Li CX, Shi M, Tian JH, Lin XD, Kang YJ, Chen LJ, Qin XC, Xu J, Holmes EC, Zhang YZ. 2015. Unprecedented genomic diversity of RNA viruses in arthropods reveals the ancestry of negative-sense RNA viruses. eLife 4:e05378. doi: 10.7554/eLife.05378. [DOI] [PMC free article] [PubMed] [Google Scholar]
