Abstract
Zika virus is an emerging human pathogen of great concern due to putative links to microcephaly and Guillain-Barre syndrome. Here, we report the complete genomes, including the 5′ and 3′ untranslated regions, of five Zika virus isolates, one from the Asian lineage and four from the African lineage.
GENOME ANNOUNCEMENT
Zika virus (ZIKV) is an emerging human pathogen of great concern due to putative links to severe birth defects (microcephaly [1]) and a rare autoimmune syndrome (Guillain-Barre [2]). The virus was first detected in a sentinel rhesus macaque in 1947 while monitoring for enzootic yellow fever virus circulation in the Zika Forest, Uganda (3). Since then, ZIKV has been detected throughout much of Sub-Saharan Africa and has spread across the Pacific Ocean to Central and South America via Southeast Asia (4, 5).
ZIKV is an arbovirus believed to be vectored mainly by mosquitoes in the genus Aedes. ZIKV belongs to the Spondweni serocomplex within the genus Flavivirus and the family Flaviviridae. Viruses in the genus Flavivirus have single-stranded positive-sense RNA genomes of ~11 kb in size (6). These genomes encode a single polyprotein flanked by 5′ and 3′ untranslated regions (UTRs).
Here, we report five complete ZIKV genomes, including the full polyprotein and the 5′ and 3′ UTRs. Four of these viruses were isolated in Africa (Senegal and Uganda) and belong to the African lineage (4). One virus (FSS13025) was isolated in Cambodia and belongs to the Asian lineage (4). All five isolates were passaged multiple times in cell culture (Table 1). Several distinctly passaged versions of strain MR-766 have been previously published, including two complete genomes (GenBank: AY632535, LC002520). A coding-complete version (i.e., missing the 5′ and 3′ UTRs [7]) of the genome was published for an earlier passage of FSS13025 (GenBank: JN860885).
TABLE 1 .
Isolate | Source | Location | Date | Passage historya | Genome/ORFb size (nucleotides) | Accession no. |
---|---|---|---|---|---|---|
FSS13025 | Homo sapiens | Cambodia | 8/8/2010 | Vero ×4 | 10,807/10,272 | KU955593 |
DAK AR D 41671 | Aedes taylori | Senegal | 12/14/1984 | AP61 ×1, C6/36 ×1, Vero ×3 | 10,806/10,272 | KU955595 |
DAK AR D 41662 | Aedes taylori | Senegal | 12/6/1984 | AP61 ×1, C6/36 ×1, Vero ×3 | 10,806/10,272 | KU955592 |
DAK AR D 41525 | Aedes africanus | Senegal | 11/20/1984 | AP61 ×1, C6/36 ×1, Vero ×3 | 10,806/10,272 | KU955591 |
MR-766 | Macaca mulatta | Uganda | 4/1947 | SM ×150, Vero ×2 | 10,795/10,260 | KU955594 |
AP61, Aedes pseudoscutellaris cells; C6/36, Aedes albopictus cells; SM, suckling mouse; Vero, African green monkey kidney cells.
ORF, open reading frame.
Isolates were sequenced at the U.S. Army Medical Research Institute of Infectious Diseases and the University of Texas Medical Branch using Illumina NextSeq/MiSeq. RNA was extracted using the Direct-zol RNA extraction kit (Zymo), converted to cDNA using SuperScript III (Invitrogen) and amplified using sequence-independent single-primer amplification (8) combined with primers for rapid amplification of cDNA ends (9). Adaptors and primers were clipped using Cutadapt version 1.21 (10), and low-quality reads/bases were filtered using Prinseq-lite version 0.20.4 (11). Reads were aligned to a reference genome (chosen separately for each isolate based on nucleotide-level similarity between preliminary de novo contigs and publically available ZIKV genomes) using Bowtie2 (12), duplicates were removed with Picard (http://broadinstitute.github.io/picard), and a new consensus was generated using a combination of SAMtools version 0.1.18 (13) and custom scripts. When necessary, the 5′ and 3′ UTRs were then built out beyond the reference genome using custom scripts that iterate the reference assembly process with the addition of ambiguous characters (N’s) onto the 5′ and 3′ ends of the reference.
The assembled, complete genomes (7) ranged in size from 10,795 to 10,807 nucleotides; the 5′ UTRs were 106 to 107 nucleotides and the 3′ UTRs were 428 to 429 nucleotides, respectively. The MR-766 stock we sequenced contained a 12-nucleotide in-frame deletion in the polyprotein, which it shared with AY632535, but is not present in LC002520. However, our sequence does not share the five additional single nucleotide insertions/deletions present in AY632535. All sequenced isolates contained the conserved, dinucleotide complimentary terminal sequences that are characteristic of the Flavivirus genus (5′ – AG … CU – 3′ [6]).
Nucleotide sequence accession numbers.
Genome accession numbers to public databases are listed in Table 1.
ACKNOWLEDGMENTS
We thank Bradley Pfeffer for his help submitting the genomes to GenBank. Opinions, interpretations, conclusions, and recommendations are those of the authors and are not necessarily endorsed by the U.S. Army. The views expressed in this article are those of the authors and do not necessarily reflect the official policy or position of the Department of the Navy, Department of Defense, or the U.S. Government. Some authors of this work are military service members or employees of the U.S. Government. This work was prepared as part of their official duties.
The work at USAMRIID was funded by DTRA project CB10246. Some of the genomes referenced in this publication were also sequenced through funding provided by DHS S&T through contract no. HSHQDC-13-C-B0009 “Capturing Global Biodiversity of Pathogens by Whole Genome Sequencing.”
Footnotes
Citation Ladner JT, Wiley MR, Prieto K, Yasuda CY, Nagle E, Kasper MR, Reyes D, Vasilakis N, Heang V, Weaver SC, Haddow A, Tesh RB, Sovann L, Palacios G. 2016. Complete genome sequences of five Zika virus isolates. Genome Announc 4(3):e00377-16. doi:10.1128/genomeA.00377-16.
REFERENCES
- 1.Mlakar J, Korva M, Tul N, Popović M, Poljšak-Prijatelj M, Mraz J, Kolenc M, Resman Rus K, Vesnaver Vipotnik T, Fabjan Vodušek V, Vizjak A, Pižem J, Petrovec M, Avšič Županc T. Zika virus associated with microcephaly. N Engl J Med 374:951–958. doi: 10.1056/NEJMoa1600651. [DOI] [PubMed] [Google Scholar]
- 2.Oehler E, Watrin L, Larre P, Leparc-Goffart I, Lastere S, Valour F, Baudouin L, Mallet H, Musso D, Ghawche F. 2014. Zika virus infection complicated by Guillain-Barre syndrome—case report, French Polynesia, December 2013. Euro Surveill 19:20720. doi: 10.2807/1560-7917.ES2014.19.9.20720. [DOI] [PubMed] [Google Scholar]
- 3.Dick GWA, Kitchen SF, Haddow AJ. 1952. Zika virus (I). Isolations and serological specificity. Transactions R Soc Trop Med Hyg 46:509–520. doi: 10.1016/0035-9203(52)90042-4. [DOI] [PubMed] [Google Scholar]
- 4.Haddow AD, Schuh AJ, Yasuda CY, Kasper MR, Heang V, Huy R, Guzman H, Tesh RB, Weaver SC. 2012. Genetic characterization of Zika virus strains: geographic expansion of the Asian lineage. PLoS Negl Trop Dis 6:e1477. doi: 10.1371/journal.pntd.0001477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ioos S, Mallet H-P, Leparc Goffart I, Gauthier V, Cardoso T, Herida M. 2014. Current Zika virus epidemiology and recent epidemics. Med Mal Infectieuses 44:302–307. doi: 10.1016/j.medmal.2014.04.008. [DOI] [PubMed] [Google Scholar]
- 6.King AM, Adams MJ, Lefkowitz EJ. 2011. Virus taxonomy: classification and nomenclature of viruses: ninth report of the International Committee on Taxonomy of Viruses, vol 9 Academic Press, London. [Google Scholar]
- 7.Ladner JT, Beitzel B, Chain PS, Davenport MG, Donaldson EF, Frieman M, Kugelman JR, Kuhn JH, O’Rear J, Sabeti PC, Wentworth DE, Wiley MR, Yu GY, Threat Characterization Consortium, Sozhamannan S, Bradburne C, Palacios G. 2014. Standards for sequencing viral genomes in the era of high-throughput sequencing. mBio 5:e01360-01314. doi: 10.1128/mBio.01360-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Djikeng A, Halpin R, Kuzmickas R, Depasse J, Feldblyum J, Sengamalay N, Afonso C, Zhang X, Anderson NG, Ghedin E, Spiro DJ. 2008. Viral genome sequencing by random priming methods. BMC Genomics 9:5. doi: 10.1186/1471-2164-9-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Leguia M, Loyola S, Rios J, Juarez D, Guevara C, Silva M, Prieto K, Wiley M, Kasper MR, Palacios G, Bausch DG. 2015. Full genomic characterization of a Saffold virus isolated in Peru. Pathogens 4:816–825. doi: 10.3390/pathogens4040816. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Martin M. 2011. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J 17:10–12. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
- 11.Schmieder R, Edwards R. 2011. Quality control and preprocessing of metagenomic datasets. BioInformatics 27:863–864. doi: 10.1093/bioinformatics/btr026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Langmead B, Salzberg SL. 2012. Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Genome Project Data Processing S . 2009. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]