Skip to main content
Brazilian Journal of Microbiology logoLink to Brazilian Journal of Microbiology
. 2020 Aug 4;51(4):1601–1605. doi: 10.1007/s42770-020-00349-3

Insights on Zika virus envelope gene conservation in American outbreaks

Mateus V Thomazella 1,, Quirino A Lima Neto 1, Francisco F Duarte Junior 1, Fernando A Jorge 1, Déborah de Castro Moreira 1, Amauri D Leal Junior 1, Mayra M Presibella 2, Irina N Riediger 2, Rafael A da Silva 3,4, Isabel Maria V G de Carvalho 3,4, Dennis A Bertolini 1
PMCID: PMC7688768  PMID: 32749573

Abstract

Phylogenetic studies with Zika virus (ZIKV) have been conducted in Brazil. In this study, we sequenced 8 new sequences of the ZIKV envelope (E) gene from strains of cases from the Paraná and Mato Grosso do Sul states in 2016. A low phylogenetic signal was observed, with more than 40% of unresolved quartets, and the Maximum Likelihood Tree grouped all sequences in the Brazilian branches within the Asian genotype. In addition, a Shannon entropy analysis was conducted, showing a high stability in the E protein through the ZIKV polyprotein. Taken together, these results suggest a high degree of conservation in the ZIKV E gene from the recent American outbreaks.

Electronic supplementary material

The online version of this article (10.1007/s42770-020-00349-3) contains supplementary material, which is available to authorized users.

Keywords: Zika virus, Phylogenetic study, Entropy, Brazil


Between the years of 2015 and 2016, Brazil was surprised by an outbreak of an exanthematous disease with dengue-like symptoms [13]. Clinical investigations led to the finding that the causative agent was Zika virus (ZIKV), and over the course of the outbreak, the association with serious sequelae (Congenital Zika Syndrome (CZS) in children whose mothers were infected during pregnancy, and Guillain-Barre) was established [4]. Since then, the number of new ZIKV cases has decreased substantially, and by the end of 2019, the incidence of confirmed cases of ZIKV infection was only 5.1 cases per 100,000 inhabitants in Brazil [5].

Phylogenetic analyses have shown the occurrence of two main genotypes of ZIKV: African and Asian [6, 7]. Since 2015, ZIKV sequences obtained from outbreaks that occurred in different regions of Brazil were clustered within the Asian genotype [2, 3]. Here, using new ZIKV sequences from strains that occurred in the Paraná and Mato Grosso do Sul states, we presented a phylogenetic analysis based on the E gene.

Serum samples from patients were provided by Laboratório Central do Estado do Paraná (LACEN-PR). The diagnosis of ZIKV infection was defined by the qRT-PCR method, using a set of primers specific for E gene, as previously described [7]. A cut-off was established in a cycle threshold (CT) of 40. Total RNA was extracted from 140 μL of serum using the QIAmp® Viral RNA Mini kit (Qiagen, Courtaboeuf, France), following the manufacturer’s instructions, being immediately used for cDNA synthesis (Superscript III, Invitrogen, Carlsbad, CA) according to the manufacturer’s protocol. The use of human samples was approved by the Permanent Research Ethics Committee of the State University of Maringá (Protocol no. 1.751.311).

In order to sequence the entire ZIKV E gene, two pairs of primers were designed: ZVF1/ZVF1.2 and ZVF2/ZVR2.1 (Online resource – Table S1). Two PCR reactions were conducted: The first reaction was prepared to 20 μL, with 1X Buffer, 0.2 mM of each dNTP (10 mM dNTP mix), 1.5-mM MgCl2, 1 U of Taq DNA polymerase (Invitrogen, Carlsbad, CA), primers ZVF1 and ZVR2.1 (0.5 μM each), and 4 μL of ZIKV cDNA. The conditions were initial denaturation at 94 °C for 5 min, 30 cycles of 94 °C for 1 min, 60 °C for 30 s, 72 °C for 2 min, followed by final extension at 72 °C for 10 min. For the second reaction, two semi-nested PCR were performed. Both were prepared to 20 μL, with 1X Buffer, 0.062 mM of each dNTP (10 mM dNTP mix), and 1.5-mM MgCl2; primers ZVF1 and ZVR1.2 (0.5 μM each) and primers ZVF2 and ZVR2.1 (0.5 μM each) were used for the two fragments, respectively. Further, 1 U of Taq DNA polymerase (Invitrogen, Carlsbad, CA) and 2 μL of the product obtained in the first PCR were used. The PCR conditions were initial denaturation at 94 °C for 5 min, 35 cycles of 94 °C for 1 min, 58 °C for 30 s, 72 °C for 1 min and 30 s, followed by final extension at 72 °C for 10 min. The PCR products were sequenced by ACTGene Análises Moleculares Ltda. (Centro de Biotecnologia, UFRGS, Porto Alegre, RS, Brazil) using the ABI Prism® BigDye™ Terminator v3.1 (Applied Biosystems, Foster City, CA) and the AB 3500 Genetic Analyzer (Applied Biosystems, Foster City, CA). All ZIKV E gene sequences were assembled by the Phred/Phrap program and contigs were formed for each sequence [8, 9].

A data set with the ZIKV E gene sequences obtained in this study and 167 complete ZIKV sequences recovered from Genbank, representing different regions and genotypes, was created (Online resource – Table S2). The softwares MUSCLE [10] and jModelTest 2.1 [11] were used to align and determine the best-fit model of nucleotide substitution, respectively, and sequences were trimmed in order to keep the E gene sequence for the whole dataset. The phylogenetic signal of the ZIKV E gene was determined using the Tree-Puzzle 5.3 software [12, 13] and a phylogenetic tree by the Maximum Likelihood Method was constructed using the PhyML package as implemented in SEAVIEW software [14], with 1000 bootstrap pseudo-replicates [15]. All sequences were deposited in GenBank under the accession numbers MH379777 to MH379784 (Envelope gene).

Of the 33 samples provided by LACEN-PR, only 8 were sequenced, 4 from the state of Mato Grosso do Sul and 4 from the state of Paraná, Brazil. One of possible reasons for our poor recovery of samples that could be sequenced was the low viral load of ZIKV in the peripheral blood of patients (Online resource – Table S3). All sequences obtained in this study were grouped in the Brazilian branches within the Asian genotype (Fig. 1).

Fig. 1.

Fig. 1

Maximum Likelihood Phylogenetic Tree using aligned ZIKV E gene sequences and obtained after 1000 pseudo bootstrap replicates. The GTR model was used with a variation of four Gamma distribution categories. The bootstrap value is above the clades that separate ZIKV genotypes. Clades highlighted with red color represent the East African genotype, clades highlighted in green represent the West African genotype, and clades highlighted in white the Asian/Americas genotype. The sequences obtained in this study, corresponding to the states of Paraná and Mato Grosso do Sul, are highlighted in red and yellow oblongs, respectively

The quartets analysis showed a low phylogenetic signal, with more than 40% of unresolved quartets (Online resource – Fig. S1). An informational entropy analysis of Shannon, based on the entire ZIKV polyprotein, was performed in BioEdit 7.2.0 software [16] using the same sequences of the phylogenetic analysis (Online resource - Table S2) and excluding sequences of E gene obtained in this study. Entropy analysis was used in order to evaluate the variability and complexity in an amino acid position, being calculated in bits using the Shannon formula: H(l) = − Σf(b,l)log(base 2)f(b,l), where H is the entropy at the position l and f(b,l) is the frequency with which residue b is found at the position l. As a result, entropy in E protein sequence was low, with few peaks in some amino acids positions, with higher entropy values in nonstructural regions (Fig. 2).

Fig. 2.

Fig. 2

Informational entropy analysis of Shannon based on ZIKV polyprotein implemented on BioEdit 7.2.0. The entropy analysis was conducted based on the position of each amino acid in complete coding region of ZIKV polyprotein. The NS5 region shows the higher entropy values and E gene shows a low entropy

Different from our results, Faye et al. (2014), in a study with 43 ZIKV sequences sampled from 1947 to 2007 in Africa and Asia, observed a high phylogenetic signal in the ZIKV E gene [17]. However, it is important to emphasize that in our report, the sequences analyzed were from 2013 to 2016, which is a short period in comparison with the time of ZIKV occurrence in Africa and Asia. Therefore, the possibility of cumulating nucleotide substitutions in our strains is smaller than in the African strains analyzed by Faye et al. (2014) [17]. Besides, it is important to mention that ZIKV has reached the Americas recently and has affected an immunologically naïve population [18, 19].

In addition, it is known that several sites in the ZIKV E gene are under strong negative selection and possible nonsynonymous substitutions could not be fixed in the population [17, 20, 21]. In fact, the dual-host cycling of arboviruses may impact in the occurrence of nonsynonymous mutations in important genes, which means that the majority of those mutations are deleterious and removed by purifying selection [22]. Also, the short time of ZIKV occurrence in the Americas may reflect in a lack of positive selection in the E protein [17]. Finally, taking into account all discussion above and the establishment of a single lineage in Brazil, the potential role for founder effect should also be considered a reason for the low genetic diversity of ZIKV E gene observed in our study [23, 24]. Moreover, despite the short period of time analyzed in this study, it is possible that American ZIKV strains were not under a strong natural pressure, which led to a conservative E gene.

In conclusion, the sequences obtained in this study were grouped in the Brazilian branches, within the American clade and the Asian genotype. Altogether, our results showed a high degree of conservation on ZIKV E gene in the recent American outbreak strains, giving a good perspective to the development of new vaccines, treatments, and diagnostic methods.

Electronic supplementary material

ESM 1 (80.4KB, docx)

(DOCX 80 kb)

ESM 2 (245.6KB, jpg)

(JPG 245 kb)

Acknowledgments

We thank the Laboratório Central do Estado do Paraná, Brazil, for supplying the samples used in this study.

Funding information

This work was supported by Fundação Araucária de Apoio ao Desenvolvimento Científico e Tecnológico do Estado do Paraná, Brazil (Grant. no. 040/2016).

Compliance with ethical standards

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Ethical approval

The use of human samples was approved by the Permanent Research Ethics Committee of the State University of Maringá (Protocol no. 1.751.311).

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Campos GS, Bandeira AC, Sardi SI. Zika virus outbreak, Bahia, Brazil. Emerg Infect Dis. 2015;21:1885–1886. doi: 10.3201/eid2110.150847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Pessoa R, Patriota JV, Lourdes de Souza M, et al. Investigation into an outbreak of dengue-like illness in Pernambuco, Brazil, revealed a cocirculation of Zika, chikungunya, and dengue virus type 1. Medicine (Baltimore) 2016;95:e3201. doi: 10.1097/MD.0000000000003201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Zanluca C, de Melo VCA, Mosimann ALP, et al. First report of autochthonous transmission of Zika virus in Brazil. Mem Inst Oswaldo Cruz. 2015;110:569–572. doi: 10.1590/0074-02760150192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Cao-Lormeau V-M, Blake A, Mons S, Lastère S, Roche C, Vanhomwegen J, Dub T, Baudouin L, Teissier A, Larre P, Vial AL, Decam C, Choumet V, Halstead SK, Willison HJ, Musset L, Manuguerra JC, Despres P, Fournier E, Mallet HP, Musso D, Fontanet A, Neil J, Ghawché F. Guillain-Barré syndrome outbreak associated with Zika virus infection in French Polynesia: a case-control study. Lancet. 2016;387:1531–1539. doi: 10.1016/S0140-6736(16)00562-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Brasil. Ministério da Saúde. Secretaria de Vigilância em Saúde. Monitoramento dos casos de arboviroses urbanas transmitidas pelo Aedes (dengue, chikungunya e Zika), Semanas Epidemiológicas 01 a 52. Brasília, v51, n.2, Jan. 2020
  • 6.Haddow AD, Schuh AJ, Yasuda CY, Kasper MR, Heang V, Huy R, Guzman H, Tesh RB, Weaver SC. Genetic characterization of Zika virus strains: geographic expansion of the Asian lineage. PLoS Negl Trop Dis. 2012;6:e1477. doi: 10.1371/journal.pntd.0001477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Lanciotti RS, Kosoy OL, Laven JJ, Velez JO, Lambert AJ, Johnson AJ, Stanfield SM, Duffy MR. Genetic and serologic properties of Zika virus associated with an epidemic, Yap State, Micronesia, 2007. Emerg Infect Dis. 2008;14:1232–1239. doi: 10.3201/eid1408.080287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Ewing B, Hillier L, Wendl MC, Green P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 1998;8:175–185. doi: 10.1101/gr.8.3.175. [DOI] [PubMed] [Google Scholar]
  • 9.Ewing B, Green P. Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 1998;8:186–194. doi: 10.1101/gr.8.3.186. [DOI] [PubMed] [Google Scholar]
  • 10.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 2012;9:772. doi: 10.1038/nmeth.2109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Schmidt HA, Strimmer K, Vingron M, von Haeseler A. TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics. 2002;18:502–504. doi: 10.1093/bioinformatics/18.3.502. [DOI] [PubMed] [Google Scholar]
  • 13.Strimmer K, von Haeseler A. Likelihood-mapping: a simple method to visualize phylogenetic content of a sequence alignment. Proc Natl Acad Sci U S A. 1997;94:6815–6819. doi: 10.1073/pnas.94.13.6815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Galtier N, Gouy M, Gautier C. SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny. Comput Appl Biosci. 1996;12:543–548. doi: 10.1093/bioinformatics/12.6.543. [DOI] [PubMed] [Google Scholar]
  • 15.Efron B, Halloran E, Holmes S. Bootstrap confidence levels for phylogenetic trees. Proc Natl Acad Sci U S A. 1996;93:13429–13434. doi: 10.1073/pnas.93.23.13429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hall TA. Nucleic acids symposium series. London: Information Retrieval Ltd.; 1999. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT; pp. 95–98. [Google Scholar]
  • 17.Faye O, Freire CCM, Iamarino A, Faye O, de Oliveira JVC, Diallo M, Zanotto PMA, Sall AA. Molecular evolution of Zika virus during its emergence in the 20(th) century. PLoS Negl Trop Dis. 2014;8:e2636. doi: 10.1371/journal.pntd.0002636. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Faria NR, Quick J, Claro IM, Thézé J, de Jesus JG, Giovanetti M, Kraemer MUG, Hill SC, Black A, da Costa AC, Franco LC, Silva SP, Wu CH, Raghwani J, Cauchemez S, du Plessis L, Verotti MP, de Oliveira WK, Carmo EH, Coelho GE, Santelli ACFS, Vinhal LC, Henriques CM, Simpson JT, Loose M, Andersen KG, Grubaugh ND, Somasekar S, Chiu CY, Muñoz-Medina JE, Gonzalez-Bonilla CR, Arias CF, Lewis-Ximenez LL, Baylis SA, Chieppe AO, Aguiar SF, Fernandes CA, Lemos PS, Nascimento BLS, Monteiro HAO, Siqueira IC, de Queiroz MG, de Souza TR, Bezerra JF, Lemos MR, Pereira GF, Loudal D, Moura LC, Dhalia R, França RF, Magalhães T, Marques ET, Jr, Jaenisch T, Wallau GL, de Lima MC, Nascimento V, de Cerqueira EM, de Lima MM, Mascarenhas DL, Neto JPM, Levin AS, Tozetto-Mendoza TR, Fonseca SN, Mendes-Correa MC, Milagres FP, Segurado A, Holmes EC, Rambaut A, Bedford T, Nunes MRT, Sabino EC, Alcantara LCJ, Loman NJ, Pybus OG. Establishment and cryptic transmission of Zika virus in Brazil and the Americas. Nature. 2017;546:406–410. doi: 10.1038/nature22401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Metsky HC, Matranga CB, Wohl S, Schaffner SF, Freije CA, Winnicki SM, West K, Qu J, Baniecki ML, Gladden-Young A, Lin AE, Tomkins-Tinch CH, Ye SH, Park DJ, Luo CY, Barnes KG, Shah RR, Chak B, Barbosa-Lima G, Delatorre E, Vieira YR, Paul LM, Tan AL, Barcellona CM, Porcelli MC, Vasquez C, Cannons AC, Cone MR, Hogan KN, Kopp EW, Anzinger JJ, Garcia KF, Parham LA, Ramírez RMG, Montoya MCM, Rojas DP, Brown CM, Hennigan S, Sabina B, Scotland S, Gangavarapu K, Grubaugh ND, Oliveira G, Robles-Sikisaka R, Rambaut A, Gehrke L, Smole S, Halloran ME, Villar L, Mattar S, Lorenzana I, Cerbino-Neto J, Valim C, Degrave W, Bozza PT, Gnirke A, Andersen KG, Isern S, Michael SF, Bozza FA, Souza TML, Bosch I, Yozwiak NL, MacInnis BL, Sabeti PC. Zika virus evolution and spread in the Americas. Nature. 2017;546:411–415. doi: 10.1038/nature22402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Angeletti S, Lo Presti A, Giovanetti M, et al. Nonstructural proteins are preferential positive selection targets in Zika virus and related flaviviruses. Pathog Glob Health. 2016;10:269–274. doi: 10.1371/journal.pntd.0004978. [DOI] [Google Scholar]
  • 21.Liang D, Leung RKK, Lee SS, Kam KM. Insights into intercontinental spread of Zika virus. PLoS One. 2017;12:e0176710. doi: 10.1371/journal.pone.0176710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Holmes EC. Patterns of intra- and Interhost nonsynonymous variation reveal strong purifying selection in dengue virus. J Virol. 2003;77:11296 LP–11211298. doi: 10.1128/JVI.77.20.11296-11298.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Kivisild T (2013) Founder effect. Brenner's encyclopedia of genetics, second edn. Academic press, San Diego, pp 100–101
  • 24.Rai MA, Nerurkar VR, Khoja S, Khan S, Yanagihara R, Rehman A, Kazmi SU, Ali SH. Evidence for a “founder effect” among HIV-infected injection drug users (IDUs) in Pakistan. BMC Infect Dis. 2010;10:7. doi: 10.1186/1471-2334-10-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ESM 1 (80.4KB, docx)

(DOCX 80 kb)

ESM 2 (245.6KB, jpg)

(JPG 245 kb)


Articles from Brazilian Journal of Microbiology are provided here courtesy of Brazilian Society of Microbiology

RESOURCES