Skip to main content
Nature Communications logoLink to Nature Communications
. 2023 Jul 21;14:4413. doi: 10.1038/s41467-023-40099-y

Increased interregional virus exchange and nucleotide diversity outline the expansion of chikungunya virus in Brazil

Joilson Xavier 1,2,#, Luiz Carlos Junior Alcantara 1,2,✉,#, Vagner Fonseca 3,#, Mauricio Lima 1,4, Emerson Castro 1,4, Hegger Fritsch 1,2, Carla Oliveira 5, Natalia Guimarães 4, Talita Adelino 4, Mariane Evaristo 6, Evandra S Rodrigues 6, Elaine Vieira Santos 6, Debora de La-Roque 6, Laise de Moraes 7, Stephane Tosta 1,2, Adelino Neto 8, Alexander Rosewell 3, Ana Flavia Mendonça 9, Anderson Leite 10, Andreza Vasconcelos 11, Arabela L Silva de Mello 12, Bergson Vasconcelos 13, Camila A Montalbano 14, Camila Zanluca 15, Carla Freitas 16, Carlos F C de Albuquerque 3, Claudia Nunes Duarte dos Santos 15, Cleiton S Santos 7, Cliomar Alves dos Santos 17, Crhistinne C Maymone Gonçalves 18, Dalane Teixeira 13, Daniel F L Neto 16, Diego Cabral 11, Elaine C de Oliveira 19, Ethel L Noia Maciel 20, Felicidade Mota Pereira 12, Felipe Iani 4, Fernanda P de Carvalho 11, Gabriela Andrade 12, Gabriela Bezerra 17, Gislene G de Castro Lichs 21, Glauco Carvalho Pereira 4, Haline Barroso 13, Helena Cristina Ferreira Franz 16, Hivylla Ferreira 22, Iago Gomes 23, Irina N Riediger 24, Isabela Rodrigues 17, Isadora C de Siqueira 7, Jacilane Silva 11, Jairo Mendez Rico 25, Jaqueline Lima 12, Jayra Abrantes 23, Jean Phellipe M do Nascimento 10, Judith N Wasserheit 26, Julia Pastor 11, Jurandy J F de Magalhães 11,27, Kleber Giovanni Luz 28, Lidio G Lima Neto 22, Livia C V Frutuoso 29, Luana Barbosa da Silva 19, Ludmila Sena 17, Luis Arthur F de Sousa 22, Luiz Augusto Pereira 9, Luiz Demarchi 21, Magaly C B Câmara 23, Marcela G Astete 12, Maria Almiron 25, Maricelia Lima 30, Marina C S Umaki Zardin 21, Mayra M Presibella 24, Melissa B Falcão 31, Michael Gale Jr 32, Naishe Freire 11, Nelson Marques 24, Noely F O de Moura 29, Pedro E Almeida Da Silva 20, Peter Rabinowitz 33, Rivaldo V da Cunha 34, Karen S Trinta 34, Rodrigo F do Carmo Said 3, Rodrigo Kato 16, Rodrigo Stabeli 3, Ronaldo de Jesus 16, Roselene Hans Santos 11, Simone Kashima 6, Svetoslav N Slavov 6,35, Tamires Andrade 13, Themis Rocha 23, Thiago Carneiro 13, Vanessa Nardy 12, Vinicius da Silva 9, Walterlene G Carvalho 8, Wesley C Van Voorhis 36, Wildo N Araujo 25, Ana M B de Filippis 5,, Marta Giovanetti 1,2,37,
PMCID: PMC10362057  PMID: 37479700

Abstract

The emergence and reemergence of mosquito-borne diseases in Brazil such as yellow fever, zika, chikungunya, and dengue have had serious impacts on public health. Concerns have been raised due to the rapid dissemination of the chikungunya virus across the country since its first detection in 2014 in Northeast Brazil. In this work, we carried out on-site training activities in genomic surveillance in partnership with the National Network of Public Health Laboratories that have led to the generation of 422 chikungunya virus genomes from 12 Brazilian states over the past two years (2021–2022), a period that has seen more than 312 thousand chikungunya fever cases reported in the country. These genomes increased the amount of available data and allowed a more comprehensive characterization of the dispersal dynamics of the chikungunya virus East-Central-South-African lineage in Brazil. Tree branching patterns revealed the emergence and expansion of two distinct subclades. Phylogeographic analysis indicated that the northeast region has been the leading hub of virus spread towards other regions. Increased frequency of C > T transitions among the new genomes suggested that host restriction factors from the immune system such as ADAR and AID/APOBEC deaminases might be driving the genetic diversity of the chikungunya virus in Brazil.

Subject terms: Phylogenetics, Next-generation sequencing, Phylogeny, Viral epidemiology, Epidemiology


Chikungunya virus is endemic in Brazil and cases have been rapidly increasing in recent years. Here, the authors describe the expansion of a genomic surveillance program across the country allowing them to characterise the emergence and dispersal of two distinct subclades mainly seeded from the north eastern region.

Introduction

Mosquito-borne viral diseases have impacted the lives of millions of people across several populations in the tropics and subtropics1,2. This scenario prompted the World Health Organization (WHO) to issue a guideline in 2017 with strategies for global vector control response aiming to reduce the burden and threat of vector-borne diseases, such as Dengue, Zika, and chikungunya fever, by 20303. Five years later, considering the public health implications brought by the coronavirus disease (COVID-19) pandemic, the WHO outlined ten proposals to strengthen health emergency preparedness, response, and resilience4. One of the proposals calls for the development and establishment of a collaborative surveillance system with improved laboratory capacity for pathogen and genomic surveillance that would guide the public health response.

Several genomic surveillance initiatives have been carried out in the last few years to build knowledge regarding the genetic diversity and transmission dynamics of arboviruses in Brazil, such as chikungunya virus (CHIKV)512. CHIKV infection can cause long-lasting effects such as debilitating arthritis and arthralgia, and there are no effective treatments available13. Vaccine candidates in development have reached phase 2 and 3 clinical trials using attenuated virus derived from the Indian Ocean lineage (IOL) or virus-like particle containing recombinant structural proteins derived from a Senegalese viral strain14,15. Genomic sequencing has revealed that the first case of chikungunya fever (CHIKF) reported in Brazil was an infection by the Asian lineage introduced in a northern state in 2014, while another case reported seven days later in a north-eastern state represented the first known introduction of the East-Central-South-African (ECSA) lineage in the country12. The establishment of the CHIKV in the Brazilian territory was followed by several outbreaks reported across the country, accounting for more than 200 thousand confirmed cases in only the last two years16. Viral genomic data from Brazilian cases has revealed that the ECSA lineage is widespread throughout the country17 and has been linked to fatal cases observed in both risk and non-risk groups (young adults and people without comorbidities)18.

Viral genomic surveillance activities have been driven by the rapid development of DNA sequencing technology and bioinformatics tools for genomic data analysis19. Such tools have allowed the characterization of the genome and dispersal patterns of emerging and reemerging pathogens6,9,20. The use of such tools during the COVID-19 pandemic allowed, for example, the rapid identification of emerging mutations likely associated with increased transmissibility and immune escape21. Despite technological advances and the high number of CHIKF cases reported in recent years in Brazil, the amount of genomic data available in public databases has consisted of genomes from localized outbreaks that could be limited in terms of representativeness across different states and outbreak events. By generating CHIKV sequence data, extensive genomic surveillance efforts in Brazil can provide updated information on the genetic pool of the viral population circulating in the country, fostering new studies on virus evolution and disease treatment.

Recurring outbreaks demonstrate that CHIKV is currently endemic in Brazil. The existence of abundant vectors, together with adequate climatic conditions for vector survival in areas of high population density, create conditions that can modify the adaptive landscape, allowing the continued expansion and evolutionary adaptation of CHIKV2224. Faced with a scenario of limited availability of genetic information on potential strains causing a rapid increase in the number of CHIKF cases over the past two years in Brazil, in this work, we carried out on-site training activities in genomic surveillance in 12 Brazilian states covering four geographic regions to increase the number of available viral genomic sequences. This has allowed comprehensive monitoring of the expansion of the ECSA lineage and its variants circulating in different states, in addition to the characterization of the most up-to-date structured phylogeny of the CHIKV-ECSA lineage in the country.

Results

East, Central, and South African (ECSA) lineage monitoring through countrywide genomic surveillance

Nanopore sequencing was performed on selected CHIKV-positive samples provided by state public health laboratories from 12 states across 4 geographic regions (Northeast, Midwest, Southeast, and South) of Brazil (Fig. 1a) during the years 2021–2022, which saw a significant increase in the number of CHIKF cases reported across different Brazilian regions, with a peak incidence rate of more than 20 cases per 100,000 population, and a total of more than 312,000 suspected cases reported nationally in this 2-year period (Fig. 1b). Due to the portability and easy setup of the nanopore sequencing protocol, which allows data generation in less than 24 h, the collaborative work with the public health laboratories was able to not only generate genomic data but also promote on-site genomic surveillance training activities for the local laboratory staff. This approach combining wet lab and data analysis training allowed local teams to understand how genomic data can be linked to demographic information in order to produce comprehensive and relevant inferences regarding the epidemiology and evolution of CHIKV circulating in Brazil.

Fig. 1. Spatiotemporal distribution of chikungunya fever (CHIKF) in Brazil.

Fig. 1

a Map of Brazil displays states colored according to the location where samples were collected and sequenced. Bar plots indicate the number of isolates obtained from each state. States abbreviations: AL = Alagoas, BA = Bahia, MA = Maranhão, PB = Paraíba, PE = Pernambuco, PI = Piauí, RN = Rio Grande do Norte, SE = Sergipe, MG = Minas Gerais, PR = Paraná, GO = Goiás, MS = Mato Grosso do Sul. b Time series of monthly reported CHIKF suspected cases normalized per 100 K individuals in five Brazilian macroregions over 2014–2022 (until epidemiological week 28). Epidemic curves are colored according to geographical macroregion: N = North, NE = Northeast, MW = Midwest, SE = Southeast, S = South. The shaded rectangle indicates the period in which samples were collected for this study. See “Data availability” for Source data.

A total of 425 CHIKV-positive samples were subjected to nanopore whole-genome sequencing, with 84.94% (n = 361) of these samples originating in the Northeast region, consisting largely of samples collected from the state of Bahia (n = 102) (Fig. 1a, Table 1, and Supplementary Fig. 1). These samples presented a mean RT-qPCR cycle threshold (Ct) value of 24.04 (ranging from 11 to 35.90) (Table 2). Patients’ mean age upon sample collection was similar for both females and males (39 years of age), with 57.88% (n = 246) of the participants identified as female (Table 2). The clinical status of patients at the time of sample collection, and travel history data were not available for these samples.

Table 1.

Number of chikungunya virus-positive samples sequenced during the genomic surveillance in Brazil, 2021–2022, by geographical origin

Region and State # Samples (total n = 425)
Northeast (n = 361)
Alagoas (AL) 3 (0.71%)
Bahia (BA) 102 (24.00%)
Maranhão (MA) 38 (8.94%)
Paraíba (PB) 23 (5.41%)
Pernambuco (PE) 16 (3.77%)
Piauí (PI) 90 (21.18%)
Rio Grande do Norte (RN) 45 (10.59%)
Sergipe (SE) 44 (10.35%)
Midwest (n = 42)
Goiás (GO) 40 (9.41%)
Mato Grosso do Sul (MS) 2 (0.47%)
Southeast
Minas Gerais (MG) 8 (1.88%)
South
Paraná (PR) 14 (3.29%)

Table 2.

Demographic and laboratory characteristics of chikungunya virus-infected patients

Patients (n = 425)
Sex
   Female 246 (57.88%)
   Male 179 (42.12%)
Mean age on sample collection (SD)
   Female 39.10 ± 18.83
   Male 39.37 ± 22.03
Median days of symptoms prior to sampling (SD) 3.47 ± 8.52
RT-qPCR mean Ct (cycle threshold) value (SD) 24.04 ± 4.15

SD standard deviation.

Multiplex PCR-tiling amplicon sequencing on MinION allowed the recovery of 425 genomic sequences from CHIKV with a mean genome coverage and sequencing depth of 90.98% (range 31.80 to 96.19%) and 3290x (range 58 to 129,706), respectively (Supplementary Data 1). Of the 425 sequences, 14 were recovered from old CHIKV-PCR samples collected in Bahia state during July–August 2015 and stored since. These isolates had a mean genome coverage of 70% (range 31.80 to 92.9%). The remaining genomes have an associated collection date ranging from April 2019 to June 2022. To better capture the phylogenetic signal, only the sequences with genome coverage over 60% (n = 422) were considered for further analysis (discarded sequences are listed in the methods).

All the newly recovered genomes were assembled using Genome Detective software which also classified all of them as belonging to the East, Central, and South African (ECSA) lineage. To investigate the phylogenetic relationship of the new sequences with other Brazilian and non-Brazilian sequences available in public databases, we built a global dataset (n = 1987) composed of 1565 CHIKV genomes retrieved from GenBank NCBI in addition to 422 sequences from this study. It is noteworthy that the two years of genomic surveillance activities of this study contributed to a more than doubling of the number of CHIKV genomes from Brazil available in the NCBI (by then there were 332 complete sequences) since the virus emerged in the country in 2014 (Fig. 2c).

Fig. 2. Time-measured phylogeny of chikungunya virus ECSA lineage in Brazil.

Fig. 2

a Maximum Clade Credibility tree reconstructed using 706 sequences from Brazil (in addition to 5 sequences from Paraguay and 2 from Haiti) and a molecular clock approach. Numbers in black show clade posterior probabilities of main nodes. Some posterior probability values were omitted for clarity. Tip colors represent the sampling location. b Root-to-tip genetic distance regression in a maximum likelihood phylogeny of the CHIKV ECSA lineage (n = 713). New sequences are colored in black. The black line represents the medium values of the linear regression while the light gray bands around the line represent the standard error. R2 indicates the coefficient of determination. c Number of genomes generated in this study (with >60% genome coverage) compared to the number of Brazilian CHIKV ECSA sequences available on the GenBank up to 27th Jan. 2023. d The spatial spread of CHIKV in Brazil estimated, using SPREAD4 software, under a discrete diffusion model employed in the Bayesian Phylogeographic approach using a dataset with 471 sequences. Size of colored circles was scaled by location posterior support. See Data Availability for Source data.

Updated CHIKV phylogeny reveals two distinct emerging subclades

A preliminary Maximum Likelihood (ML) tree was reconstructed using the global dataset that showed all Brazilian sequences grouped in the ECSA clade (Supplementary Fig. 2). It can also be observed from the ML tree that most of the new sequences formed two well-distinct derived clades, where it can be noticed that sequences collected in different geographical regions were closely related (see ML tree in Supplementary Data 2). To investigate in more detail the phylogenetic features of these clades within a time-aware evolutionary framework, which can benefit from the increased amount of genomic data obtained in this study, we performed a Bayesian phylogenetic analysis using a downsampled dataset (n = 713) mostly composed of Brazilian sequences.

Root-to-tip genetic distance regression indicated that the downsampled dataset presented sufficient temporal signal (R2 = 0.70 and correlation coefficient = 0.83) to infer a time-measured phylogeny (Fig. 2b). Consistent with the ML tree, the inferred Maximum Clade Credibility (MCC) tree also revealed two distinct more derived clades (henceforth clade I and II) formed mainly by 2021–2022 sequences (Fig. 2a). The Bayesian evolutionary analysis estimated the time of the most recent common ancestor (tMRCA) of clade I to be late January 2018 (95% highest posterior density (HPD): December 2017 and March 2018), while clade II presented a slightly late tMRCA estimated to be early February 2018 (95% HPD: January 2018 and March 2018).

Some composition differences can be noticed between these clades. Clade I comprises sequences from 14 distinct states mostly collected from 2021 to 2022 (n = 304) and from northeastern Brazil (62.38%, n = 204), with sequences from Sergipe (13.5%, Northeast), Minas Gerais (1.8%), São Paulo (16.2%, Southeast), Goiás (11.6%), Mato Grosso do Sul (0.6%, Midwest), and Paraná (4.3%, South) states uniquely present in this clade (Fig. 2a). Meanwhile, clade II is mostly composed of sequences collected in 2022 from northeastern states (87.5%, n = 147), with a total of 8 states sampled and Piaui state being the most represented (46.42%, n = 78) (Fig. 2a).

A closer look at sequence distribution inside these clades reveals recurrent virus movement between states and regions, with midwestern isolates closely related to isolates from the Northeast and from the state of Minas Gerais (Southeast) in clade I. It can also be noticed in clade I that several distinct CHIKV introductions occurred in the state of Goiás (Midwest) and in northeastern states (Bahia, Rio Grande do Norte, Sergipe, Paraiba, Pernambuco, and Piaui) (Fig. 2a). Contrarily, the clade I sequence distribution reveals that apparently, only one viral introduction event has happened in the southern state of Paraná, sharing a most recent common ancestor (dated from Jun. 2019 to Jun. 2020, 95% HPD, posterior probability = 1.0) with isolates from the state of São Paulo (Southeast cluster). Similarly, clade II displays viral exchange between states, especially from the Northeast, as indicated by a single well-supported subclade (posterior = 1.0), dated from Jan. 2020 to Dec. 2020 (95% HPD) and dominated by sequences from northeastern states (Piauí, Bahia, Rio Grande do Norte, Paraíba, Pernambuco, Maranhão, and Alagoas) (Fig. 2a).

CHIKV dispersal in Brazil has been mainly seeded by the Northeast

In face of the recurrent virus movement observed across the country, as indicated by our MCC phylogeny, we employed a Bayesian phylogeographic approach to reconstruct the spatial dispersal dynamic of CHIKV in Brazil (closely related sequences from Haiti and Paraguay were included) and to estimate the ancestral locations of clades I and II. The resulting phylogeny kept the topology from the MCC tree shown in Fig. 2a and revealed the Northeast as a leading source of CHIKV transmission in Brazil, seeding the network of frequent virus exchange among states mainly from the Northeast, Southeast, and Midwest, as indicated by the location probability of the 5 early branching events inferred by the discrete phylogeography (Fig. 2d and Supplementary Fig. 3). Moreover, the ECSA lineage circulating in Brazil extended its transmission network by reaching other countries in the region such as Paraguay and Haiti, likely via the Midwest and Northeast of Brazil, respectively. An alternative approach, using a transmission network generated from transition states summarized from the Bayesian phylogeography and centrality metrics, also indicated the Northeast as a source (Source Hub Ratio of 0.66) in the network for the CHIKV spread in the country, where intense interactions are displayed between the Northeast and Southeast (Supplementary Fig. 4 and Supplementary Table 1). Moreover, the discrete state ancestral reconstruction employed in our Bayesian analysis estimated that both clades I and II might have emerged in the Southeast region with a location probability of 1.0 and with estimated divergence times (clade I: 95% HPD Nov. 2017–Feb. 2018; clade II: Dec. 2017–Mar. 2018) (Supplementary Fig. 3) comparable to those obtained in the MCC tree from Fig. 2a inferred using a comprehensive dataset.

These estimates place the divergence time of clades I and II in the period that marks the return of CHIKV-increased transmissions after two main epidemic seasons registered from 2016 to 2017 when a total of more than 565 thousand disease cases were notified in the country (Fig. 1b). From the time series graph of CHIKF cases, we can see an increase in the incidence rate around early 2018 for the Midwest and North regions. In that same period, clades I and II were estimated to emerge in the Southeast region, which also presented an increased incidence rate. Since then, a seasonal epidemic pattern has been observed in the CHIKV transmission dynamic with incidence peaks being displayed in the first months of the following years (Fig. 1b). Although the Northeast displayed a slightly later peak in its incidence curve in 2020 and 2021, but not in 2022, when compared to the other regions. These changes in the incidence curve patterns could be explained by differences in local climatic conditions within regions associated with differences in population susceptibility and immunity to the virus, leading to the differential spread of the virus in other parts of Brazil. Alternatively, it has been suggested that spatial heterogeneity associated to virus spread could determine CHIKV epidemics in urban environments25. Moreover, surveillance and reporting biases cannot be ruled out as the 2020–2021 period comprises the time when the states were heavily impacted by rapid increase in the number of SARS-CoV-2 infections, which subsided in 2022.

Consistent with the phylogeography and the transmission network analyses, the CHIKF case time-series plot presents the Northeast region as a major source of virus transmission in the country, as shown by the consecutive incidence peaks registered for that region in the last three years.

Transition changes outline the evolutionary expansion of the Brazilian ECSA lineage

Despite having close divergence times, clades I and II have different sequence compositions that likely explain their apparent branching distance. While sequences from clades I and II were obtained from patients with similar median age, a significant difference (p < 0.05) was observed between the median RT-qPCR Ct value of the samples of each clade (Fig. 3a). This difference in the median Ct values, however, might have been caused by an imbalance present in the dataset used for comparison since clade I has almost twice as many sequences compared to clade II. Alternatively, the between-clades inconsistency observed in the time between symptoms onset and sample collection could have affected sample Ct values.

Fig. 3. Genetic and demographic aspects of the newly generated chikungunya virus (CHIKV) sequences.

Fig. 3

a Violin plot and boxplot of the patient’s age and samples’ cycle threshold (Ct) value distributions by clades I (n = 255 biologically independent samples) and II (n = 147 biologically independent samples). Mann–Whitney U (Wilcoxon) two-sided test with a significance level alpha = 0.05. b Violin plots and boxplots of the frequency distributions for each class of single nucleotide variation (SNV) identified across genome positions of sequences from clades I (n = 327 biologically independent samples) and II (n = 168 biologically independent samples). Mann–Whitney U (Wilcoxon) two-sided test with a significance level alpha = 0.05. a, b Data visualized as violin plots (outer shape representing the data kernel density) and box-and-whisker plots (box represents the median value and 25 and 75% quartiles, the whiskers display the minimum and maximum values, 1.5x interquartile range; points, outliers). c Genomic positions and frequency (represented by plotted bars) of the non-synonymous substitutions uniquely identified in the sequences from clades I and II. Numbers represent the genomic position of each substitution. d Bar plot of the absolute number of CHIKV genome mutated positions for each class of single nucleotide variation (SNV) identified in the alignment of the new CHIKV sequences (n = 422 biologically independent samples). e Scatterplot of the frequency distributions for each class of single nucleotide variation (SNV) identified in the alignment of the new CHIKV sequences (n = 422 biologically independent samples). Each dot represents the frequency of a distinct mutated genome position. b, d, e Letters in the x-axis represent nucleotides: A—adenine, G—guanine, C—cytosine, T—thymine. See “Data availability” for Source data.

We also assessed these clades as to which selective regime they are likely to be subject to. The results from the BUSTED analysis provided evidence that the envelope polyprotein coding region experienced positive diversifying selection in both clades I (p = 1.774e−8) and II (p = 1.840e−11) (Supplementary Table 2). For comparison, we employed a second method, MEME, that identified 12 sites under positive selection for clade I, whereas 13 sites were identified in clade II. Of these sites, 7 are shared by both clades, while exclusive positive selected sites were reported for each clade (Supplementary Table 3). These results suggest the active status of these clades and consequently provide evidence of the continuous evolutionary expansion of the Brazilian ECSA lineage.

Separate sequence alignments representative of each clade revealed a significant difference between the median frequency of 3 classes of single nucleotide variation (SNV) identified in the clades. Clade II presented a higher median frequency of SNV of type C > T, A > G, and T > C transitions (Fig. 3b). By comparing the mutational profile of the two clades, we identified 27 SNVs exclusively present and shared by sequences from clade II, with three of them being non-synonymous substitutions leading putatively to amino acid change such as a T288I substitution in the E1 protein (Table 3 and Fig. 3c). The other two non-synonymous substitutions were identified in two nonstructural proteins (nsP2-P352A and nsP4-A43V).

Table 3.

List of single nucleotide variations (SNV) uniquely identified for clades I and II

Genomic positiona Reference base Altered base Substitution type aab change Clade Genomic region aac position Frequencyd
3208 C T S D > D 1 nsP2 509 1.00
3217 T C S Y > Y 1 nsP2 512 1.00
3277 C T S S > S 1 nsP2 532 1.00
3295 G A S P > P 1 nsP2 538 1.00
3314 G T NS A > S 1 nsP2 544 1.00
7328 G A NS V > I 1 nsP4 555 1.00
7787 A G NS Q > R 1 capsid 74 1.00
7788 G A NS Q > R 1 capsid 74 1.00
8641 C T S L > L 1 E2 34 0.68
9283 C T NS L > F 1 E2 248 0.22
9285 C T S L > L 1 E2 248 0.73
9936 T C S C > C 1 6K 42 1.00
9970 A G NS I > V 1 6K 54 1.00
1754 T C S L > L 2 NSP2 25 1.00
1759 T A S V > V 2 NSP2 26 1.00
1783 T G S R > R 2 NSP2 34 1.00
1789 A G S Q > Q 2 NSP2 36 1.00
1795 T C S L > L 2 NSP2 38 1.00
2735 C G NS P > A 2 NSP2 352 1.00
2767 A G S G > G 2 NSP2 362 1.00
2887 T C S L > L 2 NSP2 402 1.00
3403 T C S F > F 2 NSP2 574 1.00
3901 A G S V > V 2 NSP2 740 1.00
3916 T C S F > F 2 NSP2 745 1.00
4711 T C S D > D 2 NSP3 212 1.00
4837 A G S S > S 2 NSP3 254 1.00
4843 C T S P > P 2 NSP3 256 1.00
4939 T C S S > S 2 NSP3 288 1.00
5783 C T S L > L 2 NSP4 40 1.00
5793 C T NS A > V 2 NSP4 43 1.00
5807 C T S L > L 2 NSP4 48 1.00
5857 T C S Y > Y 2 NSP4 64 1.00
5921 T C S L > L 2 NSP4 86 1.00
5956 C T S T > T 2 NSP4 97 1.00
6994 T C S N > N 2 NSP4 443 1.00
7213 A G S T > T 2 NSP4 516 1.00
9153 T C S G > G 2 E2 204 0.89
9642 T A S T > T 2 E2 367 0.80
10856 C T NS T > I 2 E1 288 0.86
11241 T C S G > G 2 E1 416 0.81

Non-synonymous substitutions are highlighted in bold.

aGenomic position according to the reference sequence NC_004162.2.

bAmino acid changes.

cAmino acid position in the respective peptide.

dFrequency of the substitution in the dataset from the respective clade.

Sequences from Clade I, on the other hand, presented 13 exclusive SNVs, of which six are non-synonymous mutations present in two nonstructural proteins (nsP2 and nsP4) and three structural proteins (capsid, 6k and E2) (Table 3 and Fig. 3c). The capsid sequence from this clade presented two contiguous transition substitutions in the same codon (7787-7788) resulting in a Q74R change. Moreover, both clades contain genomes harboring the E1-T98A mutation (121/327 sequences in clade I, and 31/167 sequences in clade II), while only clade II reported 113 sequences (67%) presenting another mutation, the V264A in the E2. These two mutations have been associated with increased virus infectivity in Aedes ssp. when present in distinct genetic backgrounds (E1-226V with E1-T98A or E1-226A with E2-V264A)26,27. However, the E1-226V mutation, as well as other E1 (K211E) and E2 (D60G, R198Q, L210Q, I211T, K233E, K252Q) adaptative mutations have not been detected in the Brazilian isolates.

In an alignment analysis of all 422 new sequences, we found that among the total of CHIKV genome mutated positions the majority consisted of type C > T transitions (29.5%, n = 368) followed by T > C substitutions (22.9%, n = 285) (Fig. 3d). These transitions along with A > G and G > A transitions displayed a comparable frequency distribution across the sequence dataset, where mutations at several positions were identified in all new sequences (Fig. 3e). Due to the possibility of these substitutions resulting from sequencing errors, we performed the same comparative analysis on a different dataset containing sequences that form a subclade (from the South region) in clade I and that were generated by a different sequencing technology (Illumina) and a different research group (data in Supplementary files). We observed the same pattern of increased frequency (26.8%) of C > T transitions among those southern sequences, which suggested the observed increased transition changes in clades I and II were likely derived from mutation rather than sequencing artifacts.

Discussion

Advances in sequencing technology and bioinformatics tools have contributed to an increased understanding of the genetic diversity and transmission dynamics of emerging viruses causing epidemics28. In this study, we increased the number of CHIKV genomes deposited in the NCBI by generating 422 genomes that cover 12 Brazilian states over the years 2021–2022. By combining genomic and demographic data this effort has provided a comprehensive overview of CHIKV-ECSA phylogeny in Brazil which summarizes and updates previous phylogenies from other studies of localized outbreaks10,12,17,29,30. The updated and metadata-enriched dataset allowed us not only to describe the genetic diversity of circulating CHIKV variants but also to describe branching patterns observed in the updated phylogeny and infer the divergence time and likely location of emerging distinct subclades from the Brazilian lineage.

Following the arrival of ECSA lineage in the Northeast region, a number of other studies have predicted and reported the establishment and spread of this lineage to the rest of the country1012,17,20,2932. The Bayesian phylogeographic approach employed in this study has revealed the main routes of dispersion of CHIKV in the country. After being introduced in the Northeast region, the ECSA lineage dispersed towards several states from all five regions in Brazil. In line with previous findings10,17,29,3335, both the Bayesian phylogeographic and transmission network approaches indicated the Northeast region has been acting as the leading hub of virus spread towards other regions in the country, including by forming an intense virus exchange network with the Southeast region. These interactions are likely driven by human mobility between these areas, as both Southeast and Northeast regions are densely populated, housing transportation infrastructure and major urban centers that attract a substantial number of visitors. From the Southeast, the virus dispersed to the southern state of Paraná (posterior probability of 0.96) with the time of the most recent common ancestor shared with samples from São Paulo estimated to be January 2020. In addition to spreading in Brazil, CHIKV has also extended its circulation to other countries such as Paraguay and Haiti. CHIKV circulation in Paraguay and Haiti has been previously associated with transmission events originating from Brazil through international viral exchange likely mediated by human movement3638. These results evidence the lineage’s potential expansion across Latin America, despite limited information available about this lineage in the region as evidenced by the lack of CHIKV-ECSA sequences from other Latin American countries.

Our time-measured phylogeny corroborated previous studies and revealed emerging branching patterns, mainly represented by two well-supported distinct subclades named clade I and II. Both these clades were estimated to emerge in the Southeast region around the first months of 2018. At the end of that year, the Southeast region accounted for more than 65 thousand reported cases. Moreover, differences in sequence composition were observed between these clades, with clade I being more diverse as it comprises sequences from four different regions (Northeast, Midwest, Southeast, and South) collected in the years 2021–2022, while clade II contains mostly sequences from northeastern states collected in 2022. This difference in the clades sequence composition profile might change as the lineage continues to expand into the country. The observed differences in geographic diversity between clades might reflect distinct transmission networks underlying the divergence and expansion of these subclades.

Different viral lineages might be under distinct evolutionary pressure that together with the emergence and selection of mutations can drive viral adaptation to a particular environment39,40. It has been argued that two different CHIKV lineages, IOL and Asian, have undergone different evolutionary trajectories leading to different vector adaptative potentials41. Here, we used the ratio of non-synonymous (dN) to synonymous (dS) nucleotide substitutions in the CHIKV envelope gene to assess the selective regime to which clades I and II might be subjected. Our analysis revealed that both clades have experienced positive diversifying selection. The CHIKV envelope gene encodes three viral glycoproteins associated with membrane fusion and receptor binding during infection, and these proteins are a target of neutralizing antibodies42,43. This gene constitutes a variable region in the virus genome where several adaptive mutations have been identified44, for instance, mutations in the envelope proteins such as E1-226V and E2-L210Q have been implicated in the increased adaptation and transmission of the CHIKV IOL lineage in Aedes albopictus22,45, leading to the epidemics reported in the Indian Ocean region between 2004 and 200746,47.

A higher ratio of non-synonymous substitutions is observed under a positive selection regime promoted by virus-host interactions48. Since SNVs continue to arise in RNA virus populations mainly driven by errors made by the virus replication complex that lead to genetic diversity, we compared the mutational profiles of clades I and II49. We identified in both clades isolates harboring mutations (E1-T98A and E2-V264A) previously associated with increased fitness in mosquitoes Aedes ssp. when these mutations are present in the background with either E1-226A26 or E1-226V27 mutations, although the latter was not observed in the genomes from this study. We identified several SNVs across nonstructural and structural protein genes that were exclusive to each clade. Clade II presented more SNVs (n = 27) than clade I (n = 13), of which three are non-synonymous substitutions (E1-T288I; nsP2-P352A, and nsP4-A43V). Literature research revealed that E1-T288I change was previously identified in a 2017 sequence from Iran and also in CHIKV sequences collected in 2016 from infected cancer patients in Rio de Janeiro, Brazil50,51. The nsP2-P352A substitution was also present in sequences collected between 2016 and 2017 in Rio de Janeiro17,52. In turn, clade I sequences contain six non-synonymous mutations across nonstructural (nsP2 and nsP4) and structural protein sequences (capsid, 6k, and E2). The nsP4-V555I change was previously detected in sequences from Thailand in 2008–200953. The E2-L248F from clade I has been reported in Asian lineage sequences from Colombia (2014–2015) and Philippines (2012)54,55. Isolates from Thailand, Indonesia, Lao PDR, Cameroon and India also presented the 6K-I54V mutation observed in clade I53,5659. Despite the detection of these mutations in different countries (indicating homoplasy) by other previous studies, there is no information about the functional impact of such substitutions on CHIKV fitness, thus warranting further experimental studies to elucidate the potential effects of SNVs on lineage-specific evolutionary adaptation. It has been argued that not only non-synonymous mutations have the potential to promote adaptive changes but also synonymous mutations, including deletions and insertions in the 3′untranslated region60,61, can lead to changes in the viral RNA that can drive differential viral gene expression and host adaptation6062.

Mutational analysis of the 422 sequences generated in this study revealed a higher amount of C > T and T > C transitions followed by A > G and G > A substitutions in the CHIKV genome and several genome positions presenting these transitions with higher frequency across all new sequences. Moreover, Clade II has a significantly higher median frequency of C > T, A > G, and T > C transitions compared to clade I. Although this study cannot experimentally establish the significance of these transitions for CHIKV evolutionary adaptation, other studies have associated this mutational pattern with the action of host antiviral immune response mediated by AID/APOBEC and ADAR families of deaminases63,64. These enzymes are part of the interferon-stimulated innate immune response and promote viral genome transitions mutations by catalyzing the deamination of adenosine to inosine to cause A > G/T > C (by ADAR) substitutions or deamination of cytosine to uracil that leads to C > T/G > A (by AID/APOBEC) mutations65. This RNA editing process has been experimentally observed targeting specific viral sequences of SARS-CoV-2 to produce C > T transitions and increasing viral replication in Caco-2 cells, thus promoting viral increased fitness and adaptative evolution66. However, specific information about the effect of these RNA editing processes on the CHIKV genome remains elusive, although the APOBEC3A gene has been observed up-regulated in the expression profile of CHIKV-infected patients67. Despite arguments that these transitions happen mainly in phylogenetically uninformative sites, the available evidence indicates that RNA editing processes might act as a significant driver of viral sequence diversity and evolutionary adaptation through the introduction of nucleotide changes64,68.

This study recognizes that some limitations should be noted. Although this study presented the results of a Bayesian phylogeographic and mutational profile analysis performed on 422 new CHIKV sequences collected from 12 Brazilian states, not all states were evenly represented in the dataset used, which might limit the estimates relative to divergence time and ancestral location reconstructions, prompting careful interpretation of the results presented here. Ongoing sequencing efforts across the country could reduce this disparity in the future. Moreover, although single nucleotide substitutions identified among the new sequences offer insights into the evolutionary dynamics of CHIKV in Brazil, further functional studies need to be undertaken to elucidate the actual adaptive effect of these mutations.

Recurring CHIKF epidemics, as indicated by the seasonal peak patterns displayed in the incidence time-series plot, are evidence that the virus is endemic in Brazil. Human mobility, population immunity, vector suitability, vegetation coverage, site socioeconomic status, and viral sequence variation are factors considered to mediate the dispersal of CHIKV in Brazil29,6971. Although we did not find the E1:226V Aedes albopictus-adaptive mutation in the Brazilian sequences, the high abundance in the region of widely spread competent vectors, such as Ae. aegypti and Ae. albopictus, together with favorable climatic and social conditions in large urban centers create conditions that modify the adaptive landscape of CHIKV, which in turn can allow the continued expansion of the ECSA lineage in the country with a resultant increased impact on public health23,72,73. Therefore, public health measures should be undertaken to ensure continuous genomic surveillance of circulating CHIKV variants which can help to identify viral transmission routes where focused vector control strategies could be employed to reduce the risk of recurring CHIKF epidemics.

Methods

Ethical statement

The project was approved by the Pan American World Health Organization (PAHO) and the Ministry of Health of Brazil (MoH) as part of the arboviral genomic surveillance efforts within the terms of Resolution 510/2016 of CONEP (Comissão Nacional de Ética em Pesquisa, Ministério da Saúde; National Ethical Committee for Research, Ministry of Health). This authorizes the use of clinical samples collected in the Brazilian Central Public Health Laboratories to accelerate knowledge building and contribute to surveillance and outbreak response. The study protocol, including collection and publication of individual-level data, was reviewed and approved by the Research Ethics Committee of the Universidade Federal de Minas Gerais with approval No. 32912820.6.1001.5149. Personally identifiable information was de-identified in the datasets and tables in a way that minimizes the risk of unintended disclosure of identity of individuals and information about them.

Sample collection

Residual samples (serum or plasma) were obtained from the epidemiological surveillance routine of the Brazilian Central Public Health Laboratories (LACEN) from different states (Alagoas, Bahia, Goiás, Paraíba, Paraná, Pernambuco, Piauí, Maranhão, Minas Gerais, Mato Grosso do Sul, Rio Grande do Norte, and Sergipe). These samples, which were taken from patients that spontaneously seek medical care during epidemic season, were submitted to nucleic acid purification using the MagMax Viral RNA Mini kit (Thermo Fisher Scientific), following the manufacturer’s recommendations, and were previously screened by each LACEN. CHIKV RT-qPCR positive samples were selected for sequencing based on the cycle threshold (Ct) value ≤ 30 and the availability of demographic metadata such as sex, age, and municipality of residency. These demographic patient data were provided by LACENs and were collected through a questionnaire filled out by patients and/or health professionals at local health care services.

cDNA synthesis and whole-genome sequencing

Extracted RNA from positive CHIKV samples were provided by collaborating LACENs and submitted to cDNA synthesis and PCR, using a sequencing protocol (primers sequences in Supplementary Data 1) based on multiplex PCR-tiling amplicon approach design for MinION nanopore sequencing74. All reactions were performed at biosafety level 2 facilities and using no template controls. PCR products were purified using 1x AMpure beads Beckman Coulter, UK) and quantified using Qubit 3.0 instrument (Life Technologies) and the Qubit dsDNA High Sensitivity assay. DNA library preparation was performed on all amplified samples using the Ligation Sequencing Kit (Oxford Nanopore Technologies). Individual samples were barcoded using the Native Barcoding Kit (NBD104, Oxford Nanopore Technologies, Oxford, UK). Sequencing library was loaded onto a R9.4 flow cell and data were collected for up to 48 sequencing hours.

Generation of consensus sequences

Basecalling of raw FAST5 files and demultiplexing of barcodes were performed using the software Guppy v6.0.1 (https://github.com/nanoporetech). Consensus sequences were generated by a hybrid assembling approach implemented on Genome Detective (https://www.genomedetective.com/)75.

Phylogenetic reconstruction

We used MAFFT v.7 to align 422 new sequences (with coverage over >60% according to Thézé et al.76, samples 736.22_RED, FS0116, and FS0132 were discarded) in addition to 1565 CHIKV whole-genome sequences publicly available in NCBI up to August 2022, forming a global dataset (n = 1987) that includes CHIKV all lineages. We used NCBI Entrez Utilities to retrieve worldwide CHIKV genomes according to the following inclusion criteria: chikungunya virus[title] AND 8000[SLEN]: 13000[SLEN] for minimum (60% genome coverage) and maximum sequence length. Alignment of the global dataset can be found on the repository 10.6084/m9.figshare.22335331.v2.

This global dataset was used to infer a Maximum Likelihood (ML) phylogeny using the IQ-TREE 2.1.1 software77,78. Statistical support for tree nodes was estimated using the ultrafast bootstrap (UFBoot) feature implemented in IQ-TREE with 1000 replicates. We then used the ML tree from the global dataset to extract the Brazilian ECSA clade and use it to form a second dataset (total n = 713, 706 Brazilian sequences, 2 from Haiti, and 5 from Paraguay; sequences with genome coverage >60% according to Thézé et al.76) which was used to infer a time-scaled phylogeny using BEAST v1.10.4. First, we investigated the temporal signal regressing root-to-tip genetic distances from this ML tree against sample collection dates using TempEst v.1.5.179. Secondly, we employed a stringent model selection analysis using both path-sampling (PS) and stepping-stone (SS) procedures to estimate the most appropriate molecular clock model for the Bayesian phylogenetic analysis80. For the Bayesian analysis, the uncorrelated relaxed molecular clock was chosen as indicated by estimating marginal likelihoods, also employing the HKY + G4 nucleotide substitution model, and the nonparametric Bayesian Skyline coalescent model. We combined two independent runs of 200 million states each81. The convergence of MCMC chains was checked using Tracer82. Maximum clade credibility (MCC) trees were summarized using TreeAnnotator v1.10.4 after discarding 10% as burn-in.

CHIKV ECSA lineage movements across Brazil were investigated using the Bayesian phylogeographic approach with a discrete trait phylogenetic model. A trait file was used to discretize sequences sampling location by five Brazilian regions (North, Northeast, Southeast, South, and Midwest). For this analysis, we downsampled our Brazilian ECSA clade to a dataset containing 471 sequences to maximize the temporal signal in the dataset. The final downsampled dataset was assembled by removing closely related sequences from clades with repetitive collection dates and locations, thus avoiding bias in the analysis due to those oversampled clades.

MCMC analyses were performed in BEAST v1.10.4, running in duplicate for 200 million interactions and sampling every 20,000 steps in the chain. Convergence for each run was assessed in Tracer. MCC trees for each run were summarized using TreeAnnotator after discarding the initial 10% as burn-in. Finally, we used the SPREAD 4 tool to map spatiotemporal information embedded in the MCC trees83.

Transmission network analysis

A transmission network was reconstructed, using the StrainHub tool v1.1.2, from transition states summarized from the Bayesian phylogeography84. Centrality metrics on the tree nodes were also estimated for the network.

Comparative mutational analysis

For comparative mutational analysis, we assembled separate alignment datasets for each subclade (clade I = 327; clade II = 168) and for all new sequences (n = 422). The sequence datasets were compared against the NCBI reference strain NC_004162.2 using MALVIRUS v1.0.285. We filtered and selected substitutions with an occurrence frequency of 100% across the whole dataset and substitutions with a frequency above 60% across the envelope genes E1 and E2. Results were summarized in boxplots using Rstudio 2022.12.0.

Selection pressure analysis

Since the ratio of non-synonymous (dN) to synonymous (dS) nucleotide substitutions can be used to study selection pressure on genomic sequences, we used the HYPHY software package that employs statistical methods that estimate the dN/dS to detect diversifying selection86. For that we performed an alignment of the envelope gene sequences of 720 Brazilian isolates and used BUSTED (restricting the analysis to each subclade I or II), an alignment-wide method implemented in HYPHY 2.5.42, that aims to detect evidence of episodic diversifying selection87. For comparison, we also used a different method, called MEME, a site-level approach, also implemented in HYPHY 2.5.42, that aims to detect evidence of both pervasive and episodic diversifying selection at individual sites (also restricting the analysis to each subclade I or II)88.

Epidemic curves from chikungunya fever cases reported in Brazil

Data of weekly notified suspected cases of chikungunya fever in Brazil from 2014 to 2022 (up to epidemiological week 28 of the year 2022) were supplied by the Ministry of Health of Brazil (MoH) (Ministério da Saúde, 2022). These data were used to calculate incidence and to plot time series charts using Rstudio 2022.12.0. The MoH defines a suspected case as all human cases presenting symptoms compatible with CHIKV infection (sudden onset of fever or intense arthritis not explained by other conditions), a case that resides or has traveled to endemic or epidemic areas up to 14 days prior to symptom onset, or that has an epidemiological link to an imported confirmed case. The MoH also informs that during epidemic season most cases are confirmed by clinical criteria only.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Supplementary information

Supplementary Information (673.7KB, pdf)
Peer Review File (424.9KB, pdf)
41467_2023_40099_MOESM3_ESM.pdf (90.6KB, pdf)

Description of Additional Supplementary Files

Supplemantary Data 1 (123.7KB, xlsx)
Supplemantary Data 2 (43.4KB, zip)
Reporting Summary (72.4KB, pdf)

Acknowledgements

This work was supported by The Pan American Health Organization (PAHO/WHO), the Brazilian Ministry of Health grant SCON2021-00180 (Coordenação Geral de Laboratório de Saúde Pública-CGLAB and Coordenação Geral de Vigilância de Arboviroses-CGARB), the National Institutes of Health USA grant U01 AI151698 for the United World Arbovirus Research Network (UWARN). We thank the State Health Secretariats and the epidemiological surveillance services of Brazilian states and municipalities for all the support provided. J.X. was supported by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brasil (CAPES)-Finance Code 001. M.G. is funded by PON “Ricerca e Innovazione” 2014–2020. M.G. is supported in part by the CRP- ICGEB RESEARCH GRANT 2020 Project CRP/BRA20-03, Contract CRP/20/03. The authors would like to acknowledge the Global Consortium to Identify and Control Epidemics – CLIMADE (T.O., L.C.J.A., E.C.H., J.L., and M.G.) (https://climade.health/).

Author contributions

Conception and design: L.C.J.A., M.G., V.F., and J.X.; investigations: J.X., M.G., L.C.J.A., V.F., A.M.B.D.F., Mau.L., E.C., Heg.F., C.O., N.G., Tal.A., M.E., E.S.R, E.V.S, D.D.L.R., L.D.M., S.T., A.N., A.R., A.F.M., A.L., A.V., A.L.S.D.M., B.V., C.A.M., C.Z., C.F., C.F.C.D.A, C.N.D.D.S, C.S.S., C.A.D.S., C.C.M.G., D.T., D.F.L.N., D.C., E.C.D.O., E.L.N.M., F.M.P., F.I., F.P.D.C., G.A., G.B., G.G.D.C.L., G.C.P., H.B., H.C.F.F., Hivy.F., I.G., I.N.R., I.R., I.C.D.S., J.S., J.M.R., J.L., J.A., J.P.M.D.N., J.N.W., J.P., J.J.F.D.M., K.G.L., L.G.L.N., L.C.V.F., L.B.D.S., L.S., L.A.F.D.S., L.A.P., L.D., M.C.B.C., M.G.A., M.A., Mar.L., M.C.S.U.Z., M.M.P., M.B.F., M.G.J., N.F., N.M., N.F.O.D.M., P.E.A.D.S., P.R., R.V.D.C., K.S.T., R.F.D.C.S., R.K., R.S., R.D.J., R.H.S., S.K., S.N.S., Tam.A., T.R., T.C., V.N., V.D.S., W.G.C., W.C.V.V., and W.N.A. Data curation: J.X., M.G., V.F., L.C.J.A; formal analysis: J.X., M.G., V.F., and L.C.J.A.; writing—original draft preparation: J.X., M.G., and L.C.J.A.; revision: J.X., M.G., V.F., and L.C.J.A.; resources: J.M.R., M.A., W.N.A., A.R., R.F.D.C.S., C.F.C.D.A, W.C.V.V., P.R., M.G.J., J.N.W., R.S., E.L.N.M., P.E.A.D.S., H.C.F.F., C.F., N.F.O.D.M., L.C.V.F., A.M.B.D.F., and L.C.J.A.

Peer review

Peer review information

Nature Communications thanks Rhys Inward and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Data availability

The new sequences generated in this study have been deposited in NCBI GenBank under accession numbers OQ759652- OQ760076 listed in Supplementary Data 1. Input data used for the phylogenetic and mutational profile analyses are provided on the repository 10.6084/m9.figshare.22335331.v2.

Code availability

R codes used to create the charts are available on 10.6084/m9.figshare.22335331.v2.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Joilson Xavier, Luiz Carlos Junior Alcantara, Vagner Fonseca.

Contributor Information

Luiz Carlos Junior Alcantara, Email: luiz.alcantara@ioc.fiocruz.br.

Ana M. B. de Filippis, Email: ana.bispo@ioc.fiocruz.br

Marta Giovanetti, Email: giovanetti.marta@gmail.com.

Supplementary information

The online version contains supplementary material available at 10.1038/s41467-023-40099-y.

References

  • 1.Mourad O, Makhani L, Chen LH. Chikungunya: an emerging public health concern. Curr. Infect. Dis. Rep. 2022;24:217–228. doi: 10.1007/s11908-022-00789-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Chala B, Hamde F. Emerging and re-emerging vector-borne infectious diseases and the challenges for control: a review. Front. Public Health. 2021;9:715759. doi: 10.3389/fpubh.2021.715759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.World Health Organization. Global Vector Control Response 2017–2030 (World Health Organization, 2017).
  • 4.World Health Organization. 10 Proposals to Build A Safer World Together – Strengthening the Global Architecture for HealthEmergency Preparedness, Response and Resilience (WHO Health Emergencies Programme, 2022).
  • 5.Adelino TÉR, et al. Field and classroom initiatives for portable sequence-based monitoring of dengue virus in Brazil. Nat. Commun. 2021;12:2296. doi: 10.1038/s41467-021-22607-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Faria NR, et al. Establishment and cryptic transmission of Zika virus in Brazil and the Americas. Nature. 2017;546:406–410. doi: 10.1038/nature22401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Costa ÉA, et al. West Nile virus in Brazil. Pathogens. 2021;10:896. doi: 10.3390/pathogens10070896. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Faria NR, et al. Genomic and epidemiological monitoring of yellow fever virus transmission potential. Science. 2018;361:894–899. doi: 10.1126/science.aat7115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Giovanetti M, et al. A retrospective overview of Zika virus evolution in the midwest of Brazil. Microbiol. Spectr. 2022;10:e0015522. doi: 10.1128/spectrum.00155-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Naveca FG, et al. Genomic, epidemiological and digital surveillance of Chikungunya virus in the Brazilian Amazon. PLoS Negl. Trop. Dis. 2019;13:e0007065. doi: 10.1371/journal.pntd.0007065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Xavier J, et al. Circulation of chikungunya virus East/Central/South African lineage in Rio de Janeiro, Brazil. PLoS ONE. 2019;14:e0217871. doi: 10.1371/journal.pone.0217871. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Nunes MRT, et al. Emergence and potential for spread of Chikungunya virus in Brazil. BMC Med. 2015;13:102. doi: 10.1186/s12916-015-0348-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Powers, A. M. Vaccine and therapeutic options to control chikungunya virus. Clin. Microbiol. Rev.31 e00104-16 (2018). [DOI] [PMC free article] [PubMed]
  • 14.ClinicalTrials.gov. Pivotal study to evaluate safety and immunogenicity of a live-attenuated chikungunya virus vaccine candidate in Adults-NCT04546724. https://clinicaltrials.gov/study/NCT04546724 (2022).
  • 15.Bennett, S. R. et al. Safety and immunogenicity of PXVX0317 an aluminium hydroxide-adjuvanted chikungunya virus-like particle vaccine: a randomised double-blind parallel-group phase 2 trial. Lancet Infect. Dis.22, 1343–1355 (2022). [DOI] [PubMed]
  • 16.Pan American Health Organization. Cases of chikungunya virus disease. PAHO/WHO Data—Weekly Report. https://www3.paho.org/data/index.php/en/mnu-topics/chikv-en/550-chikv-weekly-en.html (2023).
  • 17.Souza TML, et al. Emergence of the East-Central-South-African genotype of Chikungunya virus in Brazil and the city of Rio de Janeiro may have occurred years before surveillance detection. Sci. Rep. 2019;9:2760. doi: 10.1038/s41598-019-39406-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.de Lima STS, et al. Fatal outcome of chikungunya virus infection in Brazil. Clin. Infect. Dis. 2021;73:e2436–e2443. doi: 10.1093/cid/ciaa1038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Robishaw JD, et al. Genomic surveillance to combat COVID-19: challenges and opportunities. Lancet Microbe. 2021;2:e481–e484. doi: 10.1016/S2666-5247(21)00121-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Xavier J, et al. Chikungunya virus ECSA lineage reintroduction in the northeasternmost region of Brazil. Int. J. Infect. Dis. 2021;105:120–123. doi: 10.1016/j.ijid.2021.01.026. [DOI] [PubMed] [Google Scholar]
  • 21.Garcia-Beltran WF, et al. Multiple SARS-CoV-2 variants escape neutralization by vaccine-induced humoral immunity. Cell. 2021;184:2372–2383.e9. doi: 10.1016/j.cell.2021.03.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tsetsarkin KA, et al. Multi-peaked adaptive landscape for chikungunya virus evolution predicts continued fitness optimization in Aedes albopictus mosquitoes. Nat. Commun. 2014;5:4084. doi: 10.1038/ncomms5084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Pancetti FGM, Honório NA, Urbinatti PR, Lima-Camara TN. Twenty-eight years of Aedes albopictus in Brazil: a rationale to maintain active entomological and epidemiological surveillance. Rev. Soc. Bras. Med. Trop. 2015;48:87–89. doi: 10.1590/0037-8682-0155-2014. [DOI] [PubMed] [Google Scholar]
  • 24.Carvalho RG, Lourenço-de-Oliveira R, Braga IA. Updating the geographical distribution and frequency of Aedes albopictus in Brazil with remarks regarding its range in the Americas. Mem. Inst. Oswaldo Cruz. 2014;109:787–796. doi: 10.1590/0074-0276140304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.de Souza WM, et al. Spatiotemporal dynamics and recurrence of chikungunya virus in Brazil: an epidemiological study. Lancet Microbe. 2023;4:e319–e329. doi: 10.1016/S2666-5247(23)00033-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Agarwal, A., Sharma, A. K., Sukumaran, D., Parida, M. & Dash, P. K. Two novel epistatic mutations (E1:K211E and E2:V264A) in structural proteins of chikungunya virus enhance fitness in Aedes aegypti. Virology497, 59–68 (2016). [DOI] [PubMed]
  • 27.Tsetsarkin KA, et al. Chikungunya virus emergence is constrained in Asia by lineage-specific adaptive landscapes. Proc. Natl Acad. Sci. USA. 2011;108:7872–7877. doi: 10.1073/pnas.1018344108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Grubaugh ND, et al. Tracking virus outbreaks in the twenty-first century. Nat. Microbiol. 2019;4:10–19. doi: 10.1038/s41564-018-0296-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Charlys da Costa A, et al. Spread of chikungunya virus east/central/south African genotype in northeast Brazil. Emerg. Infect. Dis. 2017;23:1742–1744. doi: 10.3201/eid2310.170307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Maeda AY, et al. Circulation of Chikungunya virus East-Central-South African genotype during the 2020–21 outbreak in São Paulo State, Brazil. J. Clin. Virol. 2022;2:100070. [Google Scholar]
  • 31.de Castro Moreira D, et al. Genetic characterization of Chikungunya virus circulating in individuals from Paraná, Brazil. Braz. J. Microbiol. 2022;53:641–645. doi: 10.1007/s42770-022-00680-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Teixeira MG, et al. East/Central/South African genotype Chikungunya virus, Brazil, 2014. Emerg. Infect. Dis. 2015;21:906–907. doi: 10.3201/eid2105.141727. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Souza, T. M. A. et al. First report of the East-Central South African genotype of chikungunya virus in Rio de Janeiro, Brazil. PLoS Curr.9, 01–01 (2017). [DOI] [PMC free article] [PubMed]
  • 34.Pereira Gusmão Maia Z, et al. Return of the founder Chikungunya virus to its place of introduction into Brazil is revealed by genomic characterization of exanthematic disease cases. Emerg. Microbes Infect. 2020;9:53–57. doi: 10.1080/22221751.2019.1701954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Tanabe EL, et al. Report of East-Central South African Chikungunya virus genotype during the 2016 outbreak in the Alagoas State, Brazil. Rev. Inst. Med. Trop. Sao Paulo. 2018;60:e19. doi: 10.1590/s1678-9946201860019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.White SK, et al. A new “American” subgroup of African-lineage Chikungunya virus detected in and isolated from mosquitoes collected in Haiti, 2016. PLoS ONE. 2018;13:e0196857. doi: 10.1371/journal.pone.0196857. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Gräf T, et al. Epidemiologic history and genetic diversity origins of chikungunya and dengue viruses, Paraguay. Emerg. Infect. Dis. 2021;27:1393–1404. doi: 10.3201/eid2705.204244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.de Oliveira EC, et al. Short report: Introduction of chikungunya virus ECSA genotype into the Brazilian Midwest and its dispersion through the Americas. PLoS Negl. Trop. Dis. 2021;15:e0009290. doi: 10.1371/journal.pntd.0009290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Spielman SJ, et al. Evolution of viral genomes: interplay between selection, recombination, and other forces. Methods Mol. Biol. 2019;1910:427–468. doi: 10.1007/978-1-4939-9074-0_14. [DOI] [PubMed] [Google Scholar]
  • 40.Yu X, Cheng G. Adaptive evolution as a driving force of the emergence and re-emergence of mosquito-borne viral diseases. Viruses. 2022;14:435. doi: 10.3390/v14020435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Chen R, et al. Lineage divergence and vector-specific adaptation have driven Chikungunya virus onto multiple adaptive landscapes. MBio. 2021;12:e0273821. doi: 10.1128/mBio.02738-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Quiroz JA, et al. Human monoclonal antibodies against Chikungunya virus target multiple distinct epitopes in the E1 and E2 glycoproteins. PLoS Pathog. 2019;15:e1008061. doi: 10.1371/journal.ppat.1008061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Jin J, et al. Neutralizing monoclonal antibodies block chikungunya virus entry and release by targeting an epitope critical to viral pathogenesis. Cell Rep. 2015;13:2553–2564. doi: 10.1016/j.celrep.2015.11.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Bartholomeeusen K, et al. Chikungunya fever. Nat. Rev. Dis. Prim. 2023;9:17. doi: 10.1038/s41572-023-00429-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Tsetsarkin KA, Vanlandingham DL, McGee CE, Higgs S. A single mutation in chikungunya virus affects vector specificity and epidemic potential. PLoS Pathog. 2007;3:e201. doi: 10.1371/journal.ppat.0030201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Tsetsarkin KA, Chen R, Sherman MB, Weaver SC. Chikungunya virus: evolution and genetic determinants of emergence. Curr. Opin. Virol. 2011;1:310–317. doi: 10.1016/j.coviro.2011.07.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Mavalankar D, Shastri P, Bandyopadhyay T, Parmar J, Ramani KV. Increased mortality rate associated with chikungunya epidemic, Ahmedabad, India. Emerg. Infect. Dis. 2008;14:412–415. doi: 10.3201/eid1403.070720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Duggal NK, Emerman M. Evolutionary conflicts between viruses and restriction factors shape immunity. Nat. Rev. Immunol. 2012;12:687–695. doi: 10.1038/nri3295. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Collins ND, et al. Structural and nonstructural genes contribute to the genetic diversity of RNA viruses. MBio. 2018;9:e01871–18. doi: 10.1128/mBio.01871-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Pouriayevali MH, et al. Imported cases of Chikungunya virus in Iran. BMC Infect. Dis. 2019;19:1004. doi: 10.1186/s12879-019-4637-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Familiar-Macedo D, et al. Molecular aspects of Chikungunya virus infections in cancer patients. Mem. Inst. Oswaldo Cruz. 2022;117:e210383. doi: 10.1590/0074-02760210383. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Cunha MS, et al. Autochthonous transmission of east/central/south African genotype chikungunya virus, brazil. Emerg. Infect. Dis. 2017;23:1737–1739. doi: 10.3201/eid2310.161855. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Pongsiri P, et al. Entire genome characterization of Chikungunya virus from the 2008-2009 outbreaks in Thailand. Trop. Biomed. 2010;27:167–176. [PubMed] [Google Scholar]
  • 54.Rodas JD, et al. Genetic characterization of Northwestern Colombian Chikungunya virus strains from the 2014-2015 epidemic. Am. J. Trop. Med. Hyg. 2016;95:639–646. doi: 10.4269/ajtmh.16-0091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Tan K-K, et al. Independent emergence of the cosmopolitan Asian chikungunya virus, Philippines 2012. Sci. Rep. 2015;5:12279. doi: 10.1038/srep12279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Hakim, M. S., Annisa, L. & Aman, A. T. The evolution of chikungunya virus circulating in Indonesia: sequence analysis of the orf2 gene encoding the viral structural proteins. Int. Microbiol. 10.1007/s10123-023-00337-1 (2023). [DOI] [PubMed]
  • 57.Khan N, et al. Epidemiology and molecular characterization of chikungunya virus from human cases in North India, 2016. Microbiol. Immunol. 2021;65:290–301. doi: 10.1111/1348-0421.12869. [DOI] [PubMed] [Google Scholar]
  • 58.Calvez E, et al. Multiple chikungunya virus introductions in Lao PDR from 2014 to 2020. PLoS ONE. 2022;17:e0271439. doi: 10.1371/journal.pone.0271439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Agbodzi B, et al. Chikungunya viruses containing the A226V mutation detected retrospectively in Cameroon form a new geographical subclade. Int. J. Infect. Dis. 2021;113:65–73. doi: 10.1016/j.ijid.2021.09.058. [DOI] [PubMed] [Google Scholar]
  • 60.Chen R, Wang E, Tsetsarkin KA, Weaver SC. Chikungunya virus 3’ untranslated region: adaptation to mosquitoes and a population bottleneck as major evolutionary forces. PLoS Pathog. 2013;9:e1003591. doi: 10.1371/journal.ppat.1003591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Morley VJ, et al. Chikungunya virus evolution following a large 3’UTR deletion results in host-specific molecular changes in protein-coding regions. Virus Evol. 2018;4:vey012. doi: 10.1093/ve/vey012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Faure G, Ogurtsov AY, Shabalina SA, Koonin EV. Adaptation of mRNA structure to control protein folding. RNA Biol. 2017;14:1649–1654. doi: 10.1080/15476286.2017.1349047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Chaw S-M, et al. The origin and underlying driving forces of the SARS-CoV-2 outbreak. J. Biomed. Sci. 2020;27:73. doi: 10.1186/s12929-020-00665-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Simmonds P, Ansari MA. Extensive C->U transition biases in the genomes of a wide range of mammalian RNA viruses; potential associations with transcriptional mutations, damage- or host-mediated editing of viral RNA. PLoS Pathog. 2021;17:e1009596. doi: 10.1371/journal.ppat.1009596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Pecori R, Di Giorgio S, Paulo Lorenzo J, Nina Papavasiliou F. Functions and consequences of AID/APOBEC-mediated DNA and RNA deamination. Nat. Rev. Genet. 2022;23:505–518. doi: 10.1038/s41576-022-00459-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Kim K, et al. The roles of APOBEC-mediated RNA editing in SARS-CoV-2 mutations, replication and fitness. Sci. Rep. 2022;12:14972. doi: 10.1038/s41598-022-19067-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Soares-Schanoski A, et al. Systems analysis of subjects acutely infected with the Chikungunya virus. PLoS Pathog. 2019;15:e1007880. doi: 10.1371/journal.ppat.1007880. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Piontkivska H, Wales-McGrath B, Miyamoto M, Wayne ML. ADAR editing in viruses: an evolutionary force to reckon with. Genome Biol. Evol. 2021;13:evab240. doi: 10.1093/gbe/evab240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Churakov M, Villabona-Arenas CJ, Kraemer MUG, Salje H, Cauchemez S. Spatio-temporal dynamics of dengue in Brazil: Seasonal travelling waves and determinants of regional synchrony. PLoS Negl. Trop. Dis. 2019;13:e0007012. doi: 10.1371/journal.pntd.0007012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Freitas LP, Schmidt AM, Cossich W, Cruz OG, Carvalho MS. Spatio-temporal modelling of the first Chikungunya epidemic in an intra-urban setting: the role of socioeconomic status, environment and temperature. PLoS Negl. Trop. Dis. 2021;15:e0009537. doi: 10.1371/journal.pntd.0009537. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Power GM, et al. Socioeconomic risk markers of arthropod-borne virus (arbovirus) infections: a systematic literature review and meta-analysis. BMJ Glob. Health. 2022;7:e007735. doi: 10.1136/bmjgh-2021-007735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.de Melo Ximenes M, et al. Arbovirus expansion: new species of culicids infected by the Chikungunya virus in an urban park of Brazil. Acta Trop. 2020;209:105538. doi: 10.1016/j.actatropica.2020.105538. [DOI] [PubMed] [Google Scholar]
  • 73.Costa-da-Silva AL, et al. First report of naturally infected Aedes aegypti with chikungunya virus genotype ECSA in the Americas. PLoS Negl. Trop. Dis. 2017;11:e0005630. doi: 10.1371/journal.pntd.0005630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Quick J, et al. Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples. Nat. Protoc. 2017;12:1261–1276. doi: 10.1038/nprot.2017.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Vilsker M, et al. Genome Detective: an automated system for virus identification from high-throughput sequencing data. Bioinformatics. 2019;35:871–873. doi: 10.1093/bioinformatics/bty695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Thézé J, et al. Genomic epidemiology reconstructs the introduction and spread of Zika virus in Central America and Mexico. Cell Host Microbe. 2018;23:855–864.e7. doi: 10.1016/j.chom.2018.04.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Nguyen L-T, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Katoh K, Kuma K, Toh H, Miyata T. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 2005;33:511–518. doi: 10.1093/nar/gki198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Rambaut A, Lam TT, Max Carvalho L, Pybus OG. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen) Virus Evol. 2016;2:vew007. doi: 10.1093/ve/vew007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Baele G, et al. Improving the accuracy of demographic and molecular clock model comparison while accommodating phylogenetic uncertainty. Mol. Biol. Evol. 2012;29:2157–2167. doi: 10.1093/molbev/mss084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Suchard MA, et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 2018;4:vey016. doi: 10.1093/ve/vey016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 2018;67:901–904. doi: 10.1093/sysbio/syy032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Nahata KD, et al. SPREAD 4: online visualisation of pathogen phylogeographic reconstructions. Virus Evol. 2022;8:veac088. doi: 10.1093/ve/veac088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.de Bernardi Schneider A, et al. StrainHub: a phylogenetic tool to construct pathogen transmission networks. Bioinformatics. 2020;36:945–947. doi: 10.1093/bioinformatics/btz646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Ciccolella S, et al. MALVIRUS: an integrated application for viral variant analysis. BMC Bioinforma. 2022;22:625. doi: 10.1186/s12859-022-04668-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Kosakovsky Pond SL, et al. HyPhy 2.5—a customizable platform for evolutionary hypothesis testing using phylogenies. Mol. Biol. Evol. 2020;37:295–299. doi: 10.1093/molbev/msz197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Murrell B, et al. Gene-wide identification of episodic selection. Mol. Biol. Evol. 2015;32:1365–1371. doi: 10.1093/molbev/msv035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Murrell B, et al. Detecting individual sites subject to episodic diversifying selection. PLoS Genet. 2012;8:e1002764. doi: 10.1371/journal.pgen.1002764. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information (673.7KB, pdf)
Peer Review File (424.9KB, pdf)
41467_2023_40099_MOESM3_ESM.pdf (90.6KB, pdf)

Description of Additional Supplementary Files

Supplemantary Data 1 (123.7KB, xlsx)
Supplemantary Data 2 (43.4KB, zip)
Reporting Summary (72.4KB, pdf)

Data Availability Statement

The new sequences generated in this study have been deposited in NCBI GenBank under accession numbers OQ759652- OQ760076 listed in Supplementary Data 1. Input data used for the phylogenetic and mutational profile analyses are provided on the repository 10.6084/m9.figshare.22335331.v2.

R codes used to create the charts are available on 10.6084/m9.figshare.22335331.v2.


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES