Abstract
The emergence and reemergence of mosquito-borne diseases in Brazil such as Yellow Fever, Zika, Chikungunya, and Dengue have had serious impacts on public health. Concerns have been raised due to the rapid dissemination of the chikungunya virus (CHIKV) across the country since its first detection in 2014 in Northeast Brazil. Faced with this scenario, on-site training activities in genomic surveillance carried out in partnership with the National Network of Public Health Laboratories have led to the generation of 422 CHIKV genomes from 12 Brazilian states over the past two years (2021–2022), a period that has seen more than 312 thousand chikungunya fever cases reported in the country. These new genomes increased the amount of available data and allowed a more comprehensive characterization of the dispersion dynamics of the CHIKV East-Central-South-African (ECSA) lineage in Brazil. Tree branching patterns revealed the emergence and expansion of two distinct subclades. Phylogeographic analysis indicated that the northeast region has been the leading hub of virus spread towards other regions. Increased frequency of C>T transitions among the new genomes suggested that host restriction factors from the immune system such as ADAR and AID/APOBEC deaminases might be driving CHIKV ECSA lineage genetic diversity in Brazil.
Keywords: chikungunya virus, East-Central-South-African lineage, phylogenetics, nanopore sequencing, genomic surveillance, genetic diversity
Introduction
Mosquito-borne viral diseases have impacted the lives of millions of people across several populations in the tropics and subtropics1,2. This scenario prompted the World Health Organization (WHO) to issue a guideline in 2017 with strategies for global vector control response aiming to reduce the burden and threat of vector-borne diseases, such as Dengue, Zika and Chikungunya fever, by 20303. Five years later, considering the public health implications brought by the coronavirus disease (COVID-19) pandemic, the WHO outlined ten proposals to strengthen health emergency preparedness, response, and resilience4. One of the proposals calls for the development and establishment of a collaborative surveillance system with improved laboratory capacity for pathogen and genomic surveillance that would guide the public health response.
Several genomic surveillance initiatives have been carried out in the last few years to build knowledge regarding the genetic diversity and transmission dynamics of arboviruses in Brazil5–11. Genomic sequencing has revealed that the first case of chikungunya fever (CHIKF) reported in Brazil was an infection by the Asian lineage introduced in a northern state in 2014, while another case reported seven days later in a north-eastern state represented the first known introduction of the East-Central-South-African (ECSA) lineage in the country12. The establishment of the chikungunya virus (CHIKV) in the Brazilian territory was followed by several outbreaks reported across the country, accounting for more than 200 thousand confirmed cases in only the last two years13. Viral genomic data from Brazilian cases has revealed that the ECSA lineage is widespread throughout the country and has been linked to fatal cases observed in both risk and non-risk groups (young adults and no commodities)14,15.
Viral genomic surveillance activities have been driven by the rapid development of DNA sequencing technology and bioinformatics tools for genomic data analysis16. Such tools have allowed the characterization of the genome and dispersal patterns of emerging and reemerging pathogens6,9,17. The use of such tools during the COVID-19 pandemic allowed, for example, the rapid identification of emerging mutations likely associated with increased transmissibility and immune escape18.
Despite technological advances and the high number of CHIKF cases reported in recent years in Brazil, the amount of genomic data available in public databases has consisted of genomes from localized outbreaks that could be limited in terms of representativeness across different states and outbreak events. CHIKF can cause long last effects such as debilitating arthritis and arthralgia, and there are no effective treatments available19. Vaccine candidates in development have reached phase 2 and 3 clinical trials using attenuated virus derived from the Indian Ocean lineage (IOL) or virus-like particle containing recombinant structural proteins derived from a Senegalese viral strain20,21. Since available vaccine candidates are based on non-Brazilian variants, increasing the availability of genomic data to characterize the genetic pool of the viral population circulating in Brazil might facilitate the future development of an efficient and more representative CHIKV vaccine.
Recurring outbreaks demonstrate that CHIKV is currently endemic in Brazil. The existence of abundant vectors, together with adequate climatic conditions for vector survival in areas of high population density, create conditions that can modify the adaptive landscape, allowing the continued expansion and evolutionary adaptation of CHIKV22–24. Faced with a scenario of limited availability of genetic information on potential strains causing a rapid increase in the number of CHIKF cases over the past two years in Brazil, we carried out on-site training activities in genomic surveillance in 12 Brazilian states covering four geographic regions to increase the number of available viral genomic sequences. This has allowed comprehensive monitoring of the expansion of the ECSA lineage and its variants circulating in different states, in addition to the characterization of the most up-to-date structured phylogeny of the CHIKV ECSA lineage in the country.
Results
East, Central, and South African (ECSA) lineage monitoring through countrywide genomic surveillance
Nanopore sequencing was performed on selected CHIKV-positive samples provided by state public health laboratories from 12 states across 4 geographic regions (Northeast, Midwest, Southeast, and South) of Brazil (Figure 1A) during the years 2021–2022, which saw a significant increase in the number of CHIKF cases reported across different Brazilian regions, with a peak incidence rate of more than 20 cases per 100,000 population, and a total of more than 312,000 reported cases nationally in this two-year period (Figure 1B). Due to the portability and easy setup of the nanopore sequencing protocol, which allows data generation in less than 24h, the collaborative work with the public health laboratories was able to not only generate genomic data but also promote on-site genomic surveillance training activities for the local laboratory staff. This approach combining wet lab and basic data analysis training allowed local teams to understand how genomic data can be linked to demographic information in order to produce comprehensive and relevant inferences regarding the epidemiology and evolution of CHIKV circulating in Brazil.
A total of 425 CHIKV-positive samples were subjected to nanopore whole-genome sequencing, with 84.94% (n=361) of these samples originating in the Northeast region, consisting largely of samples collected from the state of Bahia (n=102) (Figure 1A and Table 1). These samples presented a mean RT-qPCR cycle threshold value of 24.04 (ranging from 11 to 35.90) (Table 2). Patients’ mean age upon sample collection was similar for both females and males (39 years of age), with 57.88% (n=246) of the participants identified as female (Table 2). The clinical status of patients at the time of sample collection, and travel history data were not available for these samples.
Table 1:
Region and State | # Samples (total n= 425) |
---|---|
Northeast | (n=361) |
Alagoas (AL) | 3 (0,71%) |
Bahia (BA) | 102 (24,00%) |
Maranhão (MA) | 38 (8,94%) |
Paraiba (PB) | 23 (5,41%) |
Pernambuco (PE) | 16 (3,77%) |
Piauí (PI) | 90 (21,18%) |
Rio Grande do Norte (RN) | 45 (10,59%) |
Sergipe (SE) | 44 (10,35%) |
Midwest | (n=42) |
Goiás (GO) | 40 (9,41%) |
Mato Grosso do Sul (MS) | 2 (0,47%) |
Southeast | |
Minas Gerais (MG) | 8 (1,88%) |
South | |
Paraná (PR) | 14 (3,29%) |
Table 2:
Patients (n= 425) | |
---|---|
Sex | |
Female | 246 (57.88 %) |
Male | 179 (42.12 %) |
Mean age on sample collection (SD) | |
Female | 39.10 ± 18.83 |
Male | 39.37 ± 22.03 |
Median days of symptoms prior to sampling (SD) | 3.47 ± 8.52 |
RT-qPCR mean Ct (cycle threshold) value (SD) | 24.04 ± 4.15 |
SD = Standard Deviation
Multiplex PCR-tiling amplicon sequencing on MinION allowed the recovery of 425 genomic sequences from CHIKV with a mean genome coverage of 90.98% (range 31.80 to 96.19%) (Table S1). Of the 425 sequences, 14 were recovered from old CHIKV-PCR samples collected in Bahia state during July-August 2015 and stored since. These isolates had a mean genome coverage of 70% (range 31.80 to 92.9 %). The remaining genomes have an associated collection date ranging from April 2019 to June 2022. To better capture the phylogenetic signal, only the sequences with genome coverage over 60% (n=422) were considered for further analysis (discarded sequences are listed in the methods).
All the newly recovered genomes were assembled using Genome Detective software which also classified all of them as belonging to the East, Central, and South African (ECSA) lineage. To investigate the phylogenetic relationship of the new sequences with other Brazilian and non-Brazilian sequences available in public databases, we built a global dataset (n=1,987) composed of 1,565 CHIKV genomes retrieved from GenBank NCBI in addition to 422 sequences from this study. It is noteworthy that the two years of genomic surveillance activities of this study contributed to a more than doubling of the number of CHIKV genomes from Brazil available in the NCBI (by then there were 332 complete sequences) since the virus emerged in the country in 2014 (Figure 2C).
Updated CHIKV phylogeny reveals two distinct emerging subclades
A preliminary Maximum Likelihood (ML) tree was reconstructed using the global dataset that showed all Brazilian sequences grouped in the ECSA clade (Figure S1). It can also be observed from the ML tree that most of the new sequences formed two well-distinct derived clades, where it can be noticed that sequences collected in different geographical regions were closely related (see ML tree in Supplemental files). To investigate in more detail the phylogenetic features of these clades within a time-aware evolutionary framework, which can benefit from the increased amount of genomic data obtained in this study, we performed a Bayesian phylogenetic analysis using a down-sampled dataset (n=713) mostly composed of Brazilian sequences.
Root-to-tip genetic distance regression indicated that the down-sampled dataset presented sufficient temporal signal (R2=0.70 and correlation coefficient = 0.83) to infer a time-measured phylogeny (Figure 2B). Consistent with the ML tree, the inferred Maximum Clade Credibility (MCC) tree also revealed two distinct more derived clades (henceforth clade I and II) formed mainly by 2021–2022 sequences (Figure 2A). The Bayesian evolutionary analysis estimated the time of the most recent common ancestor (tMRCA) of clade I to be late January 2018 (95% highest posterior density (HPD): December 2017 and March 2018), while clade II presented a slightly late tMRCA estimated to be early February 2018 (95% HPD: January 2018 and March 2018).
Some composition differences can be noticed between these clades. Clade I comprises sequences from 14 distinct states mostly collected from 2021 to 2022 (n=304) and from northeastern Brazil (62.38%, n=204), with sequences from Sergipe (13.5%, Northeast), Minas Gerais (1.8%), São Paulo (16.2%, Southeast), Goiás (11.6%), Mato Grosso do Sul (0.6%, Midwest), and Paraná (4.3%, South) states uniquely present in this clade (Figure 2A). Meanwhile, clade II is mostly composed of sequences collected in 2022 from northeastern states (87.5%, n=147), with a total of 8 states sampled and Piaui state being the most represented (46.42%, n=78) (Figure 2A).
A closer look at sequence distribution inside these clades reveals recurrent virus movement between states and regions, with midwestern isolates closely related to isolates from the Northeast and from the state of Minas Gerais (Southeast) in clade I. It can also be noticed in clade I that several distinct CHIKV introductions occurred into the state of Goiás (Midwest) and into northeastern states (Bahia, Rio Grande do Norte, Sergipe, Paraiba, Pernambuco, and Piaui) (Figure 2A). Contrarily, the clade I sequence distribution reveals that apparently, only one viral introduction event has happened in the southern state of Paraná, sharing a most recent common ancestor (dated from Jun. 2019 to Jun. 2020, 95% HPD, posterior probability = 1.0) with isolates from the state of São Paulo (Southeast cluster). Similarly, clade II displays viral exchange between states, especially from the Northeast, as indicated by a single well-supported subclade (posterior = 1.0), dated from Jan. 2020 to Dec. 2020 (95% HPD) and dominated by sequences from northeastern states (Piauí, Bahia, Rio Grande do Norte, Paraíba, Pernambuco, Maranhão, and Alagoas) (Figure 2A).
CHIKV dispersal in Brazil has been mainly seeded by the Northeast
In face of the recurrent virus movement observed across the country, as indicated by our MCC phylogeny, we employed a Bayesian phylogeographic approach to reconstruct the spatial dispersal dynamic of CHIKV in Brazil (closely related sequences from Haiti and Paraguay were included) and to estimate the ancestral locations of clades I and II. The resulting phylogeny kept the topology from the MCC tree shown in figure 2A and revealed the Northeast as a leading source of CHIKV transmission in Brazil, seeding the network of frequent virus exchange among states mainly from the Northeast, Southeast and Midwest, as indicated by the location probability of the 5 early branching events inferred by the discrete phylogeography (Figure 2D and Figure S2). Moreover, the ECSA lineage circulating in Brazil extended its transmission network by reaching other countries in the region such as Paraguay and Haiti likely via the Midwest and Northeast of Brazil, respectively. An alternative approach, using a transmission network generated from transition states summarized from the Bayesian phylogeography and centrality metrics, also indicated the Northeast as a source (Source Hub Ratio of 0.66) in the network for the CHIKV spread in the country, where intense interactions are displayed between the Northeast and Southeast (Figure S3 and Table S2). Moreover, the discrete state ancestral reconstruction employed in our Bayesian analysis estimated that both clades I and II might have emerged in the Southeast region with a location probability of 1.0 and with estimated divergence times (clade I: 95% HPD Nov. 2017 – Feb. 2018; clade II: Dec. 2017 – Mar. 2018) comparable to those obtained in the MCC tree inferred using a comprehensive dataset displayed in Figure 2A (Figure S2).
These estimates place the divergence time of clades I and II in the period that marks the return of CHIKV-increased transmissions after two main epidemic seasons registered from 2016 to 2017 when a total of more than 565 thousand disease cases were notified in the country (Figure 1B). From the time series graph of CHIKF cases, we can see an increase in the incidence rate around early 2018 for the Midwest and North regions. In that same period, clades I and II were estimated to emerge in the Southeast region, which also presented an increased incidence rate. Since then, a seasonal epidemic pattern has been observed in the CHIKV transmission dynamic with incidence peaks being displayed in the first semester of the following years (Figure 1B). Consistent with the phylogeography and the transmission network analyses, the CHIKF case time-series plot presents the Northeast region as a major source of virus transmission in the country, as shown by the consecutive incidence peaks registered for that region in the last three years.
Transition changes outline the evolutionary expansion of the Brazilian ECSA lineage
Despite having close divergence times, clades I and II have different sequence compositions that likely explain their apparent branching distance. While sequences from clades I and II were obtained from patients with similar median age, a significant difference (p<0.05) was observed between the median RT-qPCR cycle threshold value of the samples of each clade (Figure 3A). Such a difference, however, might have been affected by an imbalance present in the dataset used for comparison since clade I has almost twice as many sequences compared to clade II.
We also assessed these clades as to which selective regime they are likely to be subject to. The results from the BUSTED analysis provided evidence that the envelope gene experienced positive diversifying selection in both clades I (p=1.774e-8) and II (p=1.840e-11) (Table S3). For comparison, we employed a second method, MEME, that identified 12 sites under positive selection for clade I, whereas 13 sites were identified in clade II. Of these sites, 7 are shared by both clades, while exclusive positive selected sites were reported for each clade (Table S4). These results suggest the active status of these clades and consequently provide evidence of the continuous evolutionary expansion of the Brazilian ECSA lineage.
Separate sequence alignments representative of each clade revealed a significant difference between the median frequency of 3 classes of single nucleotide variation (SNV) identified in the clades. Clade II presented a higher median frequency of SNV of type C>T, A>G, and T>C transitions (Figure 3B). By comparing the mutational profile of the two clades, we identified 27 SNVs exclusively present and shared by sequences from clade II, with three of them being non-synonymous substitutions leading putatively to amino acid change such as a T288I substitution in the E1 protein gene (Table 3 and Figure 3C). The other two non-synonymous substitutions were identified in two nonstructural proteins (nsP2-P352A and nsP4-A43V). Sequences from Clade I, on the other hand, presented 13 exclusive SNVs, of which six are non-synonymous mutations present in two nonstructural proteins (nsP2 and nsP4) and three structural proteins genes (capsid, 6k and E2) (Table 3 and Figure 3C). The capsid gene sequence from this clade presented two contiguous transition substitutions in the same codon (7787–7788) resulting in a Q74R change. In an alignment analysis of all 422 new sequences, we found that among the total of CHIKV genome mutated positions the majority consisted of type C>T transitions (29.5%, n= 368) followed by T>C substitutions (22.9%, n= 285) (Figure 3D). These transitions along with A>G and G>A transitions displayed a comparable frequency distribution across the sequence dataset, where mutations at several positions were identified in all new sequences (Figure 3E). Due to the possibility of these substitutions resulting from sequencing errors, we performed the same comparative analysis on a different dataset containing sequences that form a subclade (from the South region) in clade I and that were generated by a different sequencing technology (Illumina) and a different research group (data in Supplemental files). We observed the same pattern of increased frequency (26.8%) of C>T transitions among those southern sequences, which suggested the observed increased transition changes in clades I and II were likely derived from mutation rather than sequencing artifacts.
Table 3.
Genomic Positiona | Reference base | Altered base | Substitution type | aab change | Clade | Genomic region | aac position | Frequencyd |
---|---|---|---|---|---|---|---|---|
3208 | C | T | S | D>D | 1 | nsP2 | 509 | 1.00 |
3217 | T | C | S | Y>Y | 1 | nsP2 | 512 | 1.00 |
3277 | C | T | S | S>S | 1 | nsP2 | 532 | 1.00 |
3295 | G | A | S | P>P | 1 | nsP2 | 538 | 1.00 |
3314 | G | T | NS | A>S | 1 | nsP2 | 544 | 1.00 |
7328 | G | A | NS | V>I | 1 | nsP4 | 555 | 1.00 |
7787 | A | G | NS | Q>R | 1 | capsid | 74 | 1.00 |
7788 | G | A | NS | Q>R | 1 | capsid | 74 | 1.00 |
8641 | C | T | S | L>L | 1 | E2 | 34 | 0.68 |
9283 | C | T | NS | L>F | 1 | E2 | 248 | 0.22 |
9285 | C | T | S | L>L | 1 | E2 | 248 | 0.73 |
9936 | T | C | S | C>C | 1 | 6K | 42 | 1.00 |
9970 | A | G | NS | I>V | 1 | 6K | 54 | 1.00 |
1754 | T | C | S | L>L | 2 | NSP2 | 25 | 1.00 |
1759 | T | A | S | V>V | 2 | NSP2 | 26 | 1.00 |
1783 | T | G | S | R>R | 2 | NSP2 | 34 | 1.00 |
1789 | A | G | S | Q>Q | 2 | NSP2 | 36 | 1.00 |
1795 | T | C | S | L>L | 2 | NSP2 | 38 | 1.00 |
2735 | C | G | NS | P>A | 2 | NSP2 | 352 | 1.00 |
2767 | A | G | S | G>G | 2 | NSP2 | 362 | 1.00 |
2887 | T | C | S | L>L | 2 | NSP2 | 402 | 1.00 |
3403 | T | C | S | F>F | 2 | NSP2 | 574 | 1.00 |
3901 | A | G | S | V>V | 2 | NSP2 | 740 | 1.00 |
3916 | T | C | S | F>F | 2 | NSP2 | 745 | 1.00 |
4711 | T | C | S | D>D | 2 | NSP3 | 212 | 1.00 |
4837 | A | G | S | S>S | 2 | NSP3 | 254 | 1.00 |
4843 | C | T | S | P>P | 2 | NSP3 | 256 | 1.00 |
4939 | T | C | S | S>S | 2 | NSP3 | 288 | 1.00 |
5783 | C | T | S | L>L | 2 | NSP4 | 40 | 1.00 |
5793 | C | T | NS | A>V | 2 | NSP4 | 43 | 1.00 |
5807 | C | T | S | L>L | 2 | NSP4 | 48 | 1.00 |
5857 | T | C | S | Y>Y | 2 | NSP4 | 64 | 1.00 |
5921 | T | C | S | L>L | 2 | NSP4 | 86 | 1.00 |
5956 | C | T | S | T>T | 2 | NSP4 | 97 | 1.00 |
6994 | T | C | S | N>N | 2 | NSP4 | 443 | 1.00 |
7213 | A | G | S | T>T | 2 | NSP4 | 516 | 1.00 |
9153 | T | C | S | G>G | 2 | E2 | 204 | 0.89 |
9642 | T | A | S | T>T | 2 | E2 | 367 | 0.80 |
10856 | C | T | NS | T>I | 2 | E1 | 288 | 0.86 |
11241 | T | C | S | G>G | 2 | E1 | 416 | 0.81 |
Genomic position according to the reference sequence NC_004162.2.
Amino acid change.
Amino acid position in the respective peptide.
Frequency of the substitution in the dataset from the respective clade. Non-synonymous substitutions are highlighted in bold.
Discussion
Advances in sequencing technology and bioinformatics tools have contributed to an increased understanding of the genetic diversity and transmission dynamics of emerging viruses causing epidemics25. In this study, we increased the number of CHIKV genomes deposited in the NCBI by generating 422 genomes that cover 12 Brazilian states over the years 2021–2022. By combining genomic and demographic data this effort has provided a comprehensive overview of CHIKV ECSA phylogeny in Brazil which summarizes and updates previous phylogenies from other studies of localized outbreaks10,12,14,26,27. The updated and metadata-enriched dataset allowed us not only to describe the genetic diversity of circulating CHIKV variants but also to describe branching patterns observed in the updated phylogeny and infer the divergence time and likely location of emerging distinct subclades from the Brazilian lineage.
Following the arrival of ECSA lineage in the Northeast region, a number of other studies have predicted and reported the establishment and spread of this lineage to the rest of the country10–12,14,17,26–29. The Bayesian phylogeographic approach employed in this study has revealed the main routes of dispersion of CHIKV in the country. After being introduced in the Northeast region, the ECSA lineage dispersed towards several states from all five regions in Brazil. Both the Bayesian phylogeographic and transmission network approaches indicated the Northeast region has been acting as the leading hub of virus spread towards other regions in the country, including by forming an intense virus exchange network with the Southeast region. From the Southeast, the virus dispersed to the southern state of Paraná (posterior probability of 0.96) with the time of the most recent common ancestor shared with samples from São Paulo estimated to be January 2020. In addition to spread in Brazil, CHIKV has also extended its circulation to other countries such as Paraguay and Haiti. CHIKV circulation in Paraguay and Haiti has been previously associated with transmission events originating from Brazil through international viral exchange likely mediated by human movement30–32. These results evidence the lineage’s potential expansion across Latin America, despite limited information available about this lineage in the region as evidenced by the lack of CHIKV ECSA sequences from other Latin American countries.
Our time-measured phylogeny corroborated previous studies and revealed emerging branching patterns, mainly represented by two well-supported distinct subclades named clade I and II. Both these clades were estimated to emerge in the Southeast region around the first months of 2018. At the end of that year, the Southeast region accounted for more than 65 thousand reported cases. Moreover, differences in sequence composition were observed between these clades, with clade I being more diverse as it comprises sequences from four different regions (Northeast, Midwest, Southeast and South) collected in the years 2021—2022, while clade II contains mostly sequences from northeastern states collected in 2022. This difference in the clades sequence composition profile might change as the lineage continue to expand into the country. The observed differences in geographic diversity between clades might reflect distinct transmission networks underlying the divergence and expansion of these subclades.
Different viral lineages might be under distinct evolutionary pressure that together with the emergence and selection of mutations can drive viral adaption to a particular environment33,34. It has been argued that two different CHIKV lineages, IOL and Asian, have undergone different evolutionary trajectories leading to different vector adaptative potentials35. Here, we used the ratio of non-synonymous (dN) to synonymous (dS) nucleotide substitutions in the CHIKV envelope gene to assess the selective regime to which clades I and II might be subjected. Our analysis revealed that both clades have experienced positive diversifying selection. Such evolutionary pressure might be driven by the host antiviral immune response, as the viral envelope protein is a target for neutralizing antibodies36,37. Mutations in the envelope proteins have been implicated in the increased adaptation and transmission of the CHIKV IOL lineage in Aedes albopictus mosquitoes22,38.
A higher ratio of nonsynonymous substitutions is observed under a positive selection regime promoted by virus-host interactions39. Since SNVs continue to arise in RNA virus populations mainly driven by errors made by the virus replication complex that lead to genetic diversity, we compared the mutational profiles of clades I and II40. We identified several SNVs across nonstructural and structural protein genes that were exclusive to each clade. Clade II presented more SNVs (n=27) than clade I (n=13), of which three are nonsynonymous substitutions (E1-T288I; nsP2-P352A, and nsP4-A43V). Literature research revealed that E1-T288I change was previously identified in a 2017 sequence from Iran and also in CHIKV sequences collected in 2016 from infected cancer patients in Rio de Janeiro, Brazil41,42. The nsP2-P352A substitution was also present in sequences collected between 2016 and 2017 in Rio de Janeiro14,43. In turn, clade I sequences contain six non-synonymous mutations across nonstructural (nsP2 and nsP4) and structural protein genes (capsid, 6k and E2). The nsP4-V555I change was previously detected in sequences from Thailand in 2008–200944. The E2-L248F from clade I has been reported in Asian lineage sequences from Colombia (2014–2015) and Philippines (2012)45,46. Isolates from Thailand, Indonesia, Lao PDR, Cameroon and India also presented the 6K-I54V mutation observed in clade I44,47–50. Despite the detection of these mutations in different countries (indicating homoplasy) by other previous studies, there is no information about the functional impact of such substitutions on CHIKV fitness, thus warranting further experimental studies to elucidate the potential effects of SNVs on lineage-specific evolutionary adaptation. It has been argued that not only non-synonymous mutations have the potential to promote adaptive changes but also synonymous mutations can lead to changes in the viral RNA that can drive differential viral gene expression51.
Mutational analysis of the 422 sequences generated in this study revealed a higher amount of C>T and T>C transitions followed by A>G and G>A substitutions in the CHIKV genome and several genome positions presenting these transitions with higher frequency across all new sequences. Moreover, Clade II has a significantly higher median frequency of C>T, A>G, and T>C transitions compared to clade I. Although this study cannot experimentally establish the significance of these transitions for CHIKV evolutionary adaptation, other studies have associated this mutational pattern with the action of host antiviral immune repose mediated by AID/APOBEC and ADAR families of deaminases52,53. These enzymes are part of the interferon-stimulated innate immune response and promote viral genome transitions mutations by catalyzing the deamination of adenosine to inosine to cause A>G/T>C (by ADAR) substitutions or deamination of cytosine to uracil that leads to C>T/G>A (by AID/APOBEC) mutations54. This RNA editing process has been experimentally observed targeting specific viral sequences of SARS-CoV-2 to produce C>T transitions and increasing viral replication in Caco-2 cells, thus promoting viral increased fitness and adaptative evolution55. However, specific information about the effect of these RNA editing processes on the CHIKV genome remains elusive, although the APOBEC3A gene has been observed up-regulated in the expression profile of CHIKV-infected patients56. Despite arguments that these transitions happen mainly in phylogenetically uninformative sites, the available evidence indicates that RNA editing processes might act as a significant driver of viral sequence diversity and evolutionary adaptation through the introduction of nucleotide changes53,57.
Recurring CHIKF epidemics, as indicated by the seasonal peak patterns displayed in the case time-series plot, are evidence that the virus is endemic in Brazil. Human mobility, population immunity, vector suitability, vegetation coverage, site socioeconomic status, and viral sequence variation are factors considered to mediate the dispersal of CHIKV in Brazil26,58,59. Although we did not find the E1:226V Aedes Albopictus-adaptive mutations in the Brazilian sequences, the high abundance in the region of widely spread competent vectors, such as A. aegypti and Ae. Albopictus, together with favorable climatic and social conditions in large urban centers create conditions that modify the adaptive landscape of CHIKV, which in turn can allow the continued expansion of the ECSA lineage in the country with a resultant increased impact on public health23,60,61. Therefore, public health measures should be undertaken to ensure continuous genomic surveillance of circulating CHIKV variants which can help to identify viral transmission routes where focused vector control strategies could be employed to reduce the risk of recurring CHIKF epidemics.
Limitations
Although our study presented the results of a Bayesian phylogeographic and mutational profile analysis performed on 422 new sequences collected from 12 Brazilian states, not all states were evenly represented in our dataset, which might limit our estimates relative to divergence time and ancestral location reconstructions, prompting careful interpretation of the results presented here. Ongoing sequencing efforts across the country could reduce this disparity in the future. Moreover, although single nucleotide substitutions identified among the new sequences offer insights into the evolutionary dynamics of CHIKV in Brazil, further functional studies need to be undertaken to elucidate the actual adaptive effect of these mutations.
Methods
Sample collection
Residual samples (serum or plasma) were obtained from the epidemiological surveillance routine of the Brazilian Central Public Health Laboratories (LACEN) from different states (Alagoas, Bahia, Goiás, Paraíba, Paraná, Pernambuco, Piauí, Maranhão, Minas Gerais, Mato Grosso do Sul, Rio Grande do Norte, and Sergipe). These samples were submitted to nucleic acid purification using the MagMax Viral RNA Mini kit (Thermo Fischer Scientific), following the manufacturer’s recommendations, and were previously screened by each LACEN. CHIKV RT-qPCR positive samples were selected for sequencing based on the cycle threshold value ≤ 30 and the availability of demographic metadata such as sex, age, and municipality of residency. These demographic patient data were provided by LACENs and were collected through a questionnaire filled out at local health care services.
Ethical statement
The project was approved by the Pan American World Health Organization (PAHO) and the Brazilian Ministry of Health (MoH) as part of the arboviral genomic surveillance efforts within the terms of Resolution 510/2016 of CONEP (Comissão Nacional de Ética em Pesquisa, Ministério da Saúde; National Ethical Committee for Research, Ministry of Health). This authorizes the use of clinical samples collected in the Brazilian Central Public Health Laboratories to accelerate knowledge building and contribute to surveillance and outbreak response. The study protocol was reviewed and approved by Research Ethics Committee of the Universidade Federal de Minas Gerais with approval No. 32912820.6.1001.5149. Personally identifying information were de-identified in the datasets and tables in a way that minimizes the risk of unintended disclosure of identity of individuals and information about them.
cDNA synthesis and whole genome sequencing
Extracted RNA from positive CHIKV samples were provided by collaborating LACENs and submitted to cDNA synthesis and PCR, using a sequencing protocol based on multiplex PCR tiling amplicon approach design for MinION nanopore sequencing62. All reactions were performed at biosafety level 2 facilities and using no template controls. PCR products were purified using 1x AMpure beads Beckman Coulter, UK) and quantified using Qubit 3.0 instrument (Life Technologies) and the Qubit dsDNA High Sensitivity assay. DNA library preparation was performed on all amplified samples using the Ligation Sequencing Kit (Oxford Nanopore Technologies). Individual samples were barcoded using the Native Barcoding Kit (NBD104, Oxford Nanopore Technologies, Oxford, UK). Sequencing library was loaded onto a R9.4 flow cell and data were collected for up to 48 sequencing hours.
Generation of consensus sequences
Basecalling of raw FAST5 files and demultiplex of barcodes were performed using the software Guppy (https://github.com/nanoporetech). Consensus sequences were generated by a hybrid assembling approach implemented on Genome Detective (https://www.genomedetective.com/)63.
Phylogenetic reconstruction
We used MAFFT to align 422 new sequences (with coverage over >60% according to Thézé et al.66, samples 736.22_RED, FS0116, and FS0132 were discarded) in addition to 1,565 CHIKV whole genome sequences publicly available in NCBI up to August 2022, forming a global dataset (n=1,987) that includes all lineages. This global dataset was used to infer a Maximum Likelihood (ML) phylogeny using the IQ-TREE 2.1.1 software64,65. Statistical support for tree nodes was estimated using the ultrafast bootstrap (UFBoot) feature implemented in IQ-TREE with 1,000 replicates. We then used the ML tree from the global dataset to extract the Brazilian ECSA clade and use it to form a second dataset (total n=713, 706 Brazilian sequences, 2 from Haiti, and 5 from Paraguay; sequences with genome coverage >60% according to Thézé et al.66) which was used to infer a time-scaled phylogeny using BEAST v1.10.4. First, we investigated the temporal signal regressing root-to-tip genetic distances from this ML tree against sample collection dates using TempEst v.1.5.167. Secondly, we employed a stringent model selection analysis using both path-sampling (PS) and stepping-stone (SS) procedures to estimate the most appropriate molecular clock model for the Bayesian phylogenetic analysis68. For the Bayesian analysis, the uncorrelated relaxed molecular clock was chosen as indicated by estimating marginal likelihoods, also employing the HKY+G4 nucleotide substitution model, and the nonparametric Bayesian Skyline coalescent model. We combined two independent runs of 200 million states each69. The convergence of MCMC chains was checked using Tracer70. Maximum clade credibility (MCC) trees were summarized using TreeAnnotator after discarding 10% as burn-in.
CHIKV ECSA lineage movements across Brazil were investigated using the Bayesian phylogeographic approach with a discrete trait phylogenetic model. A trait file was used to discretize sequences sapling location by five Brazilian regions (North, Northeast, Southeast, South and Midwest). For this analysis, we downsampled our Brazilian ECSA clade to a dataset containing 471 sequences to maximize the temporal signal in the dataset. MCMC analyses were performed in BEAST v1.10.4, running in duplicate for 200 million interactions and sampling every 20,000 steps in the chain. Convergence for each run was assessed in Tracer. MCC trees for each run were summarized using TreeAnnotator after discarding the initial 10% as burn-in. Finally, we used SPREAD 4 tool to map spatiotemporal information embedded in the MCC trees71.
Transmission network analysis
A transmission network was reconstructed, using the StrainHub tool, from transition states summarized from the Bayesian phylogeography72. Centrality metrics on the tree nodes were also estimated for the network.
Comparative mutational analysis
For comparative mutational analysis, we assembled separate alignment datasets for each subclade (clade I = 327; clade II = 168) and for all new sequences (n=422). The sequence datasets were compared against the NCBI reference strain NC_004162.2 using MALVIRUS73. We filtered and selected substitutions with an occurrence frequency of 100% across the whole dataset and substitutions with a frequency above 60% across the envelope genes E1 and E2.
Selection pressure analysis
Since the ratio of non-synonymous (dN) to synonymous (dS) nucleotide substitutions can be used to study selection pressure on genomic sequences, we used the HYPHY software package that employs statistical methods that estimate the dN/dS to detect diversifying selection74. For that we performed an alignment of the envelope gene sequences of 720 Brazilian isolates and used BUSTED (restricting the analysis to each subclade I or II), an alignment-wide method implemented in HYPHY, that aims to detect evidence of episodic diversifying selection75. For comparison, we also used a different method, called MEME, a site-level approach that aims to detect evidence of both pervasive and episodic diversifying selection at individual sites (also restricting the analysis to each subclade I or II)76.
Epidemic curves from Chikungunya cases reported in Brazil
Data of weekly notified and laboratory-confirmed cases of infection by CHIKV in Brazil from 2014 to 2022 were supplied by the Brazilian Ministry of Health (BrMoH)(Ministério da Saúde, 2022). These data were used to calculate incidence and to plot time series charts.
Supplementary Material
Acknowledgements
This work was supported by The Pan American Health Organization (PAHO/WHO), the Brazilian Ministry of Health grant SCON2021-00180 (Coordenação Geral de Laboratório de Saúde Pública-CGLAB and Coordenação Geral de Vigilância de Arboviroses-CGARB), the National Institutes of Health USA grant U01 AI151698 for the United World Arbovirus Research Network (UWARN). We thank the State Health Secretariats and the epidemiological surveillance services of Brazilian states and municipalities for all the support provided. JX is supported by Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brasil (CAPES)-Finance Code 001. MG is funded by PON “Ricerca e Innovazione” 2014–2020.
Footnotes
Declaration of interests
The authors declare no competing interests.
Data and code availability
New sequences have been deposited in GenBank under accession numbers [available soon]. Input files used for the phylogenetic and mutational profile analyses will be available at the GitHub repository https://github.com/genomicsurveillance.
References
- 1.Mourad O., Makhani L., and Chen L.H. (2022). Chikungunya: an emerging public health concern. Curr. Infect. Dis. Rep. 24, 217–228. 10.1007/s11908-022-00789-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Chala B., and Hamde F. (2021). Emerging and Re-emerging Vector-Borne Infectious Diseases and the Challenges for Control: A Review. Front. Public Health 9, 715759. 10.3389/fpubh.2021.715759. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.World Health Organization (2017). Global vector control response 2017–2030 (World Health Organization; ). [Google Scholar]
- 4.World Health Organization (2022). 10 proposals to build a safer world together – Strengthening the Global Architecture for Health Emergency Preparedness, Response and Resilience (WHO Health Emergencies Programme; ). [Google Scholar]
- 5.Adelino T.É.R., Giovanetti M., Fonseca V., Xavier J., de Abreu Á.S., do Nascimento V.A., Demarchi L.H.F., Oliveira M.A.A., da Silva V.L., de Mello A.L.E.S., et al. (2021). Field and classroom initiatives for portable sequence-based monitoring of dengue virus in Brazil. Nat. Commun. 12, 2296. 10.1038/s41467-021-22607-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Faria N.R., Quick J., Claro I.M., Thézé J., de Jesus J.G., Giovanetti M., Kraemer M.U.G., Hill S.C., Black A., da Costa A.C., et al. (2017). Establishment and cryptic transmission of Zika virus in Brazil and the Americas. Nature 546, 406–410. 10.1038/nature22401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Costa É.A., Giovanetti M., Silva Catenacci L., Fonseca V., Aburjaile F.F., Chalhoub F.L.L., Xavier J., Campos de Melo Iani F., da Cunha E Silva Vieira M.A., Freitas Henriques D., et al. (2021). West nile virus in brazil. Pathogens 10. 10.3390/pathogens10070896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Faria N.R., Kraemer M.U.G., Hill S.C., Goes de Jesus J., Aguiar R.S., Iani F.C.M., Xavier J., Quick J., du Plessis L., Dellicour S., et al. (2018). Genomic and epidemiological monitoring of yellow fever virus transmission potential. Science 361, 894–899. 10.1126/science.aat7115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Giovanetti M., Pereira L.A., Adelino T.É.R., Fonseca V., Xavier J., de Araújo Fabri A., Slavov S.N., da Silva Lemos P., de Almeida Marques W., Kashima S., et al. (2022). A retrospective overview of zika virus evolution in the midwest of brazil. Microbiol. Spectr. 10, e0015522. 10.1128/spectrum.00155-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Naveca F.G., Claro I., Giovanetti M., de Jesus J.G., Xavier J., Iani F.C. de M., do Nascimento V.A., de Souza V.C., Silveira P.P., Lourenço J., et al. (2019). Genomic, epidemiological and digital surveillance of Chikungunya virus in the Brazilian Amazon. PLoS Negl. Trop. Dis. 13, e0007065. 10.1371/journal.pntd.0007065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Xavier J., Giovanetti M., Fonseca V., Thézé J., Gräf T., Fabri A., Goes de Jesus J., Lima de Mendonça M.C., Damasceno Dos Santos Rodrigues C., Mares-Guia M.A., et al. (2019). Circulation of chikungunya virus East/Central/South African lineage in Rio de Janeiro, Brazil. PLoS ONE 14, e0217871. 10.1371/journal.pone.0217871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Nunes M.R.T., Faria N.R., de Vasconcelos J.M., Golding N., Kraemer M.U.G., de Oliveira L.F., Azevedo R. do S. da S., da Silva D.E.A., da Silva E.V.P., da Silva S.P., et al. (2015). Emergence and potential for spread of Chikungunya virus in Brazil. BMC Med. 13, 102. 10.1186/s12916-015-0348-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Pan American Health Organization (2023). Cases of chikungunya virus disease. PAHO/WHO Data - Weekly Report. https://www3.paho.org/data/index.php/en/mnu-topics/chikv-en/550-chikv-weekly-en.html.
- 14.Souza T.M.L., Vieira Y.R., Delatorre E., Barbosa-Lima G., Luiz R.L.F., Vizzoni A., Jain K., Miranda M.M., Bhuva N., Gogarten J.F., et al. (2019). Emergence of the East-Central-South-African genotype of Chikungunya virus in Brazil and the city of Rio de Janeiro may have occurred years before surveillance detection. Sci. Rep. 9, 2760. 10.1038/s41598-019-39406-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.de Lima S.T.S., de Souza W.M., Cavalcante J.W., da Silva Candido D., Fumagalli M.J., Carrera J.-P., Simões Mello L.M., De Carvalho Araújo F.M., Cavalcante Ramalho I.L., de Almeida Barreto F.K., et al. (2021). Fatal outcome of chikungunya virus infection in brazil. Clin. Infect. Dis. 73, e2436–e2443. 10.1093/cid/ciaa1038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Robishaw J.D., Alter S.M., Solano J.J., Shih R.D., DeMets D.L., Maki D.G., and Hennekens C.H. (2021). Genomic surveillance to combat COVID-19: challenges and opportunities. Lancet Microbe 2, e481–e484. 10.1016/S2666-5247(21)00121-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Xavier J., Fonseca V., Bezerra J.F., do Monte Alves M., Mares-Guia M.A., Claro I.M., de Jesus R., Adelino T., Araújo E., Cavalcante K.R.L.J., et al. (2021). Chikungunya virus ECSA lineage reintroduction in the northeasternmost region of Brazil. Int. J. Infect. Dis. 105, 120–123. 10.1016/j.ijid.2021.01.026. [DOI] [PubMed] [Google Scholar]
- 18.Garcia-Beltran W.F., Lam E.C., St Denis K., Nitido A.D., Garcia Z.H., Hauser B.M., Feldman J., Pavlovic M.N., Gregory D.J., Poznansky M.C., et al. (2021). Multiple SARS-CoV-2 variants escape neutralization by vaccine-induced humoral immunity. Cell 184, 2372–2383.e9. 10.1016/j.cell.2021.03.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Powers A.M. (2018). Vaccine and therapeutic options to control chikungunya virus. Clin. Microbiol. Rev. 31. 10.1128/CMR.00104-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.ClinicalTrials.gov (2022). Pivotal Study to Evaluate Safety and Immunogenicity of a Live-Attenuated Chikungunya Virus Vaccine Candidate in Adults - Full Text View - ClinicalTrials.gov. ClinicalTrials.gov. https://clinicaltrials.gov/ct2/show/NCT04546724?term=NCT04546724&draw=2&rank=1.
- 21.Bennett S.R., McCarty J.M., Ramanathan R., Mendy J., Richardson J.S., Smith J., Alexander J., Ledgerwood J.E., de Lame P.-A., Royalty Tredo S., et al. (2022). Safety and immunogenicity of PXVX0317, an aluminium hydroxide-adjuvanted chikungunya virus-like particle vaccine: a randomised, double-blind, parallel-group, phase 2 trial. Lancet Infect. Dis. 22, 1343–1355. 10.1016/S1473-3099(22)00226-2. [DOI] [PubMed] [Google Scholar]
- 22.Tsetsarkin K.A., Chen R., Yun R., Rossi S.L., Plante K.S., Guerbois M., Forrester N., Perng G.C., Sreekumar E., Leal G., et al. (2014). Multi-peaked adaptive landscape for chikungunya virus evolution predicts continued fitness optimization in Aedes albopictus mosquitoes. Nat. Commun. 5, 4084. 10.1038/ncomms5084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Pancetti F.G.M., Honório N.A., Urbinatti P.R., and Lima-Camara T.N. (2015). Twenty-eight years of Aedes albopictus in Brazil: a rationale to maintain active entomological and epidemiological surveillance. Rev. Soc. Bras. Med. Trop. 48, 87–89. 10.1590/0037-8682-0155-2014. [DOI] [PubMed] [Google Scholar]
- 24.Carvalho R.G., Lourenço-de-Oliveira R., and Braga I.A. (2014). Updating the geographical distribution and frequency of Aedes albopictus in Brazil with remarks regarding its range in the Americas. Mem. Inst. Oswaldo Cruz 109, 787–796. 10.1590/0074-0276140304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Grubaugh N.D., Ladner J.T., Lemey P., Pybus O.G., Rambaut A., Holmes E.C., and Andersen K.G. (2019). Tracking virus outbreaks in the twenty-first century. Nat. Microbiol. 4, 10–19. 10.1038/s41564-018-0296-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Charlys da Costa A., Thézé J., Komninakis S.C.V., Sanz-Duro R.L., Felinto M.R.L., Moura L.C.C., Barroso I.M. de O., Santos L.E.C., Nunes M.A. de L., Moura A.A., et al. (2017). Spread of chikungunya virus east/central/south african genotype in northeast brazil. Emerging Infect. Dis. 23, 1742–1744. 10.3201/eid2310.170307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Maeda A.Y., Nogueira J.S., Campos K.R., Camargo C.H., da Silva Vasami F.G., Arvigo A.P.B., Santos M.B.N., Abbud A., and Sacchi C.T. (2022). Circulation of Chikungunya virus East-Central-South African genotype during the 2020–21 outbreak in São Paulo State, Brazil. Journal of Clinical Virology Plus 2, 100070. 10.1016/j.jcvp.2022.100070. [DOI] [Google Scholar]
- 28.de Castro Moreira D., Junior F.F.D., Júnior J.R.P., Jorge F.A., Dos Santos Rando F., Thomazella M.V., Presibella M.M., Riediger I.N., Fernandez M.A., and Bertolini D.A. (2022). Genetic characterization of Chikungunya virus circulating in individuals from Paraná, Brazil. Braz. J. Microbiol. 53, 641–645. 10.1007/s42770-022-00680-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Teixeira M.G., Andrade A.M.S., Costa M. da C.N., Castro J.-N.S.M., Oliveira F.L.S., Goes C.S.B., Maia M., Santana E.B., Nunes B.T.D., and Vasconcelos P.F.C. (2015). East/Central/South African genotype chikungunya virus, Brazil, 2014. Emerging Infect. Dis. 21, 906–907. 10.3201/eid2105.141727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.White S.K., Mavian C., Salemi M., Morris J.G., Elbadry M.A., Okech B.A., Lednicky J.A., and Dunford J.C. (2018). A new “American” subgroup of African-lineage Chikungunya virus detected in and isolated from mosquitoes collected in Haiti, 2016. PLoS ONE 13, e0196857. 10.1371/journal.pone.0196857. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Gräf T., Vazquez C., Giovanetti M., de Bruycker-Nogueira F., Fonseca V., Claro I.M., de Jesus J.G., Gómez A., Xavier J., de Mendonça M.C.L., et al. (2021). Epidemiologic history and genetic diversity origins of chikungunya and dengue viruses, paraguay. Emerging Infect. Dis. 27, 1393–1404. 10.3201/eid2705.204244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.de Oliveira E.C., Fonseca V., Xavier J., Adelino T., Morales Claro I., Fabri A., Marques Macario E., Viniski A.E., Campos Souza C.L., Gomes da Costa E.S., et al. (2021). Short report: Introduction of chikungunya virus ECSA genotype into the Brazilian Midwest and its dispersion through the Americas. PLoS Negl. Trop. Dis. 15, e0009290. 10.1371/journal.pntd.0009290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Spielman S.J., Weaver S., Shank S.D., Magalis B.R., Li M., and Kosakovsky Pond S.L. (2019). Evolution of viral genomes: interplay between selection, recombination, and other forces. Methods Mol. Biol. 1910, 427–468. 10.1007/978-1-4939-9074-0_14. [DOI] [PubMed] [Google Scholar]
- 34.Yu X., and Cheng G. (2022). Adaptive Evolution as a Driving Force of the Emergence and Re-Emergence of Mosquito-Borne Viral Diseases. Viruses 14. 10.3390/v14020435. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Chen R., Plante J.A., Plante K.S., Yun R., Shinde D., Liu J., Haller S., Mukhopadhyay S., and Weaver S.C. (2021). Lineage Divergence and Vector-Specific Adaptation Have Driven Chikungunya Virus onto Multiple Adaptive Landscapes. MBio 12, e0273821. 10.1128/mBio.02738-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hakim M.S., Annisa L., Gazali F.M., and Aman A.T. (2022). The origin and continuing adaptive evolution of chikungunya virus. Arch. Virol. 167, 2443–2455. 10.1007/s00705-022-05570-z. [DOI] [PubMed] [Google Scholar]
- 37.Quiroz J.A., Malonis R.J., Thackray L.B., Cohen C.A., Pallesen J., Jangra R.K., Brown R.S., Hofmann D., Holtsberg F.W., Shulenin S., et al. (2019). Human monoclonal antibodies against chikungunya virus target multiple distinct epitopes in the E1 and E2 glycoproteins. PLoS Pathog. 15, e1008061. 10.1371/journal.ppat.1008061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Tsetsarkin K.A., Vanlandingham D.L., McGee C.E., and Higgs S. (2007). A single mutation in chikungunya virus affects vector specificity and epidemic potential. PLoS Pathog. 3, e201. 10.1371/journal.ppat.0030201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Duggal N.K., and Emerman M. (2012). Evolutionary conflicts between viruses and restriction factors shape immunity. Nat. Rev. Immunol. 12, 687–695. 10.1038/nri3295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Collins N.D., Beck A.S., Widen S.G., Wood T.G., Higgs S., and Barrett A.D.T. (2018). Structural and nonstructural genes contribute to the genetic diversity of RNA viruses. MBio 9. 10.1128/mBio.01871-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Pouriayevali M.H., Rezaei F., Jalali T., Baniasadi V., Fazlalipour M., Mostafavi E., Khakifirouz S., Mohammadi T., Fereydooni Z., Tavakoli M., et al. (2019). Imported cases of Chikungunya virus in Iran. BMC Infect. Dis. 19, 1004. 10.1186/s12879-019-4637-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Familiar-Macedo D., Gama B.E., Emmel V.E., Vera-Lozada G., Abdelhay E., Martins I.S., and Hassan R. (2022). Molecular aspects of Chikungunya virus infections in cancer patients. Mem. Inst. Oswaldo Cruz 117, e210383. 10.1590/0074-02760210383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Cunha M.S., Cruz N.V.G., Schnellrath L.C., Medaglia M.L.G., Casotto M.E., Albano R.M., Costa L.J., and Damaso C.R. (2017). Autochthonous transmission of east/central/south african genotype chikungunya virus, brazil. Emerging Infect. Dis. 23, 1737–1739. 10.3201/eid2310.161855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Pongsiri P., Auksornkitti V., Theamboonlers A., Luplertlop N., Rianthavorn P., and Poovorawan Y. (2010). Entire genome characterization of Chikungunya virus from the 2008–2009 outbreaks in Thailand. Trop. Biomed. 27, 167–176. [PubMed] [Google Scholar]
- 45.Rodas J.D., Kautz T., Camacho E., Paternina L., Guzmán H., Díaz F.J., Blanco P., Tesh R., and Weaver S.C. (2016). Genetic Characterization of Northwestern Colombian Chikungunya Virus Strains from the 2014–2015 Epidemic. Am. J. Trop. Med. Hyg. 95, 639–646. 10.4269/ajtmh.16-0091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Tan K.-K., Sy A.K.D., Tandoc A.O., Khoo J.-J., Sulaiman S., Chang L.-Y., and AbuBakar S. (2015). Independent emergence of the cosmopolitan asian chikungunya virus, philippines 2012. Sci. Rep. 5, 12279. 10.1038/srep12279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Hakim M.S., Annisa L., and Aman A.T. (2023). The evolution of chikungunya virus circulating in Indonesia: Sequence analysis of the orf2 gene encoding the viral structural proteins. Int. Microbiol. 10.1007/s10123-023-00337-1. [DOI] [PubMed] [Google Scholar]
- 48.Khan N., Bhat R., Jain V., Raghavendhar B S., Patel A.K., Nayak K., Chandele A., Murali-Krishna K., and Ray P. (2021). Epidemiology and molecular characterization of chikungunya virus from human cases in North India, 2016. Microbiol. Immunol. 65, 290–301. 10.1111/1348-0421.12869. [DOI] [PubMed] [Google Scholar]
- 49.Calvez E., Bounmany P., Somlor S., Xaybounsou T., Viengphouthong S., Keosenhom S., Brey P.T., Lacoste V., and Grandadam M. (2022). Multiple chikungunya virus introductions in Lao PDR from 2014 to 2020. PLoS ONE 17, e0271439. 10.1371/journal.pone.0271439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Agbodzi B., Yousseu F.B.S., Simo F.B.N., Kumordjie S., Yeboah C., Mosore M.-T., Bentil R.E., Prieto K., Colston S.M., Attram N., et al. (2021). Chikungunya viruses containing the A226V mutation detected retrospectively in Cameroon form a new geographical subclade. Int. J. Infect. Dis. 113, 65–73. 10.1016/j.ijid.2021.09.058. [DOI] [PubMed] [Google Scholar]
- 51.Faure G., Ogurtsov A.Y., Shabalina S.A., and Koonin E.V. (2017). Adaptation of mRNA structure to control protein folding. RNA Biol. 14, 1649–1654. 10.1080/15476286.2017.1349047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Chaw S.-M., Tai J.-H., Chen S.-L., Hsieh C.-H., Chang S.-Y., Yeh S.-H., Yang W.-S., Chen P.-J., and Wang H.-Y. (2020). The origin and underlying driving forces of the SARS-CoV-2 outbreak. J. Biomed. Sci. 27, 73. 10.1186/s12929-020-00665-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Simmonds P., and Ansari M.A. (2021). Extensive C->U transition biases in the genomes of a wide range of mammalian RNA viruses; potential associations with transcriptional mutations, damage- or host-mediated editing of viral RNA. PLoS Pathog. 17, e1009596. 10.1371/journal.ppat.1009596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Pecori R., Di Giorgio S., Paulo Lorenzo J., and Nina Papavasiliou F. (2022). Functions and consequences of AID/APOBEC-mediated DNA and RNA deamination. Nat. Rev. Genet. 23, 505–518. 10.1038/s41576-022-00459-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Kim K., Calabrese P., Wang S., Qin C., Rao Y., Feng P., and Chen X.S. (2022). The roles of APOBEC-mediated RNA editing in SARS-CoV-2 mutations, replication and fitness. Sci. Rep. 12, 14972. 10.1038/s41598-022-19067-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Soares-Schanoski A., Baptista Cruz N., de Castro-Jorge L.A., de Carvalho R.V.H., Santos C.A.D., Rós N. da, Oliveira Ú., Costa D.D., Santos C.L.S.D., Cunha M.D.P., et al. (2019). Systems analysis of subjects acutely infected with the Chikungunya virus. PLoS Pathog. 15, e1007880. 10.1371/journal.ppat.1007880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Piontkivska H., Wales-McGrath B., Miyamoto M., and Wayne M.L. (2021). ADAR Editing in Viruses: An Evolutionary Force to Reckon with. Genome Biol. Evol. 13. 10.1093/gbe/evab240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Churakov M., Villabona-Arenas C.J., Kraemer M.U.G., Salje H., and Cauchemez S. (2019). Spatio-temporal dynamics of dengue in Brazil: Seasonal travelling waves and determinants of regional synchrony. PLoS Negl. Trop. Dis. 13, e0007012. 10.1371/journal.pntd.0007012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Freitas L.P., Schmidt A.M., Cossich W., Cruz O.G., and Carvalho M.S. (2021). Spatio-temporal modelling of the first Chikungunya epidemic in an intra-urban setting: The role of socioeconomic status, environment and temperature. PLoS Negl. Trop. Dis. 15, e0009537. 10.1371/journal.pntd.0009537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.de Melo Ximenes M. de F.F., de Araújo Galvão J.M., Inacio C.L.S., Macêdo E Silva V.P., Pereira R.L. do N., Pinheiro M.P.G., de Medeiros Silva M.M., and Gomes C.E.S. (2020). Arbovirus expansion: New species of culicids infected by the Chikungunya virus in an urban park of Brazil. Acta Trop. 209, 105538. 10.1016/j.actatropica.2020.105538. [DOI] [PubMed] [Google Scholar]
- 61.Costa-da-Silva A.L., Ioshino R.S., Petersen V., Lima A.F., Cunha M.D.P., Wiley M.R., Ladner J.T., Prieto K., Palacios G., Costa D.D., et al. (2017). First report of naturally infected Aedes aegypti with chikungunya virus genotype ECSA in the Americas. PLoS Negl. Trop. Dis. 11, e0005630. 10.1371/journal.pntd.0005630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Quick J., Grubaugh N.D., Pullan S.T., Claro I.M., Smith A.D., Gangavarapu K., Oliveira G., Robles-Sikisaka R., Rogers T.F., Beutler N.A., et al. (2017). Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples. Nat. Protoc. 12, 1261–1276. 10.1038/nprot.2017.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Vilsker M., Moosa Y., Nooij S., Fonseca V., Ghysens Y., Dumon K., Pauwels R., Alcantara L.C., Vanden Eynden E., Vandamme A.-M., et al. (2019). Genome Detective: an automated system for virus identification from high-throughput sequencing data. Bioinformatics 35, 871–873. 10.1093/bioinformatics/bty695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Nguyen L.-T., Schmidt H.A., von Haeseler A., and Minh B.Q. (2015). IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274. 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Katoh K., Kuma K., Toh H., and Miyata T. (2005). MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 33, 511–518. 10.1093/nar/gki198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Thézé J., Li T., du Plessis L., Bouquet J., Kraemer M.U.G., Somasekar S., Yu G., de Cesare M., Balmaseda A., Kuan G., et al. (2018). Genomic epidemiology reconstructs the introduction and spread of zika virus in central america and mexico. Cell Host Microbe 23, 855–864.e7. 10.1016/j.chom.2018.04.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Rambaut A., Lam T.T., Max Carvalho L., and Pybus O.G. (2016). Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2, vew007. 10.1093/ve/vew007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Baele G., Lemey P., Bedford T., Rambaut A., Suchard M.A., and Alekseyenko A.V. (2012). Improving the accuracy of demographic and molecular clock model comparison while accommodating phylogenetic uncertainty. Mol. Biol. Evol. 29, 2157–2167. 10.1093/molbev/mss084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Suchard M.A., Lemey P., Baele G., Ayres D.L., Drummond A.J., and Rambaut A. (2018). Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 4, vey016. 10.1093/ve/vey016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Rambaut A., Drummond A.J., Xie D., Baele G., and Suchard M.A. (2018). Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 67, 901–904. 10.1093/sysbio/syy032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Nahata K.D., Bielejec F., Monetta J., Dellicour S., Rambaut A., Suchard M.A., Baele G., and Lemey P. (2022). SPREAD 4: online visualisation of pathogen phylogeographic reconstructions. Virus Evol. 8, veac088. 10.1093/ve/veac088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.de Bernardi Schneider A., Ford C.T., Hostager R., Williams J., Cioce M., Çatalyürek Ü.V., Wertheim J.O., and Janies D. (2020). StrainHub: a phylogenetic tool to construct pathogen transmission networks. Bioinformatics 36, 945–947. 10.1093/bioinformatics/btz646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Ciccolella S., Denti L., Bonizzoni P., Della Vedova G., Pirola Y., and Previtali M. (2022). MALVIRUS: an integrated application for viral variant analysis. BMC Bioinformatics 22, 625. 10.1186/s12859-022-04668-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Kosakovsky Pond S.L., Poon A.F.Y., Velazquez R., Weaver S., Hepler N.L., Murrell B., Shank S.D., Magalis B.R., Bouvier D., Nekrutenko A., et al. (2020). HyPhy 2.5-A Customizable Platform for Evolutionary Hypothesis Testing Using Phylogenies. Mol. Biol. Evol. 37, 295–299. 10.1093/molbev/msz197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Murrell B., Weaver S., Smith M.D., Wertheim J.O., Murrell S., Aylward A., Eren K., Pollner T., Martin D.P., Smith D.M., et al. (2015). Gene-wide identification of episodic selection. Mol. Biol. Evol. 32, 1365–1371. 10.1093/molbev/msv035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Murrell B., Wertheim J.O., Moola S., Weighill T., Scheffler K., and Kosakovsky Pond S.L. (2012). Detecting individual sites subject to episodic diversifying selection. PLoS Genet. 8, e1002764. 10.1371/journal.pgen.1002764. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
New sequences have been deposited in GenBank under accession numbers [available soon]. Input files used for the phylogenetic and mutational profile analyses will be available at the GitHub repository https://github.com/genomicsurveillance.