Abstract
In Mexico, the BA.4 and BA.5 Omicron variants dominated the fifth epidemic wave (summer 2022), superseding BA.2, which had circulated during the inter-wave period. The present study uses genome sequencing and statistical and phylogenetic analyses to examine these variants' abundance, distribution, and genetic diversity in Mexico from April to August 2022. Over 35 % of the sequenced genomes in this period corresponded to the BA.2 variant, 8 % to the BA.4 and 56 % to the BA.5 variant. Multiple subvariants were identified, but the most abundant, BA.2.9, BA.2.12.1, BA.5.1, BA.5.2, BA.5.2.1 and BA.4.1, circulated across the entire country, not forming geographical clusters. Contrastingly, other subvariants exhibited a geographically restricted distribution, most notably in the Southeast region, which showed a distinct subvariant dynamic. This study supports previous results showing that this region may be a significant entry point and contributed to introducing and evolving novel variants in Mexico. Furthermore, a differential distribution was observed for certain subvariants among specific States through time, which may have contributed to the overall increased diversity observed during this wave compared to the previous ones. This study highlights the importance of sustaining genomic surveillance to identify novel variants that may impact public health.
Keywords: emerging sublineages, genomic surveillance, Mexico, Omicron, SARS-COV-2
Data Summary
All 7022 SARS-CoV-2 sequence data generated in this work have been deposited to the Global Initiative on Sharing All Influenza Data (GISAID, https://www.gisaid.org/) with the accession EPI_SET_230621bg and to GenBank. The accession number and metadata for all these sequences are provided in Table S1, available in the online version of this article, which also contains the corresponding GenBank accession numbers. We acknowledge the other laboratory-generated genomes by providing GISAID acknowledgement tables in Table S2 that include a snapshot of the data provided along a hyperlink to access the contributors of each individual sequence with details such as accession number, virus name, collection date, originating laboratory, submitting laboratory and the list of authors. Epidemiological data of Mexico was obtained through the Secretaría de Salud website (https://www.gob.mx/salud/documentos/datos-abiertos-152127). All supporting data has been provided within the article or through supplementary data files. Two supplementary tables are available with the online version of this article.
Impact Statement.
The genomic surveillance of SARS-CoV-2 has helped identify variants that are of concern to public health and accurately track the viruses’ spread. However, the dynamics of the virus transmission change as the pandemic progresses and population mobility normalizes. This work describes Omicron subvariant circulation in Mexico after the fourth wave and through the end of the fifth wave, in accordance with the fourth wave, which was dominated by the BA.1 Omicron subvariant, and contrary to reports from previous waves, the most abundant subvariants of BA.2, BA.4 and BA.5 exhibited nationwide distribution. However, minor subvariants had a localized circulation and were associated with introductions from a single geographical region; mainly, the southeast had a different dispersion dynamic, suggesting that it remains a point of entry for novel viruses to Mexico, and surveillance efforts will need to focus on this region.
Introduction
Multiple variants of the severe acute respiratory syndrome coronavirus 2 virus (SARS-CoV-2) have been responsible for nearly 700 million cases and over 6.87 million deaths since December 2019 (Our World in Data, https://ourworldindata.org; accessed 6 March 2023). As of the time of writing, 2992 lineages have emerged globally according to the Phylogenetic Assignment of Named Global Outbreak (PANGO) classification (https://cov-lineages.org/lineage_list.html; accessed on 6 March 2023). The World Health Organization (WHO) has designated five of these SARS-CoV-2 viruses as Variants of Concern (VOCs) based on their increased epidemic potential, disease severity and immune escape, which contribute to faster spreading and replacement of prior variants, starting new epidemiological outbreaks, and requiring adjustments of public health protocols [1]. Alpha (B.1.1.7) was the first VOC to be detected [2] in December 2020, followed by Beta (December 2020) [3], Gamma (January 2021) [4], Delta (May 2021) [5] and Omicron (November 2021) [6]. Omicron (B.1.1.529 in the Pango nomenclature) was first reported in South Africa and Botswana in November 2021 [6]. It has 46 point mutations and three deletions (changes of two to three amino acids) compared to the Wuhan reference strain [7]; 30 of these changes are present on the spike protein alone, representing a major evolutionary leap compared with previous VOCs. In this work, a mutation is considered to belong to a variant if at least 90 % of the samples of the variant harbour it.
Immediately after its detection, Omicron was subclassified into variants BA.1, BA.2 and BA.3. These variants share 31 amino acid mutations and three deletions, in addition to their own set of mutations. In winter 2021, the BA.1 variant and its subvariants started a new epidemiological surge worldwide. Afterward, BA.2 superseded them as the predominant variant in the world [8, 9], allowing the epidemic wave of BA.1 to continue in some countries [10, 11], or causing a separate wave in other countries [12, 13], or leading to a small number of cases without an associated epidemic wave in others [14, 15]. The genomic sequences of BA.1 and BA.2 variants (and their subvariants) differ by 23 amino acids [6], resulting in enhanced replication capacity and transmissibility of BA.2 [9, 16, 17]. However, other reports indicated that people infected with this variant had a lower or equivalent death or hospitalization risk compared to BA.1 [18, 19] and a similar drop in the vaccine’s effectiveness, which was significantly lower for all Omicron variants [20].
Eventually, two additional variants, BA.4 and BA.5, emerged from the BA.2 clade [21] and quickly became dominant, driving a new epidemiological wave on a global scale. Compared to the BA.2 spike protein, the BA.4 and BA.5 sequences contain two additional amino acid changes (L452R and F486V), one deletion (69-70del) and one reversion (R493Q). These two variants were shown to have a higher infection rate in vaccinated or previously infected people [16, 22, 23] and a higher risk of hospitalization relative to BA.1 [10, 24].
Despite having a lower mortality rate than previous VOCs, the global spread of Omicron and its variants still poses a major challenge to public health. Genomic surveillance is therefore required to monitor the development and evolution of novel variants, which helps to assess the efficacy of diagnostic methods, the potential of immune escape, and the transmissibility of the virus. As previously described, the BA.1 variant (and its descending subvariants) caused Mexico’s fourth wave of infections [25]. On the other hand, BA.5 and, to a lesser extent, BA.4 were responsible for the fifth wave. In contrast, BA.2 dominated during the inter-wave interval and, in some States, at the start of the fifth wave, before BA.4 and BA.5 became dominant in the country. This study examines Mexico’s introduction, diversity, spread and distribution of these three variants.
Methods
Epidemiological data
Official records of daily cases and deaths were obtained from the open COVID-19 database from the General Epidemiological Directorate (Dirección General de Epidemiologia) (https://www.gob.mx/salud/documentos/datos-abiertos-152127) from 9 September 2022, update. The information was plotted according to the sample collection date using a 7-day moving average sliding window with a 1-day step. Epidemiological data processing and visualizations were created using the R programming language v. 4.2.2.
Study design and ethical consideration
The samples and metadata used in this study are part of the national public health response to COVID-19, collected by the CoViGen-Mex consortium under the Official Mexican Standard NOM 017-SSA2-2012 (17) for the epidemiological surveillance programme. The samples were anonymous prior to the start of the study. In hospitals and clinics of the Mexican Social Security Institute (IMSS), oropharyngeal or nasopharyngeal swab samples were collected from all 32 states of Mexico. All IMSS hospitals are required to conduct an initial SARS-CoV-2 antigen test on 100 % of hospitalized patients and 10 % of outpatients. Then, 100 and 1 % of positive samples, respectively, undergo PCR testing. Between 1 April and 31 August 2022, 10 472 samples with a cycle threshold (Ct) value of 30 or less were collected. From these samples, if available, approximately 1400 were selected at random each month for sequencing, assuming the same percentage per state. A total of 7859(70.1 %) IMSS samples were processed for sequencing. Lastly, 59 samples from the Center for Research in Food and Development (CIAD) and 370 samples from the National Institute of Respiratory Diseases (INER) were included.
High-throughput sequencing and subvariant identification
In total, 8288 SARS-CoV-2 samples were amplified with the Illumina COVIDSeq kit (Illumina, San Diego, CA, USA) using ARTIC 4.1 primers according to the manufacturer’s instructions (https://artic.network). The libraries were sequenced using Illumina NextSeq 550 or Miniseq platforms with 2×150 paired-end runs in differentinstitutions, forming the Consorcio de Vigilancia Genomica de Mexico (CoViGen-Mex). Basecalling was conducted with BCL Convert to generate FASTQ files. Adapters, low-quality bases, and duplicate reads were eliminated using FASTP v1.2 [26]. The remaining reads were mapped with Bowtie2 v2.3.4.3 [27] against the Wuhan-Hu-1 (NC_045512.1) reference genome to obtain consensus sequences using iVar (v1.3.1) [28], with the following parameters: base score Q>20, 20 minimum read coverage for basecalling, and a threshold under the majority rule (0.5 frequency) for single nucleotide polymorphisms (SNP). A total of 7022 sequences were uploaded to the EpiCoV database of the Global Initiative on Sharing All Influenza Data (GISAID) repository (https://www.gisaid.org/), as well as the GenBank database (Table S1)op[]\/i j4fvcsxz, bvfcwd with a mean genome coverage of 97.9 %, and a mean average sequencing depth of 2210.9 (±1001.5). For subvariant identification, the Pangolin software programme was utilized (https://github.com/cov-lineages/pangolin; accessed on 10 May 2022).
SARS-CoV-2 Mexican sequences data set
A set of 9002 additional complete sequences from Mexico were downloaded from the GISAID database (accessed on 19 September 2022), covering the same period from 1 April to 30 August 2022. From this set, 23 Delta-classified genomes were discarded, resulting in 8979 additional genomes (Table S2). The 7022 sequenced genomes reported in this study and generated by the CoViGen-Mex consortium were added to this dataset, tallying 16 001 genomes from Mexico (Tables S1 and S2). The correlation between the frequency of variants in each State between these two datasets was 0.90(±0.5). The States of Aguascalientes, Coahuila, Nuevo León and San Luis Potosí were excluded from the correlation because in Aguascalientes, the consortium had sequenced fewer than ten genomes, and in the other four States fewer than 20 additional genomes were reported by other entities. Nayarit was excluded from all further analyses since it had only ten genomes. Notably, the sampling strategy carried out by other entities is distinct from that of the consortium; for example, some receive their samples from hospitals, some from private diagnostic laboratories, while others actively search for imported cases, among others. Despite the difference in sampling, the high correlation suggests that the genomic diversity observed during this period and described in the article has no sampling bias.
Regional lineage distribution
To assess the differences in lineage distribution, a cluster analysis of the relative variant frequencies in each State was performed using R v.4.0.5 [29]. A State distances matrix was generated using the Vegan v.2.5.7 package [30]. Different parameters were evaluated: (a) dissimilarity distance measure; (b) method in agnes algorithm for assigning nodes to clusters, using the Cluster v.2.1.2 package [31]; (c) the number of optical clusters evaluated from two to eight groups. From the factoextra package [32], ‘silhouette widths’ were generated to evaluate the concept within-cluster cohesion and between-cluster isolation. Small positive or negative silhouette widths suggest poor State-cluster matches, while large positive widths indicate good fits. Results showed that Cao index was the best distance measure; the Ward method had the highest agglomerative coefficient (AC); and four clusters were the optimal, with an AC of 92 % and an average silhouette width of 0.49.
Mutation, evolution and phylogenetic analyses
The genomes were analysed with NextClade CLI 1.9.0 (accessed on 20 November 2022) [33], with the default command-line parameters and relative to the Wuhan-Hu-1 reference genome to identify subvariants-specific mutations, insertions and deletions. For this purpose, NextClade was used to perform sequence alignment, variant calling and clade assignment on SARS-CoV-2 genomes while considering the geographical origin and mutational analysis of the genomes.
The covSPECTRUM logistic regression model was used, with the default parameters, to estimate the relative growth advantage per week of variants compared to co-circulating variants by using the estimated proportion of SARS-CoV-2 sequences with a specific mutation between 1 April and 31 August 2022, in Mexico [7] based on GISAID data. The provided relative growth reflects the advantage of variants spreading primarily through local transmission. In addition, an estimated logistic growth rate (per day) was estimated under the assumptions of intrinsic transmission advantage, immune evasion and a prolonged infectious period [34].
For each BA.2, BA.4 and BA.5 subvariant, an analysis to study their introductions to Mexico and their region of origin was carried out. To this end, the first 150 sequences of each variant detected in Mexico were selected, and geographical redundancy was eliminated to generate a set of 100 sequences that were used to identify closely related sequences with Audacity Instant available at http://www.gisaid.org. The resultant sequences were aligned using mafft v.7 [35] against the reference sequence and then processed with CD-HIT [36] to remove redundancy. A maximum-likelihood phylogenetic tree was inferred using iqtree2 [37] with the substitution model GTR+F+R3. The location of each sequence was set to the continent of origin, except for those from Mexico and the USA, given the large number of people who routinely cross the Mexico–USA border. An ancestral state reconstruction of the location was done using PastML [38], and the resulting trees were visualized in FigTree v1.4.4 (https://github.com/rambaut/figtree/issues/130).
To characterize the evolutionary spread of BA.4 and BA.5 variants circulating in Mexico, an individual subsample of sequences from their most abundant subvariants in the country was used. The sequences were selected to ensure they were representative of all states and months. Then, a subsample of the same subvariants from the rest of the world was included (Table S2). Finally, an alignment and a phylogenetic tree was done in the same way as described previously. The phylogenies were plotted using ggtree in R v.4.0.5 [29], colouring the tips by sampling location. Maps showing subvariants’ geographical distribution were constructed in R using the package mxmaps (https://github.com/diegovalle/mxmaps).
Statistical analysis
All genomic information was expressed as counts and relative frequencies (percent). Using the median with interquartile data (IQR), the central tendency of these non-normally distributed data was described. Groups of states with a greater prevalence of a particular variant were identified as those with a higher interquartile relative frequency, as measured by month. The non-parametric Mann–Whitney U-test was utilized for group comparison. A P-value less than 0.05 was statistically significant (two-tailed). Statistical analyses were performed using the package ggplot in R v.4.0.5.
Results
Epidemiological information
By September 2022, Mexico had traversed five epidemiological waves during the COVID-19 pandemic (Fig. 1a). Its fourth wave (Week [W] 51/December 2021 – W12/March 2022), driven by the BA.1 variant, reported the highest record of the 7-day moving average of daily cases, peaking at 62 134 on 15 January 2023, followed by the fifth wave (W22/Jun – W35/August 2022), which peaked at 30 475 cases on 10 July 2022. In contrast, the inter-wave period between the fourth and fifth waves (spanning W13/April – W21/May) was characterized by the lowest number of daily new confirmed cases in Mexico since the onset of the pandemic. Contrastingly, these two epidemic waves have recorded the lowest number of hospitalizations and deaths as of this date, with peaks of 555 (30 January 2022) and 87 (28 July 2022) daily deaths, respectively (7-day moving average), compared to 798 (17 July 2020), 1432 (22 January 2021) and 812 (12 August 2021) deaths in the preceding three waves. This decline in mortality was probably related to the increase in population immunity acquired either through vaccination or previous infections or a possible decrease in virulence of the circulating variants (Fig. 1a).
Fig. 1.
Epidemic data and SARS-CoV-2 subvariants circulating in Mexico, using a 7-day rolling average on a sliding window with a 1-day step. (a) Average daily COVID-19-positive cases and deaths in Mexico are shown in green and fuchsia, respectively. The years are denoted in the bars below (light grey, 2020; grey, 2021; dark grey, 2022). (b) Relative genome frequency of SARS-CoV-2 subvariants during the fifth epidemic surge and its preceding inter-wave period. Subvariants with <1 % of national abundance are collated as ‘Others’.
Virus genetic diversity and geographical distribution in the fifth wave
The circulation of the SARS-CoV-2 variants in Mexico during the first 2 years of the pandemic has been described previously [25, 39–41]. Fig. 1b depicts the relative frequency of subvariants in Mexico between April and August 2022, which includes the last inter-wave period and the fifth wave. During this time, the BA.2, BA.4 and BA.5 variants (and related subvariants) dominated the infections in the country. A total of 89 distinct SARS-CoV-2 Omicron subvariants were identified (Table S1), but only a few circulated throughout the country.
The emergence of the BA.2 variant occurred at the beginning of the inter-wave period. In some other countries, BA.2 caused a second Omicron wave different from the BA.1 wave [12, 13]. In Mexico, BA.2 spread predominantly between waves (March to May 2022) and was displaced at the beginning of the fifth wave by BA.4 and BA.5. Forty-six different subvariants of BA.2 were identified in Mexico, with a cumulative relative frequency of 24 % in April, 90 % in May, and later descending to 52 % in June (Fig. 1a). The two most widespread BA.2 subvariants were BA.2.9 and BA.2.12.1 (Table 1 and Fig. 1b).
Table 1.
Monthly frequency (%) of Omicron subvariants in Mexico between April and August 2022
|
Variant |
Subvariant |
No. of genome sequences (frequency in %) |
||||
|---|---|---|---|---|---|---|
|
April / 22 |
May / 22 |
June / 22 |
July / 22 |
August / 22 |
||
|
BA.1 |
BA.1 |
271(34.3 %) |
82(4.0 %) |
21(0.4 %) |
4(0.1 %) |
6(0.2 %) |
|
BA.2 |
BA.2.9 |
150(19.0 %) |
463(22.6 %) |
447(9.0 %) |
55(1.1 %) |
9(0.3 %) |
|
BA.2.12.1 |
39(4.9 %) |
679(33.1 %) |
1590(32.0 %) |
333(6.9 %) |
44(1.3 %) |
|
|
BA.2.Others |
320(40.6 %) |
707(34.5 %) |
555(11.2 %) |
111(2.3 %) |
39(1.2 %) |
|
|
BA.4 |
BA.4.1 |
1(0.1 %) |
20(1.0 %) |
226(4.5 %) |
240(5.0 %) |
146(4.3 %) |
|
BA.4.2 |
0(0.0 %) |
0(0.0 %) |
40(0.8 %) |
129(2.7 %) |
52(1.5 %) |
|
|
BA.4.4 |
0(0.0 %) |
0(0.0 %) |
27(0.5 %) |
94(1.9 %) |
39(1.2 %) |
|
|
BA.4.Others |
0(0.0 %) |
19(0.9 %) |
215(4.3 %) |
50(1.0 %) |
36(1.1 %) |
|
|
BA.5 |
BA.5.1 |
1(0.1 %) |
50(2.4 %) |
1169(23.5 %) |
1914(39.5 %) |
1397(41.4 %) |
|
BA.5.1.1 |
0(0.0 %) |
0(0.0 %) |
23(0.5 %) |
230(4.7 %) |
108(3.2 %) |
|
|
BA.5.2 |
0(0.0 %) |
1(0.0 %) |
180(3.6 %) |
526(10.9 %) |
547(16.2 %) |
|
|
BA.5.2.1 |
0(0.0 %) |
5(0.2 %) |
194(3.9 %) |
505(10.4 %) |
510(15.1 %) |
|
|
BF.1 |
0(0.0 %) |
1(0.0 %) |
64(1.3 %) |
147(3.0 %) |
54(1.6 %) |
|
|
BE.1 |
0(0.0 %) |
3(0.1 %) |
43(0.9 %) |
107(2.2 %) |
110(3.3 %) |
|
|
BE.2 |
0(0.0 %) |
0(0.0 %) |
5(0.1 %) |
47(1.0 %) |
32(0.9 %) |
|
|
BA.5.Others |
0(0.0 %) |
11(0.5 %) |
165(3.3 %) |
351(7.2 %) |
242(7.2 %) |
|
In some States, BA.2 subvariants, such as BA.2.12.1 and BA.2.9, drove the rise of the fifth wave (Fig. 2), showing a cumulative relative frequency higher than the interquartile (>67 %) in June, the beginning of the wave. These States are located either in the West, on the Pacific coast (Baja California, Baja California Sur, Colima, Jalisco, Michoacan, Oaxaca and Sinaloa) or in the East, in the Gulf of Mexico (Campeche, Veracruz and Yucatan). The median abundance in June was significantly higher in these States than in the rest median=71 %(95 % CI: 65 %,72 %), vs. median=45 %(95 % CI: 41–55 %), Mann–Whitney U test P<0.00001. Fig. 2a depicts SARS-CoV-2 subvariant circulation from April to August 2022 in these States with a higher abundance of BA.2(10 out of 32 States; see the map to the right of the figure for their location). The BA.2 variant reached a cumulative relative frequency of 83.1 % in June and 32.2 % during the epidemiological peak of July. On the other hand, (Fig. 2b) depicts the rest of the country (States with a lower BA.2 abundance), where BA.2 reached 52.2 and 11.2 % at the beginning and peak of the wave, respectively. In addition, (Fig. 2a) shows that BA.2.9 circulated during the late inter-wave period, while BA.2.12.1 also extended further into the fifth wave.
Fig. 2.
Comparison of SARS-CoV-2 genomic subvariant diversity in States with high and low BA.2 circulation in Mexico during the fifth wave of infections. (a) States where BA.2 was more abundant at the beginning (black arrow) and peak (red arrow) of the wave. The map on the right highlights those States. (b) Remaining States, where BA.2 was not as abundant during the start of the wave and was early displaced by BA.5 (States shown in white on the map).
In May 2022, BA.5 began to expand, overtaking BA.2 by late June as the most abundant variant (Fig. 1b). This replacement coincided with the acceleration of the fifth epidemiological wave (Fig. 1a), with BA.5.1 being the most frequent subvariant. BA.4 emerged around the same time as BA.5, but its national relative frequency remained below 10 % throughout the wave, with BA.4.1 being the most common of its subvariants. In contrast, the BA.5 subvariants reached over 90 % by the end of August 2022 (Table 1). This observed increase is in line with the growth advantage analysis results, which indicated that BA.4 and BA.5 variants have a higher daily growth rate than BA.2, at 0.06(95 % CI: 0.06–0.07) and 0.09(95 % CI: 0.09–0.10), respectively [7]. These values correspond to an estimated relative weekly growth advantage of 57 %(95 % CI: 53–60 %) and 90 %(95 % CI: 86–93 %) for BA.4 and BA.5 over BA.2, respectively [34].
Analysis of the introduction of BA.2, BA.4 and BA.5 subvariants to Mexico
During the fifth wave, no distinct geographic regions (groups of contiguous States) of SARS-CoV-2 virus subvariant circulation were observed (Fig. S1), as was the case during the fourth wave with the variant BA.1 in Mexico [25]. Even though four groups were formed, one of them, in blue, contains most of the States. Another cluster, in red, consists of dispersed states located from north to south. Moreover, the minor or negative values of the silhouette width measure in several states suggested that these states were incorrectly grouped. Interestingly, the lack of geographical clustering was observed for the most abundant subvariants, such as BA.2.12,1, BA.4.1, BA.5.1, BA.5.2 and BA.5.2.1 (Table 1). However, other subvariants with a lesser abundance, such as BA.2.9, BA.4.2, BA.4.4 and BA.5.1.1, showed a distinctive circulation in non-neighbouring states. It is not easy to assess the reasons for the differences in variant spread because numerous stochastic events are possible. One area worth exploring was analysing the relative significance of importation events from different geographical areas versus virus spread within Mexico. To determine the origins of all Mexican major subvariants (clades composed by a minimum of two sequences from Mexico directly descending from another sequence of Mexico), the ancestral states of sequence location were reconstructed as described in Methods. Table 2 lists the number of introductions that resulted in Mexican clades and their most likely origin.
Table 2.
Clades of Mexican sequences (local transmission), along with the percentage of their ancestral nodes in each locality
|
Variant |
Subvariant |
No. clades |
USA |
Europe |
North america |
South America |
Asia |
Ambiguous |
|---|---|---|---|---|---|---|---|---|
|
BA.2 |
BA.2.9 |
10 |
5(50.0 %) |
1(10.0 %) |
0(0.0 %) |
2(20.0 %) |
1(10.0 %) |
1(10.0 %) |
|
BA.2.12.1 |
20 |
19(95.0 %) |
0(0.0 %) |
0(0.0 %) |
0(0.0 %) |
0(0.0 %) |
1(5.0 %) |
|
|
BA.4 |
BA.4.1 |
11 |
3(27.3 %) |
4(36.4 %) |
0(0.0 %) |
1(9.1 %) |
0(0.0 %) |
3(27.3 %) |
|
BA.4.2 |
12 |
11(92.0 %) |
0(0.0 %) |
0(0.0 %) |
0(0.0 %) |
0(0.0 %) |
1(8.0 %) |
|
|
BA.4.4 |
15 |
9(60.0 %) |
0(0.0 %) |
0(0.0 %) |
1(6.7 %) |
0(0.0 %) |
5(33.3 %) |
|
|
BA.5 |
BA.5.1 |
10 |
2(20.0 %) |
3(30.0 %) |
0(0.0 %) |
2(20.0 %) |
1(10.0 %) |
2(20.0 %) |
|
BA.5.1.1 |
16 |
14(88.0 %) |
0(0.0 %) |
0(0.0 %) |
0(0.0 %) |
0(0.0 %) |
2(12.0 %) |
|
|
BA.5.2 |
17 |
1(0.06 %) |
6(35.0 %) |
0(0.0 %) |
0(0.0 %) |
3(18.0 %) |
7(41.0 %) |
|
|
BA.5.2.1 |
12 |
4(33.0 %) |
3(25.0 %) |
0(0.0 %) |
2(17.0 %) |
1(8.0 %) |
2(17.0 %) |
|
|
BF.1 |
10 |
8(80.0 %) |
1(10.0 %) |
0(0.0 %) |
0(0.0 %) |
0(0.0 %) |
1(10.0 %) |
|
|
BE.1 |
11 |
2(18.2 %) |
4(36.4 %) |
1(9.1 %) |
0(0.0 %) |
0(0.0 %) |
4(36.4 %) |
|
|
BE.2 |
11 |
11(100 %) |
0(0.0 %) |
0(0.0 %) |
0(0.0 %) |
0(0.0 %) |
0(0.0 %) |
Some subvariants frequently had their origins in the USA, in agreement with the regular number of people crossing the Mexico–USA border and the influx of tourists from the USA. Other subvariants had their origins in Europe and may have also been imported through tourism, followed by others whose ancestral origins were in South America and Asia. Of note, the analysis could not discern the origin of certain Mexican clades, particularly for BA.4.1, BA.4.4, BA.5.2 and BE.1.
In the case of BA.2 subvariants, BA.2.9 was introduced at least ten times, resulting in Mexican clades that indicate further spread within Mexico (Fig. S2). Half of these introductions originated from the USA, and the rest came from other continents (Fig. S2 and Table 2). In contrast, BA.2.12.1 was mainly imported from the USA (Table 2). As shown in Fig. S3, this subvariant was widespread in the USA, and most sequences sampled with Audacity Instant were from this country.
Regarding BA.4, the most widespread subvariant in the country, BA.4.1, was introduced to Mexico from different continents, mainly from Europe (see Fig. S4). In contrast, BA.4.2 and BA.4.4 were mainly imported from the USA, and in both instances, the largest Mexican sequence clades originated in the USA (Figs S5 and S6).
The origin of the introduction of BA.5 subvariants as BA.5.1 was highly variable, reflecting the worldwide circulation of this subvariant, as indicated by the geographical diversity of the sequences sampled with Audacity Instant (Table 2 and Fig. S7). BA.5.1.1 was predominantly introduced from the USA, despite the detection of most of its sequences in southern Mexico (Fig. S8 and below). On the other hand, the origins of many BA.5.2 introductions could not be determined, but 35 % were from Europe (Fig. S9). The origin of BA.5.2.1 was also diverse, without a single dominant region, as shown in Fig. S10.
BF.1 is particularly interesting since it shows a large clade of Mexican sequences whose geographical origin cannot be established (Fig. S11). Other Mexican clades, predominantly of USA origin, were found despite their detection primarily in the south of Mexico (see Characterization of the BA.4 and BA.5 variants in Mexico below). In contrast, Mexican BE.1 clades originated mainly in Europe (Fig. S12) and circulated primarily in central Mexico. Finally, BE.2 viruses were introduced from the USA, where most subsampled sequences originated (Fig. S13).
Characterization of the BA.4 and BA.5 variants in Mexico
In April 2022, the BA.4 variant and its subvariants were identified in Mexico with a low national relative frequency (less than 1 %) and a maximal abundance in June and July (around 10 %). However, in Coahuila, Durango, Jalisco, Queretaro, Sonora, Veracruz and Zacatecas, the cumulative relative frequency of BA.4 exceeded the interquartile abundance value of 20 % in June and 17 % in July. The median abundance of BA.4 in these States was significantly higher than in the rest of the country median=30 %(95 % CI: 31 %,38 % vs. median=10 %(95 % CI: 9–13 %), Mann–Whitney U test P<0.00001. In these States, BA.5 reached a maximum relative frequency of 70 % in late August (Fig. 3a). In contrast, in States with low BA.4 circulation, BA.5 accounted for 90 % of all identified genomes in the same period (Fig. 3b). The difference is probably due to the early introduction of BA.4 in these states, and the minor advantage of BA.5 over BA.4, which is only a daily growth rate of 0.02(95 % CI: 0.02–0.02 %). In states with limited circulation of BA.4, BA.5 competed with BA.2, and was able to displace it faster. Likewise, in these states with higher BA.4 abundance, at least three of its subvariants circulated at different points (Fig. 3a), whereas in the remaining states only one variant circulated.
Fig. 3.
Comparison of SARS-CoV-2 genomic subvariant diversity in States with high and low BA.4 circulation in Mexico during the fifth wave of infections. (a) Sequenced subvariants in the States where BA.4 relative frequency was >20 %, which are shown in fuchsia in the map on the right. (b) Sequenced subvariants in the remaining States, with a lower abundance of BA.4, which are shown in white on the map.
Six subvariants of BA.4 circulated in the country (Table S1), BA.4.1 being the most common, followed by BA.4.2 and BA.4.4. To determine how the BA.4 variant spread, a phylogenetic reconstruction was carried out (Fig. 4a), using all the sequences from Mexico of these three subvariants and a set of sequences from other countries obtained during the same period. The phylogeny shows a few clusters of Mexican sequences from a single or a few States, suggesting local transmission. These groups are marked with arrows in (Fig. 4a). However, these events were not common enough to overcome a country-wide dispersion dynamic for BA.4.1, which was the most abundant subvariant. In contrast, BA.4.2 and BA.4.4 had more restricted geographical distributions (Fig. 4b, c). The relative frequency of BA.4.2 reached a maximum of 15 % in the states of Jalisco and Zacatecas (Fig. 4b), while BA.4.4 exceeded 22 % in Durango and Coahuila (Fig. 4c). In the rest of the country, the frequency of these two subvariants remained lower than 6 %.
Fig. 4.
BA.4 variant in Mexico during fifth wave. (a) Time-scaled maximum-likelihood phylogenetic reconstruction of the three most abundant BA.4 subvariants in Mexico (BA.4.1, BA.4.2 and BA.4.4). The place of origin is shown in the inner circle, while the subvariants are displayed in the outer circle. (b) Geographical distribution of BA.4.2 and BA.4.4. (c) Longitudinal abundance of subvariants where BA.4.2 and BA.4.4 were abundant. States without BA.4 sequences are shown in grey.
In Mexico, the BA.5 variant was first identified in April 2022. During the fifth wave, 24 subvariants were identified, summing a relative frequency of 0.1 % in April, 3.5 % in May, 37.1 % in June, 79.0 % in July, and 90.0 % in August. By August 2022, its four most abundant subvariants were BA.5.1, BA.5.1.1, BA.5.2 and BA.5.2.1, accounting for 72.7 % of the July reported sequences (Table 1). A phylogenetic reconstruction, abundance maps and stacked density plots were utilized to characterize the longitudinal and regional distribution of BA.5 in Mexico (Fig. 5). Like BA.4, the phylogeny suggests multiple local transmission events, with clades of only Mexican sequences (arrows in Fig. 5a).
Fig. 5.
BA.5 variant in Mexico during fifth wave. (a) Time-scaled maximum-likelihood phylogenetic reconstruction of the seven most abundant BA.5 subvariants (BA.5.1, BA.5.1.1, BA.5.2, BA.5.2.1, BE.1 (alias of B.1.1.529.5.3.1.1), BE.2 (alias of B.1.1.529.5.3.1.2) and BF.1 (alias of B.1.1.529.5.2.1.1). The place of origin is shown in the inner circle, while the subvariants are displayed in the outer circle. Panels (b) through (e) reflect the geographical distribution and longitudinal abundance of a specific subvariant. (b) BA.5.1.1. (c) BE.1. (d) BE.2. (e) BF.1. States without sequences are in grey.
Moreover, BA.5.1, BA.5.2 and BA.5.2.1 subvariants were widespread across the country (Fig. 5a), but other subvariants had a more restricted circulation. For example, BA.5.1.1 circulated predominantly in the southeast, particularly in the States of Chiapas and Quintana Roo, with a maximum relative frequency exceeding 30 % (Fig. 5b), followed by Yucatan (>18 %) and Campeche (>9 %). Furthermore, BE.1 (alias of B.1.1.529.5.3.1.1) circulated mainly in Puebla and Tlaxcala (central Mexico; Fig. 5c), while BE.2 (B.1.1.529.5.3.1.2) in Chihuahua (northern Mexico; Fig. 5d), with peaks reaching more than 18 and 37 %, respectively. On the other hand, BF.1 (B.1.1.529.5.2.1.1) was frequent in the South (Fig. 5c), especially in Tabasco (with >50 %) and Yucatan (>15 %) (Fig. 5e).
Comparative mutation profiles of BA.2, BA.4 and BA.5 subvariants
In the 16 001 Mexican SARS-CoV-2 genomes reported between April and August 2022, 12 119 different nucleotide and 6835 amino acid mutations were identified relative to the Wuhan-Hu-1 reference genome, but only 189 and 121, respectively, had a relative frequency above 1 %. Only amino acid substitutions and deletions with an abundance greater than 25 % for each subvariant were analysed, however the complete list of mutations can be found in Table S3. All genomes from variants BA.2, BA.4 and BA.5 shared 48 amino acid mutations and four deletions of three amino acids each (frequency proportion >0.8), with 27 mutations and one deletion occurring in the spike (S) protein. The relative frequency of these canonical mutations in the Mexican genomes is shown in (Fig. 6a). This observation is consistent with the reference genomes of these variants [7].
Fig. 6.
Mutation profiles of BA.2, BA.4 and BA.5 subvariants in Mexico. (a) Shared mutations in all three variants. (b) Proportions of defining mutations in each subvariant (above 0.8) and other minor mutations identified only in Mexico denotated with the sign *.
BA.2 variant had two additional specific amino acid changes (ORF1a:L3201F and S:Q493R), while BA.4 had three (ORF1a:Del141/143, ORF7b:L11F and N:P151S), and BA.5 had one (M:D3N) (Fig. 6b). BA.2 and BA.4 shared one mutation (ORF5:D61L), whereas BA.4 and BA.5 shared three (S:Del69/70, S:L452R, and S:F496V). In addition, the genomes of BA.2.9, BA.2.12, BA.4.1, BA.4.2, BA.4.4, BA.5.1, BA.5.1.1, BA.5.2, BA.5.2.1, BF.1, BE.1 and BE.2 subvariants contain between one and four additional specific mutations (Fig. 6b). Interestingly, a few additional substitutions with a lower frequency proportion (<0.4 %) were identified in BA.2.9, BA.5.1, BA.5.2.1, BF.1 and BE.1 (* in Fig. 6b), which may have been acquired in Mexico as they were absent from sequences from other countries.
Discussion
The fifth wave of the COVID-19 pandemic in Mexico (June–August 2022) had the second-largest average number of new cases per day, just below the fourth wave (December 2021–March 2022); however, it had the lowest numbers of hospitalizations and deaths. On the other hand, the period between the fourth and fifth waves had the fewest daily cases reported since the start of the pandemic. During this inter-wave time, BA.2 dominated the variant landscape, with BA.2.9 and BA.2.12.1 being the predominant subvariants in the country. The first one was introduced multiple times from different parts of the world, while the latter one was most frequently introduced from the USA, where it reached a relative frequency of 62 % [7]. Like in other countries [14], in Mexico, BA.2 spread was not associated with increased reported cases and circulated predominantly in the inter-wave period. This variant was later superseded by BA.4 and BA.5, which are closely related to BA.2 and share many mutations. Although the reason for the lack of an epidemic peak associated with BA.2 is unknown, probably its introduction to the country shortly after the fourth wave, which resulted in a very high number of infections, together with the application of booster vaccine shots to a large part of the population in the previous months, were contributing factors [25]. Interestingly, BA.2 and its subvariants continued to circulate halfway into the fifth wave in the States of the Pacific coast (Baja California, Baja California Sur, Colima, Jalisco, Michoacan, Oaxaca and Sinaloa) and Gulf of Mexico (Campeche, Veracruz and Yucatan) but were eventually displaced by BA.5 subvariants, probably due to the growth advantage of the latter.
While the first three waves of the pandemic in Mexico showed strong geographical clustering of lineage circulation (specific variants were strongly associated with specific regions) [39, 40], in this study, it was found that the most abundant subvariants in the inter-wave period (BA.2.12.1 and BA.2.9) and fifth wave (BA.4.1, BA.5.1 and BA.5.2) did not exhibit geographical clustering. The reasons for these viral dispersion disparities are complex. For instance, lifting restrictions and sanitary measures in Mexico and around the world may increase mobility, resulting in more introductions of the same variant into the country from different regions of the world and more clusters of local transmission. This may speed the spread of viruses with nearly the same genomic sequence in different parts of the country. This contrasts with a scenario of worldwide mobility restrictions with fewer successful introductions, resulting in viruses spreading mostly through continuous chains of transmission within Mexico as well as locally acquired mutations that cause geographical clustering. For instance, it has been reported that seven introductions resulted in transmission chains for the Alpha variant, whereas Gamma and Delta had two and six, respectively [42]. Although the methodology employed in the present study differs and is focused on each subvariant’s early introduction events, this work identified 10 to 17 Mexican clades in these early trees. Further analysis will be needed to assess how other factors, such as previous immunity and population mobility, affect virus spread as the pandemic progresses, but here we provide data that suggests the number of successful introductions that result in local transmissions play a role in viral dispersion.
On the other hand, some less frequently observed subvariants, such as BA.4.2, BA.4.4, BA.5.1.1, BE.1, BE.2 and BF.1, were more abundant in specific Mexican States for at least 1 month, suggesting restricted local circulation. This is supported by the identification of Mexican subclades in the phylogenies, and their origins were more from one specific region. The States in which these subvariants circulated varied; in the case of BA.4.2 and BA.4.4 was in contiguous States in Mexico’s central regions, whereas BA.5.1.1 BE.1, BE.2 and BF.1 in the southeast or single isolated States. The results of this study support previous ones [25, 39, 40], showing that the southeast region may be a major entry point and contributed to the introduction and evolution of novel variants in Mexico [43].
Interestingly, BA.5 did not exceed 70 % in States where BA.4 circulation was significant, regardless of subvariant, whereas it accounted for over 90 % in those that did not. This observation is most likely due to the early entry of BA.4 in these States and BA.5’s marginal growth rate advantage of 0.02 over BA.4 [7], which was insufficient to overcome BA.4 during the fifth wave. However, compared to BA.2, BA.5 showed a relative growth rate advantage of 0.1 per day [7], allowing it to outgrow this variant. The difference lies in the specific mutations that give BA.5 and BA.4 an edge over the BA.2 variant. The presence of L452R and F486V mutations in the spike (S) protein, which are defined changes in the BA.4 and BA5 variants, has been linked to their ability to outcompete other variants; mutation S:L452R has independently evolved within the Delta and Omicron viruses under convergent evolution, and an alternative related mutation (S:L452Q) is instead present in BA.2.12.1. The transmission advantage of these viruses has been suggested to be attributable to these two spike mutations [44, 45]. In addition, these mutations may increase the receptor affinity of BA.4 and BA.5 subvariants [46], which most probably contributes to their rapid spread. Despite having identical mutations in their spike proteins, BA.5 spread more successfully than BA.4 in Mexico and worldwide. Although most differential mutations remain uncharacterized, mutation ORF6:D61L has been described as an interferon (IFN) antagonist that inhibits mRNA export via its interaction with the nuclear pore complex. This mutation compromises the function of ORF6, thereby preventing the virus’s innate immune escape, which has been suggested to contribute to the difference in the expansion rate of BA.4 and BA.5 [47].
The emergence of novel subvariants of the SARS-CoV-2 virus, such as Omicron, demonstrates the virus’s rapid rate of mutation and emphasizes the necessity for continuous surveillance. These subvariants were found to be more transmissible, infectious, and have a higher immune evasion. Regular and timely sequencing of the virus at local scales and worldwide is still essential to monitor the evolution of SARS-CoV-2 virus in real-time and to determine mitigation strategies to minimize infections and reduce healthcare demand in Mexico. Moreover, understanding the genetic makeup of the virus and its subvariants can provide critical insights into potential vaccine targets and inform the development of next-generation vaccines and treatments.
Supplementary Data
Funding information
This study was partially funded by grants ‘Vigilancia Genómica del Virus SARS- CoV-2 en México-2023’ (PP-F003; to CFA) from the National Council for Science and Technology- México—CONACyT (to CFA), ‘Analysis of potential recombinant SARS-CoV-2 variants circulating in Mexico’ from the AHF Global Public Health Institute (to BT and SZ), and ECTZ184596 from the Agence Nationale de Recherche sur le Sida et les Hépatites Virales (ANRS) (to BR and CFA). This research was also supported by Mexico’s national epidemiological surveillance system. R.G.L. has been awarded a CONACYT postdoctoral fellowship through ProNacEs #I1000/023/2021 and A.G.S.L. (408350).
Acknowledgements
We would like to acknowledge the other authors from the originating laboratories who obtained the samples and the submitting laboratories who sequenced the genomes shared via GISAID, on which this research is based visit 10.55876/gis8.230622ah, 10.55876/gis8.230608wv, and 10.55876/gis8.231002av for a complete list of contributors to each sequence and its details, including the originating and submitting laboratories, and author lists. Jerome Verleyen, Roberto Bahena, Juan Manuel Hurtado, and Verónica Jacinto provided valuable computer assistance. The Dirección General de Cómputo y Tecnologías de la Información (DGTIC-UNAM) project LANCAD-UNAM-DGTIC-350 provided Supercomputing resources in Miztli.
Author contributions
Conceptualization: B.T., S.Z., C.F.A. Data curation: B.T., S.Z., R.G.L., A.G.S.L. Formal analysis: B.T., S.Z. Funding acquisition: B.T., S.Z., B.R., C.F.A. Investigation: J.E.M.M., B.G.G., A.H.E., A.S.F., A.G.S.L., G.M.M., H.D.Z., C.D.M., R.A.H., S.L. Methodology: J.E.M.M., A.G.S.L., B.R., G.M.M., H.D.Z., C.D.M., R.A.H. Project administration: C.F.A. Resources: J.E.M.M., B.G.G., A.H.E., A.S.F., S.L. Software: B.T., S.Z., R.G.L. Supervision: B.T., J.E.M.M., B.G.G., A.H.E., A.S.F., S.L., C.F.A. Validation: R.G.L., B.R. Visualization: B.T., S.Z., R.G.L. Writing – original draft: B.T., S.Z., Writing – review & editing: R.G.L., J.E.M.M., B.G.G., A.H.E., A.S.F., A.G.S.L., B.R., G.M.M., H.D.Z., C.D.M., R.A.H., S.L., C.F.A.
Conflicts of interest
The authors state that they have no conflicts of interest. The funding institutions were not involved in the design of the study, data collection, analysis, interpretation, manuscript writing, or decision to publish the results.
Ethical statement
The samples used in this study, as well as their accompanying metadata, are part of the national Public Health response to COVID-19 gathered by the Mexican Consortium for Genomic Surveillance (CoViGen-Mex) for the national epidemiological surveillance programme under Mexican Official Norm NOM-017-SSA2-2012.
Consent to publish
Informed consent was waived because all samples used in this study were unlinked from any personal identifiers prior to the commencement of the study.
Footnotes
Abbreviations: AC, agglomerative coefficient; CoViGen-Mex, Mexican Consortium for Genomic Surveillance; GISAID, Global Initiative on Sharing All Inlfuenza Data; IMSS, Mexican Social Security Institute; IQR, interquartile range; PANGO, Phylogenetic Assignment of Named Global Outbreak; PCR, Polymearase chain reaction; SARS-CoV-2, Severe Acute Respiratory Syndrome Coronavirus 2; VOC, Variants of Concern; WHO, World Health Organization.
References
- 1.Tracking SARS-CoV-2 variants. World Health Organization Weekly epidemiological update on COVID-19. [ February 22; 2022 ]. https://www.who.int/en/activities/tracking-SARS-CoV-2-variants/ n.d. accessed.
- 2.Volz E, Mishra S, Chand M, Barrett JC, Johnson R, et al. Assessing transmissibility of SARS-CoV-2 lineage B.1.1.7 in England. Nature. 2021;593:266–269. doi: 10.1038/s41586-021-03470-x. [DOI] [PubMed] [Google Scholar]
- 3.Tegally H, Wilkinson E, Giovanetti M, Iranzadeh A, Fonseca V, et al. Detection of a SARS-CoV-2 variant of concern in South Africa. Nature. 2021;592:438–443. doi: 10.1038/s41586-021-03402-9. [DOI] [PubMed] [Google Scholar]
- 4.Faria NR, Mellan TA, Whittaker C, Claro IM, Candido D da S, et al. Genomics and epidemiology of the P.1 SARS-CoV-2 lineage in Manaus, Brazil. Science. 2021;372:815–821. doi: 10.1126/science.abh2644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Cherian S, Potdar V, Jadhav S, Yadav P, Gupta N, et al. SARS-CoV-2 spike mutations, L452R, T478K, E484Q and P681R, in the second wave of COVID-19 in Maharashtra, India. Microorganisms. 2021;9:1542. doi: 10.3390/microorganisms9071542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Viana R, Moyo S, Amoako DG, Tegally H, Scheepers C, et al. Rapid epidemic expansion of the SARS-CoV-2 omicron variant in southern Africa. Nature. 2022;603:679–686. doi: 10.1038/s41586-022-04411-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Chen C, Nadeau S, Yared M, Voinov P, Xie N, et al. CoV-Spectrum: analysis of globally shared SARS-CoV-2 data to identify and characterize new variants. Bioinformatics. 2022;38:1735–1737. doi: 10.1093/bioinformatics/btab856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ito K, Piantham C, Nishiura H. Estimating relative generation times and reproduction numbers of omicron BA.1 and BA.2 with respect to Delta variant in Denmark. Math Biosci Eng. 2022;19:9005–9017. doi: 10.3934/mbe.2022418. [DOI] [PubMed] [Google Scholar]
- 9.Yamasoba D, Kimura I, Nasser H, Morioka Y, Nao N, et al. Virological characteristics of the SARS-CoV-2 omicron BA.2 spike. Cell. 2022;185:2103–2115. doi: 10.1016/j.cell.2022.04.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lustig A, Vattiato G, Maclaren O, Watson LM, Datta S, et al. Modelling the impact of the omicron BA.5 subvariant in New Zealand. J R Soc Interface. 2023;20:20220698. doi: 10.1098/rsif.2022.0698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Jassat W, Abdool Karim SS, Ozougwu L, Welch R, Mudara C, et al. Trends in cases, hospitalization and mortality related to the omicron BA.4/BA.5 sub-variants in South Africa. medRxiv. 2022 doi: 10.1101/2022.08.24.22279197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.de Menezes MT, Moreira FRR, Whittaker C, Santos FM, Queiroz DC, et al. Dynamics of early establishment of SARS-CoV-2 VOC omicron lineages in Minas Gerais, Brazil. Viruses. 2023;15:585. doi: 10.3390/v15020585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Elliott P, Eales O, Steyn N, et al. Twin peaks: the omicron SARS-Cov-2 BA.1 and BA.2 epidemics in England. Science. 1979 doi: 10.1126/SCIENCE.ABQ4411/SUPPL_FILE/SCIENCE.ABQ4411_MDAR_REPRODUCIBILITY_CHECKLIST.PDF. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Davies M-A, Morden E, Rousseau P, Arendse J, Bam J-L, et al. Outcomes of laboratory-confirmed SARS-CoV-2 infection during resurgence driven by omicron lineages BA.4 and BA.5 compared with previous waves in the western cape province, South Africa. Int J Infect Dis. 2023;127:63–68. doi: 10.1016/j.ijid.2022.11.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Torres C, Nabaes Jodar M, Acuña D, Montaño RMZ, Culasso ACA, et al. Omicron waves in Argentina: dynamics of SARS-CoV-2 lineages BA.1, BA.2 and the emerging BA.2.12.1 and BA.4/BA.5. Viruses. 2023;15:312. doi: 10.3390/v15020312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lyngse FP, Kirkeby CT, Denwood M, Christiansen LE, Mølbak K, et al. Household transmission of SARS-CoV-2 omicron variant of concern subvariants BA.1 and BA.2 in Denmark. Nat Commun. 2022;13:1–7. doi: 10.1038/s41467-022-33498-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Kimura I, Yamasoba D, Tamura T, Nao N, Suzuki T, et al. Virological characteristics of the SARS-CoV-2 omicron BA.2 subvariants, including BA.4 and BA.5. Cell. 2022;185:3992–4007. doi: 10.1016/j.cell.2022.09.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Webster HH, Nyberg T, Sinnathamby MA, Aziz NA, Ferguson N, et al. Hospitalisation and mortality risk of SARS-COV-2 variant omicron sub-lineage BA.2 compared to BA.1 in England. Nat Commun. 2022;13:1–6. doi: 10.1038/s41467-022-33740-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Strasser ZH, Greifer N, Hadavand A, Murphy SN, Estiri H. Estimates of SARS-CoV-2 omicron BA.2 subvariant severity in New England. JAMA Netw Open. 2022;5:e2238354. doi: 10.1001/jamanetworkopen.2022.38354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kirsebom FCM, Andrews N, Stowe J, Toffa S, Sachdeva R, et al. COVID-19 vaccine effectiveness against the omicron (BA.2) variant in England. Lancet Infect Dis. 2022;22:931–933. doi: 10.1016/S1473-3099(22)00309-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Tegally H, Moir M, Everatt J, Giovanetti M, Scheepers C, et al. Emergence of SARS-CoV-2 omicron lineages BA.4 and BA.5 in South Africa. Nat Med. 2022;28:1785–1790. doi: 10.1038/s41591-022-01911-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Khan K, Karim F, Ganga Y, Bernstein M, Jule Z, et al. Omicron BA.4/BA.5 escape neutralizing immunity elicited by BA.1 infection. Nat Commun. 2022;13:1–7. doi: 10.1038/s41467-022-33371-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Andeweg SP, de Gier B, Vennema H, van Walle I, van Maarseveen N, et al. Higher risk of SARS-CoV-2 Omicron BA.4/5 infection than of BA.2 infection after previous BA.1 infection, the Netherlands, 2 May to 24 July 2022. Euro Surveill. 2023;28:2200724. doi: 10.2807/1560-7917.ES.2023.28.7.2200724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Russell SL, Klaver BRA, Harrigan SP, Kamelian K, Tyson J, et al. Clinical severity of omicron subvariants BA.1, BA.2, and BA.5 in a population-based cohort study in British Columbia, Canada. J Med Virol. 2023;95:e28423. doi: 10.1002/jmv.28423. [DOI] [PubMed] [Google Scholar]
- 25.Zárate S, Taboada B, Rosales-Rivera M, García-López R, Muñoz-Medina JE, et al. Omicron-BA.1 dispersion rates in Mexico varied according to the regional epidemic patterns and the diversity of local Delta subvariants. Viruses. 2023;15:243. doi: 10.3390/v15010243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Chen S, Zhou Y, Chen Y, Gu J. Bioinformatics. Vol. 34. Oxford University Press; 2018. Fastp: an ultra-fast all-in-one FASTQ preprocessor; pp. i884–i890. vol. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Grubaugh ND, Gangavarapu K, Quick J, Matteson NL, De Jesus JG, et al. An amplicon-based sequencing framework for accurately measuring intrahost virus diversity using PrimalSeq and iVar. Genome Biol. 2019;20:1–19. doi: 10.1186/s13059-018-1618-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Yu G, Smith DK, Zhu H, Guan Y, Lam TTY. Ggtree: an R package for visualization and annotation of Phylogenetic trees with their covariates and other associated data. Methods Ecol Evol. 2017;8:28–36. doi: 10.1111/2041-210X.12628. [DOI] [Google Scholar]
- 30.Oksanen J, Blanchet F, Kindt R. vegan: Community Ecology Package. R package version. [ February 24; 2016 ]. https://scholar.google.com/scholar?cluster=3119008020093914981&hl=en&oi=scholarr#0 n.d. accessed.
- 31.R-packages - Cluster. [ Oct 2; 2023 ]. https://svn.r-project.org/R-packages/trunk/cluster/ n.d. accessed.
- 32.Parvej MS, Nakamura H, Alam MA, et al. Factoextra: extract and visualize the results of multivariate data analyses. R Package Version. 2016;1 doi: 10.1128/AEM.02796-18. [DOI] [Google Scholar]
- 33.Aksamentov I, Roemer C, Hodcroft EB, Neher RA. Nextclade: clade assignment, mutation calling and quality control for viral genomes. JOSS. 2021;6:3773. doi: 10.21105/joss.03773. [DOI] [Google Scholar]
- 34.Althaus CL, Baggio S, Reichmuth ML, Hodcroft EB, Riou J, et al. A tale of two variants: spread of SARS-CoV-2 variants Alpha in Geneva, Switzerland, and Beta in South Africa. Epidemiology. 2021 doi: 10.1101/2021.06.10.21258468. [DOI] [Google Scholar]
- 35.Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Fu L, Niu B, Zhu Z, Wu S, Li W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012;28:3150–3152. doi: 10.1093/bioinformatics/bts565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 2020;37:1530–1534. doi: 10.1093/molbev/msaa131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ishikawa SA, Zhukova A, Iwasaki W, Gascuel O. A fast likelihood method to reconstruct and visualize ancestral scenarios. Mol Biol Evol. 2019;36:2069–2085. doi: 10.1093/molbev/msz131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Taboada B, Zárate S, Iša P, Boukadida C, Vazquez-Perez JA, et al. Genetic analysis of SARS-CoV-2 variants in Mexico during the first year of the COVID-19 pandemic. Viruses. 2021;13:2161. doi: 10.3390/v13112161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Taboada B, Zárate S, García-López R, Muñoz-Medina JE, Sanchez-Flores A, et al. Dominance of three sublineages of the SARS-CoV-2 Delta variant in Mexico. Viruses. 2022;14:1165. doi: 10.3390/v14061165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Zárate S, Taboada B, Muñoz-Medina JE, Iša P, Sanchez-Flores A, et al. The Alpha variant (B.1.1.7) of SARS-CoV-2 failed to become dominant in Mexico. Microbiol Spectr. 2022;10:e0224021. doi: 10.1128/spectrum.02240-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Castelán-Sánchez HG, Delaye L, Inward RPD, Dellicour S, Gutierrez B, et al. Comparing the evolutionary dynamics of predominant SARS-CoV-2 virus lineages co-circulating in Mexico. Elife. 2023;12:e82069. doi: 10.7554/eLife.82069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.García-López R, Rivera-Gutiérrez X, Rosales-Rivera M, Zárate S, Muñoz-Medina JE, et al. SARS-CoV-2 BW lineage, a fast-growing omicron variant from southeast Mexico bearing relevant escape mutations. Infection. 2023;51:1549–1555. doi: 10.1007/s15010-023-02034-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Cao Y, Yisimayi A, Jian F, Song W, Xiao T, et al. BA.2.12.1, BA.4 and BA.5 escape antibodies elicited by omicron infection. Nature. 2022;608:593–602. doi: 10.1038/s41586-022-04980-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Hachmann NP, Miller J, Collier A-R, Barouch DH. Neutralization escape by SARS-CoV-2 omicron subvariant BA.4.6. N Engl J Med. 2022;387:86–88. doi: 10.1056/NEJMc2212117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Wang Q, Guo Y, Iketani S, Nair MS, Li Z, et al. Antibody evasion by SARS-CoV-2 omicron subvariants BA.2.12.1, BA.4 and BA.5. Nature. 2022;608:603–608. doi: 10.1038/s41586-022-05053-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Kehrer T, Cupic A, Ye C, Yildiz S, Bouhhadou M, et al. Impact of SARS-CoV-2 ORF6 and its variant polymorphisms on host responses and viral pathogenesis. bioRxiv. 2022:2022.10.18.512708. doi: 10.1101/2022.10.18.512708. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






