Abstract
Genomic surveillance of SARS-CoV-2 is one of the tools that provide genomic information on circulating variants. Given the recent emergence of the Omicron (B.1.1.529) variant, this tool has provided data about this lineage’s genomic and epidemiological characteristics. However, in South America, this variant’s arrival and genomic diversity are scarcely known. Therefore, this study determined the genomic diversity and phylogenetic relationships of 21,615 Omicron genomes available in public databases. We found that in South America, BA.1 (n = 15,449, 71%) and BA.1.1 (n = 6257, 29%) are the dominant sublineages, with several mutations that favor transmission and antibody evasion. In addition, these lineages showed cryptic transmission arriving on the continent in late September 2021. This event may have contributed to the dispersal of Omicron sublineages and the acquisition of new mutations. Considering the genomic and epidemiological characteristics of these lineages, especially those with a high number of mutations in their genome, it is important to conduct studies and surveillance on the dynamics of these lineages to identify the mechanisms of mutation acquisition and their impact on public health.
Keywords: SARS-CoV-2, Omicron sublineages, South America, nucleotide diversity, phylogenomic analysis
1. Introduction
Genomic surveillance is one of the tools that provide real-time information on the circulating variants of SARS-CoV-2. This information allows understanding of its genomic diversity, dispersal, and transmission mechanisms [1]. Additionally, together with public data, it allows for characterizing and dating different lineages that impact public health and/or can modulate the spread of the virus. Particularly with variants of concern (VOCs), which have higher transmissibility and virulence rates [1,2], genomic surveillance has provided insight into the genomic characteristics and global diversity of these variants [3,4].
Recently, a new emerging lineage (B.1.1.529) has spread globally, causing an increase in the number of COVID-19 cases. This VOC, known as Omicron, harbors a high number of mutations that favor immune evasion, increased transmissibility and decreased vaccine efficiency in preventing infection [5,6]. Moreover, this variant has shown several dispersals and global transmission events [7] associated with the different Omicron sublineages (BA.1, BA.1.1, BA.2 and BA.3). Compared to the sublineages of other VOCs (i.e., Delta), these have unique genomic characteristics that have generated a public health impact [7]. Therefore, surveillance and genomic studies have focused on analyzing these characteristics that have enabled Omicron to create a significant health turmoil.
Despite the vast number of studies about the surveillance of Omicron in some regions around the world, the genomic diversity and introduction date of this variant into South American countries remain unknown. Therefore, this study aimed to characterize the diversity of Omicron sublineages circulating in South America using comparative genomics of public genomes available for the region.
2. Materials and Methods
We analyzed 21,615 high-quality and complete Omicron genomes from South America available in the GISAID database until 28 February 2022 (Table S1). The analysis of these genomes was performed according to the Muñoz et al. (2021) and Ramírez et al. (2020) schemes [7,8]. These schemes consisted of SARS-CoV-2 lineage typing using the PANGOLIN-v1.9 tool (https://github.com/cov-lineages/pangolin (accessed on 10 May 2022)), and alignment and phylogenetic analysis using the NextClade tool v1.5.4 default command line (https://docs.nextstrain.org/projects/nextclade/en/stable/user/nextclade-cli.html (accessed on 10 May 2022)), which performs sequence alignment, variant calling, clade assignment and maximum-likelihood (ML) tree placement. On the other hand, this considers genome georeferencing and mutational analysis of SARS-CoV-2 coding regions.
Subsequently, we estimated the potential introduction date and dispersion dynamics of Omicron into the continent using TreeTime software [9], which considers a fixed clock rate of 0.8 × 10−3 (SD = 0.4 × 10−3) [10], a strict clock (SC) under a coalescent tree skyline prior and a root step to minimize residuals in a root-to-tip. For this analysis, we used the alignment and ML tree from Nexclade, a dataset with 2365 reference genomes belonging to other SARS-CoV-2 lineages (Table S1) available from the phylogenomic dataset of auspice.us (https://auspice.us/ (accessed on 10 May 2022)) and the collection from genomes analyzed to obtain an ML time-scaled phylogeny. Six iterations were run during the TreeTime analysis, and the marginal date estimates of ancestral states were inferred with 95% confidence intervals (95% CI).
3. Results
We analyzed 21,615 Omicron genome assemblies from 14 South American countries (Table S1). These genomes corresponded to three Omicron main sublineages (BA.1, BA.1.1 and BA.2), with abundances that varied across South American countries (Figure 1a). The BA.1 was significantly predominant in four countries: Brazil (n = 11,618; 85.6%), Colombia (n = 452; 56.6%), Paraguay (n = 80; 98.8%) and Peru (n = 1033; 58.7%) (Table S2 and Figure 1b). Meanwhile, BA.1.1 was significantly predominant in five countries: Argentina (n = 1034; 70.1%), Chile (n = 1448; 55.2%), Ecuador (n = 268; 57.4%), French Guiana (n = 242; 81.8%), and Trinidad and Tobago (n = 37; 69.8%) (Table S2 and Figure 1b). As for BA.2, we found few genomes reported on the continent; hence the abundance of this sublineage was not significant (Table S2). On the other hand, we found variations in the proportions of Omicron reports from each country by lineage (Figure 1b), where BA.1 and BA.1.1 were first reported in late November 2021 in Brazil and Chile, while BA.2 was first reported in early January 2022 in Brazil.
The mutational analysis showed fifty-five non-synonymous amino acid substitutions (Figure 2a). Forty-one of these substitutions (75%) were shared, with seven present in at least eleven countries and two substitutions in six countries. Additionally, we identified four unique substitutions in French Guiana and one in 15% of the genomes from Brazil (Figure 2a). When analyzing the different Omicron sublineages, we observed twenty-five substitutions shared among sublineages (Figure 2b), most of them in the Spike gene. Furthermore, twenty-two of these shared substitutions were found between BA.1 and BA.1.1, with sixteen (72%) located in the Spike. Finally, we found thirty-one unique non-synonymous substitutions, twenty-seven of them (87%) identified only in BA.2, mostly found in the Spike (Figure 2b).
The Omicron genomes analyzed in this study were clustered into five main monophyletic clusters labeled C1–C5 (Figure 3) plus a minor divergent cluster. The minor divergent cluster included 48 genomes that corresponded mostly to reference genomes (n = 17; 35.4%) and genomes from Brazil (n = 14; 29.2%) belonging to BA.2 (Table S3). This cluster was the only one that included the analyzed genomes of this sublineage (Table S3). Clusters C1 and C3 were predominantly constituted by genomes from Brazil (96.2% and 86.4%, respectively) with a minimal proportion from other South American countries. The genomes of these clusters consisted predominantly of BA.1 with an abundance of more than 50% (Table S3). In the case of clusters C2, C4 and C5, Brazil remained in first place with the most abundant genomes, but the distribution was more homogeneous, with an increased abundance in countries such as Chile (C2 and C5), Peru (C4) and Argentina (C5). These clusters were mainly composed of sublineages BA.1.15 (n = 1139; 67.9%), BA.1 (n = 2244; 98%) and BA.1.1 (n = 5500; 78.7%) respectively (Table S3). Although these clusters had a higher abundance of BA.1 and BA.1.1, they had divergent lineage genomes from these sublineages (Table S3).
The time-scaled phylogeny obtained from TreeTime showed the introduction date based on a node-date assignment from the most recent common ancestor (MRCA) (Figure S1). In general, the estimated arrival date for the Omicron variant in South America was at the end of September 2021 (Table S4 and Figure S1), while the introduction of the pruned clades containing C1 sequences was September 29, 2021 (95% CI = 11 September 2021 to 13 October 2021). The putative introduction date of clusters circulating in South America is described in Supplementary Table S5. These findings suggest that the Omicron variant was circulating in Brazil, subsequently spreading to other South American countries such as Chile, Peru and Argentina by mid-October and November 2021.
4. Discussion
Genomic monitoring of SARS-CoV-2 variants in different regions of the world allows understanding of the dynamics and genetic mechanisms of each variant [1], especially in Omicron, that has several mechanisms to promote increased transmission and immune escape [7,11]. Furthermore, this variant has an estimated transmission rate between 5 and 8 [12], which means that Omicron is transmitted rapidly, leading to an increase in the number of associated cases. In South America, we found that BA.1, BA.1.1 and BA.2 are circulating across the continent, and the first two are predominant in particular countries (Figure 1b). These lineages are characterized by a high number of mutations in the genome that facilitate rapid transmission and evasion of the immune system [5]; most are in the Spike gene [11]. The analyzed genomes of these sublineages have various mutations of epidemiological interest (S373P, G446S, S477N, N679K and L981F) (Figure 2b), involved in the transmission and escape of antibodies [13]. That would imply an impact on the transmission dynamics of the virus on the continent. Despite the dramatic increase in reported cases under vaccination schedules, there is no increase in the number of deaths or hospitalizations [14]. Thus, future studies should focus on these dynamics considering the vaccination programs and genomic characteristics of Omicron in each country.
As for BA.2, it is interesting to note the low number of genomes reported so far and the circulation in only a few countries on the continent (Table S1 and Figure 1b) and the minor divergent cluster (Figure 3). Unlike BA.1 and BA.1.1, this lineage has a greater number of unique mutations, mainly in the Spike gene, promoting more effective immune evasion and transmissibility [7,8,9,10,11,12,13]. Moreover, this number of unique mutations might be contributing to the associated divergent clustering for this lineage, which differs from the other clusters that contain the BA.1 and BA.1.1 genomes. Although, to date, the circulation of the BA.3 lineage, that has similar genomic characteristics with BA.1 and BA.1.1 [7], has not been reported. It might be circulating in the continent in the future and have a similar impact as BA.1 and BA.1.1. Considering the characteristics and public health impact of the Omicron sublineages, especially BA.2, in the future, these lineages might be circulating and dispersing on the continent. Therefore, genomic and epidemiological studies are needed to monitor their dynamics in the region.
The emergence of Omicron in South America might be explained by dispersal pathways, such as international flights, facilitating the transmission and circulation of SARS-CoV-2 between countries [2,15], hence the arrival of new variants. In addition, the characteristics of Omicron may have facilitated the rapid spread across the continent [16]. The introduction seems to have occurred in Brazil at the end of September. However, this finding contrasts with that reported in Nextstrain (7 September 2021 in Guyana (CI = 8 June 2021 to 16 October 2021)) (https://nextstrain.org/ncov/gisaid/south-america/all-time (accessed on 10 May 2022)) and is more recent than the first case in the continent (24 November 2021) (Table S5) [17]. Following its arrival on the continent, Omicron spread to other countries through its transmission mechanisms between October and December (Tables S4 and S5) while acquiring mutations considered unique to each geographical location at the amino acid level (Figure 2a). Therefore, considering the genomic characteristics and the recent reports of BA.2, this sublineage might have emerged in December 2021.
Despite obtaining information in terms of the relationships and evolutionary times of the genomes analyzed from the continent, the dataset studied includes countries with few high-quality genomes available until the end of February 2022. Therefore, the variability in the number of genomes available in each country might affect the phylogenetic results, especially when analyzing dispersal events between countries and comparing them with real-time data from other genomic analysis programs. Nevertheless, this available data provides relevant information on the emergence and possible clustering of sublineages in South America. Therefore, in the future, to analyze dispersal dates and Omicron assemblages in each of the countries, unique and representative genomes of these geographic areas are needed.
The genomic landscape of Omicron in South America provides mutational and phylogenetic information about circulating lineages. Previous studies have reported similar results with sublineages of other VOCs in the region, where the circulation of these sublineages in diverse regions of the continent has favored the acquisition of different genomic characteristics which might affect the molecular diagnosis such as the emergence of new lineages [3,4]. Furthermore, genetic variation, especially in the amino acids, might generate impacts on public health [3]. Herein, we found unique and shared mutations by sublineage and country (Figure 2), most of which are found in the spike gene. This might be related to vaccination rates per country, promoting the acquisition of new mutations [18]. However, future studies are needed to determine the impact of vaccination schedules on the genomic structure of Omicron.
In conclusion, South America is mainly dominated by the BA.1 and BA.1.1 lineages, which emerged from Brazil at the end of September 2021. Furthermore, at the genomic level, these sublineages present unique and/or shared mutations, which might affect molecular diagnosis, transmissibility and even favor the emergence of new sublineages. Therefore, future studies and surveillance should focus on the current genomic dynamics of Omicron sublineages and their potential impact on public health in South American countries.
Acknowledgments
We thank the sequencing effort of the SARS-CoV-2 genome from different investigators worldwide. We acknowledge the use of GISAID deposited genomes following their policy in the Supplementary Materials.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/v14061234/s1, Table S1: Metadata of Omicron lineages (BA.1, BA.1.1 and BA.2) genomes from South America analyzed in this work: Omicron genomes from South America (n = 21,615) and SARS-CoV-2 lineages reference genomes (n = 2365); Table S2: Number of Omicron lineage genomes reported for each country. Table S3: Number of genomes of the Omicron sublineages in each cluster obtained from the phylogenetic analysis. Table S4: Dates of the introduction of Omicron’s circulating clusters to the South American continent. Table S5: Dates of the first Omicron genome sequenced for each South American country analyzed. Figure S1: Maximum-likelihood (ML) time-scaled phylogeny. This figure highlights with a diamond the approximate date of Omicron’s emergence in South America (29 September 2021).
Author Contributions
Conceptualization, J.D.R., M.M. and N.L.; methodology, M.M., A.L.R., L.H.P., N.B., S.A.C. and N.L.; data curation, A.L.R. and N.L.; formal analysis, M.M., A.L.R., L.H.P., N.B., S.A.C. and N.L.; Supervision, J.D.R.; Project administration, J.D.R.; writing—original draft preparation, N.L.; writing—review and editing, J.D.R., M.M. and N.L. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This work was supported by Dirección de Investigación e Innovación from Universidad del Rosario. This project was funded by the Universidad del Rosario in the framework of its strategic plan RUTA2025. Thanks to the President and the University council for leading the strategic projects (JDR).
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Oude Munnink B.B., Worp N., Nieuwenhuijse D.F., Sikkema R.S., Haagmans B., Fouchier R.A.M., Koopmans M. The next Phase of SARS-CoV-2 Surveillance: Real-Time Molecular Epidemiology. Nat. Med. 2021;27:1518–1524. doi: 10.1038/s41591-021-01472-w. [DOI] [PubMed] [Google Scholar]
- 2.Li J., Lai S., Gao G.F., Shi W. The Emergence, Genomic Diversity and Global Spread of SARS-CoV-2. Nature. 2021;600:408–418. doi: 10.1038/s41586-021-04188-6. [DOI] [PubMed] [Google Scholar]
- 3.Perbolianachis P., Ferla D., Arce R., Ferreiro I., Costábile A., Paz M., Simón D., Moreno P., Cristina J. Phylogenetic Analysis of SARS-CoV-2 Viruses Circulating in the South American Region: Genetic Relations and Vaccine Strain Match. Virus Res. 2022;311:198688. doi: 10.1016/j.virusres.2022.198688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Muñoz M., Patiño L.H., Ballesteros N., Paniz-Mondolfi A., Ramírez J.D. Characterizing SARS-CoV-2 Genome Diversity Circulating in South American Countries: Signatures of Potentially Emergent Lineages? Int. J. Infect. Dis. 2021;105:329–332. doi: 10.1016/j.ijid.2021.02.073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.He X., Hong W., Pan X., Lu G., Wei X. SARS-CoV-2 Omicron Variant: Characteristics and Prevention. MedComm. 2021;2:838–845. doi: 10.1002/mco2.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Andrews N., Stowe J., Kirsebom F., Toffa S., Rickeard T., Gallagher E., Gower C., Kall M., Groves N., O’Connell A.-M., et al. COVID-19 Vaccine Effectiveness against the Omicron (B.1.1.529) Variant. N. Engl. J. Med. 2022;386:1532–1546. doi: 10.1056/NEJMoa2119451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Abbas Q., Kusakin A., Sharrouf K., Jyakhwo S., Komissarov A.S. Follow-up Investigation and Detailed Mutational Characterization of the SARS-CoV-2 Omicron Variant Lineages (BA.1, BA.2, BA.3 and BA.1.1) bioRxiv. 2022 doi: 10.1101/2022.02.25.481941. [DOI] [Google Scholar]
- 8.Ramírez J.D., Muñoz M., Hernández C., Flórez C., Gomez S., Rico A., Pardo L., Barros E.C., Paniz-Mondolfi A.E. Genetic Diversity Among SARS-CoV2 Strains in South America May Impact Performance of Molecular Detection. Pathogens. 2020;9:580. doi: 10.3390/pathogens9070580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sagulenko P., Puller V., Neher R.A. TreeTime: Maximum-Likelihood Phylodynamic Analysis. Virus Evol. 2018;4:vex042. doi: 10.1093/ve/vex042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hill V., Rambaut A. Phylodynamic Analysis of SARS-CoV-2. [(accessed on 6 May 2022)]. Available online: https://virological.org/t/phylodynamic-analysis-of-sars-cov-2-update-2020-03-06/420.
- 11.Kumar S., Karuppanan K., Subramaniam G. Omicron (BA.1) and Sub-Variants (BA.1, BA.2 and BA.3) of SARS-CoV-2 Spike Infectivity and Pathogenicity: A Comparative Sequence and Structural-Based Computational Assessment. bioRxiv. 2022 doi: 10.1101/2022.02.11.480029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ribeiro Xavier C., Sachetto Oliveira R., da Fonseca Vieira V., Lobosco M., Weber dos Santos R. Characterisation of Omicron Variant during COVID-19 Pandemic and the Impact of Vaccination, Transmission Rate, Mortality, and Reinfection in South Africa, Germany, and Brazil. BioTech. 2022;11:12. doi: 10.3390/biotech11020012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Cui Z., Liu P., Wang N., Wang L., Fan K., Zhu Q., Wang K., Chen R., Feng R., Jia Z., et al. Structural and Functional Characterizations of Infectivity and Immune Evasion of SARS-CoV-2 Omicron. Cell. 2022;185:860–871.e13. doi: 10.1016/j.cell.2022.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Christensen P.A., Olsen R.J., Long S.W., Snehal R., Davis J.J., Ojeda Saavedra M., Reppond K., Shyer M.N., Cambric J., Gadd R., et al. Signals of Significantly Increased Vaccine Breakthrough, Decreased Hospitalization Rates, and Less Severe Disease in Patients with Coronavirus Disease 2019 Caused by the Omicron Variant of Severe Acute Respiratory Syndrome Coronavirus 2 in Houston, Texas. Am. J. Pathol. 2022;192:642–652. doi: 10.1016/j.ajpath.2022.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bai Y., Du Z., Xu M., Wang L., Wu P., Lau E.H.Y., Cowling B.J., Meyers L.A. International Risk of SARS-CoV-2 Omicron Variant Importations Originating in South Africa. medRxiv. 2021 doi: 10.1101/2021.12.07.21267410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Tian D., Sun Y., Xu H., Ye Q. The Emergence and Epidemic Characteristics of the Highly Mutated SARS-CoV-2 Omicron Variant. J. Med. Virol. 2022;94:2376–2383. doi: 10.1002/jmv.27643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Quarleri J., Galvan V., Delpino M.V. Omicron Variant of the SARS-CoV-2: A Quest to Define the Consequences of Its High Mutational Load. GeroScience. 2022;44:53–56. doi: 10.1007/s11357-021-00500-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wang R., Chen J., Hozumi Y., Yin C., Wei G.-W. Emerging Vaccine-Breakthrough SARS-CoV-2 Variants. ACS Infect. Dis. 2022;8:546–556. doi: 10.1021/acsinfecdis.1c00557. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Not applicable.