Abstract
SARS-CoV-2 has caused a high number of deaths in several countries. In Brazil, there were 37 million confirmed cases of COVID-19 and 700,000 deaths caused by the disease. The population size and heterogeneity of the Brazilian population should be considered in epidemiological surveillance due to the varied tropism of the virus. As such, municipalities and states must be factored in for their unique specificities, such as socioeconomic conditions and population distribution. Here, we investigate the spatiotemporal dispersion of emerging SARS-CoV-2 lineages and their dynamics in each microregion from Sergipe state, northeastern Brazil, in the first 3 years of the pandemic. We analyzed 586 genomes sequenced between March 2020 and November 2022 extracted from the GISAID database. Phylogenetic analyses were carried out for each data set to reconstruct evolutionary history. Finally, the existence of a correlation between the number of lineages and infection cases by SARS-CoV-2 was evaluated. Aracaju, the largest city in northeastern Brazil, had the highest number of samples sequenced. This represented 54.6% (320) of the genomes, and consequently, the largest number of lineages identified. Studies also analyzed the relationship between mean lineage distributions and mean monthly infections, daily cases, daily deaths, and hospitalizations of vaccinated and unvaccinated patients. For this, a correlation matrix was created. Results revealed that the increase in the average number of SARS-CoV-2 variants was related to the average number of SARS-CoV-2 cases in both unvaccinated and vaccinated individuals. Thus, our data indicate that it is necessary to maintain epidemiological surveillance, especially in capital cities, since they have a high rate of circulation of resident and non-resident inhabitants, which contributes to the dynamics of the virus.
Keywords: phylogenetic analysis, SARS-CoV-2 genomes, epidemiology, genomic surveillance, Sergipe
Introduction
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in China in late 2019 and rapidly spread across the globe, leading the World Health Organization (WHO) to declare a pandemic state on 11 March 2020 (1, 2). The virus has been widespread, causing waves of infections in almost all regions of the world (3). The first cases were confirmed in the state of São Paulo in February 2020. After that, actions were taken by the Ministry of Health in order to contain the emerging epidemic (4). As of today, 37.9 million cases in Brazil have resulted in 706,531 deaths, representing a mortality rate of 441.3 individuals per 100,000 inhabitants (accessed on 28 October 2023; available in https://covid.saude.gov.br//). This high mortality rate is related to the lack of a national policy against the disease, the increasing population mobility, especially in large urban centers, the return of face-to-face work activities, difficulties in implementing individual and community preventive measures to reduce the spread of COVID-19, and delays in vaccination have contributed to the emergence and spread of SARS-CoV-2 variants of concern (VOCs) across the country over time (5).
In Brazil, the pandemic was characterized by the co-circulation of multiple variants over time (6). The emergence of new variants was directly related to adaptive mutations in the viral genome that modified the pathogenic potential of SARS-CoV-2. A single amino acid change can dramatically affect a virus’s ability to evade the immune system and complicate the clinical status of infected individuals (7). Alpha (B.1.1.7), Beta (B.1.351), Gamma (P.1), Delta (B.1.617.2), and omicron (B.1.1.529) lineages were important variants associated with greater transmissibility or virulence, reduced neutralization by antibodies obtained through natural infection or vaccination, ability to avoid detection, and/or decreased therapeutic or vaccination efficacy (8).
Monitoring SARS-CoV-2 was possible due to recent technological and scientific advances in genome sequencing and bioinformatics tools, allowing almost real-time genomic surveillance and tracking the emergence and replacement dynamics of variant emergence and prevalence among populations (9). Several studies focusing on genomic surveillance have provided crucial information to understand the dynamics of SARS-CoV-2 lineages in the states or regions of Brazil due to the large differences between inter- and intra-state population sizes, concentration, and dynamics of human movement (10, 11). This proposal has been shown to be relevant to determine the spread of the virus based on the specific characteristics of the state in a refined resolution (11). In the Brazilian Northeast, 7.4 million cases of COVID-19 and 136,000 deaths have already been reported. Bahia, Ceará, and Pernambuco are the states with the highest incidence of cases and deaths in the region. In Sergipe state, 363,329 individuals were diagnosed with COVID-19, resulting in 6,539 deaths (accessed on 28 October 2023; available at https://covid.saude.gov.br//). At the moment, a single study has been identified in the literature related to genomic surveillance in Sergipe, and this analyzed genomes sequenced between March 2020 and February 2021 (5). This demonstrates the necessity to implement new research aimed at understanding the effects of the pandemic.
Therefore, this study aimed to assess the dynamics of SARS-CoV-2 variants from 2020 to 2022 in the state of Sergipe within Brazil. Knowledge gained would identify viral evolutionary patterns and behavior as it relates to epidemiological impacts.
Methods
Study area
Sergipe is located in northeastern Brazil and has a land area of 21,938,188 km2 and an estimated population of 2,338,474 inhabitants. The state has 75 municipalities and is divided into 13 microregions (Agreste de Itabaiana, Agreste de Lagarto, Aracaju, Baixo Cotinguiba, Boquim, Carira, Cotinguiba, Estância, Japaratuba, Nossa Senhora das Dores, Propriá, Sergipana do Sertão do São Francisco, and Tobias Barreto) (Figure 1). The microregion of Aracaju is made up of the capital (Aracaju), and the municipalities of Barra dos Coqueiros, Nossa Senhora do Socorro, and São Cristóvão, forming the metropolitan region of Aracaju, which represents approximately 36% of the state population1.
Data collection
Full-length SARS-CoV-2 genomes from February 2020 to November 2022 were obtained from the GISAID database2. Only complete genomes and complete collection data were used. The sequences were evaluated individually, considering the lineage, which was determined by the Pangolin software3, municipality, and collection date. Soon after, the genomes were separated by year, giving rise to three data sets. The Circos program (12) was used to visualize the distribution of the genomes by strains and municipalities.
In order to correlate the number of lineages of SARS-CoV-2 with the average of infections by months, daily cases, daily deaths, and admissions of vaccinated and unvaccinated patients, the data were uploaded to a cross-country database of COVID-194 (13, 14). The Pearson correlation test was performed using a native stats (V.4.0.3) package available in R software, and the grouped stacked bars with the abundance of lineage between months were performed and represented using the ggplot2 package (15, 16) and the correlation matrix was performed using corrplot package (17). All the differences with p-values <0.05 were considered statistically significant.
Phylogenetic analyses
Multiple sequence alignment was carried out using MAFFT v.7 with auto and add fragments parameters (18). The sequence from Wuhan-Hu-1 (NC_045512.2) was then added as an outgroup. Subsequently, the maximum likelihood (ML) phylogenetic trees were built using IQ-TREE v2.1.2 (19). The nucleotide substitution models TN + F, GTR + F + I + I + R4, and TIM + F + I + I + R3 were selected using ModelFinder in IQ-TREE2 v2.1.2 for the SARS-CoV-2 genomes of 2020, 2021, and 2022, respectively (20). Clade support was estimated using 1,000 replicates of bootstrap. The tree was visualized and edited using the iTOL v.4 tool (21). The haplotype network was created with PopART software version 1.7 (22) using the median-joining method to identify the existence of shared haplotypes.
Spatial analysis
The maps to represent the spatial distribution of SARS-CoV-2 lineages were constructed using the QGIS software version 3.18.2, with the cartographic projection corresponding to the Universal Reference System SIRGAS 2000. The cartographic projection used corresponded to the Universal Transverse Mercator (UTM) system, Terra Datum horizontal model (SIRGAS 2000) to segment by municipalities and states were collected from the databases of the Brazilian Institute of Geography and Statistics (IBGE).
Results
Genomic surveillance of SARS-CoV-2 variants in Sergipe
For this analysis, 586 SARS-CoV-2 viral genomes were evaluated and classified into 36 variant lineages (Figure 2). Sequences have been distributed in 47 municipalities, representing 62.7% of the total. Most of the genomes obtained from the GISAID database have their origin in the Aracaju microregion, as can be seen in Table 1.
Table 1.
Microregions | Municipalities | 2020 | 2021 | 2022 | % |
---|---|---|---|---|---|
Aracaju | Aracaju | 19 | 252 | 49 | 69.6% |
Barra dos Coqueiros | 02 | 11 | 22 | ||
São Cristovão | – | 02 | 12 | ||
Nossa Senhora do Socorro | 01 | 25 | 13 | ||
Estância | Estância | 01 | 05 | – | 2.6% |
Itaporanga D’ajuda | 01 | 07 | 01 | ||
Agreste de Lagarto | Lagarto | 01 | 01 | 05 | 2.2% |
Riachão do Dantas | – | 02 | 04 | ||
Baixo Cotinguiba | Laranjeiras | 01 | 04 | – | 1.9% |
Riachuelo | 01 | 01 | – | ||
Maruim | – | 01 | – | ||
Santo Amaro das Brotas | – | 01 | – | ||
Carmópolis | – | – | 02 | ||
Propriá | Telha | 01 | 01 | – | 1.9% |
Amparo de São Francisco | – | 01 | – | ||
Canhoba | – | 01 | – | ||
Cedro de São João | – | 03 | – | ||
Nossa Senhora de Lourdes | – | 01 | 01 | ||
Propriá | – | 02 | – | ||
Boquim | Tomar do Geru | 01 | 02 | – | 4.4% |
Boquim | – | 06 | 02 | ||
Cristinápolis | – | 02 | – | ||
Itabaianinha | – | 06 | – | ||
Salgado | – | 03 | 03 | ||
Umbaúba | – | – | 01 | ||
Nossa Senhora das Dores | Aquidabã | – | 02 | – | 0.5% |
Nossa Senhora das Dores | – | – | 01 | ||
Sergipana do Sertão do São Francisco | Canindé de São Francisco | – | 14 | – | 4.8% |
Nossa Senhora da Glória | 01 | 05 | – | ||
Gararu | – | 01 | – | ||
Monte Alegre de Sergipe | – | 03 | – | ||
Porto da Folha | – | 04 | – | ||
Cotinguiba | Capela | – | 04 | 03 | 1.9% |
Divina Pastora | – | – | 04 | ||
Carira | Carira | – | 01 | – | 3.4% |
Frei Paulo | – | 03 | 03 | ||
Ribeirópolis | – | 08 | 04 | ||
Pinhão | – | – | 01 | ||
Agreste de Itabaiana | Itabaiana | – | 04 | 09 | 2.9% |
Areia Branca | – | – | 02 | ||
Macambira | – | – | 01 | ||
Malhador | – | – | 01 | ||
Tobias Barreto | Simão Dias | – | 11 | 01 | 3.4% |
Tobias Barreto | – | 04 | 02 | ||
Poço Verde | – | 01 | 01 | ||
Japaratuba | Pirambu | – | 01 | – | 0.5% |
Japaratuba | – | – | 02 | ||
Total | 30 | 406 | 150 |
In 2020, five lineages were detected circulating in Sergipe, B.1 (11 sequences, 36.7%) was the most frequent, followed by B.1.1 (6 sequences, 20%), B.1.1.33 (6 sequences, 20%), B.1.1.28 (4 sequences, 13.3%), and B.1.212 (3 sequences, 10%) (Figure 2). A total of 30 genomes were available on the GISAID database. In total, 19 of those 30 genomes were related to samples from Aracaju (Supplementary Figure S1). Genomic sequences have also been observed in 10 other municipalities (Figure 3). At first, B.1 was identified in the state on 12 March 2020 during the first wave. This sample belongs to an individual who resided in Aracaju with a travel history to Europe (Spain).
For 2021, 406 sequences were used to create the datasets and subsequently classified into 16 viral variant lineages. In total, 212 samples were identified as the P.1 gamma variant, representing approximately 52.2% (Figure 2). Initially, the circulation of this variant was registered on 17 January 2021 in the municipality of Aracaju during the second wave. Delta sequences have been registered in Sergipe between January and September. P.1.2 (56 sequences, 13.8%) and P.2 Zeta variant (50 sequences, 12.3%) were also highly represented (Figure 4). This variant was predominant in infection cases from September and December. A total of 57 genomes of the AY.* lineages were found in the GISAID database. This is distributed in four strains (AY.34.1.1, AY.99.1, AY.99.2, and AY.101). AY.99.2 (45 sequences, 11.1%) was prevalent during this period. Lineages AY.34.1.1, AY.99.1, and AY.101 represented approximately 2.9% of the total genomes (Figure 2). All other strains identified in 2021 represent approximately 10.6% (43 sequences). Aracaju was the municipality with the highest number of strains circulating when compared to other localities (Figure 4). Lineages were also identified in 37 other cities (Supplementary Figure S2).
An alignment with 150 genomes was created using the genomes of 2022, and it was possible to identify 18 lineages distributed in 27 municipalities (Figure 5) (Supplementary Figure S3). In January, nine sublineages of the Omicron variant were identified as circulating. The first variant sample detected was on 3 January 2022. The lineage BA.1.1 (33 sequences, 22%) was the most frequent during the third wave, followed by BA.1 (28 sequences, 18.7%), BA.5.2 (21 sequences, 14%), and BA.5.2.1 (18 sequences, 12%). All other lineages identified represented approximately 33.3% (50 sequences) (Figure 2).
Evolutionary analysis of SARS-CoV-2 lineages
The maximum likelihood phylogenetic tree was constructed to confirm the SARS-CoV-2 variant classification that circulated between February 2020 and November 2022 in the state of Sergipe. Considering the sequences from 2020, the phylogenetic analysis suggested five distinct well-supported groups (B.1, B.1.1, B.1.212, B.1.1.28, and B.1.1.33) (Figure 6). The haplotype network has been constructed with the purpose of characterizing the ancestral relationships maintained between the lineages. Five well-defined clades (B.1, B.1.1, B.1.212, B.1.1.28, and B.1.1.33) were identified. Notably, haplotype sharing was not observed among the sequences from these different strains (Figure 7).
For the genomes from 2021, the ML tree revealed five main well-supported clades. One clade was composed only of the delta variant. N.9, B.1.1.28, and P.2 lineages were divided into different clades with significant support values (Figure 8). B.1.1.28 was identified as a common ancestor of P.1 and P.2. A clade represented by sublineages relative to P.1 (P.1.7, P.1.14, and P.1.2) was observed. However, sequences belonging to lineages B.1.1, B.1.1.33, and P.4 have not demonstrated significant bootstrap values. P.7 clade showed high support value, and its genetic pattern is associated with the P.2 lineage. The haplotype network revealed five heterogeneous clusters, where P.1 was associated with P.1.2, P.1.14, and P.1.7. Another cluster was observed with P.2 and P.7 lineages. N.9, B.1.1.28, and all AY.* remained isolated in the phylogenetic tree (Figure 9).
Analyzing the genomes from 2022, it was possible to observe differences, revealing two distinct clades (Figure 10); both groups were significantly supported. A clade was formed by genomes from lineages BA.5, BA.5.1, BA.5.2, BA.5.2.1, BA.2, BA.2.12.1, BA.2.56, BA.4, and BA.4.1, and another clade was formed by genomes from lineages BA.1, BA.1.1, BA.1.14, BA.1.14.1, BA.1.14.2, BA.1.15, BA.1.17.2, BA.1.1.1, and BA.1.1.14. The haplotype network also suggested two clusters. One cluster was composed by BA.5, BA.5.1, BA.5.2, BA.5.2.1, BA.2, BA.2.12.1, BA.2.56, BA.4, and BA.4.1. On the other hand, the other cluster was formed by lineages BA.1, BA.1.1, BA.1.14, BA.1.14.1, BA.1.14.2, BA.1.15, BA.1.17.2, BA.1.1.1, and BA.1.1.14. In addition, it was observed genomes from different lineages sharing haplotypes, such as BA.1.1|10,322,472, BA.1.1|10,322,464, BA.1.14|10,322,474, BA.5.2.1|15,279,654, BA.5.2.1|15,202,013, BA.5.2.1|15,802,440, and BA.5.2.1|15,802,439 (Figure 11). In the BA.1.1|10,322,472 genome, genetic patterns associated with BA.1 and BA.1.1 have been identified, suggesting the maintenance of ancestral relationship.
Correlation analysis of SARS-CoV-2 lineages and infection cases in Sergipe
A correlation matrix analysis was employed to examine the relationship between the average distributions of viral lineages and the average number of individuals infected, as indicated by registered cases reported by global epidemics (Figure 12). Furthermore, we explored relationships among the averages of hospital admissions in vaccinated and unvaccinated patients as well as the averages of daily deaths and daily cases (Figure 12). Notably, the averages of registered cases exhibited a correlation cluster among all reported cases as well as between vaccinated and unvaccinated patients. Additionally, the averages of daily cases and deaths showed a direct correlation. Hospital admissions in vaccinated patients were associated with clusters of registered cases, while hospital admissions in unvaccinated patients were linked to daily deaths and daily cases. The observed increase in the number of variant lineages during the study period was directly correlated with the averages of infections.
Discussion
This is a pioneering study of the state of Sergipe. The evolutionary history of circulating SARS-CoV-2 genomes over the last 3 years was reconstructed using phylogenetic analyses based on ML and the median-joining method. Our data support that B.1 was the first lineage detected in Sergipe (as recorded in Aracaju on 12 March 2020). However, Gurgel et al. (23) report in their study that the first case of COVID-19 in the state of Sergipe may have occurred a few months earlier, as samples from asymptomatic individuals sent for blood tests between the months of January and April 2020 by reasons unrelated to COVID-19 showed the presence of SARS-CoV-2 immunoglobulins (IgM and IgG) before the notification of clinical cases in the state. The country registered its first COVID-19 case in late February in São Paulo (4). In Bahia, the first confirmed SARS-CoV-2 infections occurred on 28 February, being the first case in the Northeast region5.
The lineages B.1, B.1.1, B.1.1.33, B.1.1.28, and B.1.212 were prevalent until mid-August 2020. In an analysis carried out by Dos Santos et al. (5), B.1 (58.5%), B.1.1.33 (17.1%), B.1.1.119 (12.2%), B.1.1.28 (9.8%), and B.1.212 (2.4%) were dominant from March to August 2020 in Sergipe. In late 2020, the variants zeta (P.2) (24) and gamma (P.1) (25), descendants of lineage variants B.1.1.28, emerged and were associated with the second phase of the pandemic. P.1 was identified in the state of Amazonas in mid-December 2020, with a proposed emergence around November (25, 26).
P.1 was dominant in infection cases by SARS-CoV-2, having their circulation on 17 January 2021. Some studies proposed that the emergence of three mutations E484K, N501Y, and K417T in the Spike protein allowed the virus to escape from the host immune response (27–29). In mid-January 2021, samples from 11 suspected cases and their contact reporting a travel history to/from Amazonas state were screened at the Central Laboratory of Health of the Bahia state (LACEN-BA). Genetic evidence has confirmed for the first time the circulation of the P.1 in the Brazilian Northeast (30). The study conducted by Dos Santos et al. (5) also confirms that P.1 circulated in Aracaju (Sergipe) on 17 January 2021 in a sample belonging to a resident from the city of Manaus (Amazonas) who traveled to Sergipe to visit his family (5).
Our results from the evolutionary analysis showed that B.1.1.28 is a common ancestor of P.1 and P.2. In the studies conducted by Varela et al. (31) and Harvey et al. (32), it was indicated that P.1 and P.2 descend from B.1.1.28 although they have different times of appearance and share the S: E484K mutation. A cluster formed by P.1, P.1.1, P.1.2, and P.1.7 demonstrates the ancestral relationship between these lineages, as reported by Varela et al. (31) and Machado et al. (11). In the present study, a shared ancestry between P.2 and P.7 was also observed similar to that suggested by Lamarca et al. (33).
Assessing the results, it was suggested that AY.99.2 (11.1%) became dominant in cases from September to December 2021. There are signs that the delta variant emerged in October 2020 on the Asian continent as has been classified by WHO (33). In Brazil, the first community-sustained transmission chains of the delta variant were registered in June 2021 in the state of Rio de Janeiro (34), and it has been widely detected in other Brazilian states over time (see footnote 5). Among the delta variants, the AY.99.2 was the most dominant, reaching 58% of all sublineages sampled during the period (35). Some evidence demonstrate that AY.99.2 emerged in Brazil; the first SARS-CoV-2 genomes from this lineage available in the GISAID database are from samples collected in April 2021 in the northeastern state of Ceará (36). Studies have identified mutations in the spike protein of the delta sublineages found in Brazil, and the most common mutations mentioned are T19R, T95I, E156G, DEL157/158, L452R, T478K, D614G, Q677H, P681R, D950N, V1104L, and L1265F7. Some of these mutations can be related to viral fitness advantages such as enhanced viral entry, pathogenesis, and immune escape (37, 38). Despite that, the number of hospitalizations declined from 6.9% (January to June 2021) to 3.6% (July to December 2021) during the community transmission of the delta variant with the progression of vaccination in the second half of 2021 in Sergipe (39).
The phylogenetic tree revealed that the samples of the delta variant formed a monophyletic clade. In an analysis using the neighbor-joining method with genomes from the delta lineage, a compatible structuring with a monophyletic clade was shown, and the omicron variant emerged from it (40).
In this study, the lineage BA.1.1 was reported as most frequent, followed by BA.1, BA.5.2, and BA.5.2.1. Genomic surveillance detected that in February 2022, the omicron variant was majority; 99.8% of the samples analyzed around the country being positive for the variant (see footnote 5). From January to September, BA.1 (4,253 genomes) and BA.1.1 (2,521 genomes) were also prevalent in infection cases by SARS-CoV-2 in the northeast Brazil (see footnote 5). Since the beginning of the pandemic, the SARS-CoV-2 genome has been rapidly evolving. This is mostly due to the inherent polymerase mistakes and host immune selection factors (32). The omicron variant is the most mutated variant containing more than 60 mutations in its genome. In total, 32 of these mutations lie within the receptor binding domain (RBD) of the spike protein (41, 42). The large number of mutations associated with the spike RBD domain can be related to infectivity rates, high transmission capacity, and rapid dispersal potential (26). Among the mutations identified in the omicron variants, 14 are exclusive and found in all the omicron variants (43).
Our phylogenetic tree for omicron VOC suggests the existence of two main clades, one composed of the sublineages linked to BA.1 and the other associated with BA.2, BA.4, and BA.5 variants. In a study developed by Veneziano et al. (44) with Omicron SARS-CoV-2 genomes in Italy, a group composed of BA.1 and another by BA.2, BA.4, and BA.5 lineages was identified. Six mutations have been identified in all the omicron variants, excluding omicron BA.1: Del24-26, V213G, T376A, S371F, D405N, and R408S (43). These mutations may have contributed to the structure of the clades of the phylogenetic tree and also in the haplotype network, as observed in other organisms (45, 46). Notably, our study presents a limitation due to the fact that in some months of 2020 and 2022, there were no records of SARS-CoV-2 genomes in the GISAID database. However, there is agreement between our genomic surveillance results and those observed in other states of Brazil.
Finally, the increase in the average number of SARS-CoV-2 lineages during the studied periods is related to the average number of infections in both unvaccinated and vaccinated individuals. Tarkowski et al. (47) demonstrated that vaccinated individuals presented higher levels of IgG against viral proteins of spike protein-1 (S1) and receptor-binding domain (RBD), which resulted in a better immune response to B.1 and P.1 variants although immune activation is less noticeable in response to the B.1.617.2 variant. A similar study revealed differences in the efficiency of humoral activity in vaccinated individuals against B.1.617.1, B.1.617.2, B.1.351, and P.1 lineages due to mutations in the spike protein (S) (48). Unvaccinated individuals are intrinsically associated with daily cases and deaths. Martins-Filho et al. (39) studied the dynamics of hospitalizations and the predominance of delta and omicron variants in the Northeast of Brazil and found that during the circulation of the delta variant (July to December 2021), the majority of deaths occurred in people who were not vaccinated or who had not completed the vaccination schedule. Furthermore, in 2020, vaccines were scarce, with high hospitalization rates (46% of the population with active infection) (39).
Conclusion
Our data suggest a correlation between the increase in the mean number of variant lineage strains and the mean number of infections in unvaccinated and vaccinated individuals. It is important to note that 3 years after the beginning of the SARS-CoV-2 pandemic, and despite the availability of several vaccines for 2 years, the restrictive measures to contain SARS-CoV-2 spreading were met with several challenges among most countries. In particular, the recent variants are generating new outbreaks of infection, even in countries where the level of vaccinations is high. However, it becomes necessary for continuous monitoring of the most predominant SARS-CoV-2 lineages as well as their specific dynamic and processes of evolution. Therefore, this knowledge gain and continual analysis of variant lineages is imperative for epidemiologists to define public health measures, perform adequate diagnostic tests, and strategically employ vaccines (49).
Despite the number of positive cases of COVID-19 in Sergipe, these did not have minimum values to be submitted for genetic sequencing; and we were unable to establish a stratified correlation between the number of lineages and the severity of COVID-19 cases in both vaccinated and unvaccinated individuals. This correlation could have demonstrated how lineage variability impacts the severity of infections.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Ethics statement
Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.
Author contributions
MF and VQ: conceptualization. MF, KF, LS, FN, and JR: investigation. MF, KF, and ER: methodology. MF and KF: formal analysis. MF: project administration. MF and TM: writing – original draft. MF, TM, ML, MA, SS, KF, JR, LS, and CS: writing – review and editing. All authors contributed to the article and approved the submitted version.
Acknowledgments
We thank Marilda Agudo Mendonça Teixeira de Siqueira and Paola Cristina Resende for coordinating sequencing at the Genomics Platform of the Respiratory Virus and Measles Laboratory at the Osvaldo Cruz Foundation (Fiocruz-RJ).
Footnotes
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2023.1222152/full#supplementary-material
References
- 1.Wu F, Zhao S, Yu B, Chen YM, Wang W, Song ZG, et al. A new coronavirus associated with human respiratory disease in China. Nature. (2020) 579:265–9. doi: 10.1038/s41586-020-2008-3, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.World Health Organization (WHO) . WHO director-General’s opening remarks at the media briefing on COVID-19 – 11 march 2020. (2020). Available at: https://www.who.int/director-general/speeches/detail/who-director-general-s-opening-remarks-at-themedia-briefing-on-covid-19-11-march-2020. (Accessed April 2, 2023)
- 3.He X, Lau EHY, Wu P, Deng X, Wang J, Hao X, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat Med. (2020) 26:672–5. doi: 10.1038/s41591-020-0869-5 [DOI] [PubMed] [Google Scholar]
- 4.Jesus JG, Sacchi C, Candido DDS, Claro IM, Sales FCS, Manuli ER, et al. Importation and early local transmission of COVID-19 in Brazil, 2020. Rev Inst Med Trop Sao Paulo. (2020) 62:e30. doi: 10.1590/s1678-9946202062030, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Dos Santos CA, Bezerra GVB, de Azevedo MARRA, Alves JC, Tanajura DM, Martins-Filho PR. SARS-CoV-2 genomic surveillance in Northeast Brazil: timing of emergence of the Brazilian variant of concern P1. J Travel Med. (2021) 28:1–3. doi: 10.1093/jtm/taab066, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Alcantara LCJ, Nogueira E, Shuab G, Tosta S, Fristch H, Pimentel V, et al. SARS-CoV-2 epidemic in Brazil: how the displacement of variants has driven distinct epidemic waves. Virus Res. (2022) 315:198785. doi: 10.1016/j.virusres.2022.198785, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Rodriguez-Morales AJ, Cardona-Ospina JA, Gutiérrez-Ocampo E, Villamizar-Peña R, Holguin-Rivera Y, Escalera-Antezana JP, et al. Clinical, laboratory and imaging features of COVID-19: a systematic review and meta-analysis. Travel Med Infect Dis. (2020) 34:101623. doi: 10.1016/j.tmaid.2020.101623, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Galán-Huerta KA, Flores-Treviño S, Salas-Treviño D, Bocanegra-Ibarias P, Rivas-Estilla AM, Pérez-Alba E, et al. Prevalence of SARS-CoV-2 variants of concern and variants of interest in COVID-19 breakthrough infections in a Hospital in Monterrey. Mexico Viruses. (2022) 14:154. doi: 10.3390/v14010154, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Oude Munnink BB, Sikkema RS, Nieuwenhuijse DF, Molenaar RJ, Munger E, Molenkamp R, et al. Transmission of SARS-CoV-2 on mink farms between humans and mink and back to humans. Science. (2021) 371:172–7. doi: 10.1126/science.abe5901, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Giovanetti M, Slavov SN, Fonseca V, Wilkinson E, Tegally H, Patané JSL, et al. Genomic epidemiology of the SARS-CoV-2 epidemic in Brazil. Nat Microbiol. (2022) 7:1490. doi: 10.1038/s41564-022-01191-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Machado LC, Dezordi FZ, de Lima GB, de Lima RE, Silva LCA, Pereira LM, et al. Spatiotemporal transmission of SARS-CoV-2 lineages during 2020-2021 in Pernambuco – Brazil. MedRxiv. (2023) 25:831. doi: 10.1101/2023.01.25.23284831 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, et al. Circos: an information aesthetic for comparative genomics. Genome Res. (2009) 19:1639–45. doi: 10.1101/gr.092759.109, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hasell J, Mathieu E, Beltekian D, Macdonald B, Giattino C, Ortiz-Ospina E, et al. A cross-country database of COVID-19 testing. Sci Data. (2020) 7:345. doi: 10.1038/s41597-020-00688-8, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Fukutani KF, Barreto ML, Andrade BB, Queiroz ATL. Correlation between SARS-Cov-2 vaccination, COVID-19 incidence and mortality: tracking the effect of vaccination on population protection in real time. Front Genet. (2021) 12:679485. doi: 10.3389/fgene.2021.679485 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Wickham H. ggplot2: Elegant graphics for data analysis. Berlin: Springer; (2016). [Google Scholar]
- 16.Césare N, Mota TF, Lopes FFL, Lima ACM, Luzardo R, Quintanilha LF, et al. Longitudinal profiling of the vaccination coverage in Brazil reveals a recent change in the patterns hallmarked by differential reduction across regions. Int J Infect Dis. (2020) 98:275. doi: 10.1016/j.ijid.2020.06.092 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wei T, Simko V. R package ‘corrplot’: visualization of a correlation matrix. (Version 0.92). (2021), Available at: https://github.com/taiyun/corrplot
- 18.Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. (2019) 20:1160–6. doi: 10.1093/bib/bbx108, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. (2020) 37:1530–4. doi: 10.1093/molbev/msaa015, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kalyaanamoorthy S, Minh BQ, Wong TKF, Von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nat Methods. (2017) 14:587–9. doi: 10.1038/nmeth.4285, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Letunic I, Bork P. Interactive tree of life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. (2019) 47:W256–9. doi: 10.1093/nar/gkz239, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Leigh JW, Bryant D. PopART: full-feature software for haplotype network construction. Methods Ecol Evol. (2015) 6:1110–6. doi: 10.1111/2041-210x.12410 [DOI] [Google Scholar]
- 23.Gurgel RQ, de Sá LC, Souza DRV, Martins AF, Matos ILS, Lima AGA, et al. SARS-CoV-2 has been circulating in northeastern Brazil since February 2020: evidence for antibody detection in asymptomatic patients. J Infect. (2021) 82:186–230. doi: 10.1016/j.jinf.2020.11.037, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Voloch CM, da Silva FR, LGP DA, Cardoso CC, Brustolini OJ, Gerber AL, et al. Genomic characterization of a novel SARS-CoV-2 lineage from Rio de Janeiro. Brazil J Virol. (2021) 95:e00119–21. doi: 10.1128/JVI.00119-21, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Faria NR, Mellan TA, Whittaker C, Claro IM, Candido DDS, Mishra S, et al. Genomics and epidemiology of the P.1 SARS-CoV-2 lineage in Manaus. Brazil Sci. (2021) 372:815. doi: 10.1126/science.abh2644, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Naveca FG, Nascimento V, de Souza VC, Corado AL, Nascimento F, Silva G, et al. COVID-19 in Amazonas, Brazil, was driven by the persistence of endemic lineages and P.1 emergence. Nat Med. (2021) 27:1230–8. doi: 10.1038/s41591-021-01378-7, PMID: [DOI] [PubMed] [Google Scholar]
- 27.Nonaka CKV, Franco MM, Gräf T, de Lorenzo Barcia CA, de Ávila MRN, de Sousa KAF, et al. Genomic evidence of SARS-CoV-2 reinfection involving E484K spike mutation. Brazil Emerg Infect Dis. (2021) 27:1522–4. doi: 10.3201/eid2705.210191, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Resende PC, Bezerra JF, Teixeira Vasconcelos RH, Arantes I, Appolinario L, Mendonça AC, et al. Severe acute respiratory syndrome coronavirus 2 P.2 lineage associated with reinfection case, Brazil, June-October 2020. Emerg Infect Dis. (2021) 27:1789–94. doi: 10.3201/eid2707.210401, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Greaney AJ, Starr TN, Gilchuk P, Zost SJ, Binshtein E, Loes AN, et al. Complete mapping of mutations to the SARS-CoV-2 spike receptor-binding domain that escape antibody recognition. Cell Host Microbe. (2021) 29:44–57.e9. doi: 10.1016/j.chom.2020.11.007, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Tosta S, Giovanetti M, Brandão Nardy V, Reboredo de Oliveira da Silva L, Kelly Astete Gómez M, Gomes Lima J, et al. Short report: early genomic detection of SARS-CoV-2P.1 variant in Northeast Brazil. PLoS Negl Trop Dis. (2021) 15:e0009591. doi: 10.1371/journal.pntd.0009591, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Varela APM, Prichula J, Mayer FQ, Salvato RS, Sant’Anna FH, Gregianini TS, et al. SARS-CoV-2 introduction and lineage dynamics across three epidemic peaks in southern Brazil: massive spread of P.1. Infect Genet Evol. (2021) 96:105144. doi: 10.1016/j.meegid.2021.105144, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Harvey WT, Carabelli AM, Jackson B, Gupta RK, Thomson EC, Harrison EM, et al. SARS-CoV-2 variants, spike mutations and immune escape. Nat Rev Microbiol. (2021) 19:409–24. doi: 10.1038/s41579-021-00573-0, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lamarca AP, de Almeida LGP, Francisco RDS, Jr Lima LFA, Scortecci KC, Perez VP, et al. Genomic surveillance of SARS-CoV-2 tracks early interstate transmission of P.1 lineage and diversification within P.2 clade in Brazil. PLoS Negl Trop Dis. (2021) 15:e0009835. doi: 10.1371/journal.pntd.0009835, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Singh J, Rahman SA, Ehtesham NZ, Hira S, Hasnain SE. SARS-CoV-2 variants of concern are emerging in India. Nat Med. (2021) 27:1131–3. doi: 10.1038/s41591-021-01397-4 [DOI] [PubMed] [Google Scholar]
- 35.Chen C, Nadeau S, Yared M, Voinov P, Xie N, Roemer C, et al. CoV-Spectrum: analysis of globally shared SARS-CoV-2 data to identify and characterize new variants. Bioinformatics. (2021) 38:1735–7. doi: 10.1093/bioinformatics/btab856, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Romano CM, de Oliveira CM, da Silva LS, Levi JE. Early emergence and dispersal of Delta SARS-CoV-2 lineage AY.99.2 in Brazil. Front Med. (2022) 9:930380. doi: 10.3389/fmed.2022.930380, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Grant RA, Morales-Nebreda L, Markov NS, Swaminathan S, Querrey M, Guzman ER, et al. Circuits between infected macrophages and T cells in SARS-CoV-2 pneumonia. Nature. (2021) 590:635–41. doi: 10.1038/s41586-020-03148-w, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Li D, Edwards RJ, Manne K, Martinez DR, Schäfer A, Alam SM, et al. In vitro and in vivo functions of SARS-CoV-2 infection-enhancing and neutralizing antibodies. Cells. (2021) 184:4203. doi: 10.1016/j.cell.2021.06.021, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Martins-Filho PR, Quintans-Júnior LJ, BDS S, Barboza WS, Cavalcante TF, et al. Dynamics of hospitalizations and in-hospital deaths from COVID-19 in Northeast Brazil: a retrospective analysis based on the circulation of SARS-CoV-2 variants and vaccination coverage. Epidemiol Health. (2022) 44:e2022036. doi: 10.4178/epih.e2022036, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Kandeel M, Mohamed MEM, Abd El-Lateef HM, Enugopala KN, El-Beltagi HS. Omicron variant genome evolution and phylogenetics. J Med Virol. (2022) 94:1627. doi: 10.1002/jmv.27515, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Tsang AK, Cheng PK, Mak GC, Leung PK, Yip PC, Lam ET, et al. Unusual high number of spike protein mutations for the SARS-CoV-2 strains detected in Hong Kong. J Clin Virol. (2022) 148:105081. doi: 10.1016/j.jcv.2022.105081, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wang L, Cheng G. Sequence analysis of the emerging SARS-CoV-2 variant omicron in South Africa. J Med Virol. (2021) 94:1728–33. doi: 10.1002/jmv.27516 [DOI] [PubMed] [Google Scholar]
- 43.Caputo E, Mandrich L. Structural and phylogenetic analysis of SARS-CoV-2 spike glycoprotein from the most widespread variants. Life. (2022) 12:1245. doi: 10.3390/life12081245, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Veneziano C, Marascio N, De Marco C, Quaresima B, Biamonte F, Trecarichi EM, et al. The spread of SARS-CoV-2 omicron variant in CALABRIA: a spatio-temporal report of viral genome evolution. Viruses. (2023) 15:408. doi: 10.3390/v15020408, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.de Souza Freitas MT, Ríos-Velasquez CM, Costa CR, Figueirêdo CA, Jr, Aragão NC, da Silva LG, et al. Phenotypic and genotypic variations among three allopatric populations of Lutzomyia umbratilis, main vector of Leishmania guyanensis. Parasit Vectors. (2015) 8:448. doi: 10.1186/s13071-015-1051-7, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Lima Costa CR, Jr FMT, Santiago Figueirêdo CA, Jr ANC, da Silva LG, Marcondes CB, et al. Genetic structuring and fixed polymorphisms in the gene period among natural populations of Lutzomyia longipalpis in Brazil. Parasit Vectors. (2015) 8:193. doi: 10.1186/s13071-015-0785-6, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Tarkowski M, de Jager W, Schiuma M, Covizzi A, Lai A, Gabrieli A, et al. Anti-SARS-CoV-2 immunoglobulin isotypes, and neutralization activity against viral variants, according to BNT162b2-vaccination and infection history. Front Immunol. (2021) 12:793191. doi: 10.3389/fimmu.2021.793191 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Liu C, Ginn HM, Dejnirattisai W, Supasa P, Wang B, Tuekprakhon A, et al. Reduced neutralization of SARS-CoV-2 B.1.617 by vaccine and convalescent serum. Cells. (2021) 184:4220. doi: 10.1016/j.cell.2021.06.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Borges LP, Martins AF, de Melo MS, de Oliveira MGB, Neto JMR, Dósea MB, et al. Seroprevalence of SARS-CoV-2 IgM and IgG antibodies in an asymptomatic population in Sergipe. Brazil Rev Panam Salud Publica. (2020) 44:e108. doi: 10.26633/RPSP.2020.108, PMID: [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.