Skip to main content
Genetics and Molecular Biology logoLink to Genetics and Molecular Biology
. 2022 Mar 9;45(1):e20210309. doi: 10.1590/1678-4685-GMB-2021-0309

Evolutionary history of the SARS-CoV-2 Gamma variant of concern (P.1): a perfect storm

Yuri Yépez 1, * , Mariana Marcano-Ruiz 1, * , Rafael S Bezerra 2, * , Bibiana Fam 1, João PB Ximenez 2, Wilson A Silva Jr 2,3, Maria Cátira Bortolini 1
PMCID: PMC8908351  PMID: 35266951

Abstract

Our goal was to describe in more detail the evolutionary history of Gamma and two derived lineages (P.1.1 and P.1.2), which are part of the arms race that SARS-CoV-2 wages with its host. A total of 4,977 sequences of the Gamma strain of SARS-CoV-2 from Brazil were analyzed. We detected 194 sites under positive selection in 12 genes/ORFs: Spike, N, M, E, ORF1a, ORF1b, ORF3, ORF6, ORF7a, ORF7b, ORF8, and ORF10. Some diagnostic sites for Gamma lacked a signature of positive selection in our study, but these were not fixed, apparently escaping the action of purifying selection. Our network analyses revealed branches leading to expanding haplotypes with sites under selection only detected when P.1.1 and P.1.2 were considered. The P.1.2 exclusive haplotype H_5 originated from a non-synonymous mutational step (H3509Y) in H_1 of ORF1a. The selected allele, 3509Y, represents an adaptive novelty involving ORF1a of P.1. Finally, we discuss how phenomena such as epistasis and antagonistic pleiotropy could limit the emergence of new alleles (and combinations thereof) in SARS-COV-2 lineages, maintaining infectivity in humans, while providing rapid response capabilities to face the arms race triggered by host immuneresponses.

Keywords: Gamma, P.1, evolution, COVID-19

Introduction

According to the World Health Organization (WHO) and multiple researchers, the estimated average mortality rate, considering detectable/reported cases, for COVID-19 is lower (2.72%) than the disease caused by MERS-CoV (34.4%) and SARS-CoV (9.6%) (Xiao et al., 2020; ECDC, 2021a,b; Krishnamoorthy et al., 2021; Awadasseid et al., 2021). This number remains low even considering that the number of deaths caused by COVID-19 may be underestimated by 50%, as seen in Wuhan, China (Liu, J. et al., 2021). Despite this relatively low mortality rate, SARS-CoV-2 infection has led to the deaths of 4,219,578 people (WHO, 2021a). Over the past 18 months since the first reported COVID-19 case, the WHO still recognizes that the global public health risks associated with COVID-19 remain very high (WHO, 2021b). Comparatively, SARS-CoV and MERS-CoV infected 8,098 and 2,566 people and killed 774 and 866 people, respectively (WHO, 2003; Alfaraj et al., 2019; WHO, 2020; Petersen et al., 2020).

SARS-CoV, MERS-CoV, and SARS-Cov-2 have high mutation rates (0.80-2.38 × 10-3 substitutions per site per year (Zhao et al., 2004; Cotten et al., 2014; Li R et al., 2020). These mutation rates are of the same order of magnitude as other RNA viruses, and can lead to the acquisition of enhanced virulence and high evolvability, favoring changes in the host and rapid dispersion. A successful zoonotic spillover also depends on the vulnerability of the new host’s defenses, and ecological and climatic conditions. Human populations also have cultural habits, with some of them facilitating the transmission of pathogens, i.e., hugs, kisses, sharing food (Olival et al., 2017; Duffy, 2018). Thus, Homo sapiens has become a potentially easy target of new pathogens in modern times because of its large population size, urbanization, ease of mobility of people between cities, countries, and continents, and close contact with wild, semi-wild, and domesticated animals. These conditions were in place for SARS-CoV, MERS-CoV, and SARS-CoV-2 so that the interspecific barriers were overcome, and related diseases have been reported (Kan et al., 2005; Zaki et al., 2012; Hedman et al., 2021).

However, there is a notable difference in the outbreak trajectories associated with these β-COVs, specializing in infecting humans and causing severe respiratory syndrome symptoms, as mentioned above. No complete scenario explaining such differences is well understood. However, it is possible to suggest that certain potential drivers, shaped by microevolutionary phenomena, can turn a local epidemic into a global pandemic, as found with SARS-CoV-2: stronger tropism involving host cells, high transmissibility, elevated transmission rates from asymptomatic individuals, substantial viral load, and relatively low lethality all powerful triggers for the emergence of evolutionarily successful viral lineages. All these conditions/factors together represent a “perfect storm”.

COVs may have originated millions of years ago (Wertheim et al., 2013); as such, their natural hosts, species of vertebrates, have been under attack for an equivalent time. The success of zoonotic spillover, that is, transmitting a pathogen from a vertebrate animal to a human and vice versa, is a systematic process. Thus, COVs and vertebrates, including primates, are expected to have a long evolutionary history of biological arms races (Meyerson and Sawyer, 2011; Enard et al., 2016; Wang W. et al., 2020). That is, the host defense system wins at a given evolutionary moment and context; in another, the pathogen attack system wins. There is constant selective pressure on the losing side to change, resulting in these two antagonistic stages alternating perpetually, unless one species involved goes extinct, putting an end to the ongoing arms race. This phenomenon can be recognized as a form of ‘Red Queen’ dynamics (Van Valen, 1973).

Regarding SARS-CoV-2, the coronavirus RaTG13 of the brown bat (Rhinolophs affinis) is a potential ancestor (Ji et al., 2020; Li R. et al., 2020; Wu et al., 2020; Zhou et al., 2020).Their genomes have 97.41% identity (Malaiyan et al., 2021); however, at least five amino acid (aa) substitutions (F486L, Q493Y, S494R, N501D, and Y505H) at critical sites of the Spike (S) glycoprotein receptor-binding domain (RBD) of RaTG13 are crucial for the Wuhan-SARS-CoV-2 lineage to acquire high tropism with its cognate human cell receptor (cell-surface peptidase angiotensin-converting enzyme 2, ACE2; Wan et al., 2020). The first strain of SARS-CoV-2 identified in Wuhan, China, in December 2019, was also characterized as having five critical amino acid differences in its RBD when compared with SARS-CoV (L455Y, F486L, Q493N, S494D, N501T; SARS-CoV-2 and SARS-CoV aa respectively; Wan et al., 2020; Andersen et al., 2020). However, at present, the intermediate host of SARS-CoV-2 remains unknown, and even the origin of SARS-CoV-2 is still controversial; many theories have been propagated, although unfortunately for non-scientific reasons. Despite this, similar to other novel viral pathogens that have caused epidemics or pandemics involving human populations, the overwhelming conclusion is that SARS-CoV-2 can be found in human hosts through a series of unhappy accidental encounters with animals (Rasmussen, 2021). For a more recent review on this topic, see Lytras et al. (2021) and Holmes et al. (2021).

The S glycoprotein of SARS-CoV-2 contains a cleavage motif for furin proteases (Coutard et al., 2020; Lau et al., 2020), that mediates efficient fusion of the virus with human cell membranes (Yan et al., 2020). In contrast, in SARS-CoV, furin-mediated cleavage of the S glycoprotein does not occur naturally (Simmons et al., 2004). Follis et al. (2006) introduced a functional furin cleavage site in the SARS-CoV S glycoprotein and observed potentiation of membrane fusion activity. Lau et al. (2020) showed that in vitro, SARS-CoV-2 replication is compromised in cells with deletions or point mutations involving the S1/S2 junction. The authors also highlighted the role of natural selection in shaping viral molecular characteristics (Lau et al., 2020).

In April 2020, Fam et al. (2020) reported the conservation of 30 ACE2 sites with records relevant for interactions with SARS-CoV-like S glycoproteins in H. sapiens populations (Fam et al., 2020). The authors concluded that SARS-CoV-2 has a similar potential to infect humans. Subsequent investigations corroborated these findings; although ACE2 single polymorphisms have been identified in a number of human populations, none of these SNPs markedly altered interactions between ACE2 and the SARS-CoV-2 S glycoprotein (Hashizume et al., 2021). The contrast between low ACE2 diversity in the target species and the relative diversity between species has been interpreted as a sign of the long evolutionary arms race between CoVs and potential mammalian hosts (Fam et al., 2020). However, it was predicted that successive SARS-CoV-2 mutations would occur, impacting the course of the COVID-19 pandemic, including the dynamics of the evolutionary arms race between the virus and its more recent host, H. sapiens (Fam et al., 2020).

The combined impact of these SARS-CoV-2 and human ACE2 characteristics, shaped by evolutionary forces is responsible for the magnitude of the COVID-19 pandemic. For instance, SARS-CoV-2 has high tropism with human ACE2 (Andersen et al., 2020; Wan et al., 2020) and higher transmissibility (R0 = 2.5) than SARS-CoV (2.4), and even higher than the influenza virus (2.0), which caused the 1918 pandemic (Petersen et al., 2020). In addition, 59% of all SARS-CoV-2 transmissions came from asymptomatic individuals (35% from pre-symptomatic individuals and 24% from individuals who never developed symptoms), another powerful trigger for accelerated transmission (Johansson et al., 2021).

In April 2021, the WHO began to recognize the existence of SARS-CoV-2 VOCs and other variants of interest (VOIs) that pose enormous challenges to the management of the outbreak itself, vaccination, and the treatment of symptoms and outcomes linked to the major COVID-19 second/third wave that has plagued several countries more recently (Wang M et al., 2020; Awadasseid et al., 2021; WHO and other health agencies).

The first VOC identified was Beta (B.1.351; Nextstrain clade 20H/V2) in South Africa, which appeared in May-August 2020 (Tegally et al., 2020; World Health Organization, 2021c). This derived lineage is characterized by nine changes in the S glycoprotein beyond the 614G allele already present in its parental lineage, which was dominant in South Africa (Tegally et al., 2020).

The WHO and other reports identified several key SARS-CoV-2 S mutations in Beta (B.1.351): D80A, D215G, 241/243del, K417N, E484K, N501Y, D614G, A701V (WHO, 2021c,d,e,f).

Lineage Alpha (B.1.1.7; Nextstrain clade 20I/V1) emerged in southeast England in September-November 2020 and is rapidly spreading toward fixation (Davies et al., 2021a; World Health Organization, 2021b,c). Nine key mutations involving the S glycoprotein have been identified: 69/70del, 144del, N501Y, A570D, D614G, P681H, T716I, S982A, and D1118H (WHO, 2021c,d,e,f).

In early January 2021, Japanese authorities reported that four people from Amazonas, Brazil, had a new SARS-CoV-2 variant. Currently, the WHO has identified the Gamma lineage (B.1.1.28.1, P.1 or Gamma; Nextstrain clade 20J/V3) with the following key S mutations: L18F, T20N, P26S, D138Y, R190S, K417T, E484K, N501Y, D614G, H655Y, T1027I, and V1176F. Gamma probably originated in the state of Amazonas in November 2020 and in the capital Manaus. In this city, lineage B.1.1.195 was replaced by B1.1.28, and then by B.1.1.28.1 (Gamma) in <2 months (Faria et al., 2021b). Soon after, Gamma quickly spread to other Brazilian states (Faria et al., 2021a, b; WHO, 2021c,d,e,f).

In May 2021, a new VOC was recognized: Delta (B.1.617, Nextstrain clade 21A) which has three sub-lineages: B.1.617.1, B.1.617.2, and B.1.617.3. Key S mutations: L452R, D614G, P681R, ± (E484Q, Q107H, T19R, del157/158, T478K, D950N; WHO, 2021c,d,e,f). Delta lineages were first reported in India in October 2020. The Delta VOC was responsible for the second devastating wave of COVID-19 in India (Ranjan et al., 2021). In June 2021, researchers and WHO members speculated that this VOC could become dominant worldwide because of its increased transmissibility (Campbell et al., 2021; Reuters, 2021).

These VOCs have alleles that confer high transmissibility with some capacity for a second attack (reinfection), even after vaccination (Wibmer et al., 2021; Davies et al., 2021a; Jassat et al., 2021; Davies et al., 2021a; Davies et al., 2021b; Graham et al., 2021; ECDC, 2021a; NERVTAG, 2021;Challen et al., 2021;Bager et al., 2021; Faria et al., 2021a,b; Buss et al., 2021; Taylor, 2021; Martins et al., 2021; Franceschi et al., 2021; Montagutelli et al., 2021; Bogler et al., 2020; Bivins et al., 2020).

In November 2021, the WHO classified the Omicron lineage as a VOC. Omicron is evolving, and the current situation in terms of epidemiology and transmissibility, clinical severity, risk of reinfection and potential impact on diagnostics, vaccines and therapeutics is quite preliminary, but will continue to be refined as more data become widely available (WHO, 2021h).

As genomic screening of SARS-CoV-2 strains expands, it has become possible to identify at least some causes and characteristics that have led to the current COVID-19 pandemic with dramatic consequences for humanity. In addition, several studies have tested natural selection involving the SARS-CoV-2 genome based on the proportion of synonymous to non synonymous substitutions (dN/dS) (Tang et al., 2020; Chaw et al., 2020; Li X.et al., 2020; Sohpal, 2021; Yi et al., 2021). The most prominent signal to emerge from these investigations is the combination of diversifying positive selection and purifying selection, depending on the site considered, within the gene encoding the S glycoprotein. Other studies have compared selective patterns in S with those in other regions of the viral genome. For instance, Koçhan et al. (2021) showed that the S gene, unlike the others, had higher dN/dS ratios throughout the evolution of the pandemic.

In the present study, we investigated 4,977 SARS-COV-2 genomic sequences from Brazil and identified as belonging to the VOC Gamma lineage. Our goal was to determine in greater detail the evolutionary trajectory of this Brazilian autochthonous GammaVOC and its derived sub-lineages, P.1.1, P.1.2.

Material and Methods

We downloaded 4,977 SARS-CoV-2 genomic sequences from Brazil, already classified as Gamma lineage, and publicly available on the GISAID platform (Elbe and Buckland-Merrett, 2017) (downloaded on May 25,2021). Next, we aligned the genomic sequences with the Wuhan genome as a reference (NC_045512.2) using MAFFT v7 software (Katoh et al., 2019) using default parameters.

We chose sequence selection criteria based on lineage assignments made using the PANGO Lineages (PANGOLIN) platform to confirm the sequence lineages (O’Toole et al., 2021). The PANGOLIN tools have robust and reproducible criteria, are constantly revised and updated, and when applied to our dataset, resulted in the conservation of a larger number of sequences and consequently resulted in less loss of variability in the genomic alignment, in contrast to sequence classification criteria based on diagnostic sites or signature mutations (Faria et al., 2021a,b; WHO, 2021c,d,e,f). We also considered possible scenarios in which Gamma diagnostic sites could present selection signals.

Subsequently, we generated two datasets from the initial alignment: (a) only with sequences assigned by PANGOLIN as Gamma VOC (P.1); and (b) with sequences designated as Gamma and also including those designated as the derived lineages P.1.1 and P.1.2. The latter lineage has nine recognized mutations in addition to the P.1 lineage-defining ones: ORF1ab (synC1150T, synC1912T, D762G, T1820I), ORF3a (D155Y, S180F), M (synC26954T), N (synC28789T), and S glycoprotein (A262S)(de Almeida et al., 2021). P.1.1 has 21 mutations, all found in Gamma. One difference involves two other alleles of the N gene, 203K and 204R, found in Gamma but not in the P.1.1 sub-lineage. The dataset with Gamma and derived lineages will be referred to as a “complete dataset” to facilitate reading. Additionally, for phylogeographic analyses, we worked with a third dataset that included more P.1.1 sequences deposited in the GISAID platform after May 25 (sequences downloaded until June 17, 2021).

Selection tests

We used ModelFinder (Kalyaanamoorthy et al., 2017) to select the best-fit substitution model and inferred maximum likelihood trees of genomic sequence alignments of the two datasets using IQ-TREE2 with default settings (Minh et al., 2013). We then filtered the alignment by gene and open reading frame (ORF) using SARS-CoV-2 sequences available in NCBI gene records. We performed a series of selection analyses using HYPHY 2.5.31 (Kosakovsky et al., 2020). We implemented three site-level methods: FUBAR (Murrell et al., 2013) and FEL (Kosakovsky and Frost, 2005), to detect sites subject to pervasive positive (diversifying) and negative selection, and MEME (Murrell et al., 2012), a method at the codon level to detect pervasive and episodic positive selection (diversifying). The MEME, FUBAR, and FEL tests use the ratio of non-synonymous to synonymous substitutions (dN/dS) as metrics. We established a p-value ≤ 0.1 for FEL and MEME for statistical significance thresholds and a posterior probability threshold ≥ 0.9 for FUBAR (Spielman et al., 2019).

Notably, the dN/dS ratio was originally developed for the analysis of genetic sequences of divergent species. However, viral nucleotide substitution rates can be up to millions of times greater than those of their hosts. This rapid evolution is mainly due to high mutation rates. Despite controversies (Kryazhimskiy and Plotkin, 2008), approaches based on the dN/dS ratio are also appropriate for use in the context of virus populations, including to rescue the evolutionary history of SARS-CoV-2 (Kumar et al., 2020).

Network analysis

Haplotype networks are used to visualize genealogical relationships at the intraspecific level and to make inferences regarding the biogeography and history of SARS-CoV-2 Gamma populations. We used this methodology to visualize potential signs of expansion associated with lineages and/or groups of sequences (haplotypes) that carry positively selected alleles. Specifically, to analyze the networks in conjunction with the selection tests (FUBAR, FEL, and MEME), we used the following criteria: we chose haplotypes composed of >7 sequences, and subsequently, the mutations that led to these haplotypes from the main haplotype(s) were noted.

Haplotype detection was performed using DnaSP v6 software (Rozas et al., 2017), considering only the variable sites and disregarding gaps or missing data. Subsequently, we constructed networks using NETWORK software v10.2 (Fluxus Technology, 2020) using the median-joining algorithm (Bandelt et al., 1995). Maximum parsimony calculations (Polzin and Daneschmand, 2003) were used to identify unnecessary median vectors and links. Additionally, to simplify the complex network, we employed the “Frequency>1” criterion (Fluxus Technology, 2020), which ignores unique sequences in the dataset.

Phylogeography of P.1.1 and P.1.2

To better understand the spread of P.1.1 and P.1.2 variants across spatiotemporal scales within Brazil, we employed a discrete diffusion model (Lemey et al., 2009) that maps phylogenetic nodes to their inferred locations of origin, as implemented in the software package BEAST v1.8.4. (Drummond et al., 2012). We worked with two separate data sets, one for P.1.1 (collected until June 17, 2021) and another for P.1.2 (collected until June 6, 2021), both available on the GISAID platform. More P.1.1 sequences were identified, but no additional P.1.2 sequences considering those already downloaded on May 25, 2021. The genomes were classified into lineages using PANGOLIN, as previously described. The sequences classified as P.1.1 and P.1.2 were selected for a total of 18 and 224 sequences in each dataset, respectively. Subsequently, alignments were performed for each lineage using MAFT v7 as described above.

We employed the GTR + G model of nucleotide substitution, a uniform strict molecular clock (8-10 × 10-4 substitutions/site/year) (Naveca et al., 2021), and assumed an exponential population size growth coalescent process. We considered one discretization scheme considering the state’s capital where the samples were taken, using an asymmetric substitution model allowing Bayesian stochastic search variable selection (BSSVS). Markov chain Monte Carlo (MCMC) was run for >80 million generations and sampled every 10,000 generations. The convergence and mixing properties were inspected using Tracer 1.7.1 (Rambaut et al., 2018). After discarding 10% of sampled trees as burn-in, a maximum clade credibility (MCC) tree was obtained using TreeAnnotator 1.8.4 included in the BEAST v1.8.4 package. We used the resulting MCC tree as input for SpreaD3 v0.9.7.1 (Spatial Phylogenetic Reconstruction of Evolutionary Dynamics using Data-Driven Documents, Bielejec et al., 2018), which allowed us to analyze and visualize the reconstructions resulting from Bayesian inference of the variants between February and June 2021.

Results

We detected certain changes involving the sites that define the Gamma VOC according to Faria et al. (2021a, b) and the WHO (2021c) (Table 1). For instance, of the 4,977 sequences reported as Gamma in GISAID, we observed that 1,442 did not have a complete set of key mutations (alleles) that characterize it: ~29% relative to the initial number of sequences. Notably, 922 (18.5%) sequences presented a reverse mutation at position 417, that is, lysine residues (K) rather than threonines (T) in the S glycoprotein. We also observed 37 (~0.68%) sequences with the 614D allele (aspartic acid residue present in the Wuhan-SARS-CoV-2 sequence). 417T and 614G are considered diagnostic alleles for Gamma (WHO, 2021b,c,e).

Table 1. Observed sequences without P.1 (Gamma) mutation signatures*.

Gene Gamma signature mutations Number of sequences without signature mutations %
ORF1ab synT733C 5 0.10
synC2749T 88 1.77
S1188L 94 1.89
K1795Q 1 0.02
del11288-11296(3675-3677) SGF 192 3.86
synC12778T 116 2.33
synC13860T 87 1.75
E5665D 90 1.81
Spike L18F 6 0.12
T20N 88 1.77
P26S 6 0.12
D138Y 209 4.20
R190S 38 0.76
K417T 922 18.53
E484K 15 0.30
N501Y 11 0.22
D614G 37 0.68
H655Y 4 0.08
T1027I 4 0.08
N P80R 48 0.96
ins28269-28273 - -
ORF8 E92K 94 1.89

*According to Faria et al. (2021a, b) and WHO, 2021c,d,e,f. Total sequences: 4,977.

When working with public datasets from various research and diagnostic centers, it is difficult to know whether the loss of one or more diagnostic alleles represents a methodological artifact or a real phenomenon involving the natural evolution of the Gamma lineage. Therefore, the situation is likely to be a combination of both. However, to minimize the chance of error and considering our approach to identify evolutionary footprints, we chose to classify lineages using the PANGOLIN platform (Table 2). The PANGOLIN tool preserves intra-clade diversity, as >93% of the available sequences were classified as belonging to the Gamma lineage. In addition, 6.4% of them could be assigned to other lineages, including lineages derived from Gamma, such as P.1.1 and P.1.2 (Table 2).

Table 2 - . Classification of sequences according to PANGOLIN.

Lineage Number of sequences %
B.1 3 0.06
B.1.1 58 1.17
B.1.1.28 92 1.85
B.1.1.74 1 0.02
B.1.560 1 0.02
B.1.566 1 0.02
P.1 4,660 93.63
P.1.1 15 0.30
P.1.2 146 2.93

Notably, using the PANGOLIN tool, 33 and 902 sequences with alleles 614D and 417K, respectively, were preserved, indicating that these allelic reversions to an ancestral state may represent an actual phenomenon within Gamma.

Table 3 and Table S1 present a comparison of the positively selected sites, considering the two datasets (Gamma and the complete dataset). The same genes in both datasets presented sites under positive selection (S, N, M, E, ORF1a, ORF1b, ORF3, ORF6, ORF7a, ORF7b, ORF8, and ORF10; Table S1).

Table 3 -. Comparison between positively selected sites detected in VOC P.1 (Gamma) and its potential derived-lineages P.1.1 and P.1.2.

Positive selection Gamma Gamma, P.1.1, P.1.2
Sites detected by at least one of the methods 197 214
Sites detected by all three methods 77 52
MEME detected sites for at least 1 branch 126 92
Sites detected only by MEME (possible episodic selection) 23 20
Diagnostic sites that are under selection 9 12

Using the MEME method, Faria et al. (2021b) found 18 sites with positive selection signatures in N, ORF1a, ORF1b, and ORF3, including some Gamma S diagnostic sites. Notably, in our analysis, a number of these sites (e.g., E484K) did not exhibit evidence of positive selection.

Interestingly, diagnostic sites that lost the positive selection signal in our study were not entirely fixed, and apparently they were not on the way to becoming fixed within the Gamma clade. This was already noticeable through the data presented in Table 1, but these suggestions become more robust because the diagnostic sites that lose positive selection signatures do not appear to be subject to the powerful action of purifying selection (Table S2).

Our next step was to generate networks involving all 12 genes that showed sites whose diversity seemed to be the result of positive selection. We chose to show those that met the criteria of haplotypes composed of >7 sequences and with some signal of expansion based on network topology for more detail, see Material and Methods). For instance, star-like clusters of nodes surrounding a founder node are classic scenarios that reveal expansion events. It is also known that star phylogeny networks present many rare haplotypes, with one or a few mutational steps from a central haplotype at high frequency (Bandelt et al., 1995). However, because of the frequent transmission of viruses, rapid lineage expansion involving rare and random alleles can occur due to founder effects (Croucher and Didelot, 2015; Campbell et al., 2021). However, despite the expected scenario involving random phenomena, we hypothesized that the adaptive advantage led, at least in part, to the expansion of lineages within Gamma.

Table 4 shows the sites that met these criteria in both datasets [Gamma and Gamma/P1.1/P.1.2; S (L5F, T572I, A845S), ORF3 (L83F, K16N, L85F, D27Y), ORF1a (G150S, G519S, P2046T, L642F, K3353R), ORF1b (L314P, A1643V, D1264E, T1774I)], while others were unique to one or another dataset, suggesting potentially different evolutionary pathways from the ancestor (Gamma) and its two more recently derived lineages.

Table 4 -. Haplotypes with more than seven sequences, a signal of expansion based on networks, and alleles in sites under positive selection.

Gamma sequences Gamma, P.1.1, and P.1.2 sequences
Spike
Haplotype N RNA sequence position/ Aminoacid change Brazilian region Haplotype1 N RNA sequence position/ Aminoacid change Brazilian region
6 12 1715 /T572I SE 5 67 2533/A845S SE
15 66 2533 /A845S SE 10 9 80/A27V S, SE, N, NE
19 30 13/L5F SE, S, CW, N, NE 13 12 1715/T572I SE
45 31 1841 /G614D S 23 31 13/L5F SE, S, CW, N, NE
40 9 35/S12F SE, S
101 9 2533/A845S SE
ORF3
2 161 249/L83F SE, S, CW 2 226 249/L83F CW, S, SE
11 12 161/A54V SE, NE 3 17 48/K16N SE
13 16 604/V202L SE 20 10 255/L85F SE
15 10 255/L85F SE 36 33 79/D27Y SE
19 27 97/A33S SE
28 15 48/K16N SE
32 33 79/D27Y SE
100 8 524/T175I SE
ORF1a
26 76 9626/A3209V SE, NE, S 5 9 10525/H3509Y SE, NE
35 15 448/G150S SE, NE 64 15 448/G150S SE, NE
183 12 12521/T4174I SE, NE, CW, N 79 30 6137/P2046L2 SE, NE, CW
204 10 1555/G519S SE 89 19 10058/K3353R SE, NE
206 13 6136/P2046T SE 222 10 1555/G519S SE, CW
210 9 1924/L642F SE, CW 224 13 6136/P2046T2 SE
173 17 10058/K3353R SE, NE 228 9 1924/L642F SE, CW
ORF1b
2 50 941/L314P SE,N,CW,S 11 56 941/L314P CW, SE, N, S
21 9 4928/A1643V SE, N, S 32 8 5484/K1828T4 SE
43 74 3792/D1264E S, SE 39 9 4928/A1643V SE, S, NE
253 8 5321/T1774I SE 59 8 5483/K1828T4 SE
60 74 3792/D1264E S, SE, CW, NE
65 35 653/P218L SE, N, CW, NE
117 19 246/K82N SE
135 11 5321/T1774I3 SE
261 8 5321/ T1774I3 SE

N: Number of sequences present in each Haplotype. Brazilian regions: SE: Southeast; S: South; Central-West: CW; N: North; NE: Northeast. In bold are the changes in common between both datasets. 1Networks with haplotypes considering the complete dataset (Gamma plus P1.1 and P1.2 sequences) can be seen in Figure S1 A-D. 2A non-synonymous mutation in different position at the same codon. 3While one mutation clearly occurs in a sequence of an ancestral haplotype still existing, the other comes from a median vector automatically generated by the software. A median vector represents potential ancestral sequence/haplotype, but not represented in present sampling. 4A non-synonymous mutation in different position at the same codon.

The simplified networks (Figure S1 A-D), considering our complete dataset, revealed a known star-like pattern, as well as some thought-provoking findings for gene/ORFs S, ORF3, ORF1a, and ORF1b. Some rare reticulations may be due to parallel mutations, homoplasy, recombination, and/or methodological errors. For example, the synonymous substitution C1818T in the RNA sequence is recurrent in S and causes several reticulations (Figure S1 A), but we do not know if it represents a natural SARS-CoV-2 genomic mutational hotspot, or a simple annotation or sequencing error.

Haplotypes 5 and 23 (H_5 and H_23, Figure S1 A) have the selected sites A845S and L5F in the S. H_23 is present in all Brazilian regions, whereas H_5 is present only in the south-east, the most densely populated region of the country. It is noteworthy that the positive selection signal was not lost when considering the derived lineages (Table 4), similar to what happens with the ORF3 selected sites L83F and D27Y; ORF1a selected sites G150S, G519S, P2046T/L, and K3353R, as well as ORF1b selected sites L314P, D1264E, and T1774I (Table 4),all of which were present in haplotypes with some level of expansion (Figure S1 B: H_2, H_36; Figure S1 C: H_64, H_222, H_79, H_89; Figure S1 D: H_11, H_60, H_261, respectively). Other branches that led to haplotypes with signs of expansion had sites under positive selection detected only when the complete data were analyzed (Table 4). For example, H_5 originated from a non-synonymous mutational step (position 10525 of the RNA sequence/ H3509Y) from H_1 of ORF1a (Figure S1 C). ORF1b also presented positively selected sites under these conditions (H_32, 5484,5483/K1828T; H_65, 652/ P218L; H_117, 246/ K82N).

Notably, some critical diagnostic sites, whose positive selection signals were detected by us (Table S1), did not seem to be relevant in the networks, indicating no apparent sign of expansion (according to our criteria), at least so far (e.g., ORF1a T1820I and ORF3 D155Y diagnostic sites of P.1.2; de Almeida et al. 2021).

On the other hand, Table S3 shows that only the P.1.2-derived lineage assembles a more exclusive set of sequences: 27% and 100% of the ORF3 and ORF1a sequences, identified by PANGOLIN as P.1.2, clustered in H_2 (Figure S1 B) and H_5 (Figure S1 C). Notably, H_5 (ORF1a) was composed of sequences with the potentially positively selected allele 3509Y. This finding indicated that P.1.2 has at least one site, not previously identified, that characterizes it as a target of natural selection and can endow a favorable fitness to this emerging lineage.

Concomitantly, our analysis revealed a large number of sites under purifying selection (Table S2), which maintain the general SARS-CoV-2 “status quo,” as a specialist to infect humans (Fam et al., 2020).

Phylogeographic analysis of P.1.1 and P.1.2 sequences (Figure S2 A and B) revealed that the former seems to have originated in the state of Goiás (central-west region, CW) and spread mainly to the southern (S) and south-eastern (SE) states, not reaching other areas; however, the number of sequences used in the analysis of P.1.1 was low, and this result should be taken with caution when making larger inferences. Variant P.1.2 seems to have originated in Rio Grande do Sul, the southernmost state in Brazil, with a broader distribution, although mostly restricted to the country’s east. Notably, H_5 (ORF1a), composed only of P.1.2 3509Y sequences, was present in the southeast and northeast regions of Brazil (Figure S1 C). This result suggested that H3509Y of ORF1a occurred after P.1.2 had dispersed from the S to the southwest (SW) and northeast (NE) regions.

Discussion

The perfect storm described herein, culminating in the ongoing tragedy of the COVID-19 pandemic, was only possible because of particular evolutionary events in the trajectory of SARS-CoV-2 that maintained its high transmissibility and relatively low lethality. The dominance of certain SARS-CoV-2 lineages over time relative to others is robust evidence of this fact and that the evolutionary arms race between SARS-CoV-2 and its host is in full swing. In this scenario, we must also compute the “Homo sapiens reaction,” including natural immune responses (probably shaped for millions of years due to likely recurrent attacks of CoVs; Meyerson and Sawyer, 2011; Enard et al., 2016; Wang W. et al., 2020) and those induced by current large-scale vaccine deployment, among other containment and pharmacological measures. In addition, Gamma was found in Brazil, a territory already swept by other SARS-CoV-2 lineages; the interaction between different sets of Gamma × non-Gamma lineages could also play an important role in determining whether a Gamma lineage would be able to expand.

Because of limited genome size, as seen in viruses, relatively few nucleotide sites are free to vary, despite high mutation rates (Eigen, 1996; Holmes, 2003). Thus, convergent evolution (e.g., the parallel occurrence of identical alleles in distant lineages) is relatively common among RNA viruses (Cuevas et al., 2002). Weinreich et al. (2006) demonstrated how many mutational pathways are repeatedly selected in pathogenic microorganisms. The phenomena of epistasis (non-additive interactions among alleles) also dictates the complex course of viral evolution (Holmes and Rambaut, 2004; Fragata et al., 2019). In the presence of epistasis, the fitness effect of a combination of multiple non-interacting alleles of the same or related genes/pathways corresponds to the sum of fitness effects of the individual alleles (Ferretti et al., 2018; Østman et al., 2012; Fragata et al., 2019). Thus, epistasis limits the possible routes to high fitness, and orders which sequences of consecutive mutations are feasible, but can also open new functional evolutionary paths, promoting adaptive novelty (Usmanova et al., 2015; Starr and Thornton, 2016; Ferretti et al., 2016; Fragata et al., 2019). Sanjuán et al (2004) showed details of how the architecture of fitness in RNA viruses depends on epistasis. Recently, epistasis has been cited as a phenomenon behind the rapid and successful dispersion of the Omicron variant (BBC News Mundo, 2021). Another important phenomenon in the evolution of viral genomes is antagonistic pleiotropy, i.e., mutations beneficial in one host may be deleterious in others. Antagonistic pleiotropy may limit the range of adaptations and promote the evolution of specialization (Santiago et al., 2009). This concept is different from that of classic pleiotropy, characterized by genes/ORFs that affect more than one independent trait, a mechanism that promotes substantial modulation of variation (Wagner and Zhang, 2011), which likely also impacts the evolution of SARS-CoV-2. Below, we discuss how examples presented in this work illustrate the action of such mechanisms.

It is known that a tyrosine (Y) residue at position 501 (located in the receptor-binding domain, RBD, of the S protein), rather than an asparagine (N; present in Wuhan SARS-CoV-2 genome sequence), is present in Alpha, Beta, and Gamma VOCs, potentially creating opportunities for these to become more infectious and partially resistant to therapeutics blocking RBD-ACE2 interactions (Liu H. et al., 2021; Narayanan and Procko, 2021). 501Y endows these lineages with the ability to escape from therapeutically relevant antibodies and the host immune system (Wibmer et al., 2021). Other derived lineages also harbor the N501Y mutation, signaling its recurrence (Lemmermann et al., 2021). VOCs Beta and Gamma share an additional RDB mutation, K417N/T (Abdool and de Oliveira, 2021), while UK researchers detected cases of Alpha lineages with the 484K allele, which also characterizes the Beta and Gamma lineages (Shamsian, 2021; Faria et al., 2021b). Some studies have shown that lineages containing the 417N allele are more effective in escaping antibody neutralization than Gamma, which presents the 417T allele (Garcia-Beltran et al., 2021; Liu Y. et al., 2021). In addition, the 484K allele creates a new binding site for the amino acid at position 75 in human ACE2. This interaction seems stronger than the binding between ACE2 and the original main site located at position 501 (Ferrareze et al., 2021; Nelson et al., 2021). Other studies have shown that 484K reduces neutralization by polyclonal antibodies (Greaney et al., 2021; Oliveira et al., 2021). Oliveira et al. (2021) identified an immune dominant epitope (S415-429) recognized by 68% of sera from convalescent Brazilians infected with the ancestral SARS-CoV-2 lineage. This immune dominant RBD region harbors a mutational hotspot site at position 417. The same authors also performed simulations that indicated impaired RBD binding to previous infection- or vaccine-induced neutralizing antibodies in both Beta (417N) and Gamma (417T) VOCs (Oliveira et al., 2021). It is noteworthy that 614G started to appear as a diagnostic allele for VOCs Alpha, Beta, and Gamma only in the WHO report of April 27 (2021 g). Many recurring mutations lead to the exchange of amino acids D > G at position 614 of the SARS-CoV-2 S glycoprotein. Lineages with the 614G allele were rare before March 2020, but became dominant after May 2020, suggesting that the 614G allele might improve viral fitness (Plante et al., 2020). More recently, Xu et al. (2021) reported that the 614G allele could lower the energy barrier for conformational transformation from the RDB closed pre-fusion state to the fusion-prone open state, resulting in even greater affinity of SARS-CoV-2 S protein binding to ACE2. The SARS-CoV-2 lineage from Wuhan harbors the D614 allele.

Some positively selected sites have also been highlighted in studies by the GISAID platform group that applied the MEME and FEL methods to a database of approximately 390,000 sequences per gene/ORF (Gamma sequences included). This team of researchers detected 213 sites with positive selection signatures, 27% of which were located in S; the majority had the same aa changes as we observed (Pond, 2020). For example, the positively selected site T175I (ORF3; Table S1), also referred to as a site where epitopes overlap, the minor allele (isoleucine-I) is found on other continents (Pond, 2020), indicating that it is recurrent in several lineages in addition to Gamma. The recurrence of identical alleles in lineages which are otherwise relatively phylogenetically distant appears to be a constant.

All of these studies reinforce the functionality of these positions, and the same mutational pathways are repeatedly selected for. We found in Gamma sequences ancestral alleles at all of these sites (417, 484, 501, and 614), but in different combinations. In other words, in most cases, there are alternations between alleles (amino acid residues) present in the Wuhan sequence and derived alleles. Our results also showed no negative selection signals involving these positions (Table S2), indicating a certain margin for the evolvability (evolutionary capacity) of the SARS-CoV-2 S protein, but within an evolutionary trajectory also modulated by the phenomena of epistasis and pleiotropy.

Relatively few alternatives can represent adaptive novelties in S (for example glutamine (Q) at position 484 in Delta). The potentially positively selected allele 3509Y (ORF1a) in nine P.1.2 sequences that form an expanding haplotype also illustrates another adaptive novelty (Figure S1 C). The role of CoV ORF1a does not seem to be limited to viral transcription and replication. Graham et al. (2008) demonstrated its function in virulence, virus-cell interactions, and alterations to virus-host responses. More recent investigations (Emam et al., 2021) demonstrate that proteins encoded by ORF1a and ORF1b together (ORF1ab) are involved in SARS-CoV-2 pathogenicity and infectivity. Only functional studies will define whether there is a gain in fitness in P.1.2 due to 3509Y. These findings suggest a limitation to the emergence of variability inSARS-CoV-2 modulated by positive selection involving critical positions, as well as the likely existence of compensatory alleles due to epistasis and/or pleiotropy (both antagonistic and classical).

We also observed that many lineages with potentially advantageous alleles succumbed, while others, for no obvious or apparent reasons, expanded (Figure S1 A-D). This finding clearly illustrates the effect of stochastic events (i.e., a rapid expansion of lineages with rare neutral alleles occurring due to founder effects after they reach a large urban center), already predicted to be important in viral evolution (Croucher and Didelot, 2015; Campbell et al., 2021). These two powerful forces (natural selection and random events) are not mutually exclusive because a variant under selection can spread more rapidly in urban and densely populated areas.

Finally, our findings should be considered with caution. For instance, although widely used in population studies involving organisms with high mutation rates (Kumar et al., 2020), methods based on the dN/d S ratio at the population level may result in false-positive results (Kryazhimskiy and Plotkin, 2008). In addition, the quantitative and qualitative complexity of positive selection regimes using the present population genomic data cannot be accessed through our analysis.

Acknowledgments

This research was supported financially by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES - Finance code 001), the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), the Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP), and the Instituto de Pesquisa do Câncer de Guarapuava. The funders had no role in the study design, data collection and analysis, decision to publish, or manuscript preparation.

Supplementary Material.

The following online material is available for this article:

Table S1 - . Results for positive selection tests (FUBAR, MEME and FEL).
Table S2 -. Results for negative selection tests (FUBAR and FEL).
Table S3 - . P.1.1 and P.1.2 sequences in genes/ORFs with sites under positive selection.
Figure S1 -. Networks: a) Spike; b) ORF3; c) ORF1a; d) ORF1b.
Figure S2 - . Geographical dispersion of lineages P.1.1 (A) and P.1.2(B).

References

  1. Abdool Karim S, de Oliveira T. New SARS-CoV-2 variants - clinical, public health, and vaccine implications. N Engl J Med. 2021;384:1866–1868. doi: 10.1056/NEJMc2100362. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alfaraj SH, Al-Tawfiq JA, Assiri AY, Alzahrani NA, Alanazi A, Memish ZA. Clinical predictors of mortality of Middle East Respiratory Syndrome Coronavirus (MERS-CoV) infection: A cohort study. Travel Med Infect Dis. 2019;29:48–50. doi: 10.1016/j.tmaid.2019.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Andersen KG, Rambaut A, Lipkin WI, Holmes EC, Garry RF. The proximal origin of SARS-CoV-2. Nat Med. 2020;26:450–452. doi: 10.1038/s41591-020-0820-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Awadasseid A, Wu Y, Tanaka Y, Zhang W. SARS-CoV-2 variants evolved during the early stage of the pandemic and effects of mutations on adaptation in Wuhan populations. Int J Biol Sci. 2021;17:97–106. doi: 10.7150/ijbs.47827. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bager P, Wohlfahrt J, Fonager J, Albertsen M, YssingMichaelsen T, HoltenMøller C, Ethelberg S, Legarth R, Fischer Button MS, Gubbels SM, et al. Increased risk of hospitalization associated with infection with SARS-CoV-2 Lineage B.1.1.7 in Denmark. SSRN Electron J. 2021 doi: 10.2139/ssrn.3792894. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bandelt HJ, Forster P, Sykes BC, Richards MB. Mitochondrial portraits of human populations using median networks. Genetics. 1995;141:743–753. doi: 10.1093/genetics/141.2.743. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bielejec F, Baele G, Vrancken B, Suchard MA, Rambaut A, Lemey P. SpreaD3: Interactive visualization of spatiotemporal history and trait evolutionary processes. Mol Biol Evol. 2016;33:2167–2169. doi: 10.1093/molbev/msw082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bivins A, Greaves J, Fischer R, Yinda KC, Ahmed W, Kitajima M, Munster VJ, Bibby K. Persistence of SARS-CoV-2 in water and wastewater. Environ Sci Technol Lett. 2020;7:937–942. doi: 10.1021/acs.estlett.0c00730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bogler A, Packman A, Furman A, Gross A, Kushmaro A, Ronen A, Dagot C, Hill C, Vaizel-Ohayon D, Morgenroth E, et al. Rethinking wastewater risks and monitoring in light of the COVID-19 pandemic. Nat Sustain. 2020;3:981–990. [Google Scholar]
  10. Buss LF, Prete CA, Abrahim CM, JrA Mendrone, Salomon T, Almeida-Neto C, de França RFO, Belotti MC, Carvalho MPS, Costa AG, et al. Three-quarters attack rate of SARS-CoV-2 in the Brazilian Amazon during a largely unmitigated epidemic. Science. 2021;371:288–292. doi: 10.1126/science.abe9728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Campbell F, Archer B, Laurenson-Schafer H, Jinnai Y, Konings F, Batra N, Pavlin B, Vandemaele K, Kerkhove MD, Van Jombart T, et al. Increased transmissibility and global spread of SARS-CoV-2 variants of concern as at June 2021. Euro Surveill. 2021;26:2100509. doi: 10.2807/1560-7917.ES.2021.26.24.2100509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Challen R, Brooks-Pollock E, Read JM, Dyson L, Tsaneva-Atanasova K, Danon L. Risk of mortality in patients infected with SARS-CoV-2 variant of concern 202012/1: Matched cohort study. BMJ. 2021;372(579) doi: 10.1136/bmj.n579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chaw SM, Tai JH, Chen SL, Hsieh CH, Chang SY, Yeh SH, Yang WS, Chen PJ, Wang HY. The origin and underlying driving forces of the SARS-CoV-2 outbreak. J Biomed Sci. 2020;27:73. doi: 10.1186/s12929-020-00665-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Cotten M, Watson SJ, Zumla AI, Makhdoom HQ, Palser AL, Ong SH, Al Rabeeah A, Alhakeem RF, Assiri A, Al-Tawfiq JA, et al. Spread, circulation, and evolution of the Middle East respiratory syndrome coronavirus. mBio. 2014;5:e1062-13. doi: 10.1128/mBio.01062-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Coutard B, Valle C, Lamballerie X, Canard B, Seidah NG, Decroly E. The spike glycoprotein of the new coronavirus 2019-nCoV contains a furin-like cleavage site absent in CoV of the same clade. Antiviral Res. 2020;176:104742. doi: 10.1016/j.antiviral.2020.104742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Croucher NJ, Didelot X. The application of genomics to tracing bacterial pathogen transmission. Curr Opin Microbiol. 2015;23:62–67. doi: 10.1016/j.mib.2014.11.004. [DOI] [PubMed] [Google Scholar]
  17. Cuevas JM, Elena SF, Moya A. Molecular basis of adaptive convergence in experimental populations of RNA viruses. Genetics. 2002;162:533–542. doi: 10.1093/genetics/162.2.533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Davies NG, Abbott S, Barnard RC, Jarvis CI, Kucharski AJ, Munday JD, Pearson CAB, Russell TW, Tully DC, Washburne AD, et al. Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science. 2021;372:eabg3055. doi: 10.1126/science.abg3055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Davies NG, Jarvis CI, CMMID COVID-19 Working Group, Edmunds WJ, Jewell NP, Diaz-Ordaz K, Keogh RH. Increased mortality in community-tested cases of SARS-CoV-2 lineage B.1.1.7. Nature. 2021;593:270–274. doi: 10.1038/s41586-021-03426-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Drummond AJ, Suchard M A, Xie D, Rambaut A. Bayesian Phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012;29:1969–1973. doi: 10.1093/molbev/mss075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Duffy S. Why are RNA virus mutation rates so damn high? PLoS Biol. 2018;16:e3000003. doi: 10.1371/journal.pbio.3000003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Eigen M. On the nature of virus quasispecies. Trends Microbiol. 1996;4:216–218. doi: 10.1016/0966-842X(96)20011-3. [DOI] [PubMed] [Google Scholar]
  23. Elbe S, Buckland-Merrett G. Data, disease and diplomacy: GISAID’s innovative contribution to global health. Glob Chall. 2017;1:33–46. doi: 10.1002/gch2.1018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Emam M, Oweda M, Antunes A, El-Hadidi M. Positive selection as a key player for SARS-CoV-2 pathogenicity: Insights into ORF1ab, S and E genes. Virus Res. 2021;302:198472. doi: 10.1016/j.virusres.2021.198472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Enard D, Cai L, Gwennap C, Petrov DA. Viruses are a dominant driver of protein adaptation in mammals. Elife. 2016;5:e12469. doi: 10.7554/eLife.12469. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Fam BSO, Vargas-Pinilla P, Amorim CEG, Sortica VA, Bortolini MC. ACE2 diversity in placental mammals reveals the evolutionary strategy of SARS-CoV-2. Genet Mol Biol. 2020;43:e20200104. doi: 10.1590/1678-4685-GMB-2020-0104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Faria NR, Mellan TA, Whittaker C, Claro IM, Candido D da S, Mishra S, Crispim MAE, Sales FCS, Hawryluk I, McCrone JT, et al. Genomics and epidemiology of the P.1 SARS-CoV-2 lineage in Manaus, Brazil. Science. 2021;372:815–821. doi: 10.1126/science.abh2644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Ferrareze PAG, Franceschi VB, Mayer A de M, Caldana GD, Zimerman RA, Thompson CE. E484K as an innovative phylogenetic event for viral evolution: Genomic analysis of the E484K spike mutation in SARS-CoV-2 lineages from Brazil. Infect Genet Evol. 2021;93:104941. doi: 10.1016/j.meegid.2021.104941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Ferretti L, Schmiegelt B, Weinreich D, Yamauchi A, Kobayashi Y, Tajima F, Achaz G. Measuring epistasis in fitness landscapes: The correlation of fitness effects of mutations. J Theor Biol. 2016;396:132–143. doi: 10.1016/j.jtbi.2016.01.037. [DOI] [PubMed] [Google Scholar]
  30. Ferretti L, Weinreich D, Tajima F, Achaz G. Evolutionary constraints in fitness landscapes. Heredity (Edinb) 2018;121:466–481. doi: 10.1038/s41437-018-0110-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Follis KE, York J, Nunberg JH. Furin cleavage of the SARS coronavirus spike glycoprotein enhances cell-cell fusion but does not affect virion entry. Virology. 2006;350:358–369. doi: 10.1016/j.virol.2006.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Fragata I, Blanckaert A, Louro MAD, Liberles DA, Bank C. Evolution in the light of fitness landscape theory. Trends Ecol Evol. 2019;34:69–82. doi: 10.1016/j.tree.2018.10.009. [DOI] [PubMed] [Google Scholar]
  33. Franceschi VB, Caldana GD, Perin C, Horn A, Peter C, Cybis GB, Ferrareze PAG, Rotta LN, Cadegiani FA, Zimerman RA, et al. Predominance of the SARS-CoV-2 lineage P.1 and its sublineage P.1.2 in patients from the metropolitan region of Porto Alegre, Southern Brazil in March 2021: A phylogenomic analysis. medRxiv. 2021 doi: 10.1101/2021.05.18.21257420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Garcia-Beltran WF, Lam EC, Denis KSt, Nitido AD, Garcia ZH, Hauser BM, Feldman J, Pavlovic MN, Gregory DJ, Poznansky MC, et al. Multiple SARS-CoV-2 variants escape neutralization by vaccine-induced humoral immunity. Cell. (9) 2021;184:2372–2383. doi: 10.1016/j.cell.2021.03.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Graham MS, Sudre CH, May A, Antonelli M, Murray B, Varsavsky T, Kläser K, Canas LS, Molteni E, Modat M, et al. Changes in symptomatology, reinfection, and transmissibility associated with the SARS-CoV-2 variant B.1.1.7: An ecological study. Lancet Public Health. 2021;6:e335-e345. doi: 10.1016/S2468-2667(21)00055-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Graham RL, Sparks JS, Eckerle LD, Sims AC, Denison MR. SARS coronavirus replicase proteins in pathogenesis. Virus Res. 2008;133:88–100. doi: 10.1016/j.virusres.2007.02.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Greaney AJ, Loes AN, Crawford KHD, Starr TN, Malone KD, Chu HY, Bloom JD. Comprehensive mapping of mutations in the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human plasma antibodies. Cell Host Microbe. (6) 2021;29:463–476. doi: 10.1016/j.chom.2021.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Hashizume M, Gonzalez G, Ono C, Takashima A, Iwasaki M. Population-specific ACE2 single-nucleotide polymorphisms have limited impact on SARS-CoV-2 Infectivity In Vitro. Viruses. 2021;13:67. doi: 10.3390/v13010067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Hedman HD, Krawczyk E, Helmy YA, Zhang L, Varga C. Host diversity and potential transmission pathways of SARSCoV-2 at the human-animal interface. Pathogens. 2021;10:180. doi: 10.3390/pathogens10020180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Holmes EC. Error thresholds and the constraints to RNA virus evolution. Trends Microbiol. 2003;11:543–546. doi: 10.1016/j.tim.2003.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Holmes EC, Rambaut A. Viral evolution and the emergence of SARS coronavirus. Philos Trans R Soc Lond B Biol Sci. 2004;359:1059–1065. doi: 10.1098/rstb.2004.1478. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Holmes EC, Goldstein SA, Rasmussen AL, Robertson DL, Crits-Christoph A, Wertheim JO, Anthony SJ, Barclay WS, Boni MF, Doherty PC, et al. The origins of SARS-CoV-2: A critical review. Cell. 2021;184:4848–4856. doi: 10.1016/j.cell.2021.08.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Jassat W Mudara C, Ozougwu L Tempia S, Blumberg L Davies MA, Pillay Y Carter T, Morewane R Wolmarans, et al. Increased mortality among individuals hospitalised with COVID-19 during the second wave in South Africa. medRxiv. 2021 doi: 10.1101/2021.03.09.21253184. [DOI] [Google Scholar]
  44. Ji W, Wang W, Zhao X, Zai J, Li X. Cross-species transmission of the newly identified coronavirus 2019-nCoV. J Med Virol. 2020;92:433–440. doi: 10.1002/jmv.25682. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Johansson MA, Quandelacy TM, Kada S, Prasad PV, Steele M, Brooks JT, Slayton R B, Biggerstaff M, Butler JC. SARS-CoV-2 Transmission From People Without COVID-19 Symptoms. JAMA Netw Open. 2021;4:e2035057. doi: 10.1001/jamanetworkopen.2020.35057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Kalyaanamoorthy S, Minh BQ, Wong TKF, von Haeseler A, Jermiin LS. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat Methods. 2017;14:587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kan B, Wang M, Jing H, Xu H, Jiang X, Yan M, Liang W, Zheng H, Wan K, Liu Q, et al. Molecular evolution analysis and geographic investigation of severe acute respiratory syndrome coronavirus-like virus in palm civets at an animal market and on farms. J Virol. 2005;79:11892–11900. doi: 10.1128/JVI.79.18.11892-11900.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019;20:1160–1166. doi: 10.1093/bib/bbx108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Koçhan N, Eskier D, Suner A, Karakülah G, Oktay Y. Different selection dynamics of S and RdRp between SARS-CoV-2 genomes with and without the dominant mutations. Infect Genet Evol. 2021;91:104796. doi: 10.1016/j.meegid.2021.104796. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kosakovsky SLP, Frost SDW. Not so different after all: A comparison of methods for detecting amino acid sites under selection. Mol Biol Evol. 2005;22:1208–1222. doi: 10.1093/molbev/msi105. [DOI] [PubMed] [Google Scholar]
  51. Kosakovsky SLP, Poon AFY, Velazquez R, Weaver S, Hepler NL, Murrell B, Shank SD, Magalis BR, Bouvier D, Nekrutenko A, et al. HyPhy 2.5-A customizable platform for evolutionary hypothesis testing using phylogenies. Mol Biol Evol. 2020;37:295–299. doi: 10.1093/molbev/msz197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Krishnamoorthy P, Raj AS, Roy S, Kumar NS, Kumar H. Comparative transcriptome analysis of SARS-CoV, MERS-CoV, and SARS-CoV-2 to identify potential pathways for drug repurposing. Comput Biol Med. 2021;128:104123. doi: 10.1016/j.compbiomed.2020.104123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Kryazhimskiy S, Plotkin JB. The population genetics of dN/dS. PLoS Genet. 2008;4:e1000304. doi: 10.1371/journal.pgen.1000304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Kumar R, Verma H, Singhvi N, Sood U, Gupta V, Singh M, Sood U, Kumari R, Hira P, Nagar S, et al. Comparative genomic analysis of rapidly evolving SARS-COV-2 reveals mosaic pattern of phylogeographical distribution. mSystem. 2020;5:e00505-20. doi: 10.1128/mSystems.00505-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Lau S-Y, Wang P, Mok BWY, Zhang AJ, Chu H, Lee ACY, Deng S, Chen P, Chan K-H, Song W, et al. Attenuated SARS-CoV-2 variants with deletions at the S1/S2 junction. Emerg Microbes Infect. 2020;9:837–842. doi: 10.1080/22221751.2020.1756700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Lemey P, Rambaut A, Drummond A J, Suchard MA. Bayesian phylogeography finds its roots. PLoS Comput Biol. 2009;5:e1000520. doi: 10.1371/journal.pcbi.1000520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Lemmermann N, Lieb B, Laufs T, Renzaho A, Runkel S, Kohnen W, Linke M, Gerber S, Schweiger S, Michel A, et al. SARS-CoV-2 genome surveillance in Mainz, Germany, reveals convergent origin of the N501Y spike mutation in a hospital setting. medRxiv. 2021 doi: 10.1101/2021.02.11.21251324. [DOI] [Google Scholar]
  58. Li R, Qiao S, Zhang G. Analysis of angiotensin-converting enzyme 2 (ACE2) from different species sheds some light on cross-species receptor usage of a novel coronavirus 2019-nCoV. J Infect. 2020;80:469–496. doi: 10.1016/j.jinf.2020.02.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Li X, Giorgi EE, Marichannegowda MH, Foley B, Xiao C, Kong XP, Chen Y, Gnanakaran S, Korber B, Gao F. Emergence of SARS-CoV-2 through recombination and strong purifying selection. Sci Adv. 2020;6:eabb9153. doi: 10.1126/sciadv.abb9153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Liu H, Wei P, Zhang Q, Chen Z, Aviszus K, Downing W, Peterson S, Reynoso L, Downey GP, Frankel SK, et al. 501Y.V2 and 501Y.V3 variants of SARS-CoV-2 lose binding to bamlanivimabin vitro. MAbs. 2021;13:1919285. doi: 10.1080/19420862.2021.1919285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Liu J, Zhang L, Yan Y, Zhou Y, Yin P, Qi J, Wang L, Pan J, You J, Yang J, et al. Excess mortality in Wuhan city and other parts of China during the three months of the covid-19 outbreak: Findings from nationwide mortality registries. BMJ. 2021;372(415) doi: 10.1136/bmj.n415. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Liu Y, Liu J, Xia H, Zhang X, Fontes-Garfias CR, Swanson KA, Cai H, Sarkar R, Chen W, Cutler M, et al. Neutralizing activity of BNT162b2-elicited serum. N Engl J Med. 2021;384:1466–1468. doi: 10.1056/NEJMc2102017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Lytras S, Xia W, Hughes J, Jiang X, Robertson DL. The animal origin of SARS-CoV-2. Science. 2021;373:968–970. doi: 10.1126/science.abh0117. [DOI] [PubMed] [Google Scholar]
  64. Malaiyan J, Arumugam S, Mohan K, Radhakrishnan G. An update on the origin of SARS-CoV-2: Despite closest identity, bat (RaTG13) and pangolin derived coronaviruses varied in the critical binding site and O-linked glycan residues. J Med Virol. 2021;93:499–505. doi: 10.1002/jmv.26261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Martins AF, Zavascki AP, Wink PL, Volpato FCZ, Monteiro FL, Rosset C, De-Paris F, Ramos ÁK, Barth AL. Detection of SARS-CoV-2 lineage P.1 in patients from a region with exponentially increasing hospitalisation rate, February 2021, Rio Grande do Sul, Southern Brazil. Euro Surveill. 2021;26:2100276. doi: 10.2807/1560-7917.ES.2021.26.12.2100276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Meyerson NR, Sawyer SL. Two-stepping through time: mammals and viruses. Trends Microbiol. 2011;19:286–294. doi: 10.1016/j.tim.2011.03.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, Lanfear R. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol. 2020;37:1530–1534. doi: 10.1093/molbev/msaa015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Montagutelli X, Prot M, Levillayer L, Salazar EB, Jouvion G, Conquet L, Donati F, Albert M, Gambaro F, Behillil S, et al. The B1.351 and P.1 variants extend SARS-CoV-2 host range to mice. bioRxiv. 2021 doi: 10.1101/2021.03.18.436013. [DOI] [Google Scholar]
  69. Murrell B, Moola S, Mabona A, Weighill T, Sheward D, Kosakovsky Pond SL, Scheffler K. FUBAR: A fast, unconstrained Bayesian approximation for inferring selection. Mol Biol Evol. 2013;30:1196–1205. doi: 10.1093/molbev/mst030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Murrell B, Wertheim JO, Moola S, Weighill T, Scheffler K, Pond SLK. Detecting individual sites subject to episodic diversifying selection. PLoS Genet. 2012;8:e1002764. doi: 10.1371/journal.pgen.1002764. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Narayanan KK, Procko E. Deep mutational scanning of viral glycoproteins and their host receptors. Front Mol Biosci. 2021;8:636660. doi: 10.3389/fmolb.2021.636660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Nelson G, Buzko O, Spilman P, Niazi K, Rabizadeh S, Soon-Shiong P. Molecular dynamic simulation reveals E484K mutation enhances spike RBD-ACE2 affinity and the combination of E484K, K417N and N501Y mutations (501Y.V2 variant) induces conformational change greater than N501Y mutant alone, potentially resulting in an escap. bioRxiv. 2021 doi: 10.1101/2021.01.13.426558. [DOI] [Google Scholar]
  73. O’Toole Á, Scher E, Underwood A, Jackson B, Hill V, McCrone JT, Colquhoun R, Ruis C, Abu-Dahab K, Taylor B, et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 2021;7:veab064. doi: 10.1093/ve/veab064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Olival KJ, Hosseini PR, Zambrana-Torrelio C, Ross N, Bogich TL, Daszak P. Host and viral traits predict zoonotic spillover from mammals. Nature. 2017;546:646–650. doi: 10.1038/nature22975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Oliveira JR, Machado RRG, Arcuri HA, Magawa JY, Daher IP, Urbanski AH, Schmitz GJH, Silva RCV, Durigon EL, Boscardin SB, et al. Immunodominant B cell epitope in a hotspot mutation site and mechanism of immune escape for SARS-CoV-2. medRxiv. 2021 doi: 10.1101/2021.03.11.21253399. [DOI] [Google Scholar]
  76. Østman B, Hintze A, Adami C. Impact of epistasis and pleiotropy on evolutionary adaptation. Proc Biol Sci. 2012;279:247–256. doi: 10.1098/rspb.2011.0870. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Petersen E, Koopmans M, Go U, Hamer DH, Petrosillo N, Castelli F, Storgaard M, Al Khalili S, Simonsen L. Comparing SARS-CoV-2 with SARS-CoV and influenza pandemics. Lancet Infect Dis. 2020;20:e238-e244. doi: 10.1016/S1473-3099(20)30484-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Plante JA, Liu Y, Liu J, Xia H, Johnson BA, Lokugamage KG, Zhang X, Muruato AE, Zou J, Fontes-Garfias CR, et al. Spike mutation D614G alters SARS-CoV-2 fitness. Nature. 2020;592:7852. doi: 10.1038/s41586-020-2895-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Polzin T, VahdatiDaneshmand S. On Steiner trees and minimum spanning trees in hypergraphs. Oper Res Lett. 2003;31:12–20. [Google Scholar]
  80. Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst Biol. 2018;67:901–904. doi: 10.1093/sysbio/syy032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Ranjan R, Sharma A, Verma MK. Characterization of the second wave of COVID-19 in India. medRxiv. 2021 doi: 10.1101/2021.04.17.21255665. [DOI] [Google Scholar]
  82. Rasmussen AL. On the origins of SARS-CoV-2. Nat Med. 2021;27:9. doi: 10.1038/s41591-020-01205-5. [DOI] [PubMed] [Google Scholar]
  83. Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, Sánchez-Gracia A. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol. 2017;34:3299–3302. doi: 10.1093/molbev/msx248. [DOI] [PubMed] [Google Scholar]
  84. Sanjuán R, Moya A, Santiago FE. The contribution of epistasis to the architecture of fitness in an RNA virus. Proc Natl Acad Sci U S A. 2004;101:15376–15379. doi: 10.1073/pnas.0404125101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Santiago FE, Patricia AR, Jasna L. The evolution of viruses in multi-host fitness landscapes. Open Virol J. 2009;3:1–6. doi: 10.2174/1874357900903010001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Shamsian N. One small step for man... J Wound Care. 2021;30:91–92. doi: 10.12968/jowc.2021.30.2.91. [DOI] [PubMed] [Google Scholar]
  87. Simmons G, Reeves JD, Rennekamp AJ, Amberg SM, Piefer AJ, Bates P. Characterization of severe acute respiratory syndrome-associated coronavirus (SARS-CoV) spike glycoprotein-mediated viral entry. Proc Natl Acad Sci U S A. 2004;101:4240–4245. doi: 10.1073/pnas.0306446101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Sohpal VK. Comparative study: nonsynonymous and synonymous substitution of SARS-CoV-2, SARS-CoV, and MERS-CoV genome. Genomics Inform. (15) 2021;19 doi: 10.5808/gi.20058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Spielman SJ, Weaver S, Shank SD, Magalis BR, Li M, Pond SLK. In: Evolutionary Genomics. Methods in Molecular Biology. Anisimova M, editor. Vol. 1910. Humana Press; New York, NY: 2019. Evolution of viral genomes: Interplay between selection, recombination, and other forces. [DOI] [PubMed] [Google Scholar]
  90. Starr TN, Thornton JW. Epistasis in protein evolution. Protein Sci. 2016;25:1204–1218. doi: 10.1002/pro.2897. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Tang X, Wu C, Li X, Song Y, Yao X, Wu X, Duan Y, Zhang H, Wang Y, Qian Z, et al. On the origin and continuing evolution of SARS-CoV-2. Natl Sci Rev. 2020;7:1012–1023. doi: 10.1093/nsr/nwaa036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Taylor L. Covid-19: Is Manaus the final nail in the coffin for natural herd immunity? BMJ. 2021;372(394) doi: 10.1136/bmj.n394. [DOI] [PubMed] [Google Scholar]
  93. Tegally H, Wilkinson E, Giovanetti M, Iranzadeh A, Fonseca V, Giandhari J, Doolabh D, Pillay S, San EJ, Msomi N, et al. Emergence and rapid spread of a new severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) lineage with multiple spike mutations in South Africa. medRxiv. 2020 doi: 10.1101/2020.12.21.20248640. [DOI] [Google Scholar]
  94. Usmanova DR, Ferretti L, Povolotskaya IS, Vlasov PK, Kondrashov FA. A model of substitution trajectories in sequence space and long-term protein evolution. Mol Biol Evol. 2015;32:542–554. doi: 10.1093/molbev/msu318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Van Valen L. A new evolutionary law. Evol Theor. 1973;1:1–30. [Google Scholar]
  96. Wagner GP, Zhang J. The pleiotropic structure of the genotype-phenotype map: the evolvability of complex organisms. Nat Rev Genet. 2011;12:204–213. doi: 10.1038/nrg2949. [DOI] [PubMed] [Google Scholar]
  97. Wan Y, Shang J, Graham R, Baric RS, Li F. Receptor recognition by the novel coronavirus from Wuhan: An analysis based on decade-long structural studies of SARS coronavirus. J Virol. 2020;94:e00127-20. doi: 10.1128/JVI.00127-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Wang M, Li M, Ren R, Li L, Chen E-Q, Li W, Ying B. International expansion of a Novel SARS-CoV-2 Mutant. J Virol. 2020;94:e00567-20. doi: 10.1128/JVI.00567-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Wang W, Zhao H, Han GZ. Host-virus arms races drive elevated adaptive evolution in viral receptors. J Virol. 2020;94:e00684-20. doi: 10.1128/JVI.00684-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Weinreich DM, Delaney NF, De Pristo MA, Hartl DL. Darwinian evolution can follow only very few mutational paths to fitter proteins. Science. 2006;312:111–114. doi: 10.1126/science.1123539. [DOI] [PubMed] [Google Scholar]
  101. Wertheim JO, Chu DKW, Peiris JSM, Pond SLK, Poon LM. A case for the ancient origin of coronaviruses. J Virol. 2013;87:7039–7045. doi: 10.1128/JVI.03273-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Wibmer CK, Ayres F, Hermanus T, Madzivhandila M, Kgagudi P, Oosthuysen B, Lambson BE, de Oliveira T, Vermeulen M, van der Berg K, et al. SARS-CoV-2 501Y.V2 escapes neutralization by South African COVID-19 donor plasma. Nat Med. 2021;27:622–625. doi: 10.1038/s41591-021-01285-x. [DOI] [PubMed] [Google Scholar]
  103. Wu A, Peng Y, Huang B, Ding X, Wang X, Niu P, Meng J, Zhu Z, Zhang Z, Wang J, et al. Genome composition and divergence of the Novel Coronavirus (2019-nCoV) originating in China. Cell Host Microbe. 2020;27:325–328. doi: 10.1016/j.chom.2020.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Xiao J, Fang M, Chen Q, He B. SARS, MERS and COVID-19 among healthcare workers: A narrative review. J Infect Public Health. 2020;13:843–848. doi: 10.1016/j.jiph.2020.05.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Xu C, Wang Y, Liu C, Zhang C, Han W, Hong X, Wang Y, Hong Q, Wang S, Zhao Q, et al. Conformational dynamics of SARS-CoV-2 trimeric spike glycoprotein in complex with receptor ACE2 revealed by cryo-EM. Sci Adv. 2021;7:eabe5575. doi: 10.1126/sciadv.abe5575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Yan R, Zhang Y, Li Y, Xia L, Guo Y, Zhou Q. Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2. Science. 2020;367:1444–1448. doi: 10.1126/science.abb2762. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Yi K, Kim SY, Bleazard T, Kim T, Youk J, Ju YS. Mutational spectrum of SARS-CoV-2 during the global pandemic. Exp Mol Med. 2021;53:1229–1237. doi: 10.1038/s12276-021-00658-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Zaki AM, van Boheemen S, Bestebroer TM, Osterhaus ADME, Fouchier RAM. Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. N Engl J Med. 2012;367:1814–1820. doi: 10.1056/NEJMoa1211721. [DOI] [PubMed] [Google Scholar]
  109. Zhao Z, Li H, Wu X, Zhong Y, Zhang K, Zhang YP, Boerwinkle E, Fu YX. Moderate mutation rate in the SARS coronavirus genome and its implications. BMC Evol Biol. 2004;4:21. doi: 10.1186/1471-2148-4-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Zhou P, Yang XL, Wang XG, Hu B, Zhang L, Zhang W, Si HR, Zhu Y, Li B, Huang C-L, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. BBC News Mundo Ómicron: Qué es laepistasis y por qué es la clave para entender quétanpeligrosa es lanueva variante delcoronavirus. 2021. [7 Dec 2021]. BBC News Mundo (2021) Ómicron: Qué es laepistasis y por qué es la clave para entender quétanpeligrosa es lanueva variante delcoronavirus, https://www.bbc.com/mundo/noticias-59554525 .
  112. de Almeida LGP, Lamarca AP, da Silva FJrR, Cavalcante L, Gerber AL, Guimarães AP de C, Terra Machado D, Alves C, Mariani D, Felix Cruz T, et al. GenomicSurveillanceof SARS-CoV-2 in theStateof Rio de Janeiro, Brazil: technical briefing - SARS-CoV-2 coronavirus / nCoV-2019 GenomicEpidemiology - Virological. 2021. [8 Nov 2021]. (2021) GenomicSurveillanceof SARS-CoV-2 in theStateof Rio de Janeiro, Brazil: technical briefing - SARS-CoV-2 coronavirus / nCoV-2019 GenomicEpidemiology - Virological, https://virological.org/t/genomic-surveillance-of-sars-cov-2-in-the-state-of-rio-de-janeiro-brazil-technical-briefing/683 .
  113. ECDC European Centre for Disease Prevention and Control (2021a) COVID-19 situation update worldwide, as of week 30, updated 5 August 2021. 2021. [8 Nov 2021]. European Centre for Disease Prevention and Control (2021a) COVID-19 situation update worldwide, as of week 30, updated 5 August 2021, https://www.ecdc.europa.eu/en/geographical-distribution-2019-ncov-cases .
  114. ECDC European Centre for Disease Prevention and Control (2021b) Risk Assessment: Risk related to the spread of new SARS-CoV-2 variants of concern in the EU/EEA - first update. [8 Nov 2021]. European Centre for Disease Prevention and Control (2021b) Risk Assessment: Risk related to the spread of new SARS-CoV-2 variants of concern in the EU/EEA - first update, https://www.ecdc.europa.eu/en/publications-data/covid-19-risk-assessment-spread-new-variants-concern-eueea-first-update .
  115. Faria NR, Morales Claro I, Candido D, Moyses Franco LA, Andrade PS, Coletti TM, Silva CA, Sales FC, Manuli ER, Aguiar RS, et al. Genomic characterisation of an emergent SARS-CoV-2 lineage in Manaus: Preliminary findings - SARS-CoV-2 coronavirus / nCoV-2019 Genomic Epidemiology - Virological. 2021. [8 Nov 2021]. (2021a) Genomic characterisation of an emergent SARS-CoV-2 lineage in Manaus: Preliminary findings - SARS-CoV-2 coronavirus / nCoV-2019 Genomic Epidemiology - Virological, https://virological.org/t/genomic-characterisation-of-an-emergent-sars-cov-2-lineage-in-manaus-preliminary-findings/586 .
  116. Fluxus Technology Network. v10.2. 2020. [8 Nov 2021]. https://www.fluxus-engineering.com/index.htm .
  117. Naveca F, da Costa C, Nascimento V, Souza V, Corado A, Nascimento F, Costa Á, Duarte D, Silva G, Mejía M, et al. SARS-CoV-2 reinfectionbythe new VariantofConcern (VOC) P.1 in Amazonas, Brazil - SARS-CoV-2 coronavirus/ nCoV-2019 GenomicEpidemiology - Virological. 2021. [8 Nov 2021]. 1 in Amazonas, Brazil - SARS-CoV-2 coronavirus/ nCoV-2019 GenomicEpidemiology - Virological, https://virological.org/t/sars-cov-2-reinfection-by-the-new-variant-of-concern-voc-p-1-in-amazonas-brazil/596 .
  118. NERVTAG NERVTAG paper on COVID-19 variant of concern B.1.1.7 - GOV.UK. 2021. [8 Nov 2021]. https://www.gov.uk/government/publications/nervtag-paper-on-covid-19-variant-of-concern-b117 .
  119. Pond S Natural selection analysis of global SARS-CoV-2/COVID-19 enabled by data from GISAID. 2020. [8 Nov 2021]. Pond S (2020) Natural selection analysis of global SARS-CoV-2/COVID-19 enabled by data from GISAID, https://observablehq.com/@spond/revised-sars-cov-2-analytics-page .
  120. Reuters Delta COVID variant becoming globally dominant, WHO official says. 2021. [8 Nov 2021]. Reuters (2021) Delta COVID variant becoming globally dominant, WHO official says, https://www.reuters.com/world/delta-covid-variant-becoming-globally-dominant-says-who-official-2021-06-18/
  121. World Health Organization Consensus document on the epidemiology of severe acute respiratory syndrome (SARS), Department of Communicable Disease Surveillance and Response. 2003. [8 Nov 2021]. World Health Organization (2003) Consensus document on the epidemiology of severe acute respiratory syndrome (SARS), Department of Communicable Disease Surveillance and Response, https://www.who.int/csr/sars/en/WHOconsensus.pdf .
  122. World Health Organization MERS Situation Update, 2020. [8 Nov 2021]. World Health Organization (2020) MERS Situation Update, https://applications.emro.who.int/docs/WHOEMCSR326E-eng.pdf?ua=1 .
  123. World Health Organization Weekly epidemiological update on COVID-19 - 3 August 2021. 2021. [5 August, 2021]. World Health Organization (2021a) Weekly epidemiological update on COVID-19 - 3 August 2021, https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---3-august-2021 .
  124. World Health Organization Weekly epidemiological update on COVID-19 - 13 July 2021. 2021. [14 July, 2021]. World Health Organization (2021b) Weekly epidemiological update on COVID-19 - 13 July 2021, https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---13-july-2021 .
  125. World Health Organization Weekly epidemiological update on COVID-19 - 25 May 2021. 2021. [27 May, 2021]. World Health Organization (2021c) Weekly epidemiological update on COVID-19 - 25 May 2021, https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---25-may-2021 .
  126. World Health Organization Weekly epidemiological update on COVID-19 - 29 June 2021. 2021. [1 July, 2021]. World Health Organization (2021d) Weekly epidemiological update on COVID-19 - 29 June 2021, https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---29-june-2021 .
  127. World Health Organization Tracking SARS-CoV-2 variants. 2021. [August 11, 2021]. World Health Organization (2021e) Tracking SARS-CoV-2 variants, https://www.who.int/en/activities/tracking-SARS-CoV-2-variants .
  128. World Health Organization Weekly epidemiological update on COVID-19 - 1 June 2021. 2021. [5 June, 2021]. World Health Organization (2021f) Weekly epidemiological update on COVID-19 - 1 June 2021, https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---1-june-2021 .
  129. World Health Organization Weekly epidemiological update on COVID-19 - 27 April 2021. 2021. [2 May, 2021]. World Health Organization (2021g) Weekly epidemiological update on COVID-19 - 27 April 2021, https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---27-april-2021 .
  130. World Health Organization Weekly epidemiological update on COVID-19 - 7 December 2021. 2021. [8 December, 2021]. World Health Organization (2021h) Weekly epidemiological update on COVID-19 - 7 December 2021, https://www.who.int/publications/m/item/weekly-epidemiological-update-on-covid-19---7-december-2021 .

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S1 - . Results for positive selection tests (FUBAR, MEME and FEL).
Table S2 -. Results for negative selection tests (FUBAR and FEL).
Table S3 - . P.1.1 and P.1.2 sequences in genes/ORFs with sites under positive selection.
Figure S1 -. Networks: a) Spike; b) ORF3; c) ORF1a; d) ORF1b.
Figure S2 - . Geographical dispersion of lineages P.1.1 (A) and P.1.2(B).

Articles from Genetics and Molecular Biology are provided here courtesy of Sociedade Brasileira de Genética

RESOURCES