Abstract
Uruguay controlled the viral dissemination during the first nine months of the SARS-CoV-2 pandemic. Unfortunately, towards the end of 2020, the number of daily new cases exponentially increased. Herein, we analyzed the country-wide genetic diversity of SARS-CoV-2 between November 2020 and April 2021. We identified that the most prevalent viral variant during the first epidemic wave in Uruguay (December 2020–February 2021) was a B.1.1.28 sublineage carrying Spike mutations Q675H + Q677H, now designated as P.6, followed by lineages P.2 and P.7. P.6 probably arose around November 2020, in Montevideo, Uruguay’s capital department, and rapidly spread to other departments, with evidence of further local transmission clusters; it also spread sporadically to the USA and Spain. The more efficient dissemination of lineage P.6 with respect to P.2 and P.7 and the presence of mutations (Q675H and Q677H) in the proximity of the key cleavage site at the S1/S2 boundary suggest that P.6 may be more transmissible than other lineages co-circulating in Uruguay. Although P.6 was replaced by the variant of concern (VOC) P.1 as the predominant lineage in Uruguay since April 2021, the monitoring of the concurrent emergence of Q675H + Q677H in VOCs should be of worldwide interest.
Keywords: SARS-CoV-2, B.1.1.28, Spike, Q675H, Q677H, Uruguay, phylogeography, phylogenetics
1. Introduction
By the end of 2020 and the beginning of 2021, several studies reported the emergence of novel SARS-CoV-2 variants of interest (VOIs) and concern (VOCs) with different missense mutations and deletions in the Spike (S) protein that impact viral transmissibility and escape from previous host’s immune responses, among other features [1,2]. In Brazil, the SARS-CoV-2 lineages B.1.1.28 and B.1.1.33 dominated the first epidemic wave [3,4], but were replaced by VOC P.1 (WHO name: Gamma) and P.2 (former VOI Zeta), both descendants of lineage B.1.1.28, by the end of 2020 and beginning of 2021 [5]. So far, five B.1.1.28 descendant sublineages carrying mutations of concern have emerged. The VOC P.1, which harbors the mutations of concern S:K417T/E484K/N501Y among its lineage defining mutations [6], originated in the Amazonas state in mid-November [7,8] and rapidly spread across Brazil and to over 50 countries globally [9]. The lineages P.2, P.4, and P.5, carrying the concerning amino acid changes S:E484K, S:L452R, and S:E484Q/N501T, respectively, were also initially detected in samples from Brazil [10,11,12,13,14,15]. The lineage P.3 (former VOI Theta) emerged in the Philippines, and it includes substitutions S:E484K/N501Y/P681H among the lineage-defining mutations; the first sample was collected on 8 January 2021, and later it further spread to the USA, Germany, and Malaysia, among other countries [16]. One additional B.1.1.28 descendant clade that emerged in southern Brazil carrying mutation N:P13L was recently defined as a new Pango P.7 lineage [17,18].
Uruguay was able to control the early viral dissemination during the first nine months of the SARS-CoV-2 pandemic by implementing a successful Test, Trace, and Isolation strategy (TETRIS). The low number of total cases, contained outbreaks, and few deaths were characteristic for this first period [19,20]. At the beginning, viral diversity was high, with cocirculation of strains A.2, A.5, B.1, B.1.195, and B.31, introduced mostly through Montevideo, Uruguay’s capital city and connection hub through its international airport and harbor [21]. Later, multiple introductions of SARS-CoV-2 lineages B.1.1.28 and B.1.1.33 of Brazilian origin were detected in Uruguay, mainly along the 1068 km long Uruguayan–Brazilian dry border, and these lineages became predominant between May and July 2020 [22]. Towards the end of 2020, the number of active cases exponentially increased, from an average of 60 cases per day during October and November to more than 400 during December [19], concomitant with the loss of the TETRIS safety zone [23,24]. SARS-CoV-2 positive daily new cases decreased around mid-February after more stringent mobility measures were taken by the government [20,25]; but the total number of cases stayed outside the TETRIS zone and a second exponential growth period started in March 2021, coinciding with the introduction and dissemination of VOC P.1 [26]. Summer-related social gatherings and relaxed social distancing are some of the proposed reasons to explain the first epidemic wave in Uruguay, but there is currently a gap in knowledge concerning the potential influence of virological factors.
To understand the SARS-CoV-2 diversity associated with the first COVID-19 epidemic wave in Uruguay, we conducted a retrospective epidemiological and genomic analysis of SARS-CoV-2 complete genomes from COVID-19 patients diagnosed between November 2020 and April 2021. Our study revealed that a novel B.1.1.28 clade harboring two nonsynonymous changes in the Spike protein: Q675H and Q677H, now designated as lineage P.6 [27], was the most prevalent SARS-CoV-2 variant by the end of 2020 and beginning of 2021. Lineages P.2 and P.7 were also detected at lower prevalence during the first epidemic wave in Uruguay. The Q675H and Q677H mutations are in the proximity of the polybasic cleavage site at the S1/S2 boundary, a region of biological relevance for virus replication [28], and also arose independently in many other SARS-CoV-2 VOIs circulating worldwide. These findings suggest that local emergence and spread of a more transmissible P.6 variant might have had a non-negligible role in the first epidemic wave of COVID-19 in Uruguay.
2. Materials and Methods
2.1. Ethics Statement
This work was done by the Inter-Institutional Working Group (IiWG) for SARS-CoV-2 genomic surveillance in Uruguay, which involves a diagnostic network, expertise and resources to handle large-scale sequencing, computational scientists for genomic analysis, and an affordable and decentralized “in-house” qPCR test designed to detect known VOCs [26]. Residual deidentified RNA samples from SARS-CoV-2 positive patients were remitted to the Institut Pasteur de Montevideo (IPMon). IPMon was validated by the Ministry of Health of Uruguay as an approved center providing diagnostic testing for COVID-19. All samples were deidentified before receipt by the study investigators. All relevant ethical guidelines were appropriately followed. Additionally, the project was approved by the Ethics Committee of the Sanatorio Americano SASA (Uruguay) on the 29 April 2021. Ethical approval was given, and signed informed consent was obtained from the participants.
2.2. SARS-CoV-2 Samples
In total, 260 SARS-CoV-2 RNA samples (Table S1) were recovered from nasopharyngeal–throat combined swabs collected from clinically ill or asymptomatic individuals that resided in different Uruguayan departments and were diagnosed from November 2020 to April 2021 in Uruguay. As the IiWG began working in March 2021, the availability of earlier samples was conditional to what was kept at any laboratory of the IiWG diagnostic network. Positive RNA samples were reverse transcribed using SuperScript™ II Reverse Transcriptase (Thermo Fisher Scientific Inc., Waltham, MA, USA) or the LunaScript® RT SuperMix Kit (New England Biolabs, Ipswich, MA, USA). A negative control was included at this point and carried throughout the protocol.
2.3. Genome Sequencing
Sequencing libraries were prepared according to the classic ARTIC protocol described by Quick J. [29,30], the 2000 bp long amplicon version described by Resende P.C. [31], or the sequencing protocol using a 1200 bp amplicon “midnight” primer set, with the Nanopore Rapid kit as described by Freed N. and Silander O. [32,33] (Table S1). The final library was eluted in EB buffer (ONT) and quantified using a fluorometric assay. Recommended amounts of library were loaded into a FLO-MIN106D R9.4.1 flowcell and sequenced on the MinION Mk1C or GridION X5 sequencing platforms (ONT). Basecalling and demultiplexing were performed with Guppy 4.3.2 or higher [34] using the high or super accuracy mode. Consensus genomes were generated using the poreCov pipeline 0.7.0 or higher [35,36,37,38,39,40,41,42,43,44], and Nanopolish was used for consensus generation. Complete sequences with up to 15% of Ns were kept for further analysis. All genomes obtained in this study were uploaded to the EpiCoV database in the GISAID initiative under the accession numbers specified in Table S1.
2.4. SARS-CoV-2 Lineage Assignment
SARS-CoV-2 full-length consensus sequences were manually curated in specific genome positions, such as clade-defining mutations. Genotyping was performed according to Rambaut et al. [45] using the Pangolin application [46,47], and later confirmed using maximum likelihood (ML) analysis.
2.5. Phylogenetic and Phylogeographic Analysis of B.1.1.28
Uruguayan B.1.1.28 sequences (n = 212; samples assigned as lineage B.1.1.28, P.6, or P.7 in Table S1) were next analyzed in the context of additional B.1.1.28 sequences from Uruguay and Brazil, downloaded from the EpiCoV database of the GISAID initiative [48] (Table S3 showing EpiCoV/GISAID acknowledgments). Downloaded B.1.1.28 sequences from Uruguay (n = 143) were complete, with full collection date information and sampled before 31 May 2021. Sequences from Brazil (n = 1428) were complete and high quality with full collection date information, and were sampled before 31 May 2021 (Table S2). Additionally, we downloaded four B.1.1.28 from the USA (n = 2), Spain (n = 1), and Belgium (n = 1) that also harbored both S:Q675H and S:Q677H mutations (Tables S2 and S3). Alignment was performed with MAFFT v7.471 [49]. Maximum likelihood phylogenetic analysis of the 1787 B.1.1.28 sequences was performed with IQ-TREE version 1.6.12 under the model GTR + F + R3 of nucleotide substitution selected by the built-in ModelFinder option [50]. Branch support was assessed by the approximate likelihood-ratio test based on a Shimodaira–Hasegawa-like procedure (SH-aLRT) with 1000 replicates [51]. The tree root was established with the sequence EPI_ISL_416036 with the earliest collection date of 5 March 2020. This tree was time-scaled using TreeTime 0.8.3.1 [52], applying a fixed clock rate of 8 × 10−4 substitutions/site/year [53,54], and keeping polytomies. The time-scaled tree was then employed for the ancestral character state reconstruction (ACR) of epidemic locations with PastML v.1.9.15 [55], using the marginal posterior probabilities approximation (MPPA) method with an F81-like model. Brazilian sequences were grouped according to the region: South, Southeast, Central West, North, and Northeast. A time-scaled Bayesian phylogeographic analysis was next performed to infer the geographical source and dissemination pattern of the Uruguayan B.1.1.28 + Q675H + Q677H (now P.6) samples, and to estimate the time of their most recent common ancestors (TMRCA). Phylogenetic trees were estimated in BEAST v1.10 [56] using the GTR + F + I nucleotide substitution model, the nonparametric Bayesian skyline model as the coalescent tree prior [57], a strict molecular clock model with a uniform substitution rate prior (8–10 × 10−4 substitutions/site/year), and a reversible discrete phylogeographic model (using Uruguayan departments as epidemic locations) [58] with a continuous-time Markov chain (CTMC) rate reference prior [59]. MCMC chains were run for 100 million generations, and convergence (effective sample size > 200) in parameter estimates was assessed using Tracer v1.7 [60]. The maximum clade credibility (MCC) tree was summarized with TreeAnnotator v1.10 [61] and visualized using FigTree v1.4.4 [62]. Additional visualizations were implemented in the R environment with treeio 1.16.2 and ggtree 3.0.4 Bioconductor packages [63].
2.6. Phylogenetic and Phylogeographic Analysis of P.2
Uruguayan P.2 sequences (n = 79) were analyzed in the context of 1272 additional Brazilian P.2 sequences downloaded from EpiCoV/GISAID; sequences were complete and high quality with full collection date information, and were sampled before 31 May 2021 (Tables S2 and S3). Five Brazilian sequences with a collection date before September 2020 were discarded (the collection date might have been incorrect). Alignment and ML phylogenetic analysis were performed as above. The tree root was established with the sequence EPI_ISL_2344425 with the earliest collection date of 1 September 2020. The tree was time-scaled using TreeTime [52] and the ACR of epidemic locations inferred with PastML [55], as before. Brazilian locations were grouped according to the five Brazilian regions mentioned above. The resulting tree with inferred locations was visualized using FigTree [62].
2.7. Lineage Prevalence of Available Uruguayan Samples
To assess the prevalence of B.1.1.28 + Q675H + Q677H clade in the context of the different SARS-CoV-2 lineages circulating in Uruguay, we used the lineage assignments obtained for the 260 samples presented in this study, 335 additional samples already available at EpiCoV/GISAID, and 342 records available at the IiWG domain. As only one B.1.1.28 (not carrying Q675H + Q677H mutations) was identified in May 2021, we kept our analysis from November 2020 to April 2021. The dataset comprised a total of 937 SARS-CoV-2 genotypes, and prevalence (relative frequency) of each lineage was calculated monthly, from November to April. The “Others” category included B.1, B.1.1, B.1.1.1, B.1.1.34, B.1.177, B.1.177.12, and B.1.238.
In the case of geographic distribution of P.6 and P.2 variants (for the Uruguayan maps), the number of cases per department were obtained using samples from this study and also available at the IiWG domain, from November 2020 to April 2021 (n = 174 for P.6 and n = 70 for P.2).
2.8. Determination of Prevalence of Q675H + Q677H
To assess the prevalence of co-occurring S:Q675H and S:Q677H in worldwide SARS-CoV-2 genomes, we downloaded from EpiCoV/GISAID (accessed on 7 July 2021) 129 complete genomes, with high quality and full collection date information. We removed redundant samples with the ones included in this study, obtaining a final dataset of 259 sequences (Tables S2 and S3).
2.9. Structural Representation of the Spike Protein
The molecular model of SARS-CoV-2 Spike glycoprotein was taken from the D. E. Shaw Research database (DESRES-ANTON-11021566) [64]. The visual rendering was done with VMD 1.9.3 [65].
3. Results
At the IiWG, we sequenced 663 SARS-CoV-2 positive samples detected in Uruguay between November 2020 and April 2021 (Figure S1A and Table S1), which were classified in the following lineages: 333 (50%) P.1, 180 (27%) B.1.1.28, 70 (11%) P.2, 32 (5%) P.7, and 39 (6%) other B.1-derived lineages. The mutational profile of B.1.1.28 sequences identified 174 (26%) genomes carrying amino acid changes S:Q675H and S:Q677H that compose a new Pango lineage designated as P.6 [27,45]. Lineage P.6 was widely spread throughout the country, being detected in 12 out of 19 Uruguayan departments from 2 December 2020 to 26 April 2021; but was particularly prevalent in Montevideo, the capital department of Uruguay, and the surrounding metropolitan area (Canelones) (Figure 1A,B, right panel). Lineages P.2 and P.7, by contrast, were more frequently detected outside the metropolitan region (Figure 1B, right panel). Analysis of these 663 SARS-CoV-2 Uruguayan sequences determined by the IiWG, plus additional Uruguayan sequences obtained at the EpiCoV database in GISAID [48], revealed a changing molecular epidemiological pattern over time (Figure 1B, to the left). Lineage P.7 was the most prevalent variant (55%) detected from late November to late December 2020, but was rapidly replaced by lineage P.6, the relative frequency of which increased from 26% in December 2020 to 76% in February 2021. Lineage P.2 was first detected in Uruguay in December 2020 and reached a relatively high prevalence during the summer season, with a maximum frequency of 36% in January 2021. As previously described [26], the VOC P.1 was first detected in Uruguay in February 2021 and then quickly outcompeted the other SARS-CoV-2 variants circulating in the country, becoming the predominant lineage in April 2021.
To identify the probable geographic source of B.1.1.28, P.6, and P.7 lineages detected in Uruguay, all Uruguayan sequences here obtained (n = 212, Table S1) were combined with complete genome sequences of those lineages available at the EpiCoV database in GISAID sampled in Uruguay (n = 143) and Brazil (n = 1428), and with all B.1.1.28 sequences sampled worldwide that carried mutations Q675H and Q677H (USA = 2, Spain = 1 and Belgium = 1; Table S2). The ML phylogeographic analysis supported at least 20 independent introductions of this lineage B.1.1.28 (n = 14) and P.7 (n = 6) from Brazil into Uruguay that mostly resulted in singletons, dyads, or small clades (n < 5 sequences), with no evidence of extensive dissemination in Uruguay (Figure 1C and Figure S2). One B.1.1.28 introduction, however, was successfully established and originated the lineage P.6 (SH-aLRT = 98) that comprised all Uruguayan sequences (n = 314), as well as three sequences collected in the USA and Spain carrying mutations S:Q675H + Q677H. According to our analysis, the lineage P.6 was most likely introduced from the southeastern Brazilian region (ACR-location marginal probability (LMP) = 0.99) and was disseminated from Uruguay to the USA (two independent times, ACR-LMP ≥ 0.99) and Spain (ACR-LMP = 0.99) (Figure 1C and Figure S3). Of note, the B.1.1.28 + Q675H + Q677H collected in Belgium did not belong to clade P.6, indicating an independent and recurrent appearance of both mutations (see below; Figure 1C). We also identified one successful introduction of lineage P.7, probably from Southern Brazil (ACR-LMP = 0.99), that originated the highly supported (SH-aLRT = 100) Uruguayan clade designated as UYP.7 composed by 27 Uruguayan sequences plus eight sequences from Southern Brazil (Figure 1C and Figure S3). We also performed a ML phylogeographic analysis of lineage P.2 sequences sampled in Uruguay (n = 79) and Brazil (n = 1267), and identified three successful introductions, most likely from Southern Brazil (ACR-LMP > 0.85), that originated the highly supported (SH-aLRT > 0.92) Uruguayan clades UY-IP.2 (n = 35), UY-IIP.2 (n = 30), and UY-IIIP.2 (n = 12) (Figure S4). It was noteworthy that most sequences branching within Uruguayan clades UYP.7 (89%), UY-IP.2 (100%) and UY-IIP.2 (83%) were sampled outside the metropolitan region.
To better understand the origin and spread of lineage P.6, we performed a Bayesian phylogeographic analysis of all P.6 Uruguayan sequences produced in this work that had a geographic source available (n = 165), and six Brazilian basal sequences. The spatiotemporal reconstruction suggested that an ancestral B.1.1.28 virus was probably introduced into Uruguay from Brazil around October 2020, and after a short period of local evolution, the lineage P.6 ancestor arose in Montevideo (posterior state probability (PSP) = 0.94) around 9 November 2020 (95% HPD: 20 October–26 November) (Figure 2A). Lineage P.6 was next disseminated from Montevideo to the surrounding metropolitan area and also to more distant Uruguayan departments. The TMRCA of major transmission clusters outside the metropolitan region was traced to 23 December 2020 (95% HPD: 12 December 2020 to 30 December 2020) in Rocha and 30 December 2020 (95% HPD: 19 December 2020 to 9 January 2021) in Salto (Figure 2A,B). The introduction and dispersion of lineage P.6 in each department coincided with the increase in new COVID-19 cases reported daily (Figure 2A). Given that VOI P.2 was also cocirculating, we reinforce that there was a low spatiotemporal overlap of P.6 and P.2 in Montevideo and Salto, though this was not the case for Rocha (Figure S1C). The lineage P.6 was characterized by eight lineage-defining genetic changes in addition to S:Q675H and S:Q677H, including a total of five nonsynonymous mutations (Figure 2B). Of note, eight out of 10 lineage P.6-defining mutations (including S:Q675H) were also identified in a basal B.1.1.28 sequence sampled in Rio de Janeiro in January 2021, and were thus probably present in the ancestral virus that arrived from Brazil; while the remaining two mutations (ORF1ab: C8980T and S:Q677H) were fixed during the early local transmission in Uruguay. We also identified one additional mutation (ORF3a:M260I) that was fixed at a later step during evolution of P.6 in Uruguay (Figure 2B).
Structural analysis of the SARS-CoV-2 Spike glycoprotein showed that residues Q675H and Q677H were within the subdomain SD2 of each protomer constituting the homotrimer (Figure 2C). Particularly, they were located at the beginning of a very flexible loop (residues 675–690) [66], which embraced the solvent-accessible polybasic furin cleavage site [66]. These mutations were close to two experimentally observed O-glycosylation sites at T676 and T678 [67,68], and at the same domain of the D614G mutation [69] (Figure 2C, inset). Substitution S:Q677H has been reported as a recurrent mutation arising independently in many SARS-CoV-2 lineages, including several VOIs, circulating worldwide by the end of 2020 [70,71]. A search in the EpiCoV database (accessed on 7 July 2021) for high-quality SARS-CoV-2 genomes carrying both mutations S:Q675H and S:Q677H recovered 85 non-Uruguayan sequences. Overall, the pair S:Q675H + Q677H appeared to be distributed in 12 different countries (in decreasing frequency order: Uruguay, England, USA, Belgium, India, Australia, Switzerland, Spain, Netherlands, Japan, Germany, and France) and in 13 different Pango lineages (in decreasing frequency order: P.6, B.1.36, B.1.2, C.36, B.1.538, B.1.1.316, B.1.526 (VOI Iota), B.1.525 (VOI Eta), B.1.243, B.1.1.70, B.1.1.7 (VOC Alpha), B.1.1.63, and B.1) (Figure S5).
4. Discussion
Since March 2020, Uruguay had been successful at keeping the COVID-19 pandemic in check. Closed international borders and an aggressive contact-tracing system, among other government measures, were able to avoid virus transmission growing exponentially [19,20]. Brazil has been a COVID-19 hotspot in South America, and the 1068 km long Uruguayan–Brazilian dry border allowed the rapid local establishment of SARS-CoV-2 Brazilian lineages B.1.1.28 and B.1.1.33, initially associated with a few outbreaks that occurred in the departments bordering Brazil [22]. However, by the end of 2020, the pandemics worsened, with a clear increase of daily cases in December that extended until mid-February 2021. Summer-related social gatherings and relaxed social-distancing measures were some of the proposed reasons to explain the epidemic growth [20]. In this study, we described a new B.1.1.28 sublineage, designated P.6, that probably arose in Montevideo by November 2020 and spread throughout the country. Lineage P.6 comprised most (60%) Uruguayan virus genomes recovered between November 2020 and February 2021, and its spread coincided with national and local (as shown for Montevideo, Rocha, and Salto departments) increases in daily SARS-CoV-2 cases during the first epidemic wave.
Dispersion of lineage P.6 in Uruguay could have been fueled by changes in human behavior coinciding with the end of the austral spring and the ambiance of relaxed restrictions, as was demonstrated for variant 20E(EU1), which emerged in Spain and spread through Europe in the boreal summer of 2020 [72]. This hypothesis, however, failed to explain why lineage P.6 outcompeted both lineage P.7, which was the most prevalent variant in Uruguay in late 2020, and lineage P.2, which became the dominant SARS-CoV-2 strain in many Brazilian states by the end of 2020 [5,13]. An alternative hypothesis is that lineage P.6 was successfully spread in Uruguay because it was initially established in the capital city of Montevideo, which comprises nearly half of Uruguayan inhabitants and is strongly connected with all other departments. By contrast, lineages P.2 and P.7 were probably initially established outside the metropolitan region, and this may have reduced their chance of spreading at the country level. Finally, we hypothesized that the combined presence of amino acid changes S:Q675H + Q677H might have also produced a more transmissible P.6 variant, contributing to the rapid increase in the lineage dominance observed between December 2020 and February 2021.
We are not aware of any experimental assay that assessed the effect of mutations S:Q675H + Q677H on the viral fitness, but independent data indicated that these amino acid changes (either one or both of them) might facilitate viral transmissibility. These mutations are in close proximity to the polybasic cleavage site at the S1/S2 boundary that can be processed by furin and other proteases like TMPRSS2, mediating efficient entry into cells and increasing human-to-human transmission [73,74,75]. Mutations Q675H and Q677H might alter the properties of this nearby protease-cleavage site through changes in the structure conformation, glycosylation, and/or phosphorylation processes already known to have a role in cleavage regulation [73]. Recently, the promotion of syncytium formation and virus infectivity has been shown for the D614G mutation, which is known to impact structural and thermodynamic aspects of the Spike [69,76,77,78,79,80] and to enhance the protease cleavage, likely by allosterically increasing the binding to furin [81,82,83]. Knowing that histidine residues function as pH sensors in other viruses [84], Q675H and Q677H mutations might also provide some synergic structural changes in the dynamics of the subdomain SD2, enhancing the effects of mutation D614G.
Convergent evolution is a hallmark of positive selection, and we identified the independent appearance of both S:Q675H and S:Q677H in 12 additional SARS-CoV-2 lineages. Moreover, mutations close to or at the polybasic cleavage site at the S1/S2 boundary have been reported in several VOCs and VOIs, including: Alpha (S:P681H), Beta (A701V), Delta (P681R), Eta (Q677H), Iota (A701V), Kappa (P681R), and Theta (P.3, P681H). These findings suggest that the S1/S2 boundary is a region particularly relevant for selection of mutations that resulted in more transmissible SARS-CoV-2 variants. Consistent with this notion, a recent study that used a reverse genetic system and primary human airway cultures identified mutation S:P681R as a significant determinant for enhanced viral replication fitness of the VOC Delta, and supported that Spike mutations that potentially affect furin cleavage efficiency must be closely monitored for future variant surveillance [85]. Notably, another study that developed an innovative model on epidemiological variables integrating the effect of Spike amino acid changes in viral fitness forecasted that mutations Q675H and Q677H could appear in emerging SARS-CoV-2 VOCs in the following months [86]. These observations underscore the importance of future experimental studies to assess the functional impact of Spike mutations Q675H and Q677H on virus infectivity and transmissibility.
In summary, this study described the emergence and local spread of lineage P.6, a new B.1.1.28-derived lineage carrying Spike mutations Q675H + Q677H, in Uruguay that coincided with the first exponential growth phase of the country’s COVID-19 epidemic, which started by November 2020 and lasted until mid-February 2021. P.6 was the second recognized B.1.1.28-descendent lineage, together with lineage P.3, that emerged outside Brazil. An ancestral B.1.1.28 virus carrying mutation Q675H was probably introduced from southeastern Brazil into Montevideo, Uruguay’s capital city, and by November 2020 the virus already fixed mutation Q677H and spread across the entire country, originating lineage P.6. We propose that simultaneous presence of Spike mutations Q675H and Q677H might confer to lineage P.6 a higher infectivity and increased transmissibility, which, combined with the establishment in the populated metropolitan region, contributed to its swift dissemination in Uruguay. Although the lineage P.6 was substituted by the VOC P.1 as the most prevalent lineage in Uruguay since April 2021, the concurrent emergence of Spike mutations Q675H and Q677H in VOIs and/or VOCs circulating worldwide should be closely monitored.
Acknowledgments
The authors wish to thank all the health care workers and scientists who have worked hard to deal with this pandemic threat, the GISAID team, and all the EpiCoV database’s submitters (the GISAID acknowledgments table containing the sequences used in this study is shown in Table S3). We thank Christian Brandt from the Institute for Infectious Diseases and Infection Control (Jena University Hospital) for adapting their poreCov Nextflow pipeline to our requirements and quickly fixing reported bugs. We thank Maria Jose Benitez-Galeano for fruitful discussions. We thank the Zurich-Santander Foundation for their support.
Supplementary Materials
The following are available online at https://www.mdpi.com/article/10.3390/v13091801/s1. Figure S1. (A) Monthly COVID-19 cases reported in Uruguay from November 2020 to April 2021 (y-axis to the left). In pink, total number of sequences generated in this study. In blue, total number of sequences generated by our group, including several not yet publicly available (y-axis to the right). (B) Uruguayan map with the number of P.2 sequences generated by the IiGW as indicated in the legend. Some department names are written accordingly. (C) Uruguayan map showing departments with simultaneous incidence of P.6 and P.2 based on data generated by the IiGW. The barplots show the absolute frequency of P.6 and P.2 monthly, from December 2020 to April 2021, in a few selected departments (Salto, Montevideo, and Rocha). Figure S2. Maximum likelihood tree of 1783 B.1.1.28 sequences found in Brazil and Uruguay. Additionally, four sequences from the USA, Spain, and Belgium were included. The root of the tree was established using the earliest sequence from Brazil, with a collection date of 5 March 2020. The Uruguayan P.6 clade and Brazilian basal sequences are highlighted in purple and pink, respectively. In addition, taxon names for samples collected in the USA and Spain are shown in blue. Uruguayan P.7 main clade and additional sequences are shown in green; Brazilian P.7 clade is shown in light green. The remaining Uruguayan sequences through the ML tree are indicated by a blue taxon name, and supported clades are highlighted in blue. Figure S3. Schematic representation of migration events during dissemination of SARS-CoV-2 lineage B.1.1.28 between Uruguay and Brazil (including recently assigned P.6 and P.7 samples). The migration events were inferred by ancestral character reconstruction obtained through a maximum likelihood method implemented in PastML. Each node in the network is identified by location and number of sequences within different phylogenetic subclusters. Arrows indicate migration events deduced from location state changes across the B.1.1.28 ML tree. The shade of gray identifies marginal probabilities, and the numbers quantify the migration events connecting respective locations (no numbers represent one single event). Nodes are colored according to their location. BR: Brazil. Figure S4. Maximum likelihood phylogeographic analysis of lineage P.2 samples (n = 1346) from Uruguay (n = 79) and Brazil (n = 1267) inferred by ancestral character reconstruction method implemented in PastML. Tips and branches are colored according to sampling location and the most probable location state of their descendent nodes, respectively, as indicated in the legend. Shaded boxes highlight the major P.2 clades in Uruguay (labels UY-I, UY-II, and UY-III in the tree correspond to clades UY-IP2, UY-IIP2, and UY-IIIP2 mentioned in the main manuscript, respectively). The time-scaled tree was rooted with the earliest P.2 sequence (collection date: 1 September 2020). Branch lengths are drawn to scale indicating nucleotide substitutions per site per year. Figure S5. Geographical and Pango lineage distribution of worldwide SARS-CoV-2 genome sequences carrying mutations Q675H + Q677H. Barplots indicate the number of observed cases per country and lineage. The analysis is based on the 174 B.1.1.28 + Q675H + Q677H sequences from this study and an additional 85 complete and high quality genomes, collected worldwide and carrying Q675H + Q677H independently of the assigned lineage. These sequences were obtained from GISAID on 7 July 2021. B.1.1.28 corresponds to the newly designated P.6 Pango lineage plus one independent arisen sequence collected in Belgium. Table S1. Information on SARS-CoV-2 samples from Uruguay used in this study. GISAID accession identifier, IiWG identifier, collection date, and geographic origin in Uruguay (department) and Pango lineage are indicated. Additionally, # indicates sequences obtained with the standard ARTIC protocol, #c indicates those obtained with the standard ARTIC protocol with long amplicons, and * indicates the samples processed using the “midnight” primer set, using the Nanopore Rapid kit. Table S2. Information about Uruguayan, Brazilian, and worldwide samples obtained from the EpiCoV database in GISAID and used in this study (accessed on 7 July 2021). Table S3. GISAID (EpiCoV database) acknowledgments table.
Author Contributions
Conceptualization: N.R. (Natalia Rego), G.B., L.S.; methodology, C.S., M.P., A.C., A.F., P.P., V.N.; software, I.F.; formal analysis, N.R. (Natalia Rego), C.S., M.P., A.C., A.F., I.A, D.M., P.P., M.R.M., G.B.; investigation, N.R. (Natalia Rego), C.S., M.P., A.C., A.F., P.P., T.F.-C., V.N., M.B., R.A., M.A. (Mailen Arleo), T.P., N.R. (Natalia Reyes), M.N.B., A.L., V.B., A.M., O.C., N.N., J.H., M.D., M.B.G., L.G., M.M. (Mauricio Mendez), M.P.T., J.Z., B.R., M.M. (Matias Maidana), M.A. (Martina Alonso), P.S., I.A., E.P., D.M., C.A., J.M., H.A., R.C., G.B., P.M., G.M., G.I., L.S.; resources, V.N., M.B., R.A., M.A. (Mailen Arleo), T.P., N.R. (Natalia Reyes), M.N.B., A.L., V.B., A.M., O.C., N.N., J.H., M.D., M.B.G., L.G., M.M. (Mauricio Mendez), M.P.T., J.Z., B.R., M.M. (Matias Maidana), M.A. (Martina Alonso), P.S., C.A., J.M., H.A., R.C.; data curation, N.R. (Natalia Rego), C.S., I.F., T.F.-C.; writing—original draft preparation, N.R. (Natalia Rego), L.S.; writing—review and editing, C.S., T.F.-C., M.R.M., D.M., G.B., P.M., G.M., G.I.; visualization, M.R.M.; supervision, P.M., G.M., G.I., L.S.; funding acquisition, P.M., G.M, G.I. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported by FOCEM-Fondo para la Convergencia Estructural del Mercosur (COF03/11).
Institutional Review Board Statement
The National Ministry of Health (Uruguay) is the main health Institution in our country. It is a dedicated ethics oversight body, and granted us the ethical approval for this work. All necessary patient/participant consent was obtained, and the appropriate institutional forms were archived.
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement
All SARS-CoV-2 genome sequences have been submitted to the EpiCoV/GISAID database with accession numbers indicated in Table S1.
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.CDC Coronavirus Disease 2019 (COVID-19) [(accessed on 17 August 2021)]; Available online: https://www.cdc.gov/coronavirus/2019-ncov/variants/variant-info.html.
- 2.Janik E., Niemcewicz M., Podogrocki M., Majsterek I., Bijak M. The Emerging Concern and Interest SARS-CoV-2 Variants. Pathogens. 2021;10:633. doi: 10.3390/pathogens10060633. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Candido D.S., Claro I.M., de Jesus J.G., Souza W.M., Moreira F.R.R., Dellicour S., Mellan T.A., du Plessis L., Pereira R.H.M., Sales F.C.S., et al. Evolution and Epidemic Spread of SARS-CoV-2 in Brazil. Science. 2020;369:1255–1260. doi: 10.1126/science.abd2161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Resende P.C., Delatorre E., Gräf T., Mir D., Motta F.C., Appolinario L.R., da Paixão A.C.D., da Fonseca Mendonça A.C., Ogrzewalska M., Caetano B., et al. Evolutionary Dynamics and Dissemination Pattern of the SARS-CoV-2 Lineage B.1.1.33 During the Early Pandemic Phase in Brazil. Front. Microbiol. 2021;11:3565. doi: 10.3389/fmicb.2020.615280. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Rede Genomica Fiocruz. [(accessed on 17 August 2021)]. Available online: http://www.genomahcov.fiocruz.br/
- 6.Fujino T., Nomoto H., Kutsuna S., Ujiie M., Suzuki T., Sato R., Fujimoto T., Kuroda M., Wakita T., Ohmagari N. Novel SARS-CoV-2 Variant in Travelers from Brazil to Japan. Emerg. Infect. Dis. 2021;27:1243. doi: 10.3201/eid2704.210138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Faria N.R., Mellan T.A., Whittaker C., Claro I.M., Candido D.D.S., Mishra S., Crispim M.A.E., Sales F.C.S., Hawryluk I., McCrone J.T., et al. Genomics and Epidemiology of the P.1 SARS-CoV-2 Lineage in Manaus, Brazil. Science. 2021;372:815–821. doi: 10.1126/science.abh2644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Naveca F.G., Nascimento V., de Souza V.C., de Lima Corado A., Nascimento F., Silva G., Costa Á., Duarte D., Pessoa K., Mejía M., et al. COVID-19 in Amazonas, Brazil, Was Driven by the Persistence of Endemic Lineages and P.1 Emergence. Nat. Med. 2021;27:1230–1238. doi: 10.1038/s41591-021-01378-7. [DOI] [PubMed] [Google Scholar]
- 9.Cov-Lineages. [(accessed on 17 August 2021)]. Available online: https://cov-lineages.org/lineage.html?lineage=P.1.
- 10.Cov-Lineages. [(accessed on 17 August 2021)]. Available online: https://cov-lineages.org/lineage.html?lineage=P.2.
- 11.Voloch C.M., da Silva Francisco R., de Almeida L.G.P., Cardoso C.C., Brustolini O.J., Gerber A.L., de C Guimarães A.P., Mariani D., da Costa R.M., Ferreira O.C., et al. Genomic Characterization of a Novel SARS-CoV-2 Lineage from Rio de Janeiro, Brazil. J. Virol. 2021;95 doi: 10.1128/JVI.00119-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lamarca A.P., de Almeida L.G.P., da S. Francisco R., Lima L.F.A., Scortecci K.C., Perez V.P., Brustolini O.J., Sousa E.S.S., Secco D.A., Santos A.M.G., et al. Genomic Surveillance of SARS-CoV-2 Tracks Early Interstate Transmission of P.1 Lineage and Diversification within P.2 Clade in Brazil. [(accessed on 17 August 2021)];medRxiv. 2021 doi: 10.1101/2021.03.21.21253418. Available online: https://www.medrxiv.org/content/10.1101/2021.03.21.21253418v2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Franceschi V.B., Ferrareze P.A.G., Zimerman R.A., Cybis G.B., Thompson C.E. Mutation hotspots and spatiotemporal distribution of SARS-CoV-2 lineages in Brazil, February 2020–2021. Virus Res. 2021;304:198532. doi: 10.1016/j.virusres.2021.198532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.da Silva Francisco R., Jr., Lamarca A.P., de Almeida L.G.P., Cavalcante L., Machado D.T., Martins Y., Brustolini O., Gerber A.L., de C Guimarães A.P., Gonçalves R.B., et al. Turnover of SARS-CoV-2 Lineages Shaped the Pandemic and Enabled the Emergence of New Variants in the State of Rio de Janeiro, Brazil. [(accessed on 17 August 2021)];medRxiv. 2021 doi: 10.1101/2021.07.20.21260890. Available online: https://www.medrxiv.org/content/10.1101/2021.07.20.21260890v1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Potential New B.1.1.28 Sub-Lineage with L452R in Brazil Issue #68·Cov-Lineages/Pango-Designation. [(accessed on 17 August 2021)]. Available online: https://github.com/cov-lineages/pango-designation/issues/68.
- 16.Tablizo F.A., Kim K.M., Lapid C.M., Castro M.J.R., Yangzon M.S.L., Maralit B.A., Ayes M.E.C., Cutiongco-de la Paz E.M., De Guzman A.R., Yap J.M.C., et al. Genome Sequencing and Analysis of an Emergent SARS-CoV-2 Variant Characterized by Multiple Spike Protein Mutations Detected from the Central Visayas Region of the Philippines. [(accessed on 17 August 2021)];medRxiv. 2021 doi: 10.1101/2021.03.03.21252812. Available online: https://www.medrxiv.org/content/10.1101/2021.03.03.21252812v2. [DOI] [Google Scholar]
- 17.da Silva Francisco R., Jr., Benites L.F., Lamarca A.P., de Almeida L.G.P., Hansen A.W., Gularte J.S., Demoliner M., Gerber A.L., de C Guimarães A.P., Antunes A.K.E., et al. Pervasive Transmission of E484K and Emergence of VUI-NP13L with Evidence of SARS-CoV-2 Co-Infection Events by Two Different Lineages in Rio Grande Do Sul, Brazil. Virus Res. 2021;296:198345. doi: 10.1016/j.virusres.2021.198345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Sant’Anna F.H., Varela A.P.M., Prichula J., Comerlato J., Comerlato C.B., Roglio V.S., Pereira G.F.M., Moreno F., Seixas A., Wendland E.M. Emergence of the Novel SARS-CoV-2 Lineage VUI-NP13L and Massive Spread of P.2 in South Brazil. Emerg. Microbes Infect. 2021;10:1431–1440. doi: 10.1080/22221751.2021.1949948. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Estadisticasuy. [(accessed on 26 July 2021)]. Available online: https://guiad-covid.github.io/estadisticasuy.html.
- 20.Taylor L. Why Uruguay Lost Control of COVID. Nature. 2021;595:21. doi: 10.1038/d41586-021-01714-4. [DOI] [PubMed] [Google Scholar]
- 21.Elizondo V., Harkins G.W., Mabvakure B., Smidt S., Zappile P., Marier C., Maurano M.T., Perez V., Mazza N., Beloso C., et al. SARS-CoV-2 Genomic Characterization and Clinical Manifestation of the COVID-19 Outbreak in Uruguay. Emerg. Microbes Infect. 2021;10:51–65. doi: 10.1080/22221751.2020.1863747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Mir D., Rego N., Resende P.C., Tort F., Fernández-Calero T., Noya V., Brandes M., Possi T., Arleo M., Reyes N., et al. Recurrent Dissemination of SARS-CoV-2 Through the Uruguayan–Brazilian Border. Front. Microbiol. 2021;12:653986. doi: 10.3389/fmicb.2021.653986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Fraser C., Riley S., Anderson R.M., Ferguson N.M. Factors That Make an Infectious Disease Outbreak Controllable. Proc. Natl. Acad. Sci. USA. 2004;101:6146–6151. doi: 10.1073/pnas.0307506101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Grantz K.H., Lee E.C., McGowan L.D., Lee K.H., Metcalf C.J.E., Gurley E.S., Lessler J. Maximizing and Evaluating the Impact of Test-Trace-Isolate Programs: A Modeling Study. PLoS Med. 2021;18:e1003585. doi: 10.1371/journal.pmed.1003585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Uruguay Medidas. [(accessed on 16 December 2020)]. Available online: https://medios.presidencia.gub.uy/tav_portal/2020/noticias/AH_204/Medidas%2016.12.2020.pdf.
- 26.Rego N., Costábile A., Paz M., Salazar C., Perbolianachis P., Spangenberg L., Ferrés I., Arce R., Fajardo A., Arleo M., et al. Real-Time Genomic Surveillance for SARS-CoV-2 Variants of Concern, Uruguay. Emerg. Infect. Dis. 2021;27 doi: 10.3201/eid2711.211198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Cov-Lineages. [(accessed on 17 August 2021)]. Available online: https://cov-lineages.org/lineage.html?lineage=P.6.
- 28.Coutard B., Valle C., de Lamballerie X., Canard B., Seidah N.G., Decroly E. The Spike Glycoprotein of the New Coronavirus 2019-NCoV Contains a Furin-like Cleavage Site Absent in CoV of the Same Clade. Antiviral Res. 2020;176:104742. doi: 10.1016/j.antiviral.2020.104742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Quick J. NCoV-2019 Sequencing Protocol v3 (LoCost) 2020. [(accessed on 31 August 2021)]. Available online: https://www.protocols.io/view/ncov-2019-sequencing-protocol-v3-locost-bh42j8ye/metrics.
- 30.Tyson J.R., James P., Stoddart D., Sparks N., Wickenhagen A., Hall G., Choi J.H., Lapointe H., Kamelian K., Smith A.D., et al. Improvements to the ARTIC Multiplex PCR Method for SARS-CoV-2 Genome Sequencing Using Nanopore. BioRxiv. 2020 doi: 10.1101/2020.09.04.283077. [DOI] [Google Scholar]
- 31.Resende P. Long Reads Nanopore Sequencing to Recover SARS-CoV-2 Whole Genome. Protocols. 2020 doi: 10.17504/protocols.io.bfy7jpzn. [DOI] [Google Scholar]
- 32.Freed N., Silander O. SARS-CoV2 Genome Sequencing Protocol (1200bp Amplicon “Midnight” Primer Set, Using Nanopore Rapid Kit) V5. [(accessed on 31 August 2021)]. Available online: https://www.protocols.io/view/sars-cov2-genome-sequencing-protocol-1200bp-amplic-btsrnnd6?version_warning=no.
- 33.Freed N.E., Vlková M., Faisal M.B., Silander O.K. Rapid and Inexpensive Whole-Genome Sequencing of SARS-CoV-2 Using 1200 Bp Tiled Amplicons and Oxford Nanopore Rapid Barcoding. Biol. Methods Protoc. 2020;5:bpaa014. doi: 10.1093/biomethods/bpaa014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Oxford Nanopore Technologies. [(accessed on 2 June 2021)]. Available online: https://nanoporetech.com/
- 35.GitHub-Replikation/PoreCov: SARS-CoV-2 Workflow for Nanopore Sequence Data. [(accessed on 23 July 2021)]. Available online: https://github.com/replikation/poreCov.
- 36.Hufsky F., Lamkiewicz K., Almeida A., Aouacheria A., Arighi C., Bateman A., Baumbach J., Beerenwinkel N., Brandt C., Cacciabue M., et al. Computational Strategies to Combat COVID-19: Useful Tools to Accelerate SARS-CoV-2 and Coronavirus Research. Brief. Bioinform. 2021;22:642–663. doi: 10.1093/bib/bbaa232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wood D.E., Lu J., Langmead B. Improved Metagenomic Analysis with Kraken 2. Genome Biol. 2019;20:257. doi: 10.1186/s13059-019-1891-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Loman N.J., Quick J., Simpson J.T. A Complete Bacterial Genome Assembled de Novo Using Only Nanopore Sequencing Data. Nat. Methods. 2015;12:733–735. doi: 10.1038/nmeth.3444. [DOI] [PubMed] [Google Scholar]
- 39.Di Tommaso P., Chatzou M., Floden E.W., Barja P.P., Palumbo E., Notredame C. Nextflow Enables Reproducible Computational Workflows. Nat. Biotechnol. 2017;35:316–319. doi: 10.1038/nbt.3820. [DOI] [PubMed] [Google Scholar]
- 40.Kurtzer G.M., Sochat V., Bauer M.W. Singularity: Scientific Containers for Mobility of Compute. PLoS ONE. 2017;12:e0177459. doi: 10.1371/journal.pone.0177459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Ferguson J.M., Gamaarachchi H., Nguyen T., Gollon A., Tong S., Aquilina-Reid C., Bowen-James R., Deveson I.W. InterARTIC: An Interactive Web Application for Whole-Genome Nanopore Sequencing Analysis of SARS-CoV-2 and Other Viruses. bioRxiv. 2021 doi: 10.1101/2021.04.21.440861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Li H. Minimap2: Pairwise Alignment for Nucleotide Sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Ondov B.D., Bergman N.H., Phillippy A.M. Interactive Metagenomic Visualization in a Web Browser. BMC Bioinform. 2011;12:385. doi: 10.1186/1471-2105-12-385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Brandt C., Krautwurst S., Spott R., Lohde M., Jundzill M., Marquet M., Hölzer M. PoreCov-An Easy to Use, Fast, and Robust Workflow for SARS-CoV-2 Genome Reconstruction via Nanopore Sequencing. Front. Genet. 2021;12:711437. doi: 10.3389/fgene.2021.711437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Rambaut A., Holmes E.C., O’Toole Á., Hill V., McCrone J.T., Ruis C., du Plessis L., Pybus O.G. A Dynamic Nomenclature Proposal for SARS-CoV-2 Lineages to Assist Genomic Epidemiology. Nat. Microbiol. 2020;5:1403–1407. doi: 10.1038/s41564-020-0770-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.COG-UK. [(accessed on 26 July 2021)]. Available online: https://pangolin.cog-uk.io/
- 47.O’Toole Á., Scher E., Underwood A., Jackson B., Hill V., McCrone J.T., Colquhoun R., Ruis C., Abu-Dahab K., Taylor B., et al. Assignment of Epidemiological Lineages in an Emerging Pandemic Using the Pangolin Tool. Virus Evol. 2021:veab064. doi: 10.1093/ve/veab064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Shu Y., McCauley J. GISAID: Global Initiative on Sharing All Influenza Data—from Vision to Reality. Eurosurveillance. 2017;22:30494. doi: 10.2807/1560-7917.ES.2017.22.13.30494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Katoh K., Standley D.M. MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Nguyen L.-T., Schmidt H.A., von Haeseler A., Minh B.Q. IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol. Biol. Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Anisimova M., Gascuel O. Approximate Likelihood-Ratio Test for Branches: A Fast, Accurate, and Powerful Alternative. Syst. Biol. 2006;55:539–552. doi: 10.1080/10635150600755453. [DOI] [PubMed] [Google Scholar]
- 52.Sagulenko P., Puller V., Neher R.A. TreeTime: Maximum-Likelihood Phylodynamic Analysis. Virus Evol. 2018;4 doi: 10.1093/ve/vex042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Duchene S., Featherstone L., Haritopoulou-Sinanidou M., Rambaut A., Lemey P., Baele G. Temporal Signal and the Phylodynamic Threshold of SARS-CoV-2. Virus Evol. 2020;6 doi: 10.1093/ve/veaa061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Time Dependence of SARS-CoV-2 Substitution Rates—SARS-CoV-2 Coronavirus/NCoV-2019 Evolutionary History. [(accessed on 23 July 2021)]. Available online: https://virological.org/t/time-dependence-of-sars-cov-2-substitution-rates/542.
- 55.Ishikawa S.A., Zhukova A., Iwasaki W., Gascuel O. A Fast Likelihood Method to Reconstruct and Visualize Ancestral Scenarios. Mol. Biol. Evol. 2019;36:2069–2085. doi: 10.1093/molbev/msz131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Suchard M.A., Lemey P., Baele G., Ayres D.L., Drummond A.J., Rambaut A. Bayesian Phylogenetic and Phylodynamic Data Integration Using BEAST 1.10. Virus Evol. 2018;4 doi: 10.1093/ve/vey016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Drummond A.J., Rambaut A., Shapiro B., Pybus O.G. Bayesian Coalescent Inference of Past Population Dynamics from Molecular Sequences. Mol. Biol. Evol. 2005;22:1185–1192. doi: 10.1093/molbev/msi103. [DOI] [PubMed] [Google Scholar]
- 58.Lemey P., Rambaut A., Drummond A.J., Suchard M.A. Bayesian Phylogeography Finds Its Roots. PLoS Comput. Biol. 2009;5:e1000520. doi: 10.1371/journal.pcbi.1000520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Ferreira M.A.R., Suchard M.A. Bayesian Analysis of Elapsed Times in Continuous-Time Markov Chains. Can. J. Stat. 2008;36:355–368. doi: 10.1002/cjs.5550360302. [DOI] [Google Scholar]
- 60.Rambaut A., Drummond A.J., Xie D., Baele G., Suchard M.A. Posterior Summarization in Bayesian Phylogenetics Using Tracer 1.7. Syst. Biol. 2018;67:901–904. doi: 10.1093/sysbio/syy032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Bouckaert R., Vaughan T.G., Barido-Sottani J., Duchêne S., Fourment M., Gavryushkina A., Heled J., Jones G., Kühnert D., Maio N.D., et al. BEAST 2.5: An Advanced Software Platform for Bayesian Evolutionary Analysis. PLoS Comput. Biol. 2019;15:e1006650. doi: 10.1371/journal.pcbi.1006650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.FigTree. [(accessed on 26 July 2021)]. Available online: http://tree.bio.ed.ac.uk/software/figtree/
- 63.Yu G., Smith D.K., Zhu H., Guan Y., Lam T.T.-Y. Ggtree: An r Package for Visualization and Annotation of Phylogenetic Trees with Their Covariates and Other Associated Data. Methods Ecol. Evol. 2017;8:28–36. doi: 10.1111/2041-210X.12628. [DOI] [Google Scholar]
- 64.D. E. Shaw Research Technical Data Molecular Dynamics Simulations Related to SARS-CoV-2. [(accessed on 26 July 2021)]. Available online: https://www.deshawresearch.com/downloads/download_trajectory_sarscov2.cgi/
- 65.Humphrey W., Dalke A., Schulten K. VMD: Visual Molecular Dynamics. J. Mol. Graph. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
- 66.Lemmin T., Kalbermatter D., Harder D., Plattet P., Fotiadis D. Structures and Dynamics of the Novel S1/S2 Protease Cleavage Site Loop of the SARS-CoV-2 Spike Glycoprotein. J. Struct. Biol. X. 2020;4:100038. doi: 10.1016/j.yjsbx.2020.100038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Bagdonaite I., Thompson A.J., Wang X., Søgaard M., Fougeroux C., Frank M., Diedrich J.K., Yates J.R., Salanti A., Vakhrushev S.Y., et al. Site-Specific O-Glycosylation Analysis of SARS-CoV-2 Spike Protein Produced in Insect and Human Cells. Viruses. 2021;13:551. doi: 10.3390/v13040551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Sanda M., Morrison L., Goldman R. N- and O-Glycosylation of the SARS-CoV-2 Spike Protein. Anal. Chem. 2021;93:2003–2009. doi: 10.1021/acs.analchem.0c03173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Gobeil S.M.-C., Janowska K., McDowell S., Mansouri K., Parks R., Stalls V., Kopp M.F., Manne K., Li D., Wiehe K., et al. Effect of Natural Mutations of SARS-CoV-2 on Spike Structure, Conformation, and Antigenicity. Science. 2021 doi: 10.1126/science.abi6226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Hodcroft E.B., Domman D.B., Snyder D.J., Oguntuyo K.Y., Diest M.V., Densmore K.H., Schwalm K.C., Femling J., Carroll J.L., Scott R.S., et al. Emergence in Late 2020 of Multiple Lineages of SARS-CoV-2 Spike Protein Variants Affecting Amino Acid Position 677. medRxiv. 2021 doi: 10.1101/2021.02.12.21251658. [DOI] [Google Scholar]
- 71.Detection of the Recurrent Substitution Q677H in the Spike Protein of SARS-CoV-2 in Cases Descended from the Lineage B.1.429—SARS-CoV-2 Coronavirus/NCoV-2019 Genomic Epidemiology. [(accessed on 22 July 2021)]. Available online: https://virological.org/t/detection-of-the-recurrent-substitution-q677h-in-the-spike-protein-of-sars-cov-2-in-cases-descended-from-the-lineage-b-1-429/660.
- 72.Hodcroft E.B., Zuber M., Nadeau S., Vaughan T.G., Crawford K.H.D., Althaus C.L., Reichmuth M.L., Bowen J.E., Walls A.C., Corti D., et al. Spread of a SARS-CoV-2 Variant through Europe in the Summer of 2020. Nature. 2021:1–6. doi: 10.1038/s41586-021-03677-y. [DOI] [PubMed] [Google Scholar]
- 73.Örd M., Faustova I., Loog M. The Sequence at Spike S1/S2 Site Enables Cleavage by Furin and Phospho-Regulation in SARS-CoV2 but Not in SARS-CoV1 or MERS-CoV. Sci. Rep. 2020;10:16944. doi: 10.1038/s41598-020-74101-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Tang T., Jaimes J.A., Bidon M.K., Straus M.R., Daniel S., Whittaker G.R. Proteolytic Activation of SARS-CoV-2 Spike at the S1/S2 Boundary: Potential Role of Proteases beyond Furin. ACS Infect. Dis. 2021;7:264–272. doi: 10.1021/acsinfecdis.0c00701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Peacock T.P., Goldhill D.H., Zhou J., Baillon L., Frise R., Swann O.C., Kugathasan R., Penn R., Brown J.C., Sanchez-David R.Y., et al. The Furin Cleavage Site in the SARS-CoV-2 Spike Protein Is Required for Transmission in Ferrets. Nat. Microbiol. 2021;6:899–909. doi: 10.1038/s41564-021-00908-w. [DOI] [PubMed] [Google Scholar]
- 76.Zhang J., Cai Y., Xiao T., Lu J., Peng H., Sterling S.M., Walsh R.M., Rits-Volloch S., Zhu H., Woosley A.N., et al. Structural Impact on SARS-CoV-2 Spike Protein by D614G Substitution. Science. 2021;372:525–530. doi: 10.1126/science.abf2303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Mansbach R.A., Chakraborty S., Nguyen K., Montefiori D.C., Korber B., Gnanakaran S. The SARS-CoV-2 Spike Variant D614G Favors an Open Conformational State. Sci. Adv. 2021;7:eabf3671. doi: 10.1126/sciadv.abf3671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Yurkovetskiy L., Wang X., Pascal K.E., Tomkins-Tinch C., Nyalile T.P., Wang Y., Baum A., Diehl W.E., Dauphin A., Carbone C., et al. Structural and Functional Analysis of the D614G SARS-CoV-2 Spike Protein Variant. Cell. 2020;183:739–751.e8. doi: 10.1016/j.cell.2020.09.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Benton D.J., Wrobel A.G., Roustan C., Borg A., Xu P., Martin S.R., Rosenthal P.B., Skehel J.J., Gamblin S.J. The Effect of the D614G Substitution on the Structure of the Spike Glycoprotein of SARS-CoV-2. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2022586118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Zhang L., Jackson C.B., Mou H., Ojha A., Peng H., Quinlan B.D., Rangarajan E.S., Pan A., Vanderheiden A., Suthar M.S., et al. SARS-CoV-2 Spike-Protein D614G Mutation Increases Virion Spike Density and Infectivity. Nat. Commun. 2020;11:6013. doi: 10.1038/s41467-020-19808-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Gobeil S.M.-C., Janowska K., McDowell S., Mansouri K., Parks R., Manne K., Stalls V., Kopp M.F., Henderson R., Edwards R.J., et al. D614G Mutation Alters SARS-CoV-2 Spike Conformation and Enhances Protease Cleavage at the S1/S2 Junction. Cell Rep. 2021;34:108630. doi: 10.1016/j.celrep.2020.108630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Mohammad A., Alshawaf E., Marafie S.K., Abu-Farha M., Abubaker J., Al-Mulla F. Higher Binding Affinity of Furin for SARS-CoV-2 Spike (S) Protein D614G Mutant Could Be Associated with Higher SARS-CoV-2 Infectivity. Int. J. Infect. Dis. 2021;103:611–616. doi: 10.1016/j.ijid.2020.10.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Cheng Y.-W., Chao T.-L., Li C.-L., Wang S.-H., Kao H.-C., Tsai Y.-M., Wang H.-Y., Hsieh C.-L., Lin Y.-Y., Chen P.-J., et al. D614G Substitution of SARS-CoV-2 Spike Protein Increases Syncytium Formation and Virus Titer via Enhanced Furin-Mediated Spike Cleavage. mBio. 2021;12:e00587-21. doi: 10.1128/mBio.00587-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Kampmann T., Mueller D.S., Mark A.E., Young P.R., Kobe B. The Role of Histidine Residues in Low-PH-Mediated Viral Membrane Fusion. Structure. 2006;14:1481–1487. doi: 10.1016/j.str.2006.07.011. [DOI] [PubMed] [Google Scholar]
- 85.Liu Y., Liu J., Johnson B.A., Xia H., Ku Z., Schindewolf C., Widen S.G., An Z., Weaver S.C., Menachery V.D., et al. Delta Spike P681R Mutation Enhances SARS-CoV-2 Fitness over Alpha Variant. [(accessed on 26 June 2021)];bioRxiv. 2021 doi: 10.1101/2021.08.12.456173. Available online: https://www.biorxiv.org/content/10.1101/2021.08.12.456173v1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Maher M.C., Bartha I., Weaver S., di Iulio J., Ferri E., Soriaga L., Lempp F.A., Hie B.L., Bryson B., Berger B., et al. Predicting the Mutational Drivers of Future SARS-CoV-2 Variants of Concern. medRxiv. 2021 doi: 10.1101/2021.06.21.21259286. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All SARS-CoV-2 genome sequences have been submitted to the EpiCoV/GISAID database with accession numbers indicated in Table S1.