Abstract
In spring 2021, an increasing number of infections was observed caused by the hitherto rarely described SARS-CoV-2 variant A.27 in south-west Germany. From December 2020 to June 2021 this lineage has been detected in 31 countries. Phylogeographic analyses of A.27 sequences obtained from national and international databases reveal a global spread of this lineage through multiple introductions from its inferred origin in Western Africa. Variant A.27 is characterized by a mutational pattern in the spike gene that includes the L18F, L452R and N501Y spike amino acid substitutions found in various variants of concern but lacks the globally dominant D614G. Neutralization assays demonstrate an escape of A.27 from convalescent and vaccine-elicited antibody-mediated immunity. Moreover, the therapeutic monoclonal antibody Bamlanivimab and partially the REGN-COV2 cocktail fail to block infection by A.27. Our data emphasize the need for continued global monitoring of novel lineages because of the independent evolution of new escape mutations.
Subject terms: SARS-CoV-2, Epidemiology, Phylogenetics, Next-generation sequencing, Immune evasion
The A.27 SARS-CoV-2 lineage spread globally in 2021 but did not become dominant. Here, the authors show that A.27 shares some mutations in the spike gene that are present in variants of concern, but lacks the D614G mutation, indicating independent evolution of immune escape properties.
Introduction
The continuing pandemic spread of SARS-CoV-2, the causative agent of coronavirus disease 2019 (COVID-19), has a devastating global impact on life, health care systems and economies by causing significant morbidity and mortality in the human population. SARS-CoV-2 is an enveloped, positive-sense single-stranded RNA virus and infects host cells via binding of the viral spike glycoprotein (S) to the angiotensin-converting enzyme 2 (ACE2) receptor and proteolytic activation through cellular proteases1,2. The mature S protein is cleaved into two subunits S1 and S2 and organized as a homotrimer in the viral particle3. While S1 forms a globular structure essential for ACE2 binding, S2 mediates membrane fusion. Both the receptor-binding domain (RBD) and the N-terminal domain (NTD)4 are targeted by neutralizing antibodies in sera of convalescent and vaccinated individuals5,6. Thus, multiple RBD-specific monoclonal antibodies (mAb) are assessed in clinical trials or are approved to treat COVID-19, including Bamlanivimab (LY-Cov-555) in combination with Etesevimab (LY-COV016)7 and the REGN-COV2 mAb cocktail (REGN10933 and REGN10987)8.
Early in the pandemic, SARS-CoV-2 acquired the S D614G substitution that has been associated with increased transmissibility and set the genetic foundation for the large number of B.1 derived lineages9,10. As the pandemic progressed the genomic diversity of SARS-CoV-2 increased significantly and several variants of concern (VOCs) and variants of interest (VOIs) emerged. These variants may be associated with higher transmissibility, can lead to more severe disease and/or significantly escape from antibody-mediated immunity, thereby reducing the effectiveness of available vaccines and treatments with mAbs11–13. Prominent examples are the B.1.1.7 (Alpha) and B.1.617.2 (Delta) variants that dominated global infections in late 2020 and 2021. These variants are characterized by specific patterns of concerning S mutations: apart from the D614G substitution, lineage B.1.1.7 has the N501Y amino acid substitution associated with increased affinity to ACE214,15 and two deletions in the NTD, among other changes. Moreover, a B.1.1.7 sub-lineage with an additional E484K substitution in the RBD has been detected in multiple countries. The E484K amino acid change is also found in other VOCs/VOIs and has been shown to reduce antibody neutralization16. A prominent amino acid change in the S protein of the B.1.617.2 variant is L452R that is also found in various other lineages. This mutation was shown to enhance infectivity in vitro and decrease neutralization by sera of COVID-19 patients and vaccinees17,18.
Here, we describe the detection, inferred origin and phenotypic characteristics of SARS-CoV-2 lineage A.27, which was primarily identified in Germany19 and France20 in spring 2021. This variant emerged in late 2020 and spread to over 30 countries. Through travel history-aware phylogeographic reconstruction, we estimate Western Africa as the likely origin of this lineage, from where it spread to other regions. A.27 is characterized by a mutational profile including L18F, L452R and N501Y in the S protein that combines genetic changes found in various VOCs/VOIs, while lacking the D614G substitution present in most other lineages. Our data demonstrate that A.27 can partially escape the neutralization by sera of vaccinees and recovered COVID-19 patients, and by therapeutic mAbs. This study emphasizes the importance of continued molecular surveillance to quickly detect antibody escape variants that threaten global vaccination efforts.
Results
Detection of SARS-CoV-2 lineage A.27
In early 2021, sequencing of material from SARS-CoV-2 infected individuals in Germany increased with about one third of sequences generated in the south-western state Baden-Wuerttemberg (BW) located at the French border (Fig. 1a/b). Within the molecular surveillance program of the Robert Koch Institute (RKI, Public Health Institute Germany), 851/178.264 (0.48%) sequences were classified as lineage A.27 from 18th of January to 1st of June 2021. The A.27 lineage was initially defined in January 2021 following an outbreak in Mayotte21, an overseas region of France located in the Indian ocean between the coast of Mozambique and Madagascar. Most of the German A.27 sequences originated in BW (81.4%), and in the northern state of Schleswig-Holstein (SH) (6.8%) (Fig. 1a). In the beginning of 2021, the frequency of A.27 cases steadily increased, reaching up to 6% of all sequences generated in BW and SH (Fig. 1c). Afterwards, the relative detection rate of this variant decreased while the frequency of VOC B.1.1.7 consistently increased (Supplementary Fig. 1). From December 2020 to June 2021, A.27 was reported in 31 countries. Apart from the German sequences, 535 A.27 sequences had been deposited in the GISAID database22. The earliest sequences were reported in Denmark (Europe) in December 2020 and the Western African countries Senegal, Burkina Faso and Togo. Interestingly, the majority of A.27 sequences deposited in GISAID between January and April 2021 originated from France (n = 263) indicating a similarly rapid spread in France compared to Germany (Fig. 1d).
Phylogeographic analyses reveal West Africa as the likely origin of A.27
To further characterize the global spread of A.27 and to estimate the potential origin of this lineage, we performed maximum-likelihood (ML) phylogenetic and Bayesian phylogeographic analyses incorporating available travel data of 12 A.27 infected patients (Supplementary Table 1). A preliminary unrooted ML phylogenetic analysis of 1386 complete A.27 genomes and a representative set of 2516 sequences from an Africa-focused Nextstrain build, capturing the global SARS-CoV-2 diversity, showed sufficient temporal signal in a root-to-tip regression analysis. Therefore, we estimated a global time-calibrated phylogeny, which dated its most recent common ancestor (tMRCA) to the second half of November 2020. The predicted evolutionary rate of 7.60e−04 substitutions per site and year was in line with previous estimates for SARS-CoV-223. We subsequently performed a more targeted ML phylogenetic analysis on a subtree of the inferred global phylogeny that contained all A.27 and 25 non-A.27 genomes. The analysis of this subset also had a sufficient temporal signal and a time calibrated tree showed that A.27 is monophyletic and further diversified in early 2021 (Supplementary Fig. 2). Location-specific clusters within Germany or France indicated possible independent introductions of A.27 into Europe.
Subsampled travel history-aware Bayesian phylogeographic analyses of the A.27 subtree (Supplementary Fig. 2) using BEAST 1.10.524 estimated the ancestral origin of the A.27 lineage in Western Africa (Fig. 2a and Supplementary Fig. 3). The subsampling was performed to take the sampling bias toward the high proportion of sequences from Germany and France into account (Fig. 2b). This analysis predicted a tMRCA for A.27 in late September 2020 (95% Highest Posterior Density interval (HPD) ranging between mid-August 2020 and late October 2020) with an evolutionary rate of 8.15e−04 (95% HPD: [7.04e−04; 9.33e−04]) substitutions per site and year. The earliest introduction of A.27 into Germany likely occurred in the third week of November 2020 (95% HPD covering the second half of November 2020), while the introduction into France happened slightly earlier around the beginning of November 2020 (95% HPD between early October 2020 and mid-November 2020). This places the estimated tMRCAs a few months before the first confirmed cases in both countries (January 4th and January 6th of 2021 for France and Germany, respectively). Therefore, we estimate that A.27 was introduced into Europe between 6 and 8 weeks after its common tMRCA.
We estimate that A.27 was initially transmitted within Western Africa (Fig. 2a and Supplementary Fig. 3) before spreading to other regions. The spread from West Africa was estimated by the expected number of transitions between all regions (Markov jumps). We confirmed a large number of introductions out of West Africa and multiple separate introduction events in Germany and France, leading to the different large German and French clades (Fig. 2c and Supplementary Fig. 4). Other African regions could have been the source for seeding introductions into Europe, such as Mayotte seeding into France, although this was not consistently supported (Supplementary Data 1). However, we also inferred introductions from France to the Benelux Union and Northern Africa, as well as introductions into Asia-Pacific (APAC) from the Benelux Union. Although no A.27 sequences were available to us from Southern Africa, our travel history-aware phylogeographic analysis showed consistent and strong support for a spread to this region from Western Europe, which in turn led to an introduction into regions of the Benelux Union. This bi-directional exchange of lineages between African regions and the European continent can also be observed for Northern and Central Africa. This is in contrast with Western and Eastern Africa, which were seen as exclusive sources of lineages in Europe in our reconstruction. In conclusion, A.27 likely originated in Western Africa in late 2020, from where it spread to multiple countries around the globe resulting in large clusters in Germany and France in spring 2021.
Proportion of hospitalized patients infected with A.27 or B.1.1.7 in Germany
As part of the German molecular surveillance program, sequences uploaded to the RKI are linked to case-based data reported by local public health authorities into the electronic surveillance system for infectious disease. We compared available patient data of 329 sequenced A.27 and 56,453 patient specimens of VOC B.1.1.7 between January and June of 2021. Interestingly, in the set of sequences that were randomly selected and limited to not fully vaccinated patients, those infected with A.27 (n = 100) were on average 5.6 years older than B.1.1.7 (n = 17.512) infected individuals (Fig. 3a). There was no gender difference in terms of A.27 patients compared to B.1.1.7 patients (Fig. 3b). Accordingly, we compared the proportion of hospitalized B.1.1.7 and A.27 infected patients. We found no significant differences in the proportion of hospitalization, 8.9% for B.1.1.7 and 6.2% for A.27, respectively (Fig. 3c). Analysis of A.27 and B.1.1.7 infections shows, that A.27 infection also preferentially occur in older individuals with an increasing risk for hospital admissions by age.
A.27 is characterized by a mutation profile similar to current VOCs and VOIs
We characterized the mutational profile of A.27 based on 1386 full genome A.27 sequences. The nucleotide profiles were determined in comparison to Wuhan-Hu-1 using covSonar (https://gitlab.com/s.fuchs/covsonar) and aligned to each other (Supplementary Fig. 5). Lineage-specific mutations were defined as mutations present in ≥75% of all sequences (Table 1). The A.27 lineage is characterized by 26 mutations including seven non-synonymous mutations in the S gene, a frameshift in ORF3a and a deletion in ORF8. The frameshift in ORF3a is located at the C-terminus in a region that has so far not been resolved in available cryo-electron microscopy analyses25 and leads to a 14 amino-acid truncated protein. The deletion in ORF8 also resides at the C-terminal end and translates into a deletion of an aspartate and phenylalanine involved in the stabilization of the ORF8 dimerization interface26 (Supplementary Fig. 6). Three of the seven S substitutions, L18F, L452R, and N501Y, are of particular interest (Fig. 4a–c). The L18F amino acid substitution is found within the first of five loops of the NTD supersite5,27 and the L452R and N501Y mutations are located in the receptor-binding motif (RBM) which interacts with the human ACE2 protein1,4,28. Both regions are targets for neutralizing antibodies27,29,30 and L18F and L452R have been previously associated with antibody escape and L452R additionally with increased infectivity17,18,31. Furthermore, N501Y was suggested to enhance the binding affinity to ACE214,15. These three mutations are also found in multiple VOCs and VOIs (Fig. 4c). L18F is present in the VOCs B.1.351 and P.1, the L452R substitution is found in high frequencies in B.1.617.2 and related AY lineages, and N501Y is known from B.1.1.7, B.1.351 and P.1. One of the hallmarks of A.27 is the absence of the S D614G substitution present in the globally dominating B.1-derieved lineages, indicating an independent acquisition of the other spike mutations.
Table 1.
75% of sequences | 90% of sequences | gene | amino acid substitution |
---|---|---|---|
A361G | A361G | ORF1a | synonymous |
C1122T | C1122T | ORF1a | P286L |
C2509T | C2509T | ORF1a | synonymous |
C8782T | C8782T | ORF1a | synonymous |
A9204G | A9204G | ORF1a | D2980G |
A11217G | A11217G | ORF1a | N3651S |
C16293T | – | ORF1b | synonymous |
C16466T | C16466T | ORF1b | P5401L |
A18366G | A18366G | ORF1b | synonymous |
A20262G | A20262G | ORF1b | synonymous |
C21614T | C21614T | S | L18F |
G22468T | G22468T | S | synonymous |
T22917G | T22917G | S | L452R |
A23063T | A23063T | S | N501Y |
C23520T | – | S | A653V |
C23525T | C23525T | S | H655Y |
G23948T | G23948T | S | D796Y |
G25218T | G25218T | S | G1219V |
T25541C | T25541C | ORF3a | V50A |
del:26160:8 | – | ORF3a | del257/258fsX6 |
C27247T | C27247T | ORF6 | synonymous |
T28144C | T28144C | ORF8 | L84S |
del:28248:6 | del:28248:6 | ORF8 | del:119/120 |
A28273T | A28273T | NCR | synonymous |
G28878A | G28878A | N | S202N |
G29742A | – | NCR | synonymous |
The A.27 Black Forest isolate is attenuated in vivo
To characterize the biological features of the A.27 lineage, we isolated this variant from an oropharyngeal swab. The sample was derived from a patient living in the Black Forest area in South Germany and who was treated at the University Medical Center of Freiburg. Virus isolation was performed on VeroE6 cells and followed by one cell culture passage to produce high titre stocks. We performed whole genome sequencing of the initial patient material and the derived virus stocks to analyze if the virus had acquired cell culture adaptations during isolation. Analysis of the variant frequencies found in the respective samples showed a ~60% variant frequency for the G11083T substitution in the ORF1ab after isolation. Although likely selected during isolation, this mutation was already present in low frequencies in the patient material (Fig.5a). Moreover, the virus isolate exhibited high genomic stability throughout the isolation process and all lineage-defining mutations were confirmed. The A.27 Black Forest isolate reached similar titres in both VeroE6 and Calu3 cells, comparable with a prototypic B.1 isolate (Muc-IMB-1)32 that only harbors the S D614G substitution in its viral genome and four different VOCs (Fig. 5b/c). A clear exception was the B.1.351 variant, which showed a 100-fold reduced viral titre 3 days post infection compared with A.27 on Calu3 cells (Fig. 5c) as previously reported33. Furthermore, cells infected with the B.1 and A.27 isolates were analyzed by confocal immunofluorescence microscopy (Fig. 5d). Both virus isolates showed a diffuse cytosolic accumulation of the S and N proteins 8 h post infection. The frameshift in ORF3a in the A.27 isolate might result in an altered cellular localization of this viral protein. Therefore, we additionally stained for ORF3a, but found no differences between B.1 and A.27 infected cells indicating that the missing C-terminal amino acids do not impact its localization. This is in line with previous results showing a comparable cellular localization of wild type ORF3a and a C-terminal deletion mutant of ORF3a34. A previous study showed that ORF3a represents an important virulence factor in human ACE2 transgenic (hACE2) mice35. Therefore, we hypothesized that the ORF3a frameshift in the A.27 isolate could lead to an attenuated phenotype in vivo. To test this hypothesis, we compared the pathogenicity of B.1, B.1.1.7 and A.27 isolates in hACE2 transgenic mice36. Mice were infected with an intranasal inoculum containing 132 plaque forming units (pfu) per virus variant and weight loss and survival were monitored. B.1 and B.1.1.7 infected mice showed pronounced weight loss and all mice reached humane endpoints between 6 and 7 days post infection (Fig. 5e/f). Intriguingly, 75% of A.27 infected mice only transiently lost weight and recovered from the infection demonstrating that A.27 is severely attenuated in vivo compared to the B.1 and B.1.1.7 isolates, possibly due to the deletion in ORF3a.
A.27 escaped neutralization by patient sera and therapeutic antibodies
Lineage A.27 has two mutations in its RBD that translate to L452R and N501Y. Previous binding studies of RBD mutants with mAb and sera showed decreased binding for both mutations6,31 (Supplementary Fig. 7a/b). To estimate the effect of the mutations found in the S gene of A.27 in the context of virus neutralization, serial dilutions of sera from convalescent COVID-19 patients (Fig.6a and Supplementary Fig. 8) or BioNTech BNT162b2 vaccinees (Fig. 6b and Supplementary Fig. 9) were analyzed by plaque reduction assays. The potential escape was assessed by comparing the A.27 Black Forest isolate to the prototypic B.1 isolate that harbors the D614G mutation. The neutralizing titres resulting in 50% plaque reduction (NT50) of sera from convalescent COVID-19 patients and from vaccinees were significantly reduced, on average, two- to three-fold lower against the A.27 isolate compared to the B.1 isolate. Notably, the resistance of A.27 toward antibody neutralization was similar in either group as there were no significant differences in the NT50 values between convalescent and vaccinee sera (Fig. 6c). Furthermore, we assessed the escape of A.27, B.1 and four different VOC isolates from the neutralizing capacity of the mAbs LY-COV55537 and the REGN-COV2 cocktail (REGN10933, REGN10987)8 which can be used to treat COVID-19 patients. In strong contrast to B.1 and B.1.1.7, the A.27 as well as the B.1.351, B.1.617.2 and P.1 isolates completely escaped the neutralizing effect of LY-COV555 (Fig. 6d). For REGN10933, B.1.351 and P.1 displayed a pronounced escape (Fig. 6e). Furthermore, the neutralizing capacity of REGN10987 against A.27 and B.1.617.2 was slightly reduced (Fig. 6f). The observed differences for the individual REGN-COV2 antibodies could be compensated by a 1:1 combination of both antibodies, mimicking the actual treatment regimen8 (Fig. 6g). Based on the NT50 values, REGN10987 showed an overall broad and strong neutralizing capacity while LY-COV555 failed to neutralize most variants (Table 2). Besides neutralization, a prime function of antigen-bound (complexed) IgG is the activation of Fc-gamma receptors (FcγRs) present on various immune cells such as monocyte-derived cells and Natural Killer (NK) cells. One of the most potent antiviral immune mechanisms mediated by the Fc-part of complexed IgG (Fcγ) is antibody-dependent cellular cytotoxicity elicited by NK cells expressing FcγRIII/CD16. Therefore, we assessed the potential of the above mAbs to activate CD16 in a cell-based FcγR-activation reporter assay. Inactivated virions from different strains were titrated and immobilized on ELISA plates and incubated with the respective mAbs at a fixed concentration. CD16 reporter cells were then cultured on opsonized virions and IL-2 production was measured as an indicator of receptor activation as described previously38. Directly immobilized mAbs served as a positive control and showed that all mAbs are equally able to activate CD16 (Fig. 6h). Incubation on opsonized A.27 and B.1.617.2 virions resulted in reduced CD16 activation for LY-COV555 in line with the neutralization data (Fig. 6i). However, for the other isolates there was no direct correlation between CD16 activation and neutralizing capacity. Considering that all mAbs were able to activate CD16, this showed that residual binding of the therapeutic antibodies to the antigen could still result in CD16 activation. Taken together, these data argue for a pronounced escape of A.27 from neutralizing antibodies similar to other VOCs.
Table 2.
LY-COV555 | REGN10933 | REGN10987 | REGN-COV2 | |
---|---|---|---|---|
A.27 | > 10 µg/ml | 0.12 µg/ml | 0.05 µg/ml | 0.05 µg/ml |
B.1 | 0.11 µg/ml | 0.09 µg/ml | 0.02 µg/ml | 0.03 µg/ml |
B.1.1.7 | 0.06 µg/ml | 0.02 µg/ml | 0.01 µg/ml | 0.02 µg/ml |
B.1.351 | > 10 µg/ml | 4.70 µg/ml | 0.01 µg/ml | 0.05 µg/ml |
P.1 | > 10 µg/ml | >10 µg/ml | 0.01 µg/ml | 0.02 µg/ml |
B.1.617.2 | > 10 µg/ml | 0.01 µg/ml | 0.04 µg/ml | 0.02 µg/ml |
Discussion
Here, we report the epidemiology, inferred origin and phenotypic characteristics of SARS-CoV-2 lineage A.27, whose genome contains several substitutions that have also been observed in various VOCs and VOIs. The root of the pandemic lies in the parental A lineage with the characteristic C8782T and T28144C mutations. However, lineages derived from A are rare, with only a few thousand sequences reported worldwide. In the COVID-19 pandemic, infections are currently dominated by B.1-derived lineages harboring the prominent D614G mutation9. This makes the mutational pattern of lineage A.27 of particular interest as it shows the independent acquisition of the same mutations found in B.1-derived VOCs and VOIs in a different genomic background. A similar pattern of concerning mutations in an emerging A-derived lineage has so far only been described for the A.23/A.23.1 lineage first discovered in Uganda and Rwanda39,40. The early detection in multiple countries in Western Africa in late December 2020, an outbreak in January 2021 in Mayotte and A.27 infections of Belgian military personnel returning from Mali pointed to a suspected origin in the African continent41,42. Through carefully crafted phylogeographic analyses that exploit individual travel histories of patients infected with A.27, we here provide support for the origin of A.27 in Western Africa. After completion of our phylogenetic and phylogeographic analyses, on the 17th of September 2021, three additional A.27 genomes from Burkina Faso appeared on GISAID with sampling dates on the 16 and the 19th of December 2020. They additionally confirm the early circulation and our inferred origin of A.27 in West Africa toward the end of 2020. The entire backbone of the A.27 phylogeny was estimated to be located in Western Africa, with the virus consequently spreading from there to other, mostly European, regions. From Western Africa, we observed that A.27 spread to Europe through multiple separate introduction events, eventually forming several large German and French clades. Interestingly, the Markov jump analysis did not support bi-directional seeding events between the two adjacent countries, which was further supported by the clear phylogenetic separation of the French and German clades. Note that due to the effect of sampling bias, for example as a result of varying sequencing efforts between countries, the major German and French clades do not necessarily signify that the spread of A.27 was largely confined to these two countries. This was confirmed by the individual travel histories we collected, with two infected patients testing positive in the Netherlands after traveling back from South Africa, although South Africa did not yet report any A.27 genomes.
While the origin of most French A.27 clades also lay in Western Africa, one of the larger French clades seemed to be more closely related to Eastern-African sequences, a fact also corroborated by the Markov jump analysis. This Eastern-African clade corresponds to an outbreak in Mayotte, a French overseas territory in the Indian Ocean, indicating the possibility that one or more travel cases from Mayotte to France led to the introduction of A.27 into France. Notably, only one out of five replicates (see Supplementary Data 1) of our phylogeographic analysis provides strong support (Bayes factor > 20) for A.27 being introduced into France via Mayotte, with the other four replicates showing positive support (Bayes factor > 3). Known travel cases from Mayotte to France of patients infected with A.27 could have led to more conclusive evidence, but unfortunately, we were not able to obtain such individual travel histories for the purpose of our travel history-aware phylogeographic analyses. Of note, we were also not able to obtain travel history for the earliest A.27 sample in Denmark which we assume to be a travel case given that all of the other early A.27 samples came from West Africa.
Within Germany, particularly in BW, we observed a constant increase in A.27 sequences over several weeks followed by a rapid decline of A.27. Since the increase of B.1.1.7 started earlier, this decline occurred with the start of the third wave in Germany, which was mainly driven by B.1.1.7 and led to its dominance. This suggests a fitness advantage of A.27 in comparison to other lineages but a disadvantage compared to the B.1.1.7 lineage. A possible explanation for this intermediate fitness phenotype could be the acquisition of the N501Y mutation in the absence of D614G. Both mutations are present in VOCs B.1.1.7, B.1.351 and P.1 and might have additive effects. N501Y increases the affinity of the viral S for ACE214 and D614G is thought to prevent premature dissociation of the S trimer leading to a higher infectivity and transmissibility10,43,44. We detected a lower, but not a significant lower proportion of hospitalization of A.27 compared to B.1.1.7 cases, but a significant mean higher age. However, these observations were clearly limited by the small amount of available metadata. To investigate if both viruses had a comparable virulence, we analyzed the growth of the A.27 isolate in cell culture as well as its pathogenicity in hACE2 transgenic mice. The A.27 and B.1.1.7 isolates demonstrated a comparable viral replication in cell culture but A.27 was significantly attenuated in mice compared to B.1 and B.1.1.7. This indicates that despite comparable viral replication in cell culture, A.27 possesses features which decrease its pathogenicity in mice. Genomic changes like the frameshift in ORF3a and the deletion in ORF8 might contribute to this phenotype. The lack of position 119/120 in the accessory protein ORF8 might lead to a decreased stability of the homodimer and a partial loss of function26. Interestingly, B.1.617.2 lacks the two ORF8 amino acids 120/21, strongly indicating a convergent evolution for deletions in this region or the lack of a selective pressure to maintain these amino acids. ORF8 seems to be dispensable for viral replication and deletions/stop codons could represent an adaptation to the human host45. ORF3a has been shown to induce apoptosis and block autophagy34,46. Our immunofluorescence analysis suggests a similar cellular localization and expression of ORF3a in B.1 and A.27 infected cells. However, the frameshift in ORF3a might impair some of the proteins’ functions. An attenuation due to a crippled ORF3a would be in line with a previous study showing that ORF3a and ORF6 are the major contributors of viral pathogenesis in hACE2 transgenic mice35. Implications of these amino acid changes for human disease are presently unclear. Functional characterizations of mutations in accessory viral proteins of SARS-CoV-2 are urgently needed to better understand their impact on virulence and pathogenicity of SARS-CoV-2 in humans.
Vaccinations are currently the major instrument to combat the COVID-19 pandemic and treatments with mAb are a potent therapeutic option to treat COVID-19. As such, antibody escape mutations in circulating variants could further fuel the pandemic. A.27 harbors multiple mutations in the viral S, the major target for neutralizing antibodies, raising the question whether mAb and sera from COVID-19 patients or from vaccinees will protect from an A.27 infection. Our data suggest that A.27 can escape antibody-mediated immunity. We observed a consistent decrease of the neutralizing capacity of sera from COVID-19 patients and vaccinees. Moreover, A.27 completely escaped the neutralization of LY-COV555 and partially of REGN10987. Importantly, B.1.617.2 escaped these antibodies in a similar manner, suggesting that the L452R mutation present in both variants facilitates this escape47. This is in agreement with deep mutational scanning data that showed decreased binding of L452R and LY-COV55531. Interestingly, the VOCs B.1.351 and P.1 escaped LY-COV555 and REGN10933 which was likely facilitated by the E484K mutation present in both variants. This suggests that L452R and E484K lead to an escape from LY-COV55531 and to a partial resistance to either REGN10987 or REGN10933, respectively. The FcγR activation assay showed that this escape is to some extend independent of the CD16 activation. However, reduced neutralization can result in reduced CD16 activation as observed for LY-COV555 and A.27 or B.1.617.2 indicating key immunological mechanisms linked to opsonisation can be additionally impaired in non-neutralizing mAbs. Importantly, the combination of both REGN-COV2 antibodies sufficiently neutralized A.27 as well as all tested VOCs, emphasizing that REGN-COV2 but not LY-COV555 should be used to treat COVID-19 patients suffering from an infection with these variants. This also argues for using different mAb preparations when no clear clinical effect is observed, as the lineage might harbor mutations that are refractory to a particular preparation. Notably, NTD polymorphisms might also decrease the neutralizing capacity of sera as multiple studies have detected NTD-specific antibodies in COVID-19 patients48 and vaccinees49. L18F lies within the first of five loops of the NTD supersite5,27 and could contribute to the escape from antibody-mediated immunity as previously suggested50.
The present study analyzed the genomic profile and biological features of the A.27 lineage which was primarily detected in France and Germany in spring 2021. Our phylogeographic analyses that were able to exploit individual travel histories provided evidence that these sequences stem from separate introduction events out of Western Africa, which we estimate to be the geographic origin of A.27. Importantly, our data further suggest that A.27 is less susceptible to SARS-CoV-2-specific antibodies and that COVID-19 patients and vaccinees might not be fully protected against this variant. The presence of concerning S mutations in an A-derived lineage supports the notion that the same escape mutations can appear in relatively distant genomic backgrounds with similar phenotypic consequences. Therefore, global molecular surveillance has to continue to detect novel variants and to support assessing their risk for the human population.
Methods
Data acquisition
To assess the prevalence of lineage A.27 in Germany, all SARS-CoV-2 full-genome sequences that were submitted to the RKI (n = 851) in 2021 until the 1st of June (Supplementary Data 2) were classified using the PANGO lineage assignment (pangolin version: 3.0.3, pangoLEARN: 2021-05-27)51,52. The relative frequency of A.27 and B.1.1.7 sequences in comparison to all submitted sequences was assessed for each federal state and each week of 2021. Sequencing data was linked to patient metadata obtained from local health authorities as part of the genomic surveillance program of the RKI via the national electronic reporting system for surveillance of notifiable infectious diseases (SurvNet)53. Anonymized data of the hospitalization status, sex and age were extracted. Furthermore, as of 2021-08-24 all additionally available 535 A.27 sequences and associated metadata for samples taken until 2021-06-01 were downloaded from GISAID (Supplementary Table 2). The Germany map was downloaded from https://gadm.org/maps/DEU.html and the maps were visualized using tmap54. Patient metadata of hospitalization status, sex and age was analyzed with GraphPad Prism.
Phylogenetic analysis and time-calibrated phylogenetic tree reconstruction
To put the A.27 sequences within the global context, we combined all 851 A.27 sequences from the RKI with 535 A.27 sequences from GISAID, along with 2545 sequences from the Africa-focused Nextstrain build (https://nextstrain.org/ncov/gisaid/africa)55 for a total of 3907 sequences after removing duplicate entries. We limited our selection of sequences from GISAID and Nextstrain to those collected up to June 1st to remain consistent with the sampling period of the RKI sequences. We used this selection of sequences to infer an unrooted phylogenetic tree using IQTREE2 v2.1.056 under a GTR model with empirical frequencies and four-category FreeRate model of site heterogeneity, which was selected as the best fitting model using IQTREE’s ModelTest functionality. The resulting phylogeny was then time calibrated using TreeTime v.0.7.457, rooting the tree on the “Wuhan/Hu-1/2019” isolate, following the Nextstrain SARS-CoV-2 workflow (https://github.com/nextstrain/ncov) and assuming a strict molecular clock and a skyline coalescent model. TreeTime detected one GISAID sequence (EPI_ISL_1586901) and three RKI sequences (IMS-10020-CVDP-DCAB86B5-00C8-496F-9B16-297546A77DF2, IMS-10122-CVDP-DF8FAF93-3173-40A6-85D8-B50274A72B20, IMS-10122-CVDP-5B313DE6-7B34-4BAA-81BD-DEEE937126EC) as outliers, which we subsequently removed from further downstream analyses. Such a relatively low number of outliers is to be expected as the Nextstrain workflow already performs a similar data cleaning step. We visualized the resulting phylogeny using baltic (https://github.com/evogytis/baltic).
Travel history-aware phylogeographic reconstruction
From the full time-calibrated ML phylogeny, we selected the subtree containing all A.27 sequences, along with 25 non-A27 ancestral sequences (n = 1383) to perform a more targeted reconstruction to determine the geographic origin of the A.27 lineage. To this end, we aimed to perform a Bayesian phylogeographic reconstruction using BEAST v1.10.558. We note that over 50% of the sequences in the A.27 clade were collected in Germany (n = 884), with France a close second (n = 263) (see Fig. 2b and Supplementary Fig. 2). Such severe sampling bias is known to affect discrete phylogeographic reconstruction59, leading to overconfidence in inferring oversampled locations as ancestral in the phylogeny60. To mitigate this, we employed a subsampling scheme where we removed identical sequences and limited our dataset to a maximum of 10 randomly selected sequences per week for these two locations. We performed this subsampling procedure five times, to exclude the possibility of accidentally sampling a highly unlikely scenario. This yielded final datasets of between 560 and 565 sequences from 31 countries, on which we performed travel history-aware phylogeographic reconstruction24. However, estimating transition rates between locations that have very few sequences may be subject to poor mixing61. In order to avoid this issue, we aggregated certain locations into larger regions (with the categorization based mainly on the UN geoschemes). For example, sequences from Denmark, Sweden, UK and Ireland were grouped together as belonging to “Northern Europe”. We refer to Supplementary Table 3 for a detailed description of which countries were grouped into which regions and how many sequences were included in total per region. This process resulted in a total of 14 regions being considered in the phylogeographic analysis: Asia-Pacific (APAC), Benelux (Belgium, Netherlands and Luxembourg), Eastern Africa, Middle Africa, Southern Africa, Western Africa, North America, Eastern Europe, Southern Europe, Western Europe, France and Germany. We decided not to group France and Germany into a larger region, given that they are countries of interest for this study. For twelve sequences in our dataset that were sequenced in the Netherlands and Belgium, we obtained travel information, indicating cases in which a patient had travelled in the days preceding diagnosis. Two patients had returned from South Africa to the Netherlands, two from Burkina Faso to Belgium (and one other of whom a household member returned from Burkina Faso to Belgium and tested positive) and seven from Mali to Belgium (Fig. 2a and Supplementary Table 1).
With these regions and individual travel histories in place, we performed travel history-aware discrete phylogeographic analysis24,59 (using BEAST 1.10.558, while employing the BEAGLE 3.2.0 high-performance computational library62 to improve performance. We made use of Bayesian stochastic search variable selection to simultaneously determine which migration rates are zero depending on the evidence in the data and to efficiently infer the ancestral locations, in addition to providing a Bayes factor test to identify significant non-zero migration rates. We also estimated the expected number of transitions (known as Markov jumps63) between all regions in the dataset. On the sequence data partition, we made use of a general time-reversible substitution model with estimated base frequencies and among-site rate heterogeneity64, along with a relaxed molecular clock model with an underlying lognormal distribution65. We used the following prior specifications for these analyses: a non-parametric skygrid coalescent model (for which we employed Hamiltonian Monte Carlo estimation66), a gamma (shape = 0.001; scale = 1000) prior on the skygrid precision parameter, dirichlet (α1, … αK = 1.0; K equal to the number of states) priors on the transition rates for the GTR substitution model and the frequencies for the GTR nucleotide-substitution model, an exponential (mean = 0.5) prior on the shape parameter of the discretized gamma distribution to model among-site rate heterogeneity, a Poisson prior (mean = 13) on the sum of non-zero rates between regions, a CTMC reference prior on the mean evolutionary rate67 and an exponential (mean = 1/3) prior on its standard deviation. For the travel history-aware phylogeographic model, we treated the departure time of the patient as a random variable, conditioned on a normal prior distribution with a mean of 10 days before sampling date (based on a mean incubation time of 5 days and a constant ascertainment period of 5 days between symptom onset and testing68) and a standard deviation of 3 days. We truncated the distribution to be positive (back-in-time), in order to avoid an infection time at a later date than the corresponding sampling time.
Each of these phylogeographic analysis replicates ran for a total of 560 million iterations, respectively, with the Markov chains being sampled every 50,000th iteration, in order to reach an effective sample size (ESS) for all relevant parameters of at least 200, as determined by Tracer 1.769. We used TreeAnnotator to construct maximum clade credibility (MCC) trees for each replicate.
Analysis of A.27 lineage-defining mutations and lineage comparison
The nucleotide and amino acid profiles of the 1386 A.27 sequences were determined in comparison to Wuhan-Hu-1 using covSonar (https://gitlab.com/s.fuchs/covsonar). To extract the nucleotide mutations and define INDELs the R package stringr was utilized. The profiles were matched with the R package dplyr, mutations with a frequency below 1% were excluded and the resulting matrix visualized with the R pheatmap package. We extracted mutations that were present in 75% of the mutation profiles and defined them as lineage-defining mutations. Furthermore, the data produced by covSonar was utilized to compare the A.27 amino acid profile in the viral spike gene with different VOCs and VOIs. Here, the amino acid profile was subset for the viral spike gene and the frequency of the mutations calculated excluding again frequencies below 1%. The amino acid mutation frequencies in the viral spike of A.27 and the different VOCs and VOIs was downloaded from outbreak.info70 on 2021-07-13. Outbreak.info calculates these frequencies based on all available sequence data from GISAID. The A.27 frequencies were replaced with our calculated frequencies as they include data from GISAID and RKI and visualized the mutation frequencies in the viral spike again with the R pheatmap package.
Visualization of viral protein structures
The EM structure of the closed of the trimeric SARS-CoV-2 spike protein (10.2210/pdb6vxx/pdb) and the dimeric 2.04 Å crystal structure of ORF8 (10.2210/pdb7JTL/pdb) were downloaded from the protein data bank and visualized with UCSF ChimeraX version: 1.1 (2020-09-09).
Cell culture
Virus isolation, cell culture and mouse infection experiments with SARS-CoV-2 were performed under Biosafety Level 3 protocols at the Institute of Virology, Freiburg, approved by the Regierungspraesidium Tuebingen (No. 25-27/8973.10-18 and UNI.FRK.05.16-29). Adherent African green monkey kidney VeroE6 cells (ATCC CRL-1586™) and human lung Calu3 cells (ATCC HTB-55™) were cultured in 1× Dulbecco’s modified Eagle medium (DMEM) containing 5% or 10% fetal calf serum (FCS), respectively. To isolate SARS-CoV-2 from patient material, filtered throat swab samples of patients with previous SARS-CoV-2 A.27 (EPI_ISL_3200835) or Delta variant B.1.617.2 infections (EPI_ISL_2535433) were inoculated on VeroE6 cells (2 × 106 cells) in 4 ml DMEM with 2% FCS and incubated at 37 °C and 5% CO2 for 4–6 days until a cytopathic effect was visible. The culture supernatant was cleared and stored at −80 °C. Virus titres were determined by plaque assay on VeroE6 cells. Furthermore, the following SARS-CoV-2 isolates were used: Muc-IMB-1, lineage B.1 (EPI_ISL_406862 Germany/BavPat1/2020)32, kindly provided by Roman Woelfel, Bundeswehr Institute of Microbiology; Alpha variant B.1.1.7 (EPI_ISL_751799) and Beta variant B.1.351 (hCoV-19/Germany/NW-RKI-I-0029/2020; ID: EPI_ISL_803957), provided by Donata Hoffmann and Martin Beer, Friedrich-Loeffler-Institute, Riems; and Gamma variant P.1 (EPI_ISL_3980444) provided by Michael Schindler, Institute for Medical Virology and Epidemiology, Tuebingen. All virus stocks used for experiments were inspected for mutations compared to the parental virus isolate using whole genome Illumina sequencing. For the analysis of viral growth, VeroE6 or Calu3 cells were inoculated in six well plates with a moi of 0.001 and supernatant collected at 8 h, 24 h, 48 h and 72 h post-infection. Viral titres were then determined by plaque assay on VeroE6 cells. BW5147 mouse thymoma cells (kindly provided by Ofer Mandelboim, Hadassah Hospital, Jerusalem, Israel) stably express human FcγR ectodomains genetically fused to the CD3ξ signalling module38. Cells were maintained at 3 × 105 to 9 × 105 cells/ml in Roswell Park Memorial Institute medium (RPMI GlutaMAX, Gibco) supplemented with 10% (vol/vol) FCS, sodium pyruvate (1×, Gibco), 100 U/ml penicillin-Streptomycin (Gibco) β-mercaptoethanol (0.1 mM, Gibco). Cells were cultured at 37 °C, 5% CO2. All cell lines were routinely tested for mycoplasma.
Evaluation of the neutralizing capacity of sera and monoclonal antibodies
Serological neutralization tests were performed with patient sera collected after resolved infection with SARS-CoV-2 or sera of vaccinated individuals ~10–50 days post-vaccination with the second dose of the BNT162b2 mRNA vaccine (Pfizer/BioNTech). Neutralizing antibody titres were determined by a plaque reduction assay. Serial twofold dilutions of the sera were incubated for 1 h with 100 pfu of the SARS-CoV-2 isolates. The serum-virus mixture was then used to infect VeroE6 for 90 min at room temperature. The inoculum was removed and the cells overlaid with 0.6% oxoid agar for 48 h at 37 °C. Cells were fixed with 3.7% formaldehyde and stained with crystal violet. The reduction in counted plaque numbers was determined in comparison to an untreated mock-infected control without serum. Neutralization titres of mAb were determined by incubation of the respective SARS-CoV-2 isolates with tenfold dilutions of the individual antibodies (101–10−4 µg/mL). Plaque reduction assay was performed as described above replacing sera dilutions with serial dilution of the mAbs from concentration 101–10−4 µg/ml. To evaluate the neutralizing capacity and determine the neutralizing titre 50, a non-linear fit least squares regression (constraints: 0 and 100) was performed. For sera, the mean of each dilution for all sera was determined and plotted to visualize the overall tendency. The fold difference was calculated by the quotient of the NT50 for B.1 and A.27 for the individual sera.
Fc receptor activation assay
Virus stocks were concentrated by ultracentrifugation at 100.000 g for 2 h and 4 °C and the pellet dissolved in PBS. The concentrated virus stocks were inactivated with 0.1% β-propiolactone for 16 h at 4 °C followed by 2 h at 37 °C. IgG or the inactivated virions were titrated in PBS and incubated on an ELISA plate for 1 h at 37 °C for coating. Plates were then blocked in PBS with 10% FCS for 1 h at RT. Immobilized virions were opsonized by incubation with 20 ng/µl of the respective mAbs (IgG) for 2 h at RT, followed by incubation with mouse BW5147 reporter cells stably expressing human FcR ectodomains genetically fused to CD for 16 h in an incubator (37 °C, 5% CO2). Immobilized IgG was incubated with reporter cells directly. Secreted mIL-2 was quantified via anti-mIL-2 sandwich ELISA as described previously38,71.
Whole genome sequencing
cDNA was produced from extracted RNA of oropharyngeal swab or cell culture supernatant samples using random hexamer primers and Superscript III (ThermoFisher) followed by a PCR tiling the entire SARS-CoV-2 genome (ARTIC V3 primer sets; https://github.com/artic-network/artic-ncov2019). This produced ~400 bp long, overlapping amplicons that were subsequently used to prepare the sequencing library. The amplicons were purified with AMPure magnetic beads (Beckman Coulter). Afterwards the QIAseq FX DNA Library Kit (Qiagen) was used to prepare indexed paired-end libraries for Illumina sequencing. Normalized and pooled sequencing libraries were denatured with 0.2 N NaOH. These libraries were sequenced on an Illumina MiSeq using the 300-cycle MiSeq Reagent Kit v2.
The de-multiplexed raw reads were subjected to a custom Galaxy pipeline, which is based on bioinformatics pipelines on usegalaxy.eu72. The raw reads were pre-processed with fastp v.0.20.173 and mapped to the SARS-CoV-2 Wuhan-Hu-1 reference genome (Genbank: NC_045512) using BWA-MEM v.0.7.1774. For datasets produced with the ARTIC v3 protocol, primer sequences were trimmed with ivar trim v1.9 (https://andersen-lab.github.io/ivar/html/manualpage.html). Variants (SNPs and INDELs) were called with the ultrasensitive variant caller LoFreq v2.1.575, demanding a minimum base quality of 30 and a coverage of at least 20×. Afterwards, the called variants were filtered based on a minimum variant frequency of 10 % and on the support of strand bias. The effects of the mutations were automatically annotated in the vcf files with SnpEff v.4.3.176. Finally, consensus sequences were constructed with bcftools v.1.1.077. Regions with low coverage (>20×) or variant frequencies between 30 and 70% were masked with N. The final consensus sequences have been deposited in the GISAID database (www.gisaid.org).
The clades of the reconstructed viral genomes were classified with the Pangolin webserver (pangolin.cog-uk.io). An in-house R script was also used to plot the variant frequencies that were detected by LoFreq as a heatmap (github.com/jonas-fuchs/SARS-CoV-2-analyses). This tool is also available on usegalaxy.eu (“Variant Frequency Plot”).
Immunofluorescence analysis
VeroE6 cells seeded on glass coverslips were either infected with SARS-CoV-2 isolates at a moi of 0.1 or left uninfected. At 8 h post infection, cells were fixed in 4% paraformaldehyde in PBS, permeabilized with 0.3% Triton X-100 and blocked in 10% FCS. SARS-CoV-2 N- (Rockland #200-401-A50, 1:1000), S- (Rockland #600-401-MS8, 1:250) and ORF3a-specific primary antibodies (https://mrcppu-covid.bio/, 1:100) and AF568-labeled goat-anti-rabbit (Invitrogen, #A11011, 1:400) secondary antibody as well as AF488-labeled Phalloidin (Hypermol, #8813-01, 1:400) were used for staining. The coverslips were embedded in Diamond Antifade Mountant with 4′,6-diamidino-2-phenylindole (DAPI) (ThermoFisher, #P36971). Fluorescence images were generated using a LSM800 confocal laser-scanning microscope (Zeiss) equipped with a 63X, 1.4 NA oil objective and Airyscan detector and processed with Zen blue software (Zeiss) and ImageJ/Fiji.
Infection of K18-hACE2 transgenic mice
Transgenic (K18-hACE2)2Prlmn mice36 were purchased from The Jackson Laboratory and bred locally. Hemizygous 8–12-week-old males were used in accordance with the guidelines of the Federation for Laboratory Animal Science Associations and the National Animal Welfare Body. All experiments were in compliance with the German animal protection law and approved by the animal welfare committee of the Regierungspraesidium Freiburg (permit G-20/91). Mice were anesthetized using isoflurane and infected intranasally (i.n.) with virus dilutions in 40 µl PBS containing 0.1% BSA. Mice were monitored daily and euthanized if severe symptoms were observed or body weight loss exceeded 25% of the initial weight.
Plotting and statistical analysis
All plots and statistics were generated with GraphPad Prism v8.4.2 or R studio (R version 4.0.2).
Ethical statement
The project has been approved by the ethical committee of the Albert-Ludwigs-Universität, Freiburg, Germany. Written informed consent was obtained from all participants and the study was conducted according to federal guidelines, local ethics committee regulations (Albert-Ludwigs-Universität, Freiburg, Germany: No. F-2020-09-03-160428 and no. 322/20) and the Declaration of Helsinki (1975). All routine virological laboratory testing of patient specimens (virus isolation and next-generation sequencing) was performed in the Diagnostic Department of the Institute of Virology, University Medical Center, Freiburg (Local ethics committee no. 1001913). Convalescent sera and sera of vaccinees were obtained from the Hepatology-Gastroenterology-Biobank as part of the Freeze-Biobank Consortium at the University Medical Center Freiburg. Written informed consent was obtained from all blood donors prior to inclusion.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank Roman Woelfel (Bundeswehr Institute of Microbiology) for providing the B.1 (Muc-IMB-1) isolate; Donata Hoffmann and Martin Beer (Friedrich-Loeffler-Institut, Insel Riems) for providing the B.1.1.7 and B.1.351 isolates, Michael Schindler (Institute for Medical Virology and Epidemiology, Tuebingen) for providing the P.1 isolate; Markus Hoffmann (Goettingen) for the Calu-3 cells, Todd Giardiello (Rockland Immunochemicals PA) for providing anti-N and anti-S specific rabbit antisera and James Hastie (MRC Protein Phosphorylation and Ubiquitylation Unit, College of Life Sciences, University of Dundee) for providing polyclonal sheep antibody targeting ORF3a (see https://mrcppu-covid.bio/). We gratefully acknowledge the authors from the originating laboratories responsible for obtaining the specimens and the submitting laboratories where genetic sequence data were generated and shared via the DESH hub of the RKI and GISAID (Supplementary Data 2 and Supplementary table 3). We acknowledge the contribution of all local and state public health authorities, laboratories, and health workforce who have submitted COVID-19 case-based data to the German notification system. We would like to thank Bas B. Oude Munnink (Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, Netherlands) for providing travel history information of A.27 cases. We are furthermore grateful for the sequencing efforts from our colleagues Judd F. Hultquist, Ramon Lorenzo-Redondo and Lacy M. Simons (Center for Pathogen Genomics and Microbial Evolution, Institute for Global Health, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA), Olubusuyi M. Adewumi (Department of Virology, College of Medicine, University of Ibadan, Ibadan, Nigeria), Dr. Halatoko Wemboo INH (Institut National d’Hygiene, Lomé, Togo), Dagnra Anoumou Yaotsè (Biolim Université de Lomé, Lome, Togo), Ilhem Boutiba, Riadh Gouider, Sameh Trabelsi, and Henda Triki (Laboratory of BioInformatics, bioMathematics and bioStatistics (BIMS)) and Dr. Justin Lee (CDC Atlanta). We furthermore like to acknowledge the excellent technical assistance of Valentina Wagner and Annette Ohnemus. The authors are grateful to Zsolt Ruzsics, Walter Haas and Otto Haller for helpful comments on the paper. This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation, grant number PA 2274/4-1), by the Bundesministerium fuer Bildung und Forschung (BMBF) through the Deutsches Zentrum fuer Luft- und Raumfahrt, Germany to M.P. and M.S. (DLR, grant number 01KI2077) and to A.S.O. and C.A.K. (ANDEMIA; grant number 01KA1606). S.L.H. acknowledges support from the Research Foundation - Flanders (“Fonds voor Wetenschappelijk Onderzoek - Vlaanderen,” G0D5117N). G.B. acknowledges support from the Internal Funds KU Leuven (Grant No. C14/18/094). G.B. and N.B. acknowledge support from the Research Foundation - Flanders (“Fonds voor Wetenschappelijk Onderzoek - Vlaanderen,” G0E1420N, G098321N). The funders had no role in the study design, data analysis, data interpretation, and in the writing of this report.
Source data
Author contributions
J.F., M.H., M.P., L.K., and T.K. designed the study and contributed to experiment design and data interpretation. M.H., J.F., S.L.H., N.B., S.C., S.K., and G.B. collected sequence and associated metadata. S.L.H., N.B. and G.B. performed phylogeographic analyses. J.F., L.K., M.H., S.K., S.C., and A.W. performed statistical analyses of patient metadata or analyzed next-generation sequencing data. A.P., A.L., M.Sa., N.N., J.K.O., R.R., A.B., C.A.K., A.O., E.S.L., V.E., and E.A.O. provided sequencing data of SARS-CoV-2. T.K., L.K., J.B., D.S., S.U., S.W., G.K., P.K., M.H.U., and L.J. performed experiments and analysed the data. J.F., L.K., T.K., P.K., S.L.H., and N.B. wrote the paper. J.F., G.D., and S.L.H. visualized the data. M.P., M.Sc., and G.K. were involved in funding acquisition.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Data availability
All necessary data and information are given in the paper. Source data are provided with this paper. Input XML files of the phylogeographic analysis is supplied in the Supplementary Data 1. The sequence data were submitted to the GISAID data base and are publicly available (Supplementary Table 2). Note, that due to sequencing or reconstruction errors (e.g., causing frameshifts) not all A.27 genome sequences obtained from external laboratories could be uploaded to GISAID. However, all sequences and metadata obtained from the RKI are also available via https://github.com/robert-koch-institut/SARS-CoV-2-Sequenzdaten_aus_Deutschland, including also all A.27 sequences used in this study (Supplementary Data 2). Raw sequencing data have been submitted to the European Nucleotide Archive (https://www.ebi.ac.uk/ena/browser) under the study accession number: ERP134884. Source data are provided with this paper.
Code availability
The script to visualize the variant frequencies is publicly available (github.com/jonas-fuchs/SARS-CoV-2-analyses, v1.0) and implemented on usegalaxy.eu (Variant Frequency Plot).
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Tamara Kaleta, Lisa Kern, Samuel Leandro Hong.
Contributor Information
Marcus Panning, Email: marcus.panning@uniklinik-freiburg.de.
Jonas Fuchs, Email: jonas.fuchs@uniklinik-freiburg.de.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-28766-y.
References
- 1.Wang Q, et al. Structural and functional basis of SARS-CoV-2 entry by using human ACE2. Cell. 2020;181:894–904. doi: 10.1016/j.cell.2020.03.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hoffmann M, et al. SARS-CoV-2 cell entry depends on ACE2 and TMPRSS2 and is blocked by a clinically proven protease inhibitor. Cell. 2020;181:271–280. doi: 10.1016/j.cell.2020.02.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Ke Z, et al. Structures and distributions of SARS-CoV-2 spike proteins on intact virions. Nature. 2020;588:498–502. doi: 10.1038/s41586-020-2665-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lan J, et al. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor. Nature. 2020;581:215–220. doi: 10.1038/s41586-020-2180-5. [DOI] [PubMed] [Google Scholar]
- 5.McCallum M, et al. N-terminal domain antigenic mapping reveals a site of vulnerability for SARS-CoV-2. Cell. 2021;184:2332–2347.e16. doi: 10.1016/j.cell.2021.03.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Greaney AJ, et al. Comprehensive mapping of mutations in the SARS-CoV-2 receptor-binding domain that affect recognition by polyclonal human plasma antibodies. Cell Host Microbe. 2021;29:463–476. doi: 10.1016/j.chom.2021.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gottlieb RL, et al. Effect of bamlanivimab as monotherapy or in combination with etesevimab on viral load in patients with mild to moderate COVID-19: a randomized clinical trial. Jama. 2021;325:632–644. doi: 10.1001/jama.2021.0202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Weinreich DM, et al. REGN-COV2, a neutralizing antibody cocktail, in outpatients with Covid-19. N. Engl. J. Med. 2021;384:238–251. doi: 10.1056/NEJMoa2035002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Korber B, et al. Tracking changes in SARS-CoV-2 spike: evidence that D614G increases infectivity of the COVID-19 virus. Cell. 2020;182:812–827. doi: 10.1016/j.cell.2020.06.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zhou B, et al. SARS-CoV-2 spike D614G change enhances replication and transmission. Nature. 2021;592:122–127. doi: 10.1038/s41586-021-03361-1. [DOI] [PubMed] [Google Scholar]
- 11.Lustig Y, et al. Neutralising capacity against Delta (B. 1.617. 2) and other variants of concern following Comirnaty (BNT162b2, BioNTech/Pfizer) vaccination in health care workers, Israel. Eurosurveillance. 2021;26:2100557. doi: 10.2807/1560-7917.ES.2021.26.26.2100557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Geers, D. et al. SARS-CoV-2 variants of concern partially escape humoral but not T-cell responses in COVID-19 convalescent donors and vaccinees. Sci. Immunol. 6, eabj1750 (2021). [DOI] [PMC free article] [PubMed]
- 13.Bager, P. et al. Risk of hospitalisation associated with infection with SARS-CoV-2 lineage B. 1.1. 7 in Denmark: an observational cohort study. Lancet Infect. Dis. 21, 1507–1517 (2021). [DOI] [PMC free article] [PubMed]
- 14.Ali F, Kasry A, Amin M. The new SARS-CoV-2 strain shows a stronger binding affinity to ACE2 due to N501Y mutant. Med. Drug Disco. 2021;10:100086. doi: 10.1016/j.medidd.2021.100086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Gu H, et al. Adaptation of SARS-CoV-2 in BALB/c mice for testing vaccine efficacy. Sci. (80-.). 2020;369:1603–1607. doi: 10.1126/science.abc4730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Jangra, S. et al. SARS-CoV-2 spike E484K mutation reduces antibody neutralisation. Lancet Microbe. 2, e283–e284 (2021). [DOI] [PMC free article] [PubMed]
- 17.Deng, X. et al. Transmission, infectivity, and neutralization of a spike L452R SARS-CoV-2 variant. Cell. 184, 3426–3437.e8 (2021). [DOI] [PMC free article] [PubMed]
- 18.Li Q, et al. The impact of mutations in SARS-CoV-2 spike on viral infectivity and antigenicity. Cell. 2020;182:1284–1294. doi: 10.1016/j.cell.2020.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mallm, J.-P. et al. Local emergence and decline of a SARS-CoV-2 variant with mutations L452R and N501Y in the spike protein. 41, medRxiv10.1101/2021.04.27.21254849 (2021).
- 20.Colson, P. et al. Spreading of a new SARS-CoV-2 N501Y spike variant in a new lineage. Clin. Microbiol. Infect. 27, 1352.e1–1352.e5 (2021). [DOI] [PMC free article] [PubMed]
- 21.E., S.-L. Potential new lineage causing a cluster in Mayotte 2021 [194] https://github.com/cov-lineages/pango-designation/issues/11.
- 22.Shu Y, McCauley J. GISAID: Global initiative on sharing all influenza data–from vision to reality. Eurosurveillance. 2017;22:30494. doi: 10.2807/1560-7917.ES.2017.22.13.30494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Duchene S, et al. Temporal signal and the phylodynamic threshold of SARS-CoV-2. Virus Evol. 2020;6:veaa061. doi: 10.1093/ve/veaa061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lemey P, et al. Accommodating individual travel history and unsampled diversity in Bayesian phylogeographic inference of SARS-CoV-2. Nat. Commun. 2020;11:1–14. doi: 10.1038/s41467-020-18877-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kern, D. M. et al. Cryo-EM structure of SARS-CoV-2 ORF3a in lipid nanodiscs. Nat. Struct. Mol. Biol. 28, 573–582 (2021). [DOI] [PMC free article] [PubMed]
- 26.Flower TG, et al. Structure of SARS-CoV-2 ORF8, a rapidly evolving immune evasion protein. Proc. Natl Acad. Sci. 2021;118:e2021785118. doi: 10.1073/pnas.2021785118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Cerutti G, et al. Potent SARS-CoV-2 neutralizing antibodies directed against spike N-terminal domain target a single supersite. Cell Host Microbe. 2021;29:819–833. doi: 10.1016/j.chom.2021.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Yan R, et al. Structural basis for the recognition of SARS-CoV-2 by full-length human ACE2. Sci. (80-.) 2020;367:1444–1448. doi: 10.1126/science.abb2762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Brouwer PJM, et al. Potent neutralizing antibodies from COVID-19 patients define multiple targets of vulnerability. Sci. (80-.) 2020;369:643–650. doi: 10.1126/science.abc5902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Premkumar, L. et al. The receptor binding domain of the viral spike protein is an immunodominant and highly specific target of antibodies in SARS-CoV-2 patients. Sci. Immunol. 5, eabc8413 (2020). [DOI] [PMC free article] [PubMed]
- 31.Starr TN, Greaney AJ, Dingens AS, Bloom JD. Complete map of SARS-CoV-2 RBD mutations that escape the monoclonal antibody LY-CoV555 and its cocktail with LY-CoV016. Cell Rep. Med. 2021;2:100255. doi: 10.1016/j.xcrm.2021.100255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Böhmer MM, et al. Investigation of a COVID-19 outbreak in Germany resulting from a single travel-associated primary case: a case series. Lancet Infect. Dis. 2020;20:920–928. doi: 10.1016/S1473-3099(20)30314-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.de Souza GAP, et al. Emerging SARS-CoV-2 Genotypes Show Different Replication Patterns in Human Pulmonary and Intestinal Epithelial Cells. Viruses. 2022;14:23. doi: 10.3390/v14010023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Zhang Y, et al. The SARS-CoV-2 protein ORF3a inhibits fusion of autophagosomes with lysosomes. Cell Disco. 2021;7:1–12. doi: 10.1038/s41421-021-00268-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Silvas, J. A. et al. Contribution of SARS-CoV-2 accessory proteins to viral pathogenicity in K18 hACE2 transgenic mice. J. Virol. JVI-00402 (2021). [DOI] [PMC free article] [PubMed]
- 36.Winkler ES, et al. SARS-CoV-2 infection of human ACE2-transgenic mice causes severe lung inflammation and impaired function. Nat. Immunol. 2020;21:1327–1335. doi: 10.1038/s41590-020-0778-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Chen P, et al. SARS-CoV-2 neutralizing antibody LY-CoV555 in outpatients with Covid-19. N. Engl. J. Med. 2021;384:229–237. doi: 10.1056/NEJMoa2029849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kolb P, et al. Human cytomegalovirus antagonizes activation of Fcγ receptors by distinct and synergizing modes of IgG manipulation. Elife. 2021;10:e63877. doi: 10.7554/eLife.63877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Bugembe, D. L. et al. Emergence and spread of a SARS-CoV-2 lineage A variant (A. 23.1) with altered spike protein in Uganda. Nat. Microbiol. 6, 1094–1101 (2021). [DOI] [PMC free article] [PubMed]
- 40.Butera Y, et al. Genomic sequencing of SARS-CoV-2 in Rwanda reveals the importance of incoming travelers on lineage diversity. Nat. Commun. 2021;12:5705. doi: 10.1038/s41467-021-25985-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Anoh, E. A. et al. SARS-CoV-2 variants of concern, variants of interest and lineage A. 27 are on the rise in Côte d’Ivoire. medRxiv10.1101/2021.05.06.21256282 (2021).
- 42.Pirnay J-P, et al. Variant Analysis of SARS-CoV-2 Genomes from Belgian Military Personnel Engaged in Overseas Missions and Operations. Viruses. 2021;13:1359. doi: 10.3390/v13071359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Zhang J, et al. Structural impact on SARS-CoV-2 spike protein by D614G substitution. Sci. (80-.) 2021;372:525–530. doi: 10.1126/science.abf2303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Ozono S, et al. SARS-CoV-2 D614G spike mutation increases entry efficiency with enhanced ACE2-binding affinity. Nat. Commun. 2021;12:1–9. doi: 10.1038/s41467-021-21118-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Pereira F. Evolutionary dynamics of the SARS-CoV-2 ORF8 accessory gene. Infect. Genet. Evol. 2020;85:104525. doi: 10.1016/j.meegid.2020.104525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Ren Y, et al. The ORF3a protein of SARS-CoV-2 induces apoptosis in cells. Cell. Mol. Immunol. 2020;17:881–883. doi: 10.1038/s41423-020-0485-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Hoffmann, M. et al. SARS-CoV-2 variant B. 1.617 is resistant to Bamlanivimab and evades antibodies induced by infection and vaccination. Cell Rep. 36, 109415 (2021). [DOI] [PMC free article] [PubMed]
- 48.Suryadevara N, et al. Neutralizing and protective human monoclonal antibodies recognizing the N-terminal domain of the SARS-CoV-2 spike protein. Cell. 2021;184:2316–2331. doi: 10.1016/j.cell.2021.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Amanat, F. et al. SARS-CoV-2 mRNA vaccination induces functionally diverse antibodies to NTD, RBD, and S2. Cell184, 3936–3948.e10 (2021). [DOI] [PMC free article] [PubMed]
- 50.Wu J, et al. The Antigenicity of Epidemic SARS-CoV-2 Variants in the United Kingdom. Front. Immunol. 2021;12:2205. doi: 10.3389/fimmu.2021.687869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Rambaut A, et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat. Microbiol. 2020;5:1403–1407. doi: 10.1038/s41564-020-0770-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.O’Toole Á, et al. Assignment of epidemiological lineages in an emerging pandemic using the pangolin tool. Virus Evol. 2021;7:veab064. doi: 10.1093/ve/veab064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Krause G, et al. SurvNet electronic surveillance system for infectious disease outbreaks, Germany. Emerg. Infect. Dis. 2007;13:1548. doi: 10.3201/eid1310.070253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Tennekes M. tmap: Thematic Maps in R. J. Stat. Softw. 2018;84:1–39. [Google Scholar]
- 55.Hadfield J, et al. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics. 2018;34:4121–4123. doi: 10.1093/bioinformatics/bty407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Minh BQ, et al. IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 2020;37:1530–1534. doi: 10.1093/molbev/msaa015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Sagulenko P, Puller V, Neher RA. TreeTime: Maximum-likelihood phylodynamic analysis. Virus Evol. 2018;4:vex042. doi: 10.1093/ve/vex042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Suchard MA, et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 2018;4:vey016. doi: 10.1093/ve/vey016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Lemey P, Rambaut A, Drummond AJ, Suchard MA. Bayesian phylogeography finds its roots. PLoS Comput. Biol. 2009;5:e1000520. doi: 10.1371/journal.pcbi.1000520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.De Maio N, Wu C-H, O’Reilly KM, Wilson D. New routes to phylogeography: a Bayesian structured coalescent approximation. PLoS Genet. 2015;11:e1005421. doi: 10.1371/journal.pgen.1005421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Lynch, S. M. Evaluating Markov Chain Monte Carlo Algorithms and Model Fit. in Introduction to Applied Bayesian Statistics and Estimation for Social Scientists 131–164 (Springer, 2007).
- 62.Ayres DL, et al. BEAGLE 3: improved performance, scaling, and usability for a high-performance computing library for statistical phylogenetics. Syst. Biol. 2019;68:1052–1061. doi: 10.1093/sysbio/syz020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Minin VN, Suchard MA. Counting labeled transitions in continuous-time Markov models of evolution. J. Math. Biol. 2008;56:391–412. doi: 10.1007/s00285-007-0120-8. [DOI] [PubMed] [Google Scholar]
- 64.Yang Z. Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods. J. Mol. Evol. 1994;39:306–314. doi: 10.1007/BF00160154. [DOI] [PubMed] [Google Scholar]
- 65.Drummond AJ, Ho SYW, Phillips MJ, Rambaut A. Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006;4:e88. doi: 10.1371/journal.pbio.0040088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Baele, G., Gill, M. S., Lemey, P. & Suchard, M. A. Hamiltonian Monte Carlo sampling to estimate past population dynamics using the skygrid coalescent model in a Bayesian phylogenetics framework. Wellcome Open Res. 5, (2020). [DOI] [PMC free article] [PubMed]
- 67.Ferreira MAR, Suchard MA. Bayesian analysis of elapsed times in continuous‐time Markov chains. Can. J. Stat. 2008;36:355–368. [Google Scholar]
- 68.Lauer SA, et al. The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application. Ann. Intern. Med. 2020;172:577–582. doi: 10.7326/M20-0504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst. Biol. 2018;67:901. doi: 10.1093/sysbio/syy032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Alaa Abdel Latif, Julia L. et al. Hughes, and the C. for V. S. B. Outbreak.info, (https://outbreak.info/compare-lineages).
- 71.Corrales-Aguilar E, et al. A novel assay for detecting virus-specific antibodies triggering activation of Fcγ receptors. J. Immunol. Methods. 2013;387:21–35. doi: 10.1016/j.jim.2012.09.006. [DOI] [PubMed] [Google Scholar]
- 72.Kumar, A., Bangash, A. H. & Gruening, B. Community Research Amid COVID-19 Pandemic: Genomics Analysis of SARS-CoV-2 over Public GALAXY server. 2020050343 10.20944/preprints202005.0343.v1 (2020).
- 73.Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Wilm, A. et al. LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets. Nucleic Acids Res. (2012) 10.1093/nar/gks918. [DOI] [PMC free article] [PubMed]
- 76.Cingolani P, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. (Austin) 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Li H, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All necessary data and information are given in the paper. Source data are provided with this paper. Input XML files of the phylogeographic analysis is supplied in the Supplementary Data 1. The sequence data were submitted to the GISAID data base and are publicly available (Supplementary Table 2). Note, that due to sequencing or reconstruction errors (e.g., causing frameshifts) not all A.27 genome sequences obtained from external laboratories could be uploaded to GISAID. However, all sequences and metadata obtained from the RKI are also available via https://github.com/robert-koch-institut/SARS-CoV-2-Sequenzdaten_aus_Deutschland, including also all A.27 sequences used in this study (Supplementary Data 2). Raw sequencing data have been submitted to the European Nucleotide Archive (https://www.ebi.ac.uk/ena/browser) under the study accession number: ERP134884. Source data are provided with this paper.
The script to visualize the variant frequencies is publicly available (github.com/jonas-fuchs/SARS-CoV-2-analyses, v1.0) and implemented on usegalaxy.eu (Variant Frequency Plot).