Graphical abstract
Keywords: SARS-CoV-2, Virus evolution, N-terminal domain mutations, Immune escape
Abstract
During the past two years, the world has been ravaged by a global pandemic caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Acquired mutations in the SARS-CoV-2 genome affecting virus infectivity and/or immunogenicity have led to a number of novel strains with higher transmissibility compared to the original Wuhan strain. Mutations in the receptor binding domain (RBD) of the SARS-CoV-2 spike protein have been extensively studied in this context. However, mutations and deletions within the N-terminal domain (NTD) located adjacent to the RBD are less studied. Many of these are found within certain β sheet-linking loops, which are surprisingly long in SARS-CoV-2 in comparison to SARS-CoV and other related β coronaviruses. Here, we perform a structural and epidemiological study of novel strains carrying mutations and deletions within these loops. We identify short and long-distance interactions that stabilize the NTD loops and form a critical epitope that is essential for the recognition by a wide variety of neutralizing antibodies from convalescent plasma. Among the different mutations/deletions found in these loops, Ala 67 and Asp 80 mutations as well as His 69/Val 70 and Tyr 144 deletions have been identified in different fast-spreading strains. Similarly, deletions in amino acids 241–243 and 246–252 have been found to affect the network of NTD loops in strains with high transmissibility. Our structural findings provide insight regarding the role of these mutations/deletions in altering the epitope structure and thus affecting the immunoreactivity of the NTD region of spike protein.
Research in context
Evidence before the study
The COVID-19 worldwide pandemic is already into the fourth wave, with novel highly aggressive strains expanding rapidly. Mutations in the RBD of the SARS-CoV-2 spike protein have been extensively studied in the context of higher infectability. However, accumulated evidence underlines the critical role of NTD mutations and deletions in the immunogenicity of the spike protein.
Added value of this study
In this study we find that certain loops within the N-terminal domain of the SARS-CoV-2 spike protein have evolutionary diverged in comparison to other beta-coronaviruses and particularly SARS-CoV. These highly flexible loops are in close proximity and contribute to various interactions that stabilize a surface-exposed tertiary structure. A super-epitope recognized by most neutralizing antibodies from convalescent plasma is formed and stabilized by specific amino acid residues within these loops, implying a critical role for epitope structure. For their length, these loops accumulate a disproportionably high number of mutations, driving SARS-CoV-2 evolution. Mutations and deletions affecting these amino acid residues are predicted to promote structural changes to the super-epitope region. Such mutations/deletions are common in fast-spreading SARS-CoV-2 strains associated with immune system escape.
Implications of all the available evidence
Together with available experimental data, our findings implicate mutations and deletions of NTD loop regions in immune evasion, and provide insights into the design of novel antibodies effective against these emerging strains.
1. Introduction
Coronaviruses (family: Coronaviridae) are RNA viruses with unusually large genome (∼30 Kb), which is of positive or plus-sense [1]. Coronaviruses are divided into four subtypes (α, β, γ and δ) and can infect various hosts from birds to mammals, and cause severe morbidity and mortality. They came under scrutiny in the past two decades due to recurrent incidents of widespread infection of humans. The most common human coronaviruses are the subtypes α 229E and NL63, and subtypes β HKU1 and OC43, which infect the respiratory tract of people throughout the globe causing common cold symptoms. More recently, β coronavirus strains of zoonotic (animal) origin have emerged as serious threats of human life: MERS-CoV (Middle East Respiratory Syndrome, MERS), SARS-CoV (Severe Acute Respiratory Syndrome, SARS) and SARS-CoV-2 (Coronavirus Disease 2019, COVID-19) [2]. Although the overall mortality of MERS and SARS-CoV is much higher, SARS-CoV-2, which emerged in the Wuhan province in China [3], has infected over 205,5 million people and caused close to 4.3 million deaths worldwide (https://coronavirus.jhu.edu/map.html).
SARS-CoV and SARS-CoV-2 share almost 80% sequence identity and both enter the host cells through interaction of the S (spike) protein with the angiotensin-converting enzyme 2 (ACE2) [4]. The S-protein shares 80% protein similarity between the two SARS strains and binds ACE2 with a similar affinity [5].
It is widely accepted that genetic variability and evolution within the positive strand RNA viruses is mainly driven by the low fidelity of RNA replication, as the RNA-dependent RNA polymerase (RdRP) is prone to high error rates [6]. Unlike other RNA viruses, where replication is primarily dependent on the RdRp, in coronaviruses, non-structural proteins (nsps) that include processivity factors (nsp7-8), a helicase (nsp13), a single-strand binding protein (nsp9), a proofreading exonuclease (nsp14) and other cofactors (e.g. nsp 10, nsp16) form a replication complex with the RdRp (that has proof-reading activity and corrects errors by the viral RdRp [7], [8]. As a result, coronaviruses are characterized by a 10-fold lower mutation rate compared to other RNA viruses. Despite the proofreading activity in RNA replication , the estimated mutation rate for SARS-CoV is 4x10-4 nucleotide substitutions/site/year [9], while for SARS-CoV-2 it is 1.12 × 10−3 mutations per site/year [10].
Recent studies have provided strong evidence that the SARS-CoV-2 spike protein NTD contains epitopes recognized by neutralizing antibodies produced by host adaptive immune response [11]. Moreover, it has been proposed that this region of the spike protein is dynamically involved in host cell surface adhesion, mediating interactions with glycan groups of the cellular glycoenvironment [12]. Therefore, tracking genetic variation in SARS-CoV-2 NTD is important for monitoring emerging strains with potentially higher capability for immune escape or with higher infectivity. To this direction, we investigated the evolution of NTD in β coronaviruses. By comparing the SARS-CoV and SARS-CoV-2 spike structures and analyzing the available mutation data for SARS-CoV-2, we discovered that specific NTD loop elements o which are evolutionary diverged in the SARS-CoV-2 clade, display high mutation rates and drive genetic variation. Interestingly, specific mutations and deletions in these loops are associated with an altered NTD structure, and affect an epitope region that is common among many different antibodies targeting the NTD region of the spike protein. The corresponding epidemiological data for these mutants revealed that specific NTD mutations and deletions have been positively selected over the last waves of COVID-19 pandemic.
2. Methods
2.1. Sequence alignment and secondary structure analysis
For sequence alignment of the NTD in SARS-related coronaviruses, spike protein sequences from SARS-CoV (PDB ID: 5X58), SARS-CoV-2(PDB ID: 6VSB), Bat-SL-CoVZC45 (ID: AVP78031), Bat-SL-CoVZXC21 (ID: AVP78042), Bat-SL-CoVBM48 (ID: NC_014470.1), Bat-SL-CoVHKU3 (ID: Q3LZX1.1) and Bat-SL-CoVRp3/2004 (ID: Q3I5J5.1) were used. Multiple sequence alignment was performed with the Clustal Omega program (version 1.2.4) [13]. For multiple sequence alignment visualization, the Jalview 2.11 software (http://www.jalview.org/) was used. To investigate the structural divergence of the SARS-CoV-2 NTD with respect to SARS-CoV, the secondary structure of the SARS-CoV NTD, as resolved by the cryo-EM structure (PDB ID: 5X4S), was compared to the cryo-EM structures of SARS-CoV-2 (PDB ID: 6VYB). Phylogenetic tree generation was performed using the ClustalW2 program.
2.2. Structural analysis and interloop interactions in SARS-CoV-2 NTD
The COVID-3D online platform (http://biosig.unimelb.edu.au/covid3d/), which implements the Arpeggio program for the calculation of interatomic interactions [14], was used for the visualization of residue interactions between the β3-β4, β9-β10 and β14-β15 loops. This analysis was performed on PDB files corresponding to the cryo-EM structure of SARS-CoV-2 at closed (PDB ID: 6VXX) and open (PDB ID: 6VSB) state after molecular modelling. According to the provided information by the COVID-3D platform, the structures were optimized by Maestro (https://www.schrodinger.com/maestro, Schrodinger suite, v. 2021-1) generating the PDB structures QHD43416_CLOSED (closed state) and QHD4316_ACE2_BOAT (open state) [15]. Surface electrostatic partial charges were generated with the Poisson-Boltzmann (APBS) method [16]. Visualization of SARS-CoV-2 spike protein trajectories was also performed using the BioExcel-CV19 platform designed to provide web-access to atomistic-MD trajectories for macromolecules involved in the COVID-19 disease (https://bioexcel-cv19.bsc.es/#/)
2.3. Mutation analysis
Global mutation data for SARS-CoV-2 NTD sequences were retrieved from the Global Initiative for Sharing All Influenza Data (GISAID) at https://www.gisaid.org/epiflu-applications/phylodynamics/. The frequency over time of SARS-CoV-2 strains harboring mutations on β3-β4, β9-β10 and β14-β15 loops was analyzed through GISAID from January 2020 to July 2021). Nextstrain and WHO nomenclature regarding the major SARS-CoV-2 clades was applied to our analysis for specific variants [23], [24].
2.4. Structural analysis of epitopes recognized by NTD targeting antibodies
Structural figures related to the epitopes recognized by NTD targeting antibodies were made using Maestro (Schrodinger suite, v. 2021–1) and the following cryo-EM data: 4A8 (PDB ID:7C2L), 5–24 (PDB ID:7L2F), 2–17 (PDB ID:7LQW) and 4–8 (PDB ID:7LQV).
2.5. Ethics
Ethical approval was unnecessary because this work is a meta-analysis of publicly available data.
2.6. Role of funders
The authors received no financial support for this research.
3. Results
3.1. Specific loop regions drive the evolutionary divergence of spike NTD protein in β coronaviruses
Sequencing of SARS-CoV-2 genomes from Wuhan patients in China and phylogenetic analysis of representative β coronavirus genomes (Genus: Betacoronavirus) revealed that the subgenus Sarbecovirus of the genus Betacoronavirus could be classified into three dinstict clades. Clade 1 consists of two SARS-CoV-related strains from Rhinolophus sp from Bulgaria (BM-48) and Kenya (BtKY72). Human SARS-CoV-2 sequences and two bat-SARS-like strains from eastern China (bat-SL-CoVZC45 and bat-SL-CoVZXC21) form the clade 2, while SARS-CoV strains from humans and bat SARS-like coronaviruses from southwestern China form the clade 3 [17].
To investigate whether the protein alignment of the spike NTD in β coronavirus members displays a similar phylogenetic profile, we performed a comparative sequence analysis among the spike NTDs of representative β coronaviruses strains. In this direction, residues 15–305 of the SARS-CoV-2 spike protein (Fig. 1A) were aligned against the NTD sequences of SARS and members of clade 1 (BM48), clade 2 (BM48ZC45, ZXC21) and clade 3 (HKU3, Rp3) Bat-SARS like coronavirus strains. As Fig. 1B indicates, the protein alignment of these sequences showed that SARS-CoV-2 and the related bat-derived strains ZC45 and ZXC21 cluster together, while SARS-CoV and the bat SARS-like sequences cluster separately, demonstrating a similar phylogenetic pattern as the one acquired with the genomic sequences.
Integrating the secondary structure elements of SARS-CoV (as identified from the PDB: 5X4S structure) to this sequence alignment, we observed that overall, the NTD is well conserved within the majority of the β sheets and loops, with the exception of loops separating the β3-β4, β9-β10 and β14-β15 sheets (Fig. 1C). More specifically, SARS-CoV-2 β3-β4 and β14-β15 loops display an extended length compared to SARS-CoV. While an extended β3-β4 loop is also shared among all bat SARS-CoV members, β14-β15 loop extension is restricted only in SARS-Cov-2. To identify the structural differences in the NTD divergent regions between SARS-CoV-2 and SARS-CoV, the cryo-electron microscopy (Cryo-EM) structure of the SARS-CoV-2 spike protein [18] was compared against the SARS-CoV NTD crystal structure [19]. This structural comparison revealed that the length of β3-β4, β9-β10 and β14-β15 loops in SARS-CoV-2 has been evolutionarily extended with regard to SARS-CoV (Fig. 1D).
3.2. Interloop interactions and conformational stability of β3-β4, β9-β10 and β14-β15 domains
According to the cryo-EM structure of the SARS-CoV-2 trimeric spike complex and its proposed conformational states [18], [20], these loops are highly flexible and exposed on the outer surface of trimeric spike complex, away from the RBD. The protein surface corresponding to these interacting loops is hydrophilic and possesses a positive potential, due to the presence of several charged/hydrophilic amino acids (Fig. 2A). Notably, the β3-β4 and β14-β15 loops (amino acids 62–80 and 242–263, respectively) are in close proximity, stabilized by electrostatic interactions between amino acids in both loops (Fig. 2B).
Our structural analysis revealed that a network of electrostatic and hydrophobic interactions between several residues of loops β14-β15 and β3-β4 mediate interloop communication. More specifically, the β3-β4 loop amino acid residues Ala 67 and His 69 interact with Leu 242, Tyr 248, Gly 261, Ala 262, Ala 263 and Tyr 265 amino acids of the β14-β15 loop (see representative interactions in Fig. 3A, B). Additionally, Asp 80, also in the β3-β4 loop, interacts with Leu 242 in the β14-β15 loop further stabilizing this tertiary structure(Fig. 3C). Finally, Val 143, in the β9-β10 loop, interacts with multiple amino acids of the β14-β15 loop (Leu 244, His 245, Tyr 248 and Leu 249), further stabilizing interloop interaction (Fig. 3D).
Interestingly, 4A8, one of the best characterized neutralizing antibodies [11], was the first member of NTD targeting antibodies to recognize a discontinuous epitope (ID: 1087268, Immune Database and Analysis Resource-IEDB) encompassing β9-β10 and β14-β15 amino acids (Tyr 144, Tyr 145, His 146, Lys 147, Lys 150, Trp 152, His 245, Arg 246, Ser 247, Tyr 248, Leu 249). These amino acids are positioned in the recognition interface within 4 Å distance from the antibody. As mentioned above, several of the epitope amino acids participate in an extensive network of interactions with residues Ala 67, His 69 and Asp 80 (Fig. 4A), and Val 143 (Fig. 4B), suggesting that these interactions are important for antibody recognition.
Recent studies have highlighted the immunogenic properties of NTD and, besides 4A8, a wide panel of neutralizing antibodies have been identified to recognize a NTD supersite [21]. Our structural analysis of epitopes recognized by COV57 [22], S2L28, S2M28, S2X333 [23], 2–17, 5–24, 4–8 and 2–51 [24] antibodies revealed that β3-β4, β9-β10 and β14-β15 loops have an important role in the formation of this universal epitope (Fig. 5A, B). Therefore, mutations or deletions that affect amino acids in these loops are expected to remodel this epitope and alter the binding affinity of these antibodies. Masking of exposed amino acid residues by glycosylation has been described as a general mechanism of viral immune evasion [25]. It is thus important to mention that two glycosylation sites have been identified in loops β3-β4 and β14-β15, at residues Asn 74 and Asn 149 [26].
3.3. Amino acid variability on β3-β4, β9-β10 and β14-β15 loops modulates the evolutionary dynamics of the spike protein NTD region in SARS-COV-2 strains
To investigate the distribution of identified SARS-CoV-2 mutations in the NTD secondary structure, mutation data from 2,022,459 high-quality genomic sequences were analyzed through the GISAID SARS-CoV-2 database (https://www.gisaid.org/) as of July 9th, 2021, and aligned to the 1–350 aa sequence of the SARS-CoV-2 spike protein (YP_009724390). We observed that approximately 46.4% of identified NTD non-synonymous mutations in different SARS-CoV-2 strains are found within the β3-β4, β9-β10 and β14-β15 loop regions, indicating a higher degree of variation in these elements compared to other NTD secondary structure elements (Fig. 6A-C). Since the β3-β4, β9-β10 and β14-β15 loops are positioned away from the NTD core and the interaction surface with the RBD, mutations therein may be under reduced selective pressure. On the other hand, most of the identified loop variants affect the solvent-accessible surface of the spike protein, and these mutations can potentially affect the dynamics of intermolecular interactions with sugars or antibodies. This is particularly important for RNA viruses, as it is known that high mutation rate on their proteins is associated with escape from host immune response, higher virulence and altered tissue tropism [27], [28], [29]. Interestingly, mutations in the β3-β4 and β9-β10 loops display a higher frequency compared to the β14-β15 one, suggesting a more dynamic role of mutations in these two regions during SARS-CoV-2 evolution.
3.4. Specific mutations within the β3-β4 and β9-β10 loops can lead to conformational changes
Since we observed that mutations within the β3-β4 and β9-β10 loops are quite frequent, we sought to investigate whether specific mutations in these regions can modulate the conformational stability of NTD. As mentioned above, our analysis on the 3D structure of the spike protein revealed that Ala 67, Asp 80 and Val 143 residues maintain a rigid network of interloop interactions. Therefore, mutations in these amino acids may alter the stability of β3-β4/β14-β15 interactions. Indeed, substitution of Ala by Val at position 67 (A67V) is associated with the establishment of novel hydrophobic interactions with Ile 100, Phe 79 and Ala 263 residues (Supplementary Fig. 1A). Moreover, a number of interactions of Asp 80 (Supplementary Fig. 1B) are lost upon mutation to Tyr (Supplementary Fig. 1C) or Ala (Supplementary Fig. 1D). Specifically, D80Y abolishes interactions with His 66, while D80A does not interact with Phe 65, His 66 and Pro 82. The association of these mutants with a rewired network of interactions may induce conformational changes. On the other hand, substitution of Val by Phe at position 143 (V143F) is predicted to result in loss of aromatic and hydrophobic interactions with Leu 244, His 245 and Leu 249.
3.5. Epidemiological data provide evidence that specific mutations and deletions within the β3-β4, β9-β10 and β14-β15 loops have undergone a positive selection during SARS-CoV-2 evolution
The high mutation rate of viruses provides unique opportunities for natural selection of strains based on greater stability, higher transmission rates and immune escape. As a result, certain variants show increasing representation within the population, through a positive selection [30].
As of June 2021, Nextstrain has identified 13 major clades (19A–B, 20A–20J and 21A). During SARS-CoV-2 evolution, different mutated strains emerged within these clades, displaying high transmissibility, and thus affecting COVID-19 epidemiology. A global monitoring of these strains, also known as variants of concern (VOCs) and variants of interest (VOIs), has been established as a response to COVID-19 pandemic, and a single Greek letter naming scheme has been adopted by the World Health Organization (WHO) for easier labelling. By July 15, 2021, a high global distribution of Alpha (Pango lineage designation B.1.1.7), Beta (B.1.351), Gamma (B.1.1.28.1) and Delta (B.1.617.2) VOCs has been reported (WHO epidemiological updates: https://www.who.int/en/activities/tracking-SARS-CoV-2-variants/). According to the released reports of WHO epidemiological data, transmissibility in Alpha, Beta, Gamma and Delta strains has increased by 29%, 25%, 38% and 97%, respectively, compared to the original Wuhan strain [31]. Similarly, multiple VOIs such as Eta (B.1.525), Iota (B.1.526), Kappa (B.1.617.1) and Lambda (C.37) have been identified to represent an emerging risk, due to their increasing prevalence. Epidemiological evidence also exists that genomic properties of the B.1.1.318, B1.1.375 and B.1.1.616 strains have a significant impact on transmissibility, and these strains have been characterized as variants under monitoring.
The association of the reported VOCs/VOIs and the currently designated variants under monitoring B.1.1.318, B1.1.375 and B.1.1.616 with NTD mutations/deletions in β3-β4, β9-β10 and β14-β15 loops revealed that besides Gamma strain, all the other highly aggressive strains are characterized by multiple alterations on these three loop regions that form the universal NTD epitope (Fig. 7A). Notably, deletions are distributed among all loop regions with a specific enrichment for Δ69-70 and Δ144 in β3-β4 and β9-β10 loops respectively, which are present in several different VOCs/VOIs (Alpha, Eta, B.1.375, B.1.1.616). Moreover, a co-occurrence of Δ69-70 with Δ144 exists in Alpha and Eta strains. As a result, the frequency of these two deletions in SARS-COV-2 genotyped sequences from patients has become very high in recent months (Fig. 7B, C). Besides Δ144, the β9-β10 loop, which has a critical role in the formation of the NTD antigenic supersite, seems to display different patterns of deletions in aggressive strains, as Δ144-145 and Δ156-157 have also been identified in the B.1.1.616 and Delta strains. Interestingly, deletions in the β14-β15 loop have been found so far only in the Lambda strain harboring the Δ246-252 deletion (Supplementary Fig. 2). Notably, Δ241-243, which is present in the Beta strain, affects a region adjacent to the β14-β15 loop, while its impact on the supersite formation has not been studied yet.
In VOCs/VOIs and variants under monitoring, the above pattern of deletions coexists with specific missense mutations. More specifically, mutations in the β3-β4 loop that are associated with the genetic properties of these strains include A67V, G75V, T76I and D80A. On the other hand, G142D, W152C and E154K within the β9-β10, and R246I and D253G in the β14-β15 loop, are the missense mutations identified in the other two loops (Fig. 7A). Despite the mutation/deletion overlap among strains, the different combinations of the genetic variations generate complex virus genotypes (Fig. 7B, C and Supplementary Fig. 2), while the functional implications of these combinations are difficult to assess.
These acquired alterations are expected to modify the network of interloop and intraloop interactions, and thus to induce structural remodeling of the NTD antigenic supersite. In fact, deletions within the above loops have been proposed to decrease the binding and neutralization potency of COVID-19 patient convalescent sera or monoclonal antibodies (mAbs). For this reason, it is considered that these deletions facilitate immune escape and are thus positively selected [32], [33]. On the other hand, recent studies have highlighted the role of specific missense mutations in structural changes of the NTD antigenic supersite. Functional data have also correlated these changes with immune escape. For instance, the W152C mutation introduces a free cysteine that can form new disulfide bonds. In the Epsilon (B.1.427/B.1.429) strain, an alternative disulfide bond between C136 and W152C is proposed to drive NTD conformational changes that lead to immune escape from NTD targeting neutralizing antibodies [34]. Moreover, G142D has also been shown to alter the binding of NTD targeting antibodies [23]. Notably, Ala 67 and Asp 80 residues in the β3-β4 loop, which we predict to maintain interloop interactions in the NTD supersite, are mutated in Eta (A67V) and Beta (D80A) strains. In line with our analysis, an NTD remodeling induced by D80A has also been proposed recently [35].
4. Discussion
SARS-CoV-2 is genetically related to SARS-CoV, a deadly coronavirus that emerged in late 2002 and caused an outbreak of severe acute respiratory syndrome. SARS-CoV was highly lethal but after intense public health measures, was eradicated in 2003 [36]. The new coronavirus SARS-CoV-2 is less deadly but far more transmissible [37]. Moreover, while SARS‐CoV appears to infect pneumocytes and enterocytes of the small intestine [38], SARS-CoV-2 can infect multiple organs such as the intestine, liver, kidney and blood vessels [39], [40].
To identify divergent structure elements on the SARS-CoV-2 spike protein NTD that could potentially modulate interactions with the host, we performed a comparative sequence and structural analysis on SARS-CoV and SARS-CoV-2 NTDs. As expected, NTD sequences of SARS-CoV and SARS-CoV-2 are highly similar. The most striking difference is the length of loops β3-β4, β9-β10 and β14-β15 which are significantly longer in the SARS-CoV-2 and in certain bat coronaviruses with genomes closely related to SARS-CoV-2, indicating that the structural evolution of these elements is characteristic for the SARS-CoV-2 clade identity. The low degree of homology between sequences corresponding to the respective loops from bat and human β-coronaviruses, suggests that amino acid variations in these elements have a major impact on divergence of spike proteins within the SARS-CoV-2 clade. Although the cryo-EM data suggest that these divergent loop regions are part of a highly flexible NTD region, our molecular modeling data indicate that a network of electrostatic and hydrophobic interactions between several residues of β3-β4 and β9-β10 loops with residues of the β14-β15 loop mediate an interloop communication that provides a relative stability. Residues Ala 67, His 69 and Asp 80 in β3-β4 and Val 143 in the β9-β10 loop were identified to play an important role in these interactions.
It is well accepted that antibodies targeting the RBD confer significantly to the neutralizing activity of convalescent sera [41], [42]. In a recent study however, Voss and colleagues analyzed the proteomic profile of IgGs in convalescent sera and demonstrated that the response is directed predominantly (>80%) against epitopes residing outside the RBD, and include the NTD region and the S2 domain [43]. The same study also reported that anti-NTD antibodies contribute critically to neutralization and their protection is related to their relative levels in plasma.
To this direction, numerous immunological studies have demonstrated that NTD-targeting antibodies elicited in COVID-19 patients are effective neutralizing antibodies (nAbs). In a recent study by Liu and colleagues, almost half of the neutralizing Abs isolated from plasma derived from 40 COVID-19 patients exhibited binding to NTD but not to RBD [24]. Neutralizing antibodies 4A8 [11], COV57 [22], 2–17, 5–24, 4–8 [24] and possibly others [44] target the exposed NTD surface of the SARS-CoV-2 spike protein. Moreover, while RBD-targeting neutralizing antibodies recognize distinct epitopes, various neutralizing antibodies against the NTD target a common site comprising primarily loops β9-β10 and β14-β15 (N3 and N5 according to Chi et al) [21]. It has been postulated that this region is highly immunogenic in part because it is glycan free, which allows epitope recognition. Moreover, the high flexibility of the loops allows the peptide to assume multiple conformations accommodating recognition by several antibodies. The neutralizing activity of NTD antibodies relies on hindrance, which prevents the spike protein from binding to the ACE-2 receptor [21].
It is well accepted that the conformation of epitopes is essential for the neutralizing activity of antibody responses against SARS-CoV [45], whereas specific mutants can escape the neutralizing activity of certain antibodies [46]. Since conformation changes in the β3-β4, β9-β10 and β14-β15 loops could possibly alter NTD antibody recognition, we analyzed the GISAID mutation data in order to study the sequence variation profile of SARS-CoV-2 divergent loops. Our analysis revealed a disproportionate high rate of mutations in these loops, indicating a dynamic role in spike sequence divergence and evolution within SARS-CoV-2 clade that could enhance immune escape
Based on our molecular modeling of the spike 3D structure, Ala 67, Asp 80 and Val 143 residues maintain a rigid network of interactions. In this context, A67V, D80Y, D80A and V143F variants are predicted to rewire the network of these interactions, and to either promote the establishment of new hydrophobic interconnections (A67V) or induce a loss of intraloop hydrogen bonds (D80Y, D80A) and interloop hydrophobic interactions (V143F). This might hinder NTD recognition by neutralizing antibodies from convalescent plasma. In this context, recent studies revealed that in frame deletions of NTD amino acid sequence in SARS-CoV-2 strains that affect β3-β4 (Δ69-70), β9-β10 (Δ141-144, Δ146) and β14-β15 loops (Δ243-244) are associated with immune escape in patients [32].
In order to identify whether SARS-CoV-2 strains harboring mutations in NTD loops are associated with greater prevalence, we investigated Nextstrain data with a focus on WHO reported VOCs (Alpha, Beta, Gamma, Delta), VOIs (Eta, Iota, Kappa and Lambda) and strains under monitoring (B.1.1.318, B1.1.375 and B.1.1.616) that are characterized by high transmissibility. Notably, for many of these prevalent strains, recent functional studies have established a link between their high transmissibility and their ability to escape from RBD and NTD targeting neutralizing antibodies. Regarding the escape from NTD targeting antibodies, strong experimental evidence exists for Alpha and Beta [35], [47], [48], [49], as well as for Delta [50], [51] and Epsilon [34] strains.
This analysis revealed that with the exception of the Gamma strain, all other highly aggressive strains harbor multiple mutations and deletions in β3-β4, β9-β410 and β14-β15 loops. The high prevalence of deletions in these loop regions, may indicate that deletions might be subjected to a stronger selection for antigenic drift than missense mutations.
The structural analysis of complexes between neutralizing antibodies and the NTD suggests that the β9-β410 and β14-β15 loops are closer to the antibody binding surface than the β3-β4 loop, which is not directly involved in interactions with antibody residues. However, our structural analysis revealed an important role for the β3-β4 residues Ala 67, His 69 and Asp 80 in interloop interactions that stabilize the β14-β15 loop. Since β14-β15 loop is a critical part of the NTD antigenic supersite, β3-β4 loop mutations may have a critical role in immune escape. Interestingly, Ala 67 and Asp 80 are mutated in Eta (A67V) and Beta (D80A) strains, while His 69 is deleted (Δ69-70) in Alpha, Eta and B.1.375 strains. Our analysis also indicated that Val 143 interacts with multiple amino acids of the β14-β15 loop (Leu 244, His 245, Tyr 248 and Leu 249), and thus stabilizes it in a folding critical for antigenicity. Although, Val 143 is not mutated within the group of VOCs/VOIs and other variants under monitoring, mutations of neighboring residues Gly 142 and Tyr 144 are present in Alpha (Δ144), Eta (Δ144), Delta (G142D), B.1.1616 (Δ144-145) and B.1.617.1 (G142D) strains.
We acknowledge that all computational data presented here merit experimental validation. Nevertheless, we have uncovered important aspects regarding virus interaction with the host immune system. Our findings could facilitate the generation of monoclonal antibodies and vaccines with better profile against novel fast-spreading SARS-CoV-2 variants.
CRediT authorship contribution statement
Apostolos Klinakis: Conceptualization, Writing - original draft, Writing - review & editing, Visualization. Zoe Cournia: Software, Data curation, Visualization, Writing - original draft. Theodoros Rampias: Conceptualization, Software, Data curation, Writing - original draft, Writing - review & editing, Visualization.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgments
Acknowledgements
The authors thank Dr. Rommie Amaro for providing Molecular Dynamics simulations trajectories and structures of the glycan shielded SARS-CoV-2 spike protein and Dr. Modesto Orozco for access to the BioExcel-CV19 platform.
Contributors
All authors designed and performed the analysis. AK and TR wrote the manuscript.
Footnotes
Supplementary data to this article can be found online at https://doi.org/10.1016/j.csbj.2021.10.004.
Contributor Information
Apostolos Klinakis, Email: aklinakis@bioacademy.gr.
Theodoros Rampias, Email: trampias@bioacademy.gr.
Appendix A. Supplementary data
The following are the Supplementary data to this article:
References
- 1.Gorbalenya A.E., Enjuanes L., Ziebuhr J., Snijder E.J. Nidovirales: evolving the largest RNA virus genome. Virus Res. 2006;117(1):17–37. doi: 10.1016/j.virusres.2006.01.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Petrosillo N., Viceconte G., Ergonul O., Ippolito G., Petersen E. COVID-19, SARS and MERS: are they closely related? Clin Microbiol Infect. 2020;26(6):729–734. doi: 10.1016/j.cmi.2020.03.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Huang C., Wang Y., Li X., Ren L., Zhao J., Hu Y. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395(10223):497–506. doi: 10.1016/S0140-6736(20)30183-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zhou P., Yang X.-L., Wang X.-G., Hu B., Zhang L., Zhang W. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579(7798):270–273. doi: 10.1038/s41586-020-2012-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Walls A.C., Park Y.-J., Tortorici M.A., Wall A., McGuire A.T., Veesler D. Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein. Cell. 2020;181(2):281–292.e6. doi: 10.1016/j.cell.2020.02.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lauring A.S., Frydman J., Andino R. The role of mutational robustness in RNA virus evolution. Nat Rev Microbiol. 2013;11(5):327–336. doi: 10.1038/nrmicro3003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hartenian E., Nandakumar D., Lari A., Ly M., Tucker J.M., Glaunsinger B.A. The molecular virology of coronaviruses. J Biol Chem. 2020;295(37):12910–12934. doi: 10.1074/jbc.REV120.013930. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Subissi L., Posthuma C.C., Collet A., Zevenhoven-Dobbe J.C., Gorbalenya A.E., Decroly E. One severe acute respiratory syndrome coronavirus protein complex integrates processive RNA polymerase and exonuclease activities. Proc Natl Acad Sci U S A. 2014;111(37):E3900–E3909. doi: 10.1073/pnas.1323705111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Salemi M., Fitch W.M., Ciccozzi M., Ruiz-Alvarez M.J., Rezza G., Lewis M.J. Severe acute respiratory syndrome coronavirus sequence characteristics and evolutionary rate estimate from maximum likelihood analysis. J Virol. 2004;78(3):1602–1603. doi: 10.1128/JVI.78.3.1602-1603.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Koyama T., Platt D., Parida L. Variant analysis of SARS-CoV-2 genomes. Bull World Health Organ. 2020;98(7):495–504. doi: 10.2471/BLT.20.253591. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Chi X., Yan R., Zhang J., Zhang G., Zhang Y., Hao M. A neutralizing human antibody binds to the N-terminal domain of the Spike protein of SARS-CoV-2. Science. 2020;369(6504):650–655. doi: 10.1126/science.abc6952. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Awasthi M., Gulati S., Sarkar D.P., Tiwari S., Kateriya S., Ranjan P. The sialoside-binding pocket of SARS-CoV-2 spike glycoprotein structurally resembles MERS-CoV. Viruses. 2020;12(9):909. doi: 10.3390/v12090909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sievers F., Higgins D.G. Clustal Omega for making accurate alignments of many protein sequences. Protein Sci. 2018;27(1):135–145. doi: 10.1002/pro.3290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Jubb H.C., Higueruelo A.P., Ochoa-Montaño B., Pitt W.R., Ascher D.B., Blundell T.L. Arpeggio: a web server for calculating and visualising interatomic interactions in protein structures. J Mol Biol. 2017;429(3):365–371. doi: 10.1016/j.jmb.2016.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Portelli S., Olshansky M., Rodrigues C.H.M., D’Souza E.N., Myung Y., Silk M. Exploring the structural distribution of genetic variation in SARS-CoV-2 with the COVID-3D online resource. Nat Genet. 2020;52(10):999–1001. doi: 10.1038/s41588-020-0693-3. [DOI] [PubMed] [Google Scholar]
- 16.Baker N.A., Sept D., Joseph S., Holst M.J., McCammon J.A. Electrostatics of nanosystems: application to microtubules and the ribosome. Proc Natl Acad Sci U S A. 2001;98(18):10037–10041. doi: 10.1073/pnas.181342398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Lu R., Zhao X., Li J., Niu P., Yang B., Wu H. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet. 2020;395(10224):565–574. doi: 10.1016/S0140-6736(20)30251-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wrapp D., Wang N., Corbett K.S., Goldsmith J.A., Hsieh C.-L., Abiona O. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science. 2020;367(6483):1260–1263. doi: 10.1126/science.abb2507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yuan Y., Cao D., Zhang Y., Ma J., Qi J., Wang Q. Cryo-EM structures of MERS-CoV and SARS-CoV spike glycoproteins reveal the dynamic receptor binding domains. Nat Commun. 2017;8(1):15092. doi: 10.1038/ncomms15092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Cai Y., Zhang J., Xiao T., Peng H., Sterling S.M., Walsh R.M. Distinct conformational states of SARS-CoV-2 spike protein. Science. 2020;369(6511):1586–1592. doi: 10.1126/science.abd4251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Cerutti G., Guo Y., Zhou T., Gorman J., Lee M., Rapp M. Potent SARS-CoV-2 neutralizing antibodies directed against spike N-terminal domain target a single supersite. Cell Host Microbe. 2021;29(5):819–833. doi: 10.1016/j.chom.2021.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Barnes C.O., West A.P., Huey-Tubman K.E., Hoffmann M.A.G., Sharaf N.G., Hoffman P.R. Structures of human antibodies bound to SARS-CoV-2 spike reveal common epitopes and recurrent features of antibodies. Cell. 2020;182(4):828–842.e16. doi: 10.1016/j.cell.2020.06.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.McCallum M., De Marco A., Lempp F.A., Tortorici M.A., Pinto D., Walls A.C. N-terminal domain antigenic mapping reveals a site of vulnerability for SARS-CoV-2. Cell. 2021;184(9):2332–2347.e16. doi: 10.1016/j.cell.2021.03.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Liu L., Wang P., Nair M.S., Yu J., Rapp M., Wang Q. Potent neutralizing antibodies against multiple epitopes on SARS-CoV-2 spike. Nature. 2020;584(7821):450–456. doi: 10.1038/s41586-020-2571-7. [DOI] [PubMed] [Google Scholar]
- 25.Watanabe Y., Bowden T.A., Wilson I.A., Crispin M. Exploitation of glycosylation in enveloped virus pathobiology. Biochim Biophys Acta, Gen Subj. 2019;1863(10):1480–1497. doi: 10.1016/j.bbagen.2019.05.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Watanabe Y., Allen J.D., Wrapp D., McLellan J.S., Crispin M. Site-specific glycan analysis of the SARS-CoV-2 spike. Science. 2020;369(6501):330–333. doi: 10.1126/science.abb9983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Duffy S. Why are RNA virus mutation rates so damn high? PLoS Biol. 2018;16(8):e3000003. doi: 10.1371/journal.pbio.3000003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Novella I.S., Presloid J.B., Taylor R.T. RNA replication errors and the evolution of virus pathogenicity and virulence. Curr Opin Virol. 2014;9:143–147. doi: 10.1016/j.coviro.2014.09.017. [DOI] [PubMed] [Google Scholar]
- 29.Rasschaert D., Duarte M., Laude H. Porcine respiratory coronavirus differs from transmissible gastroenteritis virus by a few genomic deletions. J Gen Virol. 1990;71(11):2599–2607. doi: 10.1099/0022-1317-71-11-2599. [DOI] [PubMed] [Google Scholar]
- 30.Domingo E., Sheldon J., Perales C. Viral quasispecies evolution. Microbiol Mol Biol Rev. 2012;76(2):159–216. doi: 10.1128/MMBR.05023-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Campbell F., Archer B., Laurenson-Schafer H., Jinnai Y., Konings F., Batra N. Increased transmissibility and global spread of SARS-CoV-2 variants of concern as at June 2021. Euro Surveill. 2021;26(24) doi: 10.2807/1560-7917.ES.2021.26.24.2100509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.McCarthy K.R., Rennick L.J., Nambulli S., Robinson-McCarthy L.R., Bain W.G., Haidar G. Recurrent deletions in the SARS-CoV-2 spike glycoprotein drive antibody escape. Science. 2021;371(6534):1139–1142. doi: 10.1126/science.abf6950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Avanzato V.A., Matson M.J., Seifert S.N., Pryce R., Williamson B.N., Anzick S.L. Case study: prolonged infectious SARS-CoV-2 shedding from an asymptomatic immunocompromised individual with cancer. Cell. 2020;183(7):1901–1912.e9. doi: 10.1016/j.cell.2020.10.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.McCallum M., Bassi J., De Marco A., Chen A., Walls A.C., Di Iulio J. SARS-CoV-2 immune evasion by the B.1.427/B.1.429 variant of concern. Science. 2021;373(6555):648–654. doi: 10.1126/science.abi7994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Cai Y., Zhang J., Xiao T., Lavine C.L., Rawson S., Peng H. Structural basis for enhanced infectivity and immune evasion of SARS-CoV-2 variants. Science. 2021;373(6555):642–648. doi: 10.1126/science.abi9745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.May R.M., McLean A.R., Pattison J., Weiss R.A., Anderson R.M., Fraser C. Epidemiology, transmission dynamics and control of SARS: the 2002–2003 epidemic. Philos Trans R Soc Lond B Biol Sci. 2004;359(1447):1091–1105. doi: 10.1098/rstb.2004.1490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Petersen E., Koopmans M., Go U., Hamer D.H., Petrosillo N., Castelli F. Comparing SARS-CoV-2 with SARS-CoV and influenza pandemics. Lancet Infect Dis. 2020;20(9):e238–e244. doi: 10.1016/S1473-3099(20)30484-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.To K.F., Tong J.HM., Chan P.KS., Au F.WL., Chim S.SC., Allen Chan K.C. Tissue and cellular tropism of the coronavirus associated with severe acute respiratory syndrome: an in-situ hybridization study of fatal cases. J Pathol. 2004;202(2):157–163. doi: 10.1002/path.1510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Lamers M.M., Beumer J., van der Vaart J., Knoops K., Puschhof J., Breugem T.I. SARS-CoV-2 productively infects human gut enterocytes. Science. 2020;369(6499):50–54. doi: 10.1126/science.abc1669. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Puelles V.G., Lütgehetmann M., Lindenmeyer M.T., Sperhake J.P., Wong M.N., Allweiss L. Multiorgan and Renal Tropism of SARS-CoV-2. N Engl J Med. 2020;383(6):590–592. doi: 10.1056/NEJMc2011400. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Piccoli L., Park Y.J., Tortorici M.A., Czudnochowski N., Walls A.C., Beltramello M. Mapping neutralizing and immunodominant sites on the SARS-CoV-2 spike receptor-binding domain by structure-guided high-resolution serology. Cell. 2020;183(4):1024–42 e21. doi: 10.1016/j.cell.2020.09.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Barnes C.O., Jette C.A., Abernathy M.E., Dam K.-M., Esswein S.R., Gristick H.B. SARS-CoV-2 neutralizing antibody structures inform therapeutic strategies. Nature. 2020;588(7839):682–687. doi: 10.1038/s41586-020-2852-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Voss W.N., Hou Y.J., Johnson N.V., Delidakis G., Kim J.E., Javanmardi K. Prevalent, protective, and convergent IgG recognition of SARS-CoV-2 non-RBD spike epitopes. Science. 2021;372(6546):1108–1112. doi: 10.1126/science.abg5268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Brouwer P.J.M., Caniels T.G., van der Straten K., Snitselaar J.L., Aldon Y., Bangaru S. Potent neutralizing antibodies from COVID-19 patients define multiple targets of vulnerability. Science. 2020;369(6504):643–650. doi: 10.1126/science.abc5902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.He Y., Lu H., Siddiqui P., Zhou Y., Jiang S. Receptor-binding domain of severe acute respiratory syndrome coronavirus spike protein contains multiple conformation-dependent epitopes that induce highly potent neutralizing antibodies. J Immunol. 2005;174(8):4908–4915. doi: 10.4049/jimmunol.174.8.4908. [DOI] [PubMed] [Google Scholar]
- 46.ter Meulen J., van den Brink E.N., Poon L.L.M., Marissen W.E., Leung C.S.W., Cox F. Human monoclonal antibody combination against SARS coronavirus: synergy and coverage of escape mutants. PLoS Med. 2006;3(7):e237. doi: 10.1371/journal.pmed.0030237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Wang P., Nair M.S., Liu L., Iketani S., Luo Y., Guo Y. Antibody resistance of SARS-CoV-2 variants B.1.351 and B.1.1.7. Nature. 2021;593(7857):130–135. doi: 10.1038/s41586-021-03398-2. [DOI] [PubMed] [Google Scholar]
- 48.Graham C., Seow J., Huettner I., Khan H., Kouphou N., Acors S. Neutralization potency of monoclonal antibodies recognizing dominant and subdominant epitopes on SARS-CoV-2 Spike is impacted by the B.1.1.7 variant. Immunity. 2021;54(6):1276–1289.e6. doi: 10.1016/j.immuni.2021.03.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Supasa P., Zhou D., Dejnirattisai W., Liu C., Mentzer A.J., Ginn H.M. Reduced neutralization of SARS-CoV-2 B.1.1.7 variant by convalescent and vaccine sera. Cell. 2021;184(8):2201–2211.e7. doi: 10.1016/j.cell.2021.02.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Planas D., Veyer D., Baidaliuk A., Staropoli I., Guivel-Benhassine F., Rajah M.M. Reduced sensitivity of SARS-CoV-2 variant Delta to antibody neutralization. Nature. 2021;596(7871):276–280. doi: 10.1038/s41586-021-03777-9. [DOI] [PubMed] [Google Scholar]
- 51.Mlcochova P., Kemp S., Dhar M.S., Papa G., Meng B., Ferreira I. SARS-CoV-2 B.1.617.2 Delta variant replication and immune evasion. Nature. 2021:1–8. doi: 10.1038/s41586-021-03944-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.