Skip to main content
eLife logoLink to eLife
. 2021 Aug 31;10:e70362. doi: 10.7554/eLife.70362

Sterically confined rearrangements of SARS-CoV-2 Spike protein control cell invasion

Esteban Dodero-Rojas 1, Jose N Onuchic 1,2,3,4,, Paul Charles Whitford 5,6,
Editors: Donald Hamelberg7, José D Faraldo-Gómez8
PMCID: PMC8456623  PMID: 34463614

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is highly contagious, and transmission involves a series of processes that may be targeted by vaccines and therapeutics. During transmission, host cell invasion is controlled by a large-scale (200–300 Å) conformational change of the Spike protein. This conformational rearrangement leads to membrane fusion, which creates transmembrane pores through which the viral genome is passed to the host. During Spike-protein-mediated fusion, the fusion peptides must be released from the core of the protein and associate with the host membrane. While infection relies on this transition between the prefusion and postfusion conformations, there has yet to be a biophysical characterization reported for this rearrangement. That is, structures are available for the endpoints, though the intermediate conformational processes have not been described. Interestingly, the Spike protein possesses many post-translational modifications, in the form of branched glycans that flank the surface of the assembly. With the current lack of data on the pre-to-post transition, the precise role of glycans during cell invasion has also remained unclear. To provide an initial mechanistic description of the pre-to-post rearrangement, an all-atom model with simplified energetics was used to perform thousands of simulations in which the protein transitions between the prefusion and postfusion conformations. These simulations indicate that the steric composition of the glycans can induce a pause during the Spike protein conformational change. We additionally show that this glycan-induced delay provides a critical opportunity for the fusion peptides to capture the host cell. In contrast, in the absence of glycans, the viral particle would likely fail to enter the host. This analysis reveals how the glycosylation state can regulate infectivity, while providing a much-needed structural framework for studying the dynamics of this pervasive pathogen.

Research organism: Virus

Introduction

The current COVID-19 pandemic is being driven by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). While vaccine and treatment development will help mitigate the immediate impact of this disease, long-term strategies for its eradication will rely on an understanding of the factors that control transmission. The need to isolate the molecular constituents that govern SARS-CoV-2 dynamics is widely recognized, where the global scientific community has undergone its most rapid transformation in recent history. This unprecedented redirection of scientific inquiry has rapidly provided atomic-resolution structures of SARS-CoV-2 proteins at various stages of infection (Wrapp et al., 2020; Walls et al., 2020; Wang et al., 2020; Lan et al., 2020; Yuan et al., 2020; Watanabe et al., 2020a), as well as computational analysis of specific conformational states (Casalino et al., 2020; Schlick et al., 2020; Roy et al., 2020; Ali and Vijayan, 2020; Verkhivker, 2020; Shin et al., 2020; Turoňová et al., 2020). Despite these advances, our understanding of the mechanism by which SARS-CoV-2 enters the host cell is limited.

Central to the function of SARS-CoV-2 is host-cell recognition by the Spike protein, which results in virus-host membrane fusion and transfer of the viral genome. In the active virion, the Spike protein assembly (S) is a threefold symmetric homo-trimer (Wrapp et al., 2020), where each protein contains approximately 1200 residues (Figure 1). The complex is anchored to the viral membrane envelope by a transmembrane (TM) helical bundle, while the remaining regions reside on the exterior of the viral particle. Cleavage at the S1/S2 and S2’ sites leads to activation of the Spike protein, where the resulting subunits (S1 and S2) maintain contact through non-bonded interactions (Figure 1A; Shang et al., 2020). The receptor binding domain (RBD) in S1 can then associate with the ACE2 receptor (Letko et al., 2020; Lan et al., 2020), which triggers S1 release from S2 (Figure 1B). While the order of S2’ cleavage and S1/S2 dissociation is not known, it is generally thought that both processes occur prior to any large-scale rearrangements of S2. During the subsequent global structural rearrangement of S2, the fusion peptides must bind and recruit a host cell (Figure 1C; Basso et al., 2016).

Figure 1. Spike-protein-mediated membrane fusion.

Figure 1.

(A) The active Spike protein assembly is composed of the subunits S1 (white surface) and S2 (cartoon representation) (Walls et al., 2020), which remain bound through nonbonded interactions. Numerous glycosylation sites (glycans shown in orange) are present in the Head Group (HG) and Heptad Repeat 2 (HR2) regions. The modeled glycans are consistent with previous studies (Casalino et al., 2020; Turoňová et al., 2020). (B) Upon recognition of the ACE2 receptor and cleavage at the S2’ site, S1 dissociates. In addition to the HG and HR2 regions, S2 is composed of the Heptad Repeat 1 (HR1), Linker (L), Fusion Peptide (FP), Connector (CR), and Transmembrane (TM) regions. Since the HR2 and TM regions were not resolved in the prefusion structure (PDB ID: 6VXX [Walls et al., 2020]), they were modeled as a helical bundle, consistent with previous studies (Casalino et al., 2020). (C) Release of S1 allows for the FPs to associate with and recruit the host membrane. The HG and HR1 regions undergo a large-scale rotation (>90 degrees), which leads to fusion of the host and viral membranes. Since the CR and FP regions were not resolved in the postfusion structure (PDB ID: 6M3W [Fan et al., 2020]), they were modeled as an extended helical bundle.

Most of the current SARS-CoV-2 therapies and vaccines have focused on the ACE2 recognition stage of virus invasion. An alternate strategy is to target the conformational change in S2 that induces membrane fusion. In this direction, it has been found that disrupting formation of the HR1-HR2 six-helical bundle through introduction of inhibitor peptides can halt expression of the virus in biological samples (Xia et al., 2020). This suggests there is great potential for impeding viral entry by targeting intermediates of the Spike conformation change. Accordingly, understanding the conformational rearrangements associated with membrane fusion can provide new targets for medicinal applications. However, probing intermediate stages of fusion has proven to be extremely difficult. As a result, there has yet to be an experimental or computational/simulation study reported that provides direct insights into the mechanistic aspect of the S2 conformational change.

The fusion process involves global reorganization of S2 (Figure 1 and Appendix 1—figure 1). This includes dissociation of the fusion peptides (FP) from the head group (HG), disordering of Heptad Repeat 2 (HR2), rotation of the head group relative to the viral membrane and then reordering of Heptad Repeat 1 (HR1), HR2 and Connecting Region (CR) into an extended helical arrangement. During this elaborate process, the fusion peptides associate with the host membrane, where subsequent ‘zippering’ of a HR1-HR2 superhelical structure likely provides energy to recruit the host membrane. In the postfusion structure, the TM, CR, and FP regions adopt proximal positions, allowing them to facilitate membrane fusion.

While high-resolution structures have been resolved for the Spike protein in the prefusion and postfusion conformations, the precise structural mechanism of fusion is unknown. As a consequence, the molecular factors that control this process have yet to be determined. For example, while many post-translational modifications (glycans) have been identified (13 and 9 on each S1 and S2 monomer)(Watanabe et al., 2020a; Watanabe et al., 2020b; Walls et al., 2020; Wrapp et al., 2020), there has not been an investigation into their role during the fusion step of infection. However, simulations of the prefusion protein have shown how glycans may shield the Spike protein and prevent recognition by the immune system (Casalino et al., 2020). In addition, studies have provided evidence that glycans may serve as activators for the lectin pathway (Malaquias et al., 2021; Lenza et al., 2020). While glycans have been implicated in other aspects of the viral ‘life’ cycle, it is not known whether they directly impact the host-entry process.

There are various challenges that have impeded the direct study of conformational changes in the S2 subunit. In terms of structural methods, due to the transient character of S2 intermediates, all previously reported structures are of the prefusion or postfusion states (Wrapp et al., 2020; Walls et al., 2020; Fan et al., 2020). While one could envision applying a range of simulation methods to gain insights into the dynamics of the transition, the size of the Spike protein makes many strategies intractable. For example, conventional explicit-solvent simulations can be used to study the detailed energetics of small proteins (Lindorff-Larsen et al., 2011). However, such highly detailed simulations of the full S2 trimer would be extremely computationally demanding, which would preclude the possibility of simulating the full conformational process. With this limitation, it can be advantageous to apply models that have simplified representations of the energetics, such as structure-based models. In a structure-based model, the potential energy function is defined based on knowledge of stable (i.e. experimental) conformations (Clementi et al., 2000). In the cases of protein folding and functional dynamics, these models have been able to provide descriptions of mechanisms that are consistent with experimental measures for various systems. The success of these simplified models to capture folding dynamics is a reflection of the strong limitations that are imposed by molecular sterics and the complexity of folded conformations (Shea et al., 1999; Gosavi et al., 2006). Inspired by studies of folding, all-atom variants were later used to simulate rearrangements in large assemblies, such as ribosomes (Nguyen and Whitford, 2016; Levi et al., 2020) or viral capsids (Noel et al., 2016; Whitford et al., 2020). Despite the simplicity of the models, mechanistic aspects of the dynamics are often robust to the model parameters. Similar to the study of folding, robustness can be understood as arising from the atomic resolution of the models, where steric interactions strongly limit the possible mechanistic properties (Levi et al., 2019). Motivated by the observation of sterics-associated robustness during folding and functional processes, we will adopt this modeling strategy to study the dynamics of the S2 subunit as it transitions between the prefusion and postfusion conformations.

Here, we performed molecular dynamics simulations with an all-atom structure-based model to determine whether the steric composition of glycans can have a meaningful influence on SARS-CoV-2 Spike-protein-mediated membrane fusion. Simulations were initiated with the Spike protein in the prefusion conformation, while the energetics were defined to favor the postfusion conformation (shown in Figure 1C). It is important to emphasize that the prefusion model used as a starting point is intended to represent a state in which S2’ cleavage and S1 dissociation have occurred. While the precise timing of these steps is unknown, we assume they can occur prior to any significant conformational changes in the S2 subunit. By comparing the dynamics with and without glycans present, we show how the steric composition of the glycans can extend the lifetime of a critical intermediate in which the head appears to become sterically-caged. This leads to a transient pause that may increase the probability of recruiting the host cell. These calculations provide physical evidence that the glycosylation state is a critical factor that determines infectivity of SARS-CoV-2.

Results

Simulating the membrane-fusion-associated conformational change of SARS-CoV-2 Spike protein

In order to characterize the mechanism of Spike-protein-mediated membrane fusion, we employed an all-atom structure-based model (Whitford et al., 2009; Noel et al., 2016) and simulated transitions between the prefusion and postfusion conformations (Figure 2B and Video 1). In a structure-based (Gō-like) model, some/all of the energetic interactions are defined based on knowledge of specific stable (experimentally resolved) structures. In the context of protein folding, applying these types of models (Clementi et al., 2000) is supported by the principle of minimal frustration (Bryngelson and Wolynes, 1989; Bryngelson et al., 1995). However, to warrant their application to study conformational transitions, it is necessary to recognize that the models describe the effective energetics of each system (Hyeon and Thirumalai, 2011; Di Pierro et al., 2018; Chan et al., 2011). That is, by explicitly defining the molecular interactions to stabilize the endpoint conformations, the models are intended to provide a first-order approximation to the energetics. In the presented model, only interactions that are specific to the prefusion and postfusion configurations were defined to be stable. For the TM region, stabilizing prefusion-specific interactions allow it to serve as an anchor between the Spike protein and the viral membrane. An implicit membrane potential was also introduced to restrain the TM to a plane. Even though an orientation bias was not introduced, the TM region generally remains nearly perpendicular to the viral membrane surface (Appendix 1—figure 2). Finally, all non-TM interactions were defined to stabilize the postfusion conformation (Figure 1). Qualitatively, this model describes the Spike protein as a loaded (non-linear) spring that is released upon cleavage of the S2’ site and dissociation of S1 (Figure 2A). While the potential energy in the model is downhill, molecular sterics can still lead to pronounced free-energy barriers that control the kinetics (Whitford and Onuchic, 2015; Levi et al., 2019).

Figure 2. Simulating Spike-protein-mediated membrane fusion. Simulations with an all-atom structure-based model (Whitford et al., 2009; Noel et al., 2016) allow for transitions between prefusion and postfusion configurations to be observed.

Figure 2.

(A) Schematic representation of the energetics in the structure-based model. The postfusion configuration was defined as the global potential energy minimum. The pre-cleavage state (green) is assumed to be stable, where cleavage and release of S1 leads to an unstable prefusion configuration (black). While, in the employed model, stabilizing energetic terms favor the postfusion configuration, steric interactions between the protein and glycans may impede the motion (black vs. red). (B) Representative simulation (1 of 1000) of the pre-to-post transition. Spatial rmsd from the post configuration (excluding TM, CR, and FPs) is shown, as a function of time. The simulation included explicit glycans, as well as an effective viral membrane potential. (C) After an initial relaxation phase (panel B), the head (red) appears to become caged by the HR2 strands (gray), allowing it to sample configurations near the viral membrane (pink). While the host membrane (yellow) was not included in the simulations, it is depicted for illustrative purposes. (D) After reaching the caged ensemble, the head escapes and the HR1-HR2 superhelix assembles. The position of the head group, relative to the TM region (blue), is described by the cylindrical coordinates rhead and zhead. The origin is defined as the geometric center of TM, and the cylindrical axis is perpendicular to the viral membrane. See Materials and methods for details. (E) Probability distribution of simulated events (with glycans) reveals an obligatory cage-like intermediate.

Video 1. This video shows a representative simulation (1 of 1000) of the fully-glycosylated S2 subunit of the SARS-CoV-2 protein as it transitions from the prefusion to the postfusion configuration.

Download video file (14.3MB, mp4)

Before describing the simulated events, it is valuable to discuss the analysis of energetic frustration (Ferreiro et al., 2007; Parra et al., 2016) in the Spike protein, which supports the application of an unfrustrated model. Frustration analysis of the prostfusion structure indicates there is a low degree of frustration in HR1, HG, L and most of HR2 (Appendix 1—figure 3). In contrast, for the prefusion conformation, there is a higher degree of frustration in HR1 and HR2. This suggests that HR1 and HR2 are only marginally stable in the prefusion conformation after S2’ cleavage and S1 dissociation. In the structure-based model, interactions with these regions are defined to only stabilize the postfusion conformation, which is consistent with the reduced degree of frustration in the postfusion state. While this analysis supports the use of an unfrustrated model to study dynamics, we find that FP, CR and the C-terminal end of HR2 are frustrated in the postfusion conformation. For HR2, the higher level of frustration in the C-terminal region may be understood in terms of its local environment. That is, frustration analysis is based on the energetics of proteins in solvent. However, the C-terminal end of HR2 is positioned adjacent to the viral membrane, which likely introduces interactions that can stabilize its structure. This interpretation is supported by frustration analysis of other proteins, where highly-frustrated regions have been found to frequently engage in binding interactions (Ferreiro et al., 2007). Consistent with this, FP engages in host membrane interactions, again consistent with an elevated level of energetic frustration (Gorgun et al., 2021). Similarly, due to its proximity to FP, CR is also likely to engage in membrane interactions, which is suggested by the elevated level of predicted frustration. As a final note, for the postfusion conformation, CR and FP were modeled based on helical template structures, where imperfections in the structures may contribute to higher levels of predicted frustration. In summary, the majority of the postfusion structure is predicted to be minimally frustrated, which supports the use of a structure-based model. Further, as discussed in subsequent sections, the current study is focused on the dynamics of structure formation in HR1, HG, L, and HR2, whereas CR and FP remain largely disordered in the simulated events. Accordingly, any frustration in these regions that was not included in the model is not likely to influence the primary finding of the current study.

To investigate the dynamics of Spike-protein-mediated membrane fusion, we simulated thousands of transitions between the prefusion and postfusion conformations of S2 (Video 1). To describe the rearrangements, we considered the distance between HG and the viral membrane (zhead), as well as the displacement of HG parallel to the membrane (rhead; Figure 2D). The probability distribution as a function of rhead and zhead (Figure 2E) shows a clear ordering of HG rearrangements. Each simulation was initialized in the prefusion conformation (rhead=0, zhead15 nm). From there, the head moves towards the viral membrane (i.e. decreasing values of zhead), and the HR2 strands appear to enclose HG. In this ‘caged’ ensemble, the long axis of HG remains roughly perpendicular to the membrane (Appendix 1—figure 4D). During initial relaxation of HG, the fusion peptides simultaneously extend toward the host membrane (Figure 2C). After relaxation of HG, it then rotates away from its vertical orientation (increasing values of rhead; Figure 2D) by passing between two of the HR2 strands. As the head rotates (Appendix 1—figure 4E-G), the FP and CR regions are drawn toward the viral membrane. The simulations were terminated when all non-CR and non-FP residues adopted their postfusion orientations. Since the CR and FP regions were not resolved in the postfusion structure (Fan et al., 2020), the simulations describe formation of all experimentally resolved structural elements.

In simulations, the ordering of conformational events is robust to the presence of glycans. When glycans were explicitly included, there were only minor differences in the range of HG configurations that are sampled (Figure 2E vs. Appendix 1—figure 5). In both cases, HG initially relaxes towards the viral membrane before rotating towards the host as shown in Figure 2 and Appendix 1—figure 5.

Glycans induce a long-lived sterically caged intermediate

We find that glycans can reduce the kinetics of HG rearrangements by introducing a dynamic steric cage that confines HG to a position near the viral membrane (Figure 3A). This caging process gives rise to prolonged sampling of an intermediate (Figure 2E; zhead6 nm, rhead4 nm) in which the long axis of HG is roughly perpendicular to the viral membrane (Appendix 1—figure 4). The lifetime of the caged intermediate is given by τcage=τexit-τenter (Figure 3B). τenter is defined as the time at which the assembly enters the intermediate (i.e. when zhead first decreases below 6.5 nm). τexit is the time at which rhead first exceeds 5 nm, indicating the head has been displaced outside of the cage-like formation (Figure 2D). For the representative trajectory show in Figure 3B, τcage is roughly 37 µs. See Materials and methods for estimation of time units in this model.

Figure 3. Glycan-induced caging of the head domain. Glycans impede head rearrangement by introducing a steric cage.

Figure 3.

(A) Snapshot from the caged ensemble illustrates the high density of glycans surrounding the head. (B) To define the duration of each caging event (τcage=τexit-τenter), we measured zhead and rhead (Figure 2D). Based on the 2D probability distribution (Figure 2E), the system was defined as entering the cage when zhead first drops below 6.5 nm: τenter. τexit is the time at which the head moves laterally, relative to the trans-membrane region (rhead>5 nm). (C) Distribution of τcage values when glycans are present. There is an extended tail at large values (100-500μs). (D) When glycans are absent, the τcage values are narrowly distributed around short timescales.

The lifetime of the caged intermediate strongly depends on the presence of glycans. For the glycan-free system, τcage values were narrowly distributed, where τ¯cage=6.7μs (Figure 3D). When glycans are present, the distribution has a tail that extends to much larger values (100–500 µs; Figure 3C), and τ¯cage increases nearly 5-fold (29.7 µs). To isolate the origins of this effect, we repeated our simulations with subsets of glycans present. In one set of simulations, only glycans on HG were included, while the other set only included glycans on HR2 (Appendix 1—figure 6). Interestingly, the HG-glycan model exhibited timescales that were comparable to those obtained for the fully glycosylated system. In addition, the HR2-glycan model yielded timescales that were comparable to those obtained when S2 is not glycosylated (Appendix 1—figure 6). These comparisons reveal that the glycan-associated increase in excluded volume of HG is a dominant factor that determines the kinetics of interconversion between prefusion and postfusion conformations. Finally, glycans are likely to exhibit a degree of attraction under cellular conditions, which may lead to slower intramolecular diffusion. Accordingly, the predicted glycan-induced reduction in kinetics should represent a lower-bound on the influence of glycosylation.

It is important to emphasize that the apparent glycan-dependent kinetics may be fully attributed to steric effects. That is, while the protein energetics were explicitly defined to favor the postfusion conformation, glycans were not assigned energetically-preferred conformations. Rather, the potential energy of the glycans only ensured that stereochemistry and excluded volume were preserved. In addition, the excluded volume interactions were purely repulsive. Thus, the observed reduction in rate for HG motion is due solely to the excluded volume of the glycans, and not the formation of stabilizing interactions. In terms of modeling considerations, the ability to attribute this effect entirely to steric interactions means that the effect will be present in simulations with any atomic-resolution model. While the precise kinetic properties of the system will depend on the details of each model, the influence of glycan sterics that is predicted by the structure-based model will be robust to the exact energetic representation that is applied.

Glycan cage promotes host membrane capture

The simulated trajectories suggest that glycan-associated attenuation of head rearrangements can facilitate host membrane recruitment and fusion. As described above, we find that the steric composition of the glycans introduces a highly crowded environment, which can transiently cage the HG domain (Figure 3A). We additionally find that initial relaxation of HG is rapid (Figure 3B) where caging introduces a pause that allows the HR1, FP and CR regions to sample extended configurations. To describe structure formation of the HR1 region, we calculated the fraction of postfusion-specific contacts that are formed as a function of time, QHR1. Calculating the number of ‘native’ contacts formed is motivated by protein folding studies, which have shown it to be a reliable measure of structure formation (Cho et al., 2006). When glycans are absent, HG frequently exits the cage prior to reaching the postfusion structure of HR1 (QHR1=1200-1300; Figure 4B). In contrast, when glycans are present, HR1 is typically fully-formed (QHR1>1400) before HG exits the steric cage (Figure 4A). By caging HG in a position that is perpendicular to the viral membrane (Figure 3A), glycans help to ensure that the newly assembled HR1 helical coil remains directed towards the host membrane. This orientation of HR1 may serve to facilitate host membrane capture by the FP and CR regions.

Figure 4. Caging of head allows for extension of HR1 helix.

Figure 4.

(A) Distribution of QHR1 (number of postfusion-specific HR1 contacts) values when glycans are present. Distribution describes the first frame in each simulation for which the head is outside of the steric cage. In all but three simulations, nearly all HR1 contacts (>1420 of 1489) are formed upon exit of the cage. (B) Distribution when glycans are not included. When glycans are absent, it is common that HR1 is not completely formed (i.e. QHR1< 1350) prior to HG escape. (C) Representative snapshot of a caged structure in which HR1 is fully formed and extended toward the host. (D) Representative snapshot of a glycan-free case where the head escapes prior to fully forming HR1. As a result, HR1 can adopt bent configurations.

To quantify the likelihood that the Spike protein will associate with and recruit the host, we considered the extension of each fusion peptide from the viral membrane. For this, we first defined a putative host-membrane distance, dhost, which was set to discrete values (22–38 nm). Again, while the host membrane was not included in these simulations, this host-membrane distance is an indicator of the region that needs to be visited by the FP to successfully bind the host. We then determined whether the distance between the viral membrane and FP (dFP, Figure 5A) exceeded dhost for each of the three FP tails in the assembly (Figure 5B). Pcapture was then defined as the probability that at least one FP extends to the host membrane (Figure 5C). We use the notation Pcapture, since one expects that the extension of the FPs will be correlated with the probability that the Spike protein successfully captures the host cell.

Figure 5. Glycans promote host capture.

Figure 5.

(A) Snapshot of the glycosylated Spike protein with the head domain in a caged configuration (glycans not shown). Caging allows the fusion peptide tails to extend toward and engage the host membrane. dFP is the distance of the center of mass of each fusion peptide from the viral membrane surface. To calculate the probability of host capture, different values of the virion-host distance (dhost) were considered. (B) Representative simulated trajectory, showing dFP for each of the fusion peptides in a single S2 subunit. For reference, a dhost value of 30 nm is indicated by a dashed line. (C) The probability of membrane fusion is expected to be proportional to the probability that at least one tail extends to the host membrane (dFP>dhost). There is a higher probability of extending to larger dhost values when glycans are present (black vs. red curves). This is due to the glycan-induced delay of head rotation (Figure 2), which ensures the HR1 helix remains directed towards the host as the FPs sample extended configurations. (D–F) The probability that 1, 2, or 3 FPs exceed dhost. In all cases, the presence of glycans shifts the distribution to larger values of dhost, indicating an increased probability of capturing the host. This reveals a critical role for glycans during cell invasion.

We find that glycosylation of S2 significantly increases the probability that a FP will extend to the host membrane (Figure 5C). We further partitioned the capture distributions by calculating the probability that exactly 1, 2, or 3 FPs cross the host membrane (Figure 5D–F). Interestingly, we find that all distributions exhibit significant differences for 26<dhost<34 nm. In the absence of glycans, the probability of associating 3 FPs is nearly 0 in this range, while the probability increases to ∼0.1 when glycans are present (Figure 5F). Similarly, the probability that exactly 2 FPs will cross the host membrane is ∼0 for dhost>28 nm, when glycans are absent. The probability then increases to ∼0.1 when glycans are present (Figure 5E). Finally, for 26<dhost<33, the probability that one FP will cross the host membrane is increased by ∼0.2 when glycans are present (Figure 5D). To ask whether this differential extension of the FP is due to tilting of TM, we calculated the distribution of angles between the TM bundle and the viral membrane (Appendix 1—figure 2). This revealed that the TM tilting distributions were similar for the two systems, which indicates that the differential dynamics of the FPs can not be attributed to this aspect of the model. Finally, while the Spike Protein has been experimentally observed to spontaneously transition from the prefusion to postfusion configuration (Cai et al., 2020), the probabilities reported above describe events that occur when the Spike Protein is activated through host-virus interactions.

The glycan-dependent probability of host-membrane association suggests several features of the fusion process. Cryotomography imaging has revealed that the virus-host inter-membrane distance is approximately 30 nm during infection (Turoňová et al., 2020). Based on this, our simulations indicate that, if the Spike protein were not glycosylated, it is most likely that none of the FPs would associate with the host. Since Spike protein rearrangements are irreversible, these failed attempts would represent lost opportunities to infect the cell. Therefore, these simulations suggest the probability of infection would drop substantially if the Spike were not glycosylated. Consistent with this, experimental measurements have found that inhibiting the production of glycans decreases the efficiency of host cell entry (Yang et al., 2020). To assess whether the influence of glycans on FP dynamics is robust, we considered a variant of our model in which the TM helices may dissociate from each other. Specifically, the intra-TM harmonic interactions were replaced with weaker 6–12 Lennard-Jones interactions (Appendix 1—figure 7). With this modified-TM potential, we simulated 1000 transitions for the glycan-present and glycan-absent systems (2000 events, in total). These simulations show that the differential dynamics of the FPs is robust to the precise description of the TM bundle.

When glycans are absent, there is a marginal probability that only one or two FPs will reach the membrane (3D-E), where the other FPs would likely transition directly to their postfusion orientations without engaging the host. Such a process has been described as a ‘cooperative’ mechanism in fusion proteins class I, such as Hemagglutinin A, where only a fraction of the FPs anchor to the host membrane, while the remaining FPs bind to the viral membrane (Lin et al., 2014). However, when glycans are present, there is a non-negligible probability that three FPs will reach the host membrane. When all three FPs attach to the host membrane, the dynamics may be described in terms of the so-called ‘sequential’ mechanism of fusion (Lin et al., 2014). Together, these observations demonstrate how the steric contribution of glycans is critical to the mechanism and likelihood of cell invasion by SARS-CoV-2.

In terms of the mechanistic features of membrane fusion, one can expect that FP binding to a host membrane will impact the probability that subsequent FPs will also bind. To explore this point, we introduced a second flat-bottom potential (Appendix Equation 1) to describe the host membrane. Using this extended model, we simulated 1000 pre-to-post transitions for the glycosylated Spike protein. We find that the probability of unsuccessful activation (i.e. 0 FPs captured) is not affected by the host potential (Appendix 1—figure 8). However, introducing the effective host membrane potential significantly increases the probability that all three fusion peptides reach the host. This demonstrates how anchoring the first FP helps the Spike protein maintain an orientation that favors additional FP binding events. Together, this analysis suggests that the primary mode for Spike protein mediated membrane fusion is through use of a ‘sequential’ mechanism.

Discussion

The ongoing COVID-19 pandemic requires the rapid identification of molecular factors that enable infection. A necessary step during infection involves virus/cell membrane fusion, which is mediated by a major conformational change of the Spike protein. Here, we propose a mechanism where, after cleavage and dissociation of S1, sufficient time has to be made available for the fusion peptides to reach the cell membrane, before the conformational change in S2 can complete. Using all-atom models with simplified energetics, we have shown how the steric composition of post-translational modifications may introduce the delay necessary for such a mechanism to be utilized. This glycan-induced pause appears to allow for an extended window during which the fusion peptides may search for the host cell (Figure 6). In simulations that did not include glycans, the Spike protein was most likely to adopt the postfusion configuration without extending the fusion peptides towards the host. Thus, in the glycan-free case, the protein can bypass the sterically-caged intermediate, leading to failed attempts to capture the host cell. These findings suggest an interesting theoretical prediction that the precise glycan composition is a critical factor that determines transmissibility of SARS-CoV-2.

Figure 6. Schematic of fusion mechanism of the Spike protein.

Figure 6.

Initial activation of the Spike protein (left) is associated with release of S1, which is triggered by cleavage at the S2’ site and ACE2 receptor binding. When glycans are present (top), HG will enter a caged ensemble where the FPs search for, and capture, the host membrane. HG can then escape the cage, which draws the viral and host membranes together and leads to fusion. In the absence of glycans (bottom), HG can bypass the caged ensemble, resulting in a failed attempt to recruit the host.

The current predictions suggest that the steric composition of the Spike protein and glycans can guide the global dynamics of host-membrane capture. While these results are intuitive, it is possible that non-specific stabilizing interactions (i.e. not found in the pre, or post, fusion conformations) can have a notable influence on the rearrangement. For example, there may be specific long-lived non-native interactions that can transiently maintain the orientation of the HG region and facilitate FP capture of the host. Another limitation of the current analysis is that our model describes interactions between the Spike and effective membrane regions in terms of a short-range effect. If long-range electrostatic interactions dominate the FP-membrane association process, then it is possible that an alternate sequence of events may be observed. Finally, another avenue for further study would be to consider so-called ‘multi-basin’ structure-based models (Whitford et al., 2007). In such approaches, each element of the Spike protein would have interactions that stabilize both the pre and postfusion conformations. This would allow one to identify the influence of competing stabilizing interactions that may impede the ‘downhill’ dynamics of the current model. For example, introducing prefusion contacts in the HR2 region would likely delay entry of HG into the cage. Similarly, prefusion interactions could extend the time required for the FPs to initially dissociate from S2. With these open questions in mind, it will be interesting to see the extent to which various factors can enhance or attenuate the steric signatures that are described in the current study. In this context, the presented simulations provide a foundation for understanding and quantifying the relative contributions of each biophysical factor during this large-scale motion.

With the range of possible contributors to Spike protein dynamics, there are clear opportunities for novel experiments to reveal the precise influence of glycosylation on Spike protein mediated cell entry. To test the predictions of the current study, one may consider applying site-specific mutations, in order to inhibit glycan binding at individual residues. These altered Spike protiens could then be integrated into pseudovirus particles (Yang et al., 2020), which would allow one to measure the impact of specific glycans on the ability of the Spike to associate with a cell. If sterics dominate the dynamics, as predicted by the structure-based model, then mutations to HG glycan sites should significantly reduce the probability of membrane capture.

In addition to providing immediate insights into the influence of glycans on Spike protein dynamics, the current simulations establish a foundation for experimentally and theoretically investigating other factors that may influence cell invasion. To give one example, the presented models may be extended to account for electrostatic and solvation effects. With ongoing advances in high-performance computing, combined with the relatively low computational cost associated with these models, many variations may be explored in the coming months that will help elucidate the full range of factors that control this deadly pathogen.

Materials and methods

Structural modeling of the spike protein

Since complete structures of the full-length SARS-CoV-2 Spike protein have not been resolved experimentally, for either the prefusion or postfusion states, structural modeling steps were applied prior to performing simulations. For this, we used a cryo-EM structure of the prefusion assembly (PDB ID: 6VXX [Walls et al., 2020]), which lacks residues 828–853 and 1148–1233 (found in HG, and the HR2 and TM regions). For the postfusion state, we used a structural model of the SARS-CoV-1 system (PDB ID: 6M3W [Fan et al., 2020]) as a template for constructing a homology model of the SARS-CoV-2 system. However, the postfusion model was lacking residues 772–918 and 1197–1233 (FP, CR and TM). In addition, since available structures only partially resolved the base of each glycan, we constructed structural models of the complete glycans for both states (pre and post). For the prefusion structure, models of the TM and HR2 regions were constructed using the homology modeling webtool of SWISS-MODEL (Waterhouse et al., 2018). Consistent with the study of Casalino et al., 2020, both regions were modeled as coiled coils, where the sequence was assigned Uniprot (UniProt Consortium, 2019) sequence P0DTC2-1. This was accomplished with the automodel module of Modeller 9.24 (Sali and Blundell, 1993), with restraints included to preserve symmetry. The resulting model was threefold symmetric, where the RMSD between monomers was less than 1 Å. For the postfusion structure, unresolved residues in FP and CR were modeled as helical regions, using the automodel module of Modeller 9.24 with symmetry restraints imposed. CR and FP were modeled as coiled coils connected by short disordered loops. Homology models were constructed based on the structure of a coiled coil template (PDB ID: 2WPQ [Hartmann et al., 2009]). The TM strands were assigned alpha helical structures using Modeller 9.24. As a note, the postfusion configuration of the TM region was not used to define any aspect of the structure-based model.

Glycans were added to both structural models using the Glycan Reader Charmm server (Park et al., 2019). The same glycan composition was used as in other recent studies (Casalino et al., 2020; Watanabe et al., 2020a). A complete list of modeled glycans can be found in Appendix table 1. The glycosylated structural models of the prefusion and postfusion systems that were generated in this study are provided in the Supplementary Material.

All-atom structure-based model

All simulations employed an all-atom structure-based model to describe the Spike protein, with additional restraints imposed on the TM region, as well as an effective viral membrane potential. To describe the energetics of the protein, a structure-based model was constructed based on the postfusion model, using the default parameters in SMOG 2 (described in Noel et al., 2016). Several modifications were introduced to the force field, as described below. Non-default parameters were assigned for bond lengths and angles, as well as planar dihedrals. Rather than using the values found in the cryo-EM structure, bond lengths and angles were given the values found in the Amber03 force field (Duan et al., 2003). The strengths of non-planar dihedrals and contacts were consistent with earlier implementations of the structure-based model (Noel et al., 2016), these interactions are further described in the SI (Appendix Equation 2). Contacts were identified using the Shadow algorithm (Noel et al., 2012). A complete description of this variant of the model is described in Whitford et al., 2020. Force field definition files, which include glycans, are available for download at https://smog-server.org (SMOG2 force field repository ID: AA_glycans_Dodero21.v1). To ensure that the TM region remains in a helical bundle arrangement, contacts in the TM region were replaced by harmonic interactions, with distances taken from the prefusion conformation. To mimic the presence of a viral membrane, a flat-bottom potential shown in Appendix Equation 1 was imposed on the TM region to limit the movement to be inside the putative membrane region. Also, to avoid non-TM residues from crossing the effective membrane, a repulsive inverted harmonic flat bottom potential, beginning at the putative position of the viral membrane surface, was applied to atoms in HG. The potential was set to 0 at zhead=2 and the harmonic constant was set to two reduced energy units per nm2.

MD simulations

All simulations were performed using the GROMACS software package (v2020.2) (Lindahl et al., 2001; Hess et al., 2008) with source code modifications to implement the Gaussian-based flat bottom potential (Appendix Equation 1 and Appendix 1—figure 9). Input files for Gromacs were generated using the SMOG 2 software package (Noel et al., 2016), while additional in-house scripts were used to subsequently modify the force field. Simulations of seven different systems were performed: glycan-free, fully-glycosylated, HR2-glycosylated, HG-glycosylated, glycan-free with anharmonic TM interactions, glycans present with anharmonic TM interactions and glycans present with an effective host membrane potential. A total of 1000 transitions between the prefusion and postfusion structures were simulated for each system/model (7000 simulations, in total). Each system was first energy minimized using steepest descent energy minimization. Simulations were then performed using Langevin Dynamics protocols, with a reduced temperature of 0.58 (70 Gromacs units). In preliminary simulations, it was found that the assembly begins to unfold at a temperature of around 0.8. A timestep of 0.002 was used, and each simulation was continued until rhead reached a value greater than 8 nm, which indicated that HG had escaped from the HR1 cage. To estimate the effective simulated timescale, we use the conversion factor of 1 reduced unit being equivalent to 1 ns (Yang et al., 2019), which was previously obtained based on the comparison of diffusion coefficients in a SMOG model and explicit-solvent simulations.

Structural metrics

The following coordinates were used to describe the global rearrangement of the Spike protein:

  • zhead : To calculate zhead, the vector between the centers of mass of TM (residues 1203–1233) and HG (residues 1033–1129) was calculated and then decomposed into components that are perpendicular and parallel to the membrane plane. zhead is the component that is perpendicular to the plane.

  • rhead : To calculate rhead, the vector between the centers of mass of TM (residues 1203–1233) and HG (residues 1033–1129) was calculated and then decomposed into components that are perpendicular and parallel to the membrane plane. rhead is the component that is parallel to the plane.

  • θ : Angle formed between the first principal axis of HG (residues 1033–1129) and the vector normal to the membrane. A value of 0 indicates that the HG is perpendicular to the viral membrane.

  • QHR1: Number of postfusion-specific contacts formed (within 1.2 times the distance in the post-fusion conformation).

Frustration analysis

The frustration analysis was performed using the Frustratometer Web Server (Parra et al., 2016). The analysis reports the degree of frustration around each residue. For two atoms to be defined as a contact, their distance must be less than 5Å. The degree of frustration is calculated based on energetic profiles obtained with the AWSEM force field (Davtyan et al., 2012).

Acknowledgements

Work at the Center for Theoretical Biological Physics was supported by the NSF (Grant PHY-2019745). JNO was supported by the National Science Foundation (NSF) grants CHE-1614101 and PHY-1522550 and by the Welch Foundation (Grant C-1792). PCW was supported by NSF grant MCB-1915843. JNO is a Cancer Prevention Research in Texas Scholar in Cancer Research. We also acknowledge generous support from the Northeastern University Discovery cluster and Northeastern University Research Computing staff. Also, we are grateful for generous computational resources and support provided by the AMD COVID-19 HPC Fund program.

Appendix 1

Results

TM tilt angle distributions

In the virtual membrane model used in this study, the TM region is restrained to a 5-nm-thick region, centered about the x-y plane. Since the TM is free to move along the plane, it is possible that the TM region can exhibit transient tilting, with respect the z direction. To address the potential effect of TM tilt on the overall character of the pre-to-post transition, we defined the tilt in terms of the angle formed by the z-axis and the axis of the TM. The TM axis is defined as the vector connecting the center of mass of the Cα atoms of residues 1213 (one in each chain) and the center of mass of the the Cα atoms of residues 1233. Then we computed the distribution of tilt angles for the glycan-present and glycan-absent simulations (Appendix 1—figure 2).

We found that the tilt angle distributions for both sets of simulations are similar, indicating that the effect of the TM tilt will be comparable for the glycosylated or deglycosylated systems. Accordingly, TM tilting does not contribute to the observed differential extension of the FPs. In addition, it is important to note that the tilt angle is less than 40° for more than 90% of the sample configurations. Taking into account the length of the TM region (∼2.6 nm), a tilt of 40° would only account for a retraction of the Spike by ∼0.6 nm. As apparent in Figure 5 of the main text, independent of the effect of tilting, the simulated Spike protein would still reach physiological relevant distances (∼30 nm).

Effect of a virtual host membrane on FP capture probabilities

It may be expected that after one FP binds to the host membrane, the other FPs will be more likely to bind as well. In order to quantify this effect, we implemented a virtual host membrane potential that can capture each FP. The host membrane was modeled as a flat-bottom potential (Appendix Equation S1) of width ∼5 nm. Since recent computational studies (Gorgun et al., 2021) suggest that host membrane binding is irreversible, we set the depth of the potential acting on each atom to 4 kBT. Since this is stronger than the accessible thermal energy, the simulated capture events were observed to occur irreversibly.

After introducing this potential, we simulated 1000 pre-to-post transitions of the glycosylated Spike protein. As a control, we first verified that the inclusion of the host membrane potential did not affect the probability that the Spike will misfire (i.e. 0 FPs captured). As shown in Appendix 1—figure 8, the probability that the Spike fails to capture the host membrane is comparable when the membrane potential is absent or present. With regards to the number of FPs that are captured in successful events, we find that the presence of the virtual host membrane substantially increases the number of times that 3 FPs reach the host.

Methods

Membrane flat-bottom potential

V(z)={K[1exp((zwm2)22σ2)],wm2<z0,wm2<z<wm2K[1exp((z+wm2)22σ2)],z<wm2 (S1)

Here, z is the distance of the center of mass of TM from the center of the effective membrane. wm represents the membrane width (5 nm), and the depth of the potential K was set to 2 reduced energy units (4 kBT), while σ=0.2 nm.

Single-basin structure-based model potential

V=bondsϵr2(rijrij,0)2+anglesϵθ2(θijkθijk,0)2++impropersϵχimp2(χijklχijkl,0)2+planarϵχplanar2FP(φijkl)++backboneϵbbFD(ϕijklϕijkl,0)+sidechainsϵscFD(ϕijklϕijkl,0)++contactsϵC[(σijrij)122(σijrij)6]+noncontactsϵnc(σncrij)12. (S2)

Here, we define FD(ϕ)=[1-cos(ϕ)]+12[1-cos(3ϕ)], as used in earlier implementations of structure-based models (Noel et al., 2016), and FP(φ)=[1cos(2φ)]. The bonded parameters, such as bond lengths and angles (rij,0 and θijk,0), were obtained from the AMBER03 force field (Duan et al., 2003). The planar dihedrals were maintained by cosine potentials of periodicity 2. All non-planar dihedral angles were assigned 1–3 periodicity cosine dihedral potentials (FD) with minima corresponding to the postfusion conformation ϕijkl,0. Stabilizing 6–12 Lennard-Jones interactions were introduced for all atom pairs that are in contact in the postfusion conformation, with minima set to the distances found in the postfusion conformation σij.

Generated structural models

The structural models of the complete glycosylated systems are included as SI documents.

prefusion model The prefusion model is included in the file prefusion_Spike_DoderoRojas.pdb.

postfusion model The postfusion model is included in the file postfusion_Spike_DoderoRojas.pdb.

Appendix 1—table 1. N-glycan listing.

Complete list of N-glycans included in the simulations.

N706 aDMan(1→6)[aDMan(1→3)]aDMan(1→6)[aDMan(1→3)]
bDMan(1→4)bDGlcNAc(1→4)bDGlcNAc(1→)
N717 aDMan(1→6)[aDMan(1→3)]aDMan(1→6)[aDMan(1→2)aDMan(1→3)]
bDMan(1→4)bDGlcNAc(1→4)bDGlcNAc(1→)
N801 aDMan(1→6)[aDMan(1→3)]aDMan(1→6)[aDMan(1→3)]
bDMan(1→4)bDGlcNAc(1→4)bDGlcNAc(1→)
N1074 aDMan(1→6)[aDMan(1→3)]aDMan(1→6)[aDMan(1→3)]
bDMan(1→4)bDGlcNAc(1→4)bDGlcNAc(1→)
N1098 aDNeu5Ac(2→6)bDGal(1→4)bDGlcNAc(1→2)aDMan(1→3)[aDMan(1→6)
[aDMan(1→3)]aDMan(1→6)]bDMan(1→4)bDGlcNAc(1→4)bDGlcNAc(1→)
N1134 bDGlcNAc(1→2)aDMan(1→6)[bDGlcNAc(1→2)aDMan(1→3)]bDMan(1→4)
bDGlcNAc(1→4)[aLFuc(1→6)]bDGlcNAc(1→)
N1158 bDGlcNAc(1→2)aDMan(1→6)[bDGlcNAc(1→2)aDMan(1→3)]bDMan(1→4)
bDGlcNAc(1→4)bDGlcNAc(1→)
N1173 bDGlcNAc(1→6)[bDGlcNAc(1→2)]aDMan(1→6)[bDGlcNAc(1→4)[bDGlcNAc(1→2)]
aDMan(1→3)]bDMan(1→4)bDGlcNAc(1→4)[aLFuc(1→6)]bDGlcNAc(1→)
N1198 aDNeu5Ac(2→6)bDGal(1→4)bDGlcNAc(1→6)[bDGal(1→4)bDGlcNAc(1→2)]
aDMan(1→6)[bDGal(1→4)bDGlcNAc(1→4)[bDGal(1→4)bDGlcNAc(1→2)]
aDMan(1→3)]bDMan(1→4)bDGlcNAc(1→4)[aLFuc(1→6)]bDGlcNAc(1→)
Appendix 1—figure 1. Definitions of domains within the S2 protein.

Appendix 1—figure 1.

(A) Prefusion S2 subunit structure of the Spike protein. (B) Postfusion S2 subunit structure of the Spike protein. (C) Sequence range of the Head Group (HG), Fusion Peptide (FP), Connecting Region (CR), Heptad Repeat 1 (HR1), Linker Region (LR), Heptad Repeat 2 (HR2), and Transmembrane Region (TM).

Appendix 1—figure 2. TM tilt angle distributions.

Appendix 1—figure 2.

(A) Distribution of TM tilt angles (defined in SI results section 1.1) sampled during simulations when glycans are present. (B) Distribution of TM tilt angles sampled during simulations when glycans are absent.

Appendix 1—figure 3. Predicted degree of frustration, by residue.

Appendix 1—figure 3.

Density of highly frustrated contacts in a 5Å sphere per residue for prefusion (black) and postfusion (red) S2 subunit structures. Dashed line represents the S2’ cleavage site.

Appendix 1—figure 4. HG rotation.

Appendix 1—figure 4.

(A-C) Single time trace of zhead, rhead and the HG principal axis polar angle, θ. (D–G) Snapshots of the orientation of HG, relative to the membrane. During the prefusion-to-postfusion transition, the head rotates from an orientation in which it is pointing toward the membrane, to an orientation where it is pointing away. Structural snapshots illustrate various orientations during the transition.

Appendix 1—figure 5. Probability distribution when glycans are absent.

Appendix 1—figure 5.

Distribution calculated from 1000 independent simulations without glycans.

Appendix 1—figure 6. Relative influence of glycans on HR2 and HG.

Appendix 1—figure 6.

(A) Structural model with only glycans shown on HG. (B) Structural model with only HR2 glycans present. (C) Distribution of timescales with only HG glycans present. (D) Distribution with only HR2 glycans present. (E) Probability that dFP>dhost for at least one FP. There is a higher probability of extending to dhost when only HG glycans are present than when only HR2 glycans are present (black vs. red curves).

Appendix 1—figure 7. Glycans promote host capture with dissociated TM region.

Appendix 1—figure 7.

(A-D) Even in the case where the TM strands are able to dissociate, the presence of the glycans increases the probability that the FPs will capture the host membrane. 1000 transitions were simulated for each system. (E–F) Snapshots of glycosylated Spike protein where the TM strands dissociate from one another (glycans are not shown).

Appendix 1—figure 8. Comparison of probability of FP capture with, and without, a host membrane potential.

Appendix 1—figure 8.

Probability of capture, calculated from 1000 simulated transitions. (A) Model with no host membrane, using dhost=27 to define capture. (B) Model with host membrane potential as defined in Equation S1, which traps (i.e. potential begins to decrease) FPs at ∼27 nm from the virtual viral membrane. Both (A) and (B) show approximately the same probability of 0 FPs being captured, while (B) shows a drastic increase on the probability that capture will involve all three FPs.

Appendix 1—figure 9. Effective potential for TM confinement in a virtual viral membrane.

Appendix 1—figure 9.

The flat-bottom region represents the virtual membrane of width wm=5 nm, this region allows the TM motif to move freely between the planes z=wm2 and z=-wm2, beyond the flat region an energetic penalty is included to restrain the TM to escape the virtual membrane. Energy is measured in reduced energy units.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Jose N Onuchic, Email: jonuchic@rice.edu.

Paul Charles Whitford, Email: p.whitford@northeastern.edu.

Donald Hamelberg, Georgia State University, United States.

José D Faraldo-Gómez, National Heart, Lung and Blood Institute, National Institutes of Health, United States.

Funding Information

This paper was supported by the following grants:

  • National Science Foundation CHE-1614101 to Jose N Onuchic.

  • National Science Foundation PHY-1522550 to Jose N Onuchic.

  • Welch Foundation C-1792 to Jose N Onuchic.

  • National Science Foundation MCB-1915843 to Paul Charles Whitford.

  • National Science Foundation PHY-2019745 to Jose N Onuchic.

  • Cancer Prevention and Research Institute of Texas to Jose N Onuchic.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Formal analysis, Methodology, Writing - original draft, Writing - review and editing.

Conceptualization, Supervision, Methodology, Writing - review and editing.

Conceptualization, Formal analysis, Supervision, Methodology, Writing - original draft, Writing - review and editing.

Additional files

Supplementary file 1. Structural Model of Prefusion Structure.
elife-70362-supp1.pdb (1.1MB, pdb)
Supplementary file 2. Structural Model of Postfusion Structure.
elife-70362-supp2.pdb (1.1MB, pdb)
Transparent reporting form

Data availability

The current manuscript is a computational study. Simulations were prepared with SMOG 2 (free, open-source), which is available at smog-server.org. Force field templates for SMOG 2 are available for download through the SMOG 2 Force Field Repository on the smog-server page. Simulations were performed using Gromacs (free, open-source).

References

  1. Ali A, Vijayan R. Dynamics of the ACE2–SARS-CoV-2/SARS-CoV spike protein interface reveal unique mechanisms. Scientific Reports. 2020;10:14214. doi: 10.1038/s41598-020-71188-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Basso LG, Vicente EF, Crusca E, Cilli EM, Costa-Filho AJ. SARS-CoV fusion peptides induce membrane surface ordering and curvature. Scientific Reports. 2016;6:37131. doi: 10.1038/srep37131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bryngelson JD, Onuchic JN, Socci ND, Wolynes PG. Funnels, pathways, and the energy landscape of protein folding: a synthesis. Proteins: Structure, Function, and Genetics. 1995;21:167–195. doi: 10.1002/prot.340210302. [DOI] [PubMed] [Google Scholar]
  4. Bryngelson JD, Wolynes PG. Intermediates and barrier crossing in a random energy model (with applications to protein folding) The Journal of Physical Chemistry. 1989;93:6902–6915. doi: 10.1021/j100356a007. [DOI] [Google Scholar]
  5. Cai Y, Zhang J, Xiao T, Peng H, Sterling SM, Walsh RM, Rawson S, Rits-Volloch S, Chen B. Distinct conformational states of SARS-CoV-2 spike protein. Science. 2020;369:1586–1592. doi: 10.1126/science.abd4251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Casalino L, Gaieb Z, Goldsmith JA, Hjorth CK, Dommer AC, Harbison AM, Fogarty CA, Barros EP, Taylor BC, McLellan JS, Fadda E, Amaro RE. Beyond shielding: the roles of glycans in the SARS-CoV-2 spike protein. ACS Central Science. 2020;6:1722–1734. doi: 10.1021/acscentsci.0c01056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chan HS, Zhang Z, Wallin S, Liu Z. Cooperativity, local-nonlocal Coupling, and nonnative interactions: principles of protein folding from coarse-grained models. Annual Review of Physical Chemistry. 2011;62:301–326. doi: 10.1146/annurev-physchem-032210-103405. [DOI] [PubMed] [Google Scholar]
  8. Cho SS, Levy Y, Wolynes PG. P versus Q: structural reaction coordinates capture protein folding on smooth landscapes. PNAS. 2006;103:586–591. doi: 10.1073/pnas.0509768103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Clementi C, Nymeyer H, Onuchic JN. Topological and energetic factors: what determines the structural details of the transition state ensemble and "en-route" intermediates for protein folding? An investigation for small globular proteins. Journal of Molecular Biology. 2000;298:937–953. doi: 10.1006/jmbi.2000.3693. [DOI] [PubMed] [Google Scholar]
  10. Davtyan A, Schafer NP, Zheng W, Clementi C, Wolynes PG, Papoian GA. AWSEM-MD: protein structure prediction using coarse-grained physical potentials and bioinformatically based local structure biasing. The Journal of Physical Chemistry B. 2012;116:8494–8503. doi: 10.1021/jp212541y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Di Pierro M, Potoyan DA, Wolynes PG, Onuchic JN. Anomalous diffusion, spatial coherence, and viscoelasticity from the energy landscape of human chromosomes. PNAS. 2018;115:7753–7758. doi: 10.1073/pnas.1806297115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Duan Y, Wu C, Chowdhury S, Lee MC, Xiong G, Zhang W, Yang R, Cieplak P, Luo R, Lee T, Caldwell J, Wang J, Kollman P. A point-charge force field for molecular mechanics simulations of proteins based on condensed-phase quantum mechanical calculations. Journal of Computational Chemistry. 2003;24:1999–2012. doi: 10.1002/jcc.10349. [DOI] [PubMed] [Google Scholar]
  13. Fan X, Cao D, Kong L, Zhang X. Cryo-EM analysis of the post-fusion structure of the SARS-CoV spike glycoprotein. Nature Communications. 2020;11:3618. doi: 10.1038/s41467-020-17371-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Ferreiro DU, Hegler JA, Komives EA, Wolynes PG. Localizing frustration in native proteins and protein assemblies. PNAS. 2007;104:19819–19824. doi: 10.1073/pnas.0709915104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gorgun D, Lihan M, Kapoor K, Tajkhorshid E. Binding mode of SARS-CoV-2 fusion peptide to human cellular membrane. Biophysical Journal. 2021;120:2914–2926. doi: 10.1016/j.bpj.2021.02.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Gosavi S, Chavez LL, Jennings PA, Onuchic JN. Topological frustration and the folding of interleukin-1 beta. Journal of Molecular Biology. 2006;357:986–996. doi: 10.1016/j.jmb.2005.11.074. [DOI] [PubMed] [Google Scholar]
  17. Hartmann MD, Ridderbusch O, Zeth K, Albrecht R, Testa O, Woolfson DN, Sauer G, Dunin-Horkawicz S, Lupas AN, Alvarez BH. A coiled-coil motif that sequesters ions to the hydrophobic core. PNAS. 2009;106:16950–16955. doi: 10.1073/pnas.0907256106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hess B, Kutzner C, van der Spoel D, Lindahl E. GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. Journal of Chemical Theory and Computation. 2008;4:435–447. doi: 10.1021/ct700301q. [DOI] [PubMed] [Google Scholar]
  19. Hyeon C, Thirumalai D. Capturing the essence of folding and functions of biomolecules using coarse-grained models. Nature Communications. 2011;2:487. doi: 10.1038/ncomms1481. [DOI] [PubMed] [Google Scholar]
  20. Lan J, Ge J, Yu J, Shan S, Zhou H, Fan S, Zhang Q, Shi X, Wang Q, Zhang L, Wang X. Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor. Nature. 2020;581:215–220. doi: 10.1038/s41586-020-2180-5. [DOI] [PubMed] [Google Scholar]
  21. Lenza MP, Oyenarte I, Diercks T, Quintana JI, Gimeno A, Coelho H, Diniz A, Peccati F, Delgado S, Bosch A, Valle M, Millet O, Abrescia NGA, Palazón A, Marcelo F, Jiménez-Osés G, Jiménez-Barbero J, Ardá A, Ereño-Orbea J. Structural characterization of N-Linked glycans in the receptor binding domain of the SARS-CoV-2 spike protein and their interactions with human lectins. Angewandte Chemie International Edition. 2020;59:23763–23771. doi: 10.1002/anie.202011015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Letko M, Marzi A, Munster V. Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses. Nature Microbiology. 2020;5:562–569. doi: 10.1038/s41564-020-0688-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Levi M, Noel JK, Whitford PC. Studying ribosome dynamics with simplified models. Methods. 2019;162-163:128–140. doi: 10.1016/j.ymeth.2019.03.023. [DOI] [PubMed] [Google Scholar]
  24. Levi M, Walak K, Wang A, Mohanty U, Whitford PC. A steric gate controls P/E hybrid-state formation of tRNA on the ribosome. Nature Communications. 2020;11:5706. doi: 10.1038/s41467-020-19450-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Lin X, Eddy NR, Noel JK, Whitford PC, Wang Q, Ma J, Onuchic JN. Order and disorder control the functional rearrangement of influenza hemagglutinin. PNAS. 2014;111:12049–12054. doi: 10.1073/pnas.1412849111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Lindahl E, Hess B, van der Spoel D. GROMACS 3.0: a package for molecular simulation and trajectory analysis. Journal of Molecular Modeling. 2001;7:306–317. doi: 10.1007/s008940100045. [DOI] [Google Scholar]
  27. Lindorff-Larsen K, Piana S, Dror RO, Shaw DE. How fast-folding proteins fold. Science. 2011;334:517–520. doi: 10.1126/science.1208351. [DOI] [PubMed] [Google Scholar]
  28. Malaquias MAS, Gadotti AC, Motta-Junior JDS, Martins APC, Azevedo MLV, Benevides APK, Cézar-Neto P, Panini do Carmo LA, Zeni RC, Raboni SM, Fonseca AS, Machado-Souza C, Moreno-Amaral AN, de Noronha L. The role of the lectin pathway of the complement system in SARS-CoV-2 lung injury. Translational Research. 2021;231:55–63. doi: 10.1016/j.trsl.2020.11.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Nguyen K, Whitford PC. Steric interactions lead to collective tilting motion in the ribosome during mRNA-tRNA translocation. Nature Communications. 2016;7:10586. doi: 10.1038/ncomms10586. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Noel JK, Whitford PC, Onuchic JN. The shadow map: a general contact definition for capturing the dynamics of biomolecular folding and function. The Journal of Physical Chemistry B. 2012;116:8692–8702. doi: 10.1021/jp300852d. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Noel JK, Levi M, Raghunathan M, Lammert H, Hayes RL, Onuchic JN, Whitford PC. SMOG 2: a versatile software package for generating Structure-Based models. PLOS Computational Biology. 2016;12:e1004794. doi: 10.1371/journal.pcbi.1004794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Park SJ, Lee J, Qi Y, Kern NR, Lee HS, Jo S, Joung I, Joo K, Lee J, Im W. CHARMM-GUI glycan modeler for modeling and simulation of carbohydrates and glycoconjugates. Glycobiology. 2019;29:320–331. doi: 10.1093/glycob/cwz003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Parra RG, Schafer NP, Radusky LG, Tsai MY, Guzovsky AB, Wolynes PG, Ferreiro DU. Protein frustratometer 2: a tool to localize energetic frustration in Protein Molecules, now with electrostatics. Nucleic Acids Research. 2016;44:W356–W360. doi: 10.1093/nar/gkw304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Roy S, Jaiswar A, Sarkar R. Dynamic asymmetry exposes 2019-nCoV prefusion spike. The Journal of Physical Chemistry Letters. 2020;11:7021–7027. doi: 10.1021/acs.jpclett.0c01431. [DOI] [PubMed] [Google Scholar]
  35. Sali A, Blundell TL. Comparative protein modelling by satisfaction of spatial restraints. Journal of Molecular Biology. 1993;234:779–815. doi: 10.1006/jmbi.1993.1626. [DOI] [PubMed] [Google Scholar]
  36. Schlick T, Zhu Q, Jain S, Yan S. Structure-Altering mutations of the SARS-CoV-2 frame shifting RNA element. bioRxiv. 2020 doi: 10.1101/2020.08.28.271965. [DOI] [PMC free article] [PubMed]
  37. Shang J, Wan Y, Luo C, Ye G, Geng Q, Auerbach A, Li F. Cell entry mechanisms of SARS-CoV-2. PNAS. 2020;117:11727–11734. doi: 10.1073/pnas.2003138117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Shea JE, Onuchic JN, Brooks CL. Exploring the origins of topological frustration: design of a minimally frustrated model of fragment B of protein A. PNAS. 1999;96:12512–12517. doi: 10.1073/pnas.96.22.12512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Shin D, Mukherjee R, Grewe D, Bojkova D, Baek K, Bhattacharya A, Schulz L, Widera M, Mehdipour AR, Tascher G, Geurink PP, Wilhelm A, van der Heden van Noort GJ, Ovaa H, Müller S, Knobeloch KP, Rajalingam K, Schulman BA, Cinatl J, Hummer G, Ciesek S, Dikic I. Papain-like protease regulates SARS-CoV-2 viral spread and innate immunity. Nature. 2020;587:657–662. doi: 10.1038/s41586-020-2601-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Turoňová B, Sikora M, Schürmann C, Hagen WJH, Welsch S, Blanc FEC, von Bülow S, Gecht M, Bagola K, Hörner C, van Zandbergen G, Landry J, de Azevedo NTD, Mosalaganti S, Schwarz A, Covino R, Mühlebach MD, Hummer G, Krijnse Locker J, Beck M. In situ structural analysis of SARS-CoV-2 spike reveals flexibility mediated by three hinges. Science. 2020;370:203–208. doi: 10.1126/science.abd5223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. UniProt Consortium UniProt: a worldwide hub of protein knowledge. Nucleic Acids Research. 2019;47:D506–D515. doi: 10.1093/nar/gky1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Verkhivker GM. Molecular simulations and network modeling reveal an allosteric signaling in the SARS-CoV-2 spike proteins. Journal of Proteome Research. 2020;19:4587–4608. doi: 10.1021/acs.jproteome.0c00654. [DOI] [PubMed] [Google Scholar]
  43. Walls AC, Park YJ, Tortorici MA, Wall A, McGuire AT, Veesler D. Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein. Cell. 2020;181:281–292. doi: 10.1016/j.cell.2020.02.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Wang Q, Zhang Y, Wu L, Niu S, Song C, Zhang Z, Lu G, Qiao C, Hu Y, Yuen KY, Wang Q, Zhou H, Yan J, Qi J. Structural and functional basis of SARS-CoV-2 entry by using human ACE2. Cell. 2020;181:894–904. doi: 10.1016/j.cell.2020.03.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Watanabe Y, Allen JD, Wrapp D, McLellan JS, Crispin M. Site-specific glycan analysis of the SARS-CoV-2 spike. Science. 2020a;369:330–333. doi: 10.1126/science.abb9983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Watanabe Y, Berndsen ZT, Raghwani J, Seabright GE, Allen JD, Pybus OG, McLellan JS, Wilson IA, Bowden TA, Ward AB, Crispin M. Vulnerabilities in coronavirus glycan shields despite extensive glycosylation. Nature Communications. 2020b;11:2688. doi: 10.1038/s41467-020-16567-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, Heer FT, de Beer TAP, Rempfer C, Bordoli L, Lepore R, Schwede T. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Research. 2018;46:W296–W303. doi: 10.1093/nar/gky427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Whitford PC, Miyashita O, Levy Y, Onuchic JN. Conformational transitions of adenylate kinase: switching by cracking. Journal of Molecular Biology. 2007;366:1661–1671. doi: 10.1016/j.jmb.2006.11.085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Whitford PC, Noel JK, Gosavi S, Schug A, Sanbonmatsu KY, Onuchic JN. An all-atom structure-based potential for proteins: bridging minimal models with all-atom empirical forcefields. Proteins: Structure, Function, and Bioinformatics. 2009;75:430–441. doi: 10.1002/prot.22253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Whitford PC, Jiang W, Serwer P. Simulations of phage T7 capsid expansion reveal the role of molecular sterics on dynamics. Viruses. 2020;12:1273. doi: 10.3390/v12111273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Whitford PC, Onuchic JN. What protein folding teaches Us about biological function and molecular machines. Current Opinion in Structural Biology. 2015;30:57–62. doi: 10.1016/j.sbi.2014.12.003. [DOI] [PubMed] [Google Scholar]
  52. Wrapp D, Wang N, Corbett KS, Goldsmith JA, Hsieh CL, Abiona O, Graham BS, McLellan JS. Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation. Science. 2020;367:1260–1263. doi: 10.1126/science.abb2507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Xia S, Liu M, Wang C, Xu W, Lan Q, Feng S, Qi F, Bao L, Du L, Liu S, Qin C, Sun F, Shi Z, Zhu Y, Jiang S, Lu L. Inhibition of SARS-CoV-2 (previously 2019-nCoV) infection by a highly potent pan-coronavirus fusion inhibitor targeting its spike protein that harbors a high capacity to mediate membrane fusion. Cell Research. 2020;30:343–355. doi: 10.1038/s41422-020-0305-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Yang H, Bandarkar P, Horne R, Leite VBP, Chahine J, Whitford PC. Diffusion of tRNA inside the ribosome is position-dependent. The Journal of Chemical Physics. 2019;151:085102. doi: 10.1063/1.5113814. [DOI] [PubMed] [Google Scholar]
  55. Yang Q, Hughes TA, Kelkar A, Yu X, Cheng K, Park S, Huang WC, Lovell JF, Neelamegham S. Inhibition of SARS-CoV-2 viral entry upon blocking N- and O-glycan elaboration. eLife. 2020;9:e61552. doi: 10.7554/eLife.61552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Yuan M, Wu NC, Zhu X, Lee CD, So RTY, Lv H, Mok CKP, Wilson IA. A highly conserved cryptic epitope in the receptor binding domains of SARS-CoV-2 and SARS-CoV. Science. 2020;368:630–633. doi: 10.1126/science.abb7269. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Donald Hamelberg1
Reviewed by: Kei-ichi OKAZAKI2

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

This paper describes a vital step in the infection cycle of the Covid-19 virus, providing useful insights into how the virus enters the cell and the importance of the modification to viral proteins with glycans. Despite the use of a simplified model to describe the system, this study provides a thorough examination of the conformational changes of the Covid-19 Spike protein, knowledge that could be exploited for drug design purposes and would otherwise have been impossible to obtain with a more detailed model.

Decision letter after peer review:

Thank you for submitting your article "Sterically-Connned Rearrangements of SARS-CoV-2 Spike Protein Control Cell Invasion" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and a Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Kei-ichi OKAZAKI (Reviewer #2).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

1) For data reproducibility, both pre-fusion and post-fusion structure models should be provided as coordinate files in the SI files. Based on limited available information, authors made complete structure models, which itself has considerable values for subsequent studies.

(1a) Related to this, it is not explicitly stated that the structure drawn in Figure 1C is the post-fusion model the authors used in this work. If not, the post-fusion structure must be provided for example alongside Figure S1.

(1b) In the structure in Figure 1C, 3 TMs look asymmetric. Since TMs are not included in the original PDB model, authors modeled it. How this particular asymmetric configuration appeared?

(2) In the simulation, TMs are restrained to the implicitly-assumed membrane plane, which is reasonable. However, in the snapshots in Figures 2 and 4, the tilt angle of TMs does not seem to be restrained. Is it possible that this slant TMs may affect the overall orientation of the spike proteins and thus could affect the dFP? Can author comment on it?

(3) In the analysis of Figure 5, once one FP captures the host membrane, this must anchor the FP subsequently. This anchoring must affect the dynamics of the rest of spike protein including the other FPs. Therefore, in reality, there must have higher probabilities of multiple FP captures (2FP and 3FP). Can authors conduct some simple simulations in which FP is anchored once it reaches the host cell membrane.

(4) In the description of all-atom structure-based model, The meanings of "1-3 dihedral potentials" and "6-12 interactions" are not clearly understood.

(5) It was difficult to understand the viral membrane potential described in Equation S1. What does "*" mean in the upper equation and does K take a positive or negative value? It might help to include a plot of the potential along z.

(6) It would be useful to have a detailed Discussion section on caveats of the model/potential alternate mechanisms as well as possible predictions which can be tested which include the following facts/questions. The alternate mechanisms need not be simulated. They can be speculations based on the present model. The indicated citations are just suggestions and may not represent the latest results. The authors should check for those.

(6a) The starting prefusion structure for the simulations is a hypothetical structure of what the spike would look like post cleavage. This should be emphasized. Does the frustration analysis indicate that some regions will already be unfolded pre-cleavage? For instance HR2 could already be unfolded before cleavage and then will it be able to partially cage some part of the spike? And what would this do to the mechanism?

(6b) What changes in mechanism could occur if this were a dual structure-based model instead of a single structure-based one? Do any of these seem physically reasonable?

(6c) Does the importance of glycosylation decrease if HR1 folds faster than the timescale of uncaging of HG without the glycosylation? What increase in contact (or dihedral) strengths or contact to dihedral ratios would be required for this to happen? Are these increases physically reasonable?

(6d) The TM helices are pinned into a trimeric conformation. What would happen if they are not and can move away from each other? See for instance: https://www.biorxiv.org/content/10.1101/2021.06.07.447334v1

(6e) Some percentage of spikes are naturally in the post-fusion conformation. See for instance: https://science.sciencemag.org/content/369/6511/1586 How does that fit into the simulation results?

(6f) Please look through current literature (since this is a fast moving field) for the role of spike sugars in fusion. See for instance https://elifesciences.org/articles/61552 Are there experimental results which support the present model or can the authors suggest experiments which can test the model specifically in the context of sugar placement?

(7) Throughout: Viral membrane envelopes are not called "capsids".

(8) Lines 52-57: These sentences imply an order to the cleavage/ACE2 binding events (ACE2 binding happens after cleavage). Has this been proven? If yes, please give a reference. If not, please reword.

(9) Figure 1: Please also give PDB IDs for the structures right here.

(10) Lines 153 onwards: Since at least some model justification depends on frustrated contacts, it would be useful to explain frustrated contacts in some detail and bring the supplementary figure into the main paper (if no page limit constraints exist).

(11) Figure 5B: Please state if this figure is for a glycosylated protein.

(12) Line 290: Please remove the word dramatic. It's not a quantifiable amount.

(13) Lines 301-305: I don't understand what happens when FPs transition to a post-fusion conformation without engaging the host membrane. What does this mean in the context of the present model? And what happens in hemagglutinin? It would be useful to have a clarification of these sentences and a detailed explanation.

Reviewer #1:

Authors performed all-atom structure-based molecular dynamics simulation of the conformational change process of SARS-CoV-2 spike protein from its pre-fusion form to the post-fusion form, with and without glycans bound to the spike protein. Authors found that the bound glycans provide considerable steric barrier in the transition, which prolongs the transition compared to the case without the bound glycan. Interestingly, this intermediate configuration, called caged state, has high probability to extend its fusion peptide towards the host cell membrane. Thus the bound glycan enhances the probability for the spike protein to capture the host cell membrane.

Using a simplified energy function, i.e. the structure-based model, authors could simulate extremely large-scale conformational change numerous times, which robustly shows the role of the bound glycans., which strengthens their finding.

On the other hand, any attraction interactions of glycans with protein amino acids are completely ignored in this simplified energy function, which is a clear limitation of the current study. These interaction, if included, could give much longer pause in the caged intermediate. Thus, real effect of the bound glycans may be even stronger.

Reviewer #2:

Dodero-Rojas et al. investigated a large-scale conformational change of the SARS-Cov-2 Spike protein involved in membrane fusion to its host cell. They used a structure-based all-atom model to simulate the large-scale conformational transition between prefusion and post-fusion conformations, which is impossible to simulate with conventional simulation methods. From extensive simulations, they identified the "caged" intermediate, which is realized through steric interactions of glycans. It was clearly shown from various control simulations that the caged intermediate has a much shorter lifetime without glycans, and glycans attached to the head group (HG) are mainly responsible for the stable intermediate. Furthermore, they showed that the caged intermediate facilitates capturing of the host membrane by extending the fusion peptides (FP) in the perpendicular direction to the membrane. The probability of capturing the host membrane by FP is higher with the stable caged intermediate in the presence of glycans for the virus-host inter-membrane distance of 30 nm, consistent with the cryotomography observations. Overall, the author's claims and conclusions are justified by their data. The strength of this work is that they simulated the unprecedently large conformational transition thousands of times. The weakness is that they used a simplified model that might miss physical interactions like electrostatic interactions. However, this work can establish a foundation for more detailed simulations with precise physicochemical interactions.

Reviewer #3:

The conformational transition of the SARS-CoV2 spike protein from its prefusion state to its post-fusion state helps the viral envelope fuse with the host membrane and eject the viral RNA into the host cell. This conformational transition is difficult to study in its entirety using either experimental techniques or standard simulation methods. Here, the authors use starting structures modelled on the prefusion structure and a potential energy function encoding the post-fusion structure to characterize the spike conformational transition using molecular dynamics (MD) simulations. Such structure-based models simplify the potential energy function and are able to simulate timescales many orders of magnitude longer than those seen in standard atomistic MD simulations. The spike simulations show that a helical region (HR2) C-terminal to the spike head (HG) which links HG to the transmembrane segment (TM; embedded in the viral membrane) unfolds and cages HG close to the viral membrane. Glycosylation increases the size and roughens the surface of HG allowing it to be caged for longer by HR2. This allows the helical region (HR1) N-terminal to HG to fold into a long helical trimer and potentially reach out to and latch onto the host membrane more effectively suggesting that glycosylation enables efficient fusion.

Structure-based models encode the protein structure in the potential energy function simplifying it. They also do not encode interactions which are not present in the structure. In the model used here, this implies that not all interactions present in the starting prefusion structure are stabilizing. The viral membrane is encoded implicitly while the host membrane is excluded and the sugars are included using connected beads which incorporate the sugar structure but ignore any further attractive interactions. However, such simplified models have a grounding in protein folding theory. Additionally, it is such simplifications which allow the model to simulate the spike conformational transition. Finally, similar models have previously been successfully used to understand the conformational transitions of large molecular machines such as the ribosome.

The authors have successfully simulated the spike conformational transition, shown a likely order of events that occur during this conformational transition and illustrated the potential importance of spike glycosylation to SARS-CoV-2 fusion.

eLife. 2021 Aug 31;10:e70362. doi: 10.7554/eLife.70362.sa2

Author response


Essential revisions:

1) For data reproducibility, both pre-fusion and post-fusion structure models should be provided as coordinate files in the SI files. Based on limited available information, authors made complete structure models, which itself has considerable values for subsequent studies.

We are pleased that our structural models can be of utility to other groups. Accordingly, as part of the SI, we now provide PDB files of the structural models used in the study.

(1a) Related to this, it is not explicitly stated that the structure drawn in Figure 1C is the post-fusion model the authors used in this work. If not, the post-fusion structure must be provided for example alongside Figure S1.

Thank you for noting this ambiguity in the text. To clarify this point, we have introduced two modifications.

– The last paragraph of the introduction now specifies that the post-fusion structure used to define the force field is shown in Figure 1C.

“Simulations were initiated with the Spike protein in the prefusion conformation, while the energetics were defined to favor the post-fusion conformation (shown in Figure 1C)”

– As recommended, we also now show this structure alongside the pre-fusion structure in Figure S1.

(1b) In the structure in Figure 1C, 3 TMs look asymmetric. Since TMs are not included in the original PDB model, authors modeled it. How this particular asymmetric configuration appeared?

The TM region for the post-fusion structure was modelled as alpha-helices without enforcing a symmetry constraint. However, since the energetics of the TM were defined entirely by the pre-fusion conformation, the post-fusion representation of the TM is only intended to depict the putative position of the TM after fusion occurs. To clarify this point, in the Methods and Materials section “Structural Modeling of the Spike Protein” we now explicitly state that, for the post-fusion model, the CR and FP regions were modelled with symmetry restraints imposed, while the TM was modelled as helices without symmetry restraints. Page 12 reads:

“For the post-fusion structure, unresolved residues in FP and CR were modeled as helical regions, using the automodel module of Modeller 9.24 with symmetry restraints imposed. […] As a note, the post-fusion configuration of the TM region was not used to define any aspect of the structure-based model.”

(2) In the simulation, TMs are restrained to the implicitly-assumed membrane plane, which is reasonable. However, in the snapshots in Figures 2 and 4, the tilt angle of TMs does not seem to be restrained. Is it possible that this slant TMs may affect the overall orientation of the spike proteins and thus could affect the dFP? Can author comment on it?

Thank you for raising this interesting point. As noted, the TM motif is restrained to a plane without an orientation bias. To determine whether tilting of the TM region has a significant impact on the reach of the FPs, we performed additional analysis of the simulations and found that there is no distinguishable difference in orientations within the two sets (glycans present, or absent) of simulations. This comparison is now presented in the SI Results section entitled “TM tilt angle distributions” (page 1), which includes a new SI Figure (Figure S2). As described in the SI, the glycosylated and degycosylated Spike proteins exhibit similar ranges of TM tilting, which is consistent with the reach of each FPs being equally affected by the TM tilt. In addition, the tilt angle formed with the plane normal rarely exceeds 45°. Accordingly, this indicates that the differential reach of the glycosylated and unglycosylated systems will be robust to possible tilting of the TM in the simulations.

To address this point, we have also added the following passage to the main text (page 8):

“To ask whether this differential extension of the FP is due to tilting of TM, we calculated the distribution of angles between the TM bundle and the viral membrane (Figure S2). This revealed that the TM tilting distributions were similar for the two systems, which indicates that the differential dynamics of the FPs can not be attributed to this aspect of the model..”

(3) In the analysis of Figure 5, once one FP captures the host membrane, this must anchor the FP subsequently. This anchoring must affect the dynamics of the rest of spike protein including the other FPs. Therefore, in reality, there must have higher probabilities of multiple FP captures (2FP and 3FP). Can authors conduct some simple simulations in which FP is anchored once it reaches the host cell membrane.

This is a very interesting point. It is indeed expected that the anchoring of one FP will affect the probability of the other FPs binding to the host membrane. In order to explore this relationship, we extended our model, such that each FP can irreversibly bind the host mem brane. This was enabled by introducing a second flat-bottom potential (Equation S1) to describe the host membrane region. We then used this model to perform 1000 simulations of the glycosylated Spike protein. As expected, the probability that at least one FP reaches the host membrane is not affected by the introduction of the host membrane potential (Figure S8). However, as suggested by the reviewers, there is a significant increase in the probability that all three FPs reach the host membrane region. These results are now described, in detail, in a new SI section entitled “Effect of a virtual host membrane on FP capture probabilities”. We have also added a new paragraph to the main text that describes these additional calculations (page 10).

“In terms of the mechanistic features of membrane fusion, one can expect that FP binding to a host membrane will impact the probability that subsequent FPs will also bind. […] Together, this analysis suggests that the primary mode for Spike protein mediated membrane fusion is through use of a “sequential” mechanism.”

(4) In the description of all-atom structure-based model, The meanings of "1-3 dihedral potentials" and "6-12 interactions" are not clearly understood.

Thank you for pointing this ambiguous notation. We expanded the description of the model, and included the full form of the structure-based model potential on page 2 of the SI.

(5) It was difficult to understand the viral membrane potential described in Equation S1. What does "*" mean in the upper equation and does K take a positive or negative value? It might help to include a plot of the potential along z.

Thank you for nothing this typesetting issue. We now include a corrected version of the membrane potential on page 2 of the SI. Also, following the reviewers’ suggestions we included a plot of the potential (Figure S9).

(6) It would be useful to have a detailed Discussion section on caveats of the model/potential alternate mechanisms as well as possible predictions which can be tested which include the following facts/questions. The alternate mechanisms need not be simulated. They can be speculations based on the present model. The indicated citations are just suggestions and may not represent the latest results. The authors should check for those.

Thank you for raising this suggestion. Accordingly, we now include an extended discussion of the caveats of the model (page 11). We also expanded the discussion by including experimental strategies that could be used to explore the effect of Spike glycosylation on the membrane fusion process (page 11), and thereby test the predictions of the current study.

“The current predictions suggest that the steric composition of the Spike protein and glycans can guide the global dynamics of host-membrane capture. […] If sterics dominate the dynamics, as predicted by the structure-based model, then mutations to HG glycan sites should significantly reduce the probability of membrane capture.”

(6a) The starting prefusion structure for the simulations is a hypothetical structure of what the spike would look like post cleavage. This should be emphasized.

We included in the last paragraph of the introduction (page 4) that the pre-fusion structure is a possible snapshot of the system after cleavage and S1 dissociation.

“It is important to emphasize that the prefusion model used as a starting point is intended to represent a state in which S2’ cleavage and S1 dissociation have occurred. While the precise timing of these steps is unknown, we assume they can occur prior to any significant conformational changes in the S2 subunit.”.

Does the frustration analysis indicate that some regions will already be unfolded pre-cleavage? For instance HR2 could already be unfolded before cleavage and then will it be able to partially cage some part of the spike? And what would this do to the mechanism?

The frustration analysis shows that the most frustrated region of HR2 corresponds to residues near the TM boundary. Since the frustration analysis is based on protein sequences in solution, one explanation for this localized frustration is that this region is only marginally stable in solution, and likely engages in stabilizing interactions with the viral membrane. In support of this interpretation, tomographic imaging of HR2 has revealed the presence of two distinct kinks, called the ”ankle” and ”knee” [1], while the remainder is described well by a coiled coil motif.

(6b) What changes in mechanism could occur if this were a dual structure-based model instead of a single structure-based one? Do any of these seem physically reasonable?

This is a very interesting question. If a multi-basin model were implemented, where the prefusion conformation were stabilized, we would expect several substeps of the rearrangement to slow down. As described in the earlier reply (Comment 6), we have added discussion on these possibilities in the main text.

(6c) Does the importance of glycosylation decrease if HR1 folds faster than the timescale of uncaging of HG without the glycosylation? What increase in contact (or dihedral) strengths or contact to dihedral ratios would be required for this to happen? Are these increases physically reasonable?

Thank you for raising this interesting point. In our “downhill” model, we observe that the HR1 folds rapidly, where there are no obvious large-scale barriers. In the absence of an apparent free-energy barrier for the formation of HR1 (inferred by probing the number of contacts formed), increasing the slope of the downhill potential will have minimal effects on the mean- first passage time. While one could artificially strengthen the HR1 post contacts, doing so would also lead to a hyperstable helical bundle, where folding temperatures would be non-physical (e.g. 600K). Accordingly, one could only argue that the HR1 contact strengths should be increased by 10-20% (at most), which would only affect the HR1 formation time by a few percent (assuming diffusive dynamics [2] on a downhill 1D free-energy surface). Since the current model may be described as a “fast-forming HR1” model, it is more likely that the predicted impact of glycosylation will be more pronounced in solution.

(6d) The TM helices are pinned into a trimeric conformation. What would happen if they are not and can move away from each other? See for instance: https://www.biorxiv.org/content/10.1101/2021.06.07.447334v1

To address this question, we generated another variant of our model, where the harmonic intra-TM interactions were replaced with marginally-stable 6-12 Lennard-Jones interactions. With this model, we simulated 1000 transitions for the glycan-present and glycan-absent systems (2000 events, in total). We find that, even if the TM strands dissociate from one another, the FPs reach further in the glycosylated system. These results are shown in a new SI figure (Figure S7). We also added the following text to the main manuscript (page 8).

“To assess whether the influence of glycans on FP dynamics is robust, we considered a variant of our model in which the TM helices may dissociate from each other. […] These simulations show that the differential dynamics of the FPs is robust to the precise description of the TM bundle.”

(6e) Some percentage of spikes are naturally in the post-fusion conformation. See for instance: https://science.sciencemag.org/content/369/6511/1586 How does that fit into the simulation results?

In Cai et al. 2020, the post-fusion structures appear in virons under mild detergent conditions, where spontaneous activation can occur without host-virus interactions. In our study, we consider activation upon host-virus interaction. Accordingly, the Spike proteins that fail to reach the host boundary in our simulations describe a subpopulation of Spike protein that do not spontaneously adopt a post-fusion state in the absence of a host. Page 8 now reads:

“Finally, while the Spike Protein has been experimentally observed to spontaneously transition from the prefusion to post-fusion configuration (Cai et al. 2020), the probabilities reported above describe events that occur when the Spike Protein is activated through host-virus interactions.”

(6f) Please look through current literature (since this is a fast moving field) for the role of spike sugars in fusion.

As recommended by the reviewers, we went through the newest literature. Some of these new papers served as foundation to address multiple revisions. As an example, the interaction of the fusion peptides with the host membrane has been studied in [3], which allowed us to justify our host membrane potential used to answer previous points.

See for instance https://elifesciences.org/articles/61552.

We are glad that the reviewers have brought this paper to our attention. Its results fit well with our findings, as the inhibition of glycan elaboration reduced the viral entry drastically. We expanded the Results section to include this supportive experimental finding. Page 8 now reads:

“Therefore, these simulations suggest the probability of infection would drop substantially if the Spike was not glycosylated. Consistent with this, experimental measurements have found that inhibiting the production of glycans decreases the efficiency of host cell entry (Yang et al., 2020).”

Are there experimental results which support the present model or can the authors suggest experiments which can test the model specifically in the context of sugar placement?

Thank you for this suggestion. As described for Comment 6 (above), we included a section with possible experimental tests that could further quantify the influence of glycosylation on the Spike protein rearrangement.

(7) Throughout: Viral membrane envelopes are not called "capsids".

Thank you for raising the distinction, we corrected the wording issue.

(8) Lines 52-57: These sentences imply an order to the cleavage/ACE2 binding events (ACE2 binding happens after cleavage). Has this been proven? If yes, please give a reference. If not, please reword.

To the best of our knowledge, the order of events has not been unambiguously determined experimentally. For our purposes, we simply assume they both complete prior to any large-scale deformations of S2. To clarify this point, we have rephrased to avoid confusion. Page 2 now reads:

“While the order the S2’ cleavage and S1/S2 dissociation is not known, it is generally thought that both processes occur prior to any large-scale rearrangements of S2.”

(9) Figure 1: Please also give PDB IDs for the structures right here.

The PDB IDs are now included to the caption of Figure 1.

(10) Lines 153 onwards: Since at least some model justification depends on frustrated contacts, it would be useful to explain frustrated contacts in some detail and bring the supplementary figure into the main paper (if no page limit constraints exist).

Thank you for raising this point. While the results of frustration analysis support the use of a structure-based model, the original text gave the impression that the model was defined based on frustration analysis. To make it clear that we did not introduce frustrated contacts, we have significantly revised the passage in question (pages 4-6).

“Before describing the simulated events, it is valuable to discuss the analysis of energetic frustration(Ferreiro et al., 2007;Parra et al., 2016) in the Spike protein, which supports the application of an unfrustrated model. […] Accordingly, any frustration in these regions that was not included in the model is not likely to influence the primary finding of the current study.”

(11) Figure 5B: Please state if this figure is for a glycosylated protein.

Thank you for pointing this out. It is a glycosylated protein. We have updated the caption to reflect this point.

(12) Line 290: Please remove the word dramatic. It's not a quantifiable amount.

We removed the word ”dramatic”.

(13) Lines 301-305: I don't understand what happens when FPs transition to a post-fusion conformation without engaging the host membrane. What does this mean in the context of the present model? And what happens in hemagglutinin? It would be useful to have a clarification of these sentences and a detailed explanation.

We thank you for pointing this confusing phrasing. This part of the main text is now rephrased to clarify the two possible modes of capture found in fusion protein class I, such as hemagglutinin. The first mode is ”cooperative” where less than 3 FPs are attached to the host membrane. The ”sequential” mode refers to the case where all 3 FPs reach and capture the host membrane. To clarify this, we have replaced the text on page 9 with:

“When glycans are absent, there is a marginal probability that only one or two FPs will reach the membrane (Figure 5D-E), where the other FPs would likely transition directly to their post-fusion orientations without engaging the host. […] When all 3 FPs attach to the host membrane, the dynamics may be described in terms of the so-called “sequential” mechanism of fusion (Lin et al., 2014).”

References:

1. B. Turonova, M. Sikora, C. Schurmann, W. J. Hagen, S. Welsch, F. E. Blanc, S. von Bulow, M. Gecht, K. Bagola, C. Horner, et al., Science 370, 203 (2020).

2. J. D. Bryngelson and P. G. Wolynes, The Journal of Physical Chemistry 93, 6902 (1989).

3. D. Gorgun, M. Lihan, K. Kapoor, and E. Tajkhorshid, Biophysical Journal (2021).

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Supplementary file 1. Structural Model of Prefusion Structure.
    elife-70362-supp1.pdb (1.1MB, pdb)
    Supplementary file 2. Structural Model of Postfusion Structure.
    elife-70362-supp2.pdb (1.1MB, pdb)
    Transparent reporting form

    Data Availability Statement

    The current manuscript is a computational study. Simulations were prepared with SMOG 2 (free, open-source), which is available at smog-server.org. Force field templates for SMOG 2 are available for download through the SMOG 2 Force Field Repository on the smog-server page. Simulations were performed using Gromacs (free, open-source).


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES