Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2025 Jul 17;65(14):7712–7733. doi: 10.1021/acs.jcim.5c00990

Exploring the Intrinsic Structural Plasticity and Conformational Dynamics of Human Beta Coronavirus Spike Glycoproteins

Yago Ferreira e Silva †,, Harold Hilarion Fokoue , Paulo Ricardo Batista †,‡,*
PMCID: PMC12308813  PMID: 40673918

Abstract

The spike (S) glycoprotein of human beta coronaviruses (HCoVs) is central to viral entry, receptor engagement, and immune evasion. Here, we present an in-depth computational analysis of spike conformational dynamics across HCoVs, with a focus on SARS-CoV-2 and its variants. Leveraging a large cryo-EM structural ensemble and integrative modeling approaches, we dissect the intrinsic plasticity and variant-specific motions of the spike protein. Our results show that, despite substantial sequence divergence, HCoV spikes retain the ability to sample open and closed receptor-binding domain (RBD) states. For SARS-CoV-2, a hinge-like RBD opening motion dominates the conformational landscape, modulating ACE2 accessibility. Ensemble and single-structure normal modes revealed conserved dynamic domains and hinge regions and showed strong agreement with experimental structural transitions. Ligand binding rather than the D614G mutation was the principal driver of RBD opening, with multiple open RBDs observed predominantly in ligand-bound states. Notably, Omicron spike structures favored closed RBDs in the apo form but remained capable of ligand-induced opening. Dynamical network analysis identified an Omicron-specific remodeling of interdomain communication, altering the mechanical connectivity between RBD, NTD, and S2 subunits. Analysis of single-experiment multimodel cryo-EM data from the Beta variant captured temperature-dependent metastable states, validating ensemble-based modeling. Finally, hybrid molecular dynamics simulations successfully reproduced the spike experimentally observed in conformational space, unlike standard MD. These findings offer mechanistic insight into spike conformational dynamics, supporting the design of variant-adapted therapeutics and vaccines.


graphic file with name ci5c00990_0011.jpg


graphic file with name ci5c00990_0009.jpg

1. Introduction

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), first identified in late 2019, is the causative agent of the ongoing COVID-19 pandemic. As a member of the family, SARS-CoV-2 shares similarities with other human beta coronaviruses (HCoVs) responsible for significant diseases, such as SARS-CoV and MERS-CoV. SARSCoV-2 has a positive-sense single-stranded RNA genome that encodes four structural proteins: nucleocapsid, envelope, membrane, and spike, along with 16 nonstructural proteins and nine accessory proteins. These proteins contribute to the virus’s ability to replicate, assemble, and evade the host’s immune system.

The spike (S) protein is crucial for the virus’s entry into host cells. It protrudes from the viral surface in its prefusion state and binds to the angiotensin-converting enzyme 2 (ACE2) receptor, which is expressed on cells of the respiratory tract. Upon receptor binding, the spike undergoes conformational changes that trigger the fusion of the viral and host cell membranes, allowing the viral RNA to enter the host cell. The S protein is also a major target of the immune response, making it the focal point for the development of vaccines and therapeutics. Structurally, the S protein is a homotrimer, with each protomer consisting of two subunits: S1, responsible for receptor binding, and S2, which mediates membrane fusion. The S1 subunit contains the N-terminal domain (NTD), the receptor-binding domain (RBD), and two subdomains (C1 and C2) (Figure S1). The RBD is critical because it directly interacts with ACE2 through a specific region called the receptor-binding motif (RBM). Understanding the dynamics of this interaction has been the subject of intense study, given its importance for viral entry. The S protein’s RBD fluctuates between closed (down) and open (up) states. In the open conformation, the RBD is exposed and ready to engage ACE2, while the closed state conceals the RBD, preventing receptor interaction and minimizing immune detection. Cryo-electron microscopy (cryo-EM) studies have shown that the spike trimer can bind up to three ACE2 receptors, with each protomer capable of binding one ACE2 molecule when the RBD is in its open state.

Early in the pandemic, the S protein of the Wuhan-1 reference strain (hereafter referred to as “original”) acquired the D614G mutation, which shifted the equilibrium toward a more open RBD conformation, increasing the virus’s transmissibility. , This mutation allowed greater structural stability in a region known as the 630 loop (residues 620–640), enhancing viral fitness and enabling the rapid acquisition of additional mutations. , As a result, the D614G mutation has become a hallmark of many subsequent SARS-CoV-2 variants. Several notable previously circulating variants of concern have emerged, each distinguished by key mutations in the S protein. These include: Alpha (B.1.1.7), first detected in the UK, which increased transmissibility and mortality; Beta (B.1.351), first identified in South Africa, featuring enhanced immune evasion due to shared mutations with Alpha; Gamma (P.1), first detected in Brazil, which introduced 17 unique mutations in the S protein and shared mutations with Alpha and Beta; Delta (B.1.617.2), first identified in India, which rapidly became the dominant variant globally due to its high transmission rate; and Omicron (B.1.1.529), identified in November 2021 in South Africa, which rapidly replaced Delta due to its high number of mutations, particularly in the S protein, including more than 30 mutations, as well as three deletions and one insertion. Since February 2022, the SARS-CoV-2 Omicron variant has accounted for over 98% of the publicly available sequences. Currently, these earlier variants (Alpha, Beta, Gamma, Delta) are no longer classified as variants of interest or variants under monitoring, as the focus has shifted to newer Omicron sublineages.

The Omicron variant mutations, particularly N501Y, T478 K, and K417N, have significantly increased its ability to evade the immune response, enhancing both transmissibility and immune escape. The variant includes several sublineages, such as BA.1, BA.2, BA.4, BA.5, and the recombinant strain XBB, each of which carries distinct mutations that affect viral fitness, binding affinity to ACE2, and immune evasion. For instance, BA.1 exhibited 39 mutations (15 in the RBD), while BA.2 and BA.5 showed enhanced transmissibility despite having fewer mutations in the spike protein. XBB and its subvariants, such as XBB.1 and XBB.1.5, exhibit superior immune evasion compared to earlier Omicron sublineages like BA.5, making them particularly concerning in terms of their potential to cause breakthrough infections. ,

Despite significant progress in solving structures of spike variants, structural data alone cannot fully explain the dynamics of S protein conformational changes, particularly the transition between closed and open RBD states. Given that S protein dynamics is crucial to its function, molecular dynamics (MD) simulations offer valuable insights into its flexibility, residue interactions, and how mutations alter its behavior. Recent MD simulations have provided insights into the structural flexibility of the S protein, revealing allosteric sites and epitopes that may serve as druggable targets. These studies have been pivotal in elucidating how mutations like D614G and those found in the Omicron variant affect the S protein’s ability to mediate viral entry. Single-molecule experiments showed that Omicron RBD frequently adopts a closed conformation, reducing exposure to neutralizing antibodies, and compensates for fewer ACE2 binding interactions by maintaining longer attachment times, which enhances immune evasion and viral attachment. However, understanding the S protein’s conformational dynamics remains a challenge.

A key question in SARS-CoV-2 biology is how the spike protein’s conformational dynamics enable adaptability across variants despite extensive sequence variation. This study addresses that by integrating large-scale structural data with advanced conformational analysis to examine how plasticity is influenced by mutations, ligand binding, and temperature. We explore whether distinct dynamic profiles, such as those in Omicron and beta, represent adaptive mechanisms balancing immune evasion and receptor engagement. Focusing on HCoVs, we assess spike flexibility, its impact on ligand affinity, and the conformational landscape of the SARS-CoV-2 variants. Using single-structure and ensemble normal-mode analysis, along with hybrid MD simulations, we characterize the dynamic behavior of key variants. Our findings highlight the unique conformational features of the Omicron and offer new insights into the structure–function relationship of spike dynamics, with implications for viral transmission and immune escape.

2. Materials and Methods

2.1. Experimental Structural Ensemble of the Spike Proteins

To study the plasticity of the human beta coronaviruses, the spike experimental structures were retrieved, classified, and analyzed as follows (see the experimental workflow in Figure S2). The scripts are available as Supporting Information (https://github.com/yago52/tutorial_spike).

2.1.1. Ensemble Building

The ensemble of protomeric structures for the full SARS-CoV-2 spike protein (head fragment) was generated using the buildPDBEnsemble function from ProDy (version 2.3). Each chain was processed individually, with the 6VXX structure (chain A) serving as a reference. A sequence identity cutoff of 90% and a minimum sequence overlap of 50% were applied, followed by gap-based filtering to retain structures with at least 90% sequence coverage. The individual chain ensembles were subsequently merged into a final ensemble, comprising a total of 2.872 structures. For other human beta coronaviruses, the same script and reference structure were used, but a sequence identity cutoff of 20% was applied instead.

We also created a new ensemble comprising single-experiment multimodel cryo-EM structures corresponding to the Beta variant (B.1.351). All 20 models from each PDB entry were included, resulting in a total of 120 conformations (considering all protomers), derived from samples equilibrated during vitrification at two distinct temperatures: 4 (9GDX) and 37 °C (9GDY).

2.1.2. Classification of Ensemble Structures

To analyze the structural diversity of the spike protein, two ensembles of experimental structures were created: one comprising non-SARS-CoV-2 human beta coronaviruses and another containing only SARS-CoV-2 structures. A list of PDB-IDs for coronavirus spike proteins from SARS-CoV, SARS-CoV-2, and MERS-CoV was obtained from Cov3d, a specialized database that is updated weekly. Alternatively, the PDB-IDs for OC43 and HKU1 were directly obtained from the Protein data bank (PDB) and UniProt databases.

Cov3d provides a series of tables with filtering options, including (i) whether the spike protein is complexed with a ligand (specifying ligand name, type, and associated chain(s)); (ii) whether the structure represents a full-length protein or a fragment; and (iii) SARS-CoV-2 classification by variant. The tables were processed with the Pandas package (version 2.1.1) in Python (version 3.9).

2.1.2.1. Apo and Ligand-Bound Structures

The ensemble structures were classified based on the ligand using the data from the Cov3d. Such information was related to each PDB-ID, but within the trimeric bound structures, we evaluate whether each protomer has interactions with the ligands by calculating the number of contacts; if the protomer has no contact with the ligand, it is classified as an apo protomer. The antibodies were classified in classes according to domain targeting.

2.1.2.2. Mutation on 614th Residue

The residue at position 614 was identified for all of the complete structures in the ensemble using ProDy. The data set was subdivided into three primary groups: 614D, 614G, and 614­(C, N, T, Y, A, L, P, E, and I).

2.1.2.3. SARS-CoV-2 Variant Classification

The initial variant classification for the SARS-CoV-2 structures was obtained from the Cov3d database. However, approximately 45% of the PDB entries in the ensemble were not classified by Cov3d. To ensure comprehensive classification, we performed a sequence-based analysis of the entire ensemble. Complete .fasta sequences of all spike protein structures were retrieved from the PDB server, and a multiple sequence alignment (MSA) was performed using MUSCLE with a Biopython script. Short segments at the N- and C-terminal ends were truncated (residues 1–13 and >1142, UniProt code P0DTC2). Based on the MSA, a sequence distance matrix was constructed using the buildSeqidMatrix function from ProDy, and a phylogenetic tree was generated with the calcTree function, applying the weighted algorithm. Clustering was performed using the scipy.cluster.hierarchy.dendrogram function from the SciPy package. Clusters consistently contained either a single variant type previously classified by Cov3d or unclassified structures. For clusters with single classifications, all members were labeled with the same variant as the preclassified entries. In clusters where no Cov3d-classified entries were present, variant information was manually verified through PDB annotations and the corresponding literature. This comprehensive approach enabled consistent classification across the entire ensemble.

2.2. Prediction of Intrinsic Motions of SARS-CoV-2 Spike

2.2.1. Single Structure Normal-Mode Analysis

NMA was used to probe the intrinsic flexibility of the representative spike conformers. By computing vibrational modes from the elastic network of individual structures, it reveals the built-in potential for large-scale motion encoded by the spike topology and contacts. The 20 slowest nontrivial normal modes were calculated with the ProDy package using the anisotropic network model (ANM) for the ensemble reference structure (6VXX_A). First, the Hessian matrix was built (buildHessian method) based on the Cα Cartesian atomic coordinates, using a 15 Å pairwise-interactions cutoff and a gamma spring constant of 1 N/m; and the modes were calculated by diagonalizing this matrix (calcModes method).

2.2.2. Ensemble Normal Modes

The ensemble normal modes integrate structural diversity and highlight conserved, statistically dominant dynamic features across different conformational states and variants. They were calculated using the calcEnsembleENMs function in the ProDy package using the Gaussian network model (GNM) method. The Kirchhoff matrix was constructed with a 10 Å cutoff for the pairwise atomic interactions and a spring constant of 1 N/m. For each ensemble member, the 20 slowest nontrivial normal modes were computed. The modes calculated for each structure (referred to as modesets) were automatically matched to a modeset and ordered accordingly to the reference structure modes.

2.3. Ensemble Analysis

2.3.1. Principal Component Analysis (PCA)

PCA was applied to the SARS-CoV-2 experimental cryo-EM structural ensemble to extract the dominant motions sampled across spike variants. This data-driven method captures the observed conformational diversity by reducing the dimensionality of the structural data set without relying on predefined force fields or energy models. PCA was performed over the Cα-atom Cartesian atomic coordinates using the ProDy package. The covariance matrix was built (buildCovariance method), and the PCs were calculated by diagonalizing this matrix (calcModes method).

2.3.2. RBD Conformation Classification

PC1 projection was used to classify RBD conformations: protomers with values >5 as open; and ≤5 as closed states. This threshold was chosen based on the natural inflection point between these two populations, supported by the RMSD and RBD angle correlations.

2.3.3. Global Structural Measurements and Collective Variables (CVs) Used To Describe Spike Flexibility

2.3.3.1. Root Mean Square Deviation (RMSD)

The spike alpha carbon RMSD was calculated based on the reference structure using the ProDy function calcRMSD.

2.3.3.2. Radius of Gyration Calculation (Rg)

The Rg of each structure was calculated by using the ProDy calcGyradius function.

2.3.3.3. NTD/RBD Interdomain Distance

The Euclidean interdomain distance between the NTD (residues 27–303) and the RBD (residues 319–541) centers of mass (CoMs) was computed using a custom Python script based on the ProDy toolkit.

2.3.3.4. Distances between the RBDs or NTDs within the Trimmer

As described in ref , six collective variables (CVs) were defined to represent distances between the CoMs of the RBDs and NTDs within the spike trimer. CV1–CV3 correspond to RBD–RBD distances between chains 1–2, 2–3, and 3–1, respectively; CV4–CV6 analogously represent NTD–NTD distances between the same chain pairs (Figure S3A).

2.3.3.5. Area of the Triangle Formed by RBDs or NTDs within the Spike Trimmers

To quantify the spatial arrangement of RBDs or NTDs within each spike trimer, we calculated the area of the triangle defined by the CoM of each RBD or NTD within the spike trimmer for each structure in the ensemble using Heron’s formula, as follows: SRBD=s(sCV1)(sCV2)(sCV3) , where s is the triangle semiperimeter: s = (CV1 + CV2 + CV3)/2. Similarly, for the area of the triangle formed by each NTD (S NTD), the CVs 4, 5, and 6 were used instead of CVs 1, 2, and 3.

2.3.3.6. RBD Angle (Residues 405, 620, and 991)

This angle has been shown to reflect ACE2 accessibility and track RBD conformational transitions. Due to the absence of residue 622 in our structural ensemble, we used the angle formed by residues 405, 620, and 991 as a representative metric (Figure S3B).

2.3.3.7. Vector-Based Analysis

A robust CV proposed by ref was shown to efficiently capture spike conformational changes. This CV is built using the CoMs from 9 structurally relevant regions of the spike protein: NTD (27:43 and 54:271), NTD-β (116:129 and 169:172), NTD′ (44:53 and 272:293), RBD (330:443 and 503:528), RBD-α (403:410), SD1 (323:329 and 529:590), SD2 (294:322 and 591:696), S2-β (717:727 and 1047:1071), and CD (711:716 and 1072:1122), according to Figure S3C.

2.3.4. Number of Contacts between Protomers and Ligands

Using ligand information from Cov3D, a Python script employing the ProDy and Pandas libraries was developed to analyze the structural ensemble. For each structure, when a ligand was detected, the corresponding chain was identified, and the number of heavy atoms from each spike protein residue located within 4 Å of any ligand heavy atom was computed.

2.3.5. Dynamical Domain Analysis and Hinge Point Detection

The dynamical domains were classified employing the DynDom program, which analyzes structural transitions between two conformers. The normal-mode-displaced structures and the two experimental structures were uploaded and analyzed in the server. The hinge points associated with the conformational changes were also predicted using the DynDom.

2.3.6. Clustering Analyses

The SARS-CoV-2 spike ensemble structures were clustered using the g_cluster module of the GROMACS package, with the gromos algorithm and a 1.5 Å RMSD cutoff.

2.3.7. Overlap and Subspace Overlap between Modes and PCs

The ProDy package was used to calculate the overlap (CalcOverlap function) between the two sets of modes. The first set corresponds to the first 5 PCs from the SARS-CoV-2 experimental ensemble, while the second corresponds to the 5 slowest nontrivial modes of the reference structure. The modes were normalized prior to overlap calculation. A graphical table was done directly using the function showOverlapTable. The subspace overlap was calculated (calcSubspaceOverlap function) between each PC and the subspace spanned by the first five normal modes.

2.3.8. Ensemble Spectral Overlap

The spectral overlap was calculated to compare the dynamics of the individual ensemble members using the function calcEnsembleSpectralOverlaps, according to the formula: s(A,B)=1d(A,B)trA+trB , where s is the spectral overlap between NM (or a set of modes) of conformation A and B, d is the difference in the covariance of A and B matrices, and tr denotes the trace of the corresponding matrix. The arccosine of this value provides a distance metric, allowing for the construction of a spectral distance matrix by computing the overlap for all pairs in a mode ensemble. This matrix can then be used to generate a dynamics-based “phylogenetic” tree.

2.3.9. Signature Dynamics Analysis

2.3.9.1. Signature Profiles

The dynamic signatures of the first three slowest nontrivial ensemble normal modes were calculated individually and collectively using the showSignatureMode and their corresponding cross-correlation matrix with showSignatureCrossCorr ProDy functions.

2.3.9.2. Comparing Ensemble Sequence, Dynamics, and Structure

Three matrices were calculated: i. pairwise RMSD, using ProDy ensemble’s method getRMSDs (pairwise = True); ii. spectral overlap distance (20 nontrivial slowest mods) calcEnsembleSpectralOverlaps (distance = True); iii. sequence distance (seqdist_matrix), generated from the sequence identity matrix (seqdist_matrix = 1 – seqid_matrix), which was constructed from the ensemble MSA.

2.3.10. Dynamical Network Analysis

2.3.10.1. Network Generation from Ensemble Normal-Mode Analysis Data

A consistent structural ensemble was generated from the ProDy-derived SARS-CoV-2 models (Section ) using the pdbaln function from the Bio3D package, which was used for all analysis of this section. The 20 slowest nontrivial normal modes for each structure were calculated using the nma function with the HCA model. These modes were used to calculate the dynamic cross-correlation matrix (DCCM) via the dccm function, where each element (C ij ) represents the degree of dynamic coupling between residues i and j. A value of C ij = 1 indicates that the fluctuations of residues i and j are completely correlated; C ij = −1, is completely anticorrelated; and C ij = 0 is not correlated. Filters on the DCCM (filter.dccm function) were applied based on groups: RBD conformation (closed and open); ligands (apo, antibodies, receptors); 614th residue (614D, 614G, others); and SARS-CoV-2 variant (Omicron and non-Omicron). The correlation threshold was set to 0.3 (cutoff.cij = 0.3), and all residue pairs, including neighboring residues (scut = 0), were retained.

2.3.10.2. Protein Dynamic Correlation Network Construction and Community Analysis

Network construction. Correlation network analysis (CNA) was performed using the Bio3D cna function using the random walk clustering method. The filtered DCCMs were used as input for a residue-based network community organization detection.

Network refinement. To identify equivalent communities among the groups, networks were refined using the remodel.cna function. Residue pairs involving four sequential neighbors were excluded (scut = 4). Nodes were defined by spike protein domains and interdomain regions via the member parameter. The input correlation matrix was defined as the sum of all intercommunity numeric square matrices (C ij Sum), containing absolute values from the atomic correlation matrix for each community (cij.community), using method = “sum”. Edge colors reflected the normalized dynamic correlation between groups (col.edge = “feature”), with a correlation threshold of 0.25.

PCA of an array of DCCMs from the ensemble normal modes. PCA was performed using the pca.array function on a series of DCCMs, each derived from normal modes calculated for all ensemble structures. The function returns M eigenvalues and eigenvectors, where M is the number of input matrices, and each eigenvector has a dimension of N(N – 1)/2, with N being the number of matrix rows/columns.

2.4. Standard MD Simulation Trajectories

Two 10 μs MD simulation trajectories of the trimeric structure of the SARS-CoV-2 S protein were carried out by David Shaw’s group and deposited under the codes DESRES-ANTON-11021566 and DESRES-ANTON-11021571 and were obtained from the site (https://covid.molssi.org/org-contributions/). These simulations were performed with the AMBER program using Amber ff99SB-ILDN and Glycam force fields. TIP3P water models were used in all of the simulations. The first simulation was started with the S protein in the closed state (PDB 6VXX); and the second, from a partially open state (PDB 6VYB). The C and N termini were completed with amide and acetyl groups, respectively. Counterions were inserted to neutralize the system, reaching a 0.15 M NaCl concentration. The simulations were maintained at 310 K using the NPT ensemble.

2.5. MDeNM Simulations and Conformational Free Energy Estimation

2.5.1. System Preparation, Minimization, and Equilibration

The initial structures of the fully glycosylated spike protein head-only models (residue 1–1146) for the 6VSB structure were taken from the COVID-19 Proteins Library of the CHARMM-GUI Archive (https://www.charmm-gui.org/?doc=archive&lib=covid19). Preparation steps were conducted with the CHARMM-GUI Web server. Each system was placed in a cubic box with a 14 Å layer of TIP3P water molecules. Counterions were inserted to neutralize the system, reaching a 0.15 M NaCl concentration. Then, an energy-minimization protocol was performed, starting with the conjugate-gradient algorithm, keeping protein heavy atoms harmonically restrained with a force constant of 50 kcal mol–1 Å–2 to avoid structural distortions. The following steps using the same algorithm were carried out using decreasing force constants (up to 2.5 kcal mol–1 Å–2). Then, the atomic velocities were assigned accordingly to a Maxwell–Boltzmann distribution corresponding to 50 K and then slowly increased to 300 K during a 1 ns heating MD, using a 1 fs integration time. In the equilibration step, the positional restraints were gradually decreased to zero during the first half of a 3 ns constant temperature MD, while in the remaining part all restraints were removed.

2.5.2. MDeNM Simulations

The MD with excited normal modes (MDeNM) simulations were performed using the MDexciteR tool (https://github.com/mcosta27/MDexciteR). This tool is based on the combination of R internal libraries and functions from the Bio3D package, allowing users to generate and process files to run MDeNM with an external MD software, the NAMD v.2.3. MDeNM simulations were performed in the NPT in explicit solvent using periodic boundary conditions. van der Waals interactions were calculated up to 10 Å, being approximated until 12 Å by using a switching function. Electrostatic interactions were treated with the PME algorithm using a 10 Å cutoff. The SETTLE and SHAKE algorithms were used during MD simulations to fix bonds involving hydrogen atoms in water molecules and protein. Pressure was kept constant at 1 atm during equilibration and production using the Langevin piston method. In these steps, the temperature was also kept constant at 300 K using the Langevin thermostat with a damping coefficient of 1 ps–1. Each of 400 replicates corresponds to 20 cycles of excitations (linear combinations of the first 3 nontrivial normal modes) for a time of 2 ps (totaling 40 ps per replicate), with an excitation temperature of 2 K. Additionally, the structures, configuration files, and input scripts required to perform the MDeNM simulations are provided and can be accessed directly at: https://github.com/yago52/tutorial_spike/blob/main/mdenm_spike.tar.gz.

2.5.3. Free Energy Landscape Analysis

The free energy landscape (FEL) calculation protocol involved clustering the concatenated structures generated by MDexciteR, followed by relaxation MD simulations initiated from each cluster centroid. The GROMOS clustering algorithm was used with an RMSD cutoff of 1 Å to identify representative conformations. Each of the 137 centroid structures was subjected to a 1 ns unrestrained production MD simulation, with trajectory frames recorded at 2 ps intervals. These trajectories were concatenated into a single file for subsequent analysis.

FELs were computed by using a custom R script. The free energy difference (ΔGα) of a given state α relative to the most populated reference state was calculated accordingly

Gα=kBTln[P(qα)Pmax(q)] 1

where k B is the Boltzmann constant, T is the temperature of the simulations (300 K), and P(q α) is an estimate of the probability density function obtained from bidimensional kernel density estimates of projections onto NM vectors. P max(q) corresponds to the probability of the most frequently visited state.

2.6. Figures and Movies

All figures and movies were generated using VMD (version 1.9.4a51), the open-source version of PyMol; and the python plots (matplotlib, , seaborn and ProDy tools); and R (4.4.1) plots using the base graphics functions and Bio3D tools.

3. Results and Discussion

3.1. Human Beta Coronaviruses Share Similar RBD Plasticity

HCoVs, including SARS-CoV, SARS-CoV-2, MERS-CoV, OC43, and HKU1, show significant variations in their spike protein sequences, primarily within the S1 subunit. Notably, the overall spike sequences of SARS-CoV and SARS-CoV-2 share approximately 76% identity, while their identity with MERS-CoV and other endemic HCoVs (OC43 and HKU1) is lower, around 30–40%. , To investigate how sequence variations influence the structural plasticity of the spike protein across HCoVs, we constructed an ensemble of all of the experimentally available 3D structures. This ensemble comprises 2872 SARS-CoV-2 conformations, 114 of SARS-CoV, 78 of MERS-CoV, 45 of HKU1, and 39 of OC43. A detailed list of these PDB-IDs, along with relevant annotations, is provided in Table S1.

PCA of Cα-atom Cartesian coordinates showed that all HCoV spike proteins, except OC43, sampled both closed and open RBD conformations, highlighting their shared functional flexibility (Figure A). Although OC43 structures were only experimentally observed in the closed state, the limited availability of full-length OC43 spike structures may hinder its experimental observation.

1.

1

RBD plasticity of human beta coronaviruses. (A) PC projections of spike protomeric structures from HCoVs onto PC1 and PC2. (top) SARS-CoV (yellow), MERS-CoV (blue), OC43 (purple), and HKU1 (red); (bottom) SARS-CoV-2. Light gray contour lines represent the SARS-CoV-2 density distribution. A dashed line marks the boundary between closed (green) and open (red) states. (B) Distribution of PC1 projections (top); RMSD distribution showing two distinct conformational populations (bottom); the dashed lines separate open and closed RBD conformations.

Moreover, recent evidence suggests OC43 can transition to an open state, exposing the S1B domain (RBD equivalent), as indicated by neutralizing antibodies targeting cryptic S1B epitopes. Unlike SARS-CoV-2, both OC43 and HKU1 belong to the subgenus. For HKU1, spike opening is not spontaneous but is instead triggered by sialoglycan binding, raising the possibility of a similar ligand-dependent regulatory mechanism in OC43.

3.2. NTD/RBD Motions Govern SARS-CoV-2 Experimental Dynamics

The SARS-CoV-2 structural ensemble consists of 2872 conformations derived from 968 unique PDB entries. The maximum structural variation within the ensemble reaches an RMSD of 17.5 Å, compared to the reference structure (Figure S4A). Sorting the ensemble by RMSD reveals two distinct populations (Figure S4B and Movie S1). RMSD density plots confirm this separation: a major population (57.9%, 1663 structures) with the RBD in the closed conformation and a second population with the RBD in the open state (Figure B).

PCA performed over the SARS-CoV-2 ensemble Cartesian coordinates identified the primary motion (PC1), which resembles a hinge or pivoting movement where the RBD rotates outward to expose the ACE2-binding surface (Figure and Movie S2). This transition occurs around specific hinge regions near the central β-sheet core of the RBD and its connection to subdomain C1 of the S1 subunit. PC1 accounts for more than 84% of the total variance (Figure S5B) and effectively distinguishes between the open and closed RBD states (Figure ). An open RBD conformation is defined as a PC1 projection greater than 5 Å (Figure ).

2.

2

SARS-CoV-2 spike intrinsic motions: close correspondence between experiments and normal modes. (A) Directions of the first two normal modes (bottom) and experimental PCs (top), represented by red arrows (scaled to 5 Å RMSD) on the spike structure shown as a green tube. (B) RMSF profiles for PCs (left) and normal modes (right), with colored bars indicating spike structural domains: NTD (red), RBD (royal blue), RBM (cyan), and S2 (green). (C) Overlap between the first five PCs and normal modes, shown as a blue-shaded matrix; a zoomed-in version with numeric values is displayed on the right.

The similarity between the RMSD and PC1 density plots (Figure B) indicates that the primary PC effectively captures the dominant global conformational variations reflected by the RMSD. Further analysis revealed strong linear correlations between PC1 and several structural descriptors: RMSD (R = 0.99), spike radius of gyration (R = 0.90), and the interdomain distance between the NTD and RBD (R = 0.86) (Figure S6). These findings underscore the central role of PC1 in describing the principal conformational transitions of the spike protein, particularly a hinge-like motion of the RBD that modulates exposure of the ACE2 binding site by changing its proximity to the NTD. In contrast, PC2 primarily captures an interdomain opening/closing motion between the NTD and RBD, as illustrated in Figure and Movie S3. Negative PC2 values correspond to shorter interdomain distances. However, when considering the full ensemble, PC2 exhibits only a weak correlation with the NTD-RBD interdomain distance (R = 0.40) (Figure S6B). Interestingly, when the ensemble is separated into open and closed RBD conformations, PC2 shows a strong correlation within each subgroup (R = 0.81 and 0.82, respectively). This suggests that while PC2 does capture relevant interdomain dynamics, its relationship with structural metrics becomes more evident when analyzing distinct conformational states individually. In other words, this intrinsic motion occurs within both RBD states independently of the broader conformational transition between them, which is primarily described by PC1. Together, PC1 and PC2 account for over 90% of the total conformational variability in the ensemble and thus define an essential subspace for describing the experimental dynamics of the SARS-CoV-2 spike protein.

A computational study previously defined an RBD angle formed by residues 405, 620, and 991 (Figure S3B) as a geometric descriptor capable of distinguishing ACE2 accessibility and monitoring the RBD conformational transition pathway. Based on predicted conformational pathways, the authors proposed two RBD angle ranges: an ACE2-inaccessible range (31.6° to 52.2°) and an ACE2-accessible range (52.2°–84.8°).

In contrast, our analysis is based on an ensemble of experimentally determined SARS-CoV-2 spike structures. We calculated the RBD angle across this ensemble and observed that its density distribution closely resembles that of PC1 (Figure S4D), clearly separating open and closed states into two distinct populations. Importantly, a strong linear correlation was observed between the RBD angle and PC1 (R = 0.99) (Figure S4C), further validating the RBD angle as a robust descriptor of spike conformational dynamics.

To empirically assess ACE2 accessibility, we analyzed the RBD angle distribution in spike structures experimentally resolved in complex with ACE2. These structures exhibited RBD angles ranging from 49.7° to 74.7°, with a density distribution concentrated between approximately 55° and 72°. This experimentally derived range aligns well with the previously predicted ACE2-accessible interval and supports the use of the RBD angle as a proxy for the ACE2 binding potential. Notably, this range excludes the mink spike variant (Y453F), where the mink ACE2 is bound to a spike in a closed conformation (PDB: 8T22) with an RBD angle of 34.8°, falling within the computationally defined inaccessible region. Altogether, these findings reinforce the conclusion that binding of ACE2 preferentially stabilizes the open RBD conformation of the SARS-CoV-2 spike protein.

Notably, PC1, the RBD angle, and the interdomain NTD–RBD distance are strongly correlated and capture the same essential RBD opening motion. For interpretability and consistency, we prioritized RBD angle and PC1 as principal descriptors in our biological analyses, using RMSD and interdomain distance only as supporting metrics when relevant. These combined projections define a conformational subspace that efficiently describes the spike’s functional transition between receptor-inaccessible and receptor-accessible states.

A vector-based analysis considering the CoMs of nine specific spike structural domains (Figure S3C) was recently proposed as a CV to describe spike flexibility. Based on this CV, we generated a PDB file for each structure in the ensemble, containing the Cartesian coordinates of the CoMs of the nine domains, resulting in a trajectory. The same was done from the CoMs of domains used to calculate the dynamic network of the SARS-CoV-2 spike. We then performed PCA on these trajectories and compared it to the one from the SARS-CoV-2 spike ensemble (Cα atoms), revealing a highly similar profile (Figure S7). Because the PC vectors obtained from each analysis differ in dimensionality, direct calculation of their overlap was not feasible.

However, we assessed the similarity of their respective 2D projections onto the first two PCs using three complementary analyses: i. Procrustes, which yielded a disparity score of 0.0049 for the vector model and 0.0151 for the network model (with lower values indicating better alignment); ii. Pearson correlation, treating the 2D coordinates as 1D vectors, resulting in a correlation coefficient of 0.998 and 0.994 for the models, respectively; and iii. the RMSD, which quantified the average Euclidean distance between corresponding 2D points from the two projections, giving a value of 0.383 and 1.893 Å. Altogether, these analyses show that both CVs (vector-based and derived from the dynamical network) are robust to describe the RBD conformational change. The vector-base pattern is the most similar compared to the PCA from the Cartesian coordinates of the ensemble structures; however, the dynamical network CV has all of the spike residues represented.

To further characterize the conformational landscape of the SARS-CoV-2 spike protein ensemble, an RMSD-based clustering analysis was performed to identify recurrent and distinct structural states. This method effectively grouped similar conformations, enabling the identification of both dominant populations and rare structural variants. The analysis yielded 240 distinct clusters, with the 10 most populated clusters accounting for approximately 80% of all conformations in the ensemble (Figure S8 and Movie S6). The largest cluster alone contained over 1100 structures, representing roughly 40% of the total ensemble. Beyond these prevalent conformations, the analysis revealed around 35 moderately populated clusters and 156 singleton clusters composed of unique structures, highlighting the extensive conformational heterogeneity captured in the ensemble.

3.3. Normal Modes Describe Spike Intrinsic Spike Motions

The first two lowest-frequency normal modes calculated from the ensemble reference structure effectively capture the RBD and NTD motions observed in the experimental ensemble (Figure A and Movies S2–S5). Notably, when comparing their directions, there is a close correspondence between the normal modes and the first two principal components (PCs): normal mode 2 resembles PC1 (overlap of 0.73), while normal mode 1 corresponds to PC2 (0.68) (Figure C). This similarity is also confirmed by comparing individually or collectively the DCCMs derived from the two first PCs and normal modes (Figure S5A). However, this correspondence is less evident when inspecting their RMSF profiles, due to differences in amplitude (Figure B), as PC1 alone accounts for approximately 85% of the total ensemble variance (Figure S5B).

The correspondence between intrinsic dynamics and experimental motions of the spike protein was assessed by comparing the first five lowest-frequency normal modes with the essential subspace defined by PCA of the experimental ensemble (Figure S5C). The overlap matrix showed that PC1 aligns best with mode 2 and PC2 aligns best with mode 1, indicating that distinct normal modes capture the principal experimental motions. Projection of both sets onto a common conformational space (Figure C) further revealed strong overlap and consistent structural directionality. These findings confirm that low-frequency normal modes effectively reproduce the dominant experimental transitions and provide mechanistic insight into the large-scale conformational shifts of the spike protein.

Normal mode analysis was used to perform domain analysis and predict hinge points in the spike with the DynDom server. For modes 1 and 2, three dynamic domains were identified: residues 29–318 (NTD) and 592–700 (C2); residues 319–591 (RBD + C1); residues 701–1145 (S2) (Movies S7 and S8). Predicted hinge points were found at residues 317–323 (RBD), 584–594 (C1/C2), and 706–708 (S2). A similar analysis of two spike experimental structures, 6VXX_A (closed) and 6VYB_B (open), revealed two dynamic domains: one for the RBD and the other for the rest of the protein (Movie S9). The hinge regions align with those observed in cryo-EM maps of spike structures with different RBD conformations.

3.4. Ensemble Normal Modes and Signature Analysis of the SARS-CoV-2 Spike

Since proteins exist as dynamic ensembles, ensemble normal modes incorporate this intrinsic plasticity, ensuring that the normal modes align with experimentally observed transitions. In contrast to single-structure NMA, which is highly sensitive to the chosen conformation, ensemble-based methods reduce bias by averaging structural variations.

From the ensemble normal modes, signature profiles can be obtained by calculating the mean and standard deviation of properties such as mode shapes and mean square fluctuations. For the SARS-CoV-2, we calculate the average and standard deviation of the shape of the first three modes (individually or collectively) over all conformations (Figure S9). The signature profile of the lowest-frequency mode primarily describes collective motion of the S2 subunit relative to S1. The signature profile of mode 2 captures an NTD/RBD motion, while mode 3 describes a more complex motion of the S2 subunit. The cumulative signature profile and DCCM derived from the first three ensemble normal modes show an anticorrelated motion between S2 and S1; in addition, the RBD and NTD motions are strongly positively correlated.

The main difference between PCs and normal modes (including the ensemble normal modes) lies in the behavior of the S2 domain. Within the experimental ensemble, the S2 domain of one protomer engages in extensive trimeric interactions, forming a large contact area that stabilizes this region. Structurally, the HR1 and CH regions form a continuous α-helix, with their three copies assembling into a long central three-stranded coiled-coil. This tight trimeric interaction (observed in experimental structures) restricts S2 flexibility. In contrast, in normal-mode analysis based on the protomeric structure, the S2 subunit has fewer contacts and may fluctuate more freely. As a result, normal modes predict greater S2 flexibility compared with PCs, which reflect the constrained motions observed in the experimental ensemble.

To compare modes within the ensemble normal mode data set, spectral overlap, also known as covariance overlap, was used as a more sophisticated measure instead of simple overlap analysis. Additionally, advanced ensemble analysis was performed using matrices based on pairwise metrics: sequence identity (or distance), RMSD, and normal mode similarity (dissimilarity). These matrices were computed for the SARS-CoV-2 ensemble and sorted by spectral overlap distance, RMSD, and sequence identity (see Section ) (Figure S10). This approach provides a comprehensive assessment of the structural and dynamical relationships among ensemble members.

The RMSD-ordered matrix from the ensemble distinctly separates it into two groups: a major homogeneous cluster of RBD-closed structures and another containing open conformations. When sorted by spectral overlap distance, a smaller subset of structures exhibits greater spectral similarity, while a larger, more populated cluster shows greater spectral variation. Sequence-based comparison further reveals a distinct subgroup with significant sequence divergencespecifically, the Omicron variant, which displays around 40 residue differences from other ensemble members. Altogether, these matrices provide valuable insights into sequence, structural, and dynamic relationships within the SARS-CoV-2 ensemble.

3.4.1. Ensemble Normal Mode-Driven Dynamical Networks Analysis

Normal mode analysis predicts the intrinsic motions a protein can undergo based on its structure. These predicted movements can be used to construct a DCCM, which captures how the motions of different protein regions are related. Regions that tend to move in the same direction exhibit strong positive correlations, while those moving in opposite directions show strong negative correlations. Applying PCA to the DCCM does not yield direct information about physical displacements but instead reveals dominant patterns of dynamic coupling within the structure. In this context, the resulting PCs reflect groups of residues or domains exhibiting coordinated or opposing motion tendencies, as inferred from the ensemble of normal modes. Each PC thus highlights dynamically coupled regions that are potentially involved in concerted functional motions. However, unlike PCA performed on a covariance matrix of atomic displacements, this approach does not provide explicit information about displacement magnitudes or directions.

To investigate the dynamic segregation of functional regions, the values of the first PCs were projected using the DCCM derived from the ensemble of normal modes (Figure S11). PC1 in this space recapitulates the pattern observed in the PC1 derived from the full ensemble analysis of Cartesian coordinates (Figure ), where PC1 effectively separates the protomers based on the conformational state of the RBD. This agreement indicates that the normal modes capture key differences between structural states, particularly in the RBD, due to distinct dynamical signatures.

To further explore how dynamic coupling among spike residues varies across conformational states, ligand-binding classes, and D614G mutation status, comparative dynamical networks were constructed based on normalized correlation variability, using the DCCM from the normal mode ensemble (Figure ). In these networks, edge colors indicate residue pairs whose correlations differ significantly across structural groups, revealing an internal dynamic plasticity. In the RBD-closed state (Figure A), a dense pattern of highly correlated connections is observed between the RBD and the central helical (CH) domain, located near the trimer’s CoM.

3.

3

Protein network analysis of the SARS-CoV-2 experimental ensemble. (A–C) Network representations of the SARS-CoV-2 spike protomers from the experimental ensemble: (A) with RBDs in closed and open conformations. (B) Filtered by ligand-binding interactions (e.g., apo, receptors, antibodies); (C) filtered by residue 614 variants (e.g., aspartic acid, glycine, and others). All networks highlight correlated edges, colored by group. (D) A domain bar below annotates the spike regions represented in the networks. Nodes were defined by spike protein domains and interdomain regions: residues 27–303, NTD; 304–318, ID1; 319–541, RBD; 542–591, C1; 592–686, C2; 687–716, ID2; 717–757, ID3; 758–815, ID4; 816–855, FP; 856–919, ID5; 920–970, HR1; 971–1035, CH; and 1036–1147, CTh.

This suggests a stabilizing role for the CH domain in the closed conformation. In contrast, the RBD-open network displays a loss of these RBD-CH correlations, with the RBD instead showing stronger connections to other S1 subunit domains. Notably, the NTD exhibits an increased correlation with multiple regions of the protomer in the open state, indicating that the NTD may play a more central role in interdomain communication when the RBD is exposed.

On the other hand, the networks constructed for ligand-bound groups (Figure B) clearly show distinct dynamical profiles. Structures bound to receptors, primarily ACE2, exhibit widespread, highly correlated connections across nearly all domains and interdomains, consistent with the substantial conformational rearrangements required to achieve and maintain the open state of the RDB. In contrast, antibody-bound structures show no high correlated connections, reflecting the diversity of antibody epitopes present in the protomer of spike, which can target different spike regions independently of the RBD conformation. Apo structures show a more selective pattern with strong correlations limited to a few domains: RBD, C2, and some interdomains, indicating a less flexible trimer state in the absence of ligands.

Lastly, for the groups classified by residue 614 identity (Figure C), showing how the D614G mutation and other less seen substitutions or engineered influences spike dynamics. Structures of the other group exhibit a connectivity pattern similar to that of receptor-bound spikes, likely because many of these structures were designed to stabilize particular RBD conformations. The 614D group shows strong connections in NTD and CH domains with some other domains, reminiscent of the tightly packed closed-state network, supporting the idea that this ancestral form favors reduced conformational flexibility. In contrast, the 614G network displays only a single prominent edge colored between the RBD and C2 domains. This marked reduction in dynamic interdomain connectivity may reflect a more flexible or destabilized core that facilitates spontaneous RBD opening, consistent with the enhanced receptor accessibility and infectivity associated with the D614G mutation.

3.5. Ligand Binding Modulates Spike RBD Conformation

To evaluate the influence of ligand binding on the spike structure, the ensemble was divided into four categories based on ligand interactions: apo (unbound), bound to antibodies, bound to ACE2, and bound to non-ACE2 receptors. Since the Cov3D database provides ligand-binding information at the trimer level, all conformations were reclassified by calculating the number of residues in contact with the ligand. If a conformation has zero contacts, then it is classified as apo.

Within the SARS-CoV-2 ensemble, 48.96% of the conformations are in the unbound state (apo), while 42.62% are bound to antibodies, 7% are bound to ACE2, and 1.43% are bound to other receptors. Analysis of the projection of each group structure onto PC1, but not PC2, indicates that receptor or ACE2 binding (with a few outliers) stabilizes the spike in an open conformation (Figure A,B).

4.

4

Ligand binding-induced changes in SARS-CoV-2 experimental conformations. (A,B) Distribution of protomeric ensemble structures onto PC1 and PC2 (A) and corresponding boxplots (B): apo (blue); and ligand-bound structures, including antibody (red), receptor (pink), and ACE2 (green). Outliers are represented as gray points. (C) Number of spike residues in contact with ligands (within 4 Å) across structures. Marker color (viridis scale) and size indicate the number of contacts projected onto the PC 1–2 space.

Among the outliers, only one closed structure bound to ACE-2, which is a mink variant (PDB-ID 8T22), while seven receptor-bound structures (8BON, 7JZN, 7JZL, 7ZRV, 7ZSS, 7KL9, and 7ZSS) also adopt a closed conformation. In contrast, the apo conformations predominantly remain in the closed state. Notably, antibodies bind both open and closed RBDs with stronger interactions and more contacts than receptor-bound structures (Figure C). Due to their larger size, antibodies can engage multiple RBDs within the trimer. One notable outlier (8W4F) shows high binding (∼180 contacts) with a trivalent nanobody cluster, stabilizing all three chains in the closed state.

All protein–ligand interactions occur within the S1 subunit, primarily in the RBD, which binds the human ACE2 receptor (Figure S12A). Structures complexed with ACE2 interact exclusively within the RBD, focusing on the RBM. In contrast, antibody-bound structures display broader interaction surfaces, engaging an additional domain, the NTD. Analysis of spike interactions with antibody heavy (H) and light chainskappa (K) and lambda (L)reveals a similar binding pattern, predominantly targeting the RBD and interacting with the NTD (Figure S12B). However, the K chain forms fewer contacts with the NTD compared to the H and L chains.

3.6. Structural Impact of Spike Mutations

Given the significance of the D614G mutation, which early in the pandemic shifted the spike protein toward a more open RBD conformation, , we analyzed RBD conformations in both protomeric and trimeric spike structures under two conditions: i. the unbound (Apo) form and ii. complexed with ligands (Lig). Protomer-level analyses are used to isolate the intrinsic effect of ligand binding on individual spike monomers, independent of intersubunit interactions. This approach enables the comparison of RBD conformational preferences across thousands of structures with or without ligand contact. In contrast, trimer-level analyses allow us to evaluate cooperative effectsfor instance, how ligand binding to one protomer may influence the conformational state of adjacent RBDsthus capturing interprotomer coupling that underlies spike functional regulation.

3.6.1. Protomeric Level Analyses

Among the ensemble structures, 45.19% contain the original 614D, while 46.55% carry the 614G mutation. The remaining 8.25% consists of other mutations at the position 614, including engineered variants designed to stabilize specific RBD states (Figure A). It was expected that a higher percentage of structures in the 614G group would exhibit the open RBD conformation. , However, this number is around 39%, which is similar to that of the 614D group (approximately 43%). In both cases, the closed conformation proved to be more stable.

5.

5

Impact of residue 614 mutation on SARS-CoV-2 spike RBD conformation. (A) Projection of protomeric ensemble structures (classified by the residue at 614 position) onto the PC 1–2 space. Original (Wuhan, black) and variant (red) structures are shown, with gray contour lines representing the ensemble density. (B,C) Distribution of trimeric spike structures by RBD conformation: red (at least one RBD open) and green (all RBDs closed), including apo and ligand-bound forms; (C) grayscale gradients indicate the number of protomers with open RBD conformations.

Although previous studies have reported that the D614G mutation promotes RBD opening and enhances ACE2 accessibility and infectivity, our ensemble-based analysis did not reveal a substantial difference in the frequency of open conformations between the D614 and G614 spike protomers. This apparent discrepancy may be explained in part by methodological differences. First, our analysis is based on experimentally resolved structures deposited in the PDB, which are inherently biased toward ligand- or antibody-bound statesconditions that frequently stabilize open RBD conformations.

As a result, this sampling bias may obscure subtle differences in spontaneous opening propensities between D614 and G614 in the absence of binding partners. Second, our approach evaluates protomeric conformations individually, decoupled from trimeric cooperative effects that may be critical for capturing the functional consequences of D614G substitution. Prior studies reporting increased openness of the G614 spike have often relied on full-trimer cryo-EM classifications or functional assays measuring entry efficiency, which capture aspects of conformational equilibrium and cooperativity not directly reflected in static protomer snapshots. Together, these considerations underscore the importance of interpreting structural ensemble data in the context of their sampling constraints and highlight the value of complementary experimental approaches to validating dynamic and functional hypotheses.

The number of unbound (apo) structures is quite different between the two groups: ∼40% 614G; and more than 60% for 614D. From these apo structures, around 81% (614D) and 86% (614G) are in the closed conformation, while for the ligand-bound structures, this percentage is around 38% for 614D and 30% for 614G.

To further assess whether the 614G mutation is a defining feature of SARS-CoV-2 variants, we compared the 614D with the original structures and the 614G with the variant strains, revealing a highly similar RBD profile in both cases (Figure A), confirming this mutation as a hallmark for SARS-CoV-2 variants.

3.6.2. Trimmer Level

Analysis of all SARS-CoV-2 trimeric spike structures in the ensemble reveals that in the Apo group, 51.4% have all three protomers in the closed RBD conformation, whereas 84.9% of ligand-bound (Lig) structures have at least one RBD in the open state (Figure B). These findings highlight the significant impact of ligand binding on the stabilization of the open RBD conformation. Such a ligand effect is similarly observed across spike proteins carrying either the 614D or 614G residue.

For ligand-bound structures, the proportion of fully closed trimers decreases further as expected: only 18.4% of 614D and 9.5% of 614G. These observations support previous studies reporting that the D614G substitution increases RBD accessibility with or without the presence of ligand, likely by destabilizing interprotomer interactions that constrain the closed RBD conformation. ,, On the other hand, among apo structures, 67.0% of the 614D group remain fully closed, while this proportion drops to 47.4% in 614G, indicating a higher baseline RBD flexibility in the 614G.

Additionally, to investigate other determinants of RBD conformation, we assessed the influence of ligand binding in both the original and mutant structures. To get further insights into how ligand binding and the D614G mutation influence the degree of spike conformational change at the trimer level, we analyzed the distribution of open protomers across groups (Figure C). In the apo state, the fully open trimers with all three RBDs in the open conformation are absent across all subgroups, suggesting that this configuration is not observed spontaneously. In contrast, ligand-bound structures could adopt the fully open trimeric configuration with certain frequency, with 29.4% of 614D, 29.9% of 614G, and 24.5% of other mutations in this residue. These results confirm that the simultaneous opening of all RBDs is a ligand-stabilized state rather than a spontaneous equilibrium feature, in agreement with cryo-EM studies showing that trivalent engagement (e.g., by three ACE2 or multivalent antibody interactions) promotes and stabilizes this conformation. , Moreover, the similar proportions of fully open trimers between 614D and 614G in the ligand-bound group suggest that while the D614G mutation enhances spontaneous RBD opening, it does not significantly alter the extent to which ligands can drive the spike into the fully open state.

Our results reveal that ligand-induced RBD opening does not occur uniformly across the spike trimer but instead follows an asymmetric, locally triggered mechanism. At the protomeric level, RBD opening is strongly associated with the binding of the ligand to that specific subunit. However, at the trimeric level, we observe that full RBD opening involving all three protomers is rare and occurs almost exclusively when all subunits are ligand-bound. In many partially open trimer states (e.g., 1‑up or 2‑up conformations), only the ligand-bound protomer(s) exhibit an open RBD, while unbound protomers remain in the closed conformation. This pattern suggests a model of sequential or asymmetric activation, where ligand engagement facilitates a localized conformational change without requiring full concerted motion across the trimer. Nevertheless, the structural integration of the spike suggests some degree of interprotomer influence, potentially stabilizing or modulating the neighboring subunit behavior. These findings align with allosteric coupling and partial cooperativity in spike activation and might have implications for how spike-mediated receptor engagement and immune recognition are regulated.

3.7. Conformational Heterogeneity Across SARS-CoV-2 Variants

The experimental SARS-CoV-2 ensemble was subclassified into variant groups using Cov3D data combined with an MSA-based clustering strategy (see Section ). The variant-specific structures were selected in the PCs 1–2 space with different colors and the percentage representation of the total ensemble (Figure A). Structures from the original (Wuhan) strain have a cluster on negative PC2 values and lower PC1 projections, indicating a closed RBD conformation with minimal interdomain separation between NTD and RBD. This configuration suggests a tighter packing and limited flexibility. In contrast, the Omicron variant populates vastly the same PC2 values but with higher PC1, indicating a new cluster with open RBD conformation but with lower interdomain distance. This observed shift with RBD exposure and lower interdomain flexibility could potentially contribute to altered ACE2 binding kinetics and immune escape strategies.

6.

6

Variant-driven modulation of RBD conformational dynamics. (A) Projection of variant spike structures onto PC1–PC2, colored by variant (percentages are indicated). Gray contours show the ensemble density distribution. (B) Density plots of S RBD and S NTD areas of trimeric spikes. Solid lines show apo, while dotted lines with transparent fill the ligand-bound structures.

To further examine how individual variants distribute within the principal component space, we analyzed the PC1 and PC2 projections across the ensemble using violin plots, separated by RBD conformation (Figure S13). In the PC1 dimension (Figure S13A), structures in the closed RBD conformation generally cluster similarly to the original strain, indicating that most variants maintain a comparable structural baseline in this state.

The Omicron variant shows a modest shift toward more negative PC1 values in the closed state, suggesting a subtle structural compaction. The Kappa variant displays the widest distribution among closed structures and is slightly shifted toward more positive PC1 values, indicating greater conformational variability even without the RBD opening. Among open-state structures, Omicron exhibits the broadest distribution, followed by Gamma and Beta, reflecting increased heterogeneity in their open conformations. Most other variants show PC1 profiles similar to those of the original strain in the open state, with the Alpha variant slightly skewed toward lower values.

In the PC2 dimension (Figure S13B), the original strain presents the widest distribution overall, but its closed and open structures have overlapping profiles centered near zero, suggesting limited interdomain variation between states. For most variants, PC2 distributions differ more distinctly between the closed and open states. A general trend emerges where closed structures tend to have higher PC2 values, while open structures shift toward lower PC2 values. The Omicron variant stands out with the most pronounced separation: its closed-state structures show a narrow distribution at moderately negative PC2 values, while its open-state structures expand into the lowest PC2 space. This indicates a marked conformational reorganization upon RBD opening, reinforcing the notion that Omicron exhibits distinct dynamics compared to the original strain and other variants.

To dissect these conformational differences more precisely, we evaluated collective variables that describe the trimer architecture in both apo and ligand-bound states, as previously proposed 50. Specifically, we calculated the areas formed by the CoMs of the RBDs (S RBD) and NTDs (S NTD) in each trimeric structure (Figure B). The S RBD distributions in the apo structures revealed one to three distinct populations, depending on the variant. Among them, the original, Delta, and Kappa variants exhibited narrower distributions with lower variance, indicative of more stabilized trimer in the absence of ligands. For most variants, including Alpha, Beta, and Delta, ligand binding induced a broader distribution with additional peaks and a dislocation to higher areas, suggesting enhanced flexibility and RBD rearrangements in response to binding events. However, for the original, Omicron, and Gamma variants, the S RBD distributions remained largely unchanged upon ligand binding, implying reduced conformational plasticity or a pre-existing structural equilibrium that favors specific RBD orientations.

In the case of S NTD, the original variant again showed minimal differences between the apo and ligand-bound states, reinforcing the notion of a structurally rigid NTD arrangement. For most variants, ligand binding led to a moderate increase in the S NTD values, consistent with NTD expansion often associated with trimer opening. Interestingly, the Omicron and Alpha variants deviated from this trend: their S NTD distributions shifted toward lower values upon ligand binding, suggesting a more compact NTD configuration. This observation points to a variant-specific structural adaptation in which ligand interaction may induce local compaction in some spike domains rather than global expansion. Together, these results highlight distinct conformational behaviors across variants, with Omicron in particular showing limited change in both S RBD and S NTD upon ligand binding, with a possible preorganized conformation that supports efficient receptor engagement and immune evasion.

3.7.1. Single-Experiment Cryo-EM Multiple Metastates of the Beta Spike Variant

Recent advances in cryo-electron microscopy (cryo-EM) have made it possible to resolve multiple conformational states (metastates) of macromolecular complexes within a single experimental setup, capturing both discrete and continuous structural variations. Among the SARS-CoV-2 spike glycoprotein structures deposited in the PDB, two entries9GDX and 9GDYstand out for containing multiple models. These models correspond to the spike protein of the Beta variant (B.1.351), derived from samples equilibrated during vitrification at two distinct temperatures: 4 °C (9GDX) and 37 °C (9GDY). These multiple models were generated using a deep learning-based framework, HetSIREN, designed to resolve structural heterogeneity by learning a low-dimensional conformational latent space directly from particle images.

In our analysis of the SARS-CoV-2 spike ensemble, we automatically considered only the first model of each entry. However, to fully explore the conformational landscape captured by the cryo-EM-derived multiple models, we constructed a new ensemble comprising all 20 models from each PDB entry (9GDY and 9GDX), resulting in a total of 120 conformations (considering all protomers). We compare this ensemble with the entire SARS-CoV-2 ensemble and with its beta variant conformations (43 trimmers, around 130 conformations). First, we performed a PCA over the cryo-EM multimodel ensemble, and the first two PCs presented a high overlap with those from the entire SARS-CoV-2 ensemble, 0.98 and 0.86, respectively (Figure S5D). The PC projection of each structure on ensemble subspace revealed a similar profile of the 9GDX models when compared to the beta variant from the SARS-CoV-2 ensemble (Figure S14A), presenting a mix of closed and open structures.

The 9GDX ensemble revealed a mixture of 1‑up and 2‑up conformations of the spike trimer, whereas in 9GDY, a distinct conformational landscape dominated by the 3‑down state was observed. Moreover, 9GDY presented a minor population adopting the 1‑up conformation (Figure S14B). Beyond the differences in RBD conformations, additional temperature-dependent structural variability was observed in the NTD, which was particularly pronounced in the 9GDY structures. This ensemble sampled structures with more negative values of PC2, which correspond to NTD/RBD motions (Figure S14C). Notably, the 1‑up conformation at 37 °C exhibited a more restricted opening compared to its counterpart at 4 °C, underscoring the influence of temperature on the conformational flexibility and metastability of the spike trimer.

3.7.2. Structural and Functional Variability of Omicron Subvariants

To investigate conformational variability within the Omicron lineage, we projected the subvariants onto the same PV space (PC1–PC2) used for the global ensemble analysis (Figure S15). In the low-PC1 region associated with the closed RBD conformation, all Omicron subvariant structures remain confined to a similar zone, indicating structural conservation in this state. In contrast, in the high-PC1 region associated with the open RBD conformation, the sublineages diverge more visibly. BA.1 and BA.2 subvariants extend into the extreme negative range of PC2 values, while XBB subvariants occupy a region with the highest PC1 values among the Omicron group. These patterns suggest that although all Omicron subvariants share a preference for RBD openness, they exhibit subtle conformational differences that may reflect mutations affecting the dynamics of the protomer. Notably, the early sublineages appear to contribute more strongly to the exploration of extreme PC2 values, a feature that is diminished or absent in more recent sublineages such as XBB, pointing to a potential evolutionary narrowing of the conformational landscape within the lineage.

3.7.3. Omicron Variant Displays Distinct Dynamics

The Omicron variant of SARS-CoV-2 present in the experimental ensemble displays markedly dynamic behavior compared to other variants, as revealed by ensemble NM-based dynamical network analyses (Figure A). The generated dynamical networks for both Omicron and non-Omicron spike structures with the same communities revealed substantial differences within the S1 subunit.

7.

7

Omicron variants display distinct RBD dynamics. (A) Dynamical network analysis of Omicron and non-Omicron variant structures, highlighting differences in interdomain correlations. Communities were remodeled according to the dynamical domain bar shown below the plots. Nodes were defined by spike protein domains and interdomain regions: residues 27–303, NTD; 304–318, ID1; 319–541, RBD; 542–591, C1; 592–686, C2; 687–716, ID2; 717–757, ID3; 758–815, ID4; 816–855, FP; 856–919, ID5; 920–970, HR1; 971–1035, CH; and 1036–1147, CTh. (B) Comparison of RBD conformations of trimeric structures (all, apo, and ligand-bound): red (at least one RBD open) and green (all RBDs closed).

In Omicron structures, interdomain correlations between NTD, RBD, and neighboring subdomains were notably altered, getting more correlated, and paths that communicate between RBD and the S2 subunit were more weakened, suggesting a rearrangement of collective motions. These changes likely arise from the high number of mutations in Omicron, compared to other variants and the Wuhan spike. Notably, mutations such as S371L, S373P, and S375F have been implicated in enhancing the stability of RBD conformations, thereby modulating the spike flexibility. These alterations may contribute to Omicron’s ability of the omicron to evade neutralizing antibodies while maintaining efficient ACE2 binding.

3.7.3.1. Apo Omicron Variant Preferentially Stabilizes RBD in the Closed Conformation

Complementing the network analysis, a bar plot comparison of RBD conformational states within trimeric spike structures of the structures of the SARS-CoV-2 ensemble is shown in Figure B. These data indicate that the Omicron variant predominantly adopts a closed RBD conformation in its apo form, contrasting with other non-Omicron structures that more frequently exhibit open RBD states. This closed state preference in Omicron is consistent with the cryo-EM study showing a higher proportion of spike proteins with all RBDs in the down position, which may facilitate immune evasion by concealing key epitopes. However, upon binding to ligands, Omicron’s spike protein can transition to open RBD conformations more easily than the other group, indicating that ligand engagement can overcome the conformational constraints imposed by its mutations, consistent with ref . These findings underscore the dynamic adaptability of the Omicron spike protein, balancing immune evasion with receptor accessibility.

3.7.3.2. Omicron Interactions with RBD (or Non-RBD)-Targeting Antibodies

To assess how Omicron’s conformational plasticity may influence epitope accessibility and immune recognition, we performed a structural analysis of spike–antibody complexes using the Cov3D classification system and epitope definitions from Chen et al. and Barnes et al. Antibodies were grouped into RBD-targeting classes (Classes 1–4, with Classes 1 and 2 specifically engaging the RBM) and non-RBD-targeting categories (NTD, SD1, NTD/SD2, and S2). We compared the Omicron and non-Omicron variant complexes to identify potential differences in conformational preferences and contact patterns.

As shown in Figure S16A, Class 1 and 4 antibodies preferentially bind to open RBD conformations, whereas Class 2 and 3 antibodies engage in both open and closed states. Antibodies targeting the S2 region primarily interact with glycans in the HR1 domain and exhibit minimal direct contact with the spike protein surface. Despite Omicron’s distinct conformational preference for the RBD-closed state in the apo form, we did not observe statistically significant differences in the number of spike–antibody contacts between Omicron and non-Omicron variants across epitope classes (Figure S16B). A residue-level comparison of spike–antibody interfaces (Figure S17) revealed largely conserved contact patterns, with the most notable difference observed in NTD-targeting antibodies: Omicron structures exhibited increased interaction with residues located at the N-terminal region of the domain (with a peak around residue 82).

Overall, these findings indicate that the Omicron retains the established class-specific binding modes of antibody engagement, while modest shifts in NTD epitope usage may contribute to immune evasion. Importantly, prior studies have demonstrated that targeting the RBM specifically mimicking the ACE2-binding interface can be an effective antiviral strategy. Our results highlight that conformational accessibility of RBM-targeted epitopes depends on dynamic RBD motions and variant-specific stabilization patterns, which should be considered in therapeutic design.

The dynamic features identified in this study have potential applications for the rational design of vaccines and therapeutics targeting SARS-CoV-2 and emerging variants. Stabilization of the spike protein in a specific conformational state, particularly the RBD-closed form, has been a key strategy in current vaccine development. Our findings reinforce that the Omicron variant naturally favors this closed conformation, suggesting that vaccine immunogen design must account for altered epitope exposure and reduced interdomain flexibility. Conversely, therapeutic antibodies may need to target conserved or cryptic epitopes accessible in both closed and transiently open states or exploit the identified hinge regions and interdomain correlations to destabilize functionally essential conformations.

3.8. Characterizing the RBD Transition Landscape through MD Simulations

3.8.1. Standard Microsecond-Long MD Simulations Fail to Sample RBD Open/Closed Transition

We analyzed two 10 μs-long standard MD simulations of the SARS-CoV-2 trimer spike protein starting from all‑down and 1‑up RBD structure (PDB-IDs 6VXX and 6VYB, respectively), projecting the trajectories onto the essential space defined by the PC1–PC2 space (Figure A). In both cases, the conformational sampling remained confined near their respective initial states, with no significant transitions observed between closed and open RBD conformations. Simulation 1 (starting from 6VXX) explored a narrow region corresponding to closed conformations, while simulation 2 (starting from 6VYB) had chain B (RBDup) remain trapped within an open region without the transition to a closed state. These results highlight the inherent limitations of conventional MD simulations in capturing large-scale conformational transitions of the spike protein within accessible time scales, emphasizing the need for enhanced sampling strategies to explore its functional dynamics more completely.

8.

8

Conformational sampling of RBD transitions by standard and hybrid MD. (A) PCA projections of two 10 μs standard MD simulations starting from different structures (6VXX: left; 6VYB: right), shown as jet-colored density contours. Gray contours represent experimental ensemble projections. (B) Hybrid NM-based MD trajectories (red dots) projected onto the first two normal modes, with representative clusters shown as blue dots. Right: corresponding FEL illustrating the thermodynamic distribution of sampled conformations. Light gray dots indicate experimental ensemble structures.

3.8.2. Hybrid MD Simulations Fully Explore Spike Experimental Conformational Space

A recent benchmarking study highlighted MDeNM’s capacity to reproduce large-scale conformational changes in complex biomolecular systems, reinforcing its value as a predictive tool for studying protein flexibility and function. Notably, metadynamics simulations using normal modes as CVs show good agreement with MDeNM results, although MDeNM tends to be more computationally efficient. Furthermore, MDeNM has been demonstrated to effectively capture protein dynamics on the microsecond time scale, making it a powerful approach for sampling slow conformational transitions. A comprehensive overview of NM-based enhanced sampling methods is provided in ref .

To overcome the sampling limitations observed in standard MD simulations, we employed MD with excited normal modes (MDeNM) to explore the conformational landscape of the SARS-CoV-2 spike protein (Figure B). MDeNM trajectories efficiently sampled a broad range of conformations, encompassing both closed and open RBD states observed experimentally. Clustering of these trajectories revealed representative structures that span the full extent of the essential space defined by the first two normal modes. Furthermore, an FEL constructed from the simulations of representative structures demonstrated all of the space present in the experimental SARS-CoV-2 ensemble.

In summary, we highlight that standard MD simulations initiated from either the all-closed or partially open (1‑up) conformations remain kinetically trapped around their starting states and fail to explore transitions between open and closed RBD conformations, even over 10 μs time scales. In contrast, MDeNM, by selectively injecting energy along low-frequency normal modes, efficiently samples both closed and open conformations, fully covering the conformational space observed in experimental structures. This underscores the predictive and exploratory power of MDeNM in capturing large-scale, low-probability transitions that are functionally relevant for spike activation but not accessible by standard MD under practical time scales.

These findings demonstrate that hybrid MD simulations are capable of capturing the full range of experimentally observed spike dynamics, providing a powerful tool for studying the conformational flexibility critical to spike function and viral infectivity. Moreover, the use of enhanced sampling techniques, such as MDeNM, to predict structural flexibility could guide the identification of allosteric sites or conformational traps, expanding the repertoire of antiviral strategies beyond the receptor-binding interface.

4. Conclusions

In this study, we conducted a comprehensive structural and dynamical analysis of HCoV spike proteins with a particular focus on SARS-CoV-2 and its variants. Our investigation was driven by the central question of how sequence variation, ligand interactions, and environmental factors shape the conformational plasticity of the spike protein and how these changes could impact viral infectivity, immune escape, and therapeutic design.

By constructing and analyzing a large ensemble of experimentally determined structures (over 3000 conformations), we demonstrated that despite considerable sequence divergence, spike proteins from different HCoVs exhibit a conserved capacity to sample both closed and open RBD conformations. For SARS-CoV-2, PC and normal-mode analyses revealed that the dominant conformational transition involves a hinge-like motion of the RBD relative to the NTD. This movement is captured by PC1 and is strongly correlated with geometric descriptors such as RMSD, interdomain distance, and RBD angle, all of which describe the same essential transition.

Importantly, our analyses showed that ligand binding is consistently associated with open RBD conformations in both protomeric and trimeric spike structures. The strong correlation between ligand engagement and RBD opening supports a model in which binding events contribute to stabilizing functionally relevant open states. Our separation of protomer and trimer analyses allowed us to assess the intrinsic influence of binding on individual protomers while also exploring potential cooperative effects within the trimer, including the impact of ligand binding in one protomer on the conformational states of neighboring subunits.

Nonetheless, our ensemble strategy also presents a series of limitations. PDB structural data are inherently biased toward ligand-bound or antibody-bound spike conformations, which may overrepresent open RBD states and underrepresent functionally relevant, unbound open states. Additionally, structural heterogeneity in experimental conditions and construct design (e.g., stabilizing mutations) could influence the observed conformational distributions. While we attempt to account for these factors through stratified analyses and enhanced simulations, such biases must be considered when generalizing conformational trends across variants or inferring functional mechanisms from structural ensembles alone.

A key outcome of this study is the identification of variant-specific conformational and dynamical signatures. Notably, the Omicron variant adopts a unique compact open-RBD conformation with reduced interdomain separation, a state not previously described. Our ensemble-based network analysis further revealed rewiring of allosteric communication pathways in Omicron, including weakened coupling between the RBD and S2 subunit, alongside increased correlation within S1 domains. In contrast, the broader conformational landscape observed in variants such as Delta and Gamma may reflect a more balanced trade-off between immune exposure and receptor accessibility. These dynamic signatures underscore how SARS-CoV-2 variants fine-tune spike plasticity to optimize fitness. Furthermore, the identification of hinge regions and correlated domain motions across variants could inform vaccine strategies aimed at stabilizing the spike in specific conformations or guide therapeutic design targeting motion-restricting regions to impair viral entry.

Using single-experiment multimodel cryo-EM data sets from the Beta variant, we captured metastable states of the Beta variant spike, with a clear temperature-dependent shift between open and closed RBD conformations. Comparison with the broader SARS-CoV-2 ensemble revealed that the low-temperature cryo-EM multimodels closely matched the distribution and essential dynamical space occupied by Beta variant structures. This strong correspondence validates both ensemble-based and single-experiment multiple-state cryo-EM approaches for accurately capturing spike conformational heterogeneity and metastability, suggesting that targeting metastable states may be a viable strategy.

Our findings also contextualize the effects of the D614G mutation, showing that while this substitution may modestly increase the population of open RBD conformations, ligand binding remains the stronger determinant of the RBD state within both mutant and wild-type structures. Furthermore, we demonstrated that trimeric spike structures with three RBDs in the open state are observed only in the presence of ligands, underscoring the cooperative nature of spike activation.

Finally, we showed that standard MD simulations fail to capture large-scale RBD transitions within accessible time scales, whereas our hybrid simulations using MdeNM successfully sampled the full range of experimentally observed conformations. This highlights the predictive power of our ensemble-based hybrid approach for exploring spike flexibility and identifying potential conformational intermediates of functional relevance. Altogether, this work provides mechanistic and structural insights into the conformational behavior of SARS-CoV-2 spike variants, revealing how mutations, binding events, and environmental factors dynamically shape spike function. These findings contribute to our understanding of variant-specific viral adaptation and offer valuable directions for the rational design of vaccines and therapeutics that target the conserved and flexible regions of spike protein.

Supplementary Material

ci5c00990_si_001.pdf (13.1MB, pdf)
ci5c00990_si_002.zip (112.7MB, zip)
ci5c00990_si_003.xlsx (201.9KB, xlsx)

Acknowledgments

All authors would like to thank CAPES, CNPQ, FAPERJ, FIOCRUZ, and Oswaldo Cruz Institute, especially the postgraduation program in Computational and System Biology. We also thank the Scientific Computational Program for the support. We are especially grateful to Reviewer 2 for the valuable suggestion to include the complementary antibody analysis.

The scripts used to download, process, and build the structural ensemble of SARS-CoV-2 spike glycoprotein structurescomprising 72 conformations derived from the 20 most representative PDB cluster entriesare available at: https://github.com/yago52/tutorial_spike. This repository also includes the full ensemble of SARS-CoV-2 spike structures, as well as scripts for analysis, plotting, and dynamical network analysis. Additionally, the structures, configuration files, and input scripts required to perform the MDeNM simulations are provided and can be accessed directly at: https://github.com/yago52/tutorial_spike/blob/main/mdenm_spike.tar.gz. Two 10 μs molecular dynamics trajectories of the trimeric SARS-CoV-2 spike protein, generated by D. E. Shaw Research, were obtained from the COVID-19 Molecular Structure and Therapeutics Hub (https://covid.molssi.org/org-contributions/) and are available under the data set codes DESRES-ANTON-11021566 and DESRES-ANTON-11021571.

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jcim.5c00990.

  • Spike structural organization, analysis workflows, PCA and NM comparisons, clustering results, residue contact maps, and conformational analyses by variant (PDF)

  • Molecular animations visualizing SARS-CoV-2 spike ensemble dynamics, PC motions, NM displacements, representative cluster structures, and domain analysis (ZIP)

  • Annotation of HCoV spike protein PDB structures used in the study, including resolution, chain ID, organism, RBD state, ligand information, and classification (XLSX)

§.

Faculdade de Tecnologia, Universidade Estadual de Campinas, 13484-332, Limeira, São Paulo, Brazil

YFS performed research, analyzed data, and wrote the paper. HHF performed the hybrid MD simulations, analyzed data, and reviewed the paper. PRB designed and performed research, analyzed data, and wrote the paper. All authors have given approval to the final version of the manuscript.

Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro. The Article Processing Charge for the publication of this research was funded by the Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior (CAPES), Brazil (ROR identifier: 00x0ma614).

The authors declare no competing financial interest.

Published as part of Journal of Chemical Information and Modeling special issue “Computational Chemistry in the Global South: The Latin American Perspective”.

References

  1. Lai C. C., Shih T. P., Ko W. C., Tang H. J., Hsueh P. R.. Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) and Coronavirus Disease-2019 (COVID-19): The Epidemic and the Challenges. Int. J. Antimicrob. Agents. 2020;55(3):105924. doi: 10.1016/j.ijantimicag.2020.105924. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Goyal R., Gautam R. K., Chopra H., Dubey A. K., Singla R. K., Rayan R. A., Kamal M. A.. Comparative Highlights on MERS-CoV, SARS-CoV-1, SARS-CoV-2, and NEO-CoV. EXCLI J. 2022;21:1245–1272. doi: 10.17179/excli2022-5355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Naqvi A. A. T., Fatima K., Mohammad T., Fatima U., Singh I. K., Singh A., Atif S. M., Hariprasad G., Hasan G. M., Hassan M. I.. Insights into SARS-CoV-2 Genome, Structure, Evolution, Pathogenesis and Therapies: Structural Genomics Approach. Biochim. Biophys. Acta, Mol. Basis Dis. 2020;1866(10):165878. doi: 10.1016/j.bbadis.2020.165878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Li X., Yuan H., Li X., Wang H.. Spike Protein Mediated Membrane Fusion during SARS-CoV-2 Infection. J. Med. Virol. 2023;95(1):e28212. doi: 10.1002/jmv.28212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Martínez-Flores D., Zepeda-Cervantes J., Cruz-Reséndiz A., Aguirre-Sampieri S., Sampieri A., Vaca L.. SARS-CoV-2 Vaccines Based on the Spike Glycoprotein and Implications of New Viral Variants. Front. Immunol. 2021;12:701501. doi: 10.3389/fimmu.2021.701501. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Calvaresi V., Wrobel A. G., Toporowska J., Hammerschmid D., Doores K. J., Bradshaw R. T., Parsons R. B., Benton D. J., Roustan C., Reading E., Malim M. H., Gamblin S. J., Politis A.. Structural Dynamics in the Evolution of SARS-CoV-2 Spike Glycoprotein. Nat. Commun. 2023;14(1):1–14. doi: 10.1038/s41467-023-36745-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Lan J., Ge J., Yu J., Shan S., Zhou H., Fan S., Zhang Q., Shi X., Wang Q., Zhang L., Wang X.. Structure of the SARS-CoV-2 Spike Receptor-Binding Domain Bound to the ACE2 Receptor. Nature. 2020;581(7807):215–220. doi: 10.1038/s41586-020-2180-5. [DOI] [PubMed] [Google Scholar]
  8. Wrobel A. G.. Mechanism and Evolution of Human ACE2 Binding by SARS-CoV-2 Spike. Curr. Opin. Struct. Biol. 2023;81:102619. doi: 10.1016/j.sbi.2023.102619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Walls A. C., Park Y.-J., Tortorici M. A., Wall A., McGuire A. T., Veesler D.. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell. 2020;183(6):1735. doi: 10.1016/j.cell.2020.11.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Jackson C. B., Zhang L., Farzan M., Choe H.. Functional Importance of the D614G Mutation in the SARS-CoV-2 Spike Protein. Biochem. Biophys. Res. Commun. 2021;538:108–115. doi: 10.1016/j.bbrc.2020.11.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Mansbach R. A., Chakraborty S., Nguyen K., Montefiori D. C., Korber B., Gnanakaran S.. The SARS-CoV-2 Spike Variant D614G Favors an Open Conformational State. Sci. Adv. 2021;7(16):eabf3671. doi: 10.1126/sciadv.abf3671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Ozono S., Zhang Y., Ode H., Sano K., Tan T. S., Imai K., Miyoshi K., Kishigami S., Ueno T., Iwatani Y., Suzuki T., Tokunaga K.. SARS-CoV-2 D614G Spike Mutation Increases Entry Efficiency with Enhanced ACE2-Binding Affinity. Nat. Commun. 2021;12(1):1–9. doi: 10.1038/s41467-021-21118-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Winger A., Caspari T.. The Spike of ConcernThe Novel Variants of SARS-CoV-2. Viruses. 2021;13(6):1002. doi: 10.3390/v13061002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Davies N. G., Abbott S., Barnard R. C., Jarvis C. I., Kucharski A. J., Munday J. D., Pearson C. A. B., Russell T. W., Tully D. C., Washburne A. D., Wenseleers T., Gimma A., Waites W., Wong K. L. M., van Zandvoort K., Silverman J. D., Diaz-Ordaz K., Keogh R., Eggo R. M., Funk S., Jit M., Atkins K. E., John Edmunds W.. CMMID COVID-19 Working Group; COVID-19 Genomics UK (COG-UK) Consortium. Estimated Transmissibility and Impact of SARS-CoV-2 Lineage B.1.1.7 in England. Science. 2021;372:eabg3055. doi: 10.1126/science.abg3055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Tegally H., Wilkinson E., Giovanetti M., Iranzadeh A., Fonseca V., Giandhari J., Doolabh D., Pillay S., San E. J., Msomi N., Mlisana K., von Gottberg A., Walaza S., Allam M., Ismail A., Mohale T., Glass A. J., Engelbrecht S., Van Zyl G., Preiser W., Petruccione F., Sigal A., Hardie D., Marais G., Hsiao N.-Y., Korsman S., Davies M.-A., Tyers L., Mudau I., York D., Maslo C., Goedhals D., Abrahams S., Laguda-Akingba O., Alisoltani-Dehkordi A., Godzik A., Wibmer C. K., Sewell B. T., Lourenço J., Alcantara L. C. J., Kosakovsky Pond S. L., Weaver S., Martin D., Lessells R. J., Bhiman J. N., Williamson C., de Oliveira T.. Detection of a SARS-CoV-2 Variant of Concern in South Africa. Nature. 2021;592(7854):438–443. doi: 10.1038/s41586-021-03402-9. [DOI] [PubMed] [Google Scholar]
  16. Faria N. R., Mellan T. A., Whittaker C., Claro I. M., Candido D. d. S., Mishra S., Crispim M. A. E., Sales F. C. S., Hawryluk I., McCrone J. T., Hulswit R. J. G., Franco L. A. M., Ramundo M. S., de Jesus J. G., Andrade P. S., Coletti T. M., Ferreira G. M., Silva C. A. M., Manuli E. R., Pereira R. H. M., Peixoto P. S., Kraemer M. U. G., Gaburo N. Jr, Camilo C. d. C., Hoeltgebaum H., Souza W. M., Rocha E. C., de Souza L. M., de Pinho M. C., Araujo L. J. T., Malta F. S. V., de Lima A. B., Silva J. d. P., Zauli D. A. G., Ferreira A. C. d. S., Schnekenberg R. P., Laydon D. J., Walker P. G. T., Schlüter H. M., dos Santos A. L. P., Vidal M. S., Del Caro V. S., Filho R. M. F., dos Santos H. M., Aguiar R. S., Proença-Modena J. L., Nelson B., Hay J. A., Monod M., Miscouridou X., Coupland H., Sonabend R., Vollmer M., Gandy A., Prete C. A. Jr, Nascimento V. H., Suchard M. A., Bowden T. A., Pond S. L. K., Wu C. H., Ratmann O., Ferguson N. M., Dye C., Loman N. J., Lemey P., Rambaut A., Fraiji N. A., Carvalho M. d. P. S. S., Pybus O. G., Flaxman S., Bhatt S., Sabino E. C.. Genomics and Epidemiology of the P.1 SARS-CoV-2 Lineage in Manaus, Brazil. Science. 2021;372:815–821. doi: 10.1126/science.abh2644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Mlcochova P., Kemp S. A., Dhar M. S., Papa G., Meng B., Ferreira I. A. T. M., Datir R., Collier D. A., Albecka A., Singh S., Pandey R., Brown J., Zhou J., Goonawardane N., Mishra S., Whittaker C., Mellan T., Marwal R., Datta M., Sengupta S., Ponnusamy K., Radhakrishnan V. S., Abdullahi A., Charles O., Chattopadhyay P., Devi P., Caputo D., Peacock T., Wattal C., Goel N., Satwik A., Vaishya R., Agarwal M., Chauhan H., Dikid T., Gogia H., Lall H., Verma K., Dhar M. S., Singh M. K., Soni N., Meena N., Madan P., Singh P., Sharma R., Sharma R., Kabra S., Kumar S., Kumari S., Sharma U., Chaudhary U., Sivasubbu S.. et al. SARS-CoV-2 B.1.617.2 Delta Variant Replication and Immune Evasion. Nature. 2021;599(7883):114–119. doi: 10.1038/s41586-021-03944-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Viana R., Moyo S., Amoako D. G., Tegally H., Scheepers C., Althaus C. L., Anyaneji U. J., Bester P. A., Boni M. F., Chand M., Choga W. T., Colquhoun R., Davids M., Deforche K., Doolabh D., du Plessis L., Engelbrecht S., Everatt J., Giandhari J., Giovanetti M., Hardie D., Hill V., Hsiao N.-Y., Iranzadeh A., Ismail A., Joseph C., Joseph R., Koopile L., Kosakovsky Pond S. L., Kraemer M. U. G., Kuate-Lere L., Laguda-Akingba O., Lesetedi-Mafoko O., Lessells R. J., Lockman S., Lucaci A. G., Maharaj A., Mahlangu B., Maponga T., Mahlakwane K., Makatini Z., Marais G., Maruapula D., Masupu K., Matshaba M., Mayaphi S., Mbhele N., Mbulawa M. B., Mendes A., Mlisana K., Mnguni A., Mohale T., Moir M., Moruisi K., Mosepele M., Motsatsi G., Motswaledi M. S., Mphoyakgosi T., Msomi N., Mwangi P. N., Naidoo Y., Ntuli N., Nyaga M., Olubayo L., Pillay S., Radibe B., Ramphal Y., Ramphal U., San J. E., Scott L., Shapiro R., Singh L., Smith-Lawrence P., Stevens W., Strydom A., Subramoney K., Tebeila N., Tshiabuila D., Tsui J., van Wyk S., Weaver S., Wibmer C. K., Wilkinson E., Wolter N., Zarebski A. E., Zuze B., Goedhals D., Preiser W., Treurnicht F., Venter M., Williamson C., Pybus O. G., Bhiman J., Glass A., Martin D. P., Rambaut A., Gaseitsiwe S., von Gottberg A., de Oliveira T.. Rapid Epidemic Expansion of the SARS-CoV-2 Omicron Variant in Southern Africa. Nature. 2022;603(7902):679–686. doi: 10.1038/s41586-022-04411-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Statement on the update of WHO’s working definitions and tracking system for SARS-CoV-2 variants of concern and variants of interest. https://www.who.int/news/item/16-03-2023-statement-on-the-update-of-who-s-working-definitions-and-tracking-system-for-sars-cov-2-variants-of-concern-and-variants-of-interest (accessed Jan 17, 2025). [PMC free article] [PubMed]
  20. Starr T. N., Greaney A. J., Hilton S. K., Ellis D., Crawford K. H. D., Dingens A. S., Navarro M. J., Bowen J. E., Tortorici M. A., Walls A. C., King N. P., Veesler D., Bloom J. D.. Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding. Cell. 2020;182(5):1295.e20–1310.e20. doi: 10.1016/j.cell.2020.08.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Cao Y., Wang J., Jian F., Xiao T., Song W., Yisimayi A., Huang W., Li Q., Wang P., An R., Wang J., Wang Y., Niu X., Yang S., Liang H., Sun H., Li T., Yu Y., Cui Q., Liu S., Yang X., Du S., Zhang Z., Hao X., Shao F., Jin R., Wang X., Xiao J., Wang Y.. et al. Omicron Escapes the Majority of Existing SARS-CoV-2 Neutralizing Antibodies. Nature. 2021;602(7898):657–663. doi: 10.1038/s41586-021-04385-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Araf Y., Akter F., Tang Y.-D., Fatemi R., Parvez M. S. A., Zheng C., Hossain M. G.. Omicron Variant of SARS-CoV-2: Genomics, Transmissibility, and Responses to Current COVID-19 Vaccines. J. Med. Virol. 2022;94(5):1825–1832. doi: 10.1002/jmv.27588. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kumar S., Karuppanan K., Subramaniam G.. Omicron (BA.1) and Sub-Variants (BA.1.1, BA.2, and BA.3) of SARS-CoV-2 Spike Infectivity and Pathogenicity: A Comparative Sequence and Structural-Based Computational Assessment. J. Med. Virol. 2022;94(10):4780–4791. doi: 10.1002/jmv.27927. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Yang J., Hong W., Lei H., He C., Lei W., Zhou Y., Zhao T., Alu A., Ma X., Li J., Yang L., Wang Z., Wang W., Lu G., Shen G., Lu S., Wu G., Shi H., Wei X.. Low Levels of Neutralizing Antibodies against XBB Omicron Subvariants after BA.5 Infection. Signal Transduction Targeted Ther. 2023;8(1):1–12. doi: 10.1038/s41392-023-01495-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Machado R. R. G., Candido E. ´. D., Aguiar A. S., Chalup V. N., Sanches P. R., Dorlass E. G., Amgarten D. E., Pinho J. R. R., Durigon E. L., Oliveira D. B. L.. Immune Evasion of SARS-CoV-2 Omicron Subvariants XBB.1.5, XBB.1.16 and EG.5.1 in a Cohort of Older Adults after ChAdOx1-S Vaccination and BA.4/5 Bivalent Booster. Vaccines. 2024;12(2):144. doi: 10.3390/vaccines12020144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Rath S. L., Kumar K.. Investigation of the Effect of Temperature on the Structure of SARS-CoV-2 Spike Protein by Molecular Dynamics Simulations. Front. Mol. Biosci. 2020;7:583523. doi: 10.3389/fmolb.2020.583523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Turoňová B., Sikora M., Schürmann C., Hagen W. J. H., Welsch S., Blanc F. E. C., von Bülow S., Gecht M., Bagola K., Hörner C., van Zandbergen G., Landry J., de Azevedo N. T. D., Mosalaganti S., Schwarz A., Covino R., Mühlebach M. D., Hummer G., Locker J. K., Beck M.. In Situ Structural Analysis of SARS-CoV-2 Spike Reveals Flexibility Mediated by Three Hinges. Science. 2020;370:203–208. doi: 10.1126/science.abd5223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Parihar A., Ahmed S. S., Sharma P., Choudhary N. K., Akter F., Ali M. A., Sonia Z. F., Khan R.. Plant-Based Bioactive Molecules for Targeting of Endoribonuclease Using Steered Molecular Dynamic Simulation Approach: A Highly Conserved Therapeutic Target against Variants of SARS-CoV-2. Mol. Simul. 2023;49:1267. doi: 10.1080/08927022.2022.2113811. [DOI] [Google Scholar]
  29. Kim S., Liu Y., Lei Z., Dicker J., Cao Y., Zhang X. F., Im W.. Differential Interactions between Human ACE2 and Spike RBD of SARS-CoV-2 Variants of Concern. J. Chem. Theory Comput. 2021;17:7972–7979. doi: 10.1021/acs.jctc.1c00965. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Pipitò L., Rujan R.-M., Reynolds C. A., Deganutti G.. Molecular Dynamics Studies Reveal Structural and Functional Features of the SARS-CoV-2 Spike Protein. BioEssays. 2022;44(9):2200060. doi: 10.1002/bies.202200060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Mahmoudi Gomari M., Rostami N., Omidi-Ardali H., Arab S. S.. Insight into Molecular Characteristics of SARS-CoV-2 Spike Protein Following D614G Point Mutation, a Molecular Dynamics Study. J. Biomol. Struct. Dyn. 2022;40:5634–5642. doi: 10.1080/07391102.2021.1872418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Abduljalil J. M., Elghareib A. M., Samir A., Ezat A. A., Elfiky A. A.. How Helpful Were Molecular Dynamics Simulations in Shaping Our Understanding of SARS-CoV-2 Spike Protein Dynamics? Int. J. Biol. Macromol. 2023;242(Pt 4):125153. doi: 10.1016/j.ijbiomac.2023.125153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Sakkiah S., Guo W., Pan B., Ji Z., Yavas G., Azevedo M., Hawes J., Patterson T. A., Hong H.. Elucidating Interactions Between SARS-CoV-2 Trimeric Spike Protein and ACE2 Using Homology Modeling and Molecular Dynamics Simulations. Front. Chem. 2021;8:622632. doi: 10.3389/fchem.2020.622632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Yang Y., Zhang Y., Qu Y., Zhang C., Liu X.-W., Zhao M., Mu Y., Li W.. Key Residues of the Receptor Binding Domain in the Spike Protein of SARS-CoV-2 Mediating the Interactions with ACE2: A Molecular Dynamics Study. Nanoscale. 2021;13(20):9364–9370. doi: 10.1039/D1NR01672E. [DOI] [PubMed] [Google Scholar]
  35. Mansbach R. A., Chakraborty S., Nguyen K., Montefiori D. C., Korber B., Gnanakaran G.. The SARS-CoV-2 Spike Variant D614G Favors an Open Conformational State. Biophys. J. 2021;120(3):298a. doi: 10.1016/j.bpj.2020.11.1904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Neto D. F. d. L., Fonseca V., Jesus R., Dutra L. H., Portela L. M. d. O., Freitas C., Fillizola E., Soares B., Abreu A. L. d., Twiari S., Azevedo V., Goes-Neto A., de Medeiros A. C., Lopes N. P., Zanotto P. M. d. A., Kato R. B.. Molecular Dynamics Simulations of the SARS-CoV-2 Spike Protein and Variants of Concern: Structural Evidence for Convergent Adaptive Evolution. J. Biomol. Struct. Dyn. 2023;41:5789–5801. doi: 10.1080/07391102.2022.2097955. [DOI] [PubMed] [Google Scholar]
  37. Teruel N., Mailhot O., Najmanovich R. J.. Modelling Conformational State Dynamics and Its Role on Infection for SARS-CoV-2 Spike Protein Variants. PLoS Comput. Biol. 2021;17(8):e1009286. doi: 10.1371/journal.pcbi.1009286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Kodchakorn K., Kongtawelert P.. Molecular Dynamics Study on the Strengthening Behavior of Delta and Omicron SARS-CoV-2 Spike RBD Improved Receptor-Binding Affinity. PLoS One. 2022;17(11):e0277745. doi: 10.1371/journal.pone.0277745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Peng C., Zhu Z., Shi Y., Wang X., Mu K., Yang Y., Zhang X., Xu Z., Zhu W.. Computational Insights into the Conformational Accessibility and Binding Strength of SARS-CoV-2 Spike Protein to Human Angiotensin-Converting Enzyme 2. J. Phys. Chem. Lett. 2020;11(24):10482–10488. doi: 10.1021/acs.jpclett.0c02958. [DOI] [PubMed] [Google Scholar]
  40. Ferreira J. C., Villanueva A. J., Al Adem K., Fadl S., Alzyoud L., Ghattas M. A., Rabeh W. M.. Identification of Novel Allosteric Sites of SARS-CoV-2 Papain-like Protease (PLpro) for the Development of COVID-19 Antivirals. J. Biol. Chem. 2024;300(11):107821. doi: 10.1016/j.jbc.2024.107821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Zhu R., Canena D., Sikora M., Klausberger M., Seferovic H., Mehdipour A. R., Hain L., Laurent E., Monteil V., Wirnsberger G., Wieneke R., Tampé R., Kienzl N. F., Mach L., Mirazimi A., Oh Y. J., Penninger J. M., Hummer G., Hinterdorfer P.. Force-Tuned Avidity of Spike Variant-ACE2 Interactions Viewed on the Single-Molecule Level. Nat. Commun. 2022;13(1):1–17. doi: 10.1038/s41467-022-35641-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Zhang S., Krieger J. M., Zhang Y., Kaya C., Kaynak B., Mikulska-Ruminska K., Doruker P., Li H., Bahar I.. ProDy 2.0: Increased Scale and Scope after 10 Years of Protein Dynamics Modelling with Python. Bioinformatics. 2021;37(20):3657–3659. doi: 10.1093/bioinformatics/btab187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Gowthaman R., Guest J. D., Yin R., Adolf-Bryfogle J., Schief W. R., Pierce B. G.. CoV3D: A Database of High Resolution Coronavirus Protein Structures. Nucleic Acids Res. 2021;49(D1):D282–D287. doi: 10.1093/nar/gkaa731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Berman H. M., Westbrook J., Feng Z., Gilliland G., Bhat T. N., Weissig H., Shindyalov I. N., Bourne P. E.. The Protein Data Bank. Nucleic Acids Res. 2000;28(1):235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. The UniProt Consortium. UniProt: The Universal Protein Knowledgebase in 2025. Nucleic Acids Res. 2025;53(D1):D609–D617. doi: 10.1093/nar/gkae1010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Mckinney, W. pandas: a Foundational Python Library for Data Analysis and Statistics; January 2011, accessed 01/17/2025. [Google Scholar]
  47. Edgar R. C.. MUSCLE: Multiple Sequence Alignment with High Accuracy and High Throughput. Nucleic Acids Res. 2004;32(5):1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Cock P. J. A., Antao T., Chang J. T., Chapman B. A., Cox C. J., Dalke A., Friedberg I., Hamelryck T., Kauff F., Wilczynski B., de Hoon M. J. L.. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009;25(11):1422–1423. doi: 10.1093/bioinformatics/btp163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Virtanen P., Gommers R., Oliphant T. E., Haberland M., Reddy T., Cournapeau D., Burovski E., Peterson P., Weckesser W., Bright J., van der Walt S. J., Brett M., Wilson J., Millman K. J., Mayorov N., Nelson A. R. J., Jones E., Kern R., Larson E., Carey C. J., Polat İ., Feng Y., Moore E. W., VanderPlas J., Laxalde D., Perktold J., Cimrman R., Henriksen I., Quintero E. A., Harris C. R., Archibald A. M., Ribeiro A. H., Pedregosa F., van Mulbregt P.. et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods. 2020;17(3):261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Olivos-Ramirez G. E., Cofas-Vargas L. F., Madl T., Poma A. B.. Conformational and Stability Analysis of SARS-CoV-2 Spike Protein Variants by Molecular Simulation. Pathogens. 2025;14(3):274. doi: 10.3390/pathogens14030274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Henderson R., Edwards R. J., Mansouri K., Janowska K., Stalls V., Gobeil S. M. C., Kopp M., Li D., Parks R., Hsu A. L., Borgnia M. J., Haynes B. F., Acharya P.. Controlling the SARS-CoV-2 Spike Glycoprotein Conformation. Nat. Struct. Mol. Biol. 2020;27(10):925–933. doi: 10.1038/s41594-020-0479-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Lee R. A., Razaz M., Hayward S.. The DynDom Database of Protein Domain Motions. Bioinformatics. 2003;19(10):1290–1291. doi: 10.1093/bioinformatics/btg137. [DOI] [PubMed] [Google Scholar]
  53. Christen M., Hünenberger P. H., Bakowies D., Baron R., Bürgi R., Geerke D. P., Heinz T. N., Kastenholz M. A., Kräutler V., Oostenbrink C., Peter C., Trzesniak D., van Gunsteren W. F.. The GROMOS Software for Biomolecular Simulation: GROMOS05. J. Comput. Chem. 2005;26(16):1719–1751. doi: 10.1002/jcc.20303. [DOI] [PubMed] [Google Scholar]
  54. Hess B.. Convergence of Sampling in Protein Simulations. Phys. Rev. E:Stat., Nonlinear, Soft Matter Phys. 2002;65(3):031910. doi: 10.1103/PhysRevE.65.031910. [DOI] [PubMed] [Google Scholar]
  55. Grant B. J., Skjaerven L., Yao X.-Q.. The Bio3D Packages for Structural Bioinformatics. Protein Sci. 2021;30(1):20–30. doi: 10.1002/pro.3923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Hinsen K., Petrescu A.-J., Dellerue S., Bellissent-Funel M.-C., Kneller G. R.. Harmonicity in Slow Protein Dynamics. Chem. Phys. 2000;261(1–2):25–37. doi: 10.1016/S0301-0104(00)00222-6. [DOI] [Google Scholar]
  57. Salomon-Ferrer R., Case D. A., Walker R. C.. An Overview of the Amber Biomolecular Simulation Package. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2013;3(2):198–210. doi: 10.1002/wcms.1121. [DOI] [Google Scholar]
  58. Costa M. G. S., Batista P. R., Bisch P. M., Perahia D.. Exploring Free Energy Landscapes of Large Conformational Changes: Molecular Dynamics with Excited Normal Modes. J. Chem. Theory Comput. 2015;11:2755–2767. doi: 10.1021/acs.jctc.5b00003. [DOI] [PubMed] [Google Scholar]
  59. Costa M. G. S., Batista P. R., Gomes A., Bastos L. S., Louet M., Floquet N., Bisch P. M., Perahia D.. MDexciteR: Enhanced Sampling Molecular Dynamics by Excited Normal Modes or Principal Components Obtained from Experiments. J. Chem. Theory Comput. 2023;19:412. doi: 10.1021/acs.jctc.2c00599. [DOI] [PubMed] [Google Scholar]
  60. Phillips J. C., Hardy D. J., Maia J. D. C., Stone J. E., Ribeiro J. V., Bernardi R. C., Buch R., Fiorin G., Hénin J., Jiang W., McGreevy R., Melo M. C. R., Radak B. K., Skeel R. D., Singharoy A., Wang Y., Roux B., Aksimentiev A., Luthey-Schulten Z., Kalé L. V., Schulten K., Chipot C., Tajkhorshid E.. Scalable Molecular Dynamics on CPU and GPU Architectures with NAMD. J. Chem. Phys. 2020;153(4):044130. doi: 10.1063/5.0014475. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Humphrey W., Dalke A., Schulten K.. VMD: Visual Molecular Dynamics. J. Mol. Graphics. 1996;14(1):33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
  62. Schrodinger, LLC . PyMOL; Schrodinger, LLC, 2010. [Google Scholar]
  63. Hunter J. D.. Matplotlib: A 2D Graphics Environment. Comput. Sci. Eng. 2007;9(3):90–95. doi: 10.1109/MCSE.2007.55. [DOI] [Google Scholar]
  64. Waskom M.. Seaborn: Statistical Data Visualization. J. Open Source Softw. 2021;6(60):3021. doi: 10.21105/joss.03021. [DOI] [Google Scholar]
  65. Hicks J., Klumpp-Thomas C., Kalish H., Shunmugavel A., Mehalko J., Denson J.-P., Snead K. R., Drew M., Corbett K. S., Graham B. S., Hall M. D., Memoli M. J., Esposito D., Sadtler K.. Serologic Cross-Reactivity of SARS-CoV-2 with Endemic and Seasonal Betacoronaviruses. J. Clin. Immunol. 2021;41(5):906–913. doi: 10.1007/s10875-021-00997-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Kaur N., Singh R., Dar Z., Bijarnia R. K., Dhingra N., Kaur T.. Genetic Comparison among Various Coronavirus Strains for the Identification of Potential Vaccine Targets of SARS-CoV2. Infect., Genet. Evol. 2021;89:104490. doi: 10.1016/j.meegid.2020.104490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Wang C., Hesketh E. L., Shamorkina T. M., Li W., Franken P. J., Drabek D., van Haperen R., Townend S., van Kuppeveld F. J. M., Grosveld F., Ranson N. A., Snijder J., de Groot R. J., Hurdiss D. L., Bosch B.-J.. Antigenic Structure of the Human Coronavirus OC43 Spike Reveals Exposed and Occluded Neutralizing Epitopes. Nat. Commun. 2022;13(1):1–15. doi: 10.1038/s41467-022-30658-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Pronker M. F., Creutznacher R., Drulyte I., Hulswit R. J. G., Li Z., van Kuppeveld F. J. M., Snijder J., Lang Y., Bosch B.-J., Boons G.-J., Frank M., de Groot R. J., Hurdiss D. L.. Sialoglycan Binding Triggers Spike Opening in a Human Coronavirus. Nature. 2023;624(7990):201–206. doi: 10.1038/s41586-023-06599-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Melero R., Sorzano C. O. S., Foster B., Vilas J.-L., Martínez M., Marabini R., Ramírez-Aportela E., Sanchez-Garcia R., Herreros D., del Caño L., Losana P., Fonseca-Reyna Y. C., Conesa P., Wrapp D., Chacon P., McLellan J. S., Tagare H. D., Carazo J.-M.. Continuous Flexibility Analysis of SARS-CoV-2 Spike Prefusion Structures. IUCrJ. 2020;7(6):1059–1069. doi: 10.1107/S2052252520012725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Batista P. R., Robert C. H., Maréchal J. D., Hamida-Rebaï M. B., Pascutti P. G., Bisch P. M., Perahia D.. Consensus Modes, a Robust Description of Protein Collective Motions from Multiple-Minima Normal Mode Analysisapplication to the HIV-1 Protease. Phys. Chem. Chem. Phys. 2010;12(12):2850–2859. doi: 10.1039/B919148H. [DOI] [PubMed] [Google Scholar]
  71. Batista P. R., Pandey G., Pascutti P. G., Bisch P. M., Perahia D., Robert C. H.. Free Energy Profiles along Consensus Normal Modes Provide Insight into HIV-1 Protease Flap Opening. J. Chem. Theory Comput. 2011;7:2348–2352. doi: 10.1021/ct200237u. [DOI] [PubMed] [Google Scholar]
  72. Skjærven L., Yao X.-Q., Scarabelli G., Grant B. J.. Integrating Protein Structural Dynamics and Evolutionary Analysis with Bio3D. BMC Bioinf. 2014;15(1):399. doi: 10.1186/s12859-014-0399-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Zhang S., Li H., Krieger J. M., Bahar I.. Shared Signature Dynamics Tempered by Local Fluctuations Enables Fold Adaptability and Specificity. Mol. Biol. Evol. 2019;36(9):2053–2068. doi: 10.1093/molbev/msz102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Wrapp D., Wang N., Corbett K. S., Goldsmith J. A., Hsieh C.-L., Abiona O., Graham B. S., McLellan J. S.. Cryo-EM Structure of the 2019-nCoV Spike in the Prefusion Conformation. Science. 2020;367:1260. doi: 10.1126/science.abb2507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Korber B., Fischer W. M., Gnanakaran S., Yoon H., Theiler J., Abfalterer W., Hengartner N., Giorgi E. E., Bhattacharya T., Foley B., Hastie K. M., Parker M. D., Partridge D. G., Evans C. M., Freeman T. M., de Silva T. I., McDanal C., Perez L. G., Tang H., Moon-Walker A., Whelan S. P., LaBranche C. C., Saphire E. O., Montefiori D. C., Angyal A.. et al. Tracking Changes in SARS-CoV-2 Spike: Evidence That D614G Increases Infectivity of the COVID-19 Virus. Cell. 2020;182(4):812–827e19. doi: 10.1016/j.cell.2020.06.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Yurkovetskiy L., Wang X., Pascal K. E., Tomkins-Tinch C., Nyalile T. P., Wang Y., Baum A., Diehl W. E., Dauphin A., Carbone C., Veinotte K., Egri S. B., Schaffner S. F., Lemieux J. E., Munro J. B., Rafique A., Barve A., Sabeti P. C., Kyratsous C. A., Dudkina N. V., Shen K., Luban J.. Structural and Functional Analysis of the D614G SARS-CoV-2 Spike Protein Variant. Cell. 2020;183(3):739–751e8. doi: 10.1016/j.cell.2020.09.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Benton D. J., Wrobel A. G., Roustan C., Borg A., Xu P., Martin S. R., Rosenthal P. B., Skehel J. J., Gamblin S. J.. The Effect of the D614G Substitution on the Structure of the Spike Glycoprotein of SARS-CoV-2. Proc. Natl. Acad. Sci. U.S.A. 2021;118(9):e2022586118. doi: 10.1073/pnas.2022586118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Yang T.-J., Yu P.-Y., Chang Y.-C., Hsu S.-T. D.. D614G Mutation in the SARS-CoV-2 Spike Protein Enhances Viral Fitness by Desensitizing It to Temperature-Dependent Denaturation. J. Biol. Chem. 2021;297(4):101238. doi: 10.1016/j.jbc.2021.101238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Gobeil S. M.-C., Janowska K., McDowell S., Mansouri K., Parks R., Manne K., Stalls V., Kopp M. F., Henderson R., Edwards R. J., Haynes B. F., Acharya P.. D614G Mutation Alters SARS-CoV-2 Spike Conformation and Enhances Protease Cleavage at the S1/S2 Junction. Cell Rep. 2021;34(2):108630. doi: 10.1016/j.celrep.2020.108630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Mäkelä A. R., Uğurlu H., Hannula L., Kant R., Salminen P., Fagerlund R., Mäki S., Haveri A., Strandin T., Kareinen L., Hepojoki J., Kuivanen S., Levanov L., Pasternack A., Naves R. A., Ritvos O., Österlund P., Sironen T., Vapalahti O., Kipar A., Huiskonen J. T., Rissanen I., Saksela K.. Intranasal Trimeric Sherpabody Inhibits SARS-CoV-2 Including Recent Immunoevasive Omicron Subvariants. Nat. Commun. 2023;14(1):1637. doi: 10.1038/s41467-023-37290-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Zhou T., Tsybovsky Y., Gorman J., Rapp M., Cerutti G., Chuang G.-Y., Katsamba P. S., Sampson J. M., Schön A., Bimela J., Boyington J. C., Nazzari A., Olia A. S., Shi W., Sastry M., Stephens T., Stuckey J., Teng I.-T., Wang P., Wang S., Zhang B., Friesner R. A., Ho D. D., Mascola J. R., Shapiro L., Kwong P. D.. Cryo-EM Structures of SARS-CoV-2 Spike without and with ACE2 Reveal a pH-Dependent Switch to Mediate Endosomal Positioning of Receptor-Binding Domains. Cell Host Microbe. 2020;28(6):867–879e5. doi: 10.1016/j.chom.2020.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Punjani A., Fleet D. J.. 3DFlex: Determining Structure and Motion of Flexible Proteins from Cryo-EM. Nat. Methods. 2023;20(6):860–870. doi: 10.1038/s41592-023-01853-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Kinman L. F., Powell B. M., Zhong E. D., Berger B., Davis J. H.. Uncovering Structural Ensembles from Single-Particle Cryo-EM Data Using cryoDRGN. Nat. Protoc. 2023;18(2):319–339. doi: 10.1038/s41596-022-00763-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Punjani A., Fleet D. J.. 3D variability analysis: Resolving Continuous Flexibility and Discrete Heterogeneity from Single Particle Cryo-EM. J. Struct. Biol. 2021;213(2):107702. doi: 10.1016/j.jsb.2021.107702. [DOI] [PubMed] [Google Scholar]
  85. Kimanius D., Schwab J.. Confronting Heterogeneity in Cryogenic Electron Microscopy Data: Innovative Strategies and Future Perspectives with Data-Driven Methods. Curr. Opin. Struct. Biol. 2024;86:102815. doi: 10.1016/j.sbi.2024.102815. [DOI] [PubMed] [Google Scholar]
  86. Herreros D., Mata C. P., Noddings C., Irene D., Krieger J., Agard D. A., Tsai M.-D., Sorzano C. O. S., Carazo J. M.. Real-Space Heterogeneous Reconstruction, Refinement, and Disentanglement of CryoEM Conformational States with HetSIREN. bioRxiv. 2024:10.1101/2024.09.16.613176. doi: 10.1101/2024.09.16.613176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Zhao Z., Zhou J., Tian M., Huang M., Liu S., Xie Y., Han P., Bai C., Han P., Zheng A., Fu L., Gao Y., Peng Q., Li Y., Chai Y., Zhang Z., Zhao X., Song H., Qi J., Wang Q., Wang P., Gao G. F.. Omicron SARS-CoV-2 Mutations Stabilize Spike up-RBD Conformation and Lead to a Non-RBM-Binding Monoclonal Antibody Escape. Nat. Commun. 2022;13(1):4958. doi: 10.1038/s41467-022-32665-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Hong Q., Han W., Li J., Xu S., Wang Y., Li Z., Wang Y., Zhang C., Huang Z., Cong Y.. Molecular Basis of SARS-CoV-2 Omicron Variant Receptor Engagement and Antibody Evasion and Neutralization. bioRxiv. 2022:10.1101/2022.01.10.475532. doi: 10.1101/2022.01.10.475532. [DOI] [Google Scholar]
  89. Lee M., Major M., Hong H.. Distinct Conformations of SARS-CoV-2 Omicron Spike Protein and Its Interaction with ACE2 and Antibody. Int. J. Mol. Sci. 2023;24(4):3774. doi: 10.3390/ijms24043774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Chen Y., Zhao X., Zhou H., Zhu H., Jiang S., Wang P.. Broadly Neutralizing Antibodies to SARS-CoV-2 and Other Human Coronaviruses. Nat. Rev. Immunol. 2023;23(3):189–199. doi: 10.1038/s41577-022-00784-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Barnes C. O., Jette C. A., Abernathy M. E., Dam K.-M. A., Esswein S. R., Gristick H. B., Malyutin A. G., Sharaf N. G., Huey-Tubman K. E., Lee Y. E., Robbiani D. F., Nussenzweig M. C., West A. P. Jr, Bjorkman P. J.. SARS-CoV-2 Neutralizing Antibody Structures Inform Therapeutic Strategies. Nature. 2020;588(7839):682–687. doi: 10.1038/s41586-020-2852-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Valiente P. A., Nim S., Lee J., Kim S., Kim P. M.. Targeting the Receptor-Binding Motif of SARS-CoV-2 with D-Peptides Mimicking the ACE2 Binding Helix: Lessons for Inhibiting Omicron and Future Variants of Concern. J. Chem. Inf. Model. 2022;62(15):3618–3626. doi: 10.1021/acs.jcim.2c00500. [DOI] [PubMed] [Google Scholar]
  93. Kaynak B. T., Krieger J. M., Dudas B., Dahmani Z. L., Costa M. G. S., Balog E., Scott A. L., Doruker P., Perahia D., Bahar I.. Sampling of Protein Conformational Space Using Hybrid Simulations: A Critical Assessment of Recent Methods. Front. Mol. Biosci. 2022;9:832847. doi: 10.3389/fmolb.2022.832847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Krieger J. M., Doruker P., Scott A. L., Perahia D., Bahar I.. Towards Gaining Sight of Multiscale Events: Utilizing Network Models and Normal Modes in Hybrid Methods. Curr. Opin. Struct. Biol. 2020;64:34–41. doi: 10.1016/j.sbi.2020.05.013. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

ci5c00990_si_001.pdf (13.1MB, pdf)
ci5c00990_si_002.zip (112.7MB, zip)
ci5c00990_si_003.xlsx (201.9KB, xlsx)

Data Availability Statement

The scripts used to download, process, and build the structural ensemble of SARS-CoV-2 spike glycoprotein structurescomprising 72 conformations derived from the 20 most representative PDB cluster entriesare available at: https://github.com/yago52/tutorial_spike. This repository also includes the full ensemble of SARS-CoV-2 spike structures, as well as scripts for analysis, plotting, and dynamical network analysis. Additionally, the structures, configuration files, and input scripts required to perform the MDeNM simulations are provided and can be accessed directly at: https://github.com/yago52/tutorial_spike/blob/main/mdenm_spike.tar.gz. Two 10 μs molecular dynamics trajectories of the trimeric SARS-CoV-2 spike protein, generated by D. E. Shaw Research, were obtained from the COVID-19 Molecular Structure and Therapeutics Hub (https://covid.molssi.org/org-contributions/) and are available under the data set codes DESRES-ANTON-11021566 and DESRES-ANTON-11021571.


Articles from Journal of Chemical Information and Modeling are provided here courtesy of American Chemical Society

RESOURCES