Skip to main content
PLOS One logoLink to PLOS One
. 2021 May 5;16(5):e0247147. doi: 10.1371/journal.pone.0247147

Bridging between material properties of proteins and the underlying molecular interactions

Guang Song 1,2,*
Editor: Claudio M Soares3
PMCID: PMC8099097  PMID: 33951045

Abstract

In this work, we develop a novel method that bridges between material properties of proteins, particularly the modulus of elasticity, and the underlying molecular interactions. To this end, we employ both an all-atom normal mode analysis (NMA) model with the CHARMM force field and an elastic solid model for proteins and protein interfaces. And the “bridge” between the two models is a common physical property predictable by both models: the magnitude of thermal vibrations. This connection allows one to calibrate the Young’s moduli of proteins and protein interface regions. We find that the Young’s moduli of proteins are in the range of a few Gpa to 10 Gpa, while the Young’s moduli of the interface regions are several times smaller. The work is significant as it represents the first attempt to systematically compute the elastic moduli of proteins from molecular interactions.

1 Introduction

Solid is one of the fundamental states of matter. Atoms in solids are packed tightly and kept together by physical interactions, such as ionic bonds (as in sodium chloride), covalent bonds (as in diamond), metalic bonds (as in metals), hydrogen bonds (as in ice), or van der Waals interactions (as in organic compounds) [1]. Solids held together mostly by van der Waals interactions are called van der Waals solids.

The type of force that keeps a solid together determines its material properties, such as elasticity. Solids held together by metalic, ionic, or covalent bonds appear stiff and have a much higher elastic moduli than solids held together mostly by non-bonded interactions such as proteins [2].

The elasticity of proteins has been studied in three primary ways, namely, protein fibers or fibrils [3], protein crystals [4], and protein capsids [5].

Among protein fibers, some exhibit high extensibility and have a very low Young’s modulus, on the order of only 1–10 MPa. These fibers usually contain unstructured regions or high mobility motifs, undergo α-helix to β-strand transitions, or even unfold under external forces, and consequently they appear very soft (1-10 Mpa) [3]. Other protein fibers are significantly different as they “form highly regular, nearly crystalline arrangement of monomer units without regions that can extend, change secondary structure or unfold” [3] and thus appear much stiffer. Experimental studies showed these fibers, such as actin [6], tubulin, collagen [2], and keratin [7], fibrin [3], β-lactoglobulin fibrils [8], had an elastic modulus in the range of a few Gpa. The Young’s moduli of these fiber proteins were obtained through direct measurement of force under stretching, mostly using atomic force spectroscopy (AFM) [8, 9].

The mechanical properties of proteins were studied also using protein crystals, mostly the crystals of hen egg-white lysozyme (HEWL), a model protein. Table 1 summarizes the techniques used, proteins studied, and results obtained. It lists also results from the compressibility studies done in solvent [10] and on a crystal structure solved at high pressure (1,000 bar) [11]. The results show a large variation, from less than 1 Gpa to over 10 Gpa. Compressibility study by Gavish et al. [10] showed that under most conditions, the adiabatic compressibilities of lysozyme, hemoglobin, and myoglobin in solution were on the order of 10-20% of the compressibility of water, or 5-7 × 10−11 m2/N. Consequently, the bulk moduli of these proteins, which are the inverse of their compressibilities, are over 10 Gpa. (The bulk modulus or volume modulus should not be confused with Young’s modulus or tensile modulus, though they tend to have similar values for most solids.) Kundrot and Richards [11] found that the isothermal compressibility of lysozyme proteins was 4.7 10−3 kbar−1, or 20 Gpa in bulk modulus. On the other hand, the Young’s moduli obtained from protein crystals using techniques such as vibration, indentation, ultrasound, or Brillouin scattering are much smaller, from 0.3 Gpa to 5.5 Gpa (see Table 1). How to understand this discrepancy? It is possible that the larger values obtained from compressibility studies represent the elastic modulus of proteins themselves, while the elastic moduli of protein crystals probably represent the moduli of both proteins and protein interface regions, as well as other noncrystalline elements in the crystal cell such as intracrystalline liquid [4]. This would reconcile the seemingly conflicting results on elastic moduli of proteins in the literature.

Table 1. A summary of studies on the elasticity of proteins using protein crystals (top half), or compressibility measurements (bottom half).

Techniques Proteins Young’s modulus References
vibrating reed lysozyme 0.3-1.5 Gpa [1214]
indentation lysozyme 0.49-4.2 Gpa [15]
ultrasound lysozyme 4.87-5.5 Gpa [16, 17]
brillouin scattering lysozyme 5.5 Gpa [18]
bulk modulus
ultrasound in solution lysozyme, Mb, Hb >10 Gpa [10]
X-ray at high pressure lysozyme >20 Gpa [11]

The Young’s moduli of proteins measured experimentally as reviewed above depend on a number of factors, including conditions of the samples (such as pH in solvent [10] or water content in crystals: moistened or dried [15], etc.), and the frequency range at which a measurement was carried out [17], which varies from static to kilohertz [1214], to ultrasonic in the megahertz range [16, 17] and Brillouin light scattering method in the gigahertz range [18]. The elastic modulus of a protein is also temperature dependent. Morozov and Gevorkian analyzed that the mechanical properties of protein crystals at different temperatures and found that proteins became significantly more rigid below glass transition temperature, when the surface layer of proteins and their bound water became immobilized [4]. In this work, we also will look into the influence of temperature on a protein’s elastic modulus.

The elastic modulus of proteins was studied also through protein capsids by nano-indentation using atomic force microscopy (AFM). Nano-indentation allows one to measure the stiffness of viral capsids. The elastic modulus of the capsid material can then be deduced from the measured stiffness. However, in order to do so, in most cases a thin-shell model was assumed and thin-shell elasticity theory applied. The limitation of the thin shell model is that it neglects the molecular structure of the capsid and assumes a homogeneous material property throughout, as well as an idealized spherical geometry and uniform thickness [5]. Thick shell models combined with finite element analysis were also tried [19] and the effect of non-uniform geometry was investigated [20]. These work all represent a top-down approach since experimental determined stiffness values were used in all cases to fit the underlying elastic moduli of capsids. A review article in 2012 by Mateu [5] summarized the nano-indentation results of a dozen viral capsids: the stiffness of the viral capsids and the derived Young’s moduli. The review showed the Young’s moduli of capsid proteins varied over a span of more than one order of magnitude, from 0.14 GPa for CCMV to 1-3 GPa for MVM and Φ29 [5]. The range of variation in Young’s moduli seen in capsids is consistent with what is seen in protein crystals (Table 1, top half) but is much lower overall than what is obtained from compressibility studies (Table 1, bottom half), A possible explanation is that elastic moduli thus obtained represent the average moduli of both proteins and protein interface regions, while the elastic moduli obtained from compressibility studies represent those of individual proteins.

The Young’s moduli of globular and membrane proteins were also estimated theoretically or computationally [2, 6, 21, 22]. By approximating proteins as solid materials with sheets that interacted through van der Waals interactions, Howard estimated that the Young’s modulus of proteins to be around 4 Gpa (see Appendix 3.1 of [2]). In the case of F-actin, Bathe applied axial stretching of his elastic solid model for actin to match experimental stretching stiffness data [6] and found that the effective Young’s modulus of actin was 2.7 Gpa [21].

Contribution of this work. In this work, we develop a novel bottom-up approach for computing the elastic moduli of proteins from the underlying molecular interactions. Our approach utilizes both normal mode analysis (NMA) [2325], a well established technique for studying the fluctuating dynamics of macromolecules, and elastic solid models (ESM) [26] developed more recently that treat macromolecules as elastic solids with material properties such as Young’s modulus. Since both of these models can predict the magnitude of thermal vibrations of macromolecules, this commonality is used as a bridge to link material properties modeled in ESM [26] with molecular interactions used in NMA. To the best of our knowledge, This work represents the first attempt to determine the Young’s moduli of proteins and protein interface regions separately and systematically from all-atom molecular interactions. We find the Young’s modulus of proteins can be as high as 10 Gpa, while the Young’s modulus of protein interface regions is several times smaller. Our work reconciles the high modulus values found through compressibility studies and low values found in protein crystals or capsids. The large span of variations of elastic moduli at interface regions provides an explanation also for the similar extent of variations seen in protein crystals and protein capsids.

2 Methods

2.1 σESM, a molecular surface-based elastic solid model

Recently, we presented a novel elastic solid model called αESM[26] for macromolecules based on alpha shape [27]. The model has a parameter alpha which was chosen empirically.

Here we present a new extension of αESM whose major improvement over αESM is that the alpha value is now tuned individually for each protein so that the final elastic solid model of the protein has the same volume as its solvent excluded volume, such as that computed by the MSMS algorithm [28]. Another significant difference of the present ESM model from αESM is that in representing the solid, not only the atomic coordinates of the protein are used, which is the case for αESM, but also selected points on the protein’s molecular surface [21]. Consequently, the final solid model has a similar surface to the molecular surface of the protein. In this aspect the model is similar to Bathe’s elastic solid model [21]. We name this new variant of ESM σESM, since σ is a symbol commonly used for surface area or surface tension [29].

The procedural flow of σESM is given in Fig 1. The script takes as input xyz, which contains the coordinates of all the heavy atoms of a given protein structure that is available in its PDB file, and verts, faces, and volume, all three of which are outputs of the MSMS software [28] and represent respectively the vertices and faces of the solvent excluded surface mesh and its enclosed volume (line 1). The program first computes an initial alpha shape of the protein alone (line 11), which is used to set the number of faces that should be on the surface mesh (lines 12-13). It then simplifies the surface mesh produced by MSMS using MATLAB script reducepatch (line 15). The vertices on the reduced molecular surface are then appended to the coordinates of the atoms in the protein (line 16). Lastly, an alpha value is selected (line 18) so that the final alpha shape has the same volume as the the volume computed from MSMS [28]. Once the alpha value is determined, the script used for αESM[26] can be adopted to compute the stiffness matrix K and mass matrix M (line 19), from which eigenmodes and eigen frequencies are computed (Fig 1 in Ref. [26]).

Fig 1. The algorithmic flow of σESM as expressed in a MATLAB script.

Fig 1

A complete copy of the above MATLAB script is available at https://github.com/gsongISU/sigmaESMrelease. The scripts for computing stiffness matrix and mass matrix used in alphaESM are available at MATLAB file exchange (https://www.mathworks.com/matlabcentral/fileexchange/27826-fast-fem-assembly-nodal-elements), kindly contributed by Anjam and Valdman [30].

Fig 2 shows the αESM model of a protein (pdb-id: 1aqb) in red. The molecular surface, or solvent excluded surface, computed from the MSMS software [28] is shown in blue as a triangular surface mesh. The total volume of the αESM solid model (shown in red) is 16,600 Å3, while the solvent excluded volume is 23,720 Å3. Fig 2(B) shows the σESM model in cyan, which has an extra layer over the αESM model and resembles closely the molecular surface in Fig 2(A). Specifically, the volume of σESM model is 23,770 Å3, nearly the same as the protein’s solvent excluded volume.

Fig 2. The αESM model [26] (in red) using an alpha value of 3.6 Å.

Fig 2

The solvent excluded surface computed from MSMS [28] is shown in blue as a triangle mesh. (B) The σESM model: the difference between it and αESM (in red) is shown in cyan.

In summary, σESM is a more accurate representation than αESM in describing both molecular surface and and its enclosing volume.

The magnitude of thermal vibrations by σESM

Given a protein structure and its molecular surface mesh (which can be generated for instance by the MSMS software [28]), an application of the σESM script given in Fig 1 produces a stiffness (K) and a mass (M) matrix. The (global) stiffness matrix K is a 3M-by-3M matrix, where M is the number of nodes in the model, built with the element stiffness matrices of all the elements (tetrahedral in the current work). Each element stiffness matrix is a 12x12 matrix that can be written as:

k=[B]T[D][B]V, (1)

where B and D are standard matrices available in textbooks on finite element method [31] that depend solely on the coordinates of the four nodes of the tetrahedral element, and V is the volume of the tetrahedron. The unit of K is kcal/mol/Å2. For the mass matrix M used in finite element method, one has the option of a consistent mass matrix or a lump-sum mass matrix [31]. In this work, the lump-sum mass matrix is used, which is the same as the mass matrix used in NMA and is diagonal. (The consistent mass matrix, on the other hand, is not a diagonal matrix.) According to Ref. [26], the mean-square thermal vibrations of atom i by ESM is:

ΔRi2ESM=kBTEtrace([M-12Km-1M-12]ii), (2)

where the subscript ii represents the ith 3-by-3 diagonal block and Km is the mass-weighted stiffness matrix, i.e., Km = M−1/2 KM−1/2. E is the Young’s modulus of the protein solid.

The magnitude of thermal vibrations of a whole protein is defined as the mass-weighted average of ΔRi2ESM over all the nodes [32], i.e.,

ΔR2ESM=imiΔRi2ESMimi=kBTtrace(Km-1)Eimi, (3)

where i is the index of the nodes that include both protein atoms and added molecular surface nodes. mi are the mass of node i. Each surface node is given a mass of 1 atomic unit.

2.2 The sbNMA model

The sbNMA model developed by Na and Song [33] is an all-atom NMA model based on the CHARMM force field [34]. It was designed to maintain the high accuracy of classical NMA (cNMA) using all-atom force fields. By circumventing the cumbersome step of energy minimization, it can be applied directly to experimental structures. Extensive studies have demonstrated its accuracy [32, 33, 3537]. The entire sbNMA code is publicly available at https://github.com/htna/sbNMA-Matlab. Given a protein structure, an application of the sbNMA code (particularly the sbNMA_PSF.m script available at the above link) produces a Hessian matrix H and a mass matrix M that is diagonal (the sbNMA_PSF script actually produces the diagonal of the mass matrix as a vector, which can be easily converted to a mass matrix when needed). The Hessian matrix H is a 3N-by-3N matrix containing the second partial derivatives of the potential energy function with respect to the (x,y,z) coordinates of the N atoms in a protein. The unit of the Hessian matrix is kcal/mol/Å2.

Once the Hessian matrix H and the mass matrix M are obtained, the mean square fluctuation of atom i is [33]:

ΔRi2NMA=kBTtrace([M-12Hm-1M-12]ii), (4)

Thus,

ΔR2NMA=imiΔRi2NMAimi=kBTtrace(Hm-1)imi (5)

As will be shown later, the magnitude of thermal vibrations as computed from sbNMA in Eq (5) can be used to calibrate the Young’s moduli of proteins.

2.3 B-factors, static disorder, and glass transition

The Debye–Waller factor (DWF), or B-factor as it is called in protein X-ray crystallography, is a factor used to describe for each atom the degree to which electron density spreads out. The spread around the mean position of each atom i is generally modeled as a Gaussian function and the magnitude of the spread is characterized by ui2, the mean-square displacement from the mean position. The isotropic Debye-Waller factor, or B-factor, is related to ui2 as:

ui2=38π2Bi. (6)

The magnitude of mean-square displacement of a whole protein is defined as the mass-weighted averages of ui2 over all the atoms [32], i.e.,

u2=imiui2imi, (7)

where i is the atom index.

2.4 Determine the Young’s moduli of proteins based on molecular interactions

sbNMA is an accurate NMA model for predicting the magnitude of thermal vibrations. However, like all NMA models and elastic network models, one major limitation of sbNMA is that it assumes harmonicity. Thus the predicted magnitude by sbNMA represents only a lower bound of the actual magnitude of fluctuations of proteins that may contain also anharmonic motions, especially at above Tg, the temperature of glass transition [38].

The magnitude of thermal vibrations can be obtained also experimentally. X-ray crystallography that is widely used in protein structure determination produces such magnitude information in atomic displacement parameters that are commonly known as B-factors. Since B-factors contain also a significant amount of static disorder [32], the magnitude of mean-square displacement obtained from B-factors thus represents an upper bound of the actual magnitude of fluctuations of proteins.

A possible third alternative for obtaining the magnitude of thermal fluctuations of proteins is molecular dynamics (MD) simulation [39, 40], which is able to take into account anharmonic motions and is unaffected by static disorder. However, it suffers from the problem of insufficient sampling of the conformation space. Consequently, the actual magnitude of thermal fluctuations is difficult to obtain using MD.

The Young’s modulus represents a material property of a solid. By representing a protein using both a molecular model (as in NMA) and an elastic solid model as in σESM, we can link material properties with molecular interactions. Particularly, we can deduce a protein’s Young’s modulus by requiring the magnitude of thermal vibrations predicted by σESM in Eq (3) to be the same as the magnitude obtained from sbNMA in Eq (5) or the magnitude of mean-square displacement from B-factors in Eq (7). Since the magnitudes obtained from sbNMA and B-factors represent the lower and upper bounds of the actual magnitude respectively, the Young’s moduli thus deduced represent the upper and lower bounds of the actual Young’s moduli. Specifically, by requiring 〈ΔR2ESM = 〈ΔR2NMA (Eqs (3) and (5)), we have:

Eupper=trace(Km-1)/mtotESMtrace(Hm-1)/mtotNMA, (8)

where mtotESM and mtotNMA are the total masses in σESM and NMA, respectively. And similarly, by requiring 〈ΔR2ESM = 〈u2〉 (Eqs (3) and (7)), we have:

Elower=trace(Km-1)/mtotESM(imiui2)/mtotBfac, (9)

where mtotBfac is the total mass of all the atoms with B-factors, which usually are the heavy atoms.

2.5 Quaternary structures based on crystal contacts

For each protein chain p in the dataset, we determine the all the other chains in its unit cell based on the space group and symmetry matrices given in its PDB file. Additionally we find all the neighboring cells to the current cell in all directions: altogether 27-1 = 26 neighboring cells are considered. Of all the chains in these cells, we identify the one that is the closest to p and denote this chain as q. p and q form a crystal contact and a quaternary structure. (In the case when the selected q has a very small contact surface with p, a different chain q is used instead.) The quaternary structures are saved in PDB files as xxxx2.pdb, where xxxx is the pdb-id, and are available at https://github.com/gsongISU/sigmaESMrelease, under folder pdbDataset. In the following, we explain how to compute the Young’s modulus of the interface region of a quaternary structure.

2.6 σESM of the quaternary structures

σESM is readily applicable to the aforementioned quaternary structures in the same fashion as to regular tertiary structures, and as a result, an elastic solid model is produced for each quaternary structure (see Fig 3 of structure 2pwa2.pdb). The nodes in the model consists of heavy atoms of two protein chains (regions in red) in the quaternary structure and the surface nodes (in cyan) given by the MSMS software [28]. The elements in the solid model are tetrahedra generated by alpha shape [27] (see Fig 1). Given the elastic solid model of a quaternary structure, the interface region, which is in blue in Fig 3, is identified as follows.

Fig 3. Models of a quaternary structure.

Fig 3

(A) An elastic solid model of a quaternary structure (2pwa2.pdb). The pdb quaternary structure is available at https://github.com/gsongISU/sigmaESMrelease, under folder pdbDataset. (B) the all-atomic molecular model of the same structure. The image is generated by Rasmol [41].

First, all the tetrahedral elements formed by atoms from chain p or surface nodes but not from chain q are identified. Let all the nodes of these elements be nodes1. nodes1 consists of all the atoms of chain p and some surface nodes, but no atoms from chain q. Similarly, compute nodes2 for the second chain q. Next, remove any common surface nodes shared by nodes1 and nodes2 from them. As a result, the intersection of nodes1 and nodes2 is an empty set. Finally, identify all the tetrahedral elements formed solely by nodes1 or nodes2. The remaining elements are identified as the interface region (in blue).

2.7 Computing the Young’s modulus of the interface region

Next, we compute the Young’s modulus of the interface region of the quaternary structure. This allows us to assess how “soft” or “firm” the region is. Again, our method to determine the material property of the interface region is to link it with the underlying molecular interactions using σESM and sbNMA, by requiring the two models to give the same magnitude of thermal vibrations.

Fig 3(B) shows the molecular structure model of the same quaternary structure after protonation using psfgen from VMD [42]. We apply sbNMA to compute the magnitude of thermal vibration using Eq (5). To focus on the interface region and to speed up the computations, we set the two proteins as rigid, i.e., the internal degrees of freedom within each protein chain is left out, by employing a projection matrix as was done in the RTB model [43]. The magnitude of thermal vibration of the whole structure (Fig 3(B)) is thus dictated solely by inter-protein interactions. Similarly, when applying σESM to the solid model in Fig 3(A), the two protein regions, which are in red, are set as rigid and only the interface region (in blue) is flexible. By requiring the two models to give the same magnitude of thermal vibration, we can determine the Young’s modulus of the interface region, similar to what is done in Eq (8). The entire program for computing the Young’s modulus of the interface region is given at https://github.com/gsongISU/sigmaESMrelease. The program uses σESM and sbNMA and a matrix projection module. All of them are available at the above website.

It is worth noting that only non-bonded interactions (possibly also disulfide bonds) are present at the interface region. Therefore it is expected that the Young’s modulus of the interface region should be smaller than that of a protein itself, since the latter contains also bonded interactions. Also, quaternary structures constructed from crystal contacts as described above, especially after protonation, may contain steric clashing in the interface region. For atom pairs that are too close to each other (i.e., closer than their equilibrium separation distance which is the sum of their van der Waals radii), we use their equilibrium separation distances as their separation distances when computing the potential energy at the interface.

3 Results

3.1 The choice of protein dataset

In our recent work [32], we have shown that the percentage of thermal vibrations in B-factors is small in most proteins but is significantly large (50% or higher) for structures determined at both high resolution (1.2 Å or higher) and high temperature (that is, room temperature). Unfortunately such structures are rare in the PDB, as most structures in the PDB [44] are determined at cryogenic temperature to reduce the radiation damage [45].

Based on this premise, we select a group of 18 structures determined at high resolution (1.1 Å or high) and at room temperature. This set of structures are taken from a list of 1522 structures used in our previous work [32] that was originally generated by PDB_SELECT [46, 47].

3.2 Upper bound and lower bound of thermal fluctuations in proteins

There are a few universal properties of globular proteins that are intriguing. First, it was established that the vibrational spectra of globular proteins, once properly normalized, follow a universal curve [35, 48]. It was shown later that this universality in vibrational spectrum held true also for protein capsids [36]. Second, it was shown in one of our recent work that a direct corollary of the universality in vibrational spectrum among globular proteins is that the magnitudes of their thermal vibrations are nearly universal, having a narrow distribution that peaks at 0.093 Å2 at 100 K [32]. In obtaining this value, a harmonic potential is assumed. The harmonicity assumption is reasonable for thermal vibrations at or below 100 K. At higher temperature, particularly at above the glass transition temperature of proteins which is about 200-220 K, proteins undergo “glass transition” and become much more flexible [38]. As a result, the magnitude of mean-square displacements grows super-linearly. Therefore, the magnitude of thermal vibrations of globular proteins at 300 K is at least three times as high as 0.093 Å2, or 0.279 Å2. In other words, the magnitude of thermal vibrations computed from sbNMA (which assumes harmonicity) represents a lower bound of the actual magnitude at room temperature.

On the other hand, the magnitudes of thermal fluctuations of globular proteins are available in crystallographic B-factors, though in which unfortunately a large amount of static disorder co-resides. Because of static disorder, B-factors do not represent the actual magnitudes of thermal fluctuations but rather an upper bound.

The aforementioned upper and lower bounds define a range of the actual magnitudes of thermal vibrations of globular proteins. The range, though not as ideal as a single definite value, gives us a good sense of the magnitude.

Fig 4 shows the upper and lower bounds of the magnitudes of thermal fluctuations of the 18 structures in the dataset. For three out of the 18 structures, the lower bound obtained from the sbNMA computation is slightly higher than the upper bound obtained from B-factors. This could be due to the imperfection of sbNMA or uncertainties in B-factors, or both. However, for most proteins, the over trend is that the upper bound is about 1.6-1.7 times higher than the lower bound. The mean lower and upper bounds of the proteins shown in Fig 4 are 0.30 and 0.51 Å2, respectively.

Fig 4. The magnitudes of thermal fluctuations obtained from B-factors (in red) and from sbNMA computations (in blue), which form the upper and lower bounds of the magnitudes of thermal fluctuations of the 18 structures in the dataset, listed by their sizes in the ascending order.

Fig 4

For three out of the 18 structures, the lower bound is slightly higher than the upper bound.

3.3 Glass transition

It is known that proteins undergo glass transition at Tg, the glass transition temperature, which is about 200-220 K for proteins [38]. At above Tg, proteins become much more flexible. Past work indicated that “the glass transition in hydrated samples is located in the surface layer of proteins and related to the (im)mobilization of the protein groups and strongly bound water.” [4]. As a result, the magnitude of mean-square displacements grows super-linearly with temperature.

Fig 5 presents the magnitudes of thermal vibrations as computed from sbNMA [32, 33] of the 18 proteins in the dataset, as well as their mean-square displacements from crystallographic B-factors. The blue line represents the linear growth of mean-square displacements as a function of temperature should there be no glass transition. The red line in Fig 5 represents a fitting to 〈u2〉 (of B-factors). A clear picture of glass transition emerges.

Fig 5. The magnitude of thermal fluctuations as a function of temperature.

Fig 5

Tg denotes the temperature of glass transition that is 200-220K [38]. The magnitudes computed from sbNMA at 100K are marked by black crosses, while the magnitudes obtained from B-factors by black circles. The blue line represents the universal trend of the magnitude of harmonic vibrations by sbNMA with CHARMM force field, having a nearly universal value around 0.093 Å2 at 100 K for all proteins [32], and 0.279 Å2 at 300 K. At above Tg, anharmonic motions kick in and the magnitude of total thermal fluctuations increases superlinearly, as marked by the red line that is fitted to the black circles.

3.4 The range of Young’s modulus

Fig 6 shows the range of Young’s moduli computed from σESM and calibrated with sbNMA or B-factors. The median Young’s modulus when fitting to B-factors is 6.03 Gpa, and 10.6 Gpa when fitting to sbNMA. sbNMA does not consider glass transition but only harmonic motions. Thus, the Young’s moduli calibrated with sbNMA can be thought of as the Young’s moduli of proteins at temperatures below Tg, while the Young’s moduli fit to B-factors can be thought of as Young’s moduli at the room temperature.

Fig 6. A bar plot of the Young’s moduli (E) calibrated with sbNMA (in blue) and those with crystallographic B-factors (in red), for the 18 proteins in the dataset (listed by their sizes in the ascending order).

Fig 6

The blue bars can be interpreted also the Young’s moduli at temperatures below Tg, while the red bars Young’s moduli at the room temperature.

3.5 Estimating the Young’s modulus of the interface region using sbNMA and CHARMM force field

The interface region is kept together mostly by non-bonded electrostatic and van der Waals interactions. Electrostatic interactions are difficult to include in normal mode analysis as they introduce negative spring constants [33]. Our past studies showed that van der Waals interactions provided a greater contribution than the electrostatic interactions (Fig 1 in Ref. [33]) and a model without electrostatic interactions could still maintain most of the accuracy of the classical NMA [33]. For these reasons, our current model for the interactions at the interface includes only the van der Waals interactions. It is planned that a future release of the model should improve this deficiency. The van der Waals potential employed in the CHARMM force field is a 6-12 Leonard-Jones potential that takes the following form:

VvdW=ϵ((r0r)12-2(r0r)6). (10)

The spring constant due to the van der Waals potential is:

kvdW=2VvdWr2=12ϵr2(13(r0r)12-7(r0r)6)=72ϵr02, (11)

the last step of which assumes r = r0.

Fig 7 shows the histogram distribution of the van der Waals parameter ϵ of the 50 CHARMM atom types in par_all36_prot.prm and the histogram distribution of kvdW(r0), which falls mostly in the range of 0.1 to 1 Kcal/mol/Å2. It is thus clear that kvdW, the spring constant due to van der Waals interactions, is much weaker than the spring constants for covalent bonds that are in the order of several hundreds Kcal/mol/Å2.

Fig 7. Histogram distributions of (A) the van der Waals parameter ϵ used in CHARMM and (B) of kvdW(r0), the spring constant of van der Waal interactions.

Fig 7

Since the equilibrium distance between a pair of atoms are in the range of 2.5 to 4.5 Å, The Young’s modulus of the interface region, Einter, is estimated to be on the order of kvdWr0, or 0.02 to 0.4 Kcal/mol/Å3 (or 0.1 to 3 Gpa, the converting factor between the two units is 6.9). This is 2 to 3 orders smaller than that of covalent solids or ionic solids, which is expected since spring constants derived from van der Waals interactions are 2 to 3 orders smaller than those of covalent bonds or ionic bonds.

3.6 Young’s moduli of the protein interface regions

Using the approach described in the Methods, we first generate quaternary structures of all the 18 proteins in the dataset. σESM (see Methods section for details) is then applied to compute the Young’s moduli of the interface regions (the MATLAB scripts used for this computation are available at https://github.com/gsongISU/sigmaESMrelease). The results are given in Table 2. It is seen that while the Young’s moduli of the proteins are a few Gpa, the Young’s moduli of interface regions are about an order smaller. Such a significant difference in stiffness between proteins and protein interfaces must be due to the underlying chain connectivity of proteins and the associated bonded interactions. Proteins can be considered as van der Waals solids with “steel enforcement” [49]. Also listed in Table 2 are the buried surface areas of interface regions of the quaternary structures. The buried surface area is the difference between the total surface area of individual proteins and that of the complex, both of which are computed using MSMS [28].

Table 2. The lower and upper bounds of Young’s moduli of the 18 proteins and the Young’s moduli of the interface regions of the corresponding quaternary structures artificially generated based on crystal contacts.

pdb-id residue # buried area [Å2] Einter Eintra (lower bound) Eintra (upper bound)
1p9g 41 75 0.09 12.71 12.10
1rb9 52 263 0.58 7.76 11.19
1iro 53 388 0.56 4.56 10.19
2igd 61 171 0.40 8.04 7.56
1aho 64 493 1.17 8.75 7.70
5tog 75 541 0.56 6.88 7.95
1ctj 89 352 0.96 3.29 8.23
1lwb 122 251 0.58 7.10 9.88
1c7k 132 81 0.36 5.45 13.17
3r87 132 837 0.83 5.48 11.91
6gz8 133 293 1.13 5.75 10.95
5vg0 142 181 0.23 5.53 11.57
4b9g 146 264 1.02 6.69 9.91
4ann 176 527 0.57 6.27 11.41
4qa8 210 401 0.91 3.51 9.99
6h40 220 183 0.57 4.83 7.89
2pwa 279 203 0.44 7.45 16.32
6gy5 285 103 0.15 5.80 13.79
median 132 263 0.57 6.03 10.57

The lower bounds are obtained by fitting σESM results to B-factors, while the higher bounds are obtained by fitting the σESM results to sbNMA. Also listed in the table are the sizes of the 18 proteins and the buried surface areas of the interface regions. The unit for all Young’s moduli is Gpa.

The above 18 quaternary structures are artificially generated based on crystal contacts. An insightful reviewer commented that interfaces thus identified tend to be fortuitous and recommended that real protein-protein interfaces existing naturally should also be used and a comparison be made. For this reason, we selected another 18 high resolution structures (1.0 Å or higher) from the PDB that are in the form of homodimers in the asymmetric unit. Their pdb-ids and sizes are given in Table 3, as well as the buried surface areas and the Young’s moduli of the interface regions.

Table 3. The buried surface areas and the Young’s moduli (Einter) of the natural interfaces found in a list of 18 high resolution structures from the PDB that are in the form of homodimers in the asymmetric unit.

pdb-id residue # buried surface area [Å2] Einter [Gpa]
4ynh 58 1,964 2.06
3rq9 84 900 1.60
4nds 94 1,279 2.26
2nmz 99 2,381 3.12
4unu 109 597 0.53
6j64 115 2,695 2.69
2xr4 116 1,987 1.89
4egu 118 2,256 2.02
2gud 121 2,997 3.50
4axo 137 2,615 2.09
5nld 138 1,058 1.15
5idb 142 1,390 1.89
2wyt 153 729 1.71
4a7v 153 704 1.50
5sy4 195 1,802 2.14
6rk0 214 5,313 3.77
3noq 229 3,327 2.17
4ypo 325 7,924 2.50
median 129 1,976 2.08

The buried surface areas and the Young’s moduli of these natural interfaces are distinctly higher than those of the artificial interfaces seen in Table 2.

To compare the Young’s moduli of protein interfaces of the naturally existing dimers (Table 3) with those created by crystal contacts (Table 2), we show in Fig 8 a scatter plot between the buried surface areas and the Young’s moduli of the interface regions, for both natural interfaces (red dots) and artificial interfaces created by crystal contacts (blue crosses). First, a significant correlation is seen between the buried surface areas and the Young’s moduli (correlation coefficient: 0.77). Secondly, The natural interfaces (red dots) have a much higher buried surface areas, though the two sets of proteins have similar sizes, and are much stiffer, having a median Young’s modulus of about 2.0 Gpa that is about 4 times as high as that of artificial interfaces. Thirdly, even for the natural interfaces, the Young’s moduli are still a few times lower than those of proteins. Since a clear distinction is seen between artificial interfaces and natural interfaces in Fig 8, a potential application of the present study is to use the interface’s Young’s modulus proposed in this work to assess the quality of a predicted protein interface.

Fig 8. A scatter plot between the buried surface areas and the Young’s moduli of the interface regions, for both artificial (blue crosses, data points from Table 2) and natural (red solid circles, data points from Table 3) interfaces.

Fig 8

4 Discussion

In this work, we have studied the elastic moduli of proteins and protein interfaces using a bottom-up approach. The material properties of solids are determined by the underlying physical interactions. For proteins and protein interfaces in particular, their material properties such as elasticity are dictated by the underlying molecular interactions.

The innovation of this work is the development of a novel method that can bridge between material properties at the macroscopic level and molecular interactions at the microscopic level. To achieve this, we employ both an all-atom NMA model (here sbNMA [33]) and an elastic solid model (σESM) for proteins and protein interfaces. And the “bridge” between the two models is a common physical property predictable by both models: the magnitude of thermal vibrations (see Eq (8)), i.e.,

ΔR2ESM=ΔR2NMA (12)

This connection allows us to determine the Young’s moduli of proteins and protein interfaces using molecular interactions.

A similar bridge can be built between σESM and crystallographic B-factors (Eq 9) as well. A significant benefit of doing both is that the two constraints are complementary to each other and together they provide both an upper and lower bound for the Young’s moluli of proteins.

Atoms at the interface region interact primarily through non-bonded interactions. One key realization is that the spring constant due to the van der Waals interaction, or kvdW, is small and is mostly in the range of 0.1-1 kcal/mol/Å2 (Fig 7). Since the separation distances r0 between pairs of atoms at the interface region should be about the sum of their van der Waals radii, one may estimate the order of Young’s modulus at the interface region by simply taking the ratio of the two, which comes to be about 0.1 to 3 Gpa. Our actual computations are in agreement with this estimation. Covalent solids such as diamond have a Young’s modulus that is hundreds of Gpa or even over a thousand Gpa, which is 2-3 orders higher than that of proteins or protein interfaces. The difference has a simple physical explanation: the spring constants for covalent bonds (as in CHARMM force field) are 2-3 orders higher than the van der Waals spring constants while the separation distances between interacting atoms are similar. In a nutshell, the low Young’s modulus at protein interface is directly due to the weak non-bonded interactions.

Another interesting finding is that, compared with protein interfaces, the Young’s modulus of proteins is several times higher. After all, proteins are chains of amino acids linked together by covalent bonds. The covalent chain serves as a “steel enforcement” [49] that stiffens a protein solid. The abundance of secondary structures in proteins provides additional enforcement through hydrogen bonds. As a result, the internal of a protein is several times stiffer than protein interface.

As part of future work, we plan to apply the method to study the elastic properties of protein capsids. The stiffness of viral capsids has been measured for a large number of viruses using AFM technique [5]. The stiffness varies greatly among the viral capsids, with that of some capsids being an order of magnitude higher than that of others [5]. The physical cause of this large degree of variations in stiffness is not known. As mentioned in the Introduction, existing work, using with either the thin shell or thick shell models, all represents a top-down approach that uses experimental determined stiffness values to fit the underlying elastic moduli of capsids and were not able to provide a physical explanation for the observed stiffness. Our present approach has the potential to take into the account not only the structural details of protein capsids (instead of a shell with uniform spherical geometry and thickness), but also the striking difference in elastic properties between protein subunits and protein interface regions, and consequently may be used to predict capsid stiffness and provide an explanation for the observed differences in stiffness among viral capsids.

Data Availability

Data are available at: https://github.com/gsongISU/sigmaESMrelease.

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1. Grodzicki M. The concept of the chemical bond in solids. In Maksić Z., editor, Theoretical Models of Chemical Bonding, Part 2: The Concept of the Chemical Bond, pages 417–452. Springer-Verlag, 1990. [Google Scholar]
  • 2. Howard J. Mechanics of motor proteins and the cytoskeleton. Sinauer Associates Publishers, Sunderland, Mass., 2001. [Google Scholar]
  • 3. Guthold M., Liu W., Sparks E. A., Jawerth L. M., Peng L., Falvo M. A comparison of the mechanical and structural properties of fibrin fibers with other protein fibers. Cell Biochem Biophys, 49(3):165–81, 2007. 10.1007/s12013-007-9001-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Morozov V. N. and Gevorkian S. G. Low-temperature glass-transition in proteins. Biopolymers, 24(9):1785–1799, 1985. 10.1002/bip.360240909 [DOI] [Google Scholar]
  • 5. Mateu M. G. Mechanical properties of viruses analyzed by atomic force microscopy: a virological perspective. Virus Res, 168(1-2):1–22, 2012. 10.1016/j.virusres.2012.06.008 [DOI] [PubMed] [Google Scholar]
  • 6. Kojima H., Ishijima A., and Yanagida T. Direct measurement of stiffness of single actin-filaments with and without tropomyosin by in-vitro nanomanipulation. Proceedings of the National Academy of Sciences of the United States of America, 91(26):12962–12966, 1994. 10.1073/pnas.91.26.12962 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Fraser R. D. and Macrae T. P. Molecular structure and mechanical properties of keratins. Symp Soc Exp Biol, 34:211–46, 1980. [PubMed] [Google Scholar]
  • 8. Gilbert J., Charnley M., Cheng C., Reynolds N. P., and Jones O. G. Quantifying young’s moduli of protein fibrils and particles with bimodal force spectroscopy. Biointerphases, 12(4):041001, 2017. 10.1116/1.4996447 [DOI] [PubMed] [Google Scholar]
  • 9. Guthold M., Liu W., Stephens B., Lord S. T., Hantgan R. R., Erie D. A. Visualization and mechanical manipulations of individual fibrin fibers suggest that fiber cross section has fractal dimension 1.3. Biophys J, 87(6):4226–36, 2004. 10.1529/biophysj.104.042333 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Gavish B., Gratton E., and Hardy C. J. Adiabatic compressibility of globular proteins. Proc Natl Acad Sci U S A, 80(3):750–4, 1983. 10.1073/pnas.80.3.750 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Kundrot C. E. and Richards F. M. Crystal-structure of hen egg-white lysozyme at a hydrostatic-pressure of 1000 atmospheres. Journal of Molecular Biology, 193(1):157–170, 1987. 10.1016/0022-2836(87)90634-6 [DOI] [PubMed] [Google Scholar]
  • 12. Morozov V. N. and Morozova T. Y. Viscoelastic properties of protein crystals—triclinic crystals of hen egg-white lysozyme in different conditions. Biopolymers, 20(3):451–467, 1981. 10.1002/bip.1981.360200304 [DOI] [PubMed] [Google Scholar]
  • 13. Morozov V. N., Morozova T. Y., Kachalova G. S., and Myachin E. T. Interpretation of water desorption isotherms of lysozyme. International Journal of Biological Macromolecules, 10(6):329–336, 1988. 10.1016/0141-8130(88)90024-4 [DOI] [Google Scholar]
  • 14. Zenchenko T. A., Pozharskii E. V., and Morozov V. N. A magnetic micromethod to measure young’s modulus of protein crystals and other polymer materials. J Biochem Biophys Methods, 33(3):207–15, 1996. 10.1016/S0165-022X(96)00027-9 [DOI] [PubMed] [Google Scholar]
  • 15. Tait S., White E. T., and Litster J. D. Mechanical characterization of protein crystals. Particle & Particle Systems Characterization, 25(3):266–276, 2008. 10.1002/ppsc.200701112 [DOI] [Google Scholar]
  • 16. Koizumi H., Tachibana M., and Kojima K. Elastic constants in tetragonal hen egg-white lysozyme crystals containing large amount of water. Phys Rev E Stat Nonlin Soft Matter Phys, 79(6 Pt 1):061917, 2009. 10.1103/PhysRevE.79.061917 [DOI] [PubMed] [Google Scholar]
  • 17. Kitajima N., Tsukashima S., Fujii D., Tachibana M., Koizumi H., Wako K. Elastic constants in orthorhombic hen egg-white lysozyme crystals. Phys Rev E Stat Nonlin Soft Matter Phys, 89(1):012714, 2014. 10.1103/PhysRevE.89.012714 [DOI] [PubMed] [Google Scholar]
  • 18. Speziale S., Jiang F., Caylor C. L., Kriminski S., Zha C. S., Thorne R. E. Sound velocity and elasticity of tetragonal lysozyme crystals by brillouin spectroscopy. Biophys J, 85(5):3202–13, 2003. 10.1016/S0006-3495(03)74738-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Michel J. P., Ivanovska I. L., Gibbons M. M., Klug W. S., Knobler C. M., Wuite G. J. Nanoindentation studies of full and empty viral capsids and the effects of capsid protein mutations on elasticity and strength. Proc Natl Acad Sci U S A, 103(16):6184–9, 2006. 10.1073/pnas.0601744103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Gibbons M. M. and Klug W. S. Influence of nonuniform geometry on nanoindentation of viral capsids. Biophysical Journal, 95(8):3640–3649, 2008. 10.1529/biophysj.108.136176 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Bathe M. A finite element framework for computation of protein normal modes and mechanical response. Proteins, 70(4):1595–609, 2008. 10.1002/prot.21708 [DOI] [PubMed] [Google Scholar]
  • 22. Lessen H. J., Fleming P. J., Fleming K. G., and Sodt A. J. Building blocks of the outer membrane: Calculating a general elastic energy model for beta-barrel membrane proteins. J Chem Theory Comput, 14(8):4487–4497, 2018. 10.1021/acs.jctc.8b00377 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Go N., Noguti T., and Nishikawa T. Dynamics of a small globular protein in terms of low-frequency vibrational modes. Proc. Natl. Acad. Sci. USA, 80(12):3696–3700, June 1983. 10.1073/pnas.80.12.3696 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Levitt M., Sander C., and Stern P. S. The normal modes of a protein: Native bovine pancreatic trypsin inhibitor. Int. J. Quant. Chem., 10:181–199, 1983. [Google Scholar]
  • 25. Brooks B. and Karplus M. Harmonic dynamics of proteins: normal modes and fluctuations in bovine pancreatic trypsin inhibitor. Proc. Natl. Acad. Sci. USA, 80(21):6571–6575, November 1983. 10.1073/pnas.80.21.6571 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Song G. Shape-preserving elastic solid models of macromolecules. Plos Computational Biology, 16(5), 2020. 10.1371/journal.pcbi.1007855 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Edelsbrunner H. and Mucke E. P. Three-dimensional alpha shapes. ACM Transactions on Graphics, 13(1):43–72, 1994. 10.1145/174462.156635 [DOI] [Google Scholar]
  • 28. Sanner M. F., Olson A. J., and Spehner J. C. Reduced surface: an efficient way to compute molecular surfaces. Biopolymers, 38(3):305–20, 1996. [DOI] [PubMed] [Google Scholar]
  • 29. Inczédy J., Lengyel T., Ure A. M., and Gelencsér A. Compendium of analytical nomenclature: definitive rules 1997. Blackwell Science, Oxford, 3 edition, 1998. [Google Scholar]
  • 30. Rahman T. and Valdman J. Fast matlab assembly of fem matrices in 2d and 3d: Nodal elements. Applied Mathematics and Computation, 219(13):7151–7158, 2013. 10.1016/j.amc.2011.08.043 [DOI] [Google Scholar]
  • 31. Logan D. L. A first course in the finite element method. Thomson, Toronto, Canada, 4th edition, 2007. [Google Scholar]
  • 32. Na H., Hinsen K., and Song G. The amounts of thermal vibrations and static disorder in protein x-ray crystallographic b-factors. Authorea, 2020. [DOI] [PubMed] [Google Scholar]
  • 33. Na H. and Song G. Bridging between normal mode analysis and elastic network models. Proteins, 82:2157–2168, September 2014. 10.1002/prot.24571 [DOI] [PubMed] [Google Scholar]
  • 34. MacKerell A. D., Bashford D., Bellott, Dunbrack R. L., Evanseck J. D., Field M. J. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J. Phys. Chem. B, 102(18):3586–3616, April 1998. 10.1021/jp973084f [DOI] [PubMed] [Google Scholar]
  • 35. Na H., Song G., and ben Avraham D. Universality of vibrational spectra of globular proteins. Phys Biol, 13(1):016008, 2016. 10.1088/1478-3975/13/1/016008 [DOI] [PubMed] [Google Scholar]
  • 36. Na H. and Song G. Fast normal mode computations of capsid dynamics inspired by resonance. Phys Biol, 15(4):046003, 2018. 10.1088/1478-3975/aab813 [DOI] [PubMed] [Google Scholar]
  • 37. Na H. and Song G. Conventional nma as a better standard for evaluating elastic network models. Proteins, 83(2):259–67, 2015. 10.1002/prot.24735 [DOI] [PubMed] [Google Scholar]
  • 38. Ringe D. and Petsko G. A. The’glass transition’ in protein dynamics: what it is, why it occurs, and how to exploit it. Biophys Chem, 105(2-3):667–80, 2003. 10.1016/S0301-4622(03)00096-6 [DOI] [PubMed] [Google Scholar]
  • 39. Fuglebakk E., Reuter N., and Hinsen K. Evaluation of protein elastic network models based on an analysis of collective motions. J Chem Theory Comput, 9(12):5618–28, 2013. 10.1021/ct400399x [DOI] [PubMed] [Google Scholar]
  • 40. Burden C. J. and Oakley A. J. Anisotropic atomic motions in high-resolution protein crystallography molecular dynamics simulations. Phys Biol, 4(2):79–90, 2007. 10.1088/1478-3975/4/2/002 [DOI] [PubMed] [Google Scholar]
  • 41. Sayle R. A. and Milnerwhite E. J. Rasmol—biomolecular graphics for all. Trends in Biochemical Sciences, 20(9):374–376, 1995. 10.1016/S0968-0004(00)89080-5 [DOI] [PubMed] [Google Scholar]
  • 42. Humphrey W., Dalke A., and Schulten K. VMD—Visual Molecular Dynamics. J Mol. Graphics, 14:33–38, 1996. 10.1016/0263-7855(96)00018-5 [DOI] [PubMed] [Google Scholar]
  • 43. Tama F., Gadea F.X., Marques O., and Sanejouand Y.H. Building-block approach for determining low-frequency normal modes of macromolecules. Proteins, 41(1):1–7, 2000. [DOI] [PubMed] [Google Scholar]
  • 44. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, et al. The Protein Data Bank. Nucleic Acids Res. 2000;28(1):235–42. 10.1093/nar/28.1.235 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Rodgers D. W. Cryocrystallography. Structure, 2(12):1135–1140, December 1994. 10.1016/S0969-2126(94)00116-2 [DOI] [PubMed] [Google Scholar]
  • 46. Hooft R. W. W., Sander C., and Vriend G. Verification of protein structures: Side-chain planarity. Journal of Applied Crystallography, 29:714–716, 1996. 10.1107/S0021889896008631 [DOI] [Google Scholar]
  • 47. Touw W. G., Baakman C., Black J., te Beek T. A. H., Krieger E., Joosten R. P. A series of pdb-related databanks for everyday needs. Nucleic Acids Research, 43(D1):D364–D368, 2015. 10.1093/nar/gku1028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. ben-Avraham D. Vibrational normal-mode spectrum of globular proteins. Phys. Rev. B, 47(21):14559–60, 1993. 10.1103/PhysRevB.47.14559 [DOI] [PubMed] [Google Scholar]
  • 49.K. Hinsen. The stuff that proteins are made of: physical properties of folded peptide chains, 2017. A research presentation, available at https://www.youtube.com/watch?v=RbOu1Vjp3Mc.

Decision Letter 0

Claudio M Soares

12 Mar 2021

PONE-D-21-02708

Bridging Between Material Properties Of Proteins And The Underlying Molecular Interactions

PLOS ONE

Dear Dr. Song,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the all the points raised by both reviewers.

Please submit your revised manuscript by Apr 26 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Claudio M. Soares, Ph.D

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: No

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This article by Song reports the fluctuations in the interfacial region between proteins, interpreted as a material property. The author draws an interesting analogy between the mechanical properties of fibrils and the interfacial mechanics of the proteins. The key point appears to be justified: the fluctuation of the interface between proteins is softer than the internal fluctuations of a protein because of the different interactions and degrees of freedom.

The key point missing from this paper is that the mechanical model is not truly required for looking at the fluctuations of proteins; they really are not isotropic materials. Discussion of this really must be done to make the point of the article.

My comments below mostly complain about the naive analysis of the van der waals interactions between proteins which I think needs to be modified.

Major comments:

In my opinion normal mode analysis does not really qualify as predicting thermal vibrations because it is using the thermal vibrations to compute the modes. Casting it in terms of an elastic material does not change this. What the model can do is predict the strain in response to stress. Alternatively, the author might cast it as the way of predicting the thermal vibrations of large fibers from a local property of the dimer (abstract).

Proteins aren’t mostly held together by van der waals forces. Here you must know that this could only refer to the fold of a protein. The atoms of a protein are connected by covalent bonds. The hydrophobic effect and hydrogen bonding are mostly at play. The hydrophobic effect is not the van der waals force (Section 1). “The interface region is kept together mostly by van der Waals interactions” (Section 3.5).

Model peptides and water near the interface region experience relatively strong Coulomb interactions with the protein and are repelled by the vdW approximation to the Pauli repulsion at close contact (the 1/r12 potential). I believe there would still be a fluctuating interface even in the complete absence of an attractive 1/r^6 potential. Therefore, the estimate is unjustified.

The fluctuating interface shown in Figure 3 is interesting but because it is a novel result and appears challenging to validate a more sophisticated estimate would be interesting to see. Is this interface the result of hydrophobic contacts? Can the fluctuations of the hydrophobic contact volume be explained in terms of the hydrophobic-effect-driven tension? This would seem to be a more fruitful explanation than purely in terms of vdW contacts.

Figure 6 is unacceptable. Considering protein index is not important (x-axis) and the values per-protein are listed in a table, I suggest a plot of sbNMA Young’s modulus vs. the B-factor young’s modulus. For example, when the crystallographic Young’s modulus (red) is greater than the sbNMA analysis (blue) the blue value is not shown. An XY/scatter plot would indicate the correlation of differences in protein Young’s modulus.

Minor:

“Along with liquid, gas, and plasma, solid is one of the four states of matter.” Wikipedia lists quite a few more such as Bose-Einstein condensates. There is no reason to make such obvious statements in a scientific article. I urge the author to be less didactic (Section 1).

The bulk modulus is not the Young’s modulus and the author confuses them in the introduction. The Young’s modulus refers to strain along a direction with the two other directions unconstrained. Water is a fluid and as such has a zero Young’s modulus (Section 1).

What is the rationale for using the mass-weighted averages as opposed to anything else? Mass has nothing to do with any elastic modulus; it is independent of mass-scaling (2.3).

Lessen et al, JCTC 2018 is another computational study attempting to compute the Young’s modulus: https://dx.doi.org/10.1021/acs.jctc.8b00377

Reviewer #2: This manuscript discusses calculations and results on the elasticity of proteins and their interfaces. The underlying ideas is to equate the results of calculations based on solid elastic models with those based on atomic vibrations either from atomistic MD simulations or crystallographic temperature factors. The issue of elasticity of proteins and their interfaces is an interesting one, and the basic mathematical idea is sound and well executed. There is however a very important problem with the paper with regard to how the author has chosen to think about protein interfaces, perhaps reflecting some unfamiliarity with how structural biologists think about protein quaternary structure vs incidental crystal contacts. Basically, the underlying idea of the paper is interesting and perhaps informative, but to be meaningful the study would have to be repeated with a different set of classifications of what is considered a protein interface.

Major problem:

The author takes crystal structures from the PDB and generates protein interfaces by examining additional chains that are in contact based on crystallographic operations. By itself, this is not an appropriate procedure for defining protein interfaces. The study doesn’t make the critical distinction between real protein-protein interfaces (i.e. those that have arisen in nature by evolution) and fortuitous contacts (often very small) that occur incidentally during crystallization. The former category is of substantial interest. The latter category might also be of interest in a more specialized context. But it is vital that the two things not be confused or conflated. There is a long literature on both subjects. There are also dedicated databased set up to help disentangle the two cases, which is not always easy. The PISA protein database is one. The PQS protein quaternary structure database is another. Just to be more explicit, one could go to such a resource and ask for a list of PDB structures where the protein molecule is believed to be a monomer in its native form (comprised of just one protein chain); for those cases, contacts present in the expanded crystal state would (presumably) represent fortuitous protein-protein crystal contacts. For analyzing natural protein-protein interfaces, one can query the database for proteins believed to exist as dimers in their natural biological forms. Then one can extract the dimeric forms directly from that curated database – obtaining the correct biological forms is not always trivial to do by expanding from the coordinates in the PDB as the author has attempted to do based on closest contact. Analysis of these cases would then represent evolved protein-protein interfaces. And of course quaternary structures higher than dimers could be considered, with added complexity. Once the author does this, then the results, which are likely to differ substantially, can be analyzed in separate categories of natural and incidental crystal interfaces. It is a virtual certainty that the cases of natural interfaces will not be as flexible as the whole set of protein-protein contacts (including incidental crystal contacts) which the author has presently defined.

Minor points:

1) At the bottom of page 8, the form and units of the stiffness matrix K should be given. E.g. is it the collection of second partial derivatives of the conformational energy (normalized by kT) with respect to atomic coordinates (or finite elements)? Similarly for the Hessian at the top of page 10.

2) The legend to Figure 4 should clarify that the two kinds of experimental results are being interpreted as upper and lower bounds.

3) The second paragraph on page 11 should introduce, at least parenthetically, that the accepted technical term for ‘B-factors’ is ‘atomic displacement parameters’.

4) Protein-protein interfaces are customarily discussed/plotted with respect to buried surface area rather than volumes.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Decision Letter 1

Claudio M Soares

20 Apr 2021

Bridging Between Material Properties Of Proteins And The Underlying Molecular Interactions

PONE-D-21-02708R1

Dear Dr. Song,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Claudio M. Soares, Ph.D

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Thank you for addressing my comments. I look forward to seeing electrostatics addressed in the model.

Reviewer #2: The revised version is much improved and ready for publication in its present form. The inclusion of natural interfaces now makes the paper useful.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Acceptance letter

Claudio M Soares

23 Apr 2021

PONE-D-21-02708R1

Bridging Between Material Properties Of ProteinsAnd The Underlying Molecular Interactions

Dear Dr. Song:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Claudio M. Soares

Academic Editor

PLOS ONE


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES