Abstract
In allosteric proteins, binding a ligand can affect function at a distant location, for example, by changing the binding affinity of a substrate at the active site. The induced fit and population shift models, which differ by the assumed number of stable configurations, explain such cooperative binding from a thermodynamic viewpoint. Yet, understanding what mechanical principles constrain these models remains a challenge. Here, we provide an empirical study on 34 proteins supporting the idea that allosteric conformational change generally occurs along a soft elastic mode presenting extended regions of high shear. We argue, based on a detailed analysis of how the energy profile along such a mode depends on binding, that in the induced fit scenario, there is an optimal stiffness ∼ 1/N for cooperative binding, where N is the number of residues. We find that the population shift scenario is more robust to mutations affecting stiffness because binding becomes more and more cooperative with stiffness up to the same characteristic value , beyond which cooperativity saturates instead of decaying. We numerically confirm these findings in a nonlinear mechanical model. Dynamical considerations suggest that a stiffness of order is favorable in that scenario as well, supporting that for proper function, proteins must evolve a functional elastic mode that is softer as their size increases. In consistency with this view, we find a fair anticorrelation between the stiffness of the allosteric response and protein size in our data set.
Significance
Many proteins are allosteric: binding a ligand affects their activity at a distant site. Understanding the principles allowing for such an action at a distance is both of fundamental and practical importance. From the thermodynamic viewpoint, two models have been proposed, according to which binding creates a new stable configuration or instead shifts the thermal equilibrium between existing states. We perform a mechanical analysis of these models and show that they are not equally robust to mutations. We argue that in both cases, function can most properly occur along a soft elastic mode, whose stiffness decreases rapidly with protein size. We show data on 34 proteins substantiating this result, supporting a, to our knowledge, new principle for allosteric design.
Introduction
Many proteins are allosteric: binding a ligand at one or several allosteric sites can regulate function at a distant site, a long-range communication often accompanied by large conformational changes (1,2). There is a considerable interest in predicting the amino acids involved in this communication, or “allosteric pathway,” from structure or sequence data (3,4) because they can be used as targets for drug design (5). Yet, understanding the physical principles underlying such action at a distance in proteins remains a challenge (6,7). From a thermodynamic standpoint, two distinct views have been proposed. In the induced fit scenario, exemplified by the Koshland-Némethy-Filmer model (8), the protein essentially lies in one single state. The latter changes as binding occurs, leading to a conformational change. In an energy landscape picture such as that of Fig. 1 B, it corresponds to a displacement of the energy minimum upon binding. By contrast, in the population shift model exemplified by the Monod-Wyman-Changeux (MWC) model (9), two states are always present. Their relative stability can change sign upon binding, leading to an average conformational change. Although each of these models presumably applies to various proteins, they do not specify which designs allow for efficient action at a distance and how robust these designs are to mutations (10).
In several proteins—see below for a systematic study—it has been observed that the allosteric response induced by binding occurs predominantly along one or few vibrational modes (11, 12, 13, 14, 15). This result supports that in at least some proteins, elasticity—possibly nonlinear—is an appropriate language to describe allostery (in contrast to intrinsically disordered proteins that may be considered more as liquids than solids, for which the analysis proposed here would not hold). Very recently, there has been a considerable effort to use in silico evolution (16,17) to study how linear elastic materials can evolve to accomplish an allosteric task (18, 19, 20, 21, 22, 23, 24, 25, 26). In general, binding a ligand locally distorts the protein, which is modeled by imposing local displacements at some site, generating an extended elastic response that in turn determines fitness (chosen specifically to accomplish a given task). These models fall into the induced fit scenario because in the framework of linear elasticity, there is always a single minimum of energy. A particularly key allosteric function within proteins is the amount of cooperative binding, defined as the change of binding energy of a substrate at the active site caused by binding a ligand at the allosteric site. Materials optimized to display such a cooperativity over long distances develop a single extended “mechanism”—a soft elastic mode like the motion of closing scissors—connecting the two binding sites (21). It is found that the stiffness ka (i.e., the curvature of the energy) of this mode cannot be too large nor too small for cooperativity to occur and that optimal design corresponds to ∼ 1/N, where N is the number of particles in the system. If proteins are nearly optimal, mutations stiffening that mode should thus diminish cooperativity. Yet, these predictions are restricted to strictly linear elasticity, an approximation that presumably does not hold in the regime in which most proteins operate—certainly not in the population shift scenario.
In this work, we show that the population shift and the induced fit models are very different from a mechanical perspective. In the former case as well, function can be achieved by developing a mechanism or soft elastic mode, but cooperativity, instead of steadily decreasing, saturates to a constant value once the mode stiffness passes some characteristic value, whose scaling with N is ∼ 1/N. We confirm this prediction in a nonlinear elastic model of allostery. This result implies that cooperativity is more robust toward mutations increasing stiffness in the population shift scenario. Yet, displaying a stiffness much larger than implies a very long transition time between the two states and is presumably prohibited, suggesting the hypothesis that allosteric proteins function with modes presenting a stiffness near the characteristic value in both cases. We test this proposition systematically using x-ray crystallographic data of 34 allosteric proteins. We first confirm that one or a few vibrational modes contribute to the allosteric response and introduce a, to our knowledge, new observable establishing that this response presents unusually extended regions of large shear, as found previously for three proteins (27). Next, we confirm that the characteristic mode stiffness tends to decrease with the propagation length, as we expect from the predicted scaling of . Finally, we suggest systematic mutational studies to further test how mechanics constrains allostery.
Methods
Allosteric response
We identify a set of 34 allosteric proteins in the Protein Data Bank for which both the active (ligand bound at the allosteric site) and inactive (no ligand bound at the allosteric site) crystalline x-ray structures are available. Their Protein Data Bank identifiers are taken from (27,28) and reported in Supporting Materials and Methods. The set is diverse in functionality and includes enzymes (13), G-proteins (10), kinases (3), response regulators (3), DNA-binding proteins (4), and the human serum albumin, among which 12 protein complexes are present. We can thus estimate the allosteric response as the displacement field between the inactive and active structures (after having aligned them via the software Pymol 2.1.1 (29)). Here, we focus on the motion of the N amino acids, located by the position of their α-carbon. As an illustration, the allosteric response of a given protein (the elongation factor Tu) is shown in black arrows in Fig. 5 A. From the allosteric response , one can readily estimate 1) the magnitude of the displacement ||dRa||2; 2) the fraction of the protein involved in the response. For any displacement field, this fraction is usually estimated via the participation ratio (30),
(1) |
3) A measure of how much relative displacement takes place around atom i. Following (27), we consider the shear pseudoenergy Esh(i) quantifying the amount of strain—essentially a measure of the relative displacement between adjacent atoms—at residue i, whose precise definition is given in the Supporting Materials and Methods. Esh(i) = 0 indicates that the protein moves as a rigid body near atom i and, by contrast, is large when atoms slide rapidly past each other. Esh(i) is shown in color in Fig. 5 A for the protein Tu, illustrating that two parts of the protein are rigidly moving (and counter-rotate), whereas the central region displays significant pseudoenergy Esh(i), which is reminiscent of a hinge design.
Elastic networks analysis
To estimate protein elasticity, we use the elastic network model (ENM) (31), in which harmonic springs of identical stiffness are placed between all N α-carbons laying below a chosen cutoff radius Rc ∈ [8–12] Å. The ENM is obviously a crude approximation of real atomic interactions. Yet, its simplicity allows for the systematic study of various proteins, and it has been successful in capturing normal modes relevant for the function of some proteins. The dependence of the results on the value of Rc are indicated by error bars in Fig. 5 B and discussed in Supporting Materials and Methods for Fig. 6 B. This procedure defines an elastic energy from which the matrix of the second derivatives, i.e., the Hessian matrix H, can be computed. From H, one can readily estimate the stiffness ka of the allosteric response as the curvature of the elastic energy in that direction:
(2) |
Finally, the eigenvectors of H define the 3N vibrational modes of the protein . Following (11, 12, 13, 14, 15), the overlap qi between the allosteric response and mode i characterizes their similarity (qi = 1 implies that they are identical):
(3) |
Results and Discussion
Geometrical interpretation of mechanical constraints in induced fit allostery
As sketched in Fig. 1 A, a protein with two binding sites can be unbound (labeled “00”), bound to a single ligand (labeled “01” or “10”), or doubly bound (labeled “11”). We define by E00, E10, E01, E11 the energy of the protein in these four situations (corresponding to the minimal energy of their energy landscape) and choose E00 = 0 as the reference energy. Cooperativity is then defined as
(4) |
To simplify notation below, we assume a symmetry between the states 01 and 10, in particular E10 = E01. Our qualitative conclusions below, however, remain valid even if this symmetry does not hold.
Consider a protein displaying cooperativity thanks to the presence of a soft elastic mode. Let us denote by x a variable indicating the motion along that mode (see our numerical model below for a concrete example in the context of a shear design), which varies from zero to unity as the protein undergoes its allosteric response. The energy profile E00(x) of the unbound state follows
(5) |
where is the norm of the allosteric response and is the vector displacement of the amino acid i. We expect that if the protein has N residues, ||dRa||2 is on the order of Na2, where a is the interatomic distance. We will confirm empirically the dependence of ||dRa||2 with N below. ka is the mode stiffness and f(x) some function of order unity such that f″(x = 0) = 1. For a purely linear elastic material, f(x) = x2/2. More generally, for the induced fit scenario, f(x) presents a single minimum, as illustrated in Fig. 1 B. In (21), it was argued that if linear elasticity applies, then there is an optimal stiffness for cooperativity. We present a geometrical interpretation of this result that extends it to the induced fit scenario in general and will be useful to explain why the population shift scenario behaves differently. Local chemical interactions that lead to identical binding energies for the inactive and active states do not affect cooperativity and thus need not be considered here. For concreteness, we assume that after some motion x0 = 1 along that mode, the protein shape can accommodate perfectly both ligands without deforming. It implies that the energy profiles of the bound states E10(x) and E11(x) satisfy E10(x0) = E11(x0) = E00(x0), as pictured in Fig. 1 B. However as x departs from x0, the protein shape does not match the ligands, imposing an elastic strain at the binding sites and leading to an increase of elastic energy in the protein that will trigger motion along other elastic modes. Assuming that the ligands are rigid and that each binding site involves on the order of n0 atoms that move by a distance a as the protein undergoes its allosteric response, we have E01(x) − E00(x) = n0ka2g(x − x0), where k characterizes the stiffness of interatomic interactions and is large in comparison with the one of soft modes, i.e., k ka. Here, g is some dimensionless function vanishing quadratically in zero but possibly nonlinear at large arguments.
E11, E01 can be computed as the minimum of the curves E11(x) and E01(x), respectively, from which the cooperativity ΔΔE is readily computed. Two extreme cases occur, illustrated in Fig. 1 B:
-
1)
If kaN nok (Fig. 1 B.I), as x moves away from x0, the elastic energy induced by binding n0ka2g(x − x0) is very significant in comparison to the mode energy E00(x) ∼ kaNa2f(x). Thus, both E10(x) and E11(x) have a sharp minimum near x0, with E11 ≈ E10 ≈ ka||δRa||2f(1). Thus, ΔΔE = E10 + E01 − E11 ≈ E10 ≈ ka||δRa||2f(1), which vanishes as ka → 0.
-
2)
If kaN n0k is very large (Fig. 1 B.III), noka2g(x − x0) is small in comparison to E00(x): E00(x), E11(x), and E01(x) are very close to each other and must thus all present a minimum near x = 0. Thus, binding does not trigger motion along the soft mode, whose presence is useless. No extended modes couples the two binding sites, and E11 ≈ E01 + E10, leading to ΔΔE → 0 as ka → ∞.
-
3)
Optimal cooperativity is thus found at some intermediary ∼ nok/N, corresponding to Fig. 1 B.II. Note that this argument for an optimal does not require the energy profile f(x) to be an exact parabola as long as it is monotonically growing in both directions around its minimum.
Mechanical aspects of the population shift model
MWC model
We recall some aspects of the MWC model. To simplify notation, we consider that the protein displays two symmetric binding sites as illustrated in Fig. 1 A. The protein is assumed to lie in two possible distinct configurations, “inactive” (In) and “active” (Ac). In the absence of binding, we take the energy of the inactive state as our reference (i.e., ) and denote the energy of the active state = E0. We assume that the active configuration has a well-suited geometry to bind each ligand; thus, no elastic energy is spent for binding, and = E0. By contrast, we assume that in the inactive state, binding costs some energy ΔE, leading to = ΔE and = 2ΔE. The last assumption of additivity of binding energies within a given configuration, i.e., for a frozen mode amplitude, is expected to be accurate if the two binding sites are distant enough because no elastic coupling between them is expected in that case.
For each binding situation, the configuration (inactive or active) chosen is the one with the smallest energy, e.g., E01 = min. Computing the cooperativity, one finds three different cases:
-
1)
if E0 < ΔE, then the binding of one ligand is sufficient to drive the system in the active state, implying E10 = E11 = E0 and ΔΔE = E0;
-
2)
if ΔE < E0 < 2ΔE, then the binding of one ligand is not sufficient, and two ligands have to bind to drive the system in the active state. Consequently, ΔΔE = 2ΔE − E0;
-
3)
if E0 > 2ΔE, then ΔΔE = 0 because the system stays in the inactive state even if two ligands are bound. The sketch of this behavior is shown in Fig. 2, illustrating that the maximal cooperativity is found for E0 = ΔE.
Mechanical consideration in the MWC model
Our observations (see the empirical section below) indicate that a significant fraction of allosteric proteins operate mainly along one normal mode of the elastic energy, supporting the idea that in these cases, a favored path connects the inactive and active configurations. We expand the energy (in the absence of ligand) in terms of the motion x along that path (in this two-states case, it is more convenient to choose a coordinate x varying between −1 and +1 as the protein undergoes its allosteric response). Note that if nonlinearities are present, this path is not along a single linear mode but bends in phase space. Our analysis below holds independently of such bending. We keep the minimal number of nonlinear terms that allow displaying two states (a polynomial of four degrees has five parameters; three of them can be fixed by redefining the reference energy and changing the definition of x by both a multiplicative and additive constants):
(6) |
where b is a parameter reflecting how the energy profile is tilted toward the inactive state, ka characterizes the stiffness of the mode, and ||dRa||2 is the square norm of the allosteric response.
A typical profile following Eq. 6 is shown in Fig. 3 A. We denote by inactive the lowest of the two minima and the other one by active. Note that 1) in the case b = 0, Eq. 6 describes two identical minima at x = ±1 of stiffness ka, separated by an energy barrier Eb = ka ||δRa||2/8. 2) When the parameter b is positive, we have EAc > EIn up to b = bc = 1/ where the active state becomes unstable and only a single stable state is left. 3) At fixed ka, the energy difference between the two states E0 is maximal at b = bc, where one finds E0 = ka ||dRa||2. 4) For small b, E0 2ka ||dRa||2b. In what follows, we focus on the case b < bc, where the population shift model lies.
Next, we assume that the active configuration matches the shape of both ligands so that binding events in that state cost no energy. However, by moving away from the configuration, the shapes of the protein and ligands do not match anymore, and the protein needs to deform elastically near the binding site. Again, assuming that each binding site involves on the order of n0 atoms, which move by a distance of order a from the active to the inactive state, we have a binding energy E10(x) − E00(x) = n0ka2g(x − xAc), where xAc ≈ 1 is the location of the active state along the path. This energy is exemplified by the difference between the blue and black curves in Fig. 3 A. Thus, the binding energy in the inactive state follows ΔE = n0ka2g(xIn − xAc) ≈ 4n0ka2, which is independent of the protein size and mode stiffness ka. As explained before, if the two binding sites are distant enough, for a given mode amplitude, the elastic costs of binding will simply add up: E11(x) − E00(x) = 2n0ka2g(x − xAc), as shown in red in Fig. 3 A.
How the mode stiffness constrains cooperativity
To quantify this constraint we define the maximal cooperativity over all possible tilts b given ka: ΔΔE∗ maxb{ΔΔE(ka, b)}. We find two regimes:
-
1)
If Nka n0k, then the elastic costs associated with binding are very large compared to E0. Both E10(x) and E11(x) are peaked close to xAc, as illustrated in Fig. 3 B.a. Thus, E0 < ΔE, implying ΔΔE = E0 according to Fig. 2, which is maximized at b = bc leading to ΔΔE = ka||dRa||2 ∼ kaNa2. Thus, ΔΔE vanishes linearly at small ka, as illustrated in Fig. 3 B. This result is qualitatively similar to the induced fit case, for which ΔΔE also vanishes linearly as shown in Fig. 1 B.
-
2)
If Nka n0k, then the elastic cost is very small in comparison to E0: E00(x), E10(x), and E11(x) are almost identical as illustrated in Fig. 3 B.c. In that regime, cooperativity is optimized by choosing a small tilt fixing E0 to ΔE according to Fig. 2, implying ΔΔE = ΔE ∼ n0ka2, which is independent of ka. This plateau behavior is represented in Fig. 3 B and appears at ∼ n0k/N.
This result represents a fundamental difference with the induced fit case, for which a large stiffness destroys cooperativity. Indeed, in the induced fit scenario, a large stiffness implies that the minimal energy is always found for x ≈ 0 as illustrated in Fig. 1 B.III, implying that binding does not move the protein along that mode, which is thus useless. This state of affairs is ultimately a geometric necessity stemming from the fact that the three curves E00(x), E10(x), and E11(x) must be very close to each other in that regime, and each must present a single minimum. Consequently, the positions of these minima must be very similar in the three cases, leading to x ≈ 0 independently of binding ligands or not. This geometric necessity vanishes as soon as two minima are present.
Note that although ΔΔE asymptotes to a constant for ka , the barrier Eb between the inactive and active states grows linearly with ka in that limit. Large barriers would lead to undesirably slow transition rate between states; thus, we expect that in practice, ka lies reasonably close to .
A mechanical model for population shift allostery
We seek to model that 1) the allosteric response often takes place mainly along a single vibrational mode. 2) Various architectures can lead to allostery, including the well-known shear (27,32) or hinge (13) designs and others not falling in these categories (33,34). Such a diversity is also found in in silico evolution schemes (21). Yet, such synthetic architectures always present soft extended regions in which most of the strain (i.e., relative motion) is located. Such an observation was made in a few proteins (27) and will be generalized below. For proteins presenting two stable configurations, we expect these regions to present two possible ways of locally stacking amino acids well.
As an illustration, we consider the shear design in which the protein presents two three-dimensional rigid regions connected by a soft planar layer that can easily deform. The rigid regions consist of harmonic springs of stiffness k, shown in blue in Fig. 4 A. The soft layer consists of anharmonic springs (shown in red in Fig. 4 A), whose energy versus extension curve is nonlinear, chosen to have the same form as Eq. 6. These nonlinear springs have a characteristic stiffness kw and present two stable extensions at which they exert no force, whose relative energy is controlled by some bias bw. These stable extensions are chosen such that two states of the protein as a whole, the inactive and active states shown in Fig. 4 A, present no contact forces by construction and are thus local minima of the energy. Finally, the protein presents two binding sites at its top and bottom. At each site, binding imposes that the distance between two nodes (indicated by black lines in Fig. 4 A) equals the distance it naturally presents in the active state, thus favoring it. The details about the construction of the microscopic model are discussed in Supporting Materials and Methods.
We can now numerically compute the energy profiles E00(x), E01(x), and E11(x) as a function of the motion along the shear mode x by imposing a shear displacement (i.e., a value of x) and letting the entire elastic energy of the material relax except for that mode. The insets of Fig. 4 show our results. The value of the mode stiffness ka can be extracted from fitting Eq. 6 to E00(x) and measuring the displacement norm ||dRa|| and can be increased by increasing kw. From E00(x), one readily computes the energy difference E0 between the inactive and active states, which can be increased by monitoring the microscopic bias bw. From the minima of E00(x), E01(x), and E11(x), one immediately extracts the binding costs ΔE and ΔΔE.
Fig. 4 B shows ΔΔE for two values of ka as the energy difference E0 is increased. For large ka, ΔΔE passes through a maximal ΔΔE∗ = ΔE, whereas for small ka, ΔΔE is smaller, and its maximum is fixed by the maximal achievable energy difference E0. ΔΔE∗ as a function of ka is shown for different system sizes N in Fig. 4 C, confirming the presence of two regimes with a crossover at some ∼ 1/N as shown in Fig. 4 D. Overall, these observations validate our theoretical predictions on the dependence of with the number of residues and on how the stiffness ka qualitatively affects cooperativity.
Empirical study of 34 allosteric proteins
Geometry of the allosteric response
Fig. 5 B reports the maximal overlap q∗ maxiqi as a function of the participation ratio P. Our observation indicate that q∗ is in general large (in half of the cases, larger than 0.45), supporting further that allostery indeed occurs mainly along one mode (11, 12, 13, 14, 15). Interestingly, this effect is stronger when most sites of the protein are involved in the allosteric response (P large). Interestingly, some protein complexes (for example, ATCase; see the values of q∗ in Table S1) also display a large projection of their allosteric response on a single mode, supporting that mechanical considerations can be valuable in such cases as well.
We now provide systematic data supporting that the allosteric response presents extended regions of large shear energy (27). More specifically, we argue that while some vibrational modes can present significant shear (e.g., localized modes capturing the motion of dandling loops) and other can be extended (such as plane-wave-like modes), the allosteric response is unique in presenting both aspects, thus revealing a specific design principle. To quantify this effect, we introduce the quantity, which can be defined on any displacement field:
(7) |
where ||Esh|| is the total magnitude of the shear energy, i.e., ||Esh|| = . LogΓ is large if the displacement is extended and if the shear energy is large. The factor γ characterizes the trade-off between these two features. Here, we choose γ = 3.5, reflecting the fact that for vibrational modes, we find that P varies ∼3.5 times less in relative terms than ||Esh||, as shown in the Supporting Materials and Methods. Thus, for γ = 3.5, the spatial extension and the amount of shear equally affect logΓ. Fig. 5 C shows logΓ averaged over the 17 proteins with q∗ > 45% for the allosteric response (yellow line), the mode with maximal overlap (blue line), and the first 75 vibrational modes (having subtracted the one with largest overlap) as function of the mode rank. We find that Γ is typically 160 times larger for the allosteric response than for vibrational modes, a very significant difference underlying the specific geometry of the allosteric response.
Scaling of response stiffness ka with protein size
We can now test our conjecture that the allosteric stiffness ka is close to ∼ 1/N where cooperativity saturates, which implies in particular an anticorrelation between ka and protein size. In our theoretical estimate of , we have assumed that the allosteric response magnitude is linear in the protein size, i.e., ||dRa||2 ∼ N. It is a natural assumption because the larger the protein, the more likely its response involves many residues. The relationship between these two quantities is tested in 34 proteins in Fig. 6 A. We indeed find a strong correlation between the logarithms of ||dRa|| and N (Pearson coefficient r = 0.76 with p-value p = 1.64 × 10−7. Pearson coefficients are computed on the logarithmic values via the MATLAB R2017b (The MathWorks, Natick, MA) function corr). Overall, data are consistent with our assumption of proportionality.
Finally, we plot the allosteric response stiffness ka measured according to Eq. 2 in terms of N for all proteins in Fig. 6 B. A key finding is the fair anticorrelation between the logarithms of these two quantities (Pearson coefficient r = −0.64 with p-value p = 4.00 × 10−5), supporting the idea that larger allosteric proteins need to evolve a softer elastic mode to accomplish function, as expected from our analysis. The signal that we observe is encouraging, especially given the possible sources of noise in the analysis, among which, in particular, is the use of the ENM to build the Hessian matrix.
Conclusions
We have provided systematic evidence that the allosteric response occurs along one soft elastic mode, and we have introduced a, to our knowledge, novel observable Γ to establish that this response generally displays unusually extended regions of high shear strain. These observations support that for many proteins, elasticity is a useful starting point to describe allostery. We have revisited the two classical thermodynamic models of allostery from this perspective and provided a detailed study of how the energy profile along the soft mode evolves with binding. We find that induced fit and population shift models qualitatively differ. In the induced fit model, there is an optimal stiffness ∼ 1/N associated to that mode beyond which the cooperative binding energy eventually decreases to zero. The population shift model is more robust to mutations affecting stiffness, and ∼ 1/N simply marks a crossover beyond which cooperativity saturates and the transition time between configurations rapidly explodes. We introduced a, to our knowledge, novel nonlinear elastic model for allostery supporting these views. Our key result is that proper function is achieved if proteins evolve an elastic mode whose softness must rapidly decrease with size, a prediction supported by the anticorrelation observed between these quantities.
Systematic mutation scan on one single protein, in which binding assays to measure cooperativity are combined with single molecule experiments or ultrafast laser pulses to estimate the stiffness of the allosteric response, would be extremely useful to test the predicted relationship between these quantities. Molecular dynamics experiments could further test how the energy profile along the soft elastic mode evolves with binding. Elucidating such an interplay between thermodynamics and mechanics in proteins would be valuable in a variety of tasks, including de novo protein design and the discovery of novel allosteric pathways.
Author Contributions
R.R., C.B., L.Y., and M.W. conceived the project, and M.W. supervised research. R.R., S.M.F., and S.Z. developed the tools for the empirical data analysis. L.Y., C.B., and R.R. performed computations and developed the numerical methods for the network model. L.Y. and M.W. developed the theory. All authors analyzed the results and wrote the final manuscript.
Acknowledgments
We thank P. Barth, B. Bravi, P. De Los Rios, D. Malinverni, R. B. Phillips, and M. Popović for discussions.
R.R. is supported by the Swiss National Science Foundation under Grant No. 200021-165509/1. M.W. thanks the Swiss National Science Foundation (Grant No. 200021-165509) and the Simons Foundation (Grant #454953). This material is based upon work performed using computational resources supported by the “Center for Scientific Computing at UCSB” and National Science Foundation Grant CNS-0960316, by the “High Performance Computing at NYU” and by the “High Performance Computing at EPFL.”
Editor: Amedeo Caflisch.
Footnotes
Le Yan’s present address is Chan Zuckerberg Biohub, San Francisco, California 94158.
Supporting Material can be found online at https://doi.org/10.1016/j.bpj.2019.10.002.
Contributor Information
Riccardo Ravasio, Email: riccardo.ravasio@epfl.ch.
Matthieu Wyart, Email: matthieu.wyart@epfl.ch.
Supporting Material
References
- 1.Perutz M.F., Rossmann M.G., North A.C. Structure of haemoglobin: a three-dimensional Fourier synthesis at 5.5-A. resolution, obtained by X-ray analysis. Nature. 1960;185:416–422. doi: 10.1038/185416a0. [DOI] [PubMed] [Google Scholar]
- 2.Perutz M.F. Stereochemistry of cooperative effects in haemoglobin. Nature. 1970;228:726–739. doi: 10.1038/228726a0. [DOI] [PubMed] [Google Scholar]
- 3.Halabi N., Rivoire O., Ranganathan R. Protein sectors: evolutionary units of three-dimensional structure. Cell. 2009;138:774–786. doi: 10.1016/j.cell.2009.07.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Amor B.R., Schaub M.T., Barahona M. Prediction of allosteric sites and mediating interactions through bond-to-bond propensities. Nat. Commun. 2016;7:12477. doi: 10.1038/ncomms12477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Nussinov R., Tsai C.J. Allostery in disease and in drug discovery. Cell. 2013;153:293–305. doi: 10.1016/j.cell.2013.03.034. [DOI] [PubMed] [Google Scholar]
- 6.Wodak S.J., Paci E., McLeish T. Allostery in its many disguises: from theory to applications. Structure. 2019;27:566–578. doi: 10.1016/j.str.2019.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Thirumalai D., Hyeon C., Lorimer G.H. Symmetry, rigidity, and allosteric signaling: from monomeric proteins to molecular machines. Chem. Rev. 2019;119:6788–6821. doi: 10.1021/acs.chemrev.8b00760. [DOI] [PubMed] [Google Scholar]
- 8.Koshland D.E., Jr., Némethy G., Filmer D. Comparison of experimental binding data and theoretical models in proteins containing subunits. Biochemistry. 1966;5:365–385. doi: 10.1021/bi00865a047. [DOI] [PubMed] [Google Scholar]
- 9.Monod J., Wyman J., Changeux J.P. On the nature of allosteric transitions: a plausible model. J. Mol. Biol. 1965;12:88–118. doi: 10.1016/s0022-2836(65)80285-6. [DOI] [PubMed] [Google Scholar]
- 10.Hopfield J.J. Relation between structure, co-operativity and spectra in a model of hemoglobin action. J. Mol. Biol. 1973;77:207–222. doi: 10.1016/0022-2836(73)90332-x. [DOI] [PubMed] [Google Scholar]
- 11.Kitao A., Go N. Investigating protein dynamics in collective coordinate space. Curr. Opin. Struct. Biol. 1999;9:164–169. doi: 10.1016/S0959-440X(99)80023-2. [DOI] [PubMed] [Google Scholar]
- 12.Bahar I., Erman B., Covell D.G. Collective motions in HIV-1 reverse transcriptase: examination of flexibility and enzyme function. J. Mol. Biol. 1999;285:1023–1037. doi: 10.1006/jmbi.1998.2371. [DOI] [PubMed] [Google Scholar]
- 13.Xu C., Tobi D., Bahar I. Allosteric changes in protein structure computed by a simple mechanical model: hemoglobin T<-->R2 transition. J. Mol. Biol. 2003;333:153–168. doi: 10.1016/j.jmb.2003.08.027. [DOI] [PubMed] [Google Scholar]
- 14.De Los Rios P., Cecconi F., Juanico B. Functional dynamics of PDZ binding domains: a normal-mode analysis. Biophys. J. 2005;89:14–21. doi: 10.1529/biophysj.104.055004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zheng W., Brooks B.R., Thirumalai D. Low-frequency normal modes that describe allosteric transitions in biological nanomachines are robust to sequence variations. Proc. Natl. Acad. Sci. USA. 2006;103:7664–7669. doi: 10.1073/pnas.0510426103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hemery M., Rivoire O. Evolution of sparsity and modularity in a model of protein allostery. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2015;91:042704. doi: 10.1103/PhysRevE.91.042704. [DOI] [PubMed] [Google Scholar]
- 17.Tlusty T., Libchaber A., Eckmann J.-P. Physical model of the genotype-to-phenotype map of proteins. Phys. Rev. X. 2017;7:021037. [Google Scholar]
- 18.Yan L., Ravasio R., Wyart M. Architecture and coevolution of allosteric materials. Proc. Natl. Acad. Sci. USA. 2017;114:2526–2531. doi: 10.1073/pnas.1615536114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Rocks J.W., Pashine N., Nagel S.R. Designing allostery-inspired response in mechanical networks. Proc. Natl. Acad. Sci. USA. 2017;114:2520–2525. doi: 10.1073/pnas.1612139114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Flechsig H. Design of elastic networks with evolutionary optimized long-range communication as mechanical models of allosteric proteins. Biophys. J. 2017;113:558–571. doi: 10.1016/j.bpj.2017.06.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Yan L., Ravasio R., Wyart M. Principles for optimal cooperativity in allosteric materials. Biophys. J. 2018;114:2787–2798. doi: 10.1016/j.bpj.2018.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Dutta S., Eckmann J.P., Tlusty T. Green function of correlated genes in a minimal mechanical model of protein evolution. Proc. Natl. Acad. Sci. USA. 2018;115:E4559–E4568. doi: 10.1073/pnas.1716215115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Wang S.W., Bitbol A.F., Wingreen N.S. Revealing evolutionary constraints on proteins through sequence analysis. PLoS Comput. Biol. 2019;15:e1007010. doi: 10.1371/journal.pcbi.1007010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Flechsig H., Togashi Y. Designed elastic networks: models of complex protein machinery. Int. J. Mol. Sci. 2018;19 doi: 10.3390/ijms19103152. E3152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bravi B., Ravasio R., Wyart M. Direct coupling analysis of epistasis in allosteric materials. arXiv. 2019 doi: 10.1371/journal.pcbi.1007630. http://arxiv.org/abs/1811.10480 arXiv:1811.10480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Rocks J.W., Ronellenfitsch H., Katifori E. Limits of multifunctionality in tunable networks. Proc. Natl. Acad. Sci. USA. 2019;116:2506–2511. doi: 10.1073/pnas.1806790116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Mitchell M.R., Tlusty T., Leibler S. Strain analysis of protein structures and low dimensionality of mechanical allosteric couplings. Proc. Natl. Acad. Sci. USA. 2016;113:E5847–E5855. doi: 10.1073/pnas.1609462113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Daily M.D., Gray J.J. Local motions in a benchmark of allosteric proteins. Proteins. 2007;67:385–399. doi: 10.1002/prot.21300. [DOI] [PubMed] [Google Scholar]
- 29.Schrödinger L.L.C. The PyMOL Molecular Graphics System, Version 1.8. 2015. https://pymol.org/2/
- 30.Bell R.J., Dean P. Atomic vibrations in vitreous silica. Discuss. Faraday Soc. 1970;50:55–61. [Google Scholar]
- 31.Atilgan A.R., Durell S.R., Bahar I. Anisotropy of fluctuation dynamics of proteins with an elastic network model. Biophys. J. 2001;80:505–515. doi: 10.1016/S0006-3495(01)76033-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Gerstein M., Lesk A.M., Chothia C. Structural mechanisms for domain movements in proteins. Biochemistry. 1994;33:6739–6749. doi: 10.1021/bi00188a001. [DOI] [PubMed] [Google Scholar]
- 33.Goodey N.M., Benkovic S.J. Allosteric regulation and catalysis emerge via a common route. Nat. Chem. Biol. 2008;4:474–482. doi: 10.1038/nchembio.98. [DOI] [PubMed] [Google Scholar]
- 34.McLaughlin R.N., Jr., Poelwijk F.J., Ranganathan R. The spatial architecture of protein function and adaptation. Nature. 2012;491:138–142. doi: 10.1038/nature11500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kjeldgaard M., Nissen P., Nyborg J. The crystal structure of elongation factor EF-Tu from Thermus aquaticus in the GTP conformation. Structure. 1993;1:35–50. doi: 10.1016/0969-2126(93)90007-4. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.