Abstract
Using an all-atom representation, we exhaustively enumerate all sterically allowed conformations for short polyalanyl chains. Only intrachain interactions are considered, including one adjustable parameter, a favorable backbone energy (e.g., a peptide hydrogen bond). The counting is used to reevaluate Flory's isolated-pair hypothesis, the simplifying assumption that each φ,ψ pair is sterically independent. This hypothesis is a conceptual linchpin in helix–coil theories and protein folding. Contrary to the hypothesis, we find that systematic local steric effects can extend beyond nearest-chain neighbors and can restrict the size of accessible conformational space significantly. As a result, the entropy price that must be paid to adopt any specific conformation is far less than previously thought.
Keywords: helix–coil theory, Levinthal paradox
We will have to decide whether the assembly, when left to itself in the way already specified, tends to settle down mainly into one or other of a small preferred group of stationary states, whose properties are or control the equilibrium properties of the assembly; or whether it shows no such discrimination, but wanders apparently or effectively at random over the whole range of stationary states made accessible by the general conditions of the problem (1).
The central thermodynamic question in protein folding is: How can a polypeptide chain overcome conformational entropy and fold to its native state (2)? Typically, the unfolded state is depicted as a rugged energy landscape with an exorbitant number of local minima. Under suitable conditions, the protein negotiates this landscape spontaneously and finds its way to the global minimum—the native state.
This view of the unfolded state corresponds to the latter case referred to by Fowler and Guggenheim (1), in which the assembly wanders apparently at random over the whole range of conceivable stationary states. The view was placed on a rigorous foundation by the work of Flory (3), who showed that each φ,ψ pair in the peptide backbone is sterically insensitive to the values of its neighbors. Flory's simplifying conclusion is known as the isolated-pair hypothesis.
The isolated-pair hypothesis has influenced the development of helix–coil (4–6) and protein-folding theories (7). It follows from the hypothesis that local structural transitions are ruled out as a possible origin of cooperativity in protein folding (8); the entropic price is simply too high for short polypeptide backbones to preferentially populate a small set of highly similar conformations.
In contrast to these ideas, both experiments (9) and calculations (10) suggest the prevalence of biases in polypeptide chains, which motivated us to reevaluate the isolated-pair hypothesis. We find that the former case referred to by Fowler and Guggenheim—in which a small preferred group of states account for the equilibrium properties of the assembly—better describes the situation for polypeptide chains in both folded and unfolded forms. The validity of the isolated-pair hypothesis has also been questioned by Qian and Schellman (although for reasons other than sterics) (6).
We tested the isolated-pair hypothesis by simple enumeration. If the hypothesis holds, the distribution of allowed conformations for any φ,ψ pair in short polyalanine chains will be identical to those expected for an isolated alanine dipeptide. Otherwise, the number of allowed conformations will be abridged.
Exhaustive Enumeration of Allowed Conformations: A Device for Counting.
Our objective is to enumerate all possible conformations for blocked all-atom polyalanine chains: Ac-Alan-N′-methylamide. Chain conformation is specified by the backbone dihedral angles φ and ψ. To count, we use a device in which φ,ψ space for an individual φ,ψ pair is tiled into discrete bins called mesostates (Fig. 1). The 14 nonempty mesostates are: {A,G,M,R,L,F,E,K,Q,J,P,O,I,o} (Fig. 1). Not all regions within a mesostate are allowed.
For each mesostate, Γ independent conformations were generated, of which ΓA are free of steric clashes. Γ and ΓA define an acceptance ratio, Λ = ΓA/Γ, where 0 ≤ Λ ≤ 1. Values of Λ for each of the 14 mesostates are shown (Table 1, Fig. 1); only mesostate L, which includes φ,ψ values for canonical parallel and antiparallel β-strands, has unit weight.
Table 1.
Mesostate | Unnormalized mesostate weights Λi | Standard deviations Δi of unnormalized mesostate weights from their mean values Λi | Mesostate dependent τ angles used in all calculations | Observed φ, Ψ-dependent τ values (18) |
---|---|---|---|---|
A | 0.38 | 0.0016 | 107.6° | 110° |
G | 0.74 | 0.0010 | 108.0° | 110° |
M | 0.45 | 0.0012 | 110.5° | 110° |
R | 0.74 | 0.0011 | 110.5° | 110° |
L | 1.00 | 0 | 110.5° | 109° |
F | 0.52 | 0.0014 | 108.5° | 108° |
E | 0.50 | 0.0015 | 108.3° | 108° |
K | 0.99 | 0.0003 | 110.5° | 110° |
Q | 0.25 | 0.0016 | 110.5° | 110° |
J | 0.75 | 0.0010 | 113.8° | 113° |
P | 0.34 | 0.0013 | 113.8° | 113° |
O | 0.61 | 0.0014 | 111.5° | 112° |
I | 0.74 | 0.0014 | 111.8° | 112° |
o | 0.36 | 0.0013 | 111.5° | 113° |
Polypeptide chain conformations are described by a string of mesostates. Every mesostate string represents a collection of highly similar sterically allowed conformers, like the solution set of an NMR structure (Fig. 2). By using the mesostate description, exhaustive counting of allowed conformations becomes tractable.
A polyalanine chain of length n can be represented in 14n mesostate strings, spanning the complete set of conformational possibilities. The computed mesostate weights (Table 1) were used in biased Monte Carlo sampling (11) to estimate the number of allowed conformations in mesostate strings for chains of length two to seven. For a polyalanine chain of length n, enumeration of Γ conformations within a mesostate string should lead to at most ΓP sterically allowed conformers, where ΓP = Γ ∏i Λi, and each Λi is obtained from Table 1. If the isolated-pair hypothesis is valid, then ΓA = ΓP, and enumeration is not required. Otherwise, ΓA < ΓP, and it is sufficient to generate ΓP conformers, so long as individual mesostate φ,ψ values are sampled from allowed regions in an alanine dipeptide (Fig. 1).
Favorable intrachain interactions, such as backbone hydrogen bonds (12–14), stabilize chain conformations. The number of hydrogen bonds, ν, in every allowed conformation of a mesostate string is counted; each is assigned a value of ɛ ≤ 0 (in kcal/mol). The unnormalized Boltzmann weight can be written as: ∑v=0vmaxgvi exp(−βvɛ), where gvi is the number of conformations in mesostate string i with v hydrogen bonds; β = 1/(RT) is the temperature parameter and R the universal gas constant.
Methods
Hard-Sphere Radii and Contact Distances.
Values for hard-sphere contact distances in this work (Table 2) are similar to literature values (15, 16). A steric clash exists between two atoms when their contact distance is less than the hard-sphere contact distance. Distances between all pairs of atoms separated by four or more bonds were screened for violations, with allowance made for closer contact between hydrogen-bonded atoms.
Table 2.
N | C(sp3) | C(sp2) | O | HN | H | |
---|---|---|---|---|---|---|
N | 2.57 Å | 2.85 Å | 2.71 Å | 2.57 Å | 2.32 Å | 2.32 Å |
(2.33 Å) | ||||||
C(sp3) | 3.14 Å | 2.99 Å | 2.85 Å | 2.52 Å | 2.52 Å | |
C(sp2) | 2.85 Å | 2.71 Å | 2.38 Å | 2.38 Å | ||
O | 2.57 Å | 2.32 Å | 2.32 Å | |||
(1.71 Å) | ||||||
HN | 1.90 Å | 1.90 Å | ||||
H | 1.90 Å |
All contact distances were obtained from Hopfinger (16) and softened by a factor of 0.95. Values in parentheses were used when the atoms in question are in a hydrogen bond.
Mesostate-Dependent N–Cα–C′ Bond Angles.
All bond lengths, bond angles, and peptide torsion angles (ω) were held fixed at recommended values (Table 3). The recommended value for the N–Cα–C′ bond angle (τ) is 110.5° (17). However, the τ angle often deviates from this value, allowing proteins to populate φ,ψ regions that would be otherwise disallowed, as shown by Karplus (18). Moreover, it has also been shown that this deviation is coupled to the backbone conformation (18, 19). Therefore, we use φ,ψ-dependent τ values (Table 1), which were chosen to simultaneously optimize sterically allowed regions and continuity between adjacent mesostates (Fig. 1). Our optimized values (Table 1, column 4) are in good agreement with those observed in protein structures (Table 1, column 5).
Table 3.
Atom name | Atom description | Bond type | Bond length, Å | Angle type | Bond angle |
---|---|---|---|---|---|
C(sp3) | sp3 carbon atom | C(sp3)–C′ | 1.525 | H–C(sp3)–C′ | 109.6° |
C′ | Peptide group carbonyl carbon | H–C(sp3) | 1.08 | C(sp3)–C′–N | 116.2° |
O | Peptide group carbonyl oxygen | C′–O | 1.231 | O–C′–C(sp3) | 120.5° |
N | Peptide group amide nitrogen | N–C′ | 1.329 | O–C′–N | 123.3° |
Cα | Backbone α carbon | N–H | 1.008 | C′–N–H | 119.15° |
H | Hydrogen atom | N–Cα | 1.458 | C′–N–Cα | 121.7° |
Cα–C(sp3) | 1.521 | N–Cα–C(sp3) | 110.4° | ||
C–C′ | 1.525 | H–C(sp3)–Cα | 109.6° | ||
N–C(sp3) | 1.458 | C(sp3)–Cα–C′ | 110.5° | ||
Cα–C′–O | 120.5° | ||||
Cα–C′–N | 116.2° | ||||
C′–N–C(sp3) | 121.7° | ||||
N–C(sp3)–H | 110.0° | ||||
H–N–C(sp3) | 119.15° | ||||
H–C(sp3)–H | 109.6° |
Identifying Hydrogen Bonds.
The only possible hydrogen bonds in polyalanine are between donors and acceptors separated by at least one residue in sequence. Geometric criteria for hydrogen bond identification are identical to those used for protein structures (20). Each donor or acceptor atom is allowed to participate in only one hydrogen bond, and the maximum number of backbone hydrogen bonds in a chain of length n is n − 1.
Validating the Isolated-Pair Hypothesis.
Γ conformations were generated within each mesostate string, and the number allowed, ΓA, was counted. If the isolated-pair hypothesis holds, then ΓA ≈ Γexpected = Γ ∏i Λi, where the Λi are from Table 1. Conversely, if ΓA < Γexpected, then the isolated-pair hypothesis fails.
Consider the ratio ρ = ΓA/Γexpected. If ρ ≠ 1, then the isolated-pair hypothesis fails, unless the value of this ratio is confounded by sampling error. The latter possibility was tested by determining whether ρ falls within a suitable error interval, bounded above and below by the variances of mesostate weights, Δi (from Table 1). Two numbers were calculated: ΓU = Γ ∏i=1n (Λi + Δi) and ΓL = Γ ∏i=1n (Λi − Δi). ΓU > Γexpected > ΓL and ρU < ρexpected < ρL, where ρU = ΓA/ΓU and ρL = ΓA/ΓL. If ρL > 1, the isolated-pair hypothesis is valid.
Results
The isolated-pair hypothesis is valid for any combination of the nine mesostates surrounding the extended (i.e., β-strand) region of φ,ψ space—{A,G,M,R,L,F,E,K,Q}—as illustrated in Fig. 3. However, even in chains as short as five residues, the isolated-pair hypothesis fails for any combination of the remaining five mesostates, situated near contracted regions of φ,ψ space—{J,P,O,I,o}—as illustrated in Fig. 3. Mixed strings, comprised of mesostates taken from both sets, with at least two consecutive residues from the contracted region, also violate the hypothesis.
Length Scale of Local Effects.
This result raises questions about the conventional polymer definition of a local interaction for polypeptides. In polymer theory, contacts are classified as either local or nonlocal. Local contacts are limited to nearest-chain neighbors; all others are, by definition, nonlocal (7). The classification is a natural one if each φ,ψ pair is independent (3). However, it is inappropriate here because the isolated-pair hypothesis is not general, as shown above. Only nearest-neighbor contacts are possible in an alanine dipeptide. If systematic steric clashes extend between monomers situated at i ± 2, i ± 3, i ± 4, i ± 5, … , i ± x along the linear chain, then local interactions extend beyond nearest-neighbor boundaries. This is exactly the case for the mesostate strings that invalidate the isolated-pair hypothesis. Fig. 4 shows the fraction of non-nearest-neighbor steric clashes as a function of monomer separation in a 9-mer, for conformations in polyJ, polyP, polyO, polyI, and polyo mesostate strings. Almost all non-nearest-neighbor clashes are between monomers at [i, i + 3], [i, i + 4], [i, i + 5], and [i, i + 6]. As a general rule, when two or more mesostates are from the set {J,P,O,I,o}, there are fewer allowed conformations than predicted by the product of independent isolated pairs (i.e., by the product of weights in Table 1).
Steric Clashes in Helical Conformers.
Helical conformations were explored to illustrate the effect of systematic non-nearest-neighbor steric clashes. In detail, conformations accessible to the polyO (i.e., helical) mesostate string were enumerated in N-Ac-Ala9-N′-methylamide. In four separate experiments, values for the central residue of this peptide were varied uniformly over mesostate O (Fig. 1), while the remaining eight residues were held fixed within each quadrant of this mesostate: (i) (−78°, −67°); (ii) (−78°, −42°); (iii) (−53°, −67°), or (iv) (−63°, −45°). The values assigned in experiment iv are those of an ideal helix (21). If the central residue is independent of its chain neighbors, the distribution of allowed values for this φ,ψ pair resembles the alanine dipeptide map, Fig. 5a. In fact, no sterically allowed conformers were found in three of the four experiments, i–iii. In experiment (iv), steric constraints imposed by chain neighbors in an ideal helix limit the central residue to a small subset of the alanine dipeptide map φ,ψ values (Fig. 5b). A generalized version of these experiments (i–iv) was also performed, in which all φ,ψ values were varied independently for a 9-mer in a completely unrestricted polyO mesostate. Some of the previously proscribed φ,ψ values in the central residue—disallowed in a hydrogen-bonded helix with good geometry—are recovered by compensatory conformational changes in one or more of the other residues. Nevertheless, conformational restrictions still remain, as seen in Fig. 5c. For comparison, Fig. 5d shows the distribution of O mesostate φ,ψ values in 10,879 nonglycine, nonproline residues obtained from high-resolution protein structures. The observed distribution in Fig. 5d is qualitatively similar to that of a central residue in a well-formed helix (Fig. 5b), and unlike that of the allowed region for mesostate O in a Ramachandran map (Fig. 5a). Finally, we repeated Flory's original experiment (3), in which all residues in the 9-mer except the central one are fixed in trans [i.e., (φk,ψk) ≡ (−180°, 180°), ∀ k ≠ 5], and all allowed values for (φ5,ψ5) are sampled. Confirming Flory's result, the distribution is identical to the Ramachandran map of Fig. 1. Clearly, the isolated-pair hypothesis holds in the limited region of φ,ψ-space explored by Flory's experiment, but it is invalid for polypeptide chains in general.
All-or-None Behavior.
Mesostate string Boltzmann weights were used to address questions about conformational entropy and counterbalancing enthalpy. Specifically, we calculated the mean radius of gyration, <Rg>, as a function of intrachain hydrogen bond strength, ɛ. The radius of gyration is a useful measure that distinguishes between contracted and extended chains. Changing ɛ mimics changes in solvent conditions. As ɛ becomes increasingly negative, intrachain interactions are strengthened, akin to a move toward poor solvent (22, 23). Results are summarized in Fig. 6. Notably, even short polyalanine chains exhibit two-state behavior (Fig. 6); intermediate states are only marginally populated. The two predominant states are a set of extended conformations, favored when chain entropy dominates (weak hydrogen bonding), and a set of contracted conformations, favored when internal energy dominates (strong hydrogen bonding). This behavior resembles the cooperative all-or-none transitions seen in protein folding (24).
To annotate the all-or-none transition, structures were sampled at three representative points (ɛ = −4.0 kcal/mol, ɛ = −2.0 kcal/mol and ɛ = −1.0 kcal/mol) along the curve of <Rg> vs. ɛ for n = 7 (Fig. 6). The set of structures shown in Fig. 6 account for more than 90% of the Boltzmann-weighted population at the chosen values of ɛ. At ɛ = −4.0 kcal/mol, the chain is mostly 310 helix, with some α-helix. At the midpoint, ɛ = −2.0 kcal/mol, extended structures begin to contribute significantly, together with type II and type III turns.
Effective Size of Conformational Space.
As shown in Fig. 6, the landscape is dominated by two distinct sets of highly similar conformers. To further explore this phenomenon, the fraction of mesostate strings (total = 14n) required to account for at least 90% of the equilibrium population was calculated as a function of hydrogen bond strength (Fig. 7). For strong hydrogen bonds (small ɛ), the population is determined primarily by a very small number of helical mesostate strings, and the range of ɛ values for which these states dominate increases with chain length. When helical states are not favored (as ɛ→0), the number of states required to account for 90% of the equilibrium population increases sharply and then plateaus. The plateau value decreases as chain length increases. Notably, the effective size of conformational space is winnowed as chain length increases, even when the dominant contribution to equilibrium is entropic (as ɛ→0). This behavior is not anticipated by the isolated-pair hypothesis.
In summary, the thermodynamic properties of short polyalanine chains exhibit all-or-none behavior, like a two-state protein-folding transition (Fig. 6). Increased chain length leads to a significant reduction in the number of structured alternatives available to the chain, with a corresponding reduction in the effective size of conformational space (see Fig. 7 at ɛ = 0). In particular, several φ,ψ combinations from allowed regions of the Ramachandran map for a dipeptide become disallowed in longer chains. As a consequence, the entropic price required to constrain the backbone to helical values (within polyO) decreases with chain length, thereby reducing the hydrogen bond strength needed to populate helical conformers. This trend is evident in the family of curves in Fig. 7, where the transition midpoint shifts to the right as n = 3, 4, 5, 6. As chains approach the length of protein-sized helices, ≈12 residues (25), we estimate that the transition midpoint will be around −1.0 kcal/mol, approximating the experimental value of the peptide hydrogen bond in water (26).
Discussion
Using a dipeptide model, Ramachandran and coworkers (15, 19) described an effective upper limit on the conformational possibilities of a φ,ψ pair. Their model has been validated repeatedly in subsequent experimental work. Backbone dihedral angles in proteins of known structure lie well inside the allowed regions of a φ,ψ map, to the extent that the Ramachandran plot is now used routinely to assess the quality of x-ray structures (27).
Despite this success (28), hard-sphere models are seldom used in theoretical work on protein structure. The issue is one of scale. If each φ,ψ pair is independent (3), constraints that sterics impose on the dipeptide are insufficient to limit the conformations accessible to a peptide backbone, even a short one.
In contrast, our analysis of short polyalanyl chains shows that backbone conformations are limited by additional, systematic steric clashes, a superset of those seen in a dipeptide map. This conclusion is supported by results from accurate enumeration of accessible conformations. To count, we use a device in which φ,ψ space is tiled into discrete bins, called mesostates. Conformations of longer chains are described by a string of mesostates. Only two types of interactions are used: one repulsive, the other attractive. Each is constructed to map a continuous variable into a discrete range; a given distance between two atoms is either allowed or disallowed, and a potential hydrogen bond between a donor and acceptor is either made or broken. This approach enables conformations for short polyalanyl chains to be enumerated exhaustively.
There are many shortcomings in the weighting scheme used here. Repulsive interactions are well represented, but attractive interactions are lumped into a single crude approximation. The main issue pertaining to repulsive effects is the choice of hard-sphere radii. Our radii are among the most permissive, close to the lower limits set by Ijima et al. (29). Also, the N–Cα–C′ bond angles are strained slightly to accommodate experimentally confirmed conformations at the edge of allowed regions (18, 30).
Regarding attractive interactions, we ignore explicit peptide-solvent contributions, which are known to be important (31–34), our hydrogen-bonding criteria are incomplete (14, 35), and the mesostate description would be questionable if used in conjunction with a distance-dependent potential, because conformers within a given mesostate string can have similar intrachain contacts but dissimilar intramolecular energies.
Despite these shortcomings, the phenomenological model used here allows us to address the central thermodynamic question in protein folding: How can a polypeptide chain overcome conformational entropy and fold to its native state (2)? Conformational entropy is estimated by exhaustive enumeration of sterically allowed conformations. Further, the attractive interaction energy (ɛ), albeit crude, is sufficient to construct a phase diagram, akin to a folding transition.
A coherent picture emerges from these considerations. Polypeptide backbones form helices when intrachain interactions are sufficiently strong, as is the case in water (26) or water/trifluroethanol mixtures (36). Short chains fluctuate about canonical 310 and α helices, but fluctuations are reduced in longer chains, which are helical over a wider range of ɛ values. Outside the helical regime, the backbone populates extended conformations preferentially. Thermodynamic averages for these chains exhibit the characteristic two-state behavior seen in natural proteins (37, 38). The two states are ensembles of either energetically favored contracted hydrogen-bonded helical structures or entropically favored extended strand-like structures (Fig. 6). Intermediate constructs are only marginally populated.
Since its inception, the isolated-pair hypothesis has played a pivotal role in helix–coil theories and protein folding. In helix–coil theory, the entropic cost of helix formation increases as the volume of the accessible coil region increases. It is this entropic cost that sets the expected length of stable helix under given solvent conditions. In water, stable short helices would be strongly disfavored. Similarly in protein folding, as the number of conceivable states accessible to the unfolded protein escalates, so does the entropic cost of populating one region uniquely. In general, the notion that allowed conformers grow exponentially with chain length has fostered the popular view that accessible conformational space is vast, and the corresponding energy landscape is rugged.
An ongoing challenge to theorists is to explain the mechanism by which this entropic cost is paid. What energetic factors account for the surprisingly short length of an average protein-sized helix, ≈12 residues (25)? If conformational space is vast, as believed, how can a protein find its native pinhole in biological real time while avoiding metastable traps en route (1)? Yet protein-sized helices can be stable in water (9), and proteins do fold.
The foregoing analysis demonstrates these problems are more conceptual than actual. In helix–coil theory, the volume of the coil region is much smaller than predicted by the isolated pair hypothesis, lowering the entropic barrier for helix formation. In protein folding, most conceivable states are inaccessible, winnowing the effective size of conformational space and biasing the unfolded molecule toward organized structure (10). We anticipate that both theory and experiment are poised to provide further insight into the unfolded state of proteins.
Acknowledgments
We are grateful to Robert Baldwin, Trevor Creamer, S. Walter Englander, Alan Grossfield, P. Andrew Karplus, Venkatesh Murthy, Teresa Przytycka, and Bruno Zimm for much useful discussion. This work was supported by grants from the National Institutes of Health and the Mathers Foundation.
Footnotes
See commentary on page 12391.
References
- 1.Fowler R H, Guggenheim E A. Statistical Thermodynamics. London: Cambridge Univ. Press; 1939. p. 6. [Google Scholar]
- 2.Levinthal C. In: How to Fold Graciously. Debrunner P, Tsibris J C M, Münck E, editors. Urbana, IL: Univ. of Illinois Press; 1969. pp. 22–24. [Google Scholar]
- 3.Flory P J. Statistical Mechanics of Chain Molecules. New York: Wiley; 1969. p. 252. [Google Scholar]
- 4.Zimm B H, Bragg J K. J Chem Phys. 1959;31:526–535. [Google Scholar]
- 5.Lifson S, Roig A. J Chem Phys. 1961;34:1963–1974. [Google Scholar]
- 6.Qian H, Schellman J A. J Phys Chem. 1992;96:3987–3997. [Google Scholar]
- 7.Dill K A. Protein Sci. 1999;8:1166–1180. doi: 10.1110/ps.8.6.1166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chan H S, Bromberg S, Dill K A. Philos Trans R Soc London. 1995;348:61–70. doi: 10.1098/rstb.1995.0046. [DOI] [PubMed] [Google Scholar]
- 9.Marqusee S, Robbins V H, Baldwin R L. Proc Natl Acad Sci USA. 1989;86:5286–5290. doi: 10.1073/pnas.86.14.5286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Srinivasan R, Rose G D. Proc Natl Acad Sci USA. 1999;96:14258–14263. doi: 10.1073/pnas.96.25.14258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Binder K, Heerman D W. Monte Carlo Simulation in Statistical Physics: An Introduction, Springer Series in Solid-State Sciences. New York: Springer; 1997. , Chap. 1. [Google Scholar]
- 12.Schellman J A. J Phys Chem. 1958;62:1485–1494. [Google Scholar]
- 13.Pauling L, Corey R B, Branson H R. Proc Natl Acad Sci USA. 1951;37:205–210. doi: 10.1073/pnas.37.4.205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Jeffrey G A, Saenger W. Hydrogen Bonding in Biological Structures. Berlin: Springer; 1991. [Google Scholar]
- 15.Ramachandran G N, Ramakrishnan C, Sasisekharan V. J Mol Biol. 1963;7:95–99. doi: 10.1016/s0022-2836(63)80023-6. [DOI] [PubMed] [Google Scholar]
- 16.Hopfinger A J. Conformational Properties of Macromolecules. New York: Academic; 1973. p. 41. [Google Scholar]
- 17.Engh R A, Huber R. Acta Crystallogr. 1991;47:392–400. [Google Scholar]
- 18.Karplus P A. Protein Sci. 1996;5:1406–1420. doi: 10.1002/pro.5560050719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ramachandran G N, Sasisekharan V. Adv Protein Chem. 1968;23:283–438. doi: 10.1016/s0065-3233(08)60402-7. [DOI] [PubMed] [Google Scholar]
- 20.Stickle D F, Presta L G, Dill K A, Rose G D. J Mol Biol. 1992;226:1143–1159. doi: 10.1016/0022-2836(92)91058-w. [DOI] [PubMed] [Google Scholar]
- 21.Schulz G E, Schirmer R H. Principles of Protein Structure. New York: Springer; 1979. pp. 68–69. [Google Scholar]
- 22.Flory P J. Principles of Polymer Chemistry. Ithaca, NY: Cornell Univ. Press; 1953. [Google Scholar]
- 23.Chan H S, Dill K A. Annu Rev Biophys Biophys Chem. 1991;20:447–490. doi: 10.1146/annurev.bb.20.060191.002311. [DOI] [PubMed] [Google Scholar]
- 24.Ginsburg A, Carroll W R. Biochemistry. 1965;4:2159–2174. [Google Scholar]
- 25.Presta L G, Rose G D. Science. 1988;240:1632–1641. doi: 10.1126/science.2837824. [DOI] [PubMed] [Google Scholar]
- 26.Scholtz J M, Marqusee S, Baldwin R L, York E J, Stewart J M, Santoro M, Bolen D W. Proc Natl Acad Sci USA. 1991;88:2854–2858. doi: 10.1073/pnas.88.7.2854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Morris A L, MacArthur M W, Hutchinson E G, Thornton J M. Proteins Struct Funct Genet. 1992;12:345–364. doi: 10.1002/prot.340120407. [DOI] [PubMed] [Google Scholar]
- 28.Richards F M. Annu Rev Biophys Bioeng. 1977;6:151–176. doi: 10.1146/annurev.bb.06.060177.001055. [DOI] [PubMed] [Google Scholar]
- 29.Ijima H, Dunbar J B J, Marshall G R. Proteins. 1987;2:330–339. doi: 10.1002/prot.340020408. [DOI] [PubMed] [Google Scholar]
- 30.Esposito L, Vitagliano L, Sica F, Sorrentino G, Zagari A, Mazzarella L. J Mol Biol. 2000;297:713–732. doi: 10.1006/jmbi.2000.3597. [DOI] [PubMed] [Google Scholar]
- 31.Lifson S, Oppenheim J Chem Phys. 1960;33:109–115. [Google Scholar]
- 32.Makhatadze G I, Privalov P L. In: Energetics of Protein Structure. Anfinsen C B, Edsall J T, Richards F M, Eisenberg D S, editors. Vol. 47. San Diego: Academic; 1995. pp. 307–425. [DOI] [PubMed] [Google Scholar]
- 33.Honig B, Yang A-S. Adv Protein Chem. 1995;46:27–58. doi: 10.1016/s0065-3233(08)60331-9. [DOI] [PubMed] [Google Scholar]
- 34.Luo P, Baldwin R L. Proc Natl Acad Sci USA. 1999;96:4930–4935. doi: 10.1073/pnas.96.9.4930. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Mitchell J B O, Price S L. J Comp Chem. 1990;11:1217–1233. [Google Scholar]
- 36.Luo Y, Baldwin R L. Biochemistry. 1997;27:8413–8421. doi: 10.1021/bi9707133. [DOI] [PubMed] [Google Scholar]
- 37.Brandts J F. J Am Chem Soc. 1964;86:4302–4314. [Google Scholar]
- 38.Brandts J F. J Am Chem Soc. 1964;86:4291–4301. [Google Scholar]
- 39.Berman H M, Westbrook J, Feng Z, Gilliland G, Bhat T N, Weissig H, Shindyalov I N, Bourne P E. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hobohm U, Sander C. Protein Sci. 1994;3:522–524. doi: 10.1002/pro.5560030317. [DOI] [PMC free article] [PubMed] [Google Scholar]