Abstract
The efficient and accurate characterization of solvent effects is a key element in the theoretical and computational study of biological problems. Implicit solvent models, particularly generalized Born (GB) continuum electrostatics, have emerged as an attractive tool to study the structure and dynamics of biomolecules in various environments. Despite recent advances in this methodology, there remain limitations in the parameterization of many of these models. In the present work, we demonstrate that it is possible to achieve a balanced implicit solvent force field by further optimizing the input atomic radii in combination with adjusting the protein backbone torsional energetics. This parameter optimization is guided by the potentials of mean force (PMFs) between amino acid polar groups, calculated from explicit solvent free energy simulations, and by conformational equilibria of short peptides, obtained from extensive folding and unfolding replica exchange molecular dynamics (REX-MD) simulations. Through the application of this protocol, the delicate balance between the competing solvation forces and intramolecular forces appears to be better captured, and correct conformational equilibria for a range of both helical and β-hairpin peptides are obtained. The same optimized force field also successfully folds both beta-hairpin trpzip2 and mini-protein Trp-Cage, indicating that it is quite robust. Such a balanced, physics-based force field will be highly applicable to a range of biological problems including protein folding and protein structural dynamics.
Keywords: continuum electrostatics, hairpin, helix, molecular dynamics, Poisson-Boltzmann, replica exchange, structure prediction, trp-cage, trpzip
Introduction
Successful applications of molecular dynamics (MD) simulations to studying the structure and function of biomolecules hinge on the quality of the underlying molecular force field and the sampling efficacy of the simulation protocol. In particular, the solvent environment plays a critical role in the structure, dynamics and function of biomolecules. However, efficient and accurate treatment of solvation has been a perpetual problem in molecular modeling, despite its prime importance.1,2 Explicit inclusion of all solvent molecules arguably provides the most accurate and detailed description, but it significantly increases the computational cost and severely limits the simulation timescale and amount of sampling that are practically achievable. Furthermore, interesting quantities such as solvation energies converge very slowly because all solvent degrees of freedom need to be averaged out. Therefore, it is often desirable to describe the mean influence of solvent molecules around the solute without having to treat the solvent explicitly.This has motivated continual efforts in the development of various implicit solvent models.2-4 Implicit solvent models may yield considerable disagreement with explicit water simulations due to the absence of the granularity of solvent molecules, especially in short-range effects when the detailed interplay of a few water molecules (which are significantly distinct from the bulk water) is important.4,5 However, there are many biological problems for which implicit solvent models can provide insights that are very difficult to gain from explicit solvent models, such as protein-protein or protein-ligand binding thermodynamics, scoring of protein conformations in structure prediction, protein conformational changes upon binding and pH changes, and peptide and protein folding and unfolding studies.2,6-8
In the popular continuum electrostatics treatment of solvent, the solute interior and solvent region are described as featureless ”low” (solute) and ”high” (solvent) dielectric regions respectively.3 The electrostatic solvation energy of a solute with an arbitrary shape, including the solvent-screened charge-charge interactions, can be rigorously calculated from numerical solutions of the Poisson-Boltzmann (PB) equation using finite-difference methods.9-12 While particular successes in applications to complex biomolecular systems are evident,13,14 the computational cost of solving the PB equation remains a bottleneck to its application to protein folding and routine dynamics simulations of biomolecules, despite progress in fast PB computational methodologies.15,16
Based on the same underlying continuum representation, the generalized Born (GB) formalism approximates the PB electrostatic solvation energy as an efficient pairwise summation that allows analytical force calculations,17,18
(1) |
where rij is the distance between atoms i and j, qi is the atomic charge, is the so-called “effective Born radius” of atom i and ε is the solvent (high) dielectric constant. F is an empirical factor whose value may range from 2 to 10, with 4 being the most common value. Note that the (low) dielectric constant of the solute interior is assumed to be 1 (same as vacuum). ΔGelec then corresponds to the electrostatic free energy of transferring the solute from vacuum to a medium of dielectric constant ε. The effective Born radius, , is a key quantity in the GB formalism. It corresponds to the distance between a particular atom and its hypothetical spherical dielectric boundary, chosen such that the self (or atomic) electrostatic solvation energy, ΔGelec,i, satisfies the Born equation,19
(2) |
In principle, the “exact” effective Born radii can be calculated from Eqn. (2) using the self electrostatic solvation energy obtained through the PB theory. The principal assumption in the GB method is that the solvent-shielded charge-charge interactions can be reproduced by the cross-term summation in Eqn. (1) with the effective Born radii. Indeed, Eqn. (1) has been shown to closely reproduce the PB electrostatic solvation energy, provided that the effective Born radii are accurate.20,21 As such, most of the extensive literature on extensions of the GB theory has been focused on efficient and accurate evaluation of the Born radii, and ΔGelec,i or from PB calculations serve as standard benchmarks for assessing various GB approximates. Many modifications, extensions and improvements have been made over the last several years22-37 and various implementation are now available in virtually all major molecular modeling software packages. At present, the GB formalisms have reached a mature stage and the achievable accuracy can be essentially identical to the PB method.21 Successful applications to various biological problems have demonstrated the great potential of the GB implicit solvent models for studies of biomolecular structure and function.2,8 The main limitation of GB at present lies in its parameterization, manifested as several limitations observed previously indicating over-stabilized salt-bridges and distorted peptide and protein conformational equilibria.7,8
The successes and failures of various solvent models arise in principle from their ability to balance delicate energetics between sets of competing interactions, i.e., the solvation preference of sidechains and backbones in solution versus the strength of solvent-mediated interactions between these moieties in a complex protein environment. The intra-molecular Coulombic interaction energy in the protein is known to be strongly anti-correlated with the electrostatic solvation energy. Similarly, the intra-molecular van der Waals (vdW) dispersion interaction energy in the protein also strongly anti-correlates with the nonpolar solvation energy.8 These competing, opposing forces mostly cancel each other, and a shift in the balance, depending upon the extent of specific interactions in a given protein conformation and environment, can lead to a bias in conformational equilibria. To what extent a GB implicit solvent force field can capture this delicate balance is a key in the success of its applications. For example, as mentioned above, it has been noticed previously that many existing continuum electrostatics solvation models (GB as well as PB) over-stabilize salt-bridges,38-42 which can partially account for the observed discrepancies in the conformational equilibria and free energy surfaces for several peptides.40,43,44 This over-stabilization might be amplified even more in the low dielectric protein interior, which appears to be particularly problematic in applications such as protein design.45 Unfortunately, achieving sufficient balance of the competing interactions in a force field for complex heterogeneous systems is a challenging task. In addition to the general difficulty that force fields optimized with high-level quantum mechanics are not directly transferable to solvent environments, there is a severe lack of direct experimental data on solvation energies of proteins as well as the pairwise interactions between polar groups in solvent environments. As such, it appears that one has to resort to explicit water simulations and available (indirect) experimental observables (e.g., thermodynamic stability and conformation equilibria of peptides and proteins) in the implicit solvent force field optimization efforts.8
Toward this end, we first examine the solvent-mediated interactions between polar groups present in proteins and optimize the implicit solvent force field based on potentials of mean force (PMF) obtained from explicit solvent simulations. In continuum electrostatics, the extent of solvent exposure of each atom at the dielectric boundary dictates all of the electrostatic and most of nonpolar solvation energetics. Thus, it is physically appropriate to optimize the input radii, by which the low dielectric region and the high dielectric region are divided, not only based on total solvation free energy of individual sidechains but also in consideration of solvent-mediated interactions. We have demonstrated previously that by adjusting the GB input radii for the peptide backbone, it is possible to reproduce the solvent mediated backbone H-bond strength given by TIP3P explicit water model46 and thus improve the agreement with experimentally measured conformational equilibria of small helical peptides.8 In principle, the partial charges, Lennard-Jones parameters, and torsional energetics in the underlying molecular mechanics force field may also need to be adjusted for a specific implicit solvent model to achieve sufficient balance. However, given that current force fields have been extensively calibrated over the past decades to achieve proper solvent-solute and solute-solute interactions in explicit solvent,47 at this stage, it is reasonable to focus primarily on optimizing the input radii in the GB implicit solvent. Furthermore, analogous to previous efforts to improve the treatment of the peptide backbone in the context of the TIP3P explicit solvent,48 we also empirically adjust backbone dihedral energetics self-consistently with the GB input radii optimization to achieve proper conformation equilibria. Note that adjustment of backbone torsion energetics has also been previously applied to fine tune the Amber force fields.49
The atomic input radii for continuum electrostatics have been previously optimized based on the radial solvent charge distribution to reproduce the electrostatic solvation energy obtained from explicit solvent charging free energy calculations for both proteins50,51 (hereinafter referred to as the Nina’s radii) and nucleic acids.52 The Nina’s radii set has been shown to work well in several applications including peptide folding53 and protein NMR structure refinement.54,55 Therefore it offers a good starting point for further optimization. The input radii are further optimized here to explicitly balance the interactions between amino acid polar groups in a GB implicit solvent, guided by PMFs obtained from explicit solvent free energy simulations. The optimized radii are then assessed by extensive folding and unfolding simulations of a range of peptides and mini-proteins. Correct prediction of conformational equilibria for both helical peptides and β-hairpins has been considered as an important aspect of a molecular force field. It has been the focus of many force field development and parameterization efforts.37,49,56-60 The simulated conformational equilibria also provide a key feedback for the parameterization of both the input radii and backbone dihedral energetics. An important limitation of such a recursive approach is the slow convergence of conformational equilibria even for small peptides. In this study, an advanced sampling technique, namely, the replica exchange molecular dynamics (REX-MD) method61-64 has been used extensively to speed up the conformational sampling.
Methods
Force Field
The CHARMM22/CMAP all-atom force field65-67 with a GBSW implicit solvent model33 is optimized in this work. GBSW employs a vdW-based surface with a smooth dielectric boundary. Born radii are calculated by a rapid volume integration scheme that includes a higher-order correction term to the Coulomb field approximation, as introduced previously for a closely related GBMV implementation.31 Default GBSW parameters were used with a 0.6 Å smoothing length (i.e., w=0.3 Å) along with 50 Lebedev angular integration points and 24 radial integration points up to 20 Å for each atom.33 The nonpolar solvation energy was estimated from the solvent-exposed surface area (SA) using a phenomenological surface tension coefficient of 0.005 kcal/mol/Å2.
Model Peptides and Proteins
A primary focus of the current optimization efforts is to achieve proper balance of secondary structure preferences. As such, we have chosen a range of α and β peptides and a designed mini-protein. The systems are listed in Table 1, with a summary of representative folding simulations previously reported in the literature. Note that there is not yet a single force field that can provide proper conformational equilibria for all these peptides, while some of these force fields did successfully fold subsets of both α and β peptides as well as other sequences not listed. A similar deficiency of the current protein force fields in balancing the secondary structure preferences was also recently observed using C-peptide of RNase A and GB1p β-hairpin.68 Consistent with the experimental conditions, both termini of (AAQAA)3 peptide were blocked with Ace and NH2 respectively; the C-terminal of trpzip2 was blocked with NH2; and all the other peptides were simulated with unblocked termini. The sequences of the β-hairpins are as following: GEWTYDDATKTFTVTE (GB1p); GEWTYDDATKTATVTE (GB1m1); KKYTWNPATGKATVQE (HP5A); KKWTYNPATGKFTVQE (GB1m3); SWTWENGKWTWK-NH2 (trpzip2). Note that GB1m1, HP5A and GB1m3 are derived from the native sequence of the C-terminal β-hairpin (residues 41-56) of the B1 domain of protein G (GB1p) but display reduced or enhanced stability: (unfolded) GB1m1 < HP5A < GB1p < GB1m3 (most folded).69 Therefore, these peptide sequences provide a particularly useful control for the optimization.
Table 1.
Peptide | PDB ID | Structure | Length | Force Fields | References |
---|---|---|---|---|---|
(AAQAA)3 | N/A | α (50%a) | 15 | CHARMM19/SAS CHARMM22/SCP-ISM Amber84/m-GBSAd |
56 70 71 |
GB1p | 3gb1 (41-56) | β (30-80%b) | 16 | OPLS/GBSA OPLS-AA/SGB OPLS-AA/AGBNP CHARMM19/cGBSTILL |
72 73 57 74 |
GB1m1 | N/A | β (6%c) | 16 | Irbäck and Mohanty | 60 |
HP5A | N/A | β (21%c) | 16 | N/A | N/A |
GB1m3 | N/A | β (86%c) | 16 | Irbäck and Mohanty | 60 |
trpzip2 | 1le1 | β (90%d) | 12 | Amber99-m1e/GBSA OPLS-AA/GBSA Amber96/GBSA |
49 58 75 |
Trp-cage | 1l2y | α/coil | 20 | Amber99-m2f/GBSA Amber94/GBSA Amber-m3g/GBSA CHARMM19/(ACE,EEF1,SASA) CHARMM22/ACE CHARMM22/CMAP/ACE PFF01 Irbäck and Mohanty |
76 77,78 79 80 80 80 81 60 |
helicity measured by NMR chemical shifts at 270K.82
population estimated from multiple NMR chemical shift probes (∼30% at 298 K69 or 42% at 278 K83) and from the tryptophan fluorescence experiment (∼80% at 273 K).84
folded population estimated by NMR chemical shifts at 298K;69
population estimated from thermal unfolding analysis.85
with modified backbone dihedral energetics.
with modified backbone dihedral energetics.
with a new but unreferenced Amber force field.
Interaction Models and PMF Calculations
Figure 1 shows a list of the polar amino acid sidechain models for which pair-wise interactions were examined. A alanine dipeptide (Ace-Ala-Nme) was used to model the peptide backbone. A modified alanine dipeptide dimer was used to mimic backbone hydrogen bonding interaction, which was described elsewhere.8 All molecules were described by the CHARMM22 all atom force field. A total of 23 hydrogen bonding pairs between the sidechains as well as between the sidechains and backbone in various configurations (side, head-to-head, or stacking approaches) were studied. Some examples of the dimer configurations are given in Figure 2 and the rest is shown in the Supporting Material.
In the explicit solvent simulations the dimers were constrained to move along a reaction coordinate, i.e., a straight line in specific dimer orientations (see Figure 2) using the MMFP module in CHARMM.86 The system was solvated by about 750 TIP3P46 water molecules in a rectangular box with periodic boundary conditions. To remove the artifacts associated with truncation of electrostatic forces, the Particle-Mesh Ewald method (PME)87 was used to calculate the long-range electrostatic interactions. The vdW energy was smoothly switched off over the range of 10-12 Å by use of a switching function.88,89 Biased sampling along the reaction coordinate was carried out using the umbrella sampling technique90 and the final PMF was calculated using the weighted histogram analysis method (WHAM).91,92 For each window, equilibration simulations of 60 picoseconds (ps) at constant pressure and temperature (NPT) were followed by 1.0 nanosecond (ns) of production sampling at constant volume and temperature (NVT). The SHAKE algorithm93 was applied to fix lengths of all bonds involving hydrogen atoms and a time-step of 2 femtoseconds (fs) was used. Corresponding PMFs in implicit solvent were computed directly by translating the molecules away from each other along the reaction coordinate. Note that the resulting PMFs do not include the contribution of solute conformational entropy. However, this contribution is assumed to be similar in both explicit and implicit solvent models and thus omitting it in both cases should not affect the optimization results.
Backbone Dihedral Energetics
Modification of the backbone dihedral energetics was made possible by the ϕ/ψ CMAP torsion crossterm recently introduced in CHARMM.48,66,67 As proper balance of secondary structure preference is one of the primary goals, the modifications were focused on the extended (β) and helical regions of the ϕ/ψ space. Stabilization (or de-stabilization) of particular conformations was achieved by adding cosine shaped “valleys” (or “humps”) centered at the appropriate ϕ/ψ coordinates. For example, stabilization of the extended (β) conformation was achieved by the following modification,
(3) |
with , l = 1, 2, where the radius r = 45° and the centers (ϕ1, ψ1) = (-120°1, 125°) and (ϕ2, ψ2) = (-150°, 160°).
The input radii were first systematically optimized to reproduce the pairwise interaction strengths between the polar groups, as shown in Fig. 2. An iterative procedure was adopted to empirically tune the radii as well as the backbone dihedral energetics, guided and judged by extensive folding and unfolding simulations. The basic strategy is as follows. Whenever the backbone energetics is changed, the backbone input radii are adjusted such that helicity of (AAQAA)3 is close to the experimental value (∼ 50% at 270 K). Folding and unfolding simulations of the GB1p series β-hairpins (GB1m1, GB1p, HP5A, GB1m3) are followed to examine whether the correct folding thermodynamics is obtained. The final parameters are then further examined by folding simulations of trpzip2 and Trp-cage as well as control simulations of a range of other proteins (see Results and Discussion). Note that due to the large parameter space and slow convergence of simulations, popular semi-automatic optimizations procedures such as the z-score optimization59,94-96 are too expensive to be used here.
Folding and Unfolding Simulations
The folding (starting from fully extended structures) and unfolding/control (starting from the native structures) were carried out using the REX-MD facility available in the MMTSB Tool Set63,64 (available from http://mmtsb.scripps.edu) together with the CHARMM program.86 Briefly, multiple copies (replicas) of the system are simulated at different temperatures independently and simultaneously. Exchanges of simulation temperatures are periodically attempted according to a Metropolis type algorithm. In the course of an REX-MD simulation, replicas can travel up and down the temperature space automatically in a self-regularized fashion, which, in turn, induces a nontrivial walk in temperature space and reduces the probability of being trapped in states of local energy minima. 16 replicas in a temperature range of 270 K to 550 K were used in all simulations unless otherwise noted. The temperatures were distributed exponentially within the specified ranges. SHAKE was applied to fix the lengths of all bonds with hydrogen atoms and a time-step of 2 femtoseconds (fs) was used. Exchanges of simulation temperatures were attempted every 2.0 picoseconds (ps) of MD. The total simulation lengths range from 20 nanoseconds (ns) for (AAQAA)3 to 50 ns for the β-hairpins. The overall exchange ratios of these simulations range from 0.3 to 0.5.
Structural Analysis
The post-analysis was done largely with CHARMM and the MMTSB Tool Set. The helicity was computed from the average 1-4 hydrogen bond frequency defined by the criteria dOi··HNi+4 ≤ 2.6 Å, where dOi··HNi+4 is the distance between the carbonyl oxygen of residue i, Oi, and the amide hydrogen of residue i + 4, HNi+4. Note that using backbone dihedral criteria resulted in similar but shifted helicity curves for the (AAQAA)3 peptide (data not shown). Similar distance criteria were used to count the backbone hydrogen bonds in the β-hairpins. Sidechains are considered to be in contact if the shortest distance among heavy atoms is no greater than 4.2 Å.
Results and Discussion
Input Radii Optimization
The backbone input radii were optimized first to reproduce the backbone hydrogen bonding strength (using a modified alanine dipeptide dimer8) in TIP3P explicit solvent and the experimental helicity of (AAQAA)3. It turned out that only the amide nitrogen (CHARMM atom type: NH1) needed to be adjusted in the original Nina’s set. Note that adjustment of backbone input radii is strongly coupled with the backbone dihedral modification (described in the next section). Once the backbone input radii were chosen, input radii of polar sidechains were optimized to maximally reproduce the strengths of pair-wise interactions between the polar groups. The final modifications to the Nina’s radii set are summarized in Table 2. Only a few atom types need to be adjusted and the modifications mainly involve heavily charged nitrogen atoms. Even though most changes are small, they often dramatically improve the agreement of interaction strength with explicit solvent results. For example, Figure 2 compares the PMFs of several dimers in the GBSW implicit solvent before and after optimization with the explicit solvent interaction curves. It shows that many interactions are indeed significantly over-stabilized with the original Nina’s radii and such over-stabilization can be effectively eliminated or reduced by the input radii optimization. Note that the solvation peaks (oscillations in the TIP3P PMFs) are mostly absent in the GBSW implicit solvent, which is due to the lack of solvent granularity and adoption of a vdW-like surface. The first solvation peak can be effectively reproduced by incorporating the solvent reentrant surface in the dielectric boundary such as in the GBMV model,31 without having to include any explicit water molecules in the implicit solvent. However, it is not clear whether there is any significant consequence in capturing such fine details in the interactions. While the folding kinetics might be altered, the absence of large solvation peaks might actually speed up the conformational sampling without introducing any thermodynamic bias.
Table 2.
Residue | Atom | Nina (Å) | New (Å) |
---|---|---|---|
backbone | NH1 | 2.30 | 2.03 |
Lys | NZ | 2.13 | 1.80 |
Arg | N* | 2.13 | 1.70 |
CZ | 2.80 | 2.20 | |
Gln/Asn | O* | 1.42 | 1.60 |
N | 2.15 | 2.00 | |
Hse | ND | 1.80 | 1.90 |
Hsp | N* | 2.30 | 1.90 |
Trp | NE | 2.40 | 1.85 |
C* | 1.78 | 2.00 |
refers to a wild card character.
Backbone Dihedral Crossterm Modifications
While modifications to both helical and extended regions in the ϕ/ψ space were explored during the iterative empirical optimization, the final adjustment only involves stabilization of the extended region. The analytical expression of Eqn. 3 was used with kmax = 1.5 kcal/mol. Figure 3 shows a contour plot of the final adjustment to the original CMAP based on high-level quantum mechanical (QM) calculations.66 It is interesting to note that stabilization of the extended (β) region with respect to the helical region (∼ 1.5 kcal/mol) happens to agree well with that in the latest empirically adjusted CMAP for the TIP3P water based on explicit solvent simulations of several proteins in both crystal and aqueous environments.48 However, changes to the QM CMAP surface are more extensive in the CMAP for the TIP3P water and both helical and extended ϕ/ψ regions are further stablized. A stabilization of extended conformations is also in line with observations that helical conformations seem to be over-stabilized in the CHARMM22/QM CMAP force field with the TIP3P water. For example, it predicts (AAQAA)3 to be over 90% helical at 270 K, in contrast to an experimental value of about 50%.82
Conformational Equilibria of (AAQAA)3 and GB1p Series Peptides
Here we only present the results of simulations using the final optimized implicit solvent force field, i.e., CHARMM22 with the modified implicit solvent CMAP (denoted CMAPGBSW) plus the GBSW implicit solvent with the optimized input radii as described above. Figure 4 shows the simulated helicity of(AAQAA)3 computed from a 20 ns REX-MD folding simulation in comparison with the experimental results.82 Many folding and unfolding events were observed during the course of the simulation. The values of overall helicity computed using different time intervals indicate that the simulation converges well (see Figure 4 a). The computed helicity of ∼65% at 270 K is in reasonable agreement with the experimental value of ∼ 50%. Furthermore, the simulated and experimental distributions of residue helicity also agree well with a high correlation coefficient of R = 0.78.
In Figure 5, we compare the probability distributions of the number of native hydrogen bonds at 270 K for the β-hairpin series derived from residues 41-56 in the fragment of the protein G B1 domain. Since the folding timescale of β-hairpins is significantly longer than that of helix-coil transition, convergence of simulations of limited total length (tens of ns) is not guaranteed even if certain properties converge to some plateau values. As such, the REX-MD simulations were initiated from both fully extended conformations (folding) and folded hairpin conformations (control) to further examine the degree of convergence. The folded hairpin conformations were built from an NMR structure of protein G B1 domain (PDB ID: 3gb1).97 The total simulation lengths ranged from 30 ns for the control simulations up to 50 ns for the folding simulations. Multiple folding and unfolding events were observed in most simulations. Conformations during the last 10 ns were used to computed the probability distributions shown in Figure 5. The distributions for GB1m3 and GB1m1 appear to converge more readily, reflected by the good agreement between results from the control and folding simulations. However, convergence was not achieved for GB1p even with 50-ns REX-MD simulations. This might be related to the possible difference in the folding/unfolding rates of the three sequences. A recent study demonstrates that a stronger turn-promoting sequence (such as the D47P mutation in GB1m3) increases the hairpin stability primarily by increasing the folding rate, whereas a stronger hydrophobic cluster stabilizes the hairpin by decreasing the unfolding rate.98 Furthermore, the poor convergence for GB1p folding simulation might be related to the fact that the REX setup is suboptimal for GB1p. The melting temperature of GB1p was shown to be Tm ∼ 300 K, compared to Tm ∼ 330 K for Gb1m3.69 With only 16 replicas spanning 270 K to 550 K, only three replicas are actually simulated under the Tm of GB1p. An additional REX-MD simulation with the temperature range reduced to 270-400 K (with 6 temperature windows below Tm ∼ 300 K) appears to converge faster, yielding a native hydrogen bond probability distribution very similar to that of the control simulation (see Figure 5 b). Note that both folding and control simulations of HP5A were carried out with 16 replicas spanning 270-400 K based on the same reasoning. These folding and unfolding simulations seem to correctly reproduce the experimental results that GB1m3 is the most folded and GB1m1 is largely unfolded. Furthermore, assuming conformations with as native, the native populations turn out to be about 88% for GB1m3, 61% (fold_2) for GB1p, 43% for HP5A and 0% for GB1m1 from the folding simulations, which are also in very good agreement with the experimental data (see Table 1). Representative structures from the folded structures of HP5A, GB1p and GB1m3 are shown in Figure 5 e, in comparison with the fragment structure from the protein G B1 domain. All stand-alone hairpins show a characteristic twist observed in other stable hairpins such as the trpzip series.85 Close packing of hydrophobic sidechains are present in most folded structures. In particular, residue Phe52 in GB1m3 packs with both Tyr45 and Trp43 and thus contributes significantly to the observed stability. Mutation of Phe52 to alanine thus dramatically destabilizes the hairpin (such as in GB1m1).
Folding of Trpzip2 and Trp-Cage
To further examine the quality of the optimized implicit solvent force field, both control and folding REX-MD simulations of a range of proteins of various size and topology were carried out. REX control simulations of 5 to 10 ns in length were performed for: a designed ββα motif FSD-1 (PDB:1FSV); helical bundle proteins, the villin headpiece (PDB:1VII) and B domain of protein A (PDB:1BDC); two β sheet motifs, betanova and a WW domain (PDB:1E0L); and two α/β proteins, protein G B1 domain (PDB:3GB1) and a dihydrofolate reductase complex (PDB:1RX7). The results demonstrate that most proteins are stable with native secondary structures and tertiary packing well conserved. The only two exceptions are betanova and FSD-1, where the tertiary structure and part of secondary structures are lost at the end of the simulations. However, these results seem to agree with experimental measurements For example, the folded population in aqueous solution was estimated to be about 8% for betanova99 and the beta-hairpin in FSD-1 appears to be less stable than the helix.100 Thus we conclude that the consistent GBSW implicit solvent optimized force field well produces the conformations of natively folded proteins.
Folding proteins using a first-principles approach is much more costly and is typically limited to small proteins. Here we present the results of folding two sequences, trpzip2 and Trp-cage. Trpzip2, a 12-residue tryptophan zipper, is a designed β-hairpin with a type I’ turn and contains a characteristic structural motif of tryptophan-tryptophan cross-strand pairs .85 It is the smallest peptide to adopt an unique tertiary β-fold with exceptional stability. Trp-cage is a 20-residue designed mini-protein with a stable compact folded state.101 The native structure contains a short α-helix, a single turn of 310-helix and a rigidified poly-proline C-terminal tail. The well-structured hydrophobic core consists of the indole sidechain of Trp6 buried between rings of Pro12 and Pro18. The structure is further stabilized by two tertiary hydrogen bonds, one between the sidechain of Trp6 (Nε1H) and the backbone of Arg16 (CO) and the other one between the backbone groups of Trp6 (CO) and Gly11 (NH). The small sizes and extraordinary stabilities of these two peptides make them ideal for computer simulation studies of protein folding (e.g., see Table 1). Here we are mainly interested in examining the ability of the optimized implicit solvent force field to fold these peptides correctly. Accurate estimate of thermodynamic properties requires even more extensive simulations and therefore was not attempted here.
Two 40-ns REX-MD simulations were carried out to fold trpzip2 from a fully extended conformation. Two folding events were observed in one of the simulations while none was observed in the other. Figure 6a shows the correlation of Cα RMSD to the NMR structure (PDB ID: 1le1) with the total potential energy at the the lowest temperature (270K) from both simulations. Clearly, the folded conformations (with RMSD∼ 1.0Å) have lower average energies. The fact that no folding event was observed in one of the REX-MD simulation probably reflects a sampling limitation, constrained by the total simulation length as well as the sampling protocol. Note that trpzip2 is mainly stabilized by reducing the unfolding rate through the hydrophobic sidechain contacts. Forming the type I’ turn requires the backbone of Asn6 to adopt an αL conformation, possibly further limiting the folding rate of trpzip2 (compared to the GB1p series hairpins described above). The predicted folded structure, obtained by clustering the last 10 ns of the successful folding simulation (run1 of Figure 6 a), contains all native structure characteristics, including the type I’ turn, two tryptophan sidechain contacts and all five native backbone hydrogen bonds. However, the configuration of the tryptophan sidechains appears to be versatile and the most populated packing, particularly, Trp4 and Trp11, is slightly different from the NMR structures.
The results of the Trp-cage folding simulation are shown in Figure 7. The energy versus RMSD plot of the lowest temperature ensemble (see Figure 7 a) shows a strong correlation and the native states (low RMSD values) are significantly stabilized with respect to other conformations. Multiple folding and unfolding events were observed during a total of 30 ns REX-MD simulation, three of which are shown in Figure 7 b. Detailed structural analysis reveals that all secondary structure elements and the major tertiary contacts (see above) are correctly formed in the dominant clusters, with two representative structures shown in Figure 7 c. The indole sidechain of Trp6 forms native contacts with the proline rings as well as the backbone of Arg6 in most low RMSD (low energy) structures. Near native conformations (e.g., with RMSD values around 2 Å) often contain a less compact hydrophobic core and distorted 310 helical turn. Consequently, the poly-proline tail is less ordered with respect the N-terminal helix. Most misfolded structures observed involve the Trp6 sidechain either trapped in wrong orientation in the cage or completely blocked outside of the cage. One of the frequently observed misfolded structures is shown in Figure 7 c.
Conclusion
An implicit solvent force field has been optimized in the context of the GBSW model33 in CHARMM. The input radii, by which the solute-solvent boundary is defined, can be further optimized by directly examining the underlying pair-wise interaction between amino acid polar groups. Due to a paucity in direct experimental data, such an optimization is guided by explicit solvent free energy simulations and by the conformational equilibria of several short peptides. The peptide backbone torsion energetics also needed to be adjusted self-consistently with the GB input radii optimization. Advanced sampling techniques such as the REX-MD method can be effectively used to speed up the convergence of equilibrium thermodynamic properties, facilitating direct comparison with the experimental results. The final optimized implicit solvent force field appears to be properly balanced, correctly reproducing the conformational equilibria of both the helical (AAQAA)3 peptide and the GB1p series β-hairpins. In particular, the force field successfully predicts changes in stability of several sequentially similar hairpins, GB1m1, HP5A, GB1p and GBm3, that were revealed by NMR experiments.69 Successful folding simulations of several nontrivial stable peptides and proteins including trpzip2 and Trp-cage further demonstrate that the optimized force field is quite robust and might be applicable to study the folding of proteins in general. Such a first-principle approach can be a very powerful tool for structural biology as it is based on basic physical principles and free of empirical assumptions. However, it may also often be limited by the extensive sampling required for convergence. For example, difficulties in sufficient sampling and convergence were encountered even with short peptides such as GB1p and trpzip2. As such, continued efforts should be invested in developing more efficient sampling schemes, such as our recent investigation on using torsion angle molecular dynamics to speed up sampling of protein conformations.102 Together with the ever increasing computational power, one might be able to fold more complex proteins, especially those with significant β contents, using an all-atom first-principles approach in the near future.
Supplementary Material
Acknowledgments
CJH thanks a La Jolla Interface of Science postdoctoral fellowship for partial financial support. The authors are grateful to Michael Feig for his efforts in developing the MMTSB Tool Set (http://mmtsb.scripps.edu). We also acknowledge helpful discussions with Drs. Dan Price, Olgun Guvench and Jana Khandogin. This work was supported by the National Institutes of Health (RR12255 and GM48807) and by the Center for Theoretical Biological Physics (CTBP) through funding from the National Science Foundation (PHY0216576).
References
- 1.Brooks CL, III, Karplus M, Pettitt BM. John Wiley and Sons; New York: 1987. [Google Scholar]
- 2.Feig M, Brooks CL., III Curr. Opin. Struct. Biol. 2004;14:217–224. doi: 10.1016/j.sbi.2004.03.009. [DOI] [PubMed] [Google Scholar]
- 3.Roux B, Simonson T. Biophys. Chem. 1999;78:1–20. doi: 10.1016/s0301-4622(98)00226-9. [DOI] [PubMed] [Google Scholar]
- 4.Bashford D, Case DA. Annu. Rev. Phys. Chem. 2000;51:129–152. doi: 10.1146/annurev.physchem.51.1.129. [DOI] [PubMed] [Google Scholar]
- 5.Cramer CJ, Truhlar DG. Chem. Rev. 1999;99:2161–2200. doi: 10.1021/cr960149m. [DOI] [PubMed] [Google Scholar]
- 6.Lazaridis T, Karplus M. Curr. Opin. Struct. Biol. 2000;10:139–145. doi: 10.1016/s0959-440x(00)00063-4. [DOI] [PubMed] [Google Scholar]
- 7.Simonson T. Curr. Opin. Struct. Biol. 2001;11:243–252. doi: 10.1016/s0959-440x(00)00197-4. [DOI] [PubMed] [Google Scholar]
- 8.Im W, Chen J, Brooks CL., III Adv. Prot. Chem. 2006;72:171–195. [Google Scholar]
- 9.Warwicker J, Watson HC. J. Mol. Biol. 1982;157:671–679. doi: 10.1016/0022-2836(82)90505-8. [DOI] [PubMed] [Google Scholar]
- 10.Klapper I, Hagstrom R, Fine R, Sharp K, Honig B. Proteins. 1986;1:47–59. doi: 10.1002/prot.340010109. [DOI] [PubMed] [Google Scholar]
- 11.Nicholls A, Honig B. J. Comput. Chem. 1991;12:435–445. [Google Scholar]
- 12.Im W, Beglov D, Roux B. Comput. Phys. Comm. 1998;111:59–75. [Google Scholar]
- 13.Honig B, Nicholls A. Science. 1995;268:1144–1149. doi: 10.1126/science.7761829. [DOI] [PubMed] [Google Scholar]
- 14.Roux B, Bernèche S, Im W. Biochemistry. 2000;39:13295–13306. doi: 10.1021/bi001567v. [DOI] [PubMed] [Google Scholar]
- 15.Luo R, David L, Gilson MK. J. Comput. Chem. 2002;23:1244–1253. doi: 10.1002/jcc.10120. [DOI] [PubMed] [Google Scholar]
- 16.Prabhu NV, Zhu P, Sharp KA. J. Comput. Chem. 2004;25:2049–2064. doi: 10.1002/jcc.20138. [DOI] [PubMed] [Google Scholar]
- 17.Still WC, Tempczyk A, Hawley RC, Hendrickson T. J. Am. Chem. Soc. 1990;112:6127–6129. [Google Scholar]
- 18.Constanciel R, Contreras R. Theo. Chim. Acta. 1984;65:1–11. [Google Scholar]
- 19.Born M. Z. Phys. 1920;1:45–48. [Google Scholar]
- 20.Onufriev A, Case DA, Bashford D. J. Comput. Chem. 2002;23:1297–1304. doi: 10.1002/jcc.10126. [DOI] [PubMed] [Google Scholar]
- 21.Feig M, Onufriev A, Lee MS, Im W, Case DA, Brooks CL., III J. Comput. Chem. 2004;25:265–284. doi: 10.1002/jcc.10378. [DOI] [PubMed] [Google Scholar]
- 22.Hawkins GD, Cramer CJ, Truhlar DG. J. Phys. Chem. 1996;100:19824–19839. [Google Scholar]
- 23.Schaefer M, Karplus M. J. Phys. Chem. 1996;100:1578–1599. [Google Scholar]
- 24.Qiu D, Shenkin PS, Hollinger FP, Still WC. J. Phys. Chem. A. 1997;101:3005–3014. [Google Scholar]
- 25.Scarsi M, Apostolakis J, Caflisch A. J. Phys. Chem. A. 1997;101:8098–8106. [Google Scholar]
- 26.Ghosh A, Rapp CS, Friesner RA. J. Phys. Chem. B. 1998;102:10983–10990. [Google Scholar]
- 27.Dominy BN, Brooks CL., III J. Phys. Chem. B. 1999;103:3765–3773. [Google Scholar]
- 28.Srinivasan J, Trevathan MW, Beroza P, Case DA. Theor. Chem. Acc. 1999;101:426–434. [Google Scholar]
- 29.Tsui V, Case DA. J. Am. Chem. Soc. 2000;11:2489–2498. [Google Scholar]
- 30.Onufriev A, Bashford D, Case DA. J. Phys. Chem. B. 2000;104:3712–3720. [Google Scholar]
- 31.Lee MS, Salsbury FR, Jr., Brooks CL., III J. Chem. Phys. 2002;116:10606–10614. [Google Scholar]
- 32.Spassov VZ, Yan L, Szalma S. J. Phys. Chem. B. 2002;106:8726–8738. [Google Scholar]
- 33.Im W, Lee MS, Brooks CL., III J. Comput. Chem. 2003;24:1691–1702. doi: 10.1002/jcc.10321. [DOI] [PubMed] [Google Scholar]
- 34.Im W, Feig M, Brooks CL., III Biophys. J. 2003;85:2900–2918. doi: 10.1016/S0006-3495(03)74712-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Gallicchio E, Levy RM. J. Comput. Chem. 2004;25:479–499. doi: 10.1002/jcc.10400. [DOI] [PubMed] [Google Scholar]
- 36.Feig M, Im W, Brooks CL., III J. Chem. Phys. 2004;120:903–910. doi: 10.1063/1.1631258. [DOI] [PubMed] [Google Scholar]
- 37.Zhu J, Alexov E, Honig B. J. Phys. Chem. B. 2005;109:3008–3022. doi: 10.1021/jp046307s. [DOI] [PubMed] [Google Scholar]
- 38.Luo R, David L, Hung H, Devaney J, Gilson MK. J. Phys. Chem. B. 1999;103:727–736. [Google Scholar]
- 39.Gallicchio E, Zhang LY, Levy RM. J. Comput. Chem. 2002;5:517–529. doi: 10.1002/jcc.10045. [DOI] [PubMed] [Google Scholar]
- 40.Zhou R. Proteins. 2003;53:148–161. doi: 10.1002/prot.10483. [DOI] [PubMed] [Google Scholar]
- 41.Masunov A, Lazaridis T. J. Am. Chem. Soc. 2003;125:1722–1730. doi: 10.1021/ja025521w. [DOI] [PubMed] [Google Scholar]
- 42.Khandogin J, Brooks CL., III Biophys. J. 2005;89:141–157. doi: 10.1529/biophysj.105.061341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Zhou R, Berne BJ. Proc. Natl. Acad. Sci. USA. 2002;99:12777–12782. doi: 10.1073/pnas.142430099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Nymeyer H, Garcia AE. Proc. Natl. Acad. Sci. USA. 2003;100:13934–13939. doi: 10.1073/pnas.2232868100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Jaramillo A, Wodak SJ. Biophys. J. 2005;88:156–171. doi: 10.1529/biophysj.104.042044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Joregensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. J. Chem. Phys. 1983;79:926–35. [Google Scholar]
- 47.MacKerell AD., Jr. J. Comput. Chem. 2004;25:1584–1604. doi: 10.1002/jcc.20082. [DOI] [PubMed] [Google Scholar]
- 48.MacKerell AD, Jr., Feig M, Brooks CL., III J. Comput. Chem. 2004;25:1400–1415. doi: 10.1002/jcc.20065. [DOI] [PubMed] [Google Scholar]
- 49.Okur A, Strockbine B, Hornak V, Simmerling C. J. Comput. Chem. 2003;24:21–31. doi: 10.1002/jcc.10184. [DOI] [PubMed] [Google Scholar]
- 50.Nina M, Beglov D, Roux B. J. Phys. Chem. B. 1997;101:5239–5248. [Google Scholar]
- 51.Nina M, Im W, Roux B. Biophys. Chem. 1999;78:89–96. doi: 10.1016/s0301-4622(98)00236-1. [DOI] [PubMed] [Google Scholar]
- 52.Banavali NK, Roux B. J. Phys. Chem. B. 2002;106:11026–11035. [Google Scholar]
- 53.Im W, Brooks CL. J. Mol. Biol. 2004;337:513–519. doi: 10.1016/j.jmb.2004.01.045. [DOI] [PubMed] [Google Scholar]
- 54.Chen J, Im W, Brooks CL., III J. Am. Chem. Soc. 2004;126:16038–16047. doi: 10.1021/ja047624f. [DOI] [PubMed] [Google Scholar]
- 55.Chen J, Won H-S, Im W, Dyson HJ, Brooks CL. J. Biomol. NMR. 2004;31:59–64. doi: 10.1007/s10858-004-6056-z. [DOI] [PubMed] [Google Scholar]
- 56.Ferrara P, Apostolakis J, Caflisch A. J. Phys. Chem. B. 2000;104:5000–5010. [Google Scholar]
- 57.Felts AK, Harano Y, Gallicchio E, Levy RM. Proteins. 2004;56:310–321. doi: 10.1002/prot.20104. [DOI] [PubMed] [Google Scholar]
- 58.Ulmschneider JP, Jorgensen WL. J. Am. Chem. Soc. 2004;126:1849–1857. doi: 10.1021/ja0378862. [DOI] [PubMed] [Google Scholar]
- 59.Liwo A, Khalili M, Scheraga HA. Proc. Natl. Acad. Sci. USA. 2005;102:2362–2367. doi: 10.1073/pnas.0408885102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Irbäck A, Mohanty S. Biophys. J. 2005;88:1560–1569. doi: 10.1529/biophysj.104.050427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Sugita Y, Okamoto Y. Chem. Phys. Lett. 1999;314:141–151. [Google Scholar]
- 62.Hansmann UHE, Okamoto Y. Curr. Opin. Struct. Biol. 1999;9:177–183. doi: 10.1016/S0959-440X(99)80025-6. [DOI] [PubMed] [Google Scholar]
- 63.Feig M, Karanicolas J, Brooks CL., III . MMTSB Tool Set, MMTSB NIH Research Resource. The Scripps Research Institute; 2001. [Google Scholar]
- 64.Feig M, Karanicolas J, Brooks CL., III J. Comp. Graph and Modl. 2004;22:337–395. doi: 10.1016/j.jmgm.2003.12.005. [DOI] [PubMed] [Google Scholar]
- 65.MacKerell AD, Jr., et al. J. Phys. Chem. B. 1998;102:3586–3616. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
- 66.Feig M, MacKerell AD, Jr., Brooks CL., III J. Phys. Chem. 2003;107:2831–2836. [Google Scholar]
- 67.MacKerell AD, Jr., Feig M, Brooks CL., III J. Am. Chem. Soc. 2004;126:698–699. doi: 10.1021/ja036959e. [DOI] [PubMed] [Google Scholar]
- 68.Yoda T, Sugita Y, Okamoto Y. Chem. Phys. 2004;307:269–283. [Google Scholar]
- 69.Fesinmeyer RM, Hudson FM, Andersen NH. J. Am. Chem. Soc. 2004;126:7238–7243. doi: 10.1021/ja0379520. [DOI] [PubMed] [Google Scholar]
- 70.Hassan SA, Mehler EL. Int. J. Quant. Chem. 2001;83:193–202. [Google Scholar]
- 71.Liu Y, Beveridge DL. Proteins. 2002;46:128–146. doi: 10.1002/prot.10020. [DOI] [PubMed] [Google Scholar]
- 72.Zagrovic B, Sorin EJ, Pande V. J. Mol. Biol. 2001;313:151–169. doi: 10.1006/jmbi.2001.5033. [DOI] [PubMed] [Google Scholar]
- 73.Zhou RH, Berne BJ. Proc. Natl. Acad. Sci. USA. 2002;99:12777–12782. doi: 10.1073/pnas.142430099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Jang S, Shin S, Pak Y. J. Am. Chem. Soc. 2002;124:4976–4977. doi: 10.1021/ja025675b. [DOI] [PubMed] [Google Scholar]
- 75.Yang WY, Pitera JW, Swope WC, Gruebele M. J. Mol. Biol. 2004;336:241–251. doi: 10.1016/j.jmb.2003.11.033. [DOI] [PubMed] [Google Scholar]
- 76.Simmerling C, Strockbine B, Roitberg AE. J. Am. Chem. Soc. 2002;124:11258–11259. doi: 10.1021/ja0273851. [DOI] [PubMed] [Google Scholar]
- 77.Pitera JW, Swope W. Proc. Natl. Acad. Sci. USA. 2003;100:7587–7592. doi: 10.1073/pnas.1330954100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Carnevali P, Toth G, Toubassi G, Meshkat SN. J. Am. Chem. Soc. 2003;125:14244–14245. doi: 10.1021/ja036647b. [DOI] [PubMed] [Google Scholar]
- 79.Chowdhury S, Lee MC, Xiong G, Duan Y. J. Mol. Biol. 2003;327:711–717. doi: 10.1016/s0022-2836(03)00177-3. [DOI] [PubMed] [Google Scholar]
- 80.Steinbach PJ. Proteins. 2004;57:665–677. doi: 10.1002/prot.20247. [DOI] [PubMed] [Google Scholar]
- 81.Schug A, Herges T, Wenzel W. Phys. Rev. Lett. 2003;91:158102. doi: 10.1103/PhysRevLett.91.158102. [DOI] [PubMed] [Google Scholar]
- 82.Shalongo W, Dugad L, Stellwagen E. J. Am. Chem. Soc. 1994;116:8288–8293. [Google Scholar]
- 83.Blanco G, J. F, Rivas, Serrano L. Nature Struct. Biol. 1994;1:584–590. doi: 10.1038/nsb0994-584. [DOI] [PubMed] [Google Scholar]
- 84.Munoz V, Thompson PA, Hofrichter J, Eaton WA. Nature. 1997;390:196–199. doi: 10.1038/36626. [DOI] [PubMed] [Google Scholar]
- 85.Cochran AG, Skelton NJ, Starovasnik MA. Proc. Natl. Acad. Sci. USA. 2001;98:5578–5583. doi: 10.1073/pnas.091100898. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M. J. Comput. Chem. 1983;4:187–217. [Google Scholar]
- 87.Essmann U, Perera L, Berkowitz ML, Darden T, Lee H, Pedersen LG. J. Chem. Phys. 1995;103:8577–8593. [Google Scholar]
- 88.Brooks CL, III, Pettitt BM, Karplus M. J. Chem. Phys. 1985;83:5897–5908. [Google Scholar]
- 89.Steinbach PJ, Brooks BR. J. Comput. Chem. 1994;15:667–683. [Google Scholar]
- 90.Torrie GM, Valleau JP. J. Comp. Phys. 1977;23:187–199. [Google Scholar]
- 91.Kumar S, Bouzida D, Swendsen RH, Kollman PA, Rosenberg JM. J. Comput. Chem. 1992;13:1011–1021. [Google Scholar]
- 92.Roux B. Comput. Phys. Comm. 1995;91:275–282. [Google Scholar]
- 93.Ryckaert JP, Ciccotti G, Berendsen HJC. J. Comput. Phys. 1977;23:327–341. [Google Scholar]
- 94.Goldstein RA, Luthey-Schulten ZA, Wolynes PG. Proc. Natl. Acad. Sci., U.S.A. 1992;89:9029–9033. doi: 10.1073/pnas.89.19.9029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Kortemme T, Morozov AV, Baker D. J. Mol. Biol. 2003;326:1239–1259. doi: 10.1016/s0022-2836(03)00021-4. [DOI] [PubMed] [Google Scholar]
- 96.Guvench O, Brooks CL., III J. Chem. Phys. 2006 submitted for publication. [Google Scholar]
- 97.Tjandra N, Garrett DS, Gronenborn AM, Bax A, Clore GM. Nature Struct. Biol. 1997;4:443–449. doi: 10.1038/nsb0697-443. [DOI] [PubMed] [Google Scholar]
- 98.Du D, Zhu Y, Huang CY, Gai F. Proc. Natl. Acad. Sci. USA. 2004;101:15915–15920. doi: 10.1073/pnas.0405904101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.de la Paz ML, Lacroix E, Ramirez-Alvarado M, Serrano L. J. Mol. Biol. 2001;312:229–246. doi: 10.1006/jmbi.2001.4918. [DOI] [PubMed] [Google Scholar]
- 100.Dahiyat BI, Mayo SL. Science. 1997;278:82–87. doi: 10.1126/science.278.5335.82. [DOI] [PubMed] [Google Scholar]
- 101.Neidigh JW, Fesinmeyer RM, Andersen NH. Nature Struct. Biol. 2002;9:425–430. doi: 10.1038/nsb798. [DOI] [PubMed] [Google Scholar]
- 102.Chen J, Im W, Brooks CL., III J. Comput. Chem. 2005;26:1565–1578. doi: 10.1002/jcc.20293. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.