Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Mar 4.
Published in final edited form as: J Chem Theory Comput. 2008 Sep 6;4(10):1733–1744. doi: 10.1021/ct8001656

Accurate Calculations of Binding, Folding, and Transfer Free Energies by a Scaled Generalized Born Method

Harianto Tjong 1, Huan-Xiang Zhou 1,*
PMCID: PMC3587178  NIHMSID: NIHMS428715  PMID: 23468599

Abstract

The Poisson-Boltzmann (PB) equation is widely used for modeling solvation effects. The computational cost of PB has restricted its applications largely to single-conformation calculations. The generalized Born (GB) model provides an approximation at substantially reduced cost. Currently the best GB methods reproduce PB results for electrostatic solvation energies with errors at ~5 kcal/mol. When two proteins form a complex, the net electrostatic contributions to the binding free energy are typically of the order of 5 to 10 kcal/mol. Similarly, the net contributions of individual residues to protein folding free energy are < 5 kcal/mol. Clearly in these applications the accuracy of current GB methods is insufficient. Here we present a simple scaling scheme that allows our GB method, GBr6, to reproduce PB results for binding, folding, and transfer free energies with high accuracy. From an ensemble of conformations sampled from molecular dynamics simulations, five were judiciously selected for PB calculations. These PB results were used for scaling GBr6. Tests on the binding free energies of the barnase-barstar, GTPase-WASp, and U1A-U1hpII complexes and on the folding free energy of FKBP show that the effects of point mutations calculated by scaled GBr6 are accurate to within 0.3 kcal/mol of PB results. Similar accuracy was also achieved for the free energies of transfer for ribonuclease Sa and insulin from the crystalline phase to the solution phase at various pH’s. This method makes it possible to thoroughly sample the transient-complex ensemble in predicting protein binding rate constants and to incorporate conformational sampling in electrostatic modeling (such as done in the MM-GBSA approach) without loss of accuracy.

I. Introduction

Electrostatic interactions make important contributions to fundamental properties such as protein binding free energy, protein folding stability, and protein solubility. Point mutations and variations of salt concentration and pH are often used to probe such contributions. Computational methods that reliably predict those energetic effects are highly desirable, both for elucidating the underlying physical principles and for protein design. Developing such methods is a formidable task, since the free energy of binding, for example, is a small difference between two large quantities, namely the free energy due to interactions within the complex in the solvent environment and the solvation energy of the subunits in the unbound state. The effects of point mutations on the free energy of binding or folding are even smaller. The Poisson-Boltzmann (PB) equation18 has found great success in modeling electrostatic contributions to mutational effects on protein binding free energy912 and protein folding stability1315. We have also been able to use the PB equation model the effects of salt and pH on protein solubility.16, 17 However, the computational cost of solving the PB equation presents a major stumbling block.

There has been some efforts devoted to the development of fast PB methods.4, 6 An alternative that holds great promise is the generalized Born (GB) model,18 which approximates the PB equation at substantially reduced computational cost. Considerable efforts have been invested on developing GB methods that would hold down the computational cost but have great accuracy in reproducing the PB results.1926

The reduced computational cost of the GB model opens new avenues for more realistic modeling of electrostatic effects, such as the inclusion of protein conformational sampling.27 However, the small magnitudes of electrostatic contributions to binding and folding free energies pose a challenging demand on the accuracy of calculations. Up to date, the errors of the best GB methods in reproducing PB solvation energies are about 0.5% in relative terms, or roughly several kcal/mol in absolute terms.26, 28 The scatter of these errors among different proteins also appears to be random.

The electrostatic contributions to protein-protein binding free energies typically have an order of magnitude of 10 kcal/mol.9, 10 Therefore even the current most accurate GB methods (when benchmarked against PB results) may be inadequate for predicting binding free energy. One idea for improving accuracy is to reparameterize GB methods against PB results. One such attempt was made in calculating protein-ligand binding free energy29, using PB results for a subset of ligands as training data, but the error (as measured against PB results) in that work was still relatively large, about 5 kcal/mol. In fact, our recently developed GB method26, GBr6, has even smaller errors on protein-protein binding free energies (~1 kcal/mol for protein complexes studied here) without any reparameterization.

Here we take a different approach to GB reparameterization. The aim of the reparameterization is to allow for conformational sampling in electrostatic modeling. For each protein or protein complex, we generate an ensemble of conformations from molecular dynamics simulations. The raw GB result for the solvation energy of each conformation in the ensemble is calculated, and five conformations are selected to be representative of the variations in solvation energy of the ensemble. PB results for the solvation energies of the five representative conformations are then calculated and used for scaling the corresponding raw GB results. The scaling factor is finally applied to the rest of the conformation ensemble. Accuracy of the scaling method is assessed by comparing the scaled GB results and PB results for the whole ensemble. While the scaling method is applicable to any GB method, we report here results for GBr6. We refer to the reincarnation scaled GBr6, or sGBr6 in short.

We test scaled GBr6 on a number of applications. Mutational and salt effects on the binding free energies are calculated for four protein-protein and protein-RNA complexes. To assess the accuracy of scaled GBr6 for calculating mutational effects on folding free energy, 26 mutants of the FK506-binding protein (FKBP) are studied. In addition, scaled GBr6 is used to calculate the pH dependence of solubility of two proteins, ribonuclease Sa and insulin.

As another important application, we use scaled GBr6 to calculate the electrostatic interaction energy of the transient complex along the protein-protein association pathway, which predicts the electrostatic enhancement of the protein binding rate.3033 The transient complex consists of an ensemble of configurations, each of which presents a slightly different relative separation and relative orientation between the two subunits in a complex. Four pairs of binding proteins are studied.

In all these applications, we show that the relevant small differences in electrostatic solvation energy obtained by scaled GBr6 are accurate to within 0.3 kcal/mol of the corresponding PB results. These diverse applications illustrate the wide utility of our GB scaling approach.

II. Theory

In previous studies, we have described the theoretical models for calculating electrostatic contributions to free energies of protein binding9, 10 and folding,13, 14 for calculating the electrostatic rate enhancement of protein binding,3033 and for calculating the pH dependence of protein solubility.17 Here we give brief outlines of these models.

2.1 Binding free energy

When two proteins, A and B, bind to form a complex C, the electrostatic contribution to the binding free energy is calculated as

ΔGb=Gel(C)Gel(A)Gel(B), (1)

where Gel(X), X = A, B, or C, is the electrostatic free energy of molecule X. Gel(X) can be decomposed into a Coulomb term and a solvation term:

Gel(X)=GCoul(X)+Gsolv(X). (2)

By this decomposition, ΔGb has a Coulombic component and a solvation component. These will be denoted as ΔGb; Coul and ΔGb; solv, respectively. When a point mutation is introduced, ΔGb can be calculated for the wild-type (wt) pair of proteins and for the mutant (mt) pair. The change in ΔGb by the mutation is

ΔΔGb=ΔGb(mt)ΔGb(wt). (3)

2.2 Folding free energy

For protein folding, we are interested in mutational effects on the electrostatic contribution to the folding free energy. It is assumed that, in the unfolded state, individual residues do not interact and hence the contributions of residues other than the one under mutation are the same in the wild-type protein and the mutant. (Residual charge-charge interctions in the unfolded state have been modeled previously.34) Neglecting the electrostatic free energy of the “other” residues in the unfoled state (which does not affect the change in folding free energy by the mutation), the electrostatic contribution to the folding free energy is

ΔGf=Gel(protein)Gel(residue), (4)

where the two terms on the right-hand side represent the electrostatic free energies of the protein in the folded state and the residue under mutation, respectively. The solvation component of ΔGf will be denoted as ΔGf; solv. The mutational effect on ΔGf is given by

ΔΔGf=ΔGf(mt)ΔGf(wt). (5)

Notice that even ΔGf(wt) is mutant-specific; we will come back to this point in Subsection 3.4.

2.3. pH dependence of solubility

The effect of pH on protein solubility can be calculated from the pH dependence of the transfer free energy of a protein from the condensed phase to the solution phase. Both the solution phase and the condensed phase are modeled as continuum dielectrics, with the dielectric constants taking a value appropriate for water (equal to 78.5 at room temperature) for the former and a value (~55) intermediate between those for water and for the solute protein. The electrostatic component of the transfer free energy, which is what is relevant for determining the pH dependence of solubility, is

ΔGt=Gsolv(εs=78.5)Gsolv(εs=55), (6)

where Gsolvs) is the electrostatic solvation energy of the solute protein in a continuum solvent with dielectric constant εs. This result uses the fact that the Coulomb term is the same in both phases. Gsolvs) at each pH was calculated by averaging over conformations sampled from constant-pH molecular dynamics simulations.

2.4 Electrostatic rate enhancement

According to the transient-complex theory,30, 32, 33 the rate constant for the association of two proteins under diffusion control can be predicted as

ka=ka0eΔGel*/kBT, (7)

where ka0 is a basal rate constant and ΔGel* is the electrostatic interaction energy of the proteins in the transient-complex ensemble. ΔGel* is calculated as the average over configurations representing the transient complex. For each configuration, the electrostatic interaction energy is as defined by eq (1), except that C now represents the transient-complex configuration.

III. Computation Details

3.1 Generation of conformational ensembles

For calculating electrostatic contributions to protein binding and folding free energies by scaled GBr6, conformations were generated by explicit-solvent molecular dynamics (MD) simulations. Four protein-protein and protein-RNA complexes were studied: an enzyme-inhibitor complex formed by barnase and barstar; complexes formed the Wiskott-Aldrich Syndrome protein (WASp) with two homologous Rho GTPases, Cdc42 and TC10; and a protein-RNA complex formed by the U1A protein and stem/loop II of the U1 small nuclear RNA (U1SLII). The first and last complexes were previously studied by PB calculations.9, 11 The two GTPase-WASp complexes have been studied experimentally.35 These and other systems studied here are shown in Figure 1.

Figure 1.

Figure 1

Systems studied in the present work. (a)–(c) Complexes of barnase and barstar, WASp and Cdc42, U1A and U1SLII. The first and second subunits are shown in blue and red, respectively. (d) FKBP. (e) Ribonuclease Sa. (f) Insulin hexamer. Mutated residues in (a)–(d) and titrated residues in (e) and (f) are labeled.

The simulation of the barnase-barstar complex was carried out as follows. Starting from the X-ray structure of the complex (Protein Data Bank entry 1brs36), hydrogens, four Na+ ions as neutralizing counter ions, and 9,558 TIP3P water molecules were added by the LEAP program in the Amber pakage.37 The side chains of arginine and lysine residues were positively charged, side chains of aspartate and glutamate residues were negatively charged, and side chains of histidine residues were neutral (appropriate for a nominal pH of 7). The force field was ff99SB.38 The solvent (water molecules plus counter ions) was relaxed first by 200 steps of energy minimization and then by 100 ps of MD simulation at a constant pressure MD, all the while the protein complex was fixed. Next, the whole system was energy minimized with decreasing harmonic constraints applied to the protein complex, from 50 kcal/mol·A2 to 0, for a total of 2500 steps. The cutoff for the non-bonded interactions was 9 Å, and the particle mesh Ewald method was used to treat long-range electrostatic interactions. The whole system was then heated for 40 ps at constant volume to a final temperature of 298 K. Finally the simulation was continued at constant pressure and temperature. Bond lengths involving hydrogen atoms were restrained by the SHAKE algorithm39 throughout the simulation, allowing a time step of 2 fs. The first 2 ns of the constant pressure and temperature simulation was discarded; thereafter conformations were sampled at every 10 ps, and a total of 548 conformations were collected.

For the Cdc42-WASp complex, an NMR structure with 20 models was available (Protein Data Bank entry 1cee).40 We randomly picked five (models 1, 5, 9, 13, and 17) of the 20 to generate conformational ensembles. No structure for the TC10-WASp complex was available. We modeled its structure by aligning the structure of unbound TC10 (Protein Data Bank entry 2atx35) to Cdc42 in the five NMR models of the Cdc42-WASp complex. From these 10 starting structures (five for each complex), MD simulations were carried out following the protocol for the barnase-barstar complex. For each of the 10 simulations, 100 conformations were sampled at every 10 ps after discarding the first 1 ns of the constant pressure and temperature simulation.

The starting structure for the U1A-U1SLII complex was as prepared in our previous study.11 100 conformations were sampled at every 20 ps after discarding the first 2 ns of the constant pressure and temperature simulation.

FKBP was chosen as a model system for folding because it is a protein being studied experimentally in our laboratory.41 We have accumulated experimental data for the effects of a large number of charge mutations on the folding stability (J. Batra, HT, and HXZ, to be published). The same mutations are the targets of the present study. Starting from the X-ray structure (Protein Data Bank entry 1fkb42), the MD simulation of FKBP followed the protocol for the barnase-barstar complex. 200 conformations were sampled at every 15 ps after discarding the first 1 ns of the constant pressure and temperature simulation.

Previously we have used PB calculations to predict the pH dependences of the solubility of two proteins, ribonuclease Sa and insulin.17 The transfer free energy from the condensed phase to the solution phase was calculated by averaging over conformations sampled from constant-pH MD simulations, which were based on the GB model.43 For testing scaled GBr6, here we simply took the conformations generated in our previous work and used the PB results calculated there as the benchmark. We just point out that 100 conformations were collected at each of nine pH values (2.3, 2.9, 3.6, 4.0, 4.5, 4.8, 5.0, 5.2, and 5.4) for ribonuclease Sa and at each of eight pH values (4.0, 4.5, 5.0, 5.5, 5.75, 6.25, 6.75, and 7.0) for insulin.

We have obtained PB results for the electrostatic interaction energies of the transient complexes of four protein pairs, formed by barnase and barstar, interleukin-4 (IL4) and interleukin-4 binding protein (IL4BP), colicin E9 and immunity protein 9 (Im9), and acetylcholinesterase (AChE) and fasciculin (fas).3032 Each transient complex was represented by 100 configurations, generated by sampling in the six-dimensional space of relative translation and rotation (the conformations of the two subunits were held fixed). Here we used the same configurations and the PB results to test scaled GBr6.

3.2 Mutation protocol

In the present work mutational effects on binding free energy were calculated for the barnase-barstar complex, the two GTPase-WASp complexes and the U1A-U1SLII complex. Similarly mutational effects on folding free energy were calculated for FKBP. Mutations were modeled on each of the conformations collected from MD simulations of a “wild-type” protein or protein complex, and the protocol for mutation was the same as developed previously in single-conformation studies.9, 11, 13, 14 Briefly, for each single mutation, the LEAP program was used to replace the wild-type side chain with the mutant side chain. The new side chain was then energy minimized in vacuum (while holding the rest of protein or protein complex fixed) up to 50,000 steps. Multiple mutations were decomposed into a series of single mutations.

The 17 mutations on the barnase-barstar complex studied previously9 were also the target of the present work. Of these, 11 are single mutations (bnK27A, bnD54A, bnR59A, bnE60A, bnR83Q, bnR87A, bnH102A, bsD35A, bsD39A, bsE76A, and bsE80A; bn and bs refer to barnase and barstar, respectively). The remaining 6 are double mutations (bnK27A/bsD39A, bnR59A/bsD35A, bnR59A/bsE76A, bnR59A/bsE80A, bnR83Q/bsD39A, and bnR87A/bsD39A). Similarly, seven single mutations (R7Q, K20Q, K22Q, K23Q, K50A, K60Q, and R70Q) on the U1A in its complex with U1SLII, studied previously,11 were the target of the present work.

For the Cdc42-WASp complex, the present study covered of a total of 10 mutations. Of these, 7 were single mutations (E49K on Cdc42, K230A, K230E, K231A, K231E, K232A, and K232E on WASp), 1 was a double mutation (E49K/E178K on Cdc42), and 2 were triple mutations (K230A/K231A/K232A and K230E/K231E/K232E on WASp). Two mutations were studied on the TC10-WASp complex: K63E and K63E/T192E; both are on TC10. For the purpose of making error assessment on scaled GBr6, we group together the 12 mutations on the two related GTPase-WASp complexes.

We studied 26 mutations on FKBP to find their effects on the folding free energy. These include 17 single, 5 double, 2 triple, 1 quadruple, and 1 quintuple mutation (listed on Figure 6 below).

Figure 6.

Figure 6

Comparison of scaled GBr6 and PB results for ΔΔGf of 26 mutations on FKBP. Error bars indicate standard deviations within the conformational ensemble.

3.3 PB and GB calculations

The PB results for benchmarking scaled GBr6 on pH dependence of protein solubility and on electrostatic interaction energy of transient complex were available from previous studies.17, 31 PB results for electrostatic contributions to binding and folding free energies were newly calculated here on the conformational ensembles described above. The calculations were done by the UHBD program2 following a previously established protocol.9, 11, 13, 14 In particular, the dielectric boundary between the protein or RNA low dielectric and the solvent high dielectric was specified by the van der Waals surface instead of the more popularly used molecular surface. We have had most success with the former surface in comparing PB calculations with experimental results for mutational effects on protein folding and binding free energies911, 13, 14 and for electrostatic enhancement of protein binding rates.32, 33 The van der Waals surface is also what is modeled into our GB method, GBr6.26

Mutational effects on the barnase-barstar binding free energy were calculated for an ionic strength of 25 mM. In addition, PB and GB calculations were also carried out for the wild-type complex at ionic strengths of 50, 125, 225, 325, 525, and 1000 mM to test scaled GBr6 on salt effects. The ionic strengths for PB and GB calculations were 100 mM for the GTPase-WASp complexes, 160 mM for the U1A-U1SLII complex, and 150 mM for FKBP. All calculations used the Amber94 atomic partial charges44 and Bondi radii (H, 1.2 Å; O, 1.5 Å; N, 1.55 Å; C, 1.7 Å; S and P, 1.8 Å).45 Except for the U1A-U1SLII complex, all PB results were obtained by solving the linearized PB equation. For the protein-RNA complex, the full PB equation was solved due to the high charges on the system. We have developed a version of GBr6 called GBr6NL,46 which mimics the full nonlinear PB equation. However, since the nonlinear PB equation has not been modeled by other GB methods and our scaling-based reparameterization was intended for GB methods in general, even for the U1A-U1SLII complex we used raw GBr6 results for scaling. (We did use the nonlinear PB results for the U1A-U1SLII complex to test the scaling method on GBr6NL. As expected, the raw GBr6NL results had slightly less deviations from the PB targets than the raw GBr6 results, but the scaling method worked equally well for the two GB versions.)

3.4 GB scaling protocol

Rather than refine GBr6 internally, our strategy for reparameterization was to take raw GBr6 results and post-process them by scaling. The scaling was based on PB results for a small number (five) of conformations judiciously selected as representatives of the conformation ensemble. The selection varied somewhat from application to application, but followed the same overall design, as illustrated in Figure 2.

Figure 2.

Figure 2

Illustration of the scaling protocol.

Let us illustrate the scaling protocol using mutational effects on the binding free energy of a protein complex. First, for each conformation in the ensemble, the raw GBr6 result for ΔGb(wt) was calculated. Then, five conformations were selected for scaling purpose, based on the mean (m) and standard deviation (σ) of the raw GBr6 results within the conformational ensemble. The representative conformations had raw GBr6 results closest to the following target values: m, m ± 0.5σ, m ∓ 0.75σ, m ± 1.0σ, and m ∓ 1.5σ. The first or second set of signs was chosen depending on whether the actual value of the first conformation was above or below m. Other target values were experimented, but the above values resulted in the best overall performance. We now refer to the raw GBr6 results for the solvation component of ΔGb(wt) as ΔGGB(i), i = 1 to 5, of the five conformations

The corresponding PB results, ΔGPB(i), i = 1 to 5, for the five representative conformations were also obtained. The average of the five individual ratios, ΔGPB(i)/ΔGGB(i), was finally taken as the factor (λ) for scaling the raw GBr6 results of the whole conformational ensemble. The same five representative conformations were also used for calculating the scaling factors of all mutants of the same protein complex. For each mutant, calculating the scaling factor involved again obtaining PB results on the five representative conformations. The ratio ΔGPB(i)/ΔGGB(i) could spuriously deviate significantly from 1 when the magnitudes of ΔGb; solv calculated by GBr6 or PB on the representative conformations were too small. To prevent the scaling factor from being biased by such spurious ratios, we filtered out ratios outside the range of 0.75 to 1.25. This filtering was triggered only rarely in the present study.

After the scaling factors were separately found for the wild-type complex and each mutant, the mutational effect on the electrostatic contribution to the binding free energy was calculated as

ΔΔGb=[λ(mt)ΔGGB(mt)+ΔGb;Coul(mt)][λ(wt)ΔGGB(wt)+ΔGb;Coul(wt)]. (8)

This equation was applied to the individual conformations in the ensemble, all using the same scaling factors λ(mt) and λ(wt). The final, ensemble-averaged, prediction for the mutational effect was taken as average of the results for ΔΔGb calculated on the individual conformations. The same procedure was also adopted for studying salt effects on the binding free energy. In that case ‘mt’ and ‘wt’ refer to the salt concentration of interest and a reference salt concentration, respectively.

The scaling protocol for mutational effects on the folding free energy was similar. In this case, ΔGf, as defined in eq (4), was used for scaling. As noted in Subsection 2.2, even ΔGf(wt) is mutant-specific; therefore, unlike in the case of binding free energy, it is not possible to have one set of five representative conformations that is applicable to all mutants. Instead, for each mutant, the five representative conformations were selected based on GBr6 results for ΔGf(wt). The scaling factors for the wild-type protein and the mutant were then found by obtaining PB results for the solvation components of ΔGf(wt) ΔGf(mt) on these five conformations.

Extensions of the scaling protocol to studies of pH dependence of protein solubility and electrostatic rate enhancement were straightforward. For the former we just mention that ΔGt, the electrostatic component of the transfer free energy, was used for scaling. For the latter study, we mention that the interaction energy ΔGel* itself, as opposed to its changes by mutation or by salt, was of direct interest.

IV. Results and Discussion

4.1 Importance of conformational sampling

The premise of our method for GB reparameterization is that results calculated from an extensive conformational ensemble are much more reliable than those from a single conformation. It can be argued, from both fundamental and practical points of view, that a conformational ensemble is superior to a single conformation for predicting binding and folding free energies. In mimicking experimental measurements, an ensemble of conformations is obviously more realistic than a single conformation; experimental results are averages over a conformational ensemble under certain solvent conditions. In a statistical sense, the errors of calculation results decrease with increasing size of the conformational ensemble.

We use PB results for the effects of 17 mutations on the binding free energy of the barnase-barstar complex to illustrate the comparison between ensemble and single-conformation predictions. For each mutation, we take the mean value of the PB results for ΔΔGb over the 548 sampled conformations as the ensemble prediction. For all the mutants, the ranges of variation of ΔΔGb within the conformational ensemble were found to be comparable to the magnitudes of the mean ΔΔGb values (Table 1). For example, for the barnase R59A mutation, PB results for ΔΔGb among the 548 conformations varied from 3.1 to 6.5 kcal, spanning a range of 3.4 kcal/mol. In comparison, the mean ΔΔGb is 4.8 kcal/mol and the standard deviation is 0.6 kcal/mol. The fact that, relative to the mean value, the range of variation of ΔΔGb is large whereas the standard deviation is small provides direct evidence that the ensemble prediction is much more reliable than single-conformation predictions.

Table 1.

PB, sGBr6, and experimental results (in kcal/mol) for ΔΔGb of 17 mutations on the barnase-barstar complexa

Mutants PB range PB m ± sd sGBr6 m ± sd Expt
bnK27A 3.97 to 6.77 5.28 ± 0.50 5.34 ± 0.43 5.4
bnD54A −2.93 to −1.36 −2.05 ± 0.26 −1.91 ± 0.17 −0.9
bnR59A 3.11 to 6.52 4.82 ± 0.57 4.89 ± 0.53 5.2
bnE60A −1.40 to 0.65 −0.52 ± 0.35 −0.31 ± 0.36 −0.3
bnR83Q 4.08 to 8.82 6.72 ± 0.61 6.69 ± 0.57 5.4
bnR87A 3.89 to 6.20 4.91 ± 0.43 4.84 ± 0.39 5.5
bnH102A 0.28 to 2.39 1.50 ± 0.34 1.55 ± 0.36 6.1
bsD35A 2.17 to 6.13 3.82 ± 0.72 3.92 ± 0.61 4.5
bsD39A 5.56 to 10.32 7.97 ± 0.83 8.05 ± 0.79 7.7
bsE76A 1.92 to 4.83 3.35 ± 0.54 3.37 ± 0.52 1.4
bsE80A −0.16 to 0.95 0.38 ± 0.18 0.39 ± 0.13 0.5
bnK27A/bsD39A 6.93 to 11.25 9.24 ± 0.83 9.25 ± 0.75 8.2
bnR59A/bsD35A 4.27 to 9.71 7.02 ± 0.89 7.03 ± 0.80 6.3
bnR59A/bsE76A 2.97 to 6.39 4.63 ± 0.58 4.68 ± 0.53 4.9
bnR59A/bsE80A 2.99 to 6.47 4.74 ± 0.60 4.88 ± 0.54 5.1
bnR83Q/bsD39A 6.93 to 11.68 9.36 ± 0.92 9.33 ± 0.91 6.4
bnR87A/bsD39A 5.88 to 10.52 8.17 ± 0.86 8.20 ± 0.80 7.1
a

PB and sGBr6 results were calculated on 548 conformations sampled from an MD simulation. Range, m, and sd refer to the range of variation, the mean, and the standard deviation, respectively, of ΔΔGb among the conformations.

We further assess the ensemble and single-conformation predictions against experimental binding data for the 17 mutations (Table 1).4749 For each single conformation, we measure prediction error by the root-mean-square-deviation (RMSD) between PB results for ΔΔGb and experimental data for the 17 mutations. The errors calculated over the 548 sampled conformations are presented in Figure 3 as a histogram. They range from 1.25 to 2.27 kcal/mol. In comparison, the ensemble prediction has an RMSD of 1.55 kcal/mol from experiment. The latter value places the ensemble prediction in the 31st percentile of the single-conformation predictions. In other words, the probability of a randomly chosen single conformation performing worse than the ensemble is 69%. We thus show that, relative to single-conformation predictions, the ensemble prediction is not only more precise (as measured by standard deviation of the ensemble) but also more accurate (as measured by RMSD from experiment).

Figure 3.

Figure 3

Distribution of PB-experiment difference among 548 conformations of the barnase-barstar complex. On each conformation, PB results for ΔΔGb of 17 mutations were obtained and their RMSD from experiment was used as the measure of PB-experiment difference. A vertical arrow, at 1.55 kcal/mol, indicates the RMSD of the ensemble prediction.

4.2 Performance of raw GBr6

To establish a baseline for assessing the performance of scaled GBr6, we first benchmark the original GBr6 against PB. For each ΔΔGb, ΔΔGf, ΔGt, or ΔGel* value, we take the ensemble-averaged PB result as the target and measure error as the deviation of the ensemble-averaged GBr6 result from the PB target. When multiple mutants on the same protein or protein complex are studied (for ΔΔGb and ΔΔGf results), or when one protein is studied at different pH (for ΔGt results), we report a single error value, which is given by the RMSD from the PB targets among all the mutants or all the pH values.

The errors of the original GBr6 for the various types of calculations are summarized in Table 2. For the barnase-barstar complex, ΔΔGb errors calculated by GBr6 for the 17 individual mutations ranged from 0.1 to 1.9 kcal/mol, resulting in an overall RMSD of 0.85 kcal/mol. Similar ΔΔGb errors are also seen for mutations on the GTPase-WASp and U1A-U1SLII complexes.

Table 2.

Errors of raw GBr6 and sGBr6 as measured by deviations from PB results (all in kcal/mol)

System # of conf. # of mt or pHa PB results GBr6 error sGBr6 error
ΔΔGb
Bn-Bs 548 17 −2.1 to 9.4 0.85 0.09
GTPase-WASp #1 100 12 −1.2 to 7.9 1.12 0.14
GTPase-WASp #5 100 12 −1.9 to 7.2 1.28 0.19
GTPase-WASp #9 100 12 −1.8 to 8.6 0.95 0.20
GTPase-WASp #13 100 12 −1.5 to 8.3 1.30 0.32
GTPase-WASp #17 100 12 −3.1 to 6.7 1.30 0.32
U1A-U1SLII 100 12 0.1 to 3.1 0.69 0.05
ΔΔGf
FKBP 200 26 −3.5 to 1.4 1.11 0.14
ΔGt
RNase Sa 100 9 −13.8 to −12.5 0.46 0.07
Insulin 100 8 −63.2 to −60.9 6.83 0.34
ΔGel*
Bn-Bs 100 −3.30 2.85 0.07
IL4-IL4BP 100 −3.05 2.94 0.22
E9-Im9 100 −4.33 2.32 0.07
AChE-fas 100 −4.04 4.61 0.06
a

In ΔΔGb and ΔΔGf calculations, multiple mutations on a given protein or protein complex were studied. Error was measured by RMSD among the mutations. In ΔGt calculations, a given protein was studied at multiple pH values; again RMSD was used as the measure of error.

It is interesting to note that the ΔΔGb errors seen on these complexes are suppressed by error cancellation between calculations on a complex and calculations on the separate subunits. For example, for the barnase R59A mutation, GBr6 overestimated the magnitude of the solvation energy of the complex by 2.22 kcal/mol, and at the same time overestimated the magnitudes of the solvation energies of the two subunits by 2.61 and 0.05 kcal/mol, respectively. Taken together, the net error on ΔGb(mt) was only 0.44 kcal/mol. While this error cancellation was widely seen on the protein complexes studied here, there is no guarantee that it will always occur. It is possible that, for a given protein complex, GBr6 overestimates the solvation energy of the complex but at the same time underestimates the solvation energies of the subunits. In that situation, the errors accumulate rather than cancel. Errors from calculations on a mutant and those on the wild-type complex may also either cancel or accumulate. For the barnase R59A mutation, GBr6 errors on ΔGb(mt) and ΔGb(wt) occurred in opposite directions, resulting in an accumulative error of 0.64 kcal/mol on ΔΔGb. On the other hand, for the U1A-U1SLII complex, GBr6 errors on ΔGb(mt) and ΔGb(wt) occurred in the same direction; otherwise the differences in ΔΔGb between GBr6 and PB, which was from solving the nonlinear PB equation for this particular complex, would have been much greater.

The ΔΔGf errors calculated by GBr6 for the 26 individual mutations on FKBP ranged reached 2.4 kcal/mol. Even though the overall RMSD of 1.1 kcal/mol of all the 26 mutations is only slightly more than what was found above for the mutational effects on the binding free energy of the barnase-barstar complex, the ΔΔGf errors on FKBP are much more significant. According to PB calculations (see Figure 6 below), of the 26 mutations, 9 changed the folding free energy of FKBP by less than 0.5 kcal/mol, and another 8 by 0.5 to 1.0 kcal/mol; the largest effect of an individual mutation was 3.5 kcal/mol. In contrast, as shown in Table 1 under the “PB m ± sd” heading, only 2 of the 17 mutations on the barnase-barstar complex affected the binding free energy by less than 1.0 kcal/mol; the largest effect of an individual mutation was 9.4 kcal/mol.

The ΔGt results calculated by GBr6 for ribonuclease Sa at nine pH values deviated from their PB targeted by 0.2 to 0.9 kcal/mol, resulting in an overall RMSD of 0.46 kcal/mol. Deviations of such magnitudes are significant because the PB results at the nine pH values differed at most by only 1.3 kcal/mol, and it is these small differences that predict the pH dependence of the solubility.17 For insulin, the deviations of GBr6 results for ΔGt results at eight pH values were quite large, ranging from 2.5 to 9.3 kcal/mol (with an RMSD of 6.8 kcal/mol). The larger deviations most likely were due to the large size of the insulin hexamer (with a total of 306 residues); our GBr6 method was benchmarked on a set of proteins with up to 250 residues.26

In our previous study,31 the PB results for ΔGel* calculated on the transient-complex ensembles of the barnase-barstar, IL4-IL4BP, E9-Im9, and AChE-fas protein pairs were −3.3, −4.3, −3.1, and −4.0 kcal/mol, respectively. The corresponding GBr6 results obtained here were −0.45, −1.4, −0.73, and 0.57 kcal/mol, respectively. The raw GBr6 results thus had large errors.

While the ensemble-averaged values of ΔΔGb, ΔΔGf, ΔGt, and ΔGel* obtained by GBr6 had relatively large deviations from the PB targets, in each application the GBr6 and PB sets of values calculated on the individual conformations did show strong correlations. In Figure 4 we illustrate the correlations on the E49K/E178K mutation of Cdc42 for binding and on the T21K mutation of FKBP for folding. From linear regression analyses with the requirement of a zero intercept, the correlation R2 values were ~0.99. The strong correlations between raw GBr6 results and their PB counterparts are the foundation of our scaling-based GB reparameterization. The errors of raw GBr6 are manifested by a non-unity slope of the GBr6-PB correlation, which, as will be seen shortly, is rectified by the scaling.

Figure 4.

Figure 4

Figure 4

Correlation of GBr6 and PB results for ΔGb; solvGf; solv), the solvation component of the electrostatic contribution to the binding (folding) free energy. Both sets of results were calculated on the same conformational ensemble. (a)–(b) ΔGb; solv of wild-type Cdc42-WASp complex and the Cdc42 E49K/E178K mutant. (c)–(d) ΔGf; solv of wild-type FKBP and the T21K mutant.

4.3 Performance of scaled GBr6

Upon applying scaling factors calculated on a small subset of five conformations, the errors of GBr6 on ΔΔGb, ΔΔGf, ΔGt, and ΔGel* all dropped to within 0.3 kcal/mol (Table 2). Just like in the previous subsection, errors refer to deviations from the PB targets. For the barnase-barstar complex, the errors of scaled GBr6 for 13 of the 17 mutations was less than 0.1 kcal/mol and the errors of the remaining 4 mutations were less than 0.2 kcal/mol; the overall RMSD was 0.09 kcal/mol. Notice that the latter value is much smaller than the fluctuations of the PB results for ΔΔGb within the conformational ensemble (see Table 1). The final predictions of scaled GBr6 for ΔΔGb are given in Table 1 for comparison against experimental and PB results. The RMSD of scaled GBr6 results from experiment was 1.53 kcal/mol, virtually the same as the 1.55-kcal/mol RMSD of PB results from experiment. We further tested scaled GBr6 for predicting salt effects on the binding free energy of the barnase-barstar complex. According to PB calculations, increasing ionic strength from 25 mM to 1 M reduces the magnitude of the binding free energy by 2.2 kcal/mol. As Figure 5 shows, scaled GBr6 remarkably well reproduces the salt dependence of the binding free energy obtained by PB.

Figure 5.

Figure 5

Comparison of scaled GBr6 and PB results for the salt dependence of the barnase-barstar binding free energy. Changes in ΔGb from the respective results at an ionic strength of 25 mM are displayed.

Very good agreement between scaled GBr6 and PB were also observed for the GTPase-WASp complexes. As summarized in Table 2, the RMSDs of scaled GBr6 from PB, calculated over 12 mutations on the two related complexes, ranged from 0.14 to 0.32 kcal/mol among conformational ensembles sampled using five different NMR models as starting structures. The average RMSD over the five conformational ensembles was 0.23 kcal/mol.

The highly charged protein-RNA complex between U1A and U1SLII did not present an obstacle for the scaling method. The RMSD between scaled GBr6 and PB was just 0.05 kcal/mol among seven mutations. Note that for this system the PB results were obtained from solving the nonlinear PB equation, so the test on this system demonstrates that scaled GBr6 can work well even for highly charged systems such as protein-nucleic acid complexes, for which the nonlinear PB equation would otherwise be required. Solving the full nonlinear PB equation takes much longer CPU time than solving the linearized version. For highly charged systems, scaled GBr6 affords especially significant gain in computational speed and yet can still be highly accurate for calculating small electrostatic effects such as those caused by mutations on binding free energy.

The success of scaled GBr6 with predicting binding free energy was repeated on predicting folding free energy. For 26 mutations on FKBP, the RMSD from PB targets was just 0.14 kcal/mol. A comparison between ΔΔGf results obtained by scaled GBr6 and PB for the 26 mutations is displayed in Figure 6. Note that scaled GBr6 performed equally well for single mutations and for multiple mutations. Once again the deviations of GBr6 from PB are much smaller than the fluctuations of the PB results for ΔΔGf within the conformational ensemble.

Let us use a binding example and a folding example to illustrate how scaled GBr6 achieved its accuracy. The binding example is the E49K/E178K mutation of Cdc42 and the folding example is the T21K mutation of FKBP. The correlations between raw GBr6 and PB for these two examples have been shown in Figure 4. For binding, the slope of the linear repression between raw GBr6 and PB was 1.0306 for the wild-type complex and 1.0427 for the mutant. The optimal scaling would be to use these slopes as scaling factors. However, obtaining these slopes would require PB results for the whole conformational ensemble, which defeats the purpose of designing GB methods. By using PB results for a small subset of five conformations, we obtained scaling factors of 1.0306 for the wild-type complex and 1.0403 for the mutant. These scaling factors are very close to the slopes from linear regression, hence explaining why scaled GBr6 were so accurate. Similarly, for the folding examples, the slopes from linear regression were 1.0004 for wild-type FKBP and 0.9972 for the mutant. For the subset of five conformations, the scaling factors were found to be 1.0016 for the wild-type protein and 0.9982 for the mutant. The latter pairs of values again are very close to the respective slopes.

Raw GBr6 already did a good job in reproducing PB for ΔGt of ribonuclease Sa. Scaling was able to further reduce the RMSD, from 0.46 to 0.07 kcal/mol, among nine pH values. For insulin, scaled GBr6 dramatically reduced the error on ΔGt, from 6.8 kcal/mol to 0.34 kcal/mol. For ΔGel* of the four protein pairs, scaling was able to totally eliminate the large errors of raw GBr6. Scaled GBr6 results differed from the PB targets by 0.2 kcal/mol or less for all the four protein pairs.

We carried out some experimentation with the number of representative conformations used for calculating scaling factors. Our conclusion is that five is the optimal compromise between accuracy (demanding more conformations) and computational cost (demanding less conformations). However, it seems that even with fewer numbers of conformations, the performance of scaled GBr6 will not deteriorate significantly. In our test on the barnase-barstar complex, using only three representative conformations (the first, second, and third, or the first, third, and last of the original five), the RMSD between scaled GBr6 and PB increased only minutely, from 0.09 to 0.14 kcal/mol. Our original GBr6 is already quite accurate. It is expected that, when the scaling scheme developed here is applied to less accurate GB methods, improvement in accuracy even more dramatic than presented here will be achieved.

4.4 Cross validations

It is interesting to know how well the scaling scheme works when scaling factors obtained on five conformations selected from one sub-ensemble are applied to another sub-ensemble. As an example, an early portion of a MD trajectory may be used to generate the conformations for producing the scaling factors, which may then be applied to conformations sampled from the continuation of the MD simulation. To address the question posed above, we designed two types of cross-validation tests, depending on how the conformations were divided into sub-ensembles. The barnase-barstar complex was chosen for this purpose simply because it happened to have the most conformations (548) saved and PB results calculated for benchmarking. In the first cross-validation test, the 548 conformations were divided into four equal sub-ensembles according to time sequence. In the second cross-validation test, every fourth conformation was collected into a sub-ensemble, and each of the fist four conformations of the whole ensemble started a different sub-ensemble.

We refer to the sub-ensemble from which the scaling factors were obtained as the training sub-ensemble, and examine whether the scaling factors when applied to a different sub-ensemble would lead to larger errors than when applied to the training sub-ensemble itself. As the results in Table 3 show, for both types of cross-validation tests, applying the scaling factors to the training or a different sub-ensemble produces very similar errors.

Table 3.

Cross validation of sGBr6 on ΔΔGb results of 17 mutations on the barnase-barstar complexa

Training Type-1 test Type-2 test
Sub-ensemble 1 0.15 (0.08) 0.06 (0.06)
Sub-ensemble 2 0.09 (0.09) 0.15 (0.15)
Sub-ensemble 3 0.14 (0.16) 0.14 (0.14)
Sub-ensemble 4 0.14 (0.12) 0.12 (0.12)
Average 0.13 (0.11) 0.12 (0.12)
a

Under each type of test, error in kcal/mol, as measured by RMSD from PB targets, is given. The number in parentheses represents the error when the scaling factors were applied to the training sub-ensemble itself; the number outside represents the error obtained on the other three sub-ensembles.

A caveat about the cross-validation tests is that the different sub-ensembles must all be part of the conformational fluctuations within a single deep energy well. If the protein or protein complex undergoes a conformational transition and different sub-ensembles are collected before and after the transition, cross validation clearly will fail. It is important that, in designing scaling schemes, such conformational transitions are recognized and one set of scaling factors is used for each unique deep energy well.

V. Conclusion

We have demonstrated that a scaled GB method accurately reproduces PB results for binding and folding free energies, transfer energies between crystalline and solution phases, and electrostatic interaction energies in transient complexes. The scaled GB method thus opens the door to incorporate conformational sampling in robust and accurate modeling of small electrostatic effects, which fall within the error range of current GB methods.

The deviations of scaled GBr6 from PB are much smaller than the fluctuations of PB results within conformational ensembles. Our scaling scheme thus seems to have pushed GB methods to their accuracy limit. In our opinion, reparameterization without reference to any PB results will never be able to reach such a level of accuracy. Combining one of the most accurate raw GB method with a PB-guided scaling method, scaled GBr6 promises to be a prototype for a new generation of fast continuum solvation models for incorporating conformational sampling in binding and folding free energy and other related calculations.

Acknowledgment

This work was supported in part by NIH grant GM058187.

References

  • 1.Gilson MK, Honig B. Calculation of the total electrostatic energy of a macromolecular system: Solvation energies, binding energies, and conformational analysis. Proteins. 1988;4:7–18. doi: 10.1002/prot.340040104. [DOI] [PubMed] [Google Scholar]
  • 2.Madura JD, Briggs JM, Wade R, Davis ME, Luty BA, Ilin A, Antosiewicz J, Gilson MK, Bagheri B, Scott LR, McCammon JA. Electrostatic and diffusion of molecules in solution: simulations with the University of Houston Brownian Dynamics program. Comput. Phys. Commun. 1995;91:57–95. [Google Scholar]
  • 3.Baker NA, Sept D, Joseph S, Holst MJ, McCammon JA. Electrostatics of nanosystems: application to microtubules and the ribosome. Proc. Natl. Acad. Sci. USA. 2001;98:10037–10041. doi: 10.1073/pnas.181342398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Grant JA, Pickup BT, Nicholls A. A smooth permittivity function for Poisson-Boltzmann solvation methods. J. Comput. Chem. 2001;22:608–640. [Google Scholar]
  • 5.Fogolari F, Brigo A, Molinari H. The Poisson-Boltzmann equation for biomolecular electrostatics: a tool for structural biology. J. Mol. Recognit. 2002;15:377–392. doi: 10.1002/jmr.577. [DOI] [PubMed] [Google Scholar]
  • 6.Luo R, David L, Gilson MK. Accelerated Poisson-Boltzmann calculations for static and dynamic systems. J. Comput. Chem. 2002;23:1244–1253. doi: 10.1002/jcc.10120. [DOI] [PubMed] [Google Scholar]
  • 7.Lu Q, Luo R. A Poisson--Boltzmann dynamics method with nonperiodic boundary condition. J. Chem. Phys. 2003;119:11035–11047. [Google Scholar]
  • 8.Baker NA. Improving implicit solvent simulations: a Poisson-centric view. Curr. Opin. Struct. Biol. 2005;15:137–143. doi: 10.1016/j.sbi.2005.02.001. [DOI] [PubMed] [Google Scholar]
  • 9.Dong F, Vijayakumar M, Zhou H-X. Comparison of calculation and experiment implicates significant electrostatic contributions to the binding stability of barnase and barstar. Biophys. J. 2003;85:49–60. doi: 10.1016/S0006-3495(03)74453-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Dong F, Zhou H-X. Electrostatic contribution to the binding stability of protein-protein complexes. Proteins. 2006;65:87–102. doi: 10.1002/prot.21070. [DOI] [PubMed] [Google Scholar]
  • 11.Qin S, Zhou H-X. Do electrostatic interactions destabilize protein-nucleic acid binding? Biopolymers. 2007;86:112–118. doi: 10.1002/bip.20708. [DOI] [PubMed] [Google Scholar]
  • 12.Bertonati C, Honig B, Alexov E. Poisson-Boltzmann calculations of nonspecific salt effects on protein-protein binding free energies. Biophys. J. 2007;92:1891–1899. doi: 10.1529/biophysj.106.092122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Vijayakumar M, Zhou H-X. Salt bridges stabilize the folded structure of barnase. J. Phys. Chem. B. 2001;105:7334–7340. [Google Scholar]
  • 14.Dong F, Zhou H-X. Electrostatic contributions to T4 lysozyme stability: solvent-exposed charges versus semi-buried salt bridges. Biophys. J. 2002;83:1341–1347. doi: 10.1016/S0006-3495(02)73904-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Tan YH, Luo R. Protein stability prediction: a Poisson-Boltzmann approach. J. Phys. Chem. B. 2008;112:1875–1883. doi: 10.1021/jp709660v. [DOI] [PubMed] [Google Scholar]
  • 16.Zhou H-X. Interactions of macromolecules with salt ions: an electrostatic theory for the Hofmeister effect. Proteins. 2005;61:69–78. doi: 10.1002/prot.20500. [DOI] [PubMed] [Google Scholar]
  • 17.Tjong H, Zhou H-X. Prediction of protein solubility from calculation of transfer free energy. Biophys. J. 2008 doi: 10.1529/biophysj.107.127746. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Still WC, Tempczyk A, Hawley RC, Hendrickson T. Semianalytical treatment of solvation for molecular mechanics and dynamics. J. Am. Chem. Soc. 1990;112:6127–6129. [Google Scholar]
  • 19.Hawkins GD, Cramer CJ, Truhlar DG. Parametrized models of aqueous free energies of solvation based on pairwise descreening of solute atomic charges from a dielectric medium. J. Phys. Chem. 1996;100:19824–19839. [Google Scholar]
  • 20.Ghosh A, Rapp CS, Friesner RA. Generalized Born model based on a surface integral formulation. J. Phys. Chem. B. 1998;102:10983–10990. [Google Scholar]
  • 21.Bashford D, Case DA. Generalized Born models of macromolecular solvation effects. Annu. Rev. Phys. Chem. 2000;51:129–152. doi: 10.1146/annurev.physchem.51.1.129. [DOI] [PubMed] [Google Scholar]
  • 22.Lee MS, Feig M, Salsbury FR, Jr, Brooks CL., III New analytic approximation to the standard molecular volume definition and its application to generalized Born calculations. J. Comput. Chem. 2003;24:1348–1356. doi: 10.1002/jcc.10272. [DOI] [PubMed] [Google Scholar]
  • 23.Onufriev A, Bashford D, Case DA. Exploring protein native states and large-scale conformational changes with a modified generalized Born model. Proteins. 2004;55:383–394. doi: 10.1002/prot.20033. [DOI] [PubMed] [Google Scholar]
  • 24.Feig M, Brooks CL. Recent advances in the development and application of implicit solvent models in biomolecule simulations. Curr. Opin. Struct. Biol. 2004;14:217–224. doi: 10.1016/j.sbi.2004.03.009. [DOI] [PubMed] [Google Scholar]
  • 25.Gallicchio E, Levy RM. AGBNP: an analytic implicit solvent model suitable for molecular dynamics simulations and high-resolution modeling. J. Comput. Chem. 2004;25:479–499. doi: 10.1002/jcc.10400. [DOI] [PubMed] [Google Scholar]
  • 26.Tjong H, Zhou H-X. GBr6: a parameterization-free, accurate, analytical generalized Born method. J. Phys. Chem. B. 2007;111:3055–3061. doi: 10.1021/jp066284c. [DOI] [PubMed] [Google Scholar]
  • 27.Gohlke H, Kiel C, Case DA. Insights into protein-protein binding by binding free energy calculation and free energy decomposition for the Ras-Raf and Ras-RalGDS complexes. J. Mol. Biol. 2003;330:891–913. doi: 10.1016/s0022-2836(03)00610-7. [DOI] [PubMed] [Google Scholar]
  • 28.Feig M, Onufriev A, Lee MS, Im W, Case DA, Charles L. Brooks, I. Performance comparison of generalized Born and Poisson methods in the calculation of electrostatic solvation energies for protein structures. J. Comput. Chem. 2004;25:265–284. doi: 10.1002/jcc.10378. [DOI] [PubMed] [Google Scholar]
  • 29.Liu HY, Zou X. Electrostatics of ligand binding: parametrization of the generalized Born model and comparison with the Poisson-Boltzmann approach. J. Phys. Chem. B. 2006;110:9304–9313. doi: 10.1021/jp060334w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Alsallaq R, Zhou H-X. Energy landscape and transition state of protein-protein association. Biophys. J. 2007;92:1486–1502. doi: 10.1529/biophysj.106.096024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Alsallaq R, Zhou H-X. Prediction of protein-protein association rates from a transition-state theory. Structure. 2007;15:215–224. doi: 10.1016/j.str.2007.01.005. [DOI] [PubMed] [Google Scholar]
  • 32.Alsallaq R, Zhou H-X. Electrostatic rate enhancement and transient complex of protein-protein association. Proteins. 2008;71:320–335. doi: 10.1002/prot.21679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Qin S, Zhou H-X. Prediction of salt and mutational effects on the association rate of U1A protein and U1 small nuclear RNA stem/loop II. J. Phys. Chem. B. 2008;112:5955–5960. doi: 10.1021/jp075919k. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Zhou H-X. A Gaussian-chain model for treating residual charge-charge interactions in the unfolded state of proteins. Proc. Natl. Acad. Sci. USA. 2002;99:3569–3574. doi: 10.1073/pnas.052030599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Hemsath L, Dvorsky R, Fiegen D, Carlier MF, Ahmadian MR. An electrostatic steering mechanism of Cdc42 recognition by Wiskott-Aldrich syndrome proteins. Mol. Cell. 2005;20:313–324. doi: 10.1016/j.molcel.2005.08.036. [DOI] [PubMed] [Google Scholar]
  • 36.Buckle AM, Schreiber G, Fersht AR. Protein-protein recognition: crystal structural analysis of a barnase-barstar complex at 2.0-Å resolution. Biochemistry. 1994;33:8878–8889. doi: 10.1021/bi00196a004. [DOI] [PubMed] [Google Scholar]
  • 37.Case DA, Darden TA, Cheatham TEI, Simmerling CL, Wang J, Duke RE, Luo R, Merz KM, Pearlman DA, Crowley M, Walker RC, Zhang W, Wang B, Hayik S, Roitberg A, Seabra G, Wong KF, Paesani F, Wu X, Brozell S, Tsui V, Gohlke H, Yang L, Tan C, Mogan J, Hornak V, Cui G, Beroza P, Mathews DH, Schafmeister JW, Ross WS, Kollman PA. AMBER 9. San Francisco: University of California; 2006. [Google Scholar]
  • 38.Hornak V, Abel R, Okur A, Strockbine B, Roitberg A, Simmerling C. Comparison of multiple Amber force fields and development of improved protein backbone parameters. Proteins. 2006;65:712–725. doi: 10.1002/prot.21123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Ryckaert J-P, Ciccotti G, Berendsen HJC. Numerical integration of the Cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J. Comput. Phys. 1977;23:327–341. [Google Scholar]
  • 40.Abdul-Manan N, Aghazadeh B, Liu GA, Majumdar A, Ouerfelli O, Siminovitch KA, Rosen MK. Structure of Cdc42 in complex with the GTPase-binding domain of the 'Wiskott-Aldrich syndrome' protein. Nature. 1999;399:379–383. doi: 10.1038/20726. [DOI] [PubMed] [Google Scholar]
  • 41.Spencer DS, Xu K, Logan TM, Zhou H-X. Effects of pH, salt, and macromolecular crowding on the stability of FK506-binding protein: An integrated experimental and theoretical study. J. Mol. Biol. 2005;351:219–232. doi: 10.1016/j.jmb.2005.05.029. [DOI] [PubMed] [Google Scholar]
  • 42.Van Duyne GD, Standaert RF, Schreiber SL, Clardy J. Atomic structure of the rapamycin human immunophilin FKBP-12 complex. J. Am. Chem. Soc. 1991;113:7433–7434. [Google Scholar]
  • 43.Mongan J, Case DA, McCammon JA. Constant pH molecular dynamics in generalized Born implicit solvent. J. Comput. Chem. 2004;25:2038–2048. doi: 10.1002/jcc.20139. [DOI] [PubMed] [Google Scholar]
  • 44.Cornell WD, Cieplak P, Bayly CI, Gould IR, Merz KM, Ferguson DM, Spellmeyer DC, Fox T, Caldwell JW, Kollman PA. A second generation force field for the simulation of proteins, nucleic acids, and organic molecules. J. Am. Chem. Soc. 1995;117:5179–5197. [Google Scholar]
  • 45.Bondi A. van der Waals volumes and radii. J. Phys. Chem. 1964;68:441–451. [Google Scholar]
  • 46.Tjong H, Zhou H-X. GBr6NL: a generalized Born method for accurately reproducing solvation energy of the nonlinear Poisson-Boltzmann equation. J. Chem. Phys. 2007;126 doi: 10.1063/1.2735322. 195102. [DOI] [PubMed] [Google Scholar]
  • 47.Schreiber G, Fersht AR. Interaction of barnase with its polypeptide inhibitor barstar studied by protein engineering. Biochemistry. 1993;32:5145–5150. doi: 10.1021/bi00070a025. [DOI] [PubMed] [Google Scholar]
  • 48.Schreiber G, Fersht AR. Energetics of protein-protein interactions: analysis of the barnase-barstar interface by single mutations and double mutant cycles. J. Mol. Biol. 1995;248:478–486. doi: 10.1016/s0022-2836(95)80064-6. [DOI] [PubMed] [Google Scholar]
  • 49.Frisch C, Schreiber G, Johnson CM, Fersht AR. Thermodynamics of the interaction of barnase and barstar: changes in free energy versus changes in enthalpy on mutation. J. Mol. Biol. 1997;267:696–706. doi: 10.1006/jmbi.1997.0892. [DOI] [PubMed] [Google Scholar]

RESOURCES