Rapid Calculation of Protein pKa Values Using Rosetta

Krishna Praneeth Kilambi; Jeffrey J Gray

doi:10.1016/j.bpj.2012.06.044

. 2012 Aug 8;103(3):587–595. doi: 10.1016/j.bpj.2012.06.044

Rapid Calculation of Protein pK_a Values Using Rosetta

Krishna Praneeth Kilambi ^†, Jeffrey J Gray ^†,^‡,^∗

PMCID: PMC3414879 PMID: 22947875

Abstract

We developed a Rosetta-based Monte Carlo method to calculate the pK_a values of protein residues that commonly exhibit variable protonation states (Asp, Glu, Lys, His, and Tyr). We tested the technique by calculating pK_a values for 264 residues from 34 proteins. The standard Rosetta score function, which is independent of any environmental conditions, failed to capture pK_a shifts. After incorporating a Coulomb electrostatic potential and optimizing the solvation reference energies for pK_a calculations, we employed a method that allowed side-chain flexibility and achieved a root mean-square deviation (RMSD) of 0.83 from experimental values (0.68 after discounting 11 predictions with an error over 2 pH units). Additional degrees of side-chain conformational freedom for the proximal residues facilitated the capture of charge-charge interactions in a few cases, resulting in an overall RMSD of 0.85 pH units. The addition of backbone flexibility increased the overall RMSD to 0.93 pH units but improved relative pK_a predictions for proximal catalytic residues. The method also captures large pK_a shifts of lysine and some glutamate point mutations in staphylococcal nuclease. Thus, a simple and fast method based on the Rosetta score function and limited conformational sampling produces pK_a values that will be useful when rapid estimation is essential, such as in docking, design, and folding.

Introduction

Biological processes are profoundly influenced by the pH of their local cellular environment. For example, protein folding, enzyme catalysis, and protein-protein interactions are all pH dependent (1–3). Variations in pH can affect protein-protein binding energies by as much as 50% (4), and variable amino acid protonation states due to pH can result in significantly different complex conformations during small-molecule docking calculations (5). Rosetta, a highly successful biomolecular modeling and design package, does not have any dependency on pH. In this work, we added a pH dependency to Rosetta and calibrated relevant parts of the energy function based on our ability to calculate residue pK_a values.

The ability to evaluate residue pK_a values rapidly and accurately will help investigators design better drugs and more robust industrial enzymes that are stable and active over a range of pH values. The calculation of pK_a values can identify the strengths and deficiencies of the energy function, particularly with regard to the electrostatic components (6). Also, the availability of a large number of experimentally determined pK_a values allows benchmarking of energy functions to capture the effects of pH (7). We are especially interested in improving Rosetta’s ability to dock protein or small-molecule ligands to proteins, which is currently limited in the case of binding sites that contain charged atoms, because Rosetta’s algorithms do not account for alternate residue charge states.

Existing computational pK_a prediction algorithms can be broadly classified into four major branches and hybrid approaches. The first branch solves or approximates the Poisson-Boltzmann equation using grid-based continuum electrostatic models (8–13) with diverse dielectric constants (typically in the range of four to 20) to represent the protein interior. These methods have sometimes been found to overestimate the effects of charge-charge interactions (14), and often require several minutes to hours to estimate the pK_a value for a single residue. The second branch employs all-atom molecular dynamics (MD) simulations (15–20) using either explicit (AMBER/CHARMM force fields) or implicit solvent models (generalized Born potential). The third branch comprises quantum mechanics/molecular mechanics (QM/MM)-based methods, which treat the part of protein containing the titratable residue using ab initio QM and the rest of the protein environment using MM (21,22). The fourth branch uses a variety of empirical approaches, some of which employ geometric-based dielectric constants and empirical approximations for solvation, electrostatic, and hydrogen-bonding models (23–28). Each of these branches offers a distinct set of advantages, with the empirical approaches generally being computationally less expensive and the more-rigorous approaches benefiting from fundamental explanations of the underlying physical interactions. Studies have typically reported a prediction root mean-square deviation (RMSD) of <1 pH unit from experimental pK_a values.

Although the diversity in the array of current pK_a prediction algorithms is encouraging, developing a method that can incorporate extensive conformational flexibility while retaining a relatively small computational resource footprint remains a significant challenge. Complex phenomena, such as local conformational changes and coupled ionization pairs, continue to make computational pK_a predictions difficult to achieve. Some approaches tackle conformational flexibility through the use of MD snapshots, explicit side-chain sampling, or NMR structures (9,11,29–32). However, capturing conformational flexibility is expensive. Fast computations are needed to enable the use of pK_a calculation methods in protocols such as protein-protein docking and design, which themselves require generation of a large set of target conformations.

The fast and efficient Rosetta framework can thus be used for rapid estimation of pK_a values that can be useful for further calculations in structure prediction and design. Rosetta’s physics-based all-atom energy function (with terms for van der Waals, solvation, hydrogen bonding, etc.) is pairwise-additive and has used successfully in a wide range of applications (33–41). Nevertheless, Rosetta’s energy function does not include any explicit dependence on environmental conditions such as temperature, salt, and pH. Instead, investigators have assembled the energy function by combining physical and statistical potentials, and calibrating by the ability to identify native-like structures and sequences (33,42–44) and to predict free-energy changes upon mutation (45,46). Therefore, in this work, we begin to incorporate environmental dependence into the Rosetta energy function by using pK_a shifts to calibrate the Rosetta score function’s dependence on environmental pH.

In the remainder of this article, we describe a fast and simple physics-based method, termed Rosetta-pH, to estimate the pK_a values of five types of residues with varied protonation states (Asp, Glu, His, Tyr, and Lys). After showing that the standard Rosetta energy function cannot alone predict pK_a shifts, we add electrostatic terms and modify the solvation potential to create an improved score function. We then explore varying levels of conformational flexibility and test the performance of the method on a set of large pK_a shifts used in recent blind predictions (47).

Methods

In Rosetta pK_a calculations, the pH is titrated from 1 to 14 with Monte Carlo (MC) sampling of protonated and deprotonated side chains with χ-angles sampled from a backbone-dependent rotamer library (48), irrespective of the protonation state. We used an expanded rotamer library that includes additional rotamers that are one and two standard deviations away from the base rotamers, because we noted slight improvements in pK_a prediction accuracy with extra side-chain sampling (data not shown). Each rotamer configuration is accepted or rejected using the Metropolis criterion and the Rosetta score function. The conformational degeneracy in the protonated variants of Asp and Glu (with H atoms on either of the terminal O_δ and O_ε atoms, respectively) is explicitly incorporated by accommodating both possible protonated versions for the residues during sampling. The χ-angles for protons in the protonated variants are sampled at their canonical angles (−60, 60, 180) and ±20°. For neutral His, both possible tautomers (with proton on either N_δ1 or N_ε2 atoms) are sampled.

The protonation states of the lowest-energy conformers sampled during the side-chain conformational search were recorded during every pH step of the calculation (Fig. S1 in the Supporting Material shows the flowchart for the algorithm). An initial titration was carried out in intervals of 1 pH unit, starting with pH = 1 until a change in protonation state was observed, and subsequently a finer sampling interval of 0.1 pH unit was employed in the appropriate coarse interval. The pK_a was identified as the pH value at which the lowest-energy conformer of the residue shifted from the protonated to the deprotonated state. In the case of NMR structures, the entire ensemble of available structural models was used, and the mean of the calculated pK_a values over the structural dataset was used as a representative pK_a for the residue.

The Supporting Material includes a complete description of the experimental pK_a dataset, the Rosetta score function, and the command-line syntax for performing pK_a predictions with Rosetta.

Results and Discussion

Preliminary pK_a predictions

The standard all-atom Rosetta score function comprises several terms, including a Lennard-Jones potential, an implicit solvation potential, and an orientation-dependent hydrogen-bonding term. However, none of Rosetta’s standard score terms depend on the pH of the environment. Consequently, Rosetta assumes constant, standard side-chain protonation states for the ionizable amino acids irrespective of pH: Asp and Glu are assumed to be deprotonated, His and Tyr are assumed to be neutral, and Lys and Arg are assumed to be protonated. To resolve this issue, we created a means to treat the variable protonation states of amino acids.

Protonation potential

First, we added a protonation potential based on the probability of protonation of individual amino acid residues at a given pH. We used a simplified version of the potential described by Onufriev et al. (49) that has been used in pK_a estimations (11,12) and prediction of binding affinities (50). The probability of protonation $(f_{prot})$ of an amino acid is

f_{prot} = \frac{1}{10^{pH - Ip K_{a}} + 1},

and the protonation potential $(E_{pH})$ is

E_{pH} = {\begin{matrix} - k_{B} T \ln f_{prot} & if protonated \\ - k_{B} T \ln (1 - f_{prot}) & if deprotonated \end{matrix},

where pH is defined by the environment, and $Ip K_{a}$ is the unperturbed intrinsic pK_a value of the model compound in solution (4.0 for Asp, 4.4 for Glu, 6.3 for His, 10.0 for Tyr, and 10.4 for Lys). $k_{B} T$ was assigned a value of 0.59 kcal/mol, corresponding to a temperature of 298 K. The free-energy gap of protonation between protonated and deprotonated variants of a residue, $Δ E_{pH}$ , is $2.3 k_{B} T (pH - Ip K_{a})$ . The Supporting Material includes a derivation of pK_a values from the scores of the protonated and deprotonated variants.

Updating residue atom types

Second, to accumulate nonstandard charge-state amino acid variants, we updated the residue atom types to reflect the changes in the protonation state (see Table S1). For example, in the case of protonated Asp, the definition of terminal O_δ atom was modified from a carboxyl oxygen (OOC) in the standard residue to a hydroxyl oxygen (OH) in the protonated variant. The atom types determine the solvation and van der Waals parameters, and the ability to donate or accept hydrogen bonds.

pK_a predictions fail using the standard Rosetta score function

We initially evaluated the pK_a values for the assembled dataset of 264 residues using the protonation potential and the standard Rosetta score function. We calculated the pK_a values by using a single backbone conformation for x-ray crystal structures and averaging over the ensemble of models for NMR structures. The predicted pK_a values plotted against experimental pK_a values in Fig. 1 a are flat, that is, the calculated pK_a values demonstrate negligible shifts from the reference values. Only a few residues, such as H43 from Streptomyces subtilisin inhibitor (3SSI), show shifts (with the pK_a shifting to 2.0 from the reference value of 6.3). Most shifts can be attributed to steric hindrance from the neighboring residues, which renders the protonated variants highly unfavorable. The RMSD of predicted pK_a values relative to experimental pK_a values is 1.0 pH units. In comparison, the null model, in which all pK_a values are assumed to be unshifted from the reference intrinsic pK_a values, produces an RMSD of 0.94 pH units.

(a and b) Correlation between predicted and experimental pK_a values calculated using the standard Rosetta score function (a) without an explicit electrostatic potential and (b) with a distance-dependent Coulomb potential and calibrated solvation reference energies. In panel a, the prediction plots are flat (no shifts from the intrinsic pK_a values, denoted by + symbols) whereas in b, 78% of the pK_a predictions are within 1 pH unit from the experimental values (*dashed lines*) and only 4% of the predictions have errors > 2 pH units. Absolute pK_a values are plotted for clarity; see Fig. S2 for ΔpK_a correlation plots. Note that the RMSD values based on pK_a and ΔpK_a values are equivalent.

These results are not surprising, considering that Rosetta’s standard score function does not include a term for Coulomb electrostatics. Instead, Rosetta relies on a combination of an implicit solvation potential to capture the Born energy of ion burial (51), a statistical residue pair term to account for ion-ion pair interactions (52) and an orientation-dependent, hydrogen-bonding term (53) that rewards salt bridges. For the nonstandard residue protonation variants, the solvation and hydrogen-bonding scores are affected by the changed atom types. However, the residue pair-energy term (E_pair) is based on the probability of proximity of two amino acids in the PDB (normalized by the frequencies of residue-residue pairs in a given burial environment). Because the protonation states of residues in PDB crystal structures are not known, the pair statistics do not differentiate the protonated from the deprotonated forms.

Optimization of the Rosetta-pH score function for pK_a prediction

The shortcomings of the standard Rosetta energy function led us to modify it in two ways. First, to explicitly evaluate electrostatic effects, we added a simple form of a Coulomb electrostatic potential with a distance-dependent dielectric for gradual shielding at increasing interatomic distances (54). It includes a cap at short range and a shift to become zero at a long-range cutoff. The energy between atoms i and j is

E_{elec}^{i j} = {\begin{array}{l} 322 q_{i} q_{j} (\frac{1}{ε_{min} r_{min}} - \frac{1}{ε_{max} r_{max}}) & r_{i j} \leq r_{min} \\ 322 q_{i} q_{j} (\frac{1}{ε r_{i j}} - \frac{1}{ε_{max} r_{max}}) & r_{min} < r_{i j} < r_{max} \\ 0 & r_{i j} \geq r_{max} \end{array},

where q₁ and q₂ are atomic charges, r_ij is the distance between atoms i and j, and ε is the dielectric constant, which is estimated as ε = 10r. The short- and long-range cutoff distances, r_min and r_max, are 1.5 Å and 5.5 Å, respectively, with ε_min = 10r_min and ε_max = 10r_max. The partial charges (q_i) for the side-chain atoms in standard amino acid variants and nonstandard protonation states were obtained from CHARMM27 (55). For deprotonated Tyr, the parameters were obtained from the quantum chemical calculations by Rabenstein et al. (56). The total Coulomb energy, E_elec, is evaluated by summing $E_{elec}^{i j}$ over all pairs of atoms in the protein.

Our second modification to the Rosetta score function arose from the solvation term. Rosetta uses the Lazaridis-Karplus implicit model for solvation (51), where the score for an atom i is evaluated as

E_{solv}^{i} = Δ G_{i}^{ref} - \sum_{j \neq i} f (r_{i j}) V_{j},

where r_ij is the distance between atoms i and j, V_j is the volume of neighbor atom j, $Δ G_{i}^{ref}$ is reference solvation energy based on a fully solvated atom, and $f (r_{i j})$ is the solvation free-energy density estimated by the Gaussian exclusion function (see Lazaridis and Karplus (51) for details). The total solvation energy for a residue, E_solv, is evaluated by summing $E_{solv}^{i}$ over all its atoms.

To test the effects of solvation on the side-chain protonation state, we plotted the difference in solvation energies between the protonated and deprotonated variants (ΔE_solv) against the degree of burial from solvent for all residues from the pK_a prediction dataset (Fig. S3). Negligible differences were observed in solvation scores between protonated and deprotonated variants in residues exposed to the solvent (<15 neighboring atoms), indicating equal preference for either variant. Solvation scores in Rosetta are zero for atoms exposed to solvent, because the Lazaridis-Karplus reference energy $(Δ G_{i}^{ref})$ has been displaced by a separate reference score (aa_ref) derived using sequence recovery over a dataset of proteins (42). However, for solvent-exposed residues, considering alternate protonation states, the potential should favor the charged variant. Therefore, we derived reference scores for the five nonstandard protonation variants using a linear regression that minimizes the predicted pK_a RMSD over a subset of randomly selected 200 pK_a values (∼3/4 of the total dataset). For example, for Asp the pK_a predictions are optimal if the reference score for the protonated state shifts up by 0.59 score units relative to the deprotonated variant. Table S2 details the resulting reference scores.

Testing the effects of varying degrees of protein flexibility

Flexible target side chain

We employed the new energy function and evaluated pK_a values by sampling the χ-angles and protonation states of the target residue with the remaining protein held rigid. The method resulted in an RMSD of 0.83 pH units over the entire dataset (Fig. 1 b). Excluding 11 outliers that have prediction errors > 2 pH units, the remaining 253 residues have an RMSD of 0.68. We found that 92% of the predictions differed by <1.5 pH units from the experimental values, and more than half of them were accurate to within 0.5 pH units (Table S3).

The overall accuracy of the pK_a predictions usually decreased as the number of residue side-chain degrees of freedom increased (Table 1). The predictions were more accurate in the case of Asp, His, and Tyr (two χ-angles), with overall prediction RMSD values of 0.81, 0.82, and 0.77, respectively, compared with 0.92 in the case of Glu (three χ-angles). Although the RMSD for Lys residues (four χ-angles) is 0.67, it is still higher than the null-model RMSD of 0.59.

Table 1.

pK_a prediction RMSD by residue type and prediction method

	Number of χ angles	Null model	Standard Rosetta	Rosetta-pH score function
	Number of χ angles	Null model	Standard Rosetta	Site repack	Neighbor repack	Ensemble average
Asp	2	0.96	1.1	0.81	0.87	0.83
Glu	3	0.83	0.88	0.92	0.88	0.92
His	2	1.0	1.2	0.82	0.86	1.0
Tyr	2	1.2	0.96	0.77	0.84	1.2
Lys	4	0.59	0.65	0.67	0.62	0.71
All	–	0.94	1.0	0.83	0.85	0.93

Open in a new tab

To improve the accuracy of predictions and understand the limitations of the method, we inspected residues with large pK_a errors. In the case of E17 and E26 from calbindin D9k (4ICB), calculating the pK_a values for each residue separately resulted in pK_a predictions of 5.3 and 5.2, compared with experimental values of 3.6 and 4.1, respectively (57). These two glutamate residues are structurally proximal (Fig. 2 a) and are involved in strong charge-charge interaction; hence, the large pK_a prediction errors can be attributed to the unrealistic assumption that the neighboring Glu residue is deprotonated across the entire range of pH values during titration. Therefore, the pK_a predictions are upshifted due to the high energetic penalty resulting from electrostatic repulsion.

Conformational flexibility. (a) Interacting neighbor residues E17 and E26 from calbindin D9k (4ICB) are more accurately predicted when neighbor side-chain flexibility and protonation are allowed. (b) A structural model of RNase H (2RN2) generated using RosettaRelax (*blue*) compared with the native structure (*cyan*). D10 is closer to D70 in the model, resulting in a shift in the predicted pK_a value of the D10 from 4.0 using the native structure to 5.5 using the relaxed model. The experimental pK_a value for D10 is 6.1. (c) The structural model of H227 from phospholipase C (1GYM) has no space for a proton and thus highly favors the default (deprotonated) variant, resulting in a predicted pK_a of 2.7.

We also examined whether the pK_a predictions could identify the correct proton donor among the catalytic residues in hen egg white lysozyme (2LZT) and Bacillus circulans xylanase (1XNB). The catalytic sites of both lysozyme and xylanase have a Glu residue with a large upshifted pK_a value that acts as a proton donor, and a proximal carboxylic acid residue with a lower pK_a value that serves as a nucleophile (58,59). The Rosetta-pH method predicted pK_a values of 3.8 and 2.9 for the E35-D52 pair in lysozyme, and 3.0 and 4.1 for the E172-E78 pair in xylanase. The experimental pK_a values for the E35-D52 pair in lysozyme are 6.2 and 3.7, respectively, and those for the E172-E78 pair in xylanase are 6.7 and 4.6, respectively. Thus, a ranking of residues by their calculated pK_a values identified the correct proton donor in the case of lysozyme (pK_a^E35 > pK_a^D52) but failed in the case of xylanase. However, the method was unable to predict the large upward shifts in the pK_a values of Glu residues in either case. The upward shifts were likely absent because in this formulation, neighbor residues did not sample alternate protonation states. Thus, charge-charge interactions of proximal residues capable of adapting variable protonation states were not captured.

Flexible and protonatable neighbor side chains

In an effort to improve the treatment of local charge-charge interactions, we added conformational sampling of standard and alternate-charge-state side-chain rotamers of all the residues within 6 Å of the target residue. This approach allowed simultaneous sampling of variable protonation states for the E17-E26 pair from calbindin D9k (Fig. 2 a) and improved the predicted pK_a values to 3.7 and 4.1, respectively (experimental pK_a values are 3.6 and 4.1, respectively). The pK_a prediction accuracy for the E17-E26 pair is encouraging, because it is difficult to determine the optimal titration order in an ionizable pair when the difference between the respective residue pK_a values is small relative to the interaction energy involved (60).

However, the method still could not capture the large upward shift in the pK_a values of Glu residues in the catalytic sites of lysozyme and xylanase. The method predicted pK_a values of 3.7 and 2.7 for the E35-D52 pair in lysozyme, and 2.7 and 4.7 for the E172-E78 pair in xylanase. For xylanase, the method predicted a higher pK_a value for E78 relative to E172, thereby incorrectly identifying E78 as the proton donor and E172 as the nucleophile during catalysis. Thus, in this case, additional side-chain flexibility did not improve the accuracy of the pK_a predictions.

Over the complete dataset, adding neighbor side-chain sampling decreased the number of outliers from 11 to 7. However, in a few cases, sampling the extra side-chain conformations resulted in additional deviations (up to 0.3 pH units). Thus, the RMSD for predicted pK_a values (Fig. 3 a) over the complete dataset increased to 0.85 (0.73 excluding the outliers). The percentage of pK_a predictions within an error of 1 pH unit dropped slightly from 78% to 76%.

(*a–c*) Correlation between predicted and experimental pK_a values using (a) neighbor repacking and (b and c) a structural ensemble of 50 backbone conformations. (b) pK_a distribution for each generated structure in the ensemble. (c) Average pK_a value over the ensemble. The pK_a predictions are highly sensitive to local side-chain and backbone conformational changes.

Flexible backbone

For a second level of protein flexibility, we extended conformational sampling to the protein backbone. To explore the effects of a flexible backbone in x-ray crystal structures, it is first necessary to create an ensemble of backbone structures. To that end, we generated a conformational ensemble for each protein using RosettaRelax (33,61). RosettaRelax uses MC sampling employing small backbone dihedral angle (φ, ψ) perturbations followed by side-chain packing and minimization of the score function along the gradient in torsion space. We generated 50 structural models for each protein starting from its x-ray crystal structure. To enable comparisons with rigid backbone methods over the complete dataset, we also generated ensembles for NMR structures by choosing NMR Model 1 as the starting structure. The conformers generated using this protocol form a dense cluster with most models <1 Å C_α RMSD from the native structure (62).

For each residue, we evaluated the pK_a values separately using each of the 50 generated backbone models in the ensemble. As illustrated in Fig. 3 b, the predicted pK_a values for each residue were distributed over a wide range, demonstrating the sensitivity of pK_a predictions to even small changes in backbone conformation. Using the average pK_a value for each residue over the ensemble resulted in a distribution similar to that observed using a single representative structure (Fig. 3 c). The RMSD values for Asp, Glu, and Lys were comparable to those obtained without accounting for backbone flexibility (0.83, 0.92, and 0.71, respectively, compared with 0.81, 0.92, and 0.67), but the RMSD values for His and Tyr were far less accurate (1.0 and 1.2, respectively, compared with 0.82 and 0.77). The average pK_a values had an RMSD of 0.94 pH units over the complete dataset.

Fig. 2 b shows D10 from ribonuclease H (2RN2) as a representative example demonstrating the effect of backbone variation. A small backbone perturbation in one of the models generated with the use of backbone relaxation reduced the distance between D10 and D70, resulting in an increase in the predicted pK_a value from 5.0 with the crystal structure to 5.5, which is closer to the experimental pK_a value of 6.1. In contrast, in the case of H227 from phospholipase C (1GYM; Fig. 2 c), whose experimental pK_a value is 6.9, the predicted pK_a dropped dramatically from 6.6 with the crystal structure to 2.7 with the backbone ensemble. The pK_a shifted down because the backbone relaxation created a tightly packed core that fit only the deprotonated version of His, i.e., because the RosettaRelax protocol does not use alternate protonation states of residues in conformational sampling, it used the deprotonated His and compacted the structure too tightly to fit a hydrogen atom. Thus, the use of a backbone conformational ensemble improved the accuracy of some pK_a predictions but also resulted in significant downshifts in pK_a values for a few residues with stabilized deprotonated variants.

The increase in conformational degrees of freedom did not help in recovering the large upward shift in pK_a values of catalytic Glu residues in lysozyme and xylanase. Although the predicted pK_a value of 3.3 for D52 in lysozyme (experimental pK_a 3.7) was more accurate than predictions of 2.9 and 2.7 when sampling the target residue and neighboring side chains, respectively, the method predicted a pK_a value of 3.6 for E35, compared with the experimental pK_a value of 6.2. However, the additional backbone flexibility allowed the method to identify the correct proton donor among catalytic residues in xylanase, as the predicted pK_a value of 3.5 for E172 was higher than the predicted pK_a value of 2.2 for E78.

Extreme pK_a shifts—the staphylococcal nuclease set

In our main dataset, the majority of the residues had pK_a shifts of <1.5 pH units, and a few residues shifted up to 3.5 pH units. Members of the García-Moreno laboratory recently acquired experimental data for a large number of pK_a values for mutants at various positions in the highly stable Δ+PHS variant of staphylococcal nuclease (SNase). Some ionizable groups in hydrophobic regions shifted their pK_a values by as much as 5 pH units from their intrinsic pK_a values (63–70). The dataset of pK_a values was recently used in a large-scale, community-wide blind pK_a prediction challenge, called the pK_a Cooperative, for assessment of published pK_a prediction methods (47).

To test the efficacy of our method for predicting large pK_a shifts, we applied the optimized Rosetta-pH method to the SNase model system, with conformational flexibility limited to the target residue. Fig. 4 shows the correlation between predicted and experimental pK_a values represented relative to their shift from reference pK_a values. Encouragingly, the pK_a prediction RMSD for the set of Lys residues was 1.3 compared with a null-model RMSD of 3.4. The RMSDs for predictions in the case of Glu and Asp residue sets were 2.1 and 3 pH units respectively (the corresponding null-model RMSDs were 2.4 and 2.9). Although the large pK_a shift of the K66 mutant was predicted accurately (predicted pK_a = 5.3; experimental pK_a = 5.6), the method was unable to predict the large upshifts in the pK_a values of Asp and Glu mutants at the same position (predicted pK_a = 3.6 and 4.4; experimental pK_a = 8.7 and 8.5, respectively). Experimental studies revealed a network of internal water molecules in the case of E66 mutant (71) and a local conformational transition in the case of D66 mutant (72), and sensitivity of the pK_a values to the SNase global stability (66). These effects are difficult to capture. For other pK_a prediction methods using continuum electrostatics, investigators have resorted to the use of high dielectric constants (ε ≈ 10) to reproduce the magnitude of the pK_a shifts (68).

Correlation between predicted and experimental pK_a values for SNase mutants with large pK_a shifts.

Conclusion

We have developed a novel (to our knowledge) method, termed Rosetta-pH, that rapidly calculates the pK_a values of a diverse dataset. We find that 78% of the predictions are within 1 pH unit of the experimental pK_a values. The score function used for predictions includes a simple distance-based dielectric Coulomb potential that substantially increases the accuracy of pK_a predictions compared with the standard Rosetta score function. Coulomb interactions play a significant role in stabilizing the charged variants of buried residues to balance the effects of desolvation (47). However, it is perhaps surprising that employing a simple Coulomb potential generates pK_a predictions with an RMSD of 0.83 pH units using a single flexible target side chain. In comparison, the latest published versions of the widely used pK_a prediction algorithms MCCE2 (13) and PROPKA3 (25) report RMSDs of 0.90 and 0.79, respectively, over their corresponding datasets comprising 305 and 293 residues. (Note that using RMSD values to guide optimization of pK_a calculation algorithms does not guarantee physically realistic models, and RMSD values are highly dependent on the pK_a datasets used for calibration (7).) The successful prediction of large pK_a shifts in SNase point mutants also supports the efficacy of the fast and simple Coulomb electrostatic treatment.

In addition to the score function, pK_a prediction is influenced by the extent of conformational sampling. Other investigators have explored the effects of side-chain (11,29,32) and backbone (15,16,20) flexibility. For example, significant improvements in pK_a prediction accuracy were reported by Gunner et al. (73), Witham et al. (31), and Song (74), who employed Gromacs relaxation, MD snapshots, and Rosetta refinement, respectively, to incorporate backbone effects in MCCE pK_a calculations. Similarly, in Rosetta-pH, additional side-chain flexibility resulted in more accurate estimation of charge-charge interactions, and additional backbone flexibility improved relative pK_a predictions in catalytic residues. However, over the complete dataset, the prediction RMSD values increased from 0.83 to 0.85, and finally reached 0.94 pH units as we extended the sampling from a single side chain to multiple neighboring side chains and finally to the protein backbone (Table 1). One explanation for the reduced accuracy is simply the noise in the energy introduced by additional motions throughout the large protein. A second possible explanation for the dip is that the diversity of the generated backbone ensemble inaccurately represents the solution-state flexibility when alternate protonation states are possible. For example, during ensemble generation, the RosettaRelax protocol does not dynamically alter residue protonation states, thus biasing the structures toward standard protonation variants. Considering variable protonation states and calibrating RosettaRelax to generate an ensemble of thermodynamic states that can be used in Boltzmann weighting for pK_a estimations might improve the accuracy of the pK_a predictions.

Our treatment of the pK_a calculations is also very simple in that it defines the pK_a as the pH at which the lowest-energy state changes, instead of calculating the average over an ensemble of different protonated and deprotonated states at each pH. A more rigorous thermodynamic treatment would need to capture the titration dynamics by collecting side-chain rotamer frequencies during titration, and reproduce the titration curves to estimate the pK_a values. Such a treatment could better capture cases in which multiple structures make important contributions to the thermodynamic ensemble, or in which ionization effects are coupled.

The standard Rosetta standard score function has been applied very successfully to protein folding, design, and docking. However, before this work, the score function had not been developed to handle pK_a calculations. pK_a predictions are extremely sensitive to electrostatic treatments, so our hope is that these protocols will aid in improving and fine-tuning the Rosetta score function. Further studies employing the pK_a protocol can be used to test alternate electrostatic and solvation potentials in Rosetta, such as Poisson-Boltzmann treatments using matched interface- and boundary-method-based solvers (75) and semi-explicit solvation potentials (76). Ultimately, such calculations may also help introduce other environmental factors, such as temperature and salt concentration (77,78), into the Rosetta score function, but work is needed to reconcile the new score terms with the standard score function that is finely tuned for structure prediction and design applications.

The major goal of our study was to develop a fast method for identifying the most favorable protonation state at a given pH. Whereas a full titration requires ∼15–30 CPU seconds depending on the extent of conformational flexibility, Rosetta-pH can evaluate the most favorable protonation state at a given pH in less than a second. The meager computational requirements make Rosetta-pH sufficiently fast to dynamically sample, predict, and alter various probable protonation states on the fly during other calculations. Combined with the object-oriented design of the Rosetta modeling suite (79), our method makes it possible to integrate alternate protonation states and pH contexts into other protocols, such as protein folding, docking, and design.

Acknowledgments

Ryan Harrison incorporated the pH effects into earlier versions of Rosetta. We thank Danielle Hein, Jianqing Xu, and Robert Schleif for reviewing the manuscript, and Bertrand García-Moreno for critical insights and thoughtful suggestions.

This work was funded by National Institutes of Health grant GM078221.

Supporting Material

Document S1. A complete description of the experimental pK_a dataset, the Rosetta score function, and the command-line syntax for performing pK_a predictions with Rosetta

mmc1.pdf^{(317.9KB, pdf)}

References

1.Sheinerman F.B., Norel R., Honig B. Electrostatic aspects of protein-protein interactions. Curr. Opin. Struct. Biol. 2000;10:153–159. doi: 10.1016/s0959-440x(00)00065-8. [DOI] [PubMed] [Google Scholar]
2.Whitten S.T., García-Moreno E B., Hilser V.J. Local conformational fluctuations can modulate the coupling between proton binding and global structural transitions in proteins. Proc. Natl. Acad. Sci. USA. 2005;102:4282–4287. doi: 10.1073/pnas.0407499102. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Warshel A., Dryga A. Simulating electrostatic energies in proteins: perspectives and some recent studies of pKas, redox, and other crucial functional properties. Proteins. 2011;79:3469–3484. doi: 10.1002/prot.23125. [DOI] [PubMed] [Google Scholar]
4.Mitra R.C., Zhang Z., Alexov E. In silico modeling of pH-optimum of protein-protein binding. Proteins. 2011;79:925–936. doi: 10.1002/prot.22931. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Warren G.L., Andrews C.W., Head M.S. A critical assessment of docking programs and scoring functions. J. Med. Chem. 2006;49:5912–5931. doi: 10.1021/jm050362n. [DOI] [PubMed] [Google Scholar]
6.Dong F., Olsen B., Baker N.A. Computational methods for biomolecular electrostatics. Methods Cell Biol. 2008;84:843–870. doi: 10.1016/S0091-679X(07)84026-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Carstensen T., Farrell D., Nielsen J.E. On the development of protein pKa calculation algorithms. Proteins. 2011;79:3287–3298. doi: 10.1002/prot.23091. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Yang A.-S., Gunner M.R., Honig B. On the calculation of pKas in proteins. Proteins. 1993;15:252–265. doi: 10.1002/prot.340150304. [DOI] [PubMed] [Google Scholar]
9.Antosiewicz J., McCammon J.A., Gilson M.K. The determinants of pKas in proteins. Biochemistry. 1996;35:7819–7833. doi: 10.1021/bi9601565. [DOI] [PubMed] [Google Scholar]
10.Havranek J.J., Harbury P.B. Tanford-Kirkwood electrostatics for protein modeling. Proc. Natl. Acad. Sci. USA. 1999;96:11145–11150. doi: 10.1073/pnas.96.20.11145. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Georgescu R.E., Alexov E.G., Gunner M.R. Combining conformational flexibility and continuum electrostatics for calculating pK(a)s in proteins. Biophys. J. 2002;83:1731–1748. doi: 10.1016/S0006-3495(02)73940-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Fitch C.A., Karp D.A., García-Moreno E B. Experimental pK(a) values of buried residues: analysis with continuum methods and role of water penetration. Biophys. J. 2002;82:3289–3304. doi: 10.1016/s0006-3495(02)75670-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Song Y., Mao J., Gunner M.R. MCCE2: improving protein pKa calculations with extensive side chain rotamer sampling. J. Comput. Chem. 2009;30:2231–2247. doi: 10.1002/jcc.21222. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Forsyth W.R., Robertson A.D. Insensitivity of perturbed carboxyl pK(a) values in the ovomucoid third domain to charge replacement at a neighboring residue. Biochemistry. 2000;39:8067–8072. doi: 10.1021/bi992967p. [DOI] [PubMed] [Google Scholar]
15.Simonson T., Carlsson J., Case D.A. Proton binding to proteins: pK(a) calculations with explicit and implicit solvent models. J. Am. Chem. Soc. 2004;126:4167–4180. doi: 10.1021/ja039788m. [DOI] [PubMed] [Google Scholar]
16.Kuhn B., Kollman P.A., Stahl M. Prediction of pKa shifts in proteins using a combination of molecular mechanical and continuum solvent calculations. J. Comput. Chem. 2004;25:1865–1872. doi: 10.1002/jcc.20111. [DOI] [PubMed] [Google Scholar]
17.Khandogin J., Brooks C.L., 3rd Toward the accurate first-principles prediction of ionization equilibria in proteins. Biochemistry. 2006;45:9363–9373. doi: 10.1021/bi060706r. [DOI] [PubMed] [Google Scholar]
18.Wallace J.A., Shen J.K. Continuous constant pH molecular dynamics in explicit solvent with pH-based replica exchange. J. Chem. Theory Comput. 2011;7:2617–2629. doi: 10.1021/ct200146j. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Itoh S.G., Damjanović A., Brooks B.R. pH replica-exchange method based on discrete protonation states. Proteins. 2011;79:3420–3436. doi: 10.1002/prot.23176. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Williams S.L., Blachly P.G., McCammon J.A. Measuring the successes and deficiencies of constant pH molecular dynamics: a blind prediction study. Proteins. 2011;79:3381–3388. doi: 10.1002/prot.23136. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Li H., Robertson A.D., Jensen J.H. The determinants of carboxyl pKa values in turkey ovomucoid third domain. Proteins. 2004;55:689–704. doi: 10.1002/prot.20032. [DOI] [PubMed] [Google Scholar]
22.Schaefer P., Riccardi D., Cui Q. Reliable treatment of electrostatics in combined QM/MM simulation of macromolecules. J. Chem. Phys. 2005;123:014905. doi: 10.1063/1.1940047. [DOI] [PubMed] [Google Scholar]
23.Wisz M.S., Hellinga H.W. An empirical model for electrostatic interactions in proteins incorporating multiple geometry-dependent dielectric constants. Proteins. 2003;51:360–377. doi: 10.1002/prot.10332. [DOI] [PubMed] [Google Scholar]
24.Li H., Robertson A.D., Jensen J.H. Very fast empirical prediction and rationalization of protein pKa values. Proteins. 2005;61:704–721. doi: 10.1002/prot.20660. [DOI] [PubMed] [Google Scholar]
25.Olsson M.H.M., Søndergaard C.R., Jensen J.H. PROPKA3: consistent treatment of internal and surface residues in empirical pKa predictions. J. Chem. Theory Comput. 2011;7:525–537. doi: 10.1021/ct100578z. [DOI] [PubMed] [Google Scholar]
26.Krieger E., Nielsen J.E., Vriend G. Fast empirical pKa prediction by Ewald summation. J. Mol. Graph. Model. 2006;25:481–486. doi: 10.1016/j.jmgm.2006.02.009. [DOI] [PubMed] [Google Scholar]
27.Milletti F., Storchi L., Cruciani G. Predicting protein pK(a) by environment similarity. Proteins. 2009;76:484–495. doi: 10.1002/prot.22363. [DOI] [PubMed] [Google Scholar]
28.Huang R.-B., Du Q.-S., Chou K.C. A fast and accurate method for predicting pKa of residues in proteins. Protein Eng. Des. Sel. 2010;23:35–42. doi: 10.1093/protein/gzp067. [DOI] [PubMed] [Google Scholar]
29.You T.J., Bashford D. Conformation and hydrogen ion titration of proteins: a continuum electrostatic model with conformational flexibility. Biophys. J. 1995;69:1721–1733. doi: 10.1016/S0006-3495(95)80042-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.van Vlijmen H.W.T., Schaefer M., Karplus M. Improving the accuracy of protein pKa calculations: conformational averaging versus the average structure. Proteins. 1998;33:145–158. doi: 10.1002/(sici)1097-0134(19981101)33:2<145::aid-prot1>3.0.co;2-i. [DOI] [PubMed] [Google Scholar]
31.Witham S., Talley K., Alexov E. Developing hybrid approaches to predict pKa values of ionizable groups. Proteins. 2011;79:3389–3399. doi: 10.1002/prot.23097. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Beroza P., Case D.A. Including side chain flexibility in continuum electrostatic calculations of protein titration. J. Phys. Chem. 1996;100:20156–20163. [Google Scholar]
33.Bradley P., Misura K.M.S., Baker D. Toward high-resolution de novo structure prediction for small proteins. Science. 2005;309:1868–1871. doi: 10.1126/science.1113801. [DOI] [PubMed] [Google Scholar]
34.Sircar A., Chaudhury S., Gray J.J. A generalized approach to sampling backbone conformations with RosettaDock for CAPRI rounds 13-19. Proteins. 2010;78:3115–3123. doi: 10.1002/prot.22765. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Chaudhury S., Berrondo M., Gray J.J. Benchmarking and analysis of protein docking performance in Rosetta v3.2. PLoS ONE. 2011;6:e22477. doi: 10.1371/journal.pone.0022477. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Jiang L., Althoff E.A., Baker D. De novo computational design of retro-aldol enzymes. Science. 2008;319:1387–1391. doi: 10.1126/science.1152692. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Röthlisberger D., Khersonsky O., Baker D. Kemp elimination catalysts by computational enzyme design. Nature. 2008;453:190–195. doi: 10.1038/nature06879. [DOI] [PubMed] [Google Scholar]
38.Makrodimitris K., Masica D.L., Gray J.J. Structure prediction of protein-solid surface interactions reveals a molecular recognition motif of statherin for hydroxyapatite. J. Am. Chem. Soc. 2007;129:13713–13722. doi: 10.1021/ja074602v. [DOI] [PubMed] [Google Scholar]
39.Masica D.L., Schrier S.B., Gray J.J. De novo design of peptide-calcite biomineralization systems. J. Am. Chem. Soc. 2010;132:12252–12262. doi: 10.1021/ja1001086. [DOI] [PubMed] [Google Scholar]
40.Kortemme T., Joachimiak L.A., Baker D. Computational redesign of protein-protein interaction specificity. Nat. Struct. Mol. Biol. 2004;11:371–379. doi: 10.1038/nsmb749. [DOI] [PubMed] [Google Scholar]
41.Kuhlman B., Dantas G., Baker D. Design of a novel globular protein fold with atomic-level accuracy. Science. 2003;302:1364–1368. doi: 10.1126/science.1089427. [DOI] [PubMed] [Google Scholar]
42.Kuhlman B., Baker D. Native protein sequences are close to optimal for their structures. Proc. Natl. Acad. Sci. USA. 2000;97:10383–10388. doi: 10.1073/pnas.97.19.10383. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Gray J.J., Moughon S., Baker D. Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. J. Mol. Biol. 2003;331:281–299. doi: 10.1016/s0022-2836(03)00670-3. [DOI] [PubMed] [Google Scholar]
44.Song Y., Tyka M., Baker D. Structure-guided forcefield optimization. Proteins. 2011;79:1898–1909. doi: 10.1002/prot.23013. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Kortemme T., Baker D. A simple physical model for binding energy hot spots in protein-protein complexes. Proc. Natl. Acad. Sci. USA. 2002;99:14116–14121. doi: 10.1073/pnas.202485799. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Kellogg E.H., Leaver-Fay A., Baker D. Role of conformational sampling in computing mutation-induced changes in protein structure and stability. Proteins. 2011;79:830–838. doi: 10.1002/prot.22921. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Nielsen J.E., Gunner M.R., García-Moreno B.E. The pKa Cooperative: a collaborative effort to advance structure-based calculations of pKa values and electrostatic effects in proteins. Proteins. 2011;79:3249–3259. doi: 10.1002/prot.23194. [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Dunbrack R.L., Jr., Cohen F.E. Bayesian statistical analysis of protein side-chain rotamer preferences. Protein Sci. 1997;6:1661–1681. doi: 10.1002/pro.5560060807. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Onufriev A., Case D.A., Ullmann G.M. A novel view of pH titration in biomolecules. Biochemistry. 2001;40:3413–3419. doi: 10.1021/bi002740q. [DOI] [PubMed] [Google Scholar]
50.Park M.-S., Gao C., Stern H.A. Estimating binding affinities by docking/scoring methods using variable protonation states. Proteins. 2011;79:304–314. doi: 10.1002/prot.22883. [DOI] [PubMed] [Google Scholar]
51.Lazaridis T., Karplus M. Effective energy function for proteins in solution. Proteins. 1999;35:133–152. doi: 10.1002/(sici)1097-0134(19990501)35:2<133::aid-prot1>3.0.co;2-n. [DOI] [PubMed] [Google Scholar]
52.Simons K.T., Ruczinski I., Baker D. Improved recognition of native-like protein structures using a combination of sequence-dependent and sequence-independent features of proteins. Proteins. 1999;34:82–95. doi: 10.1002/(sici)1097-0134(19990101)34:1<82::aid-prot7>3.0.co;2-a. [DOI] [PubMed] [Google Scholar]
53.Kortemme T., Morozov A.V., Baker D. An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes. J. Mol. Biol. 2003;326:1239–1259. doi: 10.1016/s0022-2836(03)00021-4. [DOI] [PubMed] [Google Scholar]
54.Brooks B.R., Bruccoleri R.E., Olafson B.D., States D.J., Swaminathan S. CHARMM: a program for macromolecular energy, minimization, and dynamics calculations. J. Comput. Chem. 1983;4:187–217. [Google Scholar]
55.Brooks B.R., Brooks C.L., 3rd, Karplus M. CHARMM: the biomolecular simulation program. J. Comput. Chem. 2009;30:1545–1614. doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Rabenstein B., Ullmann G.M., Knapp E.-W. Energetics of electron-transfer and protonation reactions of the quinones in the photosynthetic reaction center of Rhodopseudomonas viridis. Biochemistry. 1998;37:2488–2495. doi: 10.1021/bi971921y. [DOI] [PubMed] [Google Scholar]
57.Kesvatera T., Jönsson B., Linse S. Focusing of the electrostatic potential at EF-hands of calbindin D(9k): titration of acidic residues. Proteins. 2001;45:129–135. doi: 10.1002/prot.1132. [DOI] [PubMed] [Google Scholar]
58.Vocadlo D.J., Davies G.J., Withers S.G. Catalysis by hen egg-white lysozyme proceeds via a covalent intermediate. Nature. 2001;412:835–838. doi: 10.1038/35090602. [DOI] [PubMed] [Google Scholar]
59.McIntosh L.P., Hand G., Withers S.G. The pKa of the general acid/base carboxyl group of a glycosidase cycles during catalysis: a 13C-NMR study of Bacillus circulans xylanase. Biochemistry. 1996;35:9958–9966. doi: 10.1021/bi9613234. [DOI] [PubMed] [Google Scholar]
60.Olsson M.H.M. Protein electrostatics and pKa blind predictions; contribution from empirical predictions of internal ionizable residues. Proteins. 2011;79:3333–3345. doi: 10.1002/prot.23113. [DOI] [PubMed] [Google Scholar]
61.Misura K.M.S., Baker D. Progress and challenges in high-resolution refinement of protein structure models. Proteins. 2005;59:15–29. doi: 10.1002/prot.20376. [DOI] [PubMed] [Google Scholar]
62.Chaudhury S., Gray J.J. Conformer selection and induced fit in flexible backbone protein-protein docking using computational and NMR ensembles. J. Mol. Biol. 2008;381:1068–1087. doi: 10.1016/j.jmb.2008.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Isom D.G., Castañeda C.A., García-Moreno B. Large shifts in pKa values of lysine residues buried inside a protein. Proc. Natl. Acad. Sci. USA. 2011;108:5260–5265. doi: 10.1073/pnas.1010750108. [DOI] [PMC free article] [PubMed] [Google Scholar]
64.Harms M.J., Schlessman J.L., García-Moreno B. A buried lysine that titrates with a normal pKa: role of conformational flexibility at the protein-water interface as a determinant of pKa values. Protein Sci. 2008;17:833–845. doi: 10.1110/ps.073397708. [DOI] [PMC free article] [PubMed] [Google Scholar]
65.Isom D.G., Castañeda C.A., García-Moreno E B. Charges in the hydrophobic interior of proteins. Proc. Natl. Acad. Sci. USA. 2010;107:16096–16100. doi: 10.1073/pnas.1004213107. [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Karp D.A., Stahley M.R., García-Moreno B. Conformational consequences of ionization of Lys, Asp, and Glu buried at position 66 in staphylococcal nuclease. Biochemistry. 2010;49:4138–4146. doi: 10.1021/bi902114m. [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Takayama Y., Castañeda C.A., Iwahara J. Direct evidence for deprotonation of a lysine side chain buried in the hydrophobic core of a protein. J. Am. Chem. Soc. 2008;130:6714–6715. doi: 10.1021/ja801731g. [DOI] [PubMed] [Google Scholar]
68.Castañeda C.A., Fitch C.A., García-Moreno B.E. Molecular determinants of the pKa values of Asp and Glu residues in staphylococcal nuclease. Proteins. 2009;77:570–588. doi: 10.1002/prot.22470. [DOI] [PubMed] [Google Scholar]
69.Denisov V.P., Schlessman J.L., Halle B. Stabilization of internal charges in a protein: water penetration or conformational change? Biophys. J. 2004;87:3982–3994. doi: 10.1529/biophysj.104.048454. [DOI] [PMC free article] [PubMed] [Google Scholar]
70.Harms M.J., Castañeda C.A., García-Moreno E B. The pK(a) values of acidic and basic residues buried at the same internal location in a protein are governed by different factors. J. Mol. Biol. 2009;389:34–47. doi: 10.1016/j.jmb.2009.03.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
71.Dwyer J.J., Gittis A.G., García-Moreno E B. High apparent dielectric constants in the interior of a protein reflect water penetration. Biophys. J. 2000;79:1610–1620. doi: 10.1016/S0006-3495(00)76411-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
72.Karp D.A., Gittis A.G., García-Moreno E B. High apparent dielectric constant inside a protein reflects structural reorganization coupled to the ionization of an internal Asp. Biophys. J. 2007;92:2041–2053. doi: 10.1529/biophysj.106.090266. [DOI] [PMC free article] [PubMed] [Google Scholar]
73.Gunner M.R., Zhu X., Klein M.C. MCCE analysis of the pKas of introduced buried acids and bases in staphylococcal nuclease. Proteins. 2011;79:3306–3319. doi: 10.1002/prot.23124. [DOI] [PubMed] [Google Scholar]
74.Song Y. Exploring conformational changes coupled to ionization states using a hybrid Rosetta-MCCE protocol. Proteins. 2011;79:3356–3363. doi: 10.1002/prot.23146. [DOI] [PubMed] [Google Scholar]
75.Zhou Y.C., Feig M., Wei G.W. Highly accurate biomolecular electrostatics in continuum dielectric environments. J. Comput. Chem. 2008;29:87–97. doi: 10.1002/jcc.20769. [DOI] [PubMed] [Google Scholar]
76.Fennell C.J., Kehoe C.W., Dill K.A. Modeling aqueous solvation with semi-explicit assembly. Proc. Natl. Acad. Sci. USA. 2011;108:3234–3239. doi: 10.1073/pnas.1017130108. [DOI] [PMC free article] [PubMed] [Google Scholar]
77.Kao Y.-H., Fitch C.A., García-Moreno E B. Salt effects on ionization equilibria of histidines in myoglobin. Biophys. J. 2000;79:1637–1654. doi: 10.1016/S0006-3495(00)76414-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
78.Lee K.K., Fitch C.A., García-Moreno E B. Distance dependence and salt sensitivity of pairwise, coulombic interactions in a protein. Protein Sci. 2002;11:1004–1016. doi: 10.1110/ps.4700102. [DOI] [PMC free article] [PubMed] [Google Scholar]
79.Leaver-Fay A., Tyka M., Bradley P. ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol. 2011;487:545–574. doi: 10.1016/B978-0-12-381270-4.00019-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
80.Forsyth W.R., Antosiewicz J.M., Robertson A.D. Empirical relationships between protein structure and carboxyl pKa values in proteins. Proteins. 2002;48:388–403. doi: 10.1002/prot.10174. [DOI] [PubMed] [Google Scholar]
81.Edgcomb S.P., Murphy K.P. Variability in the pKa of histidine side-chains correlates with burial within proteins. Proteins. 2002;49:1–6. doi: 10.1002/prot.10177. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. A complete description of the experimental pK_a dataset, the Rosetta score function, and the command-line syntax for performing pK_a predictions with Rosetta

mmc1.pdf^{(317.9KB, pdf)}

[bib1] 1.Sheinerman F.B., Norel R., Honig B. Electrostatic aspects of protein-protein interactions. Curr. Opin. Struct. Biol. 2000;10:153–159. doi: 10.1016/s0959-440x(00)00065-8. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Whitten S.T., García-Moreno E B., Hilser V.J. Local conformational fluctuations can modulate the coupling between proton binding and global structural transitions in proteins. Proc. Natl. Acad. Sci. USA. 2005;102:4282–4287. doi: 10.1073/pnas.0407499102. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] 3.Warshel A., Dryga A. Simulating electrostatic energies in proteins: perspectives and some recent studies of pKas, redox, and other crucial functional properties. Proteins. 2011;79:3469–3484. doi: 10.1002/prot.23125. [DOI] [PubMed] [Google Scholar]

[bib4] 4.Mitra R.C., Zhang Z., Alexov E. In silico modeling of pH-optimum of protein-protein binding. Proteins. 2011;79:925–936. doi: 10.1002/prot.22931. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] 5.Warren G.L., Andrews C.W., Head M.S. A critical assessment of docking programs and scoring functions. J. Med. Chem. 2006;49:5912–5931. doi: 10.1021/jm050362n. [DOI] [PubMed] [Google Scholar]

[bib6] 6.Dong F., Olsen B., Baker N.A. Computational methods for biomolecular electrostatics. Methods Cell Biol. 2008;84:843–870. doi: 10.1016/S0091-679X(07)84026-X. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] 7.Carstensen T., Farrell D., Nielsen J.E. On the development of protein pKa calculation algorithms. Proteins. 2011;79:3287–3298. doi: 10.1002/prot.23091. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] 8.Yang A.-S., Gunner M.R., Honig B. On the calculation of pKas in proteins. Proteins. 1993;15:252–265. doi: 10.1002/prot.340150304. [DOI] [PubMed] [Google Scholar]

[bib9] 9.Antosiewicz J., McCammon J.A., Gilson M.K. The determinants of pKas in proteins. Biochemistry. 1996;35:7819–7833. doi: 10.1021/bi9601565. [DOI] [PubMed] [Google Scholar]

[bib10] 10.Havranek J.J., Harbury P.B. Tanford-Kirkwood electrostatics for protein modeling. Proc. Natl. Acad. Sci. USA. 1999;96:11145–11150. doi: 10.1073/pnas.96.20.11145. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] 11.Georgescu R.E., Alexov E.G., Gunner M.R. Combining conformational flexibility and continuum electrostatics for calculating pK(a)s in proteins. Biophys. J. 2002;83:1731–1748. doi: 10.1016/S0006-3495(02)73940-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] 12.Fitch C.A., Karp D.A., García-Moreno E B. Experimental pK(a) values of buried residues: analysis with continuum methods and role of water penetration. Biophys. J. 2002;82:3289–3304. doi: 10.1016/s0006-3495(02)75670-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] 13.Song Y., Mao J., Gunner M.R. MCCE2: improving protein pKa calculations with extensive side chain rotamer sampling. J. Comput. Chem. 2009;30:2231–2247. doi: 10.1002/jcc.21222. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] 14.Forsyth W.R., Robertson A.D. Insensitivity of perturbed carboxyl pK(a) values in the ovomucoid third domain to charge replacement at a neighboring residue. Biochemistry. 2000;39:8067–8072. doi: 10.1021/bi992967p. [DOI] [PubMed] [Google Scholar]

[bib15] 15.Simonson T., Carlsson J., Case D.A. Proton binding to proteins: pK(a) calculations with explicit and implicit solvent models. J. Am. Chem. Soc. 2004;126:4167–4180. doi: 10.1021/ja039788m. [DOI] [PubMed] [Google Scholar]

[bib16] 16.Kuhn B., Kollman P.A., Stahl M. Prediction of pKa shifts in proteins using a combination of molecular mechanical and continuum solvent calculations. J. Comput. Chem. 2004;25:1865–1872. doi: 10.1002/jcc.20111. [DOI] [PubMed] [Google Scholar]

[bib17] 17.Khandogin J., Brooks C.L., 3rd Toward the accurate first-principles prediction of ionization equilibria in proteins. Biochemistry. 2006;45:9363–9373. doi: 10.1021/bi060706r. [DOI] [PubMed] [Google Scholar]

[bib18] 18.Wallace J.A., Shen J.K. Continuous constant pH molecular dynamics in explicit solvent with pH-based replica exchange. J. Chem. Theory Comput. 2011;7:2617–2629. doi: 10.1021/ct200146j. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] 19.Itoh S.G., Damjanović A., Brooks B.R. pH replica-exchange method based on discrete protonation states. Proteins. 2011;79:3420–3436. doi: 10.1002/prot.23176. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] 20.Williams S.L., Blachly P.G., McCammon J.A. Measuring the successes and deficiencies of constant pH molecular dynamics: a blind prediction study. Proteins. 2011;79:3381–3388. doi: 10.1002/prot.23136. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] 21.Li H., Robertson A.D., Jensen J.H. The determinants of carboxyl pKa values in turkey ovomucoid third domain. Proteins. 2004;55:689–704. doi: 10.1002/prot.20032. [DOI] [PubMed] [Google Scholar]

[bib22] 22.Schaefer P., Riccardi D., Cui Q. Reliable treatment of electrostatics in combined QM/MM simulation of macromolecules. J. Chem. Phys. 2005;123:014905. doi: 10.1063/1.1940047. [DOI] [PubMed] [Google Scholar]

[bib23] 23.Wisz M.S., Hellinga H.W. An empirical model for electrostatic interactions in proteins incorporating multiple geometry-dependent dielectric constants. Proteins. 2003;51:360–377. doi: 10.1002/prot.10332. [DOI] [PubMed] [Google Scholar]

[bib24] 24.Li H., Robertson A.D., Jensen J.H. Very fast empirical prediction and rationalization of protein pKa values. Proteins. 2005;61:704–721. doi: 10.1002/prot.20660. [DOI] [PubMed] [Google Scholar]

[bib25] 25.Olsson M.H.M., Søndergaard C.R., Jensen J.H. PROPKA3: consistent treatment of internal and surface residues in empirical pKa predictions. J. Chem. Theory Comput. 2011;7:525–537. doi: 10.1021/ct100578z. [DOI] [PubMed] [Google Scholar]

[bib26] 26.Krieger E., Nielsen J.E., Vriend G. Fast empirical pKa prediction by Ewald summation. J. Mol. Graph. Model. 2006;25:481–486. doi: 10.1016/j.jmgm.2006.02.009. [DOI] [PubMed] [Google Scholar]

[bib27] 27.Milletti F., Storchi L., Cruciani G. Predicting protein pK(a) by environment similarity. Proteins. 2009;76:484–495. doi: 10.1002/prot.22363. [DOI] [PubMed] [Google Scholar]

[bib28] 28.Huang R.-B., Du Q.-S., Chou K.C. A fast and accurate method for predicting pKa of residues in proteins. Protein Eng. Des. Sel. 2010;23:35–42. doi: 10.1093/protein/gzp067. [DOI] [PubMed] [Google Scholar]

[bib29] 29.You T.J., Bashford D. Conformation and hydrogen ion titration of proteins: a continuum electrostatic model with conformational flexibility. Biophys. J. 1995;69:1721–1733. doi: 10.1016/S0006-3495(95)80042-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] 30.van Vlijmen H.W.T., Schaefer M., Karplus M. Improving the accuracy of protein pKa calculations: conformational averaging versus the average structure. Proteins. 1998;33:145–158. doi: 10.1002/(sici)1097-0134(19981101)33:2<145::aid-prot1>3.0.co;2-i. [DOI] [PubMed] [Google Scholar]

[bib31] 31.Witham S., Talley K., Alexov E. Developing hybrid approaches to predict pKa values of ionizable groups. Proteins. 2011;79:3389–3399. doi: 10.1002/prot.23097. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] 32.Beroza P., Case D.A. Including side chain flexibility in continuum electrostatic calculations of protein titration. J. Phys. Chem. 1996;100:20156–20163. [Google Scholar]

[bib33] 33.Bradley P., Misura K.M.S., Baker D. Toward high-resolution de novo structure prediction for small proteins. Science. 2005;309:1868–1871. doi: 10.1126/science.1113801. [DOI] [PubMed] [Google Scholar]

[bib34] 34.Sircar A., Chaudhury S., Gray J.J. A generalized approach to sampling backbone conformations with RosettaDock for CAPRI rounds 13-19. Proteins. 2010;78:3115–3123. doi: 10.1002/prot.22765. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib35] 35.Chaudhury S., Berrondo M., Gray J.J. Benchmarking and analysis of protein docking performance in Rosetta v3.2. PLoS ONE. 2011;6:e22477. doi: 10.1371/journal.pone.0022477. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] 36.Jiang L., Althoff E.A., Baker D. De novo computational design of retro-aldol enzymes. Science. 2008;319:1387–1391. doi: 10.1126/science.1152692. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] 37.Röthlisberger D., Khersonsky O., Baker D. Kemp elimination catalysts by computational enzyme design. Nature. 2008;453:190–195. doi: 10.1038/nature06879. [DOI] [PubMed] [Google Scholar]

[bib38] 38.Makrodimitris K., Masica D.L., Gray J.J. Structure prediction of protein-solid surface interactions reveals a molecular recognition motif of statherin for hydroxyapatite. J. Am. Chem. Soc. 2007;129:13713–13722. doi: 10.1021/ja074602v. [DOI] [PubMed] [Google Scholar]

[bib39] 39.Masica D.L., Schrier S.B., Gray J.J. De novo design of peptide-calcite biomineralization systems. J. Am. Chem. Soc. 2010;132:12252–12262. doi: 10.1021/ja1001086. [DOI] [PubMed] [Google Scholar]

[bib40] 40.Kortemme T., Joachimiak L.A., Baker D. Computational redesign of protein-protein interaction specificity. Nat. Struct. Mol. Biol. 2004;11:371–379. doi: 10.1038/nsmb749. [DOI] [PubMed] [Google Scholar]

[bib41] 41.Kuhlman B., Dantas G., Baker D. Design of a novel globular protein fold with atomic-level accuracy. Science. 2003;302:1364–1368. doi: 10.1126/science.1089427. [DOI] [PubMed] [Google Scholar]

[bib42] 42.Kuhlman B., Baker D. Native protein sequences are close to optimal for their structures. Proc. Natl. Acad. Sci. USA. 2000;97:10383–10388. doi: 10.1073/pnas.97.19.10383. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib43] 43.Gray J.J., Moughon S., Baker D. Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. J. Mol. Biol. 2003;331:281–299. doi: 10.1016/s0022-2836(03)00670-3. [DOI] [PubMed] [Google Scholar]

[bib44] 44.Song Y., Tyka M., Baker D. Structure-guided forcefield optimization. Proteins. 2011;79:1898–1909. doi: 10.1002/prot.23013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib45] 45.Kortemme T., Baker D. A simple physical model for binding energy hot spots in protein-protein complexes. Proc. Natl. Acad. Sci. USA. 2002;99:14116–14121. doi: 10.1073/pnas.202485799. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib46] 46.Kellogg E.H., Leaver-Fay A., Baker D. Role of conformational sampling in computing mutation-induced changes in protein structure and stability. Proteins. 2011;79:830–838. doi: 10.1002/prot.22921. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib47] 47.Nielsen J.E., Gunner M.R., García-Moreno B.E. The pKa Cooperative: a collaborative effort to advance structure-based calculations of pKa values and electrostatic effects in proteins. Proteins. 2011;79:3249–3259. doi: 10.1002/prot.23194. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib48] 48.Dunbrack R.L., Jr., Cohen F.E. Bayesian statistical analysis of protein side-chain rotamer preferences. Protein Sci. 1997;6:1661–1681. doi: 10.1002/pro.5560060807. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib49] 49.Onufriev A., Case D.A., Ullmann G.M. A novel view of pH titration in biomolecules. Biochemistry. 2001;40:3413–3419. doi: 10.1021/bi002740q. [DOI] [PubMed] [Google Scholar]

[bib50] 50.Park M.-S., Gao C., Stern H.A. Estimating binding affinities by docking/scoring methods using variable protonation states. Proteins. 2011;79:304–314. doi: 10.1002/prot.22883. [DOI] [PubMed] [Google Scholar]

[bib51] 51.Lazaridis T., Karplus M. Effective energy function for proteins in solution. Proteins. 1999;35:133–152. doi: 10.1002/(sici)1097-0134(19990501)35:2<133::aid-prot1>3.0.co;2-n. [DOI] [PubMed] [Google Scholar]

[bib52] 52.Simons K.T., Ruczinski I., Baker D. Improved recognition of native-like protein structures using a combination of sequence-dependent and sequence-independent features of proteins. Proteins. 1999;34:82–95. doi: 10.1002/(sici)1097-0134(19990101)34:1<82::aid-prot7>3.0.co;2-a. [DOI] [PubMed] [Google Scholar]

[bib53] 53.Kortemme T., Morozov A.V., Baker D. An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes. J. Mol. Biol. 2003;326:1239–1259. doi: 10.1016/s0022-2836(03)00021-4. [DOI] [PubMed] [Google Scholar]

[bib54] 54.Brooks B.R., Bruccoleri R.E., Olafson B.D., States D.J., Swaminathan S. CHARMM: a program for macromolecular energy, minimization, and dynamics calculations. J. Comput. Chem. 1983;4:187–217. [Google Scholar]

[bib55] 55.Brooks B.R., Brooks C.L., 3rd, Karplus M. CHARMM: the biomolecular simulation program. J. Comput. Chem. 2009;30:1545–1614. doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] 56.Rabenstein B., Ullmann G.M., Knapp E.-W. Energetics of electron-transfer and protonation reactions of the quinones in the photosynthetic reaction center of Rhodopseudomonas viridis. Biochemistry. 1998;37:2488–2495. doi: 10.1021/bi971921y. [DOI] [PubMed] [Google Scholar]

[bib57] 57.Kesvatera T., Jönsson B., Linse S. Focusing of the electrostatic potential at EF-hands of calbindin D(9k): titration of acidic residues. Proteins. 2001;45:129–135. doi: 10.1002/prot.1132. [DOI] [PubMed] [Google Scholar]

[bib58] 58.Vocadlo D.J., Davies G.J., Withers S.G. Catalysis by hen egg-white lysozyme proceeds via a covalent intermediate. Nature. 2001;412:835–838. doi: 10.1038/35090602. [DOI] [PubMed] [Google Scholar]

[bib59] 59.McIntosh L.P., Hand G., Withers S.G. The pKa of the general acid/base carboxyl group of a glycosidase cycles during catalysis: a 13C-NMR study of Bacillus circulans xylanase. Biochemistry. 1996;35:9958–9966. doi: 10.1021/bi9613234. [DOI] [PubMed] [Google Scholar]

[bib60] 60.Olsson M.H.M. Protein electrostatics and pKa blind predictions; contribution from empirical predictions of internal ionizable residues. Proteins. 2011;79:3333–3345. doi: 10.1002/prot.23113. [DOI] [PubMed] [Google Scholar]

[bib61] 61.Misura K.M.S., Baker D. Progress and challenges in high-resolution refinement of protein structure models. Proteins. 2005;59:15–29. doi: 10.1002/prot.20376. [DOI] [PubMed] [Google Scholar]

[bib62] 62.Chaudhury S., Gray J.J. Conformer selection and induced fit in flexible backbone protein-protein docking using computational and NMR ensembles. J. Mol. Biol. 2008;381:1068–1087. doi: 10.1016/j.jmb.2008.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib63] 63.Isom D.G., Castañeda C.A., García-Moreno B. Large shifts in pKa values of lysine residues buried inside a protein. Proc. Natl. Acad. Sci. USA. 2011;108:5260–5265. doi: 10.1073/pnas.1010750108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib64] 64.Harms M.J., Schlessman J.L., García-Moreno B. A buried lysine that titrates with a normal pKa: role of conformational flexibility at the protein-water interface as a determinant of pKa values. Protein Sci. 2008;17:833–845. doi: 10.1110/ps.073397708. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib65] 65.Isom D.G., Castañeda C.A., García-Moreno E B. Charges in the hydrophobic interior of proteins. Proc. Natl. Acad. Sci. USA. 2010;107:16096–16100. doi: 10.1073/pnas.1004213107. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib66] 66.Karp D.A., Stahley M.R., García-Moreno B. Conformational consequences of ionization of Lys, Asp, and Glu buried at position 66 in staphylococcal nuclease. Biochemistry. 2010;49:4138–4146. doi: 10.1021/bi902114m. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib67] 67.Takayama Y., Castañeda C.A., Iwahara J. Direct evidence for deprotonation of a lysine side chain buried in the hydrophobic core of a protein. J. Am. Chem. Soc. 2008;130:6714–6715. doi: 10.1021/ja801731g. [DOI] [PubMed] [Google Scholar]

[bib68] 68.Castañeda C.A., Fitch C.A., García-Moreno B.E. Molecular determinants of the pKa values of Asp and Glu residues in staphylococcal nuclease. Proteins. 2009;77:570–588. doi: 10.1002/prot.22470. [DOI] [PubMed] [Google Scholar]

[bib69] 69.Denisov V.P., Schlessman J.L., Halle B. Stabilization of internal charges in a protein: water penetration or conformational change? Biophys. J. 2004;87:3982–3994. doi: 10.1529/biophysj.104.048454. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib70] 70.Harms M.J., Castañeda C.A., García-Moreno E B. The pK(a) values of acidic and basic residues buried at the same internal location in a protein are governed by different factors. J. Mol. Biol. 2009;389:34–47. doi: 10.1016/j.jmb.2009.03.039. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib71] 71.Dwyer J.J., Gittis A.G., García-Moreno E B. High apparent dielectric constants in the interior of a protein reflect water penetration. Biophys. J. 2000;79:1610–1620. doi: 10.1016/S0006-3495(00)76411-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib72] 72.Karp D.A., Gittis A.G., García-Moreno E B. High apparent dielectric constant inside a protein reflects structural reorganization coupled to the ionization of an internal Asp. Biophys. J. 2007;92:2041–2053. doi: 10.1529/biophysj.106.090266. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib73] 73.Gunner M.R., Zhu X., Klein M.C. MCCE analysis of the pKas of introduced buried acids and bases in staphylococcal nuclease. Proteins. 2011;79:3306–3319. doi: 10.1002/prot.23124. [DOI] [PubMed] [Google Scholar]

[bib74] 74.Song Y. Exploring conformational changes coupled to ionization states using a hybrid Rosetta-MCCE protocol. Proteins. 2011;79:3356–3363. doi: 10.1002/prot.23146. [DOI] [PubMed] [Google Scholar]

[bib75] 75.Zhou Y.C., Feig M., Wei G.W. Highly accurate biomolecular electrostatics in continuum dielectric environments. J. Comput. Chem. 2008;29:87–97. doi: 10.1002/jcc.20769. [DOI] [PubMed] [Google Scholar]

[bib76] 76.Fennell C.J., Kehoe C.W., Dill K.A. Modeling aqueous solvation with semi-explicit assembly. Proc. Natl. Acad. Sci. USA. 2011;108:3234–3239. doi: 10.1073/pnas.1017130108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib77] 77.Kao Y.-H., Fitch C.A., García-Moreno E B. Salt effects on ionization equilibria of histidines in myoglobin. Biophys. J. 2000;79:1637–1654. doi: 10.1016/S0006-3495(00)76414-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib78] 78.Lee K.K., Fitch C.A., García-Moreno E B. Distance dependence and salt sensitivity of pairwise, coulombic interactions in a protein. Protein Sci. 2002;11:1004–1016. doi: 10.1110/ps.4700102. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib79] 79.Leaver-Fay A., Tyka M., Bradley P. ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol. 2011;487:545–574. doi: 10.1016/B978-0-12-381270-4.00019-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib80] 80.Forsyth W.R., Antosiewicz J.M., Robertson A.D. Empirical relationships between protein structure and carboxyl pKa values in proteins. Proteins. 2002;48:388–403. doi: 10.1002/prot.10174. [DOI] [PubMed] [Google Scholar]

[bib81] 81.Edgcomb S.P., Murphy K.P. Variability in the pKa of histidine side-chains correlates with burial within proteins. Proteins. 2002;49:1–6. doi: 10.1002/prot.10177. [DOI] [PubMed] [Google Scholar]

PERMALINK

Rapid Calculation of Protein pK_a Values Using Rosetta

Krishna Praneeth Kilambi

Jeffrey J Gray

Abstract

Introduction

Methods