Abstract
The role of pH in regulating biological activity is ubiquitous, and understanding pH-mediated activity has traditionally relied on analyzing static biomolecular structures of highly populated ground states solved near physiological pH. However, recent advances have shown the increasing importance of transiently populated states, the characterization of which is extremely challenging but made plausible with the development of techniques such as relaxation dispersion NMR spectroscopy. To unlock the pH dependence of these transient states with atomistic-level details, we applied the recently developed explicit solvent constant pH molecular dynamics (CPHMDMSλD) framework to a series of staphylococcal nuclease (SNase) mutants with buried ionizable residues and probed their dynamics in different pH environments. Among our key findings is the existence of open states in all SNase mutants containing “buried” residues with highly shifted pKa’s, where local solvation around the protonation site was observed. The calculated pKa demonstrated good agreement with experimental pKa’s, with a low average unsigned error of 1.3 pKa units and correlation coefficient R2 = 0.78. Sampling both open and closed states in their respective pH range, where they are expected to be dominant, was necessary to reproduce experimental pKa’s, and in the most extreme examples of pKa shifts measured, it can be interpreted that the open-state structures are transient at physiological pH, contributing a small population of 1–2%. This suggests that buried ionizable residues can trigger conformational fluctuations that may be observed as transient-state structures at physiological pH. Furthermore, the coupled relationship of both open and closed states and their role in recapitulating macroscopic experimental observables suggest that structural analysis of buried residues may benefit from looking at structural pairs, as opposed to the conventional approach of looking at a single static ground-state conformation.
One of the critical regulators of biological activity is pH, and misregulated pH is a hallmark of cancer-related physiology.5 Ionizable groups that change their protonation states as a function of pH and their local electrostatic environment are often the key residues that govern pH-dependent activity. Due to their charged nature, the majority of them are expressed near the protein surface; however, in some systems, such as membrane proteins, they can be found in more hydrophobic environments. Until recently, the dominant approach to probe the mechanism of such pH-dependent activity has been focused on the analysis of static structures typically solved at physiological pH. However, recent studies have demonstrated the increasingly important role of biomolecular excited-state structures, typically referring to populations that are 0.1% to a few percent of the total population under physiological conditions6 (hereon referred to as “transient states” to avoid confusion with electronically excited states), in protein folding7 and ligand binding.8 While the list of examples of pH-dependent transient states is not that long, there is precedence for their importance, such as in membrane fusion involving the influenza hemagglutinin HA2 subunit.10
The characterization of transient states is beyond the detection limits of conventional experimental techniques, but recent developments in relaxation dispersion NMR spectroscopy11 and room-temperature X-ray crystallography12 have made this endeavor feasible. Probing pH-dependent transient states will undoubtedly be similarly challenging. In this context, we report on the application of novel computational techniques to model pH-dependent dynamics of proteins and to probe their transiently populated minor states, which can serve as complementary tools in the field since simulations are not subject to the same detection limits as experiments. One major approach to model accurate pH-dependent dynamics is the constant-pH molecular dynamics (CPHMD) framework, where MD simulation is coupled to the protonation state of the titrating residue.13 In the continuous variant of CPHMD used in this study, the titration coordinate represents an instantaneous microstate, and it is propagated continuously between the protonated and unprotonated states using the λ dynamics approach.16 We focus our investigation on modeling the pH-dependent dynamics of a series of engineered mutants of staphylococcal nuclease (SNase) with Lys residues buried in the hydrophobic core18 using the recently developed explicit solvent CPHMDMSλD framework for proteins.17 While the effectiveness of CPHMD has been demonstrated on numerous systems,19 almost all work reported to date was based on an implicit solvent model.13 Applications of explicit solvent CPHMD on biomolecules thus far have only a few reported successes.17,24 More importantly, none of the existing works have attempted a comprehensive investigation of buried ionizable residues, where, we hypothesize, the contrast in electrostatic environment between a hydrophobic pocket and a solvent-exposed environment will provide a key driving force for pH-mediated conformational fluctuations and, possibly, transient states formation.
As shown in Figure 1a, the mutants for this study comprise a set of diverse mutation sites, with residues varying in the magnitudes of pKa shifts. In addition, experiments have demonstrated that, for the mutants that we selected, the titration of the internal Lys is decoupled from the ionization of Asp and Glu residues.18 Thus, to facilitate convergence within a reasonable time, we titrated only the buried Lys and performed an initial check of the accuracy of our simulations by calculating its pKa value, using the crystallographic structures as the starting models (see Supporting Information (SI) for details). As summarized in Figure 1b and Table S1, most of the buried residues have a predicted pKa < 1, which is shifted by nearly 10 pKa units from the reference value of 10.4. pKa calculations on the V66D and V66E mutants produced a predicted pKa > 14, which is dramatically shifted from their reference values of 4.0 and 4.4, respectively. In both cases, the pKa shifted toward the direction stabilizing the neutral state of each titrating species (upward for Asp/Glu; downward for Lys). The dramatic pKa shifts observed in our simulations mirror the experimental spectrophotometric measurements of Brønsted acids and bases, where a similar pKa shift of 10–20 units was recorded when moving from an aqueous to an organic environment.28 For example, acetic acid, which has the same functional group as Asp and Glu, has its pKa value shifted from 4.8 to ∼23 in water (ε = 80) vs acetonitrile (ε = 37).29 Even though the dielectric constant of a typical hydrophobic pocket in SNase, which ranges from 4 to 20,30 is lower than that of acetonitrile, such dramatic pKa shifts of more than 10 units have never been observed in the protein. In fact, some of the largest pKa shifts for Asp, Glu, and Lys reported in the literature are perturbed by a mere 5 units from their reference values.18,33 Thus, there is an inconsistency in the magnitude of experimental pKa shifts observed in SNase compared to historical experimental pKa shifts observed in organic solvents, even though both environments have similarly low dielectric values, which indicates that additional factors have to account for the apparent inconsistency.
Strong coupling between pKa and conformational sampling has been previously reported,34 and, while the importance of conformational sampling has been observed in some SNase mutants, no connection to experimental pKa values has been reported.38 Based on these observations, we performed an additional series of extended-run explicit solvent pH-REX CPHMDMSλD simulations on the structures that were pre-equilibrated for 50 ns at high and low pH (see SI for details). The findings, summarized in Figure 1c and Table S1, demonstrate significantly improved results, with the averaged unsigned error (AUE) reduced from 3.9 to 1.2 pKa units. We observed that the longer equilibration allowed the sampling of “open” solvated structures that were critical for reproducing the experimental pKa values reported for all Lys mutants with highly shifted pKa values. This is in contrast to the “closed” crystallographic-like structure where there is little to no water present within 3 Å of the protonation site (Figure 2a). Using the V66K mutant as an example, we demonstrate that SNase conformations sampled in our simulations interconvert between closed and open states (Figure 2b,c) when the external pH is close to the calculated pKa value. While the coupling between pKa and conformational sampling has been reported for a few isolated examples,36,37 and the importance of sampling such alternative conformational states has been previously postulated by experiments,39 there has been no comprehensive proof for their role. Our work presents the first interpretation that pH-dependent transient states exist and may be of general importance for proteins with buried ionizable groups. For all mutants with highly shifted pKa values, we observed that protonation of the internal Lys was concomitant with an increase in local solvation around the protonation site, as illustrated in Figure 2a. The backbone RMSD of the entire structure between both closed and open states in these mutants is small, ranging from 1.2 to 1.9 Å (also see Figure S1), which suggests that the conformational relaxation to accommodate a buried charged residue does not require significant structural rearrangement. Our observations are consistent with experimental measurements indicating that buried ionizable residues in SNase are readily accommodated without any special structural adaptation or distortion to the overall protein conformation.18,33
Having established that buried ionizable residues can trigger pH-dependent structural fluctuations, we extend our analysis to determine the relevance of these alternative states at pH 7. Using a two-state model that assumes a conversion between one dominant open state and one dominant closed state, we derive an equation (see SI for derivation) describing the ratio of open to closed states, ROC:
This function can be decomposed into pH-dependent (KpH) and pH-independent (K0) terms. The KpH term depends on the microscopic pKa of each state (pKclosed, pKopen) and external pH. The K0 term can be physically related to the free energy difference of the open and closed states in their protonated form (SI). However, as each form favors opposing protonation states, this free energy is usually not measured by experiments. Thus, it may be advantageous to express K0 as a function of the system’s readily measured macroscopic or apparent pKa (pKapp) and the microscopic pKa of each state (pKclosed, pKopen). Based on the ROC equation, one may also derive the fraction of the open state (Fopen), a more intuitive metric for open states at a specific pH:
Due to the rapid dynamics and/or low population of the minor state, it is often beyond the detection limits of experiments to establish the pKa values of each microscopic state.39 However, because the open state is solvent-exposed, pKopen may be approximated as the reference pKa of the free amino acid. Using the range of pKa shifts of 10–20 units recorded from spectrophotometric measurements of organic acids and bases in a low dielectric solvent,28 we have conservatively assigned pKclosed to be shifted by 10. As shown in Figure 3, one can use the ROC equation to derive the pH-dependent fraction of the open state for a series of hypothetical pKapp for buried Lys or Glu.
Our analysis indicates that pH-dependent transient open states may contribute as much as 2% of the total population at pH 7 when the apparent pKa of Lys is shifted by as much as 5 units, which appears to be the upper limit of pKa shift as recorded in current literature.18,33 For Asp and Glu, a pKapp shifted by 5 pKa units represents a 1% contribution of the transient state at pH 7. Thus, for the residues with highly shifted pKa values, the low-population transient states are likely to contribute significantly to the apparent pKa and, thus, need to be elucidated to correctly compute the apparent pKa. Although the existence of transient states involving buried ionizable groups does not necessarily imply a functional relevance, there is increasing precedence that the inclusion of transient states is needed to fully account for biological properties.6−8 In the context of our work, we suggest that the effect of such pH-dependent transient states will be pronounced when an ionizable group transitions between hydrophilic and hydrophobic environments, such as in membrane fusion processes, where activated/transient states have been postulated to play a crucial role.10 In addition, traditional studies of catalytic mechanisms have always assumed that crystallographic structures correlate to measured pKa, but, as we have shown, that may not be true for buried residues with highly shifted pKa values. Moreover, the coupled relationship between open and closed states and their role in recapitulating macroscopic experimental observables suggests that structural analysis of buried residues should be performed from the perspective of looking at structural pairs, as opposed to the conventional approach of looking at a single static ground-state conformation. For such analyses, the equations we have provided will prove useful for a quick, “back of the envelope” estimation of the population of proposed transient states. For example, one could use the experimentally measured pKapp to select the appropriate curve in Figure 3, and use it to estimate the fraction of the open state at a given pH, as a means to evaluate the plausibility of experimental characterization within the detection limits of the methods employed. Alternatively, it could also be used to estimate the pH range where the population of proposed transient states will enter the detection range of experiments.
Lastly, we investigate the various computational models, specifically previous CPHMD implementations, to ascertain the robustness of predictions obtained from simulations. One insightful observation is that our calculated pKa for the N100K mutant of SNase is 6.6, which is close to the calculated pKa of 7.0 reported by Shen and co-workers,37 despite the differences in the solvation model and cutoff schemes utilized. This suggests that the CPHMD framework is not overly sensitive to specifics of the simulation setup. This is because CPHMD calculates the free energy of protonation relative to a reference compound, and, as long as the simulations of the protein and the reference are performed under identical conditions, the differences originating from the simulation setup approximately cancel out. To test this hypothesis, we compare our pKa predictions from explicit solvent pH-REX CPHMDMSλD simulations with those obtained from the GBSW14 implicit solvent pH-REX CPHMD framework, using the refined protocol presented in this paper. Our results (Figure 4a,b and Table S2) show excellent correlation of R2 = 0.89 between the pKa’s predicted from both explicit and implicit models. However, unlike the explicit solvent CPHMDMSλD simulations, which resulted in a pKa shift of more than 10 units when only the closed state was used, the implicit solvent CPHMD simulations produced pKa shifts that were smaller in magnitude. The van der Waals surface representation used to define the solute–solvent dielectric boundary in GBSW is known to form small crevices of high dielectric region between atoms, which results in the underestimation of the Born radii and overestimation of the solvation free energy.40 Therefore, despite the actual hydrophobicity of the environment near the protonating site, the GBSW model may be too “wet”, causing a smaller pKa shift. Alternatively, the same effect can be achieved as a result of the faster conformational dynamics in the GBSW model, which may sample both open and closed states more frequently.
To distinguish between these two possibilities, additional simulations, with the structures of the V66K, V99K, L125K, and I92K mutants rigidified by applying harmonic restraints to all heavy atoms, were performed, and the resulting pKa values were similar to those calculated without restraints (Table S2). These results suggest that smaller pKa shift in the implicit solvent primarily stems from the “wetness” of the GBSW model, rather than its faster conformational dynamics. Finally, we investigated the effect of using the recently released CHARMM36 all-atom force field for proteins on our predictions, as it has been previously shown to yield superior reproduction of experimental dynamic data.41 We recalculated the biasing potentials for the common titrating residues of proteins using the CHARMM36 force field (Table S4), and revised pKa values, as shown Figure 4c, indicate an improvement in the predictive performance over the older CHARMM22/CMAP force field, with R2 increasing from 0.52 to 0.78, and the slope of the regression moving from 0.52 to 1.09, while maintaining the same level of accuracy with an average unsigned error of 1.3 pKa units.
In conclusion, we have applied the recently developed explicit solvent constant pH molecular dynamics framework to simulate the pH-dependent dynamics of a comprehensive set of SNase mutants with buried ionizable residues that have varying degrees of pKa shifts. Among our key findings are that a buried charged residue cannot be accommodated inside a purely hydrophobic pocket, and that an open-state structure for these “buried” residues, characterized by local solvation around the protonating site, was observed in all SNase mutants with highly shifted pKa’s. At physiological pH, buried ionizable groups with large pKa shifts have transiently populated open states, where they contribute a small but non-zero population of 1–2%. Nevertheless, sampling these open states is a necessary condition for accurately reproducing experimental pKa measurements, with which calculated pKa from our explicit solvent CPHMDMSλD simulations demonstrated good agreement, with a low average unsigned error of 1.3 pKa units and correlation coefficient R2 = 0.78. This work provides the first validation that buried ionizable residues can readily trigger pH-mediated conformational fluctuations that may be observed as transient-state structures at physiological pH. Lastly, the discovery of a coupled relationship of both open and closed states and their role in recapitulating macroscopic experimental observables suggests that structural analysis of buried residues may benefit from looking at structural pairs, as opposed to the conventional approach of looking at a single static ground-state conformation.
Acknowledgments
This work was supported by grants from NIH (GM037554 and GM057513) and Howard Hughes Medical Institute (HHMI International Student Research Fellowship).
Supporting Information Available
Details of computational protocol, derivation of ROC equation, and additional pKa calculations. This material is available free of charge via the Internet at http://pubs.acs.org.
The authors declare no competing financial interest.
Funding Statement
National Institutes of Health, United States
Supplementary Material
References
- Webb B. A.; Chimenti M.; Jacobson M. P.; Barber D. L. Nat. Rev. Cancer 2011, 11, 671. [DOI] [PubMed] [Google Scholar]
- Sekhar A.; Kay L. E. Proc. Natl. Acad. Sci. U.S.A. 2013, 110, 12867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Korzhnev D. M.; Religa T. L.; Banachewicz W.; Fersht A. R.; Kay L. E. Science 2010, 329, 1312. [DOI] [PubMed] [Google Scholar]
- Tzeng S. R.; Kalodimos C. G. Nat. Chem. Biol. 2013, 9, 462. [DOI] [PubMed] [Google Scholar]
- Lorieau J. L.; Louis J. M.; Schwieters C. D.; Bax A. Proc. Natl. Acad. Sci. U.S.A. 2012, 109, 19994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vallurupalli P.; Hansen D. F.; Stollar E.; Meirovitch E.; Kay L. E. Proc. Natl. Acad. Sci. U.S.A. 2007, 104, 18473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fraser J. S.; van den Bedem H.; Samelson A. J.; Lang P. T.; Holton J. M.; Echols N.; Alber T. Proc. Natl. Acad. Sci. U.S.A. 2011, 108, 16247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- a Baptista A. M.; Teixeira V. H.; Soares C. M. J. Chem. Phys. 2002, 117, 4184. [Google Scholar]; b Lee M. S.; Salsbury F. R.; Brooks C. L. III. Proteins: Struct., Funct., Bioinf. 2004, 56, 738. [DOI] [PubMed] [Google Scholar]; c Meng Y.; Roitberg A. E. J. Chem. Theory Comput. 2010, 6, 1401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knight J. L.; Brooks C. L. III. J. Comput. Chem. 2009, 30, 1692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Isom D. G.; Castaneda C. A.; Cannon B. R.; Garcia-Moreno B. Proc. Natl. Acad. Sci. U.S.A. 2011, 108, 5260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goh G. B.; Hulbert B. S.; Zhou H.; Brooks C. L. III. Proteins: Struct., Funct., Bioinf. 2014, 10.1002/prot.24499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- a Khandogin J.; Brooks C. L. III. Proc. Natl. Acad. Sci. U.S.A. 2007, 104, 16880. [DOI] [PMC free article] [PubMed] [Google Scholar]; b Zhang B. W.; Brunetti L.; Brooks C. L. III. J. Am. Chem. Soc. 2011, 133, 19393. [DOI] [PMC free article] [PubMed] [Google Scholar]; c Wallace J. A.; Shen J. K. J. Phys. Chem. Lett. 2012, 3, 658. [DOI] [PMC free article] [PubMed] [Google Scholar]; d Law S. M.; Zhang B. W.; Brooks C. L. III. Protein Sci. 2013, 22, 595. [DOI] [PMC free article] [PubMed] [Google Scholar]; e Laricheva E. N.; Arora K.; Knight J. L.; Brooks C. L. III. J. Am. Chem. Soc. 2013, 135, 10906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- a Goh G. B.; Knight J. L.; Brooks C. L. III. J. Chem. Theory Comput. 2013, 9, 935. [DOI] [PMC free article] [PubMed] [Google Scholar]; b Goh G. B.; Knight J. L.; Brooks C. L. III. J. Phys. Chem. Lett. 2013, 4, 760. [DOI] [PMC free article] [PubMed] [Google Scholar]; c Nikolova E. N.; Goh G. B.; Brooks C. L. III; Al-Hashimi H. M. J. Am. Chem. Soc. 2013, 135, 6766. [DOI] [PMC free article] [PubMed] [Google Scholar]; d Chen W.; Wallace J. A.; Yue Z.; Shen J. K. Biophys. J. 2013, 105, 15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- a Kaljurand I.; Kutt A.; Soovali L.; Rodima T.; Maemets V.; Leito I.; Koppel I. A. J. Org. Chem. 2005, 70, 1019. [DOI] [PubMed] [Google Scholar]; b Kutt A.; Leito I.; Kaljurand I.; Soovali L.; Vlasov V. M.; Yagupolskii L. M.; Koppel I. A. J. Org. Chem. 2006, 71, 2829. [DOI] [PubMed] [Google Scholar]
- a Goh G. B.; Garcia-Moreno B.; Brooks C. L. III. J. Am. Chem. Soc. 2011, 133, 20072. [DOI] [PMC free article] [PubMed] [Google Scholar]; b Simonson T. J. Chem. Theory Comput. 2013, 9, 4603. [DOI] [PubMed] [Google Scholar]
- Isom D. G.; Castaneda C. A.; Velu P. D.; Garcia-Moreno B. Proc. Natl. Acad. Sci. U.S.A. 2010, 107, 16096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- a Kato M.; Warshel A. J. Phys. Chem. B 2006, 110, 11566. [DOI] [PubMed] [Google Scholar]; b Ghosh N.; Cui Q. J. Phys. Chem. B 2008, 112, 8387. [DOI] [PMC free article] [PubMed] [Google Scholar]; c Di Russo N. V.; Estrin D. A.; Marti M. A.; Roitberg A. E. PLoS Comput. Biol. 2012, 8, e1002761. [DOI] [PMC free article] [PubMed] [Google Scholar]; d Shi C. Y.; Wallace J. A.; Shen J. K. Biophys. J. 2012, 102, 1590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Damjanovic A.; Brooks B. R.; Garcia-Moreno E. B. J. Phys. Chem. A 2011, 115, 4042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chimenti M. S.; Khangulov V. S.; Robinson A. C.; Heroux A.; Majumdar A.; Schlessman J. L.; Garcia-Moreno B. Structure 2012, 20, 1071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khandogin J.; Brooks C. L. III. Biochemistry 2006, 45, 9363. [DOI] [PubMed] [Google Scholar]
- Best R. B.; Zhu X.; Shim J.; Lopes P. E. M.; Mittal J.; Feig M.; MacKerell A. D. J. Chem. Theory Comput. 2012, 8, 3257. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.