Abstract
Tautomeric identification by direct observation of 15N chemical shifts of the imidazole ring of histidine (His) has become a common practice in NMR spectroscopy. However, such applications require knowledge of the “canonical” limiting values of the 15N chemical shift of the imidazole ring of His in which each form of His, namely the protonated (H+) and the tautomeric Nε2-H and Nδ1-H forms, respectively, is present to the extent of 100%. So far, the adopted canonical limiting values of the 15N chemical shift have been those available from model compounds. As to whether these canonical values reflect those of the individual pure forms of His is investigated here by carrying out an analysis of the second-order shielding differences, ΔΔ = |Δε − Δδ, with Δξ(ξ = ε or δ) being the DFT-computed average shielding differences between the two nitrogens of the imidazole ring of His in each pure tautomeric form. In the high-pH limit the results indicate that the (i) ΔΔ values from the DFT-computed shielding, but not from the commonly-used canonical limiting values, are in closer agreement with those obtained with experimental chemical shift data from model compounds in solution and solid-state NMR; and (ii) commonly-used canonical limiting values of the 15N chemical shifts lead to an average tautomeric equilibrium constant that differs by a factor of ~2.6 from the one computed by using DFT-based 15N limiting values, raising concern about the practice of using canonical limiting 15N values; this can be avoided by reporting tautomeric equilibrium constants computed by using only limiting 15N values for the Nε2-H tautomer.
Introduction
Since chemical shifts were first observed by Arnold et al., in 1951 (1), using Nuclear Magnetic Resonance (NMR) spectroscopy, Mandel (2), in a pioneering NMR experiment, in 1965, first detected the imidazole (C2) protons of histidine (His) residues in Ribonuclease A and, soon after that in 1966, Bradbury & Scheraga (3) in a seminal NMR spectroscopic work were able to distinguish between the histidine residues of Ribonuclease A; namely, they resolved the peaks of three out of four histidines of this enzyme. Afterward, use of NMR spectroscopy, X-ray crystallography and theoretical studies, based on quantum-chemistry calculations, continuously evolved in the ability to determine the properties of the histidine residues in solution and in the solid state (4–27). The reason for this persistent interest in His is due to the fact that this residue is unique among all 20 naturally occurring amino acids, among other reasons, because ~50% of all enzymes use His in their active sites (19). This is, mainly, because of the versatility of its imidazole His ring, which includes two neutral, chemically-distinct forms (Nδ1−H and Nε2−H tautomers) and a charged H+ form, with one form favored over the other by the protein environment and pH. In addition, His with a pKo of 6.6 (28) is the only ionizable residue that titrates around neutral pH, allowing the nonprotonated nitrogen of its imidazole ring to serve as an effective ligand for metal binding (23), or to play a crucial role in the proton-transfer process (26).
As noted by Markley (6), there are mainly three factors that contribute to the observed chemical shifts of the nuclei imidazole ring of His, namely: (a) the “intrinsic” chemical shifts, which can be determined theoretically, at the DFT-level of theory, from model peptides, e.g., by using a model tripeptide with the sequence: Ac-GHG-NMe (29). The conformational-average intrinsic values of the 15N chemical shifts, for a given His form, are averages over all conformations, i.e., ~35,000, for which the shieldings were computed here at the DFT level of theory, namely for each nitrogen in the imidazole ring of His; (b) he “local environment” of each His residue in the protein, which can be very different for different His residues in the sequence; with the tautomeric coexistence being a function of the “local environment” of each His residue (27); and (c) the “proton binding/release equilibrium” of the imidazole His ring which, at a given fixed pH, is influenced by the presence of nearby positively or negatively charged side chains that will lower or raise the pK´ of a given His; overall, this factor determines the fraction of the protonated His form and, hence, those of the neutral form (27). Here, focus is directed to determining the conformational-average intrinsic values of the 15N chemical shifts for each of the two tautomeric forms of the imidazole ring of His as representative of the canonical values at high-pH, in other words, a theoretically determined set with which to represent the limiting values of the 15N chemical shift. Hereafter, the computed conformational-average values of the intrinsic 15N chemical shift will be referred to us as DFT-computed limiting 15N values.
The fact, that NMR spectroscopy is a powerful tool with which to investigate the role of histidine residues located in active sites of proteins, is an observation that goes back to pioneering NMR experiments on histidine by Mandel (2), who concluded “…This provides us with a window to observe the active site under various conditions…” Since this prediction was made, several methods are currently used to identify the protonation states of the imidazole ring of His as well as to distinguish between different tautomers (13, 22). Most of these methods rely on the observation of 15N chemical shifts of the imidazole ring of His, although an alternative to the use of 15N chemical shifts has recently been presented (27), namely a new method based on the observed 13Cγ and 13Cδ2 chemical shifts for the imidazole ring of His. Nevertheless, for a quantitative analysis of the tautomeric structure of the imidazole ring of His, use of 15N chemical-shift data is very desirable for a number of reasons, among others because 15N chemical shifts of the imidazole His ring are spread over a wide range and, even more important, are very sensitive to the presence or absence of a directly bonded proton (13). However, such applications require an accurate determination of the canonical limiting value of the chemical shift of each nitrogen in the imidazole ring of His, in each of the pure tautomers (10). Such determination, based on model compounds is not an easy task since the difference between the chemical shifts of the nitrogens of the imidazole ring of His may vary significantly among different solvents (12). In fact, for the neutral imidazole ring of Nδ1-methylimidazole at 25 °C, values of 83, 96 and 104 ppm (8) have been observed in water, chloroform and benzene, respectively. In addition, it is not yet clear which solvent represents the histidine environment better in a protein. Together, such evidence raises the question about the accuracy of the canonical limiting values of the 15N chemical shift obtained from model compounds, e.g., such as those listed by Pelton et al. (13) as commonly used in the literature (24).
Consequently, it is our interest here to explore whether the gas-phase DFT-computed 15N values, from ~35,000 conformations of each tautomeric form of His (27) at high-pH, could provide a standard with which to compare the canonical limiting values of the 15N chemical shift obtained from model compounds (from here on referred to us as: canonical 15N values). The existence of possible differences between DFT-computed and canonical 15N values together with existing experimental evidence will enable us to determine the magnitude of changes of some properties of His residues, such as the tautomeric distribution at high pH, as a function of the adopted 15N limiting values of the chemical shift.
Method
Computation of the 15N shielding of the imidazole ring of His at high-pH
Each form of His is represented as a terminally-blocked model tripeptide with the sequence: Ac-GHξG-NMe (27), with Hξ in the Nδ1-H or the Nε2-H tautomer form, respectively. The analysis of the charged form, H+, was omitted because this form does not exist at high pH. For each tautomer a set of more than ~35,000 conformations, each representing a uniform sampling of the whole Ramachandran map as a function of the ϕ, ψ, ω, χ1 and χ2 torsional angles, was generated. For each of the ~35,000 conformations, the gas-phase, isotropic shielding value of the His residue was calculated with the Gaussian 03 package (30) following the identical procedure that was already used for the computation of the 13Cα shielding data-base (27). Finally, the distribution of the computed shielding for each 15N of the imidazole ring of His was analyzed. For each 15N nucleus, namely, 15Nδ1 and15Nε2, the histogram of the shielding distribution, among all ~35,000 conformations, is shown in Figure 1a–b, together with the average values and standard deviations.
Analysis of the computed 15N shieldings for the tautomeric forms
For each tautomeric form, the average-shielding value, and its standard deviation, for each nitrogen of the imidazole ring of His were computed from the generated ~ 35,000 conformations, and the results are shown in Figures 1a–b. These average values were used to compute the shielding differences between nitrogens of the imidazole ring of His (Δξ values shown in Table 1), which is a common way to analyze 15N chemical shifts (11). Chemical shift differences are equivalent to shielding differences because of cancelation of the reference value, thereby avoiding any source of error in the reference value (21). Consequently, the nitrogen shielding differences, Δξ = ‖σ15Nε2| − |σ15Nδ1‖, with σ denoting the average-shielding and ξ = ε or δ, denoting each of the tautomeric forms of the imidazole ring of His, namely the Nε2-H and Nδ1-H form, respectively.
Table 1.
Δξ = ‖ζ15Nδ1|−|ζ15Nε2‖ with ζ denoting the average-shielding (σ) DFT-computed value, shown in the Figure Caption of Figure 1a–b, or alternatively, the observed chemical shift (δ) for each of the imidazole ring nitrogens (13); and ξ = δ or ε denoting each of the tautomeric forms of the imidazole ring of His, namely, the Nδ1-H and Nε2-H form, respectively.
Using 15N chemical shift limiting values listed by Pelton et al. (13).
From average DFT-computed 15N shielding values (see Figures 1a–b);
From solid-state NMR, at pH 8.5, from Hu et al. (26);
From Farr-Jones et al.(12) for histidine at −55 °C at pH 11.2 in Ethanol;
From Farr-Jones et al.(12) for Nξ-methylhistidine at 25 °C in H2O.
Results
Analysis of the differences between nitrogens in the imidazole ring of His: Δξ
From Table 1, we conclude that the following inequality holds: Δε > Δδ, if DFT-computed data, or data from model compounds or solid-state NMR, are used. On the other hand, the use of standard (12) canonical 15N values (see Table 1) gives the following equality: Δε = Δδ.
We have also computed the second-order difference as ΔΔ = |Δε − Δδ|. In general, use of Δξ values, with ξ = ε or δ, calculated with DFT-computed shieldings agree qualitatively with data from model compounds and solid-state NMR (see Table 1), i.e., all of them predicting ΔΔ = |Δε − Δδ| > 0. In particular, DFT-computed and solid-state NMR-data lead to close quantitative agreement, namely ΔΔ = 11 and 13 ppm, respectively. Since observed isotropic 15N chemical shifts for histidine and imidazole in the solid state do not differ appreciably from those measured in solution (10) the latter result is significant. By contrast, a similar analysis carried out using canonical 15N values (see Table 1), gives ΔΔ = 0.
Overall, the experimental evidence and DFT-based calculations, but not canonical 15N values, predict Δε ≠ Δδ. The consequences of this finding will be discussed in the next sections.
Determining DFT-computed limiting values of 15N chemical shifts in the high-pH limit
A question of central importance, that goes beyond the existence of the differences in terms of Δξ or ΔΔ mentioned above, is related to the relevance of such discrepancies; for example, in the prediction of the tautomeric fractions at high-pH. In order to answer this question, we first need to determine a set of DFT-computed limiting values of 15N chemical shift differences.
The chemical shifts (δ) can be calculated by employing the equation δ = σref – |σsubst|, where σ denotes the shielding of the reference substance (σref) or the substance of interest (σsubst), respectively. Use of this definition enables us to find an effective-reference value (σref) with which all DFT-computed average-shielding values can be converted into chemical shifts and, hence, a set of DFT-computed 15N limiting values of chemical shifts can be determined.
Therefore, in order to compute an effective reference, the following assumption was adopted: the DFT-computed chemical shift for the 15Nε2 nucleus (δ15Nε2), in the Nε2-H tautomeric form of the imidazole ring of His, is assumed to be the canonical limiting value reported for this nucleus (13), namely δ15Nε2 = 167.5 ppm (see Table 2). We choose this chemical shift as a starting point because the 15Nε2 nucleus shows the lowest standard deviation (3 ppm) among all four DFT-computed 15N average shielding values (see Figure Caption of Figure 1a–b). Consequently, the effective reference is σref = 288.5 ppm, i.e., computed as σref = (δ15Nε2 + σ15Nε2), with σ15Nε2 = 121 ppm (see Figure Caption of Figure 1a) representing the DFT-computed average-shielding value for the 15Nε2 nucleus. Hence, the chemical shift of the 15Nδ1 nucleus in the Nδ1-H tautomeric form is δ15Nδ1 = 183.5 ppm; this value was obtained by using the computed average-shielding value (σ15Nδ1 =105 ppm, see Figure Caption of Figure 1b) together with the above determined effective reference. Up to this point, we have been able to determine the chemical shifts for the protonated 15N nucleus of the imidazole ring of His in the Nε2-H and the Nδ1-H tautomeric form, respectively.
Table 2.
Tautomer | Canonical (ppm) |
DFT-Computed (ppm) |
||
---|---|---|---|---|
15Nε2 | 15Nδ1 | 15Nε2 | 15Nδ1 | |
Nδ1-H | 249.5 | 167.5 | 266.5 | 183.5 |
Nε2-H | 167.5 | 249.5 | 167.5 | 261.5 |
The canonical limiting values are those listed by Pelton et al. (13). See the text for details about the derivation of the DFT-computed limiting values.
The chemical shift for the nonprotonated 15N nucleus in each tautomeric form can be computed straightforwardly by using the definition of Δξ together with the constraints given in Table 1, i.e., by using the following relations: Δδ = |δ15Nε2 − 183.5| = 83 ppm, and Δε = |δ15Nδ1 − 167.5| = 94 ppm. As a result, the following chemical-shift values were obtained: δ15Nε2 = 266.5 ppm and δ15Nδ1 = 261.5 ppm, i.e., for the nonprotonated nucleus in the Nδ1-H and Nε2-H tautomeric forms, respectively. Table 2 lists all DFT-computed and the canonical 15N limiting values, for comparison.
Calculation of the tautomeric fraction, at high-pH, from 15N chemical shifts
The significance of the use of the DFT-computed rather than the canonical 15N value is illustrated by calculating the tautomeric fractions for His61 in plastocyanin (24) at high-pH as the average ratio between the tautomeric populations (24), namely as: (Pδ/Pε) = (δε – δ0)/(δ0 – δδ) where Pδ and Pε are the populations of the Nε2-H and Nδ1-H tautomeric form, respectively; δε and δδ are the corresponding limiting values of the 15N chemical shift, and δ0 is the observed 15N chemical shift at high-pH. It is worth noting that, at high-pH, computation of the ratio (Pδ/Pε) is equivalent to determining the tautomeric equilibrium constant KT (22), i.e., with KT = (Nδ1-H tautomer) / (Nε2-H tautomer), which is 0.25 in aqueous solution (11). Use of the canonical 15N value led Hass et al (24) to obtain KT,canonical ~ 0.102, as an average of the 0.096 and 0.108 values obtained for the (Pδ/Pε) ratio computed from the fits of the 15Nε2 and 15Nδ1 titration curves, respectively. Thus, the computed average KT,canonical ~ 0.102 would correspond to a fraction of ~0.91 and ~0.09 for the Nε2-H and Nδ1-H tautomers of His61, respectively. This result was obtained by using chemical shift data, at high pH, from the titration curve of the imidazole ring of His61 (24); namely, by using δ0 observed values of ~241 and ~175 ppm for the 15Nδ1 and 15Nε2 chemical shifts values, respectively.
This set of δ0 observed values (~241 and ~175 ppm) obtained at high-pH can be used to re-compute the average tautomeric equilibrium constant for His61 in plastocyanin by using DFT-computed rather than canonical limiting 15N values (see Table 2). In this case, we obtain KT,DFT ~ 0.265, as an average of the 0.086 and 0.443 values obtained for the (Pδ/Pε) ratio computed by using 15N limiting values for the Nε2-H and Nδ1-H tautomeric form, respectively. Hence, the computed average KT,DFT ~0.265 corresponds to a fraction of ~0.79 and ~0.21 for the Nε2-H and Nδ1-H tautomer, respectively.
Therefore, the commonly-used canonical limiting values of the 15N chemical shifts lead to an average tautomeric equilibrium constant (~0.102) that differs, by a factor of ~2.6, from the average one computed by using DFT-computed 15N limiting values (~0.265). The origin of this difference lies in the fact that DFT-computed 15N limiting values, but not the canonical ones, lead to obtain a very different ratio between the tautomeric populations (Pδ/Pε) depending on the chosen tautomer. In other words, the DFT-computed, but not the canonical ones, limiting 15N values are asymmetrical between tautomers (see Table2). As a result, the difference in the equilibrium constants computed by using canonical and DFT-computed limiting 15N values shows a minimum when only data from the Nε2-H tautomer are used to estimate the (Pδ/Pε) ratio. At this minimum, as shown above, KT,canonical is ~0.096 and KT,DFT is ~0.086; hence, these fractions lead to a close estimation of the population of the Nε2-H and Nδ1-H tautomer in solution, namely ~0.92 and 0.08, for the Nε2-H and Nδ1-H tautomer, respectively.
Overall, considering that (i) the 15Nε2 nucleus shows the lowest standard deviation among all DFT-computed 15N average shielding values (see Figure Caption of Figure 1a–b), (ii) the second-order differences values, ΔΔ, from the DFT-computed shielding, but not from the commonly-used canonical limiting values, are in closer agreement with those obtained with the experimental chemical shift data from model compounds in solution and solid-state NMR; and (iii) the tautomeric predictions computed using canonical and DFT-computed 15N values for the 15Nε2, but not for the 15Nδ1, nucleus are in close agreement, it is recommended that the equilibrium constant, KT, computed by using the 15N limiting values for only the Nε2-H tautomer be reported. In other words, this option would help to reduce inaccuracies associated with the existing uncertainty in the set of limiting 15N values (22). As to whether this conclusion depends on the Nδ1-H tautomeric fraction, remains to be proved.
Conclusions
Overall, our results suggest that a considerable difference for the average tautomeric equilibrium constant, KT, can be obtained if DFT-computed 15N limiting values rather than canonical values are used. Furthermore, since the DFT-computed, but not the canonical, second-order shielding differences, ΔΔ, are in agreement with those obtained with experimental chemical shift data from model compounds in solution and solid-state NMR, our results also raise concerns about the magnitude of the uncertainty associated with the usual predictions of the tautomeric fractions and, hence, their impact in studies of the mechanism of action of enzymes in which His dynamics may play a central role. To minimize the magnitude of such uncertainty, it is recommended here that the equilibrium constant, and then the tautomeric fractions, computed by using only limiting 15N canonical or DFT-computed values for the Nε2-H tautomer be reported.
Acknowledgment
We thank all spectroscopists and crystallographers who have deposited their coordinates at the PDB, and chemical shifts at the BMRB; without their effort, this work would not be possible. This research was supported by grants from the National Institutes of Health (GM-14312 and GM-24893), and the National Science Foundation (MCB05-41633). Support was also received from CONICET and from the UNSL (P-328402), Argentina. The research was conducted by using the resources of Pople and Blacklight, facilities of the National Science Foundation Terascale Computing System at the Pittsburgh Supercomputer Center.
Footnotes
Part of the “Harold A. Scheraga Festschrift”
References
- 1.Arnold JT, Dharmatti SS, Packard ME. J. Chem. Phys. 1951;19:507. [Google Scholar]
- 2.Mandel MJ. Biol. Chem. 1965;240:1586–1592. [PubMed] [Google Scholar]
- 3.Bradbury JH, Scheraga HA. J. Am. Chem. Soc. 1966;88:4240–4246. [Google Scholar]
- 4.Meadows DH, Jardetzky O, Epand RM, Ruterjans HH, Scheraga HA. Proc. Natl. Acad. Sci. USA. 1968;60:766–772. doi: 10.1073/pnas.60.3.766. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Reynolds WF, Peat IR, Freedman MH, Lyerla JR., Jr J. Am. Chem. Soc. 1973;95:328–331. doi: 10.1021/ja00783a006. [DOI] [PubMed] [Google Scholar]
- 6.Markley JL. Acc. Chem. Res. 1974;8:70–80. [Google Scholar]
- 7.Wüthrich K. NMR in Biological Research: Peptides and Proteins. North-Holland: Amsterdam; 1976. [Google Scholar]
- 8.Schuster II, Roberts JD. J. Org. Chem. 1979;44:3864–3867. [Google Scholar]
- 9.Harbison G, Herzfeld J, Griffin RGJ. J. Am. Chem. Soc. 1981;103:4752–4754. [Google Scholar]
- 10.Munowitz M, Bachovchin WW, Herzeld J, Dobson CM, Griffin RG. J. Am. Chem. Soc. 1982;104:1192–1196. [Google Scholar]
- 11.Bachovchin WW. Biochem. 1986;25:7751–7759. doi: 10.1021/bi00371a070. [DOI] [PubMed] [Google Scholar]
- 12.Farr-Jones S, Wong WYL, Gutheil WG, Bachovchin WW. J. Am. Chem. Soc. 1993;115:6813–6819. [Google Scholar]
- 13.Pelton JG, Torchia DA, Meadow ND, Roseman S. Prot. Sci. 1993;2:543–558. doi: 10.1002/pro.5560020406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Steiner T. Acta Cryst. 1996;C52:2554–2556. [Google Scholar]
- 15.Steiner T, Koellner G. Chem. Commun. 1997;13:1207–1208. [Google Scholar]
- 16.Shimba N, Takahashi H, Sakakura M, Fuji I, Shimada I. J. Am. Chem. Soc. 1998;120:10988–10989. [Google Scholar]
- 17.Sudmeier JL, Bradshaw EM, Haddad KEC, Day RM, Thalhauser CJ, Bullock PA, Bachovchin WW. J. Am. Chem. Soc. 2003;125:8430–8431. doi: 10.1021/ja034072c. [DOI] [PubMed] [Google Scholar]
- 18.Shimba N, Serber Z, Ledwidge R, Miller SM, Craik CS, Dotsch V. Biochem. 2003;42:9227–9234. doi: 10.1021/bi0344679. [DOI] [PubMed] [Google Scholar]
- 19.Strohmeier M, Stueber D, Grant DM. J. Phys. Chem. A. 2003;107:7629–7642. doi: 10.1021/jp0350114. [DOI] [PubMed] [Google Scholar]
- 20.Chen X, Zhan CG. THEOCHEM. 2004;682:73–82. [Google Scholar]
- 21.Cheng F, Sun H, Zhang Y, Mukkamala D, Oldfield E. J. Am. Chem. Soc. 2005;127:12544–12554. doi: 10.1021/ja051528c. [DOI] [PubMed] [Google Scholar]
- 22.Shimahara H, Yoshida T, Shibata Y, Shimizu M, Kyogoku Y, Sakiyama F, Kakazawa T, Tate S-i, Kato T, Moriyama H, et al. J. Biol. Chem. 2007;282:9646–9656. doi: 10.1074/jbc.M609679200. [DOI] [PubMed] [Google Scholar]
- 23.Jensen MR, Has MAS, Hansen DF, Led JJ. Cell Mol. Life Sci. 2007;64:1085–1104. doi: 10.1007/s00018-007-6447-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Hass MAS, Hansen DF, Christensen HEM, Led JJ, Kay LE. J. Am. Chem. Soc. 2008;130:8460–8470. doi: 10.1021/ja801330h. [DOI] [PubMed] [Google Scholar]
- 25.Hass MAS, Yilmaz A, Christensen HEM, Led JJ. J. Biomol. NMR. 2009;44:225–233. doi: 10.1007/s10858-009-9332-0. [DOI] [PubMed] [Google Scholar]
- 26.Hu F, Wenbin L, Hong M. Science. 2010;330:505–508. doi: 10.1126/science.1191714. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Vila JA, Arnautova YA, Vorobjev Y, Scheraga HA. Proc. Natl. Acad. Sci. USA. 2011;108:5602–5607. doi: 10.1073/pnas.1102373108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Demchuk E, Wade RC. J. Phys. Chem. 1996;100:17373–17387. [Google Scholar]
- 29.Vila JA, Scheraga HA. Acc. Chem. Res. 2009;42:1545–1553. doi: 10.1021/ar900068s. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Frisch MJ, Trucks GW, Schlegel HB, Scuseria GE, Robb MA, Cheeseman JR, Zakrzewski VG, Montgomery JA, Stratmann RE, Burant JC, et al. Gaussian 03, Revision E.01. Wallingford CT: Gaussian Inc; 2004. [Google Scholar]