Unusual 1H NMR chemical shifts support (His) Cepsilon 1---H{middle dot}{middle dot}{middle dot}O==C H-bond: Proposal for reaction-driven ring flip mechanism in serine protease catalysis -- Data Supplement - Supplemental Data -- Proceedings of the National Academy of Sciences

Supplementary material for Ash et al. (2000) Proc. Natl. Acad. Sci. USA 97 (19), 10371-10376.

Results and Discussion
Histidine C^e1-H Proton Linewidths.
Fig. 1 A and C in our main published paper clearly shows the often-observed greater linewidth of the C^e1-H proton resonance compared to that of the C^d2-H resonance. This phenomenon has sometimes prevented detection of the C^e1-H proton in 2D and 3D experiments (1). It has been found that H-bonding contributes to increased ¹H^N CSA (2). Whereas analogous ¹H^C CSA contributions for His ¹³C-H-donated H-bonding protons cannot be ruled out, the factor that dominates C-H proton linewidths is fast-exchange imidazole acid-base reactions (3). Because fast-exchange broadening increases as the square of spectrometer frequency, its relative contribution to overall linewidths increases dramatically with the use of increasingly higher magnetic fields. The greater linewidth of the C^e1-H proton over that of the C^d2-H proton under most circumstances is the result of the normally greater chemical shift difference upon protonation of the former, manifested through fast-exchange broadening.

In a study of ¹³C linewidth vs. pH for a sample of 90% C^e1-enriched a-lytic protease at 33ºC, rate and equilibrium constants were determined (4) for the protonation of imidazole (Im).

Im + H₃O⁺ = ImH⁺ + H₂O, [1]

where the forward and backward reactions are described by rate constants k_on^H+ and k_off^H+, respectively. A recently refined analysis of these data yielded values of k_off^H+ = 3.4 ´ 10³ s^-1 and a pK_a value of 6.87. Using the following equality, we get a value for k_on^H+:

k
_on^H+ = k_off^H+/K_a = 3.4 ´ 10³/10^-6.87

= 2.5 ´ 10¹⁰ M^-1s^-1. [2]

Such a diffusion-controlled proton "on" rate underscores the free access of solvent to the active site (5). Assuming the continued absence of buffers and dominance of k_off^H+ over k_off^OH-, we can calculate DW, the extra linewidth due to fast exchange as a function of Dn, the "kinetic window" or chemical shift change in Hz upon protonation at various magnetic fields, p_a, the fraction of the protonated or imidazolium species at various pH values, and K_a as follows (3):

W = 4p (Dn)²p_a(1 - p_a)²/(K_ak_on^H+). [3]

Fig. 4 A-C shows the calculated linewidth of the His C^e1-H proton of true a-lytic protease during a pH titration at various spectrometer fields, corresponding to 250, 500, and 360 MHz, respectively, assuming a "natural" linewidth of 25 Hz. The increased linewidths are evident, particularly in Fig. 4B at 500 MHz, which is similar to observed data points in Fig. 2, explaining the disappearance of the C^e1-H proton resonance in Fig. 2B and its reappearance (most downfield peak) in Fig. 2C. Fig. 4 C-E shows the calculated differences in linewidth vs. pH of true, "fresh," and "aged" a-lytic protease, all at 360 MHz, a spectrometer frequency employed for many of these early studies. The inactive forms are inherently easier to detect than the true form simply because of smaller Dn, the "kinetic window," and smaller pK_a values of the former. Though all forms, active and inactive, exhibit sharp lines at the high and low pH extremes, there could be a tendency to dismiss as spurious or impurity peaks any lines that disappear for a portion of the pH range.

Contribution to Histidine C^e ¹-H Proton Chemical Shifts of Neighboring Peptide Group Anisotropic Shielding.
Three models that have been applied to calculation of both C^a-H and amide N-H protons shielding in proteins are those of Ösapay & Case (6), Herranz et al. (7), and Williamson & Asakura (8) [also Asakura & Williamson (9)]. All carbonyl or peptide group anisotropic shielding models employ the McConnell equation (10), a summation of terms of the form s = (D c /3 r³)(1 - 3cos²q), where s is the shielding in ppm, r is the distance in Å between the calculated nucleus and the origin of the shielding cone, q is the angle between the nucleus and the cone axis, and D c is a difference between a pair of the principal axis magnetic susceptibilities. The coordinate system and nomenclature presented here are closest to those described in Williamson & Asakura (8), as shown in Fig. 5, employing polar angles q_X, q_Y, and q_Z and the general equation for each cone:

= (1/3 r³) [A(1 - 3cos²q_Y) + B(1 - 3cos²q_X)

+ C(1 - 3cos²q_Z)] [4]

If the shielding cone is axially symmetric, only one of the three terms in Eq. 4 is required, otherwise any two (given two of the coefficients A, B, and C the third can be obtained by difference), with contributions from more than one shielding cone being summed. Table 3 lists the values of constants A, B, and C, along with the coordinates used herein for origins of the various shielding cones, which distinguish the models under consideration.

For all 29 serine hydrolases in the database of Derewenda et al. (12), the C^e1-H proton lies within or very close to the amide group O-C'-N plane, and in close proximity to the lone electron pair anti to the N atom. In the Ösapay & Case model (6), a single, axially symmetric shielding cone lies along the y axis, with its origin on the O-C'-N angle bisector at a distance of 0.7 Å from the C' atom (Table 3). As shown in Table 4, the Ösapay and Case model was derived from 17 protein structures, fitting over 5,000 carbon-bound protons with an R value of 0.88, but correlating poorly (R = 0.58) some 1,600 amide N-H protons.

Subsequent models have attempted to make better fits to both C-H and amide N-H protons. However, amide N-H proton chemical shifts are known to be dominated by H-bonding, and by ignoring completely any peptide anisotropic shielding contribution, various researchers have obtained remarkably good correlations of chemical shift vs. various functions of r_OH, the O^...H distance [Wagner et al. (15), using r_OH^-3; Wishart et al. (16), using r_OH^-1; and Tjandra & Bax (17), using 1.3 - r_OH^-2]. Thus, the fact that the Ösapay & Case (6) or any other model fits N-H protons poorly may be more justification than criticism of the model. Instead of lumping anisotropic shielding and H-bonding together in the same treatment, we would argue that it is more useful to try to identify separately the most significant structural element, i.e. H-bonding, distinguished by its associated molecular forces. "Electric field" contributions to chemical shift [Buckingham (18)], included in some modeling efforts, yield an effect analogous to H-bonding, as will be shown later.

The Herranz model (7) employs two axially symmetric shielding cones, the first with its axis lying along the y axis "in the coordinate center of the OCN atoms" (which we interpret to mean the centroid or average of all coordinates), and the second lying along the z axis, with its origin 70% of the interval from the C' toward the O atom. The purpose of the second shielding cone is to improve the fit to amide N-H protons. With regard to this cone, Herranz et al. state "it gives rise to the largest proton shift values for hydrogen bond geometries." It does indeed fit some 239 N-H protons fairly well (R = 0.76), with somewhat different coefficients, A and C. The fit to 265 C^a -H protons, however, is sacrificed compared to Ösapay & Case (6) with an R value of 0.79 (Table 4).

The model of Williamson & Asakura (8) employs two non-axially symmetric shielding cones, the first along the C'-O axis centered at 1.1 Å from C'. The second cone lies along the C'-N axis, and for this term we reoriented the molecule in Fig. 5 with the C-N bond along the z axis. This model, employing more adjustable parameters, typically fits the somewhat smaller collection of C^a -H protons about as well as the Ösapay & Case (1991) model (Table 4), but Asakura & Williamson (9) were able to fit some 1,500 N-H protons to a degree only intermediate between Ösapay & Case (6) and Herranz (7) (Table 4).

Correlation of observed NMR chemical shifts of His C^e1-H protons to those calculated with the anisotropic shielding models from structural data is hampered, as shown in Table 5, by the fact that we have corresponding NMR and structural data for only six enzymes. These structures were obtained from Derewenda et al.'s (12) database of 29 protein x-ray crystal structures (named in our legend, Fig. 6), augmented by no. 30, chymotrypsinogenA [Wang et al.(24); PDB:2CGA] and no. 31, trypsinogen [Kossiakoff et al.(25); PDB:1TGN]. The anisotropic shielding contributions were calculated using the three models under discussion (Table 5), ranging from -0.05 to -0.44, and no correlation vs. observed shift was obtained with any model. Possible factors for the lack of correlation include: (i) the fact that five of the six chemical shifts arise from different enzymes, which may have intrinsically different starting or baseline values; (ii) insufficiently precise x-ray coordinates; and (iii) chemical shift contributions due to H-bonding.

The three anisotropic shielding models under consideration were examined for self-consistency by plotting their calculated shielding values vs. various functions of r_OH, to determine the order of magnitude of C^e ¹-H proton anisotropic shielding by the single neighboring peptide group identified as its H-bonding acceptor by Derewenda et al. (12). The calculations were carried out using the distances and angles supplied by Derewenda et al. (12). The results of the regression analysis are summarized in Table 5 and depicted in Fig. 6. Fig. 6A shows a plot of calculated anisotropic shielding of the 31 structures in our combined database vs r_OH^-1 using the model of Ösapay & Case (6). The fit is excellent, yielding an R value of 0.88 and standard error of 0.01 (Table 5). The magnitudes of the calculated shieldings are small, less than 0.15 ppm in absolute value. Fig. 6 B and C shows comparable plot for the Herranz et al. (7) and Williamson & Asakura (8) models, differing in that they are plotted vs. r_OH^-3, which gave better correlations (Table 5). Although predicting larger deshielding (the Herranz model as much as -0.47 ppm), none of these models fits the structural data as well as the Ösapay and Case model. The Herranz model gives an overall R value of 0.77, with standard error of 0.07 (Table 5). Recall that the second shielding term arising from the cone that lies along the C-O bond was intended to help account for H-bonded protons. Table 5 shows how poorly this C-O term serves the model, making a substantial contribution that does not fit the data (R = 0.46 for the C-O term). The Williamson & Asakura (8) model gives an overall R value of 0.75 and standard error of 0.07, similar to the Herranz model both in shielding magnitude (as large as -0.42) and the quality of the correlation. The shielding cone along the C-N bond in the Williamson model fits the data better (R = 0.86) than the one along the C-O bond (R = 0.72), but generally contributes less.

The anisotropic shielding calculations shown in Table 5 and Fig. 6 are not corrected for ring currents because some structures [no. 18, no. 19, and no. 22 from Derewenda et al. (12)] are not available from the Protein Data Bank. However, from a subset of the 31 proteins considered in this paper, namely the six for which NMR data exist for His C^e ¹-H protons, we calculated the contributions of all surrounding aromatic groups using software available from both Ösapay & Case (6) and Williamson & Asakura (8). The data shown in Table 6 include all His residues, with the active site His residues in boldface type. The ring current contributions are generally of small absolute magnitude (<0.1), and there is no indication of any systematic deshielding (negative s ) trend upon the His C^e ¹-H protons.

Table 6 also shows the magnitudes of the "electric field" term [Buckingham (18)] included in some chemical shift models. Ösapay & Case (6) and Williamson & Asakura (8) found that inclusion of the electric field term yielded a modest but systematic improvement in the quality of their data fitting. However, the model of Herranz et al. (7) ignored the electric field term, as did subsequent work on N-H proton shifts by Asakura et al. (9). As shown in Table 6, electric field contributions as large in magnitude as -0.25 ppm may be expected. Described as a "polarization of the C-H bond by the electric field of neighboring atoms" (8), this term seems to us barely, if at all, distinguishable from H-bonding. If the electric field term is indeed synonymous with H-bonding, its magnitude has been underestimated in all models, and by a significant factor in the model of Ösapay & Case (6) favored by our analysis.

1. Sudmeier, J. L., Ash, E. L., Gunther, U. L., Luo, X., Bullock, P. A. & Bachovchin, W. W. (1996) J. Magn. Reson. B 113, 236–247.

2. Tjandra, N. & Bax, A. (1997) J. Am. Chem. Soc. 119, 9576-9577.

3. Sudmeier, J. L., Evelhoch, J. L. & Jonsson, N. B. H. (1980) J. Magn. Reson. 40, 377-390.

4. Bachovchin, W. W. & Switzman, S. (1983) Spectrosc. Int. J. 2, 219-226.

5. Ash, E. L., Sudmeier, J. L., De Fabo, E. C. & Bachovchin, W. W. (1997) Science 278, 1128-1132.

6. Ösapay, K. & Case, D. A. (1991) J. Am. Chem. Soc. 113, 9436-9444.

7. Herranz, J., Gonzalez, C., Rico, M., Nieto, J. L., Santoro, J., Jimenez, M. A., Bruix, M., Neira, J. L. & Blanco, F. J. (1992) Magn. Reson. Chem. 30, 1012-1018.

8. Williamson, M. P. & Asakura, T. (1993) J. Magn. Reson. B 101, 63-71.

9. Asakura, T., Taoka, K., Demura, M. & Williamson, M. P. (1995) J. Biomol. NMR 6, 227-236.

10. McConnell, H. M. (1957) J. Chem. Phys. 27, 226-229.

11. Dickerson, R. E. & Geis, I. (1969) The Structure and Action of Proteins (W. A. Benjamin, Menlo Park, CA), p. 13.

12. Derewenda, Z. S., Derewenda, U. & Kobos, P. M. (1994) J. Mol. Biol. 241, 83-93.

13. Haigh, C. W. & Mallion, R. B. (1980) Progr. NMR Spectr. 13, 303-344.

14. Johnson, C. E. & Bovey, F. A. (1958) J. Chem. Phys. 29, 1012-1014.

15. Wagner, G., Pardi, A. & Wuthrich, K. (1983) J. Am. Chem. Soc. 105, 5948-5949.

16. Wishart, D. S., Sykes, B. D. & Richards, F. M. (1991) J. Mol. Biol. 222, 311-333.

17. Tjandra, N. & Bax, A. (1997) J. Am. Chem. Soc. 119, 8076-8082.

18. Buckingham, A. D. (1960) Can. J. Chem. 38, 300-307.

19. Bernstein, F. C., Koetzle, T. F., Williams, G. J., Meyer, E. E., Jr., Brice, M. D., Rodgers, J. R., Kennard, O., Shimanouchi, T. & Tasumi, M. (1977) J. Mol. Biol. 112, 535-542.

20. Abola, E. E., Bernstein, F. C., Bryant, S. H., Koetzle, T. F. & Weng, J. (1987) in Crystallographic Databases: Information Content, Software Systems, Scientific Applications, eds. Allen, F. H., Bergerhoff, G. & Sievers, R. (Data Commission of the International Union of Crystallography, Bonn), pp. 107-132.

21. Bao, D., Huskey, W. P., Kettner, C. A. & Jordan, F. (1999) J. Am. Chem. Soc. 121, 4684-4689.

22. Markley, J. L. & Ibanez, I. B. (1978) Biochemistry 17, 4627-4640.

23. Porubcan, M. A., Neves, D. E., Rausch, S. K. & Markley, J. L. (1978) Biochemistry 17, 4640-4647.

24. Wang, D., Bode, W. & Huber, R. (1985) J. Mol. Biol. 185, 595-624.

25. Kossiakoff, A. A., Chambers, J. L., Kay, L. M. & Stroud, R. M. (1977) Biochemistry 16, 654-664.

26. Case, D. A. (1995) J. Biomol. NMR 6, 341-346.

27. Markley, J. L., Neves, D. E., Westler, W. M., Ibanez, I. B., Porubcan, M. A. & Baillargeon, M. W. (1980) Front. Protein Chem. 10, 31-61.