Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2021 Oct 14;125(42):11606–11616. doi: 10.1021/acs.jpcb.1c05256

Catalytic Fields as a Tool to Analyze Enzyme Reaction Mechanism Variants and Reaction Steps

Paweł Kędzierski 1, Martyna Moskal 1, W Andrzej Sokalski 1,*
PMCID: PMC8558854  PMID: 34648705

Abstract

graphic file with name jp1c05256_0008.jpg

Catalytic fields representing the topology of the optimal molecular environment charge distribution that reduces the activation barrier have been used to examine alternative reaction variants and to determine the role of conserved catalytic residues for two consecutive reactions catalyzed by the same enzyme. Until now, most experimental and conventional top-down theoretical studies employing QM/MM or ONIOM methods have focused on the role of enzyme electric fields acting on broken bonds of reactants. In contrast, our bottom-up approach dealing with a small reactant and transition-state model allows the analysis of the opposite effects: how the catalytic field resulting from the charge redistribution during the enzyme reaction acts on conserved amino acid residues and contributes to the reduction of the activation barrier. This approach has been applied to the family of histidyl tRNA synthetases involved in the translation of the genetic code into the protein amino acid sequence. Activation energy changes related to conserved charged amino acid residues for 12 histidyl tRNA synthetases from different biological species allowed to compare on equal footing the catalytic residues involved in ATP aminoacylation and tRNA charging reactions and to analyze different reaction mechanisms proposed in the literature. A scan of the library of atomic multipoles for amino acid side-chain rotamers within the catalytic field pointed out the change in the Glu83 conformation as the critical catalytic effect, providing, at low computational cost, insight into the electrostatic preorganization of the enzyme catalytic site at a level of detail that has not yet been accessible in conventional experimental or theoretical methods. This opens the way for rational reverse biocatalyst design at a very limited computational cost without resorting to empirical methods.

Introduction

Detailed knowledge of enzyme reaction mechanisms, the role of amino acid residues essential for catalytic activity, and the structure of corresponding transition states at atomic resolution could provide invaluable information on the process of the rational catalyst1,2 or drug3 design, as many enzyme inhibitors are transition-state analogs. However, the available experimental structural data for enzymes typically determined in X-ray diffraction studies are most frequently of low resolution and lack accurate hydrogen atom positions.4 They are usually the starting point for tedious trial-and-error searches to locate stationary points on the vast multidimensional energy hypersurface using hybrid quantum mechanics/molecular mechanics (QM/MM)5 or ONIOM (our own n-layered integrated molecular orbital and molecular mechanics)6 techniques.

In this Article, we present an alternative way to analyze enzyme reaction variants based on perturbational intermolecular interaction theory, allowing the partitioning of activation energy changes into well-defined physical components, the analytical representation of which could be the subject of gradual approximations leading to simpler nonempirical models applicable to large molecular systems. Our simple approach could be applied to the preliminary screening of possible mechanisms and the evaluation of the role of charged residues in catalysis in the absence of experimental kinetics data for enzyme mutants. In the case in which such data were available, more comprehensive methods integrating structural and kinetic data could be applied.7

Warshel postulated the dominant role of electrostatic interactions in enzyme catalysis based on the empirical valence bond approach,8 which was later confirmed by various nonempirical methods. For example, the decomposition of the interaction energies using the hybrid variation-perturbation theory illustrate that short-range nonelectrostatic components could cancel each other to a significant degree and that the electrostatic contribution to the lowering of the activation barrier correlates well (R = 0.85) with rigorous MP2 results for chorismate mutase residues.9 (See Table 1 in ref (9).) This results from the short-range nature of other major components of the interaction energy, like the exchange decaying exponentially with distance and their canceling each other to a significant degree, leaving the electrostatic energy as the dominant and only additive term, which allows partitioning of the entire system into individual residues. The electrostatic nature of the catalytic activity in various enzymes was independently confirmed later by a nonempirical analysis of electric fields obtained from proton quantum dynamics,10 transition path sampling,11 full quantum-mechanical electric field,12 and electron density13 calculations and QM/MM results for proton-coupled electron transfer reactions.14 The dominant role of electric fields in catalysis has been summarized in recent reviews.15,16 The electrostatic nature of catalytic activity has been recently confirmed by experimental17 and advanced theoretical studies,1014,1820 demonstrating how external electric fields generated by enzyme residues act on bonds broken in reactants. However, in this work, we study the inverse effect, that is, how the catalytic field resulting from charge redistribution during an enzyme reaction would maximize interactions with specific amino acid residues. This allows us to determine, at low computational cost, the optimal amino acid side-chain conformations exerting extreme catalytic activity, constituting a preorganized environment. To evaluate various alternative reaction mechanisms, we employ in this study catalytic fields (CFs) defining the charge distribution of the optimal catalytic environment.9,21 The CFs derived from the corresponding transition-state and substrate wave functions have been used here to locate amino acid residues exerting the highest catalytic activity for both enzyme reaction steps and all proposed alternative mechanism variants in histidyl-tRNA synthetase (HisRS).22,23 HisRS has been selected on purpose because it represents one of the oldest enzyme families with little similarity among various biological species, except for a set of conserved charged residues, which will be matched here with different CF variants.

Every amino acid tRNA (transfer ribonucleic acid) synthetase (aaRS) catalyzes two consecutive reactions: the aminoacylation of adenosine triphosphate (ATP) and then the transfer of the residue from amino acid-AMP (adenosine monophosphate) to tRNA. Once the most probable mechanism is determined for each half-reaction, it would be possible to analyze the specific roles of conserved residues for each step, or, vice versa, the observed conservative catalytic residues could be used to determine the most probable reaction mechanism. Aminoacyl tRNA synthetases, which are involved in the translation of a DNA nucleotide to a protein amino acid sequence, are divided into two classes possibly originating from different ancestors.24 Because of their evolutional age and despite catalyzing essentially the same reaction, aaRSs originating from different organisms possess little sequence similarity, even within each of more than 20 aaRS types, like HisRS. Therefore, the topologies of conserved residues could be regarded as their only common denominator, and they constitute an excellent case to confront this with CFs corresponding to various alternative reaction mechanisms catalyzed by aaRSs. Because of the almost one order of magnitude stronger interactions of charged molecules versus neutral molecule, only conserved charged residues have been considered in this study. To keep our model general for the entire HisRS family, we did not consider the nonconserved residues in this work.

The first goal of this study is to determine the spatial distribution of conserved charged amino acid residues in class II histidyl tRNA synthetases, representing a wide set of 12 different biological species. The topology of such residues will be confronted with CFs related to both reaction stages 1 to 2 and alternative reaction mechanisms discussed in the literature.22,23 Catalytic fields for each reaction variant involving HisRS will be generated to examine activation barrier changes resulting from the presence of conserved charged residues. This would help to point out the most probable mechanisms among several proposed mechanisms. Another goal is to compare the CFs for both reactions, as shown in Figure 1, and proceeding in the HisRS active site to obtain more general knowledge of the catalytic role of conserved residues for each separate reaction step. In addition, the application of the MULTISCAN procedure21 will deliver optimal conformations of charged amino acid side chains interacting with each other, permitting the inspection of features of the electrostatic active-site preorganization that are otherwise not easily available from conventional experimental or theoretical techniques.

Figure 1.

Figure 1

Partial sequence comparison based on the structure alignment of HisRSs originating from 12 different biological organisms. Only the neighborhoods of conservative charged residues and known class II aaRS motifs are presented for brevity. Hidden parts are marked with vertical blue lines.

Methods

The catalytic activity could be expressed as the difference in the transition state (TS) and substrate (RS) interaction energies, ΔE, with the catalytic environment (C) constituting the differential transition state stabilization (DTSS) energy approximating the lowering of the activation barrier.9,21

graphic file with name jp1c05256_m001.jpg 1

The final part of eq 1 indicate, that in the extreme approximation, the DTSS contributions could be estimated with point charges representing, for example, formal charges of amino acid side chains interacting with the differential electrostatic potential calculated from the quantum models of the reactants and the transition state, represented by ΔS. However, in this work, we rely on a much more precise approximation of the electrostatic term of interaction energies, including all side-chain atoms represented by atomic multipole moments up to the hexadecapole using eq 2. A detailed analysis of all of the interaction energy constituents for enzymes9,21 indicated the dominant role of electrostatic term, EEL, which can be further approximated by cumulative atomic multipole moment (CAMM) expansion25

graphic file with name jp1c05256_m002.jpg 2

where Inline graphicand Inline graphicare α and β components of atom-centered multipole tensors of rank ka and kb for interacting molecules A and B, respectively, and Inline graphic is the αβ element of the Cartesian interaction tensor containing the partial derivatives of |Rab|–1 of rank ka + kb.

The DTSS could be alternatively estimated (eq 1) as the sum of the products of catalyst atomic charges, qi, and the difference in the molecular electrostatic potentials of the transition state, VTS, and the substrate, VRS. The static CF Δs (eq 3) corresponds to the lowering of the activation barrier by a unit point charge +1 located at any point i.

graphic file with name jp1c05256_m006.jpg 3

The highest catalytic activity is achieved when charged elements of the molecular environment coincide with extreme −ΔS values. Because of the additive nature of electrostatic interactions, −ΔS represents an inverse solution to the optimal catalyst problem. This constitutes a bottom-up approach to catalyst design based on the charge redistribution of reactants at a given reaction step.21 It is fundamentally different from conventional top-down methods requiring consideration of the entire protein and numerous assumptions related to the QM/MM boundaries, the selection of protonation states, and the use of empirical force fields.

The activation energy decrease resulting from the presence of specific amino acid residues could be estimated from the eq 1, where molecular electrostatic potentials, V, could be obtained either as expectation values from corresponding quantum-chemical wave functions or from a much more compact CAMM representation.25 Such distributed moments constitute a natural extension of Mulliken’s population analysis, and the inclusion of higher moments reduces the basis set dependency of atomic charges25 and considerably improves the description of the significant anisotropy of charge distribution at the atomic level for reacting systems, where bonds are broken or formed. A more extensive review of the multipolar electrostatics, the errors associated with the use of atomic charges, and its use in force fields can be found in the Popelier review.26 In the case of CAMM expansion (eq 2) truncated at the R–5 term, the typical error at the equilibrium distance amounts to 5%. It has to be mentioned that there are several nonempirical force-field techniques based on atomic multipole expansion like AMOEBA27 or sum of interactions between fragments ab initio (SIBFA).28

The use of CAMM atomic multipole expansion was already sufficient to reproduce relative catalytic activities in a series of ketosteroid isomerase29 and Kemp eliminase21 mutants. Because of the known flexibility of charged amino acid side chains upon ligand binding,3032 corresponding rotamers33,34 have been scanned for all conserved residues considering the mutual interactions of charged side chains within the MULTISCAN procedure, previously described in detail.21 This allowed us to determine the optimal catalytic amino acid side-chain orientation in the respective CF, which, until recently, has escaped experimental or theoretical observation in conventional studies.21,32,35

The calculations included the following steps:

(1) Reaction models were adapted from coordinates published in ref (22) for the aminoacylation step and from ref (23) for the His-tRNA charging step. The aminoacylation reactants and transition state (TS0) were originally optimized using the hybrid ONIOM HF/6-31G**:PM3 theory level.22 This model was stripped down to the essential subset of atoms, as presented in thicker representation in Figure 3. Hydrogen atoms were added in place of cut bonds using PyMOL (The PyMOL Molecular Graphics System, version 1.7, Schrödinger). PyMOL was also used to dock the models in the enzyme active site, employing the superposition of their corresponding non-hydrogen atoms onto their counterparts in the His-AMP ligand in the 1KMM Protein Data Bank (PDB) structure. The CAMM electrostatic charge distribution for this step was evaluated for up to rank 4 moments (hexadecapoles) at the HF/6-31G* (5d) level using the GAMESS program (2010-10-01 release or any later version) using the option “$ELMOM IAMM=4 CUM=.T. $END”, generating CAMMs up to hexadecapole. The models of reactants and transition states for the second step of charging the tRNA with the histidyl moiety were originally optimized in the gas phase at the B3LYP/6-31G** level,23 including five amino acid residues (Glu83, Arg113, Gln127, Arg259, and Glu270) from the active site. In this work, these models were stripped down to the essential part of the reactants, as shown by thicker representations in Figure 4 for the transition states, capped with hydrogen atoms, and docked with PyMOL, as described above. CAMM was evaluated for them at the HF/6-31G** (5d) level up to hexadecapole moments.

Figure 3.

Figure 3

Transition state TS0 for the ATP aminoacylation reaction mechanism (Scheme 1a), as proposed by Banik and Nandi,22 where the thicker representation corresponds to the smaller model used to obtain the corresponding catalytic field ΔS.

Figure 4.

Figure 4

Transition states TS1, TS2, TS3, and TS4 for the His-AMP tRNA charging reaction mechanisms (Scheme 1b), as proposed by Liu and Gauld,23 where the thicker lines correspond to the smaller model used here to obtain the corresponding catalytic field ΔS.

(2) The Python MULTISCAN procedure21 calculating multipole electrostatic interaction energies between reactants and amino acid residues, ΔEEL,MTP, according to eq 2 was used to automatically scan all possible orientations of side chains for the selected set of nearby located conserved charged amino acids in the CF in the search of the lowest DTSS, that is, the minimal activation energy for the considered amino acid set. The CAMM approximation of the interaction energies was exponent-truncated at rank 4.

(3) Atomic multipole values for the amino acid side-chain configuration were interpolated using the CAMM database of amino acid side-chain rotamers calculated at the HF/6-31G* level, available at http://camm.pwr.edu.pl/CAMM.34

Results and Discussion

Multiple Sequence Alignment of the HisRS Family

The crucial long-range electrostatic interactions of HisRS reactants are with charged amino acids. To determine the location of such conserved HisRS charged residues, we have performed multistructure alignment using 3DCOMB36 for 12 histidyl tRNA synthetases from different biological species (see the Supporting Information), for which sufficiently accurate and complete structures were available in the PDB. Among multiple PDB structures of the same protein, only those with the best resolution and a docked ligand with the phosphate group (preferably transition state analog) have been considered, unless a ligand-less structure was the only one available.

To obtain a more general picture valid for the entire family of HisRSs, our study is limited to residues where the charge is conserved in 100% of HisRSs. The results illustrated in Figure 1 indicate nine charged residues conserved in all considered HisRSs, that is, Asp13, Glu47, Glu83, Arg113, Glu131, Asp141, Arg259, Glu270, and Arg287. (The numbering corresponds to PDB 1KMN and 1KMM structures.) This extends the previous analysis,37 where only three bacteria species were considered. More details are presented in the Supporting Information. Figure 2 illustrates the topology of the conserved residues of all 12 structures superimposed using the Cα carbon coordinates of conserved residues. To estimate the flexibility of the side chains of these residues, we have compared the root-mean-square deviations (rmsd’s) of Cα carbon coordinates with the rmsd’s of Ct calculated between corresponding coordinates of carbon atoms representing the side-chain terminal charged guanidinium and carboxyl groups. The Ct rmsd’s are always larger than the rmsd’s of Cα atoms, in some instances by even more than three times. (See the Supporting Information.) This indicates the significant flexibility of charged side chains already reported in other related studies,3032 which will also be extensively explored in this work.

Figure 2.

Figure 2

Conserved charged residues for 12 members of the HisRS family superimposed on the 1KMM chain C using the corresponding Cα carbon atoms of 9 conserved charged residues, shown as black balls. Blue or red balls indicate terminal carbon atoms Ct of negatively or positively charged side chains, respectively. Golden spheres indicate the positions of phosphorus atoms in ligands, if present.

Information about conserved residues has sometimes been helpful in the search for enzyme reaction mechanisms;38,39 however, to our knowledge, there has not yet been an attempt in the literature to verify alternative enzyme reaction mechanism variants by the systematic matching of all conserved amino acids with CFs.

Stationary Points for Reactions Catalyzed by HisRS

In the first part, we analyze the CF for the ATP aminoacylation reaction mechanism (Scheme 1a) proposed by Banik and Nandi.22 For the second reaction (Scheme 1b) catalyzed by HisRS, three possible mechanism variants of transfer of the aminoacyl moiety to tRNA have been proposed by Liu and Gauld.23

Scheme 1. Schematic Representation of Both Consecutive Reactions Considered in This Study.

Scheme 1

PPi denotes pyrophosphate, and A76 denotes the adenine of the tRNA terminal nucleotide.

Analysis of Catalytic Activity

To estimate changes in the activation barrier DTSS due to every conserved residue for each reaction step or variant, we have generated CFs using the CAMM derived directly from wave functions of the transition state and substrate smaller models illustrated in Figures 3 and 4. Such simple models are composed of the CAMM describing the charge redistribution on the reaction path from the substrate to the transition state without including any amino acid residue present in the higher level part of the ONIOM or QM/MM original calculations.22,23

To consider possible conformation changes of side chains, we have scanned the library of CAMMs for amino acid rotamers34 using the MULTISCAN procedure described in more detail elsewhere.21 This procedure, besides considering the DTSS, also takes into account the relatively strong interactions between charged residues responsible for the nonadditive effects of amino acid multiple mutations.21 The inclusion of interactions between charged side chains in the case of the ATP aminoacylation reaction (Scheme 1a) resulted in a total DTSS change reaching up to 2 kcal/mol, and it resulted in a total DTSS change reaching up to 21.5 kcal/mol for tRNA charging reaction (Scheme 1b) when compared with the DTSS values obtained with side-chain conformations from the crystal structure excluding interactions between charged amino acid residues. The corresponding side-chain conformation changes are shown in Figure 5a–e.

Figure 5.

Figure 5

Catalytic fields for all considered reaction mechanisms with conserved residues exerting the most pronounced catalytic activity. 3D view of the surface colored with the ΔS value. The blue color indicates the region where a positively charged catalyst will be beneficial. Red denotes locations beneficial for negatively charged residues. Numerical values in kilocalories per mole denote the lowering of activation energy (DTSS) obtained from the MULTISCAN procedure (conformation denoted by solid color), and the values in parentheses represent the DTSS changes resulting from the use of crystallographic side-chain conformations (faded) and neglecting interactions with remaining charged side chains.

The TS structure obtained from the quantum-chemical calculation is definite for a particular protein structure and could be regarded as rigid, so it is computationally much less demanding to determine AA rotamers interacting strongly with the TS and each other. This way, one may single out specific AA side-chain deformations accessible within zero-point vibrations, which could contribute to the lowering of the activation barrier and precisely define the active-site preorganization at the atomic level. Using conventional QM or ONIOM methods, one would have to consider a much larger number of degrees of freedom to obtain such structural data.

It is remarkable that the lowering of the activation barrier for TS3, reaching −14.7 kcal/mol in the case of the Glu83 side chain, seems to play a much more important role in the active-site preorganization than it does for other residues. The inclusion of interactions between charged residues with the MULTISCAN approach has been crucial to achieving agreement with experimental catalytic activities for a series of mutated Kemp eliminases obtained by direct evolution experiments, improving the catalytic activity of the corresponding theozymes.21

DTSS contributions arising from the presence of each conserved amino acid (aa) contribute to the crude estimation of the activation energy (eq 4)

graphic file with name jp1c05256_m007.jpg 4

where ETS and ERS denote the energies of the models of the transition state (TS) and the substrate (RS), respectively. B0 has been calculated as the difference in TS and RS energies for the smaller models of reactants shown in Figures 3 and 4. As in Table 1, only the conserved amino acids have been considered, so the B0 + ∑DTSS does not pretend to be close to the accurate activation energy but certainly indicates major catalytic residues among the nine considered conserved charged residues. The activation barrier change DTSS values presented in Table 1 indicate that for the ATP aminoacylation reaction (Scheme 1a), five residues, Glu47, Glu83, Arg113, Glu131, and Asp141, exert catalytic activity out of all nine conserved charged amino acids. Of course, many more neutral or charged nonconserved residues may contribute to the DTSS, but because of the little similarity between HisRSs, their contributions will be more random in contrast with those of conserved residues. In the case of the second reaction (Scheme 1b), where three different mechanisms23 have been considered, the lowest activation barrier, B0 + ∑DTSS, seems to be associated with the pathway involving TS3, in agreement with Liu and Gauld,23 although according to the ∑DTSS value, the path involving TS1 seems to be considerably reduced by interactions with the same conserved residues, as in the case of TS3. The barrier related to TS4 following TS3 via intermediate 3 practically disappears, making TS3 the lowest reaction path. In the TS3 and TS1 cases, tRNA charging is catalyzed by four residues, Glu83, Glu131, Asp141, and Glu270. Although the mechanism involving TS1 has the largest activation barrier reduction, ∑DTSS of 11.48 kcal/mol, the corresponding barrier, B0, for bare reactants of 84.00 kcal/mol still remains significant.

Table 1. Contributions of Conserved Residues from the HisRS Family to Differential Transition State Stabilization (DTSS) (kcal/mol) from the MULTISCAN Procedurea.

  His + ATP (Scheme 1a) His-AMP + tRNA (Scheme 1b)
residueb RS → TS0 1 → TS1 1 → TS2 1 → TS3 3 → TS4
Asp13 (R ≈ 22 Å) +0.57 (+0.30) +0.83 (+0.52) –0.68 (+0.51) +0.65 (−0.10) +0.09 (+0.04)
Glu47 (R ≈ 18 Å) –2.25 (−1.38) +0.09 (−1.62) +1.39 (−0.15) +1.89 (+1.06) –0.45 (+0.92)
Glu83 (R ≈ 8 Å) –8.02 (−9.46) –10.28 (−18.18) –4.38 (−5.83) –10.27 (+4.43) –2.58 (+2.38)
Arg113 (R ≈ 10 Å) –5.43 (−6.09) +2.82 (−6.21) +0.63 (+5.38) +1.48 (−8.71) +1.49 (−8.11)
Glu131 (R ≈ 14 Å) –4.60 (−5.53) –6.54 (−6.69) +0.21 (−0.56) –2.98 (−1.80) –1.14 (−1.35)
Asp141 (R ≈ 18 Å) –2.97 (−1.57) –1.07 (−1.47) +1.28 (−0.45) –1.01 (−1.35) –0.06 (−1.18)
Arg259 (R ≈ 7 Å) +5.26 (+8.44) +3.40 (+2.37) +0.15 (+14.42) +7.03 (+0.18) –0.42 (−1.34)
Glu270 (R ≈ 10 Å) +0.39 (+1.12) –2.41 (−0.64) –0.32 (−4.43) –4.72 (−7.27) +048 (−7.24)
Arg287 (R ≈ 13 Å) +3.26 (+1.77) +1.67 (−1.13) –1.96 (+0.03) +1.35 (−1.00) +0.21 (−0.93)
DTSS –14.79 –11.48 –4.96 –6.58 –2.38
B 32.88c 45.81d 34.85d 26.10d 0.26d
B0 = ETS – ERS 37.28 84.00 103.41 18.95 –0.04
B0 + ∑DTSS 22.49 72.52 98.45 12.37 –2.42
a

Numbers in parentheses correspond to side-chain conformation identical as in crystal structure without considering interactions with other residues.

b

R indicates the distance between Cα and phosphorus atom in angstroms.

c

Activation barriers, B, reported in the literature: ONIOM study22 including Arg113 and Arg259.

d

Activation barriers, B, reported in the literature: QM/MM study23 including Glu83, Arg113, Gln127, and Arg259.

This study allowed us to analyze on equal footing both half-reactions shown in Scheme 1 catalyzed by HisRS. Glu83, Glu131, and Asp141 exert catalytic activity for both half-reactions, whereas Glu47 and Arg113 are specific for ATP aminoacylation, proceeding over TS0 and Glu270 for tRNA charging involving TS3.

Figure 6 collects the activation barrier changes for all considered reactions, demonstrating the significant sensitivity of the DTSS to a specific unique molecular charge redistribution for each reaction step, reflected by the CF derived from the corresponding wave function via CAMMs. This makes the static CF derived from the minimal reactant and transition state wave functions via CAMMs a sensitive tool for pointing out amino acid residues exerting catalytic activity for every reaction step and mechanism variant.

Figure 6.

Figure 6

Changes in activation barrier DTSS yielded by the MULTISCAN procedure for conserved charged residues for both reactions and mechanisms catalyzed by HisRS involving transition states TS0, TS1, TS2, TS3, and TS4.

For class II aminoacyl-tRNA synthetases, available experimental data allowing a comparison with the present results are much more scarce, in contrast with class I aaRS’s for which systematic site-directed mutagenesis results are available.40 In the case of HisRS, the catalytic roles of E83 and R113 have been proven by the loss of activity when these residues are replaced by alanine in mutated proteins.41,42 Experimental data indicate that other residues like R259 also stabilize transition states. This can be illustrated in the TS3 case, where the transition state stabilization amounts to −6.54 kcal/mol, but when substrate destabilization is also taken into account in the DTSS, then the net DTSS result, +7.03 kcal/mol, is positive. In this particular case, this effect is partially compensated by Glu270 contributing DTSS = −4.72 kcal/mol. These two residues bonded by the salt bridge strongly interact with each other by changing side-chain conformations. This can be illustrated by the significant differences between the crystal structure and the conformations optimal for the extreme lowering of the activation barrier, reaching up to 7 kcal/mol. Such structural details could be essential for the complete understanding of enzyme reactions and rational biocatalyst design. Activation barriers for the critical steps for both half-reactions RS → TS0 and I → TS3 estimated from B0 + ∑DTSS are only in qualitative agreement with previous reports22,23 due to the approximations involved in the present study, probably mainly from the neglect of nonconserved residues near the active site and solvent effects.

There are several limitations of the present model based on static CFs:

  • (1)

    Only conserved charged residues are considered, which is necessary due to the little similarity between His-tRNA synthetases.

  • (2)

    The applicability of the method was only tested for a charged system, where the electrostatic term could be dominant and the contribution of entropy term could be minimal with respect to the enthalpy term. This is not necessarily a limitation that excludes nonpolar reagents. There are experimentally characterized examples of catalysis driven by the electric field for the Diels–Alder reaction43 and hydrogen abstraction from methane.44 Supplementing the electrostatic model by dispersion functions35 could extend the current model’s applicability to noncharged and nonpolar systems.

  • (3)

    Solvent effects are neglected.

  • (4)

    The use of the smallest possible models of reactants and transition states is another limitation of the method, however the charge redistribution represented by CAMM is only significant for the immediate neighborhood of broken or formed bonds.The validity of the small model has already been tested against a large model including Mg2+ ions, water, and four amino acid residues from the active site by Banik and Nandi.37 Using as small as possible of a model of the reacting system has the advantage of minimal errors resulting from superimposing SC atomic multipoles on the TS structure. This is supported by the fact that near the attack, the substrate conformation is usually close to the much more rigid transition-state structure.45

  • (5)

    The model is unable to deal with tunneling effects. However, in the case of transition states bound to the enzyme via hydrogen -bond chains, one may employ dynamic CFs, which may indicate favorable directions of cooperative proton transfer increasing the catalytic activity.46

  • (6)

    Because of the approximate (but still nonempirical) character of the DTSS estimates using the CAMM atomic multipole expansion, results should only be treated qualitatively. However, the DTSS-CAMM rankings of the catalytic activity for Kemp eliminase theozyme mutants29 or chorismate mutase active-site individual residues were in excellent agreement with the experimental free energies of activation and the accurate MP2 reference results,9 respectively. Analogous conclusions have been obtained for inhibitors binding several proteins.35

Conclusions

Catalytic fields derived from substrate and transition-state wave functions enable us to determine the optimal charge distribution of the catalytic environment, which may be naturally confronted with the topology of all conserved charged enzyme amino acids. This could be used to point out the most essential catalytic residues for a given reaction step or mechanism variant. The available library of CAMMs for amino acid side-chain rotamers allows for the rapid scanning of the entire conformational space in the search for active-site structures with optimal catalytic activity, allowing us to inspect in atomic detail the electrostatic preorganization supplementing the previously proposed alternative methods.47,48 This is in line with recent independent experimental17,43,44 and theoretical8,9,1120,49,50 studies indicating the essential role of electric fields and side-chain motions in enzyme catalysis. Because the substrate, transition state, and side-chain atomic multipole representations were derived from corresponding ab initio wave functions, the presented approach is entirely nonempirical (although approximate). In addition, the perturbational methodology applied in our approach allows for the partitioning of the DTSS catalytic activity into well-defined physical components, in contrast with variational ONIOM or QM/MM methods. The approach proposed in this work, while neglecting entropy, could be useful for screening possible reaction variants, which could later be followed by more computationally demanding free-energy calculations for the most promising cases. For this purpose, the molecular mechanics Poisson–Boltzmann surface area (MMPBSA) approach51 could be applied. The DTSS results obtained for the Kemp eliminase theozyme mutants using the CAMM and MMPBSA approaches similarly correlate with the experimental activation free energies (correlation coefficient for CAMM of 0.80 and MMPBSA of 0.79).21 The DTSS-CAMM results for ketosteroid isomerase mutants also yielded catalytic activities that were well correlated with the experimental free energies of activation (figure 4d in ref (29)). This could indicate considerable enthalpy–entropy compensation observed for protein binding with transition states.52

In addition, supplementing this with an analysis of the dynamic CF, defined as the derivative of the static CF, allows us to determine the role of the possible proton dislocations in hydrogen-bond chains46 that enhance the catalytic activity, opening the way for rational reverse biocatalyst design at very limited computational cost without resorting to empirical methods.

Acknowledgments

This research has been supported by the National Centre of Science grant NCN OPUS 2017/27/B/ST4/01327. The calculations were performed at the WCSS, ICM, and PCSS supercomputing centers. We are grateful to Dr. Wiktor Beker and Dr. Edyta Dyguda-Kazimierowicz for technical advice.

Supporting Information Available

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jpcb.1c05256.

  • List of 12 selected His tRNA synthetase structures, their complete multisequence alignment results, and rmsd’s for superimposed charged terminal groups (PDF)

The authors declare no competing financial interest.

Special Issue

Published as part of The Journal of Physical Chemistry virtual special issue “Computational Advances in Protein Engineering and Enzyme Design”.

Supplementary Material

jp1c05256_si_001.pdf (764.2KB, pdf)

References

  1. Ribeiro A. J. M.; Yang L. F.; Ramos M. J.; Fernandes P. A.; Liang Z. X.; Hirao H. Insight into Enzymatic Nitrile Reduction: QM/MM Study of the Catalytic Mechanism of QueF Nitrile Reductase. ACS Catal. 2015, 5, 3740–3751. 10.1021/acscatal.5b00528. [DOI] [Google Scholar]
  2. Biggs G. S.; Klein O. J.; Boss S. R.; Barker P. D. Unlocking the Full Evolutionary Potential of Artificial Metalloenzymes Through Direct Metal-Protein Coordination A review of recent advances for catalyst development. Johnson Matthey Technol. Rev. 2020, 64, 407–418. 10.1595/205651320X15928204097766. [DOI] [Google Scholar]
  3. Shokhen M.; Hirsch M.; Khazanov N.; Ozeri R.; Perlman N.; Traube T.; Vijayakumar S.; Albeck A. From Catalytic Mechanism to Rational Design of Reversible Covalent Inhibitors of Serine and Cysteine Hydrolases. Isr. J. Chem. 2014, 54, 1137–1151. 10.1002/ijch.201300144. [DOI] [Google Scholar]
  4. Sandalova T.; Schneider G.; Kack H.; Lindqvist Y. Structure of dethiobiotin synthetase at 0.97-angstrom resolution. Acta Crystallogr., Sect. D: Biol. Crystallogr. 1999, 55, 610–624. 10.1107/S090744499801381X. [DOI] [PubMed] [Google Scholar]
  5. Warshel A.; Levitt M. Theoretical studies of enzymic reactions: dielectric, electrostatic and steric stabilization of the carbonium ion in the reaction of lysozyme. J. Mol. Biol. 1976, 103, 227–249. 10.1016/0022-2836(76)90311-9. [DOI] [PubMed] [Google Scholar]
  6. Chung L. W.; Sameera W. M. C.; Ramozzi R.; Page A. J.; Hatanaka M.; Petrova G.; Harris T. V.; li X.; Ke Z.; Liu F.; et al. The ONIOM method and its applications. Chem. Rev. 2015, 115, 5678–5796. 10.1021/cr5004419. [DOI] [PubMed] [Google Scholar]
  7. Gaines C. S.; Giese T. J.; York D. M. Cleaning Up Mechanistic Debris Generated by Twister Ribozymes Using Computational RNA Enzymology. ACS Catal. 2019, 9, 5803–5815. 10.1021/acscatal.9b01155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Warshel A.; Sharma P. K.; Kato M.; Xiang Y.; Liu H.; Olsson M. H. M. Electrostatic basis for enzyme catalysis. Chem. Rev. 2006, 106, 3210–3235. 10.1021/cr0503106. [DOI] [PubMed] [Google Scholar]
  9. Szefczyk B.; Mulholland A. J.; Ranaghan K. E.; Sokalski W. A. Differential transition-state stabilization in enzyme catalysis: Quantum chemical analysis of interactions in the chorismate mutase reaction and prediction of the optimal catalytic field. J. Am. Chem. Soc. 2004, 126, 16148–16159. 10.1021/ja049376t. [DOI] [PubMed] [Google Scholar]
  10. Wang L.; Fried S. D.; Boxer S. G.; Markland T. E. Quantum delocalization of protons in the hydrogen-bond network of an enzyme active site. Proc. Natl. Acad. Sci. U. S. A. 2014, 111, 18454–18459. 10.1073/pnas.1417923111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Zoi I.; Antoniou D.; Schwartz S. D. Electric Fields and Fast Protein Dynamics in Enzymes. J. Phys. Chem. Lett. 2017, 8, 6165–6170. 10.1021/acs.jpclett.7b02989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Hennefarth M. R.; Alexandrova A. N. Direct Look at the Electric Field in Ketosteroid Isomerase and Its Variants, ACS Catal. ACS Catal. 2020, 10, 9915–9924. 10.1021/acscatal.0c02795. [DOI] [Google Scholar]
  13. Morgenstern A.; Jaszai M.; Eberhart M. E.; Alexandrova A. N. Quantified electrostatic preorganization in enzymes using the geometry of the electron charge density. Chem. Sci. 2017, 8, 5010–5018. 10.1039/C7SC01301A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Li P. F.; Soudackov A. V.; Hammes-Schiffer S. Fundamental Insights into Proton-Coupled Electron Transfer in Soybean Lipoxygenase from Quantum Mechanical/Molecular Mechanical Free Energy Simulations. J. Am. Chem. Soc. 2018, 140, 3068–3076. 10.1021/jacs.7b13642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Léonard N. G.; Dhaoui R.; Chantarojsiri T.; Yang J. Y. Electric Fields in Catalysis: From Enzymes to Molecular Catalysts. ACS Catal. 2021, 11, 10923–10932. 10.1021/acscatal.1c02084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Welborn V. V.; Pestana L. R.; Head-Gordon T. Computational optimization of electric fields for better catalysis design. Nat. Catal. 2018, 1, 805–805. 10.1038/s41929-018-0161-y. [DOI] [Google Scholar]
  17. Fried S. D.; Bagchi S.; Boxer S. G. Extreme electric fields power catalysis in the active site of ketosteroid isomerase. Science 2014, 346, 1510–1514. 10.1126/science.1259802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Kamerlin S. C. L.; Sharma P. K.; Chu Z. T.; Warshel A. Ketosteroid isomerase provides further support for the idea that enzymes work by electrostatic preorganization. Proc. Natl. Acad. Sci. U. S. A. 2010, 107, 4075–4080. 10.1073/pnas.0914579107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Vaissier V.; Sharma S. C.; Schaettle C.; Zhang T.; Head-Gordon T. Computational Optimization of Electric Fields for Improving Catalysis of a Designed Kemp Eliminase. ACS Catal. 2018, 8, 219–227. 10.1021/acscatal.7b03151. [DOI] [Google Scholar]
  20. Chen X.; Schwartz S. D. Examining the Origin of Catalytic Power of Catechol O-Methyltransferase. ACS Catal. 2019, 9, 9870–9879. 10.1021/acscatal.9b02657. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Beker W.; Sokalski W. A. Bottom-Up Nonempirical Approach To Reducing Search Space in Enzyme Design Guided by Catalytic Fields. J. Chem. Theory Comput. 2020, 16, 3420–3429. 10.1021/acs.jctc.0c00139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Banik S. D.; Nandi N. Aminoacylation Reaction in the Histidyl-tRNA Synthetase: Fidelity Mechanism of the Activation Step. J. Phys. Chem. B 2010, 114, 2301–2311. 10.1021/jp910730s. [DOI] [PubMed] [Google Scholar]
  23. Liu H.; Gauld J. W. Substrate-assisted Catalysis in the Aminoacyl Transfer Mechanism of Histidyl-tRNA Synthetase: A Density Functional Theory Study. J. Phys. Chem. B 2008, 112, 16874–16882. 10.1021/jp807104b. [DOI] [PubMed] [Google Scholar]
  24. Eriani G.; Delarue M.; Poch O.; Gangloff J.; Moras D. Partition of Transfer-RNA Synthetases into 2 Classes Based on Mutually Exclusive Sets of Sequence Motifs. Nature 1990, 347, 203–206. 10.1038/347203a0. [DOI] [PubMed] [Google Scholar]
  25. Sokalski W. A.; Poirier R. A. Cumulative Atomic Multipole Representation of the Molecular Charge Distribution and Its Basis Set Dependence. Chem. Phys. Lett. 1983, 98, 86–92. 10.1016/0009-2614(83)80208-5. [DOI] [Google Scholar]
  26. Cardamone S.; Hughes T. J.; Popelier P. L. A. Multipolar electrostatics. Phys. Chem. Chem. Phys. 2014, 16, 10367–10387. 10.1039/c3cp54829e. [DOI] [PubMed] [Google Scholar]
  27. Ponder J. W.; Wu C.; Ren P.; Pande V. S.; Chodera J. D.; Schnieders M. J.; Haque I.; Mobley D. L.; Lambrecht D. S.; DiStasio R. A.; Head-Gordon M.; Clark G. N. I.; Johnson M. E.; Head-Gordon T. Current Status of the AMOEBA Polarizable Force Field. J. Phys. Chem. B 2010, 114, 2549–2564. 10.1021/jp910674d. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Gresh N.; Cisneros G. A.; Darden T. A.; Piquemal J.-P. Anisotropic, polarizable molecular mechanics studies of inter- and intramolecular interactions and ligand-macromolecule complexes. A bottom-up strategy. J. Chem. Theory Comput. 2007, 3, 1960–1986. 10.1021/ct700134r. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Beker B.; van der Kamp M. W.; Mulholland A. J.; Sokalski W. A. Rapid estimation of catalytic efficiency by cumulative atomic multipole moments: Application to ketosteroid isomerase mutants. J. Chem. Theory Comput. 2017, 13, 945–955. 10.1021/acs.jctc.6b01131. [DOI] [PubMed] [Google Scholar]
  30. Najmanovich R.; Kuttner J.; Sobolev V.; Edelman M. Sidechain flexibility in proteins upon ligand binding. Proteins: Struct., Funct., Genet. 2000, 39, 261–268. . [DOI] [PubMed] [Google Scholar]
  31. Gaudreault F.; Chartier M.; Najmanovich R. Side-chain rotamer changes upon ligand binding: common, crucial, correlate with entropy and rearrange hydrogen bonding. Bioinformatics 2012, 28, i423–i430. 10.1093/bioinformatics/bts395. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Bhowmick A.; Sharma S. C.; Honma H.; Head-Gordon T. The role of side-chain entropy and mutual information for improving the de novo design of Kemp eliminases KE07 and KE70. Phys. Chem. Chem. Phys. 2016, 18, 19386. 10.1039/C6CP03622H. [DOI] [PubMed] [Google Scholar]
  33. Scouras A. D.; Daggett V. The dynameonics rotamer library: Amino acid side chain conformations and dynamics from comprehensive molecular simulations in water. Protein Sci. 2011, 20, 341–352. 10.1002/pro.565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Advanced Materials Engineering and Modelling Group, Wrocław University of Science and Technology. http://camm.pwr.edu.pl/CAMM (accessed 2021-09-16).
  35. Jedwabny W.; Dyguda-Kazimierowicz E.; Pernal K.; Szalewicz K.; Patkowski K. Extension of an atom-atom dispersion function to halogen bonds and its use for rational design of drugs and biocatalysts. J. Phys. Chem. A 2021, 125, 1787–1799. 10.1021/acs.jpca.0c11347. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Wang S.; Peng J.; Xu J. B. Alignment of distantly related protein structures: algorithm, bound and implications to homology modeling. Bioinformatics 2011, 27, 2537–2545. 10.1093/bioinformatics/btr432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Banik S. D.; Nandi N. Influence of the conserved active site residues of histidyl tRNA synthetase on the mechanism of aminoacylation reaction. Biophys. Chem. 2011, 158, 61–72. 10.1016/j.bpc.2011.05.006. [DOI] [PubMed] [Google Scholar]
  38. Florián J.; Goodman M. F.; Warshel A. Computer Simulation of the Chemical Catalysis of DNA Polymerases: Discriminating between Alternative Nucleotide Insertion Mechanisms for T7 DNA Polymerase. J. Am. Chem. Soc. 2003, 125, 8163–8177. 10.1021/ja028997o. [DOI] [PubMed] [Google Scholar]
  39. Durr S. L.; Bohuszewicz O.; Berta D.; Suardiaz R.; Jambrina P. G.; Peter C.; Shao Y.; Rosta E. The Role of Conserved Residues in the DEDDh Motif: the Proton-Transfer Mechanism of HIV-1 RNase. ACS Catal. 2021, 11, 7915–7927. 10.1021/acscatal.1c01493. [DOI] [Google Scholar]
  40. Avis J. M.; Fersht A. R. Use of binding-energy in catalysis – optimization of rate in a multistep reaction. Biochemistry 1993, 32, 5321–5326. 10.1021/bi00071a006. [DOI] [PubMed] [Google Scholar]
  41. Guth E.; Connolly S. H.; Bovee M.; Francklyn C. S. A Substrate-Assisted Concerted Mechanism for Aminoacylation by a Class II Aminoacyl-tRNA Synthetase. Biochemistry 2005, 44, 3785–3794. 10.1021/bi047923h. [DOI] [PubMed] [Google Scholar]
  42. Li L.; Weinreb V.; Francklyn C.; Carter C. W. Histidyl-tRNA Synthetase Urzymes: Class I and II aminoacyl tRNA synthetase urzymes have comparable catalytic activities for cognate amino acid activation. J. Biol. Chem. 2011, 286, 10387–10395. 10.1074/jbc.M110.198929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Aragonès A. C.; Haworth N. L.; Darwish N.; Ciampi S.; Bloomfield N. J.; Wallace G. G.; Diez-Perez I.; Coote M. L. Electrostatic catalysis of a Diels–Alder reaction. Nature 2016, 531, 88–91. 10.1038/nature16989. [DOI] [PubMed] [Google Scholar]
  44. Stuyver T.; Ramanan R.; Mallick D.; Shaik S. Oriented (Local) Electric Fields Drive the Millionfold Enhancement of the H-Abstraction Catalysis Observed for Synthetic Metalloenzyme Analogues. Angew. Chem., Int. Ed. 2020, 59, 7915–7920. 10.1002/anie.201916592. [DOI] [PubMed] [Google Scholar]
  45. Fedorov A.; Shi W.; Kicska G.; Fedorov E.; Tyler P. C.; Furneaux R. H.; Hanson J. C.; Gainsford G. J.; Larese J. Z.; Schramm V. L.; et al. Transition State Structure of Purine Nucleoside Phosphorylase and Principles of Atomic Motion in Enzymatic Catalysis. Biochemistry 2001, 40, 853–860. 10.1021/bi002499f. [DOI] [PubMed] [Google Scholar]
  46. Kędzierski P.; Zaczkowska M.; Sokalski W. A. Catalytic Power of Ketosteroid Isomerase Related to Reversal of Proton Dislocations in Hydrogen-Bond Networks. J. Phys. Chem. B 2020, 124, 3661–3666. 10.1021/acs.jpcb.0c01489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Cui Q.; Karplus M. Quantum Mechanical/Molecular Mechanical Studies of the Triosephosphate Isomerase-Catalyzed Reaction: Verification of Methodology and Analysis of Reaction Mechanisms. J. Phys. Chem. B 2002, 106, 1768–1798. 10.1021/jp012659c. [DOI] [Google Scholar]
  48. Acevedo O.; Jorgensen W. L. Advances in Quantum and Molecular Mechanical (QM/MM) Simulations for Organic and Enzymatic Reactions. Acc. Chem. Res. 2010, 43, 142–152. 10.1021/ar900171c. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Świderek K. I. T.; Tuñón I.; Moliner V.; Bertran J. Protein Flexibility and Preorganization in the Design of Enzymes. The Kemp Elimination Catalyzed by HG3.17. ACS Catal. 2015, 5, 2587–2595. 10.1021/cs501904w. [DOI] [Google Scholar]
  50. Kulkarni Y. S.; Amyes T. L.; Richard J. P.; Kamerlin S. C. L. Uncovering the Role of Key Active-Site Side Chains in Catalysis: An Extended Brønsted Relationship for Substrate Deprotonation Catalyzed by Wild-Type and Variants of Triosephosphate Isomerase. J. Am. Chem. Soc. 2019, 141, 16139–16150. 10.1021/jacs.9b08713. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Kumari R.; Kumar R.; Lynn A. g_mmpbsa - A GROMACS tool for high-throughput MM-PBSA calculations. J. Chem. Inf. Model. 2014, 54, 1951–1962. 10.1021/ci500020m. [DOI] [PubMed] [Google Scholar]
  52. Houk K. N.; Leach A. G.; Kim S. P.; Zhang X. Y. Binding affinities of host-guest, protein-ligand, and protein-transition-state complexes. Angew. Chem., Int. Ed. 2003, 42, 4872–4897. 10.1002/anie.200200565. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

jp1c05256_si_001.pdf (764.2KB, pdf)

Articles from The Journal of Physical Chemistry. B are provided here courtesy of American Chemical Society

RESOURCES