Abstract
Leucyl-tRNA synthetase (LeuRS) is a Class I aminoacyl-tRNA synthetase (aaRS) that synthesizes leucyl-tRNAleu for codon-directed protein synthesis. Two signature sequences, HxGH and KMSKS help stabilize transition-states for amino acid activation and tRNA aminoacylation by all Class I aaRS. Separate alanine mutants of each signature, together with the double mutant, behave in opposite ways in Pyrococcus horikoshii LeuRS and the 129-residue urzyme ancestral model generated from it (LeuAC). Free energy coupling terms, Δ(ΔG‡), for both reactions are large and favourable for LeuRS, but unfavourable for LeuAC. Single turnover assays with 32Pα-ATP show correspondingly different internal products. These results implicate domain motion in catalysis by full-length LeuRS. The distributed thermodynamic cycle of mutational changes authenticates LeuAC urzyme catalysis far more convincingly than do single point mutations. Most importantly, the evolutionary gain of function induced by acquiring the anticodon-binding (ABD) and multiple insertion modules in the catalytic domain appears to be to coordinate the catalytic function of the HxGH and KMSKS signature sequences. The implication that backbone elements of secondary structures achieve a major portion of the overall transition-state stabilization by LeuAC is also consistent with coevolution of the genetic code and metabolic pathways necessary to produce histidine and lysine sidechains.
Graphical Abstract
INTRODUCTION
Class I aminoacyl-tRNA synthetases (aaRS) are one of two enzyme superfamilies that, together with their cognate tRNAs, translate the genetic code (1). They perform that function by activating cognate amino acids at the expense of the two high-energy bonds in ATP and transferring the activated aminoacyl group from the 5′ position of AMP to the tRNA 3′-terminal ribose. Eleven contemporary Class I aaRS families are responsible for the hydrophobic amino acids (isoleucine, valine, leucine, methionine), together with the larger homologs of the acidic (glutamate), amide (glutamine), and aromatic (tryptophan, tyrosine) amino acids, as well as arginine and cysteine. Some archaea and bacteria also possess a Class I lysyl-tRNA synthetase (2), which in most organisms is a Class II enzyme. All Class I aaRS share the same three domains (Figure 1): a Rossmann dinucleotide-binding catalytic domain, variable-length insertions between the two crossover connections of the Rossmann fold (CP) (3,4) including an embedded editing domain in larger Class I enzymes, and an idiosyncratic anticodon-binding domain, ABD. The active site within the Rossmann fold is highly conserved across the superfamily (5). Moreover, that domain has been re-constructed without any of the other domains, and it retains all three functions—amino acid activation, retention of the activated amino acid, and transfer to cognate tRNA—necessary to implement a rudimentary form of coding. For these reasons, we have argued that such constructs, called ‘urzymes’(6), are excellent experimental models for studying the emergence and early evolutionary history of ancestral aaRS (7–10) and genetic coding generally.
Here, we utilize the leucyl-tRNA synthetase from Pyrococcus horikoshii, LeuRS (11) and its urzyme, LeuAC (12), to examine the evolutionary gain of function associated with acquiring the CP and ABD domains. We note that as P. horikoshii expresses the archaeal-like LeuRS, which is monomeric, LeuAC is also very likely monomeric. The location of its post-transfer editing domain within the extended connecting peptide (CP) insertion differs from that in eubacterial LeuRS (13). Construction of the LeuAC urzyme entailed removing the entire extended CP domain, which includes many acquired insertion modules (Douglas, J., Bouckaert, R., Carter, C.W.J. and Wills, P. (2023) Enzymic recognition of amino acids drove the evolution of primordial genetic codes. Research Square DOI 10.21203/rs.3.rs-2924681/v1) as well as the ABD and C-terminal domains (Figure 1). LeuAC is thus homologous in that respect to the corresponding urzyme excerpted from Geobacillus stearothermophilus, TrpAC (6,14). However, the mass deleted from the catalytic domain (377 amino acids) is ∼ five times more than were deleted (74 amino acids) from the TrpRS catalytic domain. That missing mass includes insertion modules that others have suggested were essential to aminoacylation (15). LeuAC required a modest number of sequence changes on the surface to stabilize the removal of the Connecting Peptide. Supplementary Figure S1 in the Supplementary Information for (12) summarizes these sequence changes.
Two consensus sequence motifs present in the Rossmann fold (HIGH and KMSKS) have been implicated multiple ways in catalysis of both amino acid activation and acyl-transfer to tRNA in many Class I aaRS. Their conservation across the superfamily has been an evolutionary curiosity, suggesting that they are central to catalysis. However, both histidine and lysine require complex biosynthetic pathways, and thus are unlikely to have been available for the earliest stages of codon-dependent translation (16–19).
The unexpectedly modest impact of mutating lysine residues to alanine in the LeuAC urzyme KMSKS signature (12) motivated us to examine the catalytic roles of both signatures in both LeuRS and LeuAC. An obvious way to do this is with a factorial design in which both histidines in the HxGH signature are also mutated to alanine separately and in combination. That combinatorial mutagenesis measures the energetic coupling between the two signatures. To our knowledge, and despite a landmark combinatorial mutagenesis of the KFGKT sequence in TyrRS (20–23), no one has examined the coordinated behavior of the two Class I catalytic signatures, even in a full-length Class I aaRS. Comparing that coupling in urzyme and full-length enzyme maps changes in the evolutionary time domain, hence measures the catalytic gain of function from integrating the domains deleted in the urzyme.
The comparison reinforces previous suggestions (24–27) that full-length Class I aaRS benefit from a significant intramolecular cooperativity that is absent from the corresponding urzymes. We identify side chain packing interactions between the hydrophobic side chains in each catalytic signature and a conserved cluster of nonpolar side chains in the anticodon-binding domain. Embedding the valine and methionine side chains of the LeuRS HVGH and KMSKS sequences into that cluster provides a structural rationale for their coordination during the catalytic cycle, consistent with their catalytic coupling in the full-length enzyme.
The anti-coupling between HVGH and KMSKS signatures in the urzyme has important implications for the emergence and early evolution of polypeptide catalysts. Surprisingly, the AVGA variant of the LeuAC urzyme benefits from both the wild type KMSKS signature and unfavorable coupling energy; it is thus the most active LeuAC variant and is actually more active at aminoacylation than the corresponding mutant of full-length LeuRS. Catalysis by aaRS urzymes does not require specific amino acid side chains that are highly conserved in the active sites of all Class I aaRS, thus significantly broadening the sequence space for primordial catalysts and creating substantive experimental support for Wong's proposal that the genetic code coevolved with the amino acid biosynthetic pathways (16,17).
Finally, the consistent response to combinatorial mutational changes evident from the thermodynamic cycle analysis, and provision of a structural rationale for differential intramolecular coupling in the full-length enzyme both also decisively confirm the authenticity of catalysis by aaRS urzymes.
MATERIALS AND METHODS
Expression and purification of LeuRS and LeuAC
The gene for Pyrococcus horikoshii (Ph) LeuRS was synthesized by Gene Universal and expressed from pET-11a in BL21-CodonPlus (DE3)-RIPL (Agilent). Cells were grown at 37°C and induced with 300 μM IPTG for 4 h then harvested and stored overnight at –20°C. The cell pellet was resuspended in 1× Ni-NTA buffer (20 mM Tris, pH 8.0, 300 mM NaCl, 10 mM imidazole, 5 mM β-ME) plus cOmplete protease inhibitor (Roche) and lysed by three 15k psi passes on an Avestin Emulsiflex. Cell debris was pelleted at 4°C 30 min 20k rpm. The soluble fraction was heated at 80°C for 30 min to denature native Escherichia coli proteins. The heated cell extract was then pelleted, and the soluble material was loaded on to an equilibrated Ni-NTA column. The column was washed with three volumes 1× Ni-NTA buffer, then protein was eluted in a stepwise fashion with imidazole concentrations of 40, 80, 100, 200, and 500 mM imidazole. The fractions containing the protein of interest were pooled and dialyzed overnight against 200 mM HEPES, pH 7.4, 450 mM NaCl, 100 mM KCl, 10 mM β-ME. The following day the dialyzed protein was concentrated, brought to 50% glycerol and stored at –20°C.
LeuAC was expressed as an MBP fusion from pMAL-c2x in BL21Star(DE3) (Invitrogen). Cells were grown, induced, harvested, and lysed similarly to Ph LeuRS with the distinct difference of being resuspended in buffer (20 mM Tris, pH 7.4, 1 mM EDTA, 5 mM β-ME, 17.5% glycerol, 0.1% NP40, 33 mM (NH4)2SO4, 1.25% glycine, 300 mM guanidine hydrochloride) plus cOmplete protease inhibitor (Roche). LeuAC crude extract was then pelleted at 4°C 30 min 15k rpm to remove insoluble material. The extract was then diluted 1:4 with Optimal Buffer and loaded onto equilibrated Amylose FF resin (Cytiva). The resin was washed with five column volumes of buffer and the protein was eluted with 10 mM maltose in Optimal Buffer. Fractions containing protein were concentrated, brought to 50% glycerol and stored at –20°C. All protein concentrations were determined using the Pierce™ Detergent-Compatible Bradford Assay Kit (Thermo Scientific). Experimental assays were performed either with the intact MBP-LeuAC fusion protein or with samples cleaved by tobacco etch virus (TEV) protease, purified as described (28). Purity and cleavage efficiency was determined by running samples on PROTEAN® TGX (Bio-RAD) gels.
Single turnover active-site titration assays
Active-site titration assays were performed as described (29,30). 3 μM of protein was added to 1x reaction mix (50 mM HEPES, pH 7.5, 10 mM MgCl2, 5 μM ATP, 50 mM amino acid, 1 mM DTT, inorganic pyrophosphatase, and α-labeled [32P] ATP) to start the reaction at 37°C for LeuAC and on ice, or slightly above 0°C for LeuRS. The α labeling position allowed us to follow time courses of ADP and AMP production, as well as for ATP consumption. Timepoints were quenched in 0.4 M sodium acetate 0.1% SDS and kept on ice until all points had been collected. Quenched samples were spotted on TLC plates, developed in 850 mM Tris, pH 8.0, dried and then exposed for varying amounts of time to a phosphor image screen and visualized with a Typhoon Scanner (Cytiva). ImageJ (31) was used to quantitate intensities of each nucleotide. The time-dependent of loss (ATP) or de novo appearance (ADP, AMP) of the three adenine nucleotide phosphates were fitted using the nonlinear regression module of JMP16PRO™ (SAS Institute, Cary NC) to equation (1) (29):
(1) |
where kchem is the first-order rate constant, kcat is the rate of turnover, A is the amplitude of the first-order process or the size of the ‘burst’, and C is an offset. The burst size is proportional to the number of active molecules responsible for producing a product in the first round of catalysis in the added catalyst. Eq. (1) was also used for convenience to fit the appearance of nucleotide products, AMP and ADP. In that case the roles of A and C are inverted, C being the burst size and A the offset and both rate constants have the opposite sign.
Burst size estimation from fitted parameters in equation (1)
For [ATP] decay curves, the fitted A value estimates n directly as n = A*[ATP]/[Enzyme]. C gives n = (1 – C)*[ATP]/[enzyme]. For exponentially increasing concentrations of product, the situation reverses, and n = (1 – A) *[ATP]/[enzyme] and n = C*[ATP]/[enzyme]. The very small variance of multiple estimates and the internal self-consistency discussed in (RESULTS:Single turnover…) justify these approximations.
tRNALeu aminoacylation assays
A plasmid encoding the P. horikoshii tRNALeu (TAG codon) was synthesized by Integrated DNA Technologies and used as template for PCR amplification of the tRNA and upstream T7 promoter and downstream Hepatitis Delta Virus (HDV) ribozyme. The PCR product was used directly as template for T7 transcription. Following a 4-hour transcription at 37°C the RNA was cycled five times (90°C for 1 min, 60°C for 2 min, 25°C for 2 min) to increase the cleavage by HDV. The tRNA was purified by urea PAGE and crush and soak extraction. The tRNA 2′–3′ cyclic phosphate was removed by treatment with T4 PNK (New England Biolabs) following the manufacturer's protocol. The tRNA was then phenol chloroform isoamyl alcohol extracted, filter concentrated, aliquoted, and stored at –20°C.
We determined the active fraction of tRNALeu by following extended acylation assays using full-length P. horikoshii LeuRS until they reached a plateau. That plateau value was 0.38, which we used to compute tRNALeu concentrations in assays with both LeuRS and LeuAC.
Aminoacylations were performed in 50 mM HEPES, pH 7.5, 10 mM MgCl2, 20 mM KCl, 5 mM DTT with indicated amounts of ATP and amino acids. Desired amounts of unlabeled tRNA—mixed with [α32P] A76-labeled tRNA for assays by LeuAC—were heated in 30 mM HEPES, pH 7.5, 30 mM KCl to 90°C for 2 min. The tRNA was then cooled linearly (drop 1°C/30 s) until it reached 80°C when MgCl2 was added to a final concentration of 10 mM. The tRNA continued to cool linearly until it reached 20°C.
Data processing and statistical analysis
Phosphorimaging screens of TLC plates were densitometered using ImageJ (31). Data were transferred to JMP16PRO™ via Microsoft Excel (version 16.49), after intermediate calculations. We fitted active-site titration curves to Eq. (1) using the JMP16PRO™ nonlinear fitting module. R2 values were all >0.97 and most were >0.99.
Factorial design matrices in Supplementary Tables S1 and S2 were processed using the Fit model multiple regression analysis module of JMP16PRO™ Pro, using an appropriate form of equation (2) (32)
(2) |
where Yobs is a dependent variable, usually an experimental observation, β0 is a constant derived from the average value of Yobs, βi and βij are coefficients to be fitted, Pi,j are independent predictor variables from the design matrix, and ϵ is a residual to be minimized. All rates were converted to free energies of activation, ΔG‡ = –RTln(k), before regression analysis because free energies are additive, whereas rates are multiplicative. For example, the activation free energy for the first-order decay rate in single-turnover experiments is ΔG‡kchem.
Multiple regression analyses of factorial designs exploit the replication inherent in the full collection of experiments to estimate experimental variances on the basis of t-test P-values, in contrast to the presenting error bars showing the variance of individual datapoints. Analyses reported here also entail triplet experimental replicates, which improve the associated analysis of variance.
RESULTS
The experimental design matrix for this work is the 23 factorial design that tests all possible combinations of HVGH → AVGA and KMSKS → AMSAS mutations in both full-length LeuRS and the LeuAC urzyme. The eight variants were constructed, expressed, and purified as described previously (12) and in MATERIALS AND METHODS. We assayed all variants for leucine activation by single turnover active-site titrations (Supplementary Table S1) and for aminoacylation of tRNALeu in triplicate (Supplementary Table S3). Active-site titration experiments provided suitable estimates of burst sizes for all variants of both LeuAC and LeuRS. However, first-order rate constants were too fast at room temperature to fit reliably for LeuRS variants. However, reactions on ice (at ∼0°C) were sufficiently slow to permit corresponding estimation of the mutational impact on leucine activation by LeuRS variants. The first section describes those results.
We then describe aminoacylation rates measured at saturating concentrations of leucine (12) and the highest achievable tRNALeu concentrations. The next section compares mutational impacts on leucine activation and tRNALeu aminoacylation by both LeuAC and LeuRS. A subsequent section places those results into the context of the evolutionary gain of function in evolving LeuRS from LeuAC. Finally, we provide a structural rationale for the enhanced energetic coupling between the two signatures in the fully-evolved enzyme.
Single turnover experiments give consistent first-order rates for changes in ATP, ADP, and AMP for both LeuAC and LeuRS
Single turnover kinetic experiments, often called active-site titrations (AST) use substrate level amounts of enzyme. They measure the size of the burst corresponding to the first round of catalysis and are therefore the method of choice to estimate fractional activities of enzyme variants (29,33) and normalizing enzyme concentrations when computing kcat values.
AST assays performed here to estimate the active fractions in all variants provide supplemental evidence that mutational impacts are comparable in both leucine activation and aminoacylation by LeuAC. We uncovered an unexpected phenomenon when we followed the time courses for all three adenine nucleotides in AST assays of aaRS urzymes (Figure 2). ATP consumption is accompanied by ADP production (12), in addition to the expected products, aminoacyl-5′AMP and AMP, which is produced by product release and hydrolysis. Prior to analyzing kinetic data for the thermodynamic cycles presented here, we studied these curves in detail to assure a consistent interpretation.
Quantitation of the LeuAC active-site titration curves for the three adenine nucleotides (Figure 2A) confirms first that they are additive; n-values (Figure 2B) from ATP consumption are quantitative sums of those for ADP and AMP produced (Figure 2C). LeuAC variant n-values from fitting C and A in Eq. (1) agree with a fractional error of ∼0.09. ATP consumption <n_T> values range from 0.64 to 0.69 for the different combinatorial variants. <n_D > values (ADP) range from 0.49 to 0.57. <n_M> values (AMP) range from 0.13 to 0.17. In each case, n_T = n_D + n_M within 2% (i.e. intercepts are ∼0; slopes are both ∼1). These data furnish a self-consistent picture of multiple catalyzed reactions occurring in the first round of catalysis (Supplementary Table S2).
LeuAC variant-specific column vectors in Figure 2B are quite strictly parallel; all six correlation coefficients have R2 values > 0.99. Thus, although combinatorial mutagenesis changes the active fractions, it has no effect on the relative magnitudes of the four different n-values. Strict linear dependence means that, for the purpose of normalizing the active fractions of different variants in analyzing aminoacylation rates, any of the n-values would be equally suitable. For the present purpose, we chose to normalize acylation rates using the n_T values, to estimate the minimum rate enhancements of acylation by full length LeuRS, relative to LeuAC.
The preceding analysis provides additional circumstantial evidence supporting our earlier proposal that ADP production arises from successive phosphorylations of bound leucyl-5′AMP. That proposal was originally based on two observations (12). (i) AMP production exhibits a first order rate early in the titration, followed by a sharp increase in the steady state rate (see Supplementary Figure S2(c) in (12). (ii) The 3D structure of the LeuAC active-site configuration provides alternative ATP-binding sites that place the γ-phosphate group into suitable positions to phosphorylate the α-phosphate of the bound adenylate first to ADP and then to ATP (see Figure 9 of (12)). If those two phosphorylation reactions regenerated ATP, it would promote bound Leu-5′AMP to a transition-state-like configuration. That would be consistent with the relatively independent burst sizes and relative proportions of ADP and AMP.
Combinatorial mutants of HIGH and KMSKS catalytic signatures in LeuRS reduce its catalytic proficiency for tRNALeu acylation to that of LeuAC and its mutational variants
Supplementary Table S4 summarizes the experimental data for aminoacylation of tRNALeu. A histogram of transition-state stabilization free energies for tRNALeu aminoacylation, ΔG‡kacyl, for all eight variants (Figure 3) reveals that the two catalytic signatures make profound contributions to the catalytic proficiency of the full-length enzyme. The distributions in the upper right of Figure 3B illustrate that ΔG‡ measurements for all LeuRS mutational variants have values comparable to those of the wild type Urzyme LeuAC and its corresponding mutations. Rather than forming a separate distribution in the neighborhood of the WT LeuRS, as observed for LeuAC variants, any mutation compromises all LeuRS variants to levels of the LeuAC urzyme and its combinatorial mutants. Active-site mutations therefore have severe effects on the full-length enzyme, but substantially smaller effects on the urzyme. Thus, neither wild-type urzyme active site nor any full-length LeuRS variant can achieve a comparable rate enhancement chemistry to that of the full-length enzyme.
AST assays (Figure 2) furnish values of the internal first-order rate constant for first-round catalysis of leucine activation. Using these values, we can compare the impacts of mutation on the activation reaction. Figure 3A compares transition-state stabilization free energies of the four LeuAC variants in the two successive steps of tRNALeu biosynthesis. Qualitatively, the main difference between them is that all variants have lower ΔG‡ for activation than for aminoacylation. The AVGA mutant is markedly faster than any of the others, and the AMSAS variant is slowest by a small margin over the double mutant. These differences are easier to appreciate when transformed by estimating parameters for the corresponding thermodynamic cycle [Eq. (2)].
Thermodynamic cycle analysis emphasizes the gain of function resulting from ABD and CP domain acquisition
The factorial design matrices (Supplementary Tables S1, S2) enable us to attribute a magnitude and statistical significance of free energy contributions to transition-state stabilization for intrinsic effects of each signature sequence and the synergy between them. For this purpose we use a construct known as a thermodynamic cycle, advocated by Jencks (34) and popularized by Horovitz and Fersht (35). As originally formulated, construction of a thermodynamic cycle entails computing differences observed in activation free energies, Δ(ΔG‡), at each corner of the implicit polygon of the factorial design. For a double mutant cycle, that involves a square with WT enzyme, the two single mutants, and the double mutant at the corners of a square (see Supplementary §II and Supplementary Figure S1).
We showed (26) that in practice it is more convenient to use multiple regression methods to estimate the coefficients in Eq. (2). When values of the experimental error, ϵ, are small and the regression model explains a high (≥0.9) portion of the variation in experimental data points, a thermodynamic cycle is equivalent to changing the coordinate system of the experimental free energies in Figure 3. Supplementary §II examines this equivalence in more detail, and Supplementary §III outlines how regression coefficients for main effects and lower-order interactions depend on where along the highest-order interaction they are evaluated.
The regression model for the thermodynamic cycle for ΔG‡kchem from first-order rates, for all three nucleotides either consumed or produced in amino acid activation (Figure 4) reveals several notable details. First, the distribution of Studentized residuals associated with the contributing data points (the prediction error, ϵ, divided by its adjusted standard error) lies between –4 ≤ 4, so there are no outliers. Second, there are five significant predictors listed in the table. Their Student t-test probabilities are all ≤0.02, hence are statistically significant. Third, because neither ATP nor ADP are significant predictors of the first-order rate, there is no significant difference between rates for ATP consumption and ADP formation for the four variants. AMP production, on the other hand, is significantly different, and the AMP*HVGH β-coefficient means that it depends on whether the HxGH sequence is native or mutant. We return to this point after considering mutational effects on tRNALeu acylation.
First-order rate constants from AST, kchem, are independent of enzyme concentrations, and so do not require normalization for active fraction before converting to free energies for multiple regression analysis to construct thermodynamic cycles. Figure 4 highlights the statistical quality of the regression model (Figure 5 omits the corresponding models). Figure 5 displays histograms of the β-coefficients for the acylation reaction in kcal/mol, with error bars denoting the 95% confidence levels. Arrows in Figure 5B form cycles showing that the net free energy change of the successive steps total zero. The difference between left and right vertical arrows must equal that between top and bottom arrows, and is equal to the free energy, Δ(ΔG‡) for the HVGH*KMSKS interaction.
The most striking feature in Figure 4 is that samples at the extremes (the black dots = WT; the blue dots = AVGA) are opposite of what might be expected. The AVGA mutant is, by ∼ 1 kcal/mol, the most active catalyst in leucine activation and the WT variant is least active. Moreover, although the sign of the βKMSKS coefficient is negative, signifying that the wild-type lysine residues favor catalysis, the synergy between it and the HIGH sequence, βHVGH*KMSKS, is positive. Thus, when it comes to stabilizing the transition state for the internally catalyzed reaction, the two signatures actually work against one another. That, and the positive βHIGH value, account for the superiority of the AVGA variant, which is ∼5 times more active that wild-type LeuAC.
Figure 5 compares thermodynamic cycles (Figure 5B) for leucine activation (Figure 5A) and tRNALeu aminoacylation (Figure 5C) by LeuAC and LeuRS. ΔG‡0 is the mean activation free energy for all four variants of the respective catalysts; the remaining bars represent the additional transition-state stabilization (β < 0) or destabilization (β > 0) attributable to wild type HVGH, KMSKS, sequences and their two-way interaction or synergy. β coefficients for the main effects of the two mutations and their interaction tell us much about how the two catalytic signatures impact LeuAC catalysis. Wild-type residues in each location have opposite effects. Wild-type lysine residues in the KMSKS signature enhance catalysis modestly (ΔG‡KMSKS = –0.32 kcal/mole) compared to alanine mutants. Wild-type histidine residues in the HVGH signature actually reduce catalysis by a comparable amount (positive ΔG‡HVGH = 0.45 kcal/mol). The interaction term is also positive (ΔG‡HVGH*KMSKS = 0.36 kcal/mol), so the wild-type lysine residues in the KMSKS signature intensify the negative impact of the native HVGH sequence, and seemingly paradoxically, the presence of wild type KMSKS intensifies catalysis by mutant AVGA sequence.Combinatorial mutagenesis has comparable impacts on leucine activation and tRNALeu aminoacylation by LeuAC.
The KMSKS and HVGH*KMSKS effects behave very differently in full-length LeuRS. The inversion of these effects from left to right represents catalytic consequences of adding the anticodon binding domain and CP (as well as CP2) to the LeuAC urzyme. That consequence is a substantial increase in transition-state stabilization for both reactions in LeuRS by enforcing coupled behavior on the two catalytic signatures, as discussed further in the next section and the DISCUSSION.
Thermodynamic cycle parameters built from ΔG‡kchem values from time courses of the three nucleotides plus acylation illustrate how mutations affect all four rate constants (Figure 6). The main effects of wild-type HVGH and KMSKS signatures and their two way interaction are essentially identical in the LeuAC urzyme (Figure 6A, C). Mutational changes also impact both steps of the reaction by full-length LeuRS consistently (Figure 6B, C). Those for AMP production and tRNALeu aminoacylation by LeuRS are nearly colinear (Figure 6B). However, an important distinction for LeuRS emerges in Figure 6C. Only AMP production by full-length LeuRS has the same sensitivity to mutations as does aminoacylation. In contrast, ATP consumption and ADP production exhibit similar mutational profiles that both respond favorably to the AMSAS mutation (i.e. ΔG‡kchem > 0).
Products of the internal first-order reactions in the first round of catalysis also distribute differently in full-length LeuRS and the LeuAC urzyme. The ratio, M/D, of AMP to ADP produced (Supplementary Figure S4A; R2 = 0.93; P ≤ 0.0001) is inversely proportional to the presence of the native KMSKS sequence. Thus, increased ATP consumption associated with LeuAC variants lacking the wild-type lysine residues is also associated with lower amounts of the expected product, AMP. In contrast, the M/D ratio in full-length LeuRS variants depends fully on both main effects (HVGH and KMSKS) and on their two-way interaction (Supplementary Figure S4B; R2 = 0.99; P < 0.0001), in a manner similar to the dependence of the activation free energies, ΔG‡kchem.
Figure 6C shows that contributions of the HVGH and KMSKS signatures and their two-way interactions to ATP consumption and ADP production by LeuRS are distinctly different from those contributing to AMP production and tRNA aminoacylation. In fact, their mutational profiles are closely anticorrelated to those for AMP production and acylation, as well as those for all four reactions by LeuAC (<R2> = 0.85 ± 0.1 see Supplementary Figure S5). Together with thermodynamic cycles for rates of AMP production by LeuAC and LeuRS (Supplementary Figure S3), these observations suggest an overriding distinction between catalysis of aminoacylation by full-length LeuRS and all 7 remaining variants illustrated in Figure 3B. Deletion of any contributor to the wild-type full-length LeuRS mechanism—either of the various domains missing in the urzyme or of either or both signature sequence—changes the catalytic mechanism in similar underlying manner (see again the distribution in the upper right of Figure 3B).
Nonpolar sidechains V51 and M651 anchor both HVGH and KMSKS signatures securely into the ABD
It is worth noting that although only minimal structural data are available for any aaRS urzymes (36), neither of the hydrophobic residues in the two signature sequences appears from crystal structures of full-length Class I aaRS to be engaged in any significant nonpolar packing interactions within in LeuAC itself (Figure 7B). By contrast, a well-developed packing network of nonpolar amino acids including the two hydrophobic side chains together with residues from the ABD in full-length LeuRS anchors the valine and methionine residues within that domain (Figure 7). Delaunay tessellation and likelihood scoring (37,38) identified a homologous network in TrpRS (39), and was used here to identify residues detailed in Figure 7C. Similar networks can be identified in all full-length Class I aaRS. They appear to be both necessary and sufficient to coordinate the behavior of the two catalytic signatures, as shown schematically in Figure 7D.
DISCUSSION
The role of cooperative interactions between active-site residues in enzyme catalysis is a recurring question that is seldom addressed directly by experimentation (40), despite well-established protocols for thermodynamic cycle analysis of combinatorial mutants (35). The thermodynamic cycle is a revealing linear transformation of the experimental transition state stabilization free energies, ΔG‡, which as noted elsewhere are additive. The alternative coordinate system highlights the magnitude and sign of individual and energetically coupled contributions to catalysis.
Use of 32pα-ATP in active site titrations reveals hidden mechanistic microheterogeneity
Introducing the use of 32Pα-ATP in active site titration assays has expanded our appreciation of the complexity of first-order events in the first round of catalysis. The ability to track all three adenine nucleotides reveals unexpected production of small amounts of ADP even by full length LeuRS and especially in LeuRS active-site variants (Supplementary Table S2). Indeed, as discussed in RESULTS in other contexts, loss either of the additional domains or the intact catalytic apparatus in the active site is sufficient to dramatically alter the flow of chemical free energy during amino acid activation.
The equivalence between ATP consumption and the sum of ADP and AMP burst sizes, combined with the similarity of first-order rates for the three processes by LeuAC implicate similar catalytic machinery in both ADP and AMP production. The KMSKS sequence in full-length Class I aaRS is involved with transient binding of the PPi leaving group of ATP (see, for example, the effects of removing PPi from the free energy surface traversed during catalysis by TrpRS (41)). A more extensive discussion of this question is in Supplementary §IV.
Emergent coupling deepens evidence for the authenticity of urzyme catalysis
We cannot overlook the obvious conclusion that the contrast between the uncoupled behavior of the two catalytic signatures in LeuAC and their strong coupling by the ABD and CP in full length LeuRS reinforces the authenticity of both LeuAC (12) and TrpAC (25) catalysis. The evolution of intramolecular coupling shown here therefore materially strengthens the argument that aaRS urzymes represent valid experimental models for the ancestral assignment catalysts that originally enabled nature to mine the immense functional diversity represented by proteins (7).
The LeuAC studies complement and extend earlier thermodynamic cycle analyses of TyrRS and TrpRS mechanisms
In the present case the coupling between catalytic signatures might, in principle, arise solely from the sequence differences far from the active site on the newly created molecular surface of LeuAC, rather than from the removal of the CP and anticodon-binding domains. We cannot envision any practical way to rule this possibility out, as the native sequences lead to insoluble aggregates of LeuAC. Prior studies comparing TrpAC with TrpRS substantially strengthen confidence in the interpretation outlined here, which implicate domain movement and intramolecular communication in the enhanced function of the full-length enzyme.
Class I aminoacyl-tRNA synthetase enzymes (aaRS) afford several examples of how thermodynamic cycle analysis probes structure-function relationships. First and Fersht showed that the effects of mutating residues in the KMSKS loop were localized quite specifically along the reaction profile to destabilize the ground state pre-transition state and stabilize the transition state for tyrosine activation by TyrRS (20). The KMSKS signature breaks the synergistic binding of amino acid and ATP in the pre-transition state (20–23).
Destabilization by the KMSKS signature of the transition states for both activation and aminoacylation in LeuRS appears to be inconsistent with that observation. A plausible rationale for this apparent discrepancy is that the behavior in TyrRS also depends on interaction between the two signatures. That interpretation is especially appealing because LeuAC exhibits the opposite dependence: the wild-type KMSKS sequence itself destabilizes both transition states (Figure 5). Its catalytic contribution to both reactions becomes favorable only by virtue of the two-way interaction.
A packing motif (42) called the D1 switch (39) links the two β-strands and α-helix of the first crossover connection to the N-terminal β-strand of the second crossover connection of the Rossmann dinucleotide-binding domain, creating a conformational transition state that mediates shear during relative domain motion of the ABD and CP and imposing discrete pre-transition and products conformations in TrpRS (41,43,44) and likely in other Class I aaRS. Combinatorial mutagenesis of four residues from the D1 switch, assayed with both Mg2+ and Mn2+ established a five-dimensional thermodynamic cycle. That cycle revealed that the packing motif, which is ∼20 Å from the active-site metal, is nevertheless coupled by –6.2 kcal/mol to the catalytic function of the metal (26). That contribution nearly equals the full catalytic contribution, –6.4 kcal/mol, of the metal measured by assaying metal-free TrpRS (45). A complementary, modular thermodynamic cycle consisting of TrpRS, its urzyme, TrpAC and TrpAC plus either the CP insert (i.e. the intact catalytic domain) or the ABD showed that both CP and the ABD alone actually reduced the activity of TrpAC in aminoacylation of tRNATrp, but that both together restored full activity, a contribution of ∼5 kcal/mol—nearly the entire contribution of Mg2+ (25). Finally, both the combinatorial mutagenesis (24) and modular cycles (25) also implicate the corresponding energetic coupling in amino acid specificity.
Coupling energies between D1 switch residues and Mg2+ in transition-state stabilization are comparable to those between the TrpRS ABD and CP domains (24–27). The minimum action path connecting successive structures along the TrpRS structural reaction profile implies that repacking D1 switch residues in the conformational transition state is rate-limiting between pre-transition state and products domain configurations (41,43,44). These fragmentary experimental vignettes consistently suggest that a central function of the ABD and CP domains is to impose coordinated motion on components of the active site, such that they all assemble simultaneously into a catalytically active, discriminating configuration (24,25).
Experimental exploration of the time domain
The thermodynamic cycles comparing LeuRS to its putative ancestral urzyme form in Figures 3–5 extend experimentation into the evolutionary time domain (46–53). In this manner we identify quantitatively the explicit gain of function induced in Class I leucyl-tRNA synthetase by the acquisition of the connecting peptide 1 (CP) insertion and anticodon-binding (ABD) domains. The new data reported here reinforce the picture outlined in previous paragraphs. In LeuAC, the absence of the two larger domains leaves the two catalytic signatures uncoordinated, contributing in unexpected ways to transition state stabilization. In full length LeuRS, on the other hand, although neither signature alone can enhance transition-state stabilization, their combination, in the context of the two additional domains, furnishes substantial catalytically productive synergy. That we now have identified much the same phenomena we observed for TrpRS, the smallest Class I aaRS in one of the largest as well as the smallest, Class I aaRS suggests generality in the superfamily. The superfamily-wide conservation of the ‘enforcer’ packing motif illustrated in Figure 7C, in which high SNAPP potential (37,38) hydrophobic clusters anchor the sole hydrophobic side chains of each catalytic signature, coordinating their motion reinforces its generality.
Successively acquired CP and anticodon-binding domains may have had distinct, less obvious selective advantages
We have argued that aaRS of both Classes began as 46-residue polypeptides called ‘protozymes’ that accelerated the rate of amino acid activation ∼106-fold (54). That hypothesis gained support from validation by Tamura's group (55). Thus, it is possible that the HxGH signature has roots in the protozyme. Class I aaRS urzymes, however, are more than twice as big (∼130 residues), and the KMSKS signature must have roots in that subsequent stage. The selective advantage of the KMSKS signature likely contributed to the acquisition of the second half of urzymes, which accelerated the rate of activation by 103-fold over that for the protozyme.
As noted previously (25), neither CP nor the ABD enhanced either specificity or catalytic rate enhancement of the TrpRS urzyme. This prompts the question of how nature selected either domain except in the unlikely case that the urzyme acquired both simultaneously. The AST assays performed to assess the fraction of active catalysts in each sample suggest a possible resolution of that conundrum. The proportion of total ATP usefully converted to AMP production by LeuAC (0.21) is roughly 25% that (0.8; Figure 6D) observed for LeuRS. If the CP domain stabilizes closed, productive forms of the urzyme so that it used ATP more efficiently, that would certainly have given it a selective advantage, even without imposing the coupling between the two signatures. This hypothesis is experimentally testable.
Significant rate acceleration by early polypeptide catalysts did not require sophisticated amino acid side chains
The LeuAC AVGA variant (<ΔG‡> = 1.9 kcal/mol) is quantitatively more active than the same variant of the full-length LeuRS (<ΔG‡> = 2.1 kcal/mol). That brings us to what is perhaps the most consequential implication from the data in Figures 2–4. The overall rate enhancements observed for amino acid activation by aaRS urzymes excerpted from two Class I (TrpRS (6,7,14), LeuRS (12)) and one Class II aaRS (HisRS (10,25)) represent ∼60% of the overall transition-state stabilization free energy observed for the corresponding full length aaRS. How much of that must we attribute to the precise orientation of specific amino acid side chains in the active site?
Our data suggest that the surprising answer to that question is: apparently very little, if any of it. Indeed, the superiority of the AVGA LeuAC mutant means that the two histidine residues in that highly conserved signature actually reduce the rate of aminoacylation unless they are coupled to the KMSKS sequence by the additional domains. Moreover, the AMSAS mutant is almost as active as the WT LeuAC. These observations are consistent with the previous mutational analysis of the TrpAC urzyme, where mutation of an aspartate at the N-terminus of the C-terminal α-helix of the second crossover connection in the Rossmann fold reduces activity by ∼200-fold in TyrRS but increases activity ∼25-fold in the TrpRS urzyme.
We previously noted that secondary structural; i.e. backbone interactions in Class I and II urzymes could account for tRNA (56,57) and amino acid (58) substrate selection (7). Together with our previous evidence that the TrpAC urzyme may be a catalytically active molten globule (36), these results substantiate the possibility that substantial catalytic rate enhancement does not require precise orientation of specialized amino acid side chains. It now seems that primordial protein catalysts achieved high rate enhancements and a modicum of substrate specificity largely via physical properties of the active-site pocket that depend largely on backbone interactions that do not require specific amino acid side chains.
That conclusion lends a strong experimental basis for Wong's coevolution theory for the emergence of the genetic code and its coevolution with metabolic pathways necessary to synthesize amino acids that were not produced in abundance by prebiotic geochemistry (16,17).
Supplementary Material
ACKNOWLEDGEMENTS
P. Wills provided continual constructive discussions. E. First and L. Betts made helpful comments on the manuscript.
Author contributions: J.J.H.E. and Q.C.T. designed the combinatorial LeuAC mutants and performed all assays. Q.C.T. constructed and assayed the combinatorial LeuRS mutants. C.W.C., J.J.H.E., and G.C.T. all performed data analysis. J.D. provided phylogenetic insight for Figure 1s as well as the evolutionary history of the signature sequences. C.W.C. wrote the manuscript, and all authors contributed to and approved the final figures and text.
Notes
Present address: Jessica J. H. Elder, National Institute of Environmental Health Sciences, Research Triangle Park, NC 27709, USA.
Contributor Information
Guo Qing Tang, Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA.
Jessica J H Elder, Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA.
Jordan Douglas, Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA; Department of Physics, The University of Auckland, New Zealand.
Charles W Carter, Jr, Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599-7260, USA.
Data Availability
JMP is a product of the SAS corporation, Research Triangle, NC, USA. All rate data are provided in Supplementary Tables, are accessible from The Protein Databank, or upon request from the author.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
Alfred P. Sloan Foundation Matter-to-Life program [G-2021-16944]. Funding for open access charge: Alfred P. Sloan Foundation [G-2021-16944].
Conflict of interest statement. None declared.
REFERENCES
- 1. Rubio Gomez M.A., Ibba M.. Aminoacyl-tRNA synthetases. RNA. 2020; 26:910–936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Terada T., Nureki O., Ishitani R., Ambrogelly A., Ibba M., Söll D., Yokoyama S.. Functional convergence of two lysyl-tRNA synthetases with unrelated topologies. Nat. Str. Biol. 2002; 9:257–262. [DOI] [PubMed] [Google Scholar]
- 3. Burbaum J.J., Schimmel P.. Assembly of a class I tRNA synthetase from products of an artificially split gene. Biochem. 1991; 30:319–324. [DOI] [PubMed] [Google Scholar]
- 4. Burbaum J.J., Starzyk R.M., Schimmel P.. Understanding structural relationships in proteins of unsolved three-dimensional structure. PROTEINS: Struct. Funct. Gen. 1990; 7:99–111. [DOI] [PubMed] [Google Scholar]
- 5. Eriani G., Delarue M., Poch O., Gangloff J., Moras D.. Partition of tRNA synthetases into two classes based on mutually exclusive sets of sequence motifs. Nature. 1990; 347:203–206. [DOI] [PubMed] [Google Scholar]
- 6. Pham Y., Kuhlman B., Butterfoss G.L., Hu H., Weinreb V., Carter C.W. Jr. Tryptophanyl-tRNA synthetase urzyme: a model to recapitulate molecular evolution and investigate intramolecular complementation. J. Biol. Chem. 2010; 285:38590–38601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Carter C.W. Jr, Wills P.R. The roots of genetic coding in aminoacyl-tRNA synthetase duality. Annu. Rev. Biochem. 2021; 90:349–373. [DOI] [PubMed] [Google Scholar]
- 8. Carter C.W. Jr Coding of class I and II aminoacyl-tRNA synthetases. Adv. Exp. Med. Biol. 2017; 18:103–148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Carter C.W. Jr Urzymology: experimental access to a key transition in the appearance of enzymes. J. Biol. Chem. 2014; 289:30213–30220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Li L., Weinreb V., Francklyn C., Carter C.W. Jr. Histidyl-tRNA synthetase urzymes: class I and II aminoacyl-tRNA synthetase urzymes have comparable catalytic activities for cognate amino acid activation. J. Biol. Chem. 2011; 286:10387–10395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Fukunaga R., Yokoyama S.. Crystal structure of leucyl-tRNA synthetase from the archaeon pyrococcus horikoshii reveals a novel editing domain orientation. J. Mol. Biol. 2005; 346:57–71. [DOI] [PubMed] [Google Scholar]
- 12. Hobson J.J., Li Z., Carter C.W. Jr. A leucyl-tRNA synthetase urzyme: authenticity of tRNA synthetase urzyme catalytic activities and production of a non-canonical product. Int. J. Mol. Sci. 2022; 23:4229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Cusack S., Yaremchuk A., Tukalo M.. The 2 Å crystal structure of leucyl-tRNA synthetase and its complex with a leucyl-adenylate analogue. EMBO J. 2000; 19:2351–2361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Pham Y., Li L., Kim A., Erdogan O., Weinreb V., Butterfoss G., Kuhlman B., Carter C.W. Jr. A minimal TrpRS catalytic domain supports sense/antisense ancestry of class I and II aminoacyl-tRNA synthetases. Mol. Cell. 2007; 25:851–862. [DOI] [PubMed] [Google Scholar]
- 15. Zhou X.L., Zhu B., Wang E.D.. The cp2 domain of leucyl-trna synthetase is crucial for amino acid activation and post-transfer editing. J. Biol. Chem. 2008; 283:36608–36616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Wong J.T.-F., Ng S.-K., Mat W.-K., Hu T., Xue H.. Coevolution theory of the genetic code at age forty: pathway to translation and synthetic life. Life. 2016; 6:12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Wong J.T.-F. Coevolution theory of the genetic code at age thirty. Bioessays. 2005; 27:416–425. [DOI] [PubMed] [Google Scholar]
- 18. Bada J.L. New insights into prebiotic chemistry from Stanley Miller's spark discharge experiments. Chem. Soc. Rev. 2013; 42:2186–2196. [DOI] [PubMed] [Google Scholar]
- 19. Miller S.L. A production of amino acids under possible primitive earth conditions. Science. 1953; 117:528–529. [DOI] [PubMed] [Google Scholar]
- 20. First E.A., Fersht A.R.. Analysis of the role of the KMSKS loop in the catalytic mechanism of the tyrosyl-tRNA synthetase using multimutant cycles. Biochem. 1995; 34:5030–5043. [DOI] [PubMed] [Google Scholar]
- 21. First E.A., Fersht A.R.. Involvement of threonine 234 in catalysis of tyrosyl adenylate formation by tyrosyl-tRNA synthetase. Biochem. 1993; 32:13644–13650. [DOI] [PubMed] [Google Scholar]
- 22. First E.A., Fersht A.R.. Mutation of lysine 233 to alanine introduces positive cooperativity into tyrosyl-tRNA synthetase. Biochem. 1993; 32:13651–13657. [DOI] [PubMed] [Google Scholar]
- 23. First E.A., Fersht A.R.. Mutational and kinetic analysis of a mobile loop in tyrosyl-tRNA synthetase. Biochem. 1993; 32:13658–13663. [DOI] [PubMed] [Google Scholar]
- 24. Weinreb V., Li L., Chandrasekaran S.N., Koehl P., Delarue M., Carter C.W. Jr. Enhanced amino acid selection in fully-evolved tryptophanyl-tRNA synthetase, relative to its urzyme, requires domain movement sensed by the D1 switch, a remote, dynamic packing motif. J. Biol. Chem. 2014; 289:4367–4376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Li L., Carter C.W. Jr. Full implementation of the genetic code by tryptophanyl-tRNA synthetase requires intermodular coupling. J. Biol. Chem. 2013; 288:34736–34745. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Weinreb V., Li L., Carter C.W. Jr. A master switch couples Mg2+-assisted catalysis to domain motion in B. stearothermophilus tryptophanyl-tRNA synthetase. Structure. 2012; 20:128–138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Budiman M., Knaggs M.H., Fetrow J.S., Alexander R.W.. Using molecular dynamics to map interaction networks in an aminoacyl-tRNA synthetase. Proteins Struct. Funct. Bioinform. 2007; 68:670–689. [DOI] [PubMed] [Google Scholar]
- 28. Parks T.D., Leuther K.K., Howard E.D., Johnston S.A., Dougherty W.. Release of proteins and peptides from fusion proteins using a recombinant plant virus proteinase. Anal. Biochem. 1994; 216:413–417. [DOI] [PubMed] [Google Scholar]
- 29. Fersht A.R., Ashford J.S., Bruton C.J., Jakes R., Koch G.L.E., Hartley B.S.. Active site titration and aminoacyl adenylate binding stoichiometry of amionacyl-tRNA synthetases. Biochem. 1975; 14:1–4. [DOI] [PubMed] [Google Scholar]
- 30. Francklyn C.S., First E.A., Perona J.J., Hou Y.-M.. Methods for kinetic and thermodynamic analysis of aminoacyl-tRNA synthetases. Methods. 2008; 44:100–118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Schneider C.A., Rasband W.S., Eliceiri K.W.. NIH image to ImageJ: 25 years of image analysis. Nat. Methods. 2012; 9:671–675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Box G.E.P., Hunter W.G., Hunter J.S.. Statistics for Experimenters. 1978; NY: Wiley Interscience. [Google Scholar]
- 33. Li L., Francklyn C., Carter C.W. Jr. Aminoacylating urzymes challenge the RNA world hypothesis. J. Biol. Chem. 2013; 288:26856–26863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Jencks W.P. Catalysis in Chemistry and Enzymology. 1987; NY: Dover Publications. [Google Scholar]
- 35. Horovitz A., Fersht A.R.. Strategy for analysing the co-operativity of intramolecular interactions in peptides and proteins. J. Mol. Biol. 1990; 214:613–617. [DOI] [PubMed] [Google Scholar]
- 36. Sapienza P.J., Li L., Williams T., Lee A.L., Carter C.W. Jr. An ancestral tryptophanyl-tRNA synthetase precursor achieves high catalytic rate enhancement without ordered ground-State tertiary structures. ACS Chem. Biol. 2016; 11:1661–1668. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Tropsha A., Carter C.W. Jr, Cammer S., Vaisman I.I. Simplicial neighborhood analysis of protein packing (SNAPP): a computational geometry approach to studying proteins. Meth. Enz. 2003; 374:509–544. [DOI] [PubMed] [Google Scholar]
- 38. Carter C.W. Jr, LeFebvre B.C., Cammer S.A., Tropsha A., Edgell M.H. Four-body potentials reveal protein-specific correlations to stability changes caused by hydrophobic core mutations. J. Mol. Biol. 2001; 311:625–638. [DOI] [PubMed] [Google Scholar]
- 39. Kapustina M., Weinreb V., Li L., Kuhlman B., Carter C.W. Jr. A conformational transition state accompanies tryptophan activation by B.stearothermphilus tryptophanyl-tRNA synthetase. Structure. 2007; 15:1272–1284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Sunden F., Peck A., Salzman J., Ressl S., Herschlag D.. Extensive site-directed mutagenesis reveals interconnected functional units in the alkaline phosphatase active site. Elife. 2015; 4:e06181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Chandrasekaran S.N., Das J., Dokholyan N.V., Carter C.W. Jr. A modified PATH algorithm rapidly generates transition states comparable to those found by other well established algorithms. Struct. Dyn. 2016; 3:012101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Cammer S., Carter C.W. Jr. Six rossmannoid folds, including the class I aminoacyl-tRNA synthetases, share a partial core with the anticodon-binding domain of a class II aminoacyl-tRNA synthetase. Bioinformatics. 2010; 26:709–714. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Carter C.W. Jr, Chandrasekaran S.N., Weinreb V., Li L., Williams T.. Combining multi-mutant and modular thermodynamic cycles to measure energetic coupling networks in enzyme catalysis. Struct. Dyn. 2017; 4:032101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Chandrasekaran S.N., Carter C.W. Jr. Adding torsional interaction terms to the Anisotropic Network Model improves the PATH performance, enabling detailed comparison with experimental rate data. Struct. Dyn. 2017; 4:032103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Weinreb V., Carter C.W. Jr. Mg2+-free B.stearothermophilus tryptophanyl-tRNA synthetase activates tryptophan with a major fraction of the overall rate enhancement. J. Am. Chem. Soc. 2008; 130:1488–1494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Benner S.A., Sassi S.O., Gaucher E.A.. Molecular Paleoscience: systems biology from the past. Adv. Enzymol. Relat. Areas Mol. Biol. 2007; 75:9–140. [DOI] [PubMed] [Google Scholar]
- 47. Benner S.A. Enzyme kinetics and molecular evolution. Chem. Rev. 1989; 89:789–806. [Google Scholar]
- 48. Liberles D.A., Teichmann S.A., Bahar I., Bastolla U., Bloom J., Bornberg-Bauer E., Colwell L.J., Koning A.P.J.d., Dokholyan N.V., Echave J.et al.. The interface of protein structure, protein biophysics, and molecular evolution. Prot. Sci. 2012; 21:769–785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Bridgham J.T., Ortlund E.A., Thornton J.W.. An epistatic ratchet constrains the direction of glucocorticoid receptor evolution. Nature. 2009; 461:515–519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Ortlund E.A., Bridgham J.T., Redinbo M.R., Thornton J.W.. Crystal structure of an ancient protein: evolution by conformational epistasis. Science. 2007; 317:1544–1548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Bridgham J.T., Carroll S.M., Thornton J.W.. Evolution of hormone-receptor complexity by molecular exploitation. Science. 2006; 312:97–101. [DOI] [PubMed] [Google Scholar]
- 52. Thornton J.W. Resurrecting ancient genes: experimental analysis of extinct molecules. Nat. Rev. Genet. 2004; 5:366–375. [DOI] [PubMed] [Google Scholar]
- 53. Thornton J.W., E. N., D. C.. Resurrecting the ancestral steroid receptor: ancient origin of estrogen signaling. Science. 2003; 301:714–1717. [DOI] [PubMed] [Google Scholar]
- 54. Martinez-Rodriguez L., Jimenez-Rodriguez M., Gonzalez-Rivera K., Williams T., Li L., Weinreb V., Chandrasekaran S.N., Collier M., Ambroggio X., Kuhlman B.et al.. Functional class I and II amino acid activating enzymes can Be coded by opposite strands of the same gene. J. Biol. Chem. 2015; 290:19710–19725. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Onodera K., Suganuma N., Takano H., Sugita Y., Shoji T., Minobe A., Yamaki N., Otsuka R., Mutsuro-Aoki H., Umehara T.et al.. Amino acid activation analysis of primitive aminoacyl-tRNA synthetases encoded by both strands of a single gene using the malachite green assay. BioSystems. 2021; 208:104481. [DOI] [PubMed] [Google Scholar]
- 56. Carter C.W. Jr, Wills P.R.. Class I and II aminoacyl-tRNA synthetase tRNA groove discrimination created the first synthetase•tRNA cognate pairs and was therefore essential to the origin of genetic coding. IUBMB Life. 2019; 71:1088–1098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Carter C.W. Jr, Wills P.R.. Hierarchical groove discrimination by class I and II aminoacyl-tRNA synthetases reveals a palimpsest of the operational RNA code in the tRNA acceptor-stem bases. Nucleic Acids Res. 2018; 46:9667–9683. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Carter C.W. Jr, Wills P.R.. Experimental solutions to problems defining the origin of codon-directed protein synthesis. Biosystems. 2019; 183:103979. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Carter C.W. Jr, Li L., Weinreb V., Collier M., Gonzales-Rivera K., Jimenez-Rodriguez M., Erdogan O., Chandrasekharan S.N.. The Rodin-Ohno hypothesis that two enzyme superfamilies descended from one ancestral gene: an unlikely scenario for the origins of translation that will not Be dismissed. Biol. Direct. 2014; 9:11. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
JMP is a product of the SAS corporation, Research Triangle, NC, USA. All rate data are provided in Supplementary Tables, are accessible from The Protein Databank, or upon request from the author.