Significance
Biological processes are unique in showing a remarkable level of accuracy in discriminating between similar molecules. This characteristic is attributed to an error-correcting mechanism known as kinetic proofreading. It is widely believed that the enhancement of the accuracy in biological processes always slows them down. By analyzing the fundamental processes of DNA replication and protein translation, we established that these systems maximize speed rather than accuracy with additional energetic constraints. Our theoretical study further indicates that both speed and accuracy can be enhanced in certain parameter regimes. The resulting findings provide a microscopic picture of how complex biological processes can be accomplished so quickly with minimal errors.
Keywords: kinetic proofreading mechanisms, stochastic models, first-passage processes
Abstract
One of the most fascinating features of biological systems is the ability to sustain high accuracy of all major cellular processes despite the stochastic nature of underlying chemical processes. It is widely believed that such low error values are the result of the error-correcting mechanism known as kinetic proofreading. However, it is usually argued that enhancing the accuracy should result in slowing down the process, leading to the so-called speed–accuracy trade-off. We developed a discrete-state stochastic framework that allowed us to investigate the mechanisms of the proofreading using the method of first-passage processes. With this framework, we simultaneously analyzed the speed and accuracy of the two fundamental biological processes, DNA replication and tRNA selection during the translation. The results indicate that these systems tend to optimize speed rather than accuracy, as long as the error level is tolerable. Interestingly, for these processes, certain kinetic parameters lay in the suboptimal region where their perturbations can improve both speed and accuracy. Additional constraints due to the energetic cost of proofreading also play a role in the error correcting process. Our theoretical findings provide a microscopic picture of how complex biological processes are able to function so fast with high accuracy.
Biological systems exhibit remarkable accuracy in selecting the right substrate from the pool of chemically similar molecules. This property is common to all fundamental biological processes such as DNA replication, RNA transcription, and protein translation (1). The level of fidelity in various stages of genetic information flow depends on their relative importance in sustaining system stability. DNA replication is thought to be the most accurate process, with an error rate (2, 3); i.e., only out of incorporated nucleotides is mismatched. RNA transcription () and protein translation () processes are also quite accurate, but to a somewhat lower degree (4, 5). Failure to maintain such accuracy adversely affects cell viability and survival. For example, mutations affecting the fidelity in translation increase the amount of unfolded proteins leading to apoptosis (6) and to erroneous replication of genetic material (7).
Initially, it was unclear how the small differences in equilibrium binding stability of structurally similar substrates can allow such a high degree of discrimination (8). Then, an explanation was provided independently by Hopfield (9) and Ninio (10), who proposed an error-correction mechanism called kinetic proofreading (KPR). KPR allows enzymes to use the free energy difference between right and wrong substrates multiple times using additional steps (9); this amplifies the small energetic discrimination and results in a lower error compared with that in chemical equilibrium. However, such processes require significant energy consumption (9). To this end, enzymes use some energy-rich molecules, like ATP, to provide for the necessary driving (11, 12). The mechanism was experimentally verified later in different biological systems (13–17). Several recent studies generalized it to more complex networks and found analogies between proofreading and other phenomena such as microtubule growth (18) or bacterial chemotaxis (19). These results broaden the concept of KPR and show that such chemically driven regulatory mechanisms are widely present.
Cells must process genetic information not just accurately but also sufficiently rapidly. Proofreading enhances the accuracy by resetting the system to its initial configuration without progressing to product state (9). The completion time of the reaction is, thus, expected to increase. Hence, there could be a compromise, or trade-off, between accuracy and speed of the process (20). The understanding on this trade-off is mainly based on the Michaelis–Menten (MM) description of specificity (21, 22). These studies indicate that the minimum possible error is achieved at vanishingly low catalytic rate, i.e., when the process is the slowest (9, 21). In contrast, biological polymerization reactions must occur reasonably fast (15, 23). A recent study demonstrated a new speed–accuracy regime in the KPR model by modifying the catalytic rate (18). In this regime, a large gain in speed comes with a relatively small loss in accuracy. The authors suggested that biological systems may use this regime (18). For example, in the tRNA selection process, a fast GTP hydrolysis step speeds up protein synthesis but prevents maximal possible selectivity of the initial tRNA–ribosome binding step (21, 24).
Despite the number of studies, a clear quantitative picture of how the balance between speed and accuracy is tuned is lacking. Several current models of proofreading still mainly focus on the initial stages of substrate selection (22, 25, 26) or assume disparity of rate constants of only a few types of steps (18, 19). In contrast, experimental data show that biological systems have different rates for the right and wrong substrates for each step of the network (4, 14, 15). It is not clear how such distributed discrimination of the reaction rates affects the trade-off. Moreover, proofreading steps come with an extra energy cost to gain higher accuracy (11), but the role of this cost in the trade-off is not apparent. Therefore, to understand the fundamental mechanisms of proofreading in real biological systems, one needs to answer the following questions: (i) How does the system set its priorities when choosing between accuracy and speed, two seemingly opposite objectives? (ii) Can speed and accuracy change in the same direction; in other words, can perturbations of a kinetic parameter from its naturally selected value improve both speed and accuracy? (iii) How does the extra energy expenditure due to KPR affect the speed–accuracy optimization?
Here we focus on the role of reaction kinetics in governing the speed–accuracy trade-off. To this end, we develop a generalized framework to study one-loop KPR networks, assuming distinct rate constants for every step of the right (R) and wrong (W) pathways. Based on this approach, we model the overall selection of the correct substrate over the incorrect one as a first-passage problem, to obtain a full dynamic description of the process (27, 28). This general framework is applied to two important examples, namely, DNA replication by T7 DNA polymerase (DNAP) (3, 14) and protein synthesis by Escherichia coli ribosome (22, 29) (Fig. 1 A and B)). Starting from the experimentally measured rate constants for each system, we vary their values to analyze the resulting changes in speed and accuracy and to assess the trade-off. The role played by the extra energy consumption or cost of proofreading (11) is also investigated. By comparing the behavior of the two systems, we search for general properties of biological error correction.
Methods
Proofreading Networks of Replication and Translation.
DNA replication as well as protein synthesis use nucleotide complementarity to select the cognate substrate over other near/noncognate substrates. During replication, dNTP molecules complementary to the DNA template are chosen. Similarly, during protein synthesis, aminoacyl(aa)-tRNAs are picked by ribosome based on the complementarity of their anticodon to the mRNA codon. Wrong substrates that bind initially can be removed by error-correction proofreading mechanisms. Kinetic experiments coupled with modeling revealed a lot of mechanistic details about both of the processes (3, 14, 15, 24). The schemes depicted in Fig. 1 represent the key steps to understand the KPR in these networks.
The schemes in Fig. 1 A and C are for DNA replication (3, 14). E denotes the T7 DNAP enzyme in complex with a DNA primer template. The R and W substrates are correct and incorrect base-paired dNTP molecules, respectively. Step 1 generates enzyme–DNA complexes ER(or EW) with the primer elongated by one nucleotide. Addition of another correct nucleotide to ER (EW) gives rise to (). and complexes denote the primer shifted to the exonuclease site (Exo) from the polymerase site (Pol) of DNAP. This commences proofreading in step 2. Excision of the nucleotide in step 3 resets the system to its initial state.
The schemes in Fig. 1 B and D show the aa-tRNA selection process by ribosome during translation (29). Here, E denotes the E. coli ribosome with mRNA. Cognate (near-cognate) aa-tRNAs in ternary complex with elongation factor Tu (EFTu) and GTP bind with ribosome in step 1 to form . GTP hydrolysis in step 2 results in the complex . The latter can take one of two routes. It can progress to the product with the elongation of the peptide chain by one amino acid. Alternatively, it can dissociate in the proofreading step (step 3), rejecting the aa-tRNA.
In both schemes, we take the rate constants of the W cycle to be related to those of the R cycle through and similarly, for the catalytic step, . The set of rate constants are effectively first order containing the substrate concentrations. The factors provide the energetic discrimination between the R and W pathways. Completion of one cycle (returning to the starting state E) effectively amounts to hydrolysis of one dNTP molecule for DNA replication and one GTP molecule for aa-tRNA selection. The chemical potential difference, (in units of ) is equal for both the cycles (19).
[1] |
This leads to the condition
[2] |
Accuracy and Speed from First-Passage Description.
We determine the error and speed of the substrate selection kinetics from the first-passage probability density (27, 28). With this method, we can analyze an arbitrary catalytic reaction scheme and focus on the transitions starting from the initial state that lead to the final state . The description allows us to get analytical expression for both speed and accuracy for an arbitrary set of kinetic parameters. Therefore, we allow different rates for the R and W substrates for each step of the network as experimentally observed. Furthermore, we do not assume any step to be completely irreversible. Otherwise, the chemical potential difference over the cycle would diverge (Eq. 1). This difference is linked to the hydrolysis of some energy-rich molecules supplying large but finite free energy.
Let us denote as the probability density to reach state at time for the first time before reaching state if the system is in state E at time . The corresponding probability density is specified in the same manner. The evolution equations of are known as the backward master equation (27, 30). It is more convenient to solve them in Laplace space (Supporting Information). We define the error, as the ratio of the probabilities to reach the end states [also called the splitting probability (27)] given by
[3] |
It is important to note that this definition is equivalent to the traditional one (9, 21) defined as the ratio of the wrong product formation rate to the right one (see Supporting Information).
The speed of a reaction is naturally quantified by the net rate of the product formation. As with any chemical reaction rate, it can be defined as the inverse of the mean first-passage time (MFPT), i.e., the mean time it takes to cross the energy barrier that separates reactants and products for the first time. For example, a well-known application of this approach for single-molecule MM kinetics results in the traditional expression for the rate as the inverse of the MFPT (31). We note that the speed toward the correct product can nevertheless be affected by the presence of the incorrect substrate. Thus, it is important to consider them together in contrast to the prevalent measure of the speed in literature neglecting the presence of the W pathway (21). In our case, the expression of the MFPT to reach each product state is given by the first moment of the corresponding probability density (27).
[4] |
In the Results and Discussion, we focus on as the measure of speed and denote it simply by .
Results
Although our formalism can be applied to an arbitrary KPR scheme, we’ve chosen to study two fundamentally important biologically processes: DNA replication and translation. These processes are best characterized in terms of underlying kinetic parameters, and we can study the speed–accuracy trade-off in the biologically relevant parameter region. Notably, despite differences in parameters and KPR mechanisms for the two case studies, we reach similar conclusions for both.
Importance of Speed over Accuracy in DNA Replication by T7 DNAP.
The T7 DNAP enzyme catalyzes the polymerization of a DNA primer over a template strand (14). Wrongly incorporated dNTP is removed by the proofreading mechanism that involves the exonuclease site of DNAP (23). The model parameters of the corresponding reaction network (Fig. 1A) are listed in Table S1. They are based on the experimental data of Wong et al. (23). We do not consider dissociation of the DNA from the enzyme in our model. This approach is justified due to the faster polymerization rates in the R path and the higher exonucleolytic sliding rate in the W path (Table S1).
Table S1.
Parameter | Value |
250 s−1 | |
1 s−1 | |
0.2 s−1 | |
700 s−1 | |
900 s−1 | |
250 s−1 | |
1 | |
1 | |
The error, , varies among three limits as a function of the polymerization rate constant, (23) with fixed . All of them are lower bounds obtained in the limit (), (), and an intermediate case with (). Explicit expressions follow from the general one for (see Supporting Information). Here, we give suitable ratios of these limits to understand the error variation pattern
[5] |
Here, . From the experimental parameter values (Table S1) and Eq. 5, we expect . In other words, the system has a minimum error at some intermediate polymerization rate. This is indeed the case as shown in Fig. 2A. On the other hand, the MFPT, , decreases, and hence the speed increases monotonously with increase in (Fig. 2B). The range of also spans several orders of magnitude. The – curve is shown in Fig. 2C. Negative slope of this curve indicates speed–accuracy trade-off, i.e., higher accuracy (lower ) corresponds to lower speed (higher ) and vice versa. It is evident from Fig. 2C that there is a trade-off only when the polymerization rate constant becomes greater than the value corresponding to the minimum error. We call this branch with negative slope the trade-off branch. For lower values of , error and MFPT change in the same direction. This branch with positive slope of the – curve is denoted as the non-trade-off branch. Intuitively, the lack of trade-off for low polymerization rates arises due to different magnitudes of these rates between the R and W pathways; the latter has much smaller rates. When the polymerization rate is sufficiently smaller than the Pol–Exo sliding rate, correct substrate incorporation must undergo lots of unnecessary proofreading cycles. These futile cycles adversely affect the R pathway more, thereby compromising both speed and accuracy. Thus, at low polymerization rates, there is suboptimal regime allowing for improvement in both speed and accuracy.
The actual system (green circle) is situated on the trade-off branch of the – curve in Fig. 2C. It lies far away from the minimum error point (red triangle). In particular, the minimum error is -fold lower than that of the actual system. However, to achieve this minimum error, the system’s speed would drop by -fold. Thus, the polymerization rate constant is selected to achieve high-enough speed. Significant amount of accuracy is lost in the process. The system can further lower the MFPT by moving down the slope of the – curve. However, that means giving up more accuracy. Thus, of course, there is also a tolerable upper level of .
The Pol–Exo sliding is an important step in error correction. The – curve for this step is plotted in Fig. 2D. The minimum error value is approached in an asymptotic fashion at very large . In contrast, the global minimum of MFPT is obtained in the limit. The MFPT also has a local minimum (yellow square) and a local maximum at finite . Interestingly, the actual system lies pretty close to this (local) minimum. In particular, the system’s value is almost identical to the minimum (within a less than 0.01%). On the other hand, the corresponding error is -fold higher than that corresponding to the minimum MFPT. The speed–accuracy trade-off appears after crosses the value corresponding to the local minimum in (and also before the local maximum). Thus the system is positioned on the non-trade-off branch of the – curve. As one moves in either direction from the minimum point, error can change greatly with slight alteration in until is too low. Therefore, speed appears to be more important as long as the system remains reasonably accurate. However, the system can gain lower errors at similar speeds by moving left to the trade-off branch of the – curve. Then, what is the reason for not taking that route? We note that the proofreading pathway resets the system to the starting condition without progressing to product formation. Therefore, speedup in proofreading rate can increase the associated extra energy cost (11); this may restrict the system to go to the more advantageous regime that has greater KPR rate and so, somewhat larger cost. We will further elaborate on this point in our next case study.
tRNA Selection by E. coli Ribosome Is Optimized for Speed Rather than Accuracy with a Cost Constraint.
During translation, the ribosome decodes the mRNA sequences by selecting aa-tRNAs in ternary complex with EFTu and GTP (4, 15). Noncognate aa-tRNAs are removed by proofreading dissociation of the complex from the ribosome A site after GTP hydrolysis (24, 29). The model parameters of the network (Fig. 1B) for WT E. coli ribosome are listed in Table S2. They are based on the experimental data of Zaher and Green (29). We chose to ensure that both step 2 and step 3 are nearly irreversible (29). There remains one free parameter (as gets fixed from Eq. 2). We assumed , but our main conclusions are independent of this choice (Fig. S1).
Table S2.
Parameter | Value |
40 s−1 | |
0.5 s−1 | |
25 s−1 | |
s−1 | |
8.415 s−1 | |
0.675 | |
94 | |
7.9 | |
We show the – curves for GTP hydrolysis and ternary complex binding steps in Fig. 3 A and B, respectively, for three varieties of E. coli ribosome. One is the WT, and the other two are mutants. One mutant, rpsL141, is hyperaccurate (HYP), and the other mutant, rpsD12, is more error-prone (ERR) than WT (29). Variation of the hydrolysis rate constant, (keeping fixed) results in quite large changes in error and MFPT. The trends are similar for all three systems (see Tables S3 and S4 for parameter sets of mutants). As for the polymerization steps in DNA replication, error varies among three bounds for the GTP hydrolysis step. They are obtained from the general expression of (see Supporting Information) for low, intermediate, and high values of ,
[6] |
where . For the parameter set of the system (Table S2, ), one gets . Thus, the error vs. hydrolysis rate curve passes through a minimum. Interestingly, there is also a minimum in as shown in Fig. 3A. The two minima are at different values, however. The speed–accuracy trade-off occurs between the minimum (red triangle, WT) and the minimum (yellow square, WT) points. As is evident from Fig. 3A, all of the systems are positioned close to the minimum and far away from minimum . For example, the WT ribosome would become -fold slower to achieve the minimum error, although the latter is -fold lower than the actual value. Hence, speed is preferred to accuracy. We tested the generality of this claim against multiple parameter variations (Fig. S2), and it appears that speed is indeed more important. The robustness of this result is also tested successfully against fluctuations of the rate constants (Fig. S3). Interestingly, the WT ribosome is faster and, hence, better optimized for speed than both the mutants. It is important to note that the more accurate mutant HYP was not chosen by the natural selection. This point further emphasizes the importance of speed over accuracy in translation.
Table S3.
Parameter | Value |
27 s−1 | |
0.41 s−1 | |
14 s−1 | |
s−1 | |
4.752 s−1 | |
0.926 | |
112.2 | |
10.34 | |
Table S4.
Parameter | Value |
37 s−1 | |
0.43 s−1 | |
31 s−1 | |
s−1 | |
7.623 s−1 | |
0.973 | |
9.3 | |
0.126 | |
7.65 | |
The change in the ternary complex binding rate constant, (with fixed) also affects both the error and the MFPT significantly, but on a smaller scale than hydrolysis (Fig. 3B). There is always a trade-off between speed and accuracy, unlike the cases studied so far. The minimum in error is obtained in the limit, whereas the maximum speed is achieved for very large . With increase in , falls several orders in magnitude. Interestingly, all of the systems have values almost identical to their respective saturation limits. To attain that state, they sacrifice an order of magnitude in terms of accuracy. Therefore, regarding speed–accuracy trade-off, the system is inclined to be faster with higher but tolerable error.
Next, we explore the effects of variation of the proofreading step rate constant, (keeping fixed) on system performance. The resulting – diagram is plotted in Fig. 4A for the WT ribosome. The global minimum of is obtained in the limit, whereas the minimum of lies in the large limit. There is a local minimum (yellow square) of at some intermediate along with a local maximum. Mutant ribosomes have similar trends (not shown in the figure). The nature of the – curve is qualitatively similar to that obtained for the proofreading Pol–Exo sliding step in DNA replication (Fig. 2D) with two speed–accuracy trade-off branches. The actual system (green circle) is located on the non-trade-off branch that links the two trade-off branches of the – curve. It has an MFPT close to (-fold higher than) the (local) minimum value. However, the minimum point also has an approximately fivefold lower . More important is the fact that the system can attain much lower errors with similar, even slightly higher, speeds if it moves left up to a certain level (the dashed line in Fig. 4A). What prevents the system from gaining in both speed and accuracy?
Because correction by proofreading resets the system without a product formation, it has a cost associated with futile cycles where the correct substrate was inserted and then removed. The cost of proofreading, , is defined as the ratio of the resetting flux to the product formation flux including both R and W pathways (11) (see Supporting Information); this gives a measure of the amount of extra energy-rich molecules consumed due to the presence of the proofreading step. Specifically, the cost C can quantify the moles of dNTP (or GTP) used for proofreading per mole of product (11). This quantity can be easily computed from our formalism and investigated as a function of kinetic parameters. In particular, we quantify how the cost of proofreading changes with the increase in near local minimum of the – curve in Fig. 4A. The results, shown in Fig. 4B, demonstrate that the cost associated with the (local) minimum point is approximately threefold higher than that of the actual system. This cost disadvantage (threefold higher GTP consumption per amino acid!) may restrict the system from gaining the available advantage in both speed and accuracy. A similar consideration may also be responsible for the nature of trade-off exhibited in the DNA replication case (Fig. 2D).
Discussion
Evolution has optimized the kinetic parameters of biological enzymes to achieve the desired levels of accuracy and speed at various stages of biological information flow. In this study, by examining how the balance between speed and accuracy changes with variation of the underlying kinetic parameters, we gain insights into the important priorities for this optimization. To this end, we focus on two fundamental examples of biological proofreading networks: DNA replication and protein translation. In both cases, the systems tend to achieve maximum speed by losing significant accuracy. However, the speed–accuracy trade-off only occurs in the limited region of the parameter space, e.g., after the polymerization rate in replication passes the minimum error point. In the case of translation, the trade-off appears between the minima in error and the MFPT for the GTP hydrolysis step. A similar conclusion about the importance of speed over accuracy is reached by varying the rates of the proofreading steps in both systems. Although higher proofreading rates can further improve the accuracy without losing much speed, the associated energy cost of proofreading may restrict further improvements on an already acceptable speed and accuracy.
An important insight from the above analyses is that the speed–accuracy trade-off is not universally present, and its occurrence depends on the specific values of kinetic rates. Biologically, this result implies that mutations or application of drugs that reduce the enzyme’s accuracy do not necessarily increase its speed, and vice versa. The widespread view of a compromise between accuracy and speed is mainly based on their dependence on the effective catalytic rate of the process (9, 21). Indeed, the larger the catalytic rate, the higher the speed and the lower the accuracy. However, the role of other steps, like hydrolysis and proofreading, are not as straightforward. Our study reveals that, for these steps, trade-offs are present only over a certain range of rates, and both accuracy and speed can improve with variation of certain kinetic parameters. The partitioning of the error–time curves into trade-off and non-trade-off branches clarifies the distinct roles of various transitions and the molecular mechanisms of the speed–accuracy optimization. Our conclusions are also supported by a more advanced analysis of the maximum speed vs. accuracy curves using Pareto fronts, as explained in detail in Supporting Information.
The analysis of speed–accuracy trade-offs for different mutant varieties of E. coli ribosome further confirms the importance of speed over accuracy. The WT and two mutants (HYP and ERR) lie close to the minimum MFPT point on the error–time curves (Fig. 3A). However, the WT and HYP ribosomes are on the trade-off branch, whereas the ERR mutant is on the non-trade-off branch. Thus, movement down the slope toward the trade-off branch would raise both accuracy and speed for the ERR ribosome. That is how the WT ribosome may have evolved from the more erroneous ERR type. However, any further movement upward along the trade-off branch means a slowdown with a lower error; this leads to the more accurate (HYP) mutant. Rejection of the latter as the natural choice implies that optimization of speed is critical. We note that comparison of E. coli growth rates with WT and mutant ribosomes already indicates such an optimization (21, 32). However, according to the prevailing notion on the ever-present compromise between error and speed, the more erroneous (ERR) ribosome should be faster. Hence, the hindered growth for ERR mutant was ascribed presumably to less-active proteins (33). Our results indicate that not only the accuracy but also the speed of peptide chain elongation can be smaller for the ERR mutant.
Despite different schemes and parameter values of the replication and translation networks, there appears to be a general mechanism of error correction; this becomes apparent from the trade-off diagrams for the proofreading step. A rate constant of the proofreading step in both the cases is selected such that speed of the system is close to the maximum possible one. The actual systems reside on the non-trade-off branch of their respective error–time curves. Biologically, that implies that mutation that slightly speeds up the proofreading step would lead to an increase in both speed and accuracy of the enzymes. However, we show that such mutation would also increase energetic costs of proofreading. This extra cost does not allow the systems to further reduce the error and MFPT. Furthermore, the most interesting feature for both of the systems is the proximity of the MFPT value to the local minimum, which is similar in magnitude to the global minimum. Hence, for both case studies, the KPR rate is fine-tuned so that the loss in speed is insignificant compared with the improvement in accuracy.
Our results on the accuracy–speed trade-off in two important biological networks reveal similar strategies to optimize these two vital quantities. Rates of the steps like substrate binding, hydrolysis (of intermediates), and catalysis seem to be chosen to enhance speed at the cost of accuracy. On the other hand, proofreading or error-correction steps seem to be selected to have such rates that the error is reduced sufficiently with almost no loss in speed. Therefore, between the maximization of accuracy and speed, biological systems appear to give precedence to the latter. Tolerable levels of error and cost of error correction act as constraints to tailor the speed. It is interesting to note here that experimentally observed distribution of discriminatory steps is not optimal from the point of view of minimizing error (34). For example, for ribosome, the rates of the catalytic step are significantly different between the incorporation of the R and W amino acid in the polypeptide chain. Although this may be suboptimal in terms of error minimization (34), it allows for the proofreading rate to be much faster than the catalytic rate for a W substrate and much slower than the catalytic rate for the R substrate. As a result, ribosome avoids futile cycles (correcting the errors it did not make), improving speed and energy cost. This observation gives additional support to our arguments that biological systems distribute discrimination to better optimize speed and not accuracy (see Supporting Information). Our study thus presents a coherent quantitative picture of how the ultimate balance between accuracy and speed is achieved by adjusting various rates in distinct ways. It will be important to test our predictions in other systems and organisms. We believe such testing will further help to elucidate the fundamental mechanisms of proofreading processes in biological systems.
SI Text
First-Passage Probability Density: Evolution Equations.
We study the kinetics of the proofreading networks in terms of the first-passage process. The key quantity to characterize the dynamic properties of the system is the first-passage probability density. Let us denote as the first-passage probability density to reach the R end state at time forming product for the first time before reaching the W end state (product ), starting from state at time (Fig. 1). The corresponding probability density is defined in the same manner. The equations describing the time evolution of the first-passage probability densities are generally known as the backward master equations. To solve them, we introduce . is constructed in similar fashion.
For the DNA replication network (Fig. 1C), we can write
[S1] |
Here, and , being the Dirac delta function. The transition matrix
[S2] |
where , , , , and .
It is more convenient to solve Eq. S1 in Laplace space. The Laplace transform of is defined as
[S3] |
Under this transformation, with the initial condition , Eq. S1 becomes
[S4] |
Here, ( is similarly defined). and . From the algebraic set of equations, Eq. S4, one gets the desired quantities and , i.e., the first-passage probability density to reach one of the two end states for the first time before reaching the other, starting from state E. The corresponding splitting probabilities to reach either of the end states are expressed as
[S5] |
We use the same methodology for the translation network in Fig. 1D. Note that now the end states are linked with the states instead of ES as in replication. The transition matrix has a similar structure but now with , , , and . We also get and . The corresponding Laplace transforms are and .
Error in Replication and Translation.
The error is defined as the ratio of the splitting probabilities
[S6] |
The exact expression of error for the DNA replication network (Fig. 1C) is given by
[S7] |
The experimental data suggest the following limits: (S = R or W). Under these limits, one gets
[S8] |
It is easy to see that the translation network (Fig. 1D) is related to the replication network by the following transformation of rate constant indices: . Then, from Eq. S7, the general expression of error for the translation network comes out as
[S9] |
If one considers no discrimination in step 2 and the catalysis step, i.e., sets , then Eq. S9 reduces to the form as originally derived by Hopfield (9). This result clearly shows the validity of our approach. According to experimental data, GTP hydrolysis (step 2) and proofreading (step 3) steps are mainly unidirectional. Then, taking the limits , we get the approximate form of error as
[S10] |
Dependence of Minimum Error on the Ratio of Proofreading Rate to Catalysis Rate.
The minimum in error is obtained when the system is away from equilibrium but the initial enzyme–substrate binding equilibrium is minimally disturbed; this means (S = R or W) with the magnitude of being high enough to provide sufficient driving chemical potential difference over the cycles. From Eq. S10, we get the minimum error as
[S11] |
We now consider three different scenarios of .
Case I.
In Case I, , i.e., the catalysis rates for both R and W pathways are small compared with proofreading. From Eq. S11, one gets
[S12] |
It is satisfactory to see that, if one sets and , then one achieves the Hopfield limit of minimum error .
Case II.
In Case II, , i.e., the catalysis rate of the R pathway dominates proofreading but the opposite is true for the W pathway. The WT ribosome as well as the mutants are examples of such cases (Tables S3–S5). In this case, we have
[S13] |
Table S5.
Point | |||
I | |||
II | 0.51 | ||
III | 8.7 |
All of the rate constants are per second
Case III.
In Case III, , i.e., the catalysis rates for both R and W pathways are large compared with proofreading. Then, from Eq. S11 it follows that
[S14] |
Comparing Eqs. S12–S14, we have
[S15] |
For the sake of completeness, we also determine the minimum error in the DNA replication case. From Eq. S8, large gives this limit as
[S16] |
We take (Table S2) and then . A high excision (by hydrolysis) rate compared to the Exo–Pol back-sliding rate, i.e., , gives the same minimum error. On the other hand, gives .
The Conditional MFPTs.
The conditional MFPTs to reach the respective end states (in the presence of the other) are defined in terms of the first-passage probability densities in Laplace space as
[S17] |
For our purpose, the quantity of interest is , the MFPT (”conditional” term implied) to reach the R end. We denote it simply by . One can obtain analytical expressions for the MFPT for both the networks using Eq. S17. However, the exact forms come out to be quite unwieldy. To get some theoretical insights, we provide approximate expressions here; these are valid over the relevant parameter ranges, tested for both of the networks. We emphasize that all of the plots of the MFPTs are generated using the exact expressions.
In case of DNA replication, experimental data indicate . Then, with the limits , we get
[S18] |
Here, , , ), and is given by Eq. S8.
Kinetic data for the translation network show that . Then, taking the limit, one has
[S19] |
Here, , and is given by Eq. S10.
Cost of Proofreading.
The enhanced accuracy due to proofreading comes at a price. The KPR step resets the system and prevents it from taking the catalytic path. Thus, hydrolysis energy of triphosphate molecules is consumed without product formation. This extra cost or cost of proofreading, C, is defined as the ratio of the resetting or proofreading flux to the product formation flux. Thus,
[S20] |
Here, (S = R or W) is the flux associated with the proofreading step (step 3), and is the flux of the catalytic step. denotes the steady-state probability to find the system in state j. For the DNA replication network, .
For the aa-tRNA selection network, taking the limit yields
[S21] |
Here, is defined as the ratio of the catalytic fluxes of the W and R product. A similar simplification, however, does not produce such a compact form for the DNA replication network. We use Eq. S20 to generate the plot in Fig. 4B.
Maximum Speed vs. Accuracy Curves and Pareto Front.
In this section, we use multiple parameter variation to investigate the question, What is the maximum speed for a given accuracy value for some choice of parameters? To start with, one must note that the global maximum speed available to the system is always the catalytic rate, , irrespective of the value of other parameters. In other words, if one chooses to vary , then one can always get a higher speed at higher . Hence, it is reasonable to first fix the catalytic rate constant. As all of the trade-off curves in the Results are determined for fixed energetic discrimination ratios , here we also keep them fixed. The reverse rate constants for hydrolysis and proofreading are fixed to low values to ensure the nearly unidirectional nature of these steps.
We determine maximum speed for a given accuracy value by varying different sets of parameters. Some representative curves are shown in Fig. S2 for DNA replication and aa-tRNA selection. In Fig. S2A, the maximum speed vs. accuracy curve is generated in the following manner: Rate constants are varied freely over a range of while is evaluated at each accuracy value as a function of and other fixed parameters. The red part of the curve depicts what is known as a Pareto front. By definition, over the Pareto front, there is always a trade-off between maximum speed and accuracy. All of the points over the rest of the curve (and inside) have either lower accuracy or lower speed or both compared with those comprising the Pareto front. It is evident from Fig. S2A that the actual system lies very close to the maximum speed boundary as well as the Pareto front. The system can increase its accuracy without compromising much in speed by moving onto the Pareto front. However, the cost factor restricts such a strategy, similar to the case shown in Fig. 4. In Fig. S2B, the blue curve is obtained by increasing from to ; the violet curve is the same as in Fig. S2A and is redrawn for comparison. As expected, the maximum speed for the blue curve is lower, but it is still fairly close to the violet one. On the other hand, decreasing has very little effect on the maximal speed vs. accuracy curve and the Pareto front. Hence, the proximity of the system to the maximum speed and the Pareto front is a robust feature.
The maximum speed vs. accuracy curves for WT ribosome are shown in Fig. S2 C and D. They are generated in the following manner: Rate constants are varied freely over a range of while is evaluated at each accuracy value as a function of and other fixed parameters. The actual system lies inside the Pareto front and is relatively close to the maximum speed but not to the maximum accuracy. In Fig. S2D, similar cases of parameter variation are depicted (for details, see the figure caption). The data in Fig. 3A are also included for comparison. The maximum speed curve goes very close to the upper boundary when we set the rate constant to a large value (, orange curve in Fig. S2D). It is expected, as an increase in helps to increase the speed, as already shown in Fig. 3B. Thus, even if we were to include in our multiparameter variation, the maximum speed is expected to arise at the upper boundary of the range of . Now, it is crucial to figure out what the actual parameter values are for some suitably chosen points on the maximum speed vs. accuracy curves. For this purpose, we consider the curve in Fig. S2C and pick three points on the curve as follows: (point I) with accuracy equal to that of the system, (point II) the maximum of the maximum speed vs. accuracy curve, and (point III) with maximum speed equal to that of the system. The corresponding parameter values are listed in Table S5. It is evident from the data that all three positions are unrealistic for the system to reside on, particularly in light of experimentally measured kinetic constants (Table S2). It is important to note that, as the system lies inside the Pareto front, one cannot rule out the possibility of improving both speed and accuracy with affordable cost. However, it is not quite clear what additional physical and/or biochemical constraints could keep the system away from the Pareto front.
Acknowledgments
This work is supported by Center for Theoretical Biological Physics National Science Foundation (NSF) Grant PHY-1427654. A.B.K. also acknowledges support from Welch Foundation Grant C-1559 and from NSF Grant CHE-1360979.
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1614838114/-/DCSupplemental.
References
- 1.Alberts B, et al. Essential Cell Biology. 4th Ed Garland Sci; New York: 2013. [Google Scholar]
- 2.Watson JD. Molecular Biology of the Gene. WA Benjamin; New York: 1976. [Google Scholar]
- 3.Kunkel TA, Bebenek K. DNA replication fidelity. Annu Rev Biochem. 2000;69:497–529. doi: 10.1146/annurev.biochem.69.1.497. [DOI] [PubMed] [Google Scholar]
- 4.Zaher HS, Green R. Fidelity at the molecular level: Lessons from protein synthesis. Cell. 2009;136:746–762. doi: 10.1016/j.cell.2009.01.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Reynolds NM, Lazazzera BA, Ibba M. Cellular mechanisms that control mistranslation. Nat Rev Microbiol. 2010;8:849–856. doi: 10.1038/nrmicro2472. [DOI] [PubMed] [Google Scholar]
- 6.Nangle LA, de Crecy Lagard V, Doring V, Schimmel P. Genetic code ambiguity cell viability related to severity of editing defects in mutant tRNA synthetases. J Biol Chem. 2002;277:45729–45733. doi: 10.1074/jbc.M208093200. [DOI] [PubMed] [Google Scholar]
- 7.Schimmel P. Development of tRNA synthetases and connection to genetic code and disease. Protein Sci. 2008;17:1643–1652. doi: 10.1110/ps.037242.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Pauling L. Festschrift Arthur Stoll. Birkhauser; Basel: 1957. The probability of errors in the process of synthesis of protein molecules; pp. 597–602. [Google Scholar]
- 9.Hopfield JJ. Kinetic proofreading: A new mechanism for reducing errors in biosynthetic processes requiring high specificity. Proc Natl Acad Sci USA. 1974;71:4135–4139. doi: 10.1073/pnas.71.10.4135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ninio J. Kinetic amplification of enzyme discrimination. Biochimie. 1975;57:587–595. doi: 10.1016/s0300-9084(75)80139-8. [DOI] [PubMed] [Google Scholar]
- 11.Freter RR, Savageau MA. Proofreading systems of multiple stages for improved accuracy of biological discrimination. J Theor Biol. 1980;85:99–123. doi: 10.1016/0022-5193(80)90284-2. [DOI] [PubMed] [Google Scholar]
- 12.Ehrenberg M, Blomberg C. Thermodynamic constraints on kinetic proofreading in biosynthetic pathways. Biophys J. 1980;31:333–358. doi: 10.1016/S0006-3495(80)85063-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hopfield JJ, Yamane T, Yue V, Coutts SM. Direct experimental evidence for kinetic proofreading in amino acylation of tRNAIle. Proc Natl Acad Sci USA. 1976;73:1164–1168. doi: 10.1073/pnas.73.4.1164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Johnson KA. Conformational coupling in DNA polymerase fidelity. Annu Rev Biochem. 1993;62:685–713. doi: 10.1146/annurev.bi.62.070193.003345. [DOI] [PubMed] [Google Scholar]
- 15.Gromadski KB, Rodnina MV. Kinetic determinants of high-fidelity tRNA discrimination on the ribosome. Mol Cell. 2004;13:191–200. doi: 10.1016/s1097-2765(04)00005-x. [DOI] [PubMed] [Google Scholar]
- 16.Blanchard SC, Gonzalez RL, Kim HD, Chu S, Puglisi JD. tRNA selection and kinetic proofreading in translation. Nat Struct Mol Biol. 2004;11:1008–1014. doi: 10.1038/nsmb831. [DOI] [PubMed] [Google Scholar]
- 17.Kunkel TA. DNA replication fidelity. J Biol Chem. 2004;279:16895–16898. doi: 10.1074/jbc.R400006200. [DOI] [PubMed] [Google Scholar]
- 18.Murugan A, Huse DA, Leibler S. Speed, dissipation, and error in kinetic proofreading. Proc Natl Acad Sci USA. 2012;109:12034–12039. doi: 10.1073/pnas.1119911109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hartich D, Barato AC, Seifert U. Nonequilibrium sensing and its analogy to kinetic proofreading. New J Phys. 2015;17:055026. [Google Scholar]
- 20.Thompson RC, Karim AM. The accuracy of protein biosynthesis is limited by its speed: High fidelity selection by ribosomes of aminoacyl-tRNA ternary complexes containing GTP [gamma S] Proc Natl Acad Sci USA. 1982;79:4922–4926. doi: 10.1073/pnas.79.16.4922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Johansson M, Lovmar M, Ehrenberg M. Rate and accuracy of bacterial protein synthesis revisited. Curr Opin Microbiol. 2008;11:141–147. doi: 10.1016/j.mib.2008.02.015. [DOI] [PubMed] [Google Scholar]
- 22.Johansson M, Zhang J, Ehrenberg M. Genetic code translation displays a linear trade-off between efficiency and accuracy of tRNA selection. Proc Natl Acad Sci USA. 2012;109:131–136. doi: 10.1073/pnas.1116480109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Wong I, Patel SS, Johnson KA. An induced-fit kinetic mechanism for DNA replication fidelity: Direct measurement by single-turnover kinetics. Biochemistry. 1991;30:526–537. doi: 10.1021/bi00216a030. [DOI] [PubMed] [Google Scholar]
- 24.Wohlgemuth I, Pohl C, Mittelstaet J, Konevega AL, Rodnina MV. Evolutionary optimization of speed and accuracy of decoding on the ribosome. Phil Trans R Soc B. 2011;366:2979–2986. doi: 10.1098/rstb.2011.0138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sartori P, Pigolotti S. Kinetic versus energetic discrimination in biological copying. Phys Rev Lett. 2013;110:188101. doi: 10.1103/PhysRevLett.110.188101. [DOI] [PubMed] [Google Scholar]
- 26.Savir Y, Tlusty T. The ribosome as an optimal decoder: A lesson in molecular recognition. Cell. 2013;153:471–479. doi: 10.1016/j.cell.2013.03.032. [DOI] [PubMed] [Google Scholar]
- 27.van Kampen NG. Stochastic Processes in Physics and Chemistry. 3rd Ed North Holland; Amsterdam: 2007. [Google Scholar]
- 28.Bel G, Munsky B, Nemenman I. The simplicity of completion time distributions for common complex biochemical processes. Phys Biol. 2010;7:016003. doi: 10.1088/1478-3975/7/1/016003. [DOI] [PubMed] [Google Scholar]
- 29.Zaher HS, Green R. Hyperaccurate and error-prone ribosomes exploit distinct mechanisms during tRNA selection. Mol Cell. 2010;39:110–120. doi: 10.1016/j.molcel.2010.06.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kolomeisky AB. Motor Proteins and Molecular Motors. CRC; Boca Raton, FL: 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Kou SC, Cherayil BJ, Min W, English BP, Xie XS. Single-molecule Michaelis - Menten equations. J Phys Chem B. 2005;109:19068–19081. doi: 10.1021/jp051490q. [DOI] [PubMed] [Google Scholar]
- 32.Tubulekas I, Hughes D. Suppression of rpsL phenotypes by tuf mutations reveals a unique relationship between translation elongation and growth rate. Mol Microbiol. 1993;7:275–284. doi: 10.1111/j.1365-2958.1993.tb01118.x. [DOI] [PubMed] [Google Scholar]
- 33.Lovmar M, Ehrenberg M. Rate, accuracy and cost of ribosomes in bacterial cells. Biochimie. 2006;88:951–961. doi: 10.1016/j.biochi.2006.04.019. [DOI] [PubMed] [Google Scholar]
- 34.Murugan A, Huse DA, Leibler S. Discriminatory proofreading regimes in nonequilibrium systems. Phys Rev X. 2014;4:021016. [Google Scholar]