Elucidating interplay of speed and accuracy in biological error correction
See allHide authors and affiliations
Edited by William Bialek, Princeton University, Princeton, NJ, and approved April 7, 2017 (received for review September 4, 2016)

Significance
Biological processes are unique in showing a remarkable level of accuracy in discriminating between similar molecules. This characteristic is attributed to an error-correcting mechanism known as kinetic proofreading. It is widely believed that the enhancement of the accuracy in biological processes always slows them down. By analyzing the fundamental processes of DNA replication and protein translation, we established that these systems maximize speed rather than accuracy with additional energetic constraints. Our theoretical study further indicates that both speed and accuracy can be enhanced in certain parameter regimes. The resulting findings provide a microscopic picture of how complex biological processes can be accomplished so quickly with minimal errors.
Abstract
One of the most fascinating features of biological systems is the ability to sustain high accuracy of all major cellular processes despite the stochastic nature of underlying chemical processes. It is widely believed that such low error values are the result of the error-correcting mechanism known as kinetic proofreading. However, it is usually argued that enhancing the accuracy should result in slowing down the process, leading to the so-called speed–accuracy trade-off. We developed a discrete-state stochastic framework that allowed us to investigate the mechanisms of the proofreading using the method of first-passage processes. With this framework, we simultaneously analyzed the speed and accuracy of the two fundamental biological processes, DNA replication and tRNA selection during the translation. The results indicate that these systems tend to optimize speed rather than accuracy, as long as the error level is tolerable. Interestingly, for these processes, certain kinetic parameters lay in the suboptimal region where their perturbations can improve both speed and accuracy. Additional constraints due to the energetic cost of proofreading also play a role in the error correcting process. Our theoretical findings provide a microscopic picture of how complex biological processes are able to function so fast with high accuracy.
Biological systems exhibit remarkable accuracy in selecting the right substrate from the pool of chemically similar molecules. This property is common to all fundamental biological processes such as DNA replication, RNA transcription, and protein translation (1). The level of fidelity in various stages of genetic information flow depends on their relative importance in sustaining system stability. DNA replication is thought to be the most accurate process, with an error rate
Initially, it was unclear how the small differences in equilibrium binding stability of structurally similar substrates can allow such a high degree of discrimination (8). Then, an explanation was provided independently by Hopfield (9) and Ninio (10), who proposed an error-correction mechanism called kinetic proofreading (KPR). KPR allows enzymes to use the free energy difference between right and wrong substrates multiple times using additional steps (9); this amplifies the small energetic discrimination and results in a lower error compared with that in chemical equilibrium. However, such processes require significant energy consumption (9). To this end, enzymes use some energy-rich molecules, like ATP, to provide for the necessary driving (11, 12). The mechanism was experimentally verified later in different biological systems (13⇓⇓⇓–17). Several recent studies generalized it to more complex networks and found analogies between proofreading and other phenomena such as microtubule growth (18) or bacterial chemotaxis (19). These results broaden the concept of KPR and show that such chemically driven regulatory mechanisms are widely present.
Cells must process genetic information not just accurately but also sufficiently rapidly. Proofreading enhances the accuracy by resetting the system to its initial configuration without progressing to product state (9). The completion time of the reaction is, thus, expected to increase. Hence, there could be a compromise, or trade-off, between accuracy and speed of the process (20). The understanding on this trade-off is mainly based on the Michaelis–Menten (MM) description of specificity (21, 22). These studies indicate that the minimum possible error is achieved at vanishingly low catalytic rate, i.e., when the process is the slowest (9, 21). In contrast, biological polymerization reactions must occur reasonably fast (15, 23). A recent study demonstrated a new speed–accuracy regime in the KPR model by modifying the catalytic rate (18). In this regime, a large gain in speed comes with a relatively small loss in accuracy. The authors suggested that biological systems may use this regime (18). For example, in the tRNA selection process, a fast GTP hydrolysis step speeds up protein synthesis but prevents maximal possible selectivity of the initial tRNA–ribosome binding step (21, 24).
Despite the number of studies, a clear quantitative picture of how the balance between speed and accuracy is tuned is lacking. Several current models of proofreading still mainly focus on the initial stages of substrate selection (22, 25, 26) or assume disparity of rate constants of only a few types of steps (18, 19). In contrast, experimental data show that biological systems have different rates for the right and wrong substrates for each step of the network (4, 14, 15). It is not clear how such distributed discrimination of the reaction rates affects the trade-off. Moreover, proofreading steps come with an extra energy cost to gain higher accuracy (11), but the role of this cost in the trade-off is not apparent. Therefore, to understand the fundamental mechanisms of proofreading in real biological systems, one needs to answer the following questions: (i) How does the system set its priorities when choosing between accuracy and speed, two seemingly opposite objectives? (ii) Can speed and accuracy change in the same direction; in other words, can perturbations of a kinetic parameter from its naturally selected value improve both speed and accuracy? (iii) How does the extra energy expenditure due to KPR affect the speed–accuracy optimization?
Here we focus on the role of reaction kinetics in governing the speed–accuracy trade-off. To this end, we develop a generalized framework to study one-loop KPR networks, assuming distinct rate constants for every step of the right (R) and wrong (W) pathways. Based on this approach, we model the overall selection of the correct substrate over the incorrect one as a first-passage problem, to obtain a full dynamic description of the process (27, 28). This general framework is applied to two important examples, namely, DNA replication by T7 DNA polymerase (DNAP) (3, 14) and protein synthesis by Escherichia coli ribosome (22, 29) (Fig. 1 A and B)). Starting from the experimentally measured rate constants for each system, we vary their values to analyze the resulting changes in speed and accuracy and to assess the trade-off. The role played by the extra energy consumption or cost of proofreading (11) is also investigated. By comparing the behavior of the two systems, we search for general properties of biological error correction.
Schematic representation of proofreading networks for (A) DNA replication by T7 DNAP enzyme and (B) aminoacyl(aa)-tRNA selection by E. coli ribosome during translation. Corresponding chemical networks are shown for (C) replication and (D) translation. Reaction steps comprising the cycles are labeled 1 to 3. Rate constants of each step are denoted by
Methods
Proofreading Networks of Replication and Translation.
DNA replication as well as protein synthesis use nucleotide complementarity to select the cognate substrate over other near/noncognate substrates. During replication, dNTP molecules complementary to the DNA template are chosen. Similarly, during protein synthesis, aminoacyl(aa)-tRNAs are picked by ribosome based on the complementarity of their anticodon to the mRNA codon. Wrong substrates that bind initially can be removed by error-correction proofreading mechanisms. Kinetic experiments coupled with modeling revealed a lot of mechanistic details about both of the processes (3, 14, 15, 24). The schemes depicted in Fig. 1 represent the key steps to understand the KPR in these networks.
The schemes in Fig. 1 A and C are for DNA replication (3, 14). E denotes the T7 DNAP enzyme in complex with a DNA primer template. The R and W substrates are correct and incorrect base-paired dNTP molecules, respectively. Step 1 generates enzyme–DNA complexes ER(or EW) with the primer elongated by one nucleotide. Addition of another correct nucleotide to ER (EW) gives rise to
The schemes in Fig. 1 B and D show the aa-tRNA selection process by ribosome during translation (29). Here, E denotes the E. coli ribosome with mRNA. Cognate (near-cognate) aa-tRNAs in ternary complex with elongation factor Tu (EFTu) and GTP bind with ribosome in step 1 to form
In both schemes, we take the rate constants of the W cycle to be related to those of the R cycle through
Accuracy and Speed from First-Passage Description.
We determine the error and speed of the substrate selection kinetics from the first-passage probability density (27, 28). With this method, we can analyze an arbitrary catalytic reaction scheme and focus on the transitions starting from the initial state
Let us denote
The speed of a reaction is naturally quantified by the net rate of the product formation. As with any chemical reaction rate, it can be defined as the inverse of the mean first-passage time (MFPT), i.e., the mean time it takes to cross the energy barrier that separates reactants and products for the first time. For example, a well-known application of this approach for single-molecule MM kinetics results in the traditional expression for the rate as the inverse of the MFPT (31). We note that the speed toward the correct product can nevertheless be affected by the presence of the incorrect substrate. Thus, it is important to consider them together in contrast to the prevalent measure of the speed in literature neglecting the presence of the W pathway (21). In our case, the expression of the MFPT to reach each product state is given by the first moment of the corresponding probability density (27).
Results
Although our formalism can be applied to an arbitrary KPR scheme, we’ve chosen to study two fundamentally important biologically processes: DNA replication and translation. These processes are best characterized in terms of underlying kinetic parameters, and we can study the speed–accuracy trade-off in the biologically relevant parameter region. Notably, despite differences in parameters and KPR mechanisms for the two case studies, we reach similar conclusions for both.
Importance of Speed over Accuracy in DNA Replication by T7 DNAP.
The T7 DNAP enzyme catalyzes the polymerization of a DNA primer over a template strand (14). Wrongly incorporated dNTP is removed by the proofreading mechanism that involves the exonuclease site of DNAP (23). The model parameters of the corresponding reaction network (Fig. 1A) are listed in Table S1. They are based on the experimental data of Wong et al. (23). We do not consider dissociation of the DNA from the enzyme in our model. This approach is justified due to the faster polymerization rates in the R path and the higher exonucleolytic sliding rate in the W path (Table S1).
Model parameters for DNA replication by T7 DNAP
The error,
Speed–accuracy trade-off for T7 DNAP. (A) The change in error,
The actual system (green circle) is situated on the trade-off branch of the
The Pol–Exo sliding is an important step in error correction. The
tRNA Selection by E. coli Ribosome Is Optimized for Speed Rather than Accuracy with a Cost Constraint.
During translation, the ribosome decodes the mRNA sequences by selecting aa-tRNAs in ternary complex with EFTu and GTP (4, 15). Noncognate aa-tRNAs are removed by proofreading dissociation of the complex from the ribosome A site after GTP hydrolysis (24, 29). The model parameters of the network (Fig. 1B) for WT E. coli ribosome are listed in Table S2. They are based on the experimental data of Zaher and Green (29). We chose
Model parameters for aa-tRNA selection by WT E. coli ribosome
Robustness of error–MFPT trade-off for aa-tRNA selection against change in the parameter
We show the
Speed–accuracy trade-off in aa-tRNA selection by three varieties of E. coli ribosome. One is the wild-type (WT). The other two are hyperaccurate (HYP) and more error-prone (ERR) mutants. (A)
Model parameters for aa-tRNA selection by hyperaccurate (HYP, rpsL141) mutant E. coli ribosome
Model parameters for aa-tRNA selection by more-erroneous (ERR, rpsD12) mutant E. coli ribosome
(A) The curve of maximum speed (
Robustness of the error–MFPT trade-off for the GTP hydrolysis step with respect to random variation of various rate constants. The factors
The change in the ternary complex binding rate constant,
Next, we explore the effects of variation of the proofreading step rate constant,
Because correction by proofreading resets the system without a product formation, it has a cost associated with futile cycles where the correct substrate was inserted and then removed. The cost of proofreading,
(A) An
Discussion
Evolution has optimized the kinetic parameters of biological enzymes to achieve the desired levels of accuracy and speed at various stages of biological information flow. In this study, by examining how the balance between speed and accuracy changes with variation of the underlying kinetic parameters, we gain insights into the important priorities for this optimization. To this end, we focus on two fundamental examples of biological proofreading networks: DNA replication and protein translation. In both cases, the systems tend to achieve maximum speed by losing significant accuracy. However, the speed–accuracy trade-off only occurs in the limited region of the parameter space, e.g., after the polymerization rate in replication passes the minimum error point. In the case of translation, the trade-off appears between the minima in error and the MFPT for the GTP hydrolysis step. A similar conclusion about the importance of speed over accuracy is reached by varying the rates of the proofreading steps in both systems. Although higher proofreading rates can further improve the accuracy without losing much speed, the associated energy cost of proofreading may restrict further improvements on an already acceptable speed and accuracy.
An important insight from the above analyses is that the speed–accuracy trade-off is not universally present, and its occurrence depends on the specific values of kinetic rates. Biologically, this result implies that mutations or application of drugs that reduce the enzyme’s accuracy do not necessarily increase its speed, and vice versa. The widespread view of a compromise between accuracy and speed is mainly based on their dependence on the effective catalytic rate of the process (9, 21). Indeed, the larger the catalytic rate, the higher the speed and the lower the accuracy. However, the role of other steps, like hydrolysis and proofreading, are not as straightforward. Our study reveals that, for these steps, trade-offs are present only over a certain range of rates, and both accuracy and speed can improve with variation of certain kinetic parameters. The partitioning of the error–time curves into trade-off and non-trade-off branches clarifies the distinct roles of various transitions and the molecular mechanisms of the speed–accuracy optimization. Our conclusions are also supported by a more advanced analysis of the maximum speed vs. accuracy curves using Pareto fronts, as explained in detail in Supporting Information.
The analysis of speed–accuracy trade-offs for different mutant varieties of E. coli ribosome further confirms the importance of speed over accuracy. The WT and two mutants (HYP and ERR) lie close to the minimum MFPT point on the error–time curves (Fig. 3A). However, the WT and HYP ribosomes are on the trade-off branch, whereas the ERR mutant is on the non-trade-off branch. Thus, movement down the slope toward the trade-off branch would raise both accuracy and speed for the ERR ribosome. That is how the WT ribosome may have evolved from the more erroneous ERR type. However, any further movement upward along the trade-off branch means a slowdown with a lower error; this leads to the more accurate (HYP) mutant. Rejection of the latter as the natural choice implies that optimization of speed is critical. We note that comparison of E. coli growth rates with WT and mutant ribosomes already indicates such an optimization (21, 32). However, according to the prevailing notion on the ever-present compromise between error and speed, the more erroneous (ERR) ribosome should be faster. Hence, the hindered growth for ERR mutant was ascribed presumably to less-active proteins (33). Our results indicate that not only the accuracy but also the speed of peptide chain elongation can be smaller for the ERR mutant.
Despite different schemes and parameter values of the replication and translation networks, there appears to be a general mechanism of error correction; this becomes apparent from the trade-off diagrams for the proofreading step. A rate constant of the proofreading step in both the cases is selected such that speed of the system is close to the maximum possible one. The actual systems reside on the non-trade-off branch of their respective error–time curves. Biologically, that implies that mutation that slightly speeds up the proofreading step would lead to an increase in both speed and accuracy of the enzymes. However, we show that such mutation would also increase energetic costs of proofreading. This extra cost does not allow the systems to further reduce the error and MFPT. Furthermore, the most interesting feature for both of the systems is the proximity of the MFPT value to the local minimum, which is similar in magnitude to the global minimum. Hence, for both case studies, the KPR rate is fine-tuned so that the loss in speed is insignificant compared with the improvement in accuracy.
Our results on the accuracy–speed trade-off in two important biological networks reveal similar strategies to optimize these two vital quantities. Rates of the steps like substrate binding, hydrolysis (of intermediates), and catalysis seem to be chosen to enhance speed at the cost of accuracy. On the other hand, proofreading or error-correction steps seem to be selected to have such rates that the error is reduced sufficiently with almost no loss in speed. Therefore, between the maximization of accuracy and speed, biological systems appear to give precedence to the latter. Tolerable levels of error and cost of error correction act as constraints to tailor the speed. It is interesting to note here that experimentally observed distribution of discriminatory steps is not optimal from the point of view of minimizing error (34). For example, for ribosome, the rates of the catalytic step are significantly different between the incorporation of the R and W amino acid in the polypeptide chain. Although this may be suboptimal in terms of error minimization (34), it allows for the proofreading rate to be much faster than the catalytic rate for a W substrate and much slower than the catalytic rate for the R substrate. As a result, ribosome avoids futile cycles (correcting the errors it did not make), improving speed and energy cost. This observation gives additional support to our arguments that biological systems distribute discrimination to better optimize speed and not accuracy (see Supporting Information). Our study thus presents a coherent quantitative picture of how the ultimate balance between accuracy and speed is achieved by adjusting various rates in distinct ways. It will be important to test our predictions in other systems and organisms. We believe such testing will further help to elucidate the fundamental mechanisms of proofreading processes in biological systems.
SI Text
First-Passage Probability Density: Evolution Equations.
We study the kinetics of the proofreading networks in terms of the first-passage process. The key quantity to characterize the dynamic properties of the system is the first-passage probability density. Let us denote
For the DNA replication network (Fig. 1C), we can write
It is more convenient to solve Eq. S1 in Laplace space. The Laplace transform of
Error in Replication and Translation.
The error is defined as the ratio of the splitting probabilities
The exact expression of error for the DNA replication network (Fig. 1C) is given by
Dependence of Minimum Error on the Ratio of Proofreading Rate to Catalysis Rate.
The minimum in error is obtained when the system is away from equilibrium but the initial enzyme–substrate binding equilibrium is minimally disturbed; this means
Case I.
In Case I,
Case II.
In Case II,
Parameter values corresponding to the three points on the maximum speed vs. accuracy curve in Fig. S2A for WT E. coli ribosome
Case III.
In Case III,
The Conditional MFPTs.
The conditional MFPTs to reach the respective end states (in the presence of the other) are defined in terms of the first-passage probability densities in Laplace space as
For our purpose, the quantity of interest is
In case of DNA replication, experimental data indicate
Kinetic data for the translation network show that
Cost of Proofreading.
The enhanced accuracy due to proofreading comes at a price. The KPR step resets the system and prevents it from taking the catalytic path. Thus, hydrolysis energy of triphosphate molecules is consumed without product formation. This extra cost or cost of proofreading, C, is defined as the ratio of the resetting or proofreading flux to the product formation flux. Thus,
For the aa-tRNA selection network, taking the
Here,
Maximum Speed vs. Accuracy Curves and Pareto Front.
In this section, we use multiple parameter variation to investigate the question, What is the maximum speed for a given accuracy value for some choice of parameters? To start with, one must note that the global maximum speed available to the system is always the catalytic rate,
We determine maximum speed for a given accuracy value by varying different sets of parameters. Some representative curves are shown in Fig. S2 for DNA replication and aa-tRNA selection. In Fig. S2A, the maximum speed vs. accuracy curve is generated in the following manner: Rate constants
The maximum speed vs. accuracy curves for WT ribosome are shown in Fig. S2 C and D. They are generated in the following manner: Rate constants
Acknowledgments
This work is supported by Center for Theoretical Biological Physics National Science Foundation (NSF) Grant PHY-1427654. A.B.K. also acknowledges support from Welch Foundation Grant C-1559 and from NSF Grant CHE-1360979.
Footnotes
↵1O.A.I. and A.B.K. contributed equally to this work.
- ↵2To whom correspondence may be addressed. Email: tolya{at}rice.edu or igoshin{at}rice.edu.
Author contributions: A.B.K. and O.A.I. designed research; K.B., A.B.K., and O.A.I. performed research; and K.B., A.B.K., and O.A.I. wrote the paper.
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1614838114/-/DCSupplemental.
References
- ↵.
- Alberts B, et al.
- ↵.
- Watson JD
- ↵
- ↵
- ↵
- ↵.
- Nangle LA,
- de Crecy Lagard V,
- Doring V,
- Schimmel P
- ↵
- ↵.
- Pauling L
- ↵.
- Hopfield JJ
- ↵
- ↵
- ↵
- ↵.
- Hopfield JJ,
- Yamane T,
- Yue V,
- Coutts SM
- ↵
- ↵
- ↵
- ↵.
- Kunkel TA
- ↵.
- Murugan A,
- Huse DA,
- Leibler S
- ↵.
- Hartich D,
- Barato AC,
- Seifert U
- ↵.
- Thompson RC,
- Karim AM
- ↵
- ↵.
- Johansson M,
- Zhang J,
- Ehrenberg M
- ↵
- ↵.
- Wohlgemuth I,
- Pohl C,
- Mittelstaet J,
- Konevega AL,
- Rodnina MV
- ↵.
- Sartori P,
- Pigolotti S
- ↵
- ↵.
- van Kampen NG
- ↵
- ↵
- ↵.
- Kolomeisky AB
- ↵
- ↵
- ↵
- ↵.
- Murugan A,
- Huse DA,
- Leibler S