Abstract
High fidelity polymerases are efficient catalysts of phosphodiester bond formation during DNA replication/repair. We interpret molecular dynamics simulations of a polymerase bound to its substrate DNA and incoming nucleotide using a quasi-harmonic model to study the effect of external forces applied to the bound DNA on the kinetics of phosphoryl transfer. The origin of the force dependence is shown to be an intriguing coupling between slow, delocalized polymerase-DNA modes and fast catalytic site motions. Using non-cognate DNA substrates we show that the force dependence is context specific.
The tremendous catalytic power of high fidelity DNA polymerases, which enables them to perform DNA base pair synthesis (BPS) with high efficiency as well as high specificity (error rate of 1 per 106–108 base pairs synthesized) is a subject of intense experimental1 and theoretical2, 3 investigations. During BPS a complementary nucleotide triphosphate (dNTP) is incorporated into one of the strands (denoted as primer) of a double stranded DNA molecule, opposite an unpaired base in the other strand (denoted as template). The basic reaction (chemical step) is the formation of a covalent phosphodiester bond between the incoming dNTP and the terminal primer base (Fig. 1), prior to which the enzyme has to switch to an “active” state through a large scale conformational rearrangement of the fingers (or thumb) domain1. Subsequently, the enzyme switches back to an “inactive” state and the cycle repeats. Based on a large body of structural and biochemical data, it is known that the identity of the rate-limiting step in the BPS cycle depends not only on the type of polymerase, but also the base pairs involved in the synthesis1. During correct incorporation, either the conformational change preceding the chemical step (E. Coli pol I) or the chemical step itself (pol β) can be rate-limiting, while for incorrect incorporation indirect evidence suggests that for several polymerases the chemical step is rate-limiting. Using novel tools for single molecule experimentation4–7 pioneering studies of polymerases replicating DNA stretched under differing tensions8, 9 show that the replication rate is highly sensitive to forces exerted on the template strand (replication rate increases with forces less than 4–7 pN and decreases for higher values of the force); this implies that the applied force affects the rate limiting step. Theoretical models parameterized by experimental data8, 9 and atomistic simulations10 suggest that the rate limiting step in this case involves the rearrangement of more than one base pair. While such models are instructive the force dependence is explained purely in terms of DNA elasticity neglecting the coupling between the complex cooperative motion of polymerase and DNA degrees of freedom at the catalytic site. In this letter, we show that such a coupling leads to a new paradigm of force-dependence of the kinetics of the chemical step in these systems. This is particularly useful for extending the experimental studies to systems with mis-paired DNA substrates where the chemical step is indeed believed to be rate limiting. Through a correlation analysis of molecular dynamics (MD) simulation data, we suggest that the external force on the DNA template strand impacts the chemical step due to a coupling between the primary reactive distance for phosphoryl transfer (the O3′-Pα distance in Fig. 1) and slow delocalized enzyme-DNA motions11. The presence of such a coupling adds to accumulating evidence of a pre-chemistry phase during catalysis, in which delocalized (global), and local motions of the enzyme-substrate complex driven by thermal fluctuations synergistically orchestrate the assembly of an optimal catalytic site12, 13. We also show that the coupling is disrupted to varying extents when the inserted nucleotide is not complementary to the template base or when the template base is oxidatively damaged. We combine our correlation analysis with a quasi-harmonic approximation to derive the context-specific force dependence of the phosphoryl transfer step for four polymerase-DNA complexes.
Figure 1.

(Right) Solvated BF pol/DNA/dNTP ternary complex. (Left) Catalytic site showing key elements of the phosphoryl transfer reaction. A covalent bond is formed between α-phosphorous of the incoming nucleotide and the O3′ oxygen of the terminal primer base.
In this study, we have chosen to work with the large fragment from Bacillus stearothermophilus (Bacillus fragment or BF) even though previous single molecule data is available for the highly homologous (49% sequence identity) Klenow fragment (or KF) because BF allows us to explore four systems: correct dCTP/incorrect dATP insertion opposite an undamaged/oxidatively damaged guanine (G) template base. We denote the systems as (1) G:C, (2) G:A, (3) 8oxoG:C, and (4) 8oxoG:A. For KF, (3) and (4) are not physical because unlike BF, KF is not known to bypass the 8oxoG lesion, see section S.1 and Fig. S1 in supplementary material (SM) for a sequence and structural comparison of BF with KF and T7 polymerases. Four fully solvated model complexes of BF pol14 with DNA+dNTP substrate are prepared for G:C, G:A, 8oxoG:C, and 8oxoG:A systems. In the 8oxoG models G is replaced with 8-hydroxoguanine (8oxoG), one of the most prominent lesion generated by oxidative damage. We perform all atom MD simulations of fully solvated complexes using the CHARMM force field15 and the NAMD16 simulation package. Following standard equilibration protocol, see (SM) for details on methods and analyses, 10 ns constant volume and temperature (NVT) MD trajectories are obtained for the four model systems and data from the last 5 ns are utilized in the subsequent analysis.
In the single molecule experiments, a force applied to the template strand of the DNA fragment alters the replication rate. Here, using a minimal model, we specifically discuss the effect of the force on the chemical step during BPS. If A and B denote the ground and transition states for catalysis, then according to transition state theory the “equilibrium constant” for the system to switch from state A to B in the absence of any external force is (see Fig. 2): Keq(0) ∝ exp(−ΔG/kBT), where, ΔG is the free energy difference between A and B17. A comparison of equilibrated structures (Fig. 1) with “ideal” transition state structures in similar systems18, 19 shows that the primary reactive distance, X (signifying nucleophilic attack) for catalysis is the distance between the terminal primer adenine (A) O3′ atom and the Pα atom of the incoming dNTP. We assume that the passage from state A to B is approximated purely by a change in the coordinate X, which is harmonic (to a first approximation) close to the catalytic ground state with a spring constant kx and equilibrium value XA (Fig. 2). Then, in the absence of an external force the total (free) energy cost to drive the system from state A to B is: W(0) = 0.5kx(XB(0) − XA(0))2. Within the linear response limit, the applied force will shift the ground state equilibrium position XA(F) and the position of the transition state XB(F)17. Thus, the change in the total (free) energy cost to transition from state A to state B, i.e. ΔW=W(0)−W(F), under an applied force F acting along X is: ΔW(F) = W(0) − 0.5kx(XB(F) − XA(F))2 and the Keq under force F can be written as17: Keq(F) ∝ exp(−[ΔG − ΔW(F)]/kBT).
Figure 2.

The potential surface along the catalytic reactive distance X is given by V0=0.5kx[X−XA(0)]2. An external force F shifts the equilibrium position of the ground state A from XA(0) to XA(F) and that of the transition state B from XB(0) to XB(F) to alter the barrier for catalysis. The new potential surface for X is VF =0.5kx[X−XA(F)]2−[Fx/(2kx)], where Fx is the force along X.
A force applied on the template strand (as in the single molecule studies) will change X only if a coupling exists between X and the DNA template strand. We explore this coupling by carrying out a principal component analysis (PCA)20 on an “active site region”, including the catalytic site (Fig. 1) and comprising of non-hydrogen atoms of the incoming dNTP, six residues of the DNA template strand, four residues of the DNA primer strand, the two Mg2+ ions and two polymerase aspartate residues D830 and D653 critical for catalysis, and bound waters at the catalytic site, (see section S.2 in SM for a discussion on this choice for the subset). The plot of the variance-covariance matrix (σ, see section S.1.4 in SM) quantifies the correlations in atomic motions in the active-site, (see Fig 3 and Fig S8–S11) and the PCA of σ results in ξ=(ξ1,ξ2,…,ξ3N-6) orthogonal eigenvectors (PCs) with eigenvalues λ=(λ1, λ2,…,λ3N-6) sorted in descending order i.e. λ1>λ2…,λ3N-7>λ3N-6; visualization of the PCs suggests how the polymerase utilizes the dynamical coupling to possibly drive the reaction (see Movie-S6 in SM). Intriguingly, we find that the correlations in σ and hence the dynamical coupling of the PCs with X are both context-specific, see Fig 3, Fig S8–S11 and Table I. For the G:C system, we find that PCs that significantly influence X (Table I) are also among the top (10) PCs which account for most (>70%) of the atomic fluctuations in the MD trajectories. Such a strong coupling is disrupted to varying degrees for the three non-cognate systems (Table 1). The contrast in correlations in Fig 3 (between G:C and G:A) and more generally in Figs S8–S11 (among the four systems) determines the context-specific dynamical coupling in Table 1, wherein the catalytic site reactive distances are coupled to global motions of the BF/DNA/dNTP complex.
Figure 3.

Correlations between vector displacements (r-〈r〉) of atoms for the G:C and G:A systems; vector r is from origin to the atom of interest, and 〈r〉 its average. Enlarged views for all 4 systems are provided in Figs. S8–S11 in SM.
Table I.
The 5 modes showing the strongest correlation with X, measured by the coefficient η, defined for two independent variables x,y as: with −1 < ηxy < 1.
| G:C | G:A | 8oxoG:C | 8oxoG:A | ||||
|---|---|---|---|---|---|---|---|
| Mode | η | Mode | η | Mode | η | Mode | η |
| 1 | 0.34 | 105 | 0.27 | 10 | 0.33 | 1 | −0.23 |
| 3 | −0.32 | 112 | −0.22 | 9 | 0.32 | 20 | 0.21 |
| 2 | 0.31 | 54 | 0.21 | 4 | 0.27 | 47 | 0.18 |
| 8 | 0.24 | 115 | 0.19 | 6 | −0.22 | 23 | −0.17 |
| 5 | −0.18 | 106 | −0.19 | 15 | 0.22 | 71 | −0.16 |
Next, we discuss how the context-specific dynamic coupling relates to the force applied to the DNA template strand to influence catalysis. Each eigenvector ξm represents a collective mode of motion in the active site region of the ternary complex, with spring constant km=kBT/λm, (the equality stems from the quasi-harmonic approximation20). A force F applied along the DNA helix axis as in the case of the single molecule experiments will displace each ξm by, am =|F| cosθm/km, where, θm is the projection angle of F along ξm. Since the force is applied at the ends of the DNA template strand we project only the motions of template strand backbone (phosphate) atoms along the helix axis in calculating θm for each PC, see section S.3 in SM for further discussion. The resultant change in the active site geometry due to the applied force F along the DNA duplex axis is then: , where, R = (x1,x2‥‥x3N) is the vector representing the geometry of the active site region of N atoms. Here, R(0) is the ground state active site geometry at zero force (MD simulation average), for which X=XA(0), and R(F) is the new active state geometry due to F for which X=XA(F). The force along X which causes a displacement, ΔXA(F)=XA(F)−XA(0) is: Fx(F) = −kxΔXA(F). The spring constants kx can be obtained (Table II) from the distribution (histogram) of X values Px in our ground state MD trajectories by fitting a harmonic function to the energy of the distribution E(X)=(1/2)kxX2=−kBTln(Px). Since the free energy surface projected along the reaction coordinate has a maximum at the transition state, its the negative curvature is approximated as kts = −μ(ωts)2 = −2.77×103 pN/Å, where μ is the reduced mass of the O3′-Pα virtual bond = 16*32/48 AMU and ωts is the passage time in transition state theory, ωts=−kBT/ħ. Assuming that the same force acts on X throughout the systems’ passage from ground to transition state (the dynamic coupling is unaltered), the displacement of the transition state value of X, ΔXB(F)=XB(F)−XB(0) is given as:
| (1) |
Note that the displacements ΔXA(F) and ΔXB(F) have opposite signs due to positive/negative curvatures of the free energy for ground/transition states. In our calculations XB(0)=1.7 Å, the average value of the O3′-Pα distance seen in structures of post-catalytic (product) complexes. We have used the full set of 3N-6 PC modes to calculate the displacements ΔXA(F) and ΔXB(F) for all four models and obtained the ratio Keq(F)/Keq(0)=exp(ΔW(F)/kBT) of phosphoryl transfer rates in the presence of an external force relative to that in the zero force limit. The phosphoryl transfer rate increases with force (ratio>1) if ΔW is positive, i.e., if the force reduces X from its average value and decreases (ratio<1) for negative ΔW. In Fig. 4, we plot Keq(F)/Keq(0) versus force in the range |F|=1–6 pN for all four models. We consider two cases (that represent extreme limits and the correct scenario is likely in-between): (a) no TSS: F does not change XB(0) i.e. that the dynamic coupling present in state A is absent (completely disrupted) in state B and (b) with TSS: F changes XB(0) to XB(F) and the dynamical coupling is retained throughout the systems passage from state A to B. Based on Fig. 4, our predictions are that for three systems (G:C†, 8oxoG:C, and 8oxoG:A) the applied force induces a net reduction in X, and hence an increase in the phosphoryl transfer rate with increasing applied force. For the G:A system, the opposite is true, i.e. with force, X increases causing a net decrease in phosphoryl transfer rate. Thus, the increase or decrease of the relative replication rate with force is context-specific and is governed both by the dynamic coupling of the PCs with X (Table I) and the projection of the PCs along F (see also section S.3). These predictions, especially for the non-cognate substrates where phosphoryl transfer is indeed rate limiting (and hence governs the overall replication rate), are new and can be directly tested by carrying-out single molecule studies of DNA replication in response to weak applied forces. The dynamic coupling itself is strongest for the G:C system, i.e. the slow modes have a strong correlation with X (Table I). For the non-cognate substrates (G:A, 8oxoG:C and 8oxoG:A models) the dynamic coupling appears to be disrupted to differing extents and the most significant correlations of the O3′-Pα distance fluctuations shift to be with higher frequency (stiffer) modes (Table I). The stiffer modes by definition tend to induce smaller displacements of X and smaller shifts of the transition state (Eq. 1); in our PCA, the stiffer modes also show weaker projections along the applied force. The cumulative effect is a weaker (G:A, 8oxoG:A) or insignificant (8oxoG:C) dependence of the relative phosphoryl transfer rate on the applied force for the systems with non-cognate substrates (Fig. 4). PCA calculations performed using extended active site fragments show identical phosphoryl transfer rate versus force curves as reported in Fig. 4 (section S.2 SM), which establishes that our predictions are insensitive to our choice of the active-site.
Table II.
Spring constants (× 103 pN/Å) for all four models estimated from MD simulation data.
| Model | G:C | G:A | 8oxoG:C | 8oxoG:A |
|---|---|---|---|---|
| kx | 2.24 | 0.62 | 0.21 | 2.55 |
Figure 4.

Relative catalytic rate vs. force on the DNA template strand. TSS: Transition state shift.
To summarize, subject well-appreciated approximations‡, we have identified a dynamic coupling between slow delocalized polymerase-DNA modes and the O3′-Pα reactive distance for catalysis which is optimal only for the G:C system and is disrupted in systems with non-cognate DNA+dNTP substrates. The emerging theme from our studies is that the dynamical coupling alters the free energy landscape for phosphoryl transfer, i.e., its context-specific manifestation impacts the kinetics of phosphoryl transfer differently for different substrates. Thus the optimization of the dynamic coupling machinery through evolution can be a possible additional mechanism in the polymerase to achieve error control during DNA replication.
Supplementary Material
Acknowledgments
Computational resources were provided in part by the National Partnership for Advanced Computational Infrastructure under Grant MCB060006.
Footnotes
For G:C incorporation in KF this trend matches the experiments although experimental evidence also shows that the chemical step is not rate limiting for G:C.
These include imperfections in biomolecular force-fields, limited sampling in MD simulations, the linear response as well as quasiharmonic assumptions. In particular, our analysis would break down for strong forces due to anharmonicities, force induced mode mixing, and the coupling of the force to reactive distances other than X.
References
- 1.Joyce CM, Benkovic SJ. Biochemistry. 2004;43:14317. doi: 10.1021/bi048422z. [DOI] [PubMed] [Google Scholar]
- 2.Florian J, Goodman MF, Warshel A. Proc Natl Acad Sci USA. 2005;102:6819. doi: 10.1073/pnas.0408173102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Radhakrishnan R, Schlick T. Proc Natl Acad Sci USA. 2004;101:5970. doi: 10.1073/pnas.0308585101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bustamante C, Bryant Z, Smith SB. Nature. 2003;421:423. doi: 10.1038/nature01405. [DOI] [PubMed] [Google Scholar]
- 5.Lavery R, Lebrun A, Allemand JF, et al. Journal of Physics-Condensed Matter. 2002;14:R383. [Google Scholar]
- 6.Schnitzer MJ, Visscher K, Block SM. Nature Cell Biology. 2000;2:718. doi: 10.1038/35036345. [DOI] [PubMed] [Google Scholar]
- 7.Guo Q, Sousa R. Journal of Molecular Biology. 2006;358:241. doi: 10.1016/j.jmb.2006.02.001. [DOI] [PubMed] [Google Scholar]
- 8.Wuite GJL, Smith SB, Young M, et al. Nature. 2000;404:103. doi: 10.1038/35003614. [DOI] [PubMed] [Google Scholar]
- 9.Maier B, Bensimon D, Croquette V. Proc Natl Acad Sci USA. 2000;97:12002. doi: 10.1073/pnas.97.22.12002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Andricioaei I, Goel A, Herschbach D, et al. Biophys J. 2004;87:1478. doi: 10.1529/biophysj.103.039313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Venkatramani R, Radhakrishnan R. Proteins Struct Funct Bioinf. 2007 In press. [Google Scholar]
- 12.Hammes-Schiffer S, Benkovic SJ. Annu Rev Biochem. 2006;75:519. doi: 10.1146/annurev.biochem.75.103004.142800. [DOI] [PubMed] [Google Scholar]
- 13.Radhakrishnan R, Arora K, Wang Y, et al. Biochemistry. 2006;45:15142. doi: 10.1021/bi061353z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Johnson SJ, Taylor JS, Beese LS. Proc Natl Acad Sci USA. 2003;100:3895. doi: 10.1073/pnas.0630532100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.MacKerell AD, Bashford D, Bellott M, et al. J Phys Chem B. 1998;102:3586. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
- 16.Phillips JC, Braun R, Wang W, et al. J Comput Chem. 2005;26:1781. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Bustamante C, Chemla YR, Forde NR, et al. Annual Review of Biochemistry. 2004;73:705. doi: 10.1146/annurev.biochem.72.121801.161542. [DOI] [PubMed] [Google Scholar]
- 18.Lahiri SD, Zhang GF, Dunaway-Mariano D, et al. Science. 2003;299:2067. doi: 10.1126/science.1082710. [DOI] [PubMed] [Google Scholar]
- 19.Beese LS, Steitz TA. EMBO J. 1991;10:25. doi: 10.1002/j.1460-2075.1991.tb07917.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Amadei A, Linssen ABM, deGroot BL, et al. J Biomol Struct Dyn. 1996;13:615. doi: 10.1080/07391102.1996.10508874. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
