Abstract
Controversy surrounds the perceived absence of a relationship between DNA polymerase fidelity (kinetic discrimination) and free energy changes determined from DNA melting studies (thermodynamic discrimination). Thermodynamic discrimination together with aqueous solvent effects can account for kinetic fidelities on the order of those observed experimentally.
DNA polymerases are enzymes that are responsible for rapidly and accurately copying the genome. When immersed in an aqueous sea of A, T, G and C deoxyribonucleoside triphosphates (dNTPs), polymerases catalyse template-directed synthesis of Watson–Crick (W–C) base pairs in preference to mispairs. For many polymerases, this proceeds with very high selectivity, which is remarkable given that the choice to accept or reject a dNTP substrate must be made in about 10−3 s. Identifying the source of free energy that enables high-fidelity DNA replication would further our fundamental understanding of biology.
The thermal stabilities of AT and GC base-pair matches are greater than those of mismatches, such as AC and GT, on account of stronger hydrogen-bonding and base-stacking interactions in double-stranded DNA (dsDNA) comprising matched pairs1. How can DNA polymerases take advantage of the differences in standard free energy change (ΔΔG° = ΔG°(mismatch) − ΔG°(match)) between matched and mismatched base pairs to copy DNA accurately? This is a fundamental unresolved question. Replicative polymerases incorporate mismatches very rarely (f = e−ΔΔG°/RT; where f is the error frequency, T is temperature and R is the universal gas constant), with errors occuring only once every 103–106 base pairs. Under physiological conditions, this low error rate of high-fidelity polymerases necessitates ΔΔG° to be 4–8 kcal mol−1 (REF. 1). Determining the melting temperatures of DNA enables standard ΔG° values for individual stacked base pairs to be calculated; results are obtained by comparing matched versus mismatched pairs that are located internally in dsDNA2 or at primer–template DNA termini3. However, these data yield ΔΔG° values of about 2 kcal mol−1 or less, which alone would afford polymerase error frequencies of greater than 10−2. We have suggested that polymerase active sites may need additional sources of ΔΔG° — a supplement to the apparently small differences in ΔG° indicated by thermodynamic studies of DNA strand association2 or dissociation4 — to favour incorporation of matched pairs over mismatched ones. This Comment explores how and to what extent DNA thermodynamics in aqueous media can serve as a source of the ΔΔG° required for the high fidelities exhibited by certain DNA polymerases.
Stalling with pyrophosphate
The impetus for us to investigate anew the relationship between the thermodynamic stabilities of base pairs and polymerase kinetic base-pair selection came from an insightful recent study on DNA polymerase fidelity5. In that paper, Olson et al. questioned whether variations in the fidelity of DNA polymerases could arise from differences between the thermodynamic stabilities of base pairs and mispairs alone, or instead whether amplification of these differences in ΔG° are required in the form of specific discriminatory interactions in the active site. It was proposed that by examining match and mismatch incorporation rates in the presence of high concentrations of pyrophosphate (PPi) — used to equalize the rates of dNTP insertion and release — it becomes possible to calculate the equilibrium constant (Keq) for both match and mismatch incorporation; thus, the corresponding standard free energy changes, using ΔG° = −RTlnKeq, can be obtained.
The Vent and Pfu polymerases — thermally stable enzymes from Thermococcus litoralis and Pyrococcus furiosus, respectively — have proven particularly useful in studies of DNAn elongation to DNAn + 1. For example, Vent polymerase has been applied to the incorporation of dNTP (N = A or G), present in varying amounts, opposite to a T template. At high [PPi], the net conversion rate of DNAn to DNAn + 1 approaches zero over time, such that a constant concentration quotient [DNAn + 1]/[DNAn], interpreted as equilibrium, was attained after about 1 h in the case of the match reaction (A opposite to T) and about 20 h for the mismatch (G opposite to T)5. The equilibrium constant Keq = [DNAn + 1][PPi]/[DNAn][dNTP] was used to compute ΔG° in each case, and the resulting ΔΔG° values of 3.5–7 kcal mol−1 suggest that the imposition of additional kinetic constraints on the active site would be unnecessary. This conclusion was derived solely from match and mismatch incorporation data; pyrophosphorolysis — necessary for dNTP release from DNAn + 1 — was inferred but not measured.
The observation of constant [DNAn + 1]/[DNAn] values in the case of both match and mismatch incorporation reactions is a necessary but insufficient equilibrium condition. To be sure that a system has reached equilibrium, one must obtain the same steady-state ratios for [DNAn + 1]/[DNAn] when studying the reverse of both incorporation reactions by initiating them from the products. Therefore, we measured the rates of both the forward and reverse reactions3.The key result was that the reverse reaction was undetectable for mismatches; thus, pyrophosphorolysis of G at the primer-3′ end mispaired opposite to T did not occur3. Mismatch incorporation at high [PPi] is far from equilibrium despite showing an approximate levelling off3. Thus, the data of Olson et al.5 do not support the proposal that large ΔΔG° values obtained from polymerase-catalysed steady-state incorporation of matches and mismatches provide a direct measure of polymerase fidelity without having to consider active site constraints.
Although pyrophosphorolysis was not detected for the mismatch (G opposite to T), it was readily observed for the match (A opposite to T). The removal of A opposite to T reached steady state after ~30–60 min — the same timescale observed for incorporation of A opposite to T (REF. 3). The ΔG° values observed for the incorporation reaction and reverse were opposite in sign and of almost identical magnitude, which suggests that match incorporation at high [PPi] was indeed at or near equilibrium. Overall, it is noted that polymerase-catalysed match incorporation reaches steady state close to equilibrium when subjected to reversal with PPi, whereas Vent polymerase-catalysed mismatch incorporation seems to be irreversible even upon treatment with excess PPi. These results prompted us to re-examine here what DNA melting studies for matched and mismatched base pairs in aqueous solution can reveal about DNA polymerase fidelity when the environmental influence of H2O hydrogen bonding is considered.
The influence of solvent on ΔG°
The influence of the solvent environment on DNA stability can be determined by plotting ΔS° versus ΔH° for melting of each kind of DNA doublet (d) (FIG. 1), with values obtained by measuring the melting temperature (Tm) of dsDNA in 1 M aqueous NaCl (REF. 2). The data in the upper range of and correspond to the ten W–C nearest-neighbour doublets, whereas those in the lower range are for a subset of doublets that contain a single mispair2. The GA–CT doublet is of the type used in our polymerase fidelity experiment, in which correct incorporation of A opposite to T takes place next to 5′-nearest-neighbour G (on primer) bound to C (on template)3. Match incorporation reaches equilibrium with respect to pyrophosphorolysis at high [PPi], yielding (REF. 3), a value much larger in magnitude than calculated using and for melting of doublet GA–CT2. The value of is small because the large and positive value of is almost cancelled by the entropic contribution . This approximate cancellation has been referred to as “enthalpy–entropy compensation” (REF. 4). For W–C doublets at 37 °C, is largest in the case of CG–GC (2.24 kcal mol−1) and smallest for TA–AT (0.56 kcal mol−1).
Figure 1. Correlation between ΔS° and ΔH° associated with melting nearest-neighbour doublets in DNA.
For the ten Watson–Crick (W–C) doublets, the values of the standard enthalpy change and the standard entropy change represent the average of measurements reported in the literature2. The values (±0.3 kcal mol−1) range from 11.0 (for CG–GC) to 6.6 (for TA–AT). versus is fit to a rectangular hyperbola. Also shown are values (±0.6 kcal mol−1) for doublets with single mismatches, with the values for CG–GT, CC–GT, CA–GC and TG–AA, ranging from 4.1 to −3.0 kcal mol−1 (REF. 2). All data were acquired in aqueous NaCl (1 M) at pH 7.
The above method for determining the of melting for an individual doublet does not take into account the environmental (e) influence of H2O, with doublet melting being accompanied by significant rearrangement of the hydrogen bonding in the solvation shell. The contribution of the solvent to can be obtained by deriving an analytical curve fit for the observed relationship between and (FIG. 1). Clearly, is not constant, and is not directly proportional to as has previously been suggested6. Instead, we have proposed that Tm is linearly related to according to the relationship , where T0 is a constant corresponding to the value of Tm when , and a is an entropy constant with a value greater than any (REF. 4). Upon substituting in place of Tm, we obtain the hyperbolic function (REF. 4). This hyperbolic equation for melting of nearest-neighbour doublets in dsDNA has been confirmed by a statistical thermodynamic treatment of dsDNA melting7. The constants a and T0 define the shape of the curve and were evaluated by a least-squares fit to the data for the ten W–C doublets (FIG. 1), which gives a = 80 ± 2 cal mol−1 K−1. When , the slope of the curve is 1/T0, and Tm for the doublet becomes T0 = 273 ± 5 K. This value of T0 is close to the melting temperature of H2O (REF. 4), which implies that hydrogen bonding between H2O molecules within the solvation shell contributes to the stability of each doublet. The aqueous solvent effects on doublet melting can now be described as without changing the Tm value for each doublet, such that . Accordingly, the environmentally adjusted is the sum of and the constant aT0 (21.8 kcal mol−1), whereas remains constant at the value of a = 80 cal mol−1 K−1.
When the contribution from an aqueous medium is described in this manner for melting of doublet GA–CT, then , which agrees in magnitude with the polymerase result, , determined from Keq between polymerization and phosphorolysis3. Although the entropic parameter a is surprisingly large, the ‘proof of the pudding is in the eating’, as shown by the quantitative accord between determined for the polymerase and the thermodynamic value for the same nearest-neighbour stacking partners. We suggest that the thermodynamic stability of matched dsDNA, reinforced by hydrogen bonding in the immediate aqueous environment, provides a sufficiently negative ΔG° value for DNA polymerases to preferentially mediate formation of matched W–C base pairs.
“the thermodynamic stability of matched dsDNA, reinforced by hydrogen bonding in the immediate aqueous environment, provides a sufficiently negative ΔG° value”
Relating to DNA polymerase fidelity
When cognate and non-cognate dNTPs are present in equimolar concentrations, these substrates compete for incorporation by polymerases. As noted earlier, under conditions that promote optimal forward rates for DNA synthesis, ΔΔG° is in the range 4–8 kcal mol−1 (REF. 1). With respect to the influence of solvent on the ΔΔG° between doublet formation, is predicted to be constant for all doublets — that is, matches and mismatches — such that . In addition, , and as aT0 is the same for all doublets, . Consequently, .
For the ten W–C doublets in 1 M aqueous NaCl (FIG. 1), ranges from 11.0 (for CG–GC) to 6.6 kcal mol−1 (for TA–AT). For doublets with single mismatches, is between 4.1 and −3.0 kcal mol−1. After the inclusion of solvent effects, ranges between about 7 and 13 kcal mol−1, so the magnitudes of are either roughly coincident with or far exceed the ΔΔG° values needed to account for DNA polymerase fidelity.
Linking solvent, geometry and fidelity
The reversibility and irreversibility with which Vent polymerase incorporates matches and mismatches, respectively, offers fresh insight into the geometric constraints of DNA polymerase active sites in relation to fidelity3. Several kinetic studies have delineated individual steps in the polymerase catalytic cycle: from the binding of cognate and non-cognate substrates, to polymerase conformational changes and phosphodiester bond chemistry8. Each step may serve as a kinetic fidelity ‘checkpoint’ that discriminates between matches and mismatches in a polymerase-specific manner9.
Each step in the match and mismatch incorporation pathways must adhere to microscopic reversibility. Given the many steps involved, it might seem surprising that match incorporation is overall reversible. Yet, when Vent and Pfu polymerases process AT pairs, the similar magnitudes of and imply that the substrates and products remain properly aligned throughout the forward and reverse pathways3. The overriding importance of maintaining optimal geometric alignment for catalysis in the active site of DNA polymerases is discussed in a concurrent Review article in Nature Reviews Chemistry9. Although binding of Vent or Pfu to a terminal G (primer)⋯T (template) mismatch is strong enough to allow 100% extension by incorporation of a next correct (TA) base pair, misalignments of the GT mismatch in subsequent reverse checkpoint steps are apparently too severe to allow nucleophilic attack by PPi to yield and release detectable dGTP after a two-day incubation with Vent polymerse3. In the case of Pfu polymerase, some dGTP release is observed (<2%), although the system nevertheless remains far from equilibrium3.
We have shown here that DNA hydrogen-bonding and base-stacking interactions can provide an ample source of ΔΔG° to account for high-fidelity DNA synthesis as long as the stabilizing hydrogen-bonding effects of the solvent are also accounted for. The relevance of the values (obtained from doublet melting thermodynamics) to the values (derived from polymerase kinetics) originates from the retention of hydrogen-bonding and stacking primer– template interactions within the DNA polymerase active site. Geometric selection1 can be imposed on the active site through induced fit mechanisms that properly align cognate dNTP, which suppress entropic losses associated with the necessary conformational changes and chemical step. In this way, the enzymes take full advantage of the different ΔH° values associated with installing matches and mismatches.
From a ‘DNA-centric’ perspective, ΔΔG° values for match and mismatch pairings are insufficient to account for high polymerase fidelity. Our Comment shows that a ‘solvent + DNA’ perspective affords much larger differences in ΔG°, the magnitudes of which are easily sufficient to account for high polymerase fidelities. Different DNA polymerases impose active site constraints that can strongly modify incorporation specificities. One striking example is DNA polymerase η, the X-ray structure of which reveals an active site that favours a wobble GT base mispair at 3′-AT template motifs — a hallmark of somatic hypermutation of immunoglobin variable DNA regions that can enable an immune system to adapt to new threats10. Although the difference in ΔG° from solvent reinforced hydrogen bonding and base stacking is sufficient for discrimination between substrates, nuances in the polymerase active site structures may regulate selection. As such, the linkage between solvent, geometry and fidelity remains in the hands of the polymerase.
Acknowledgments
The authors thank C. H. Mak and K. Oertell for insightful discussions, and J. G. Bertram for collating the data. This work was supported by the US National Institutes of Health (Grant Nos ES012259, U19CA177547 and GM214422 to M.F.G.).
Footnotes
Competing interests statement
The authors declare no competing interests.
References
- 1.Echols H, Goodman MF. Fidelity mechanisms in DNA replication. Annu. Rev. Biochem. 1991;60:477–511. doi: 10.1146/annurev.bi.60.070191.002401. [DOI] [PubMed] [Google Scholar]
- 2.SantaLucia J, Hicks D. The thermodynamics of DNA structural motifs. Annu. Rev. Biophys. Biomol. Struct. 2004;33:415–440. doi: 10.1146/annurev.biophys.32.110601.141800. [DOI] [PubMed] [Google Scholar]
- 3.Oertell K, et al. Kinetic selection versus free energy of DNA base pairing in control of polymerase fidelity. Proc. Natl Acad. Sci. USA. 2016;113:E2277–E2285. doi: 10.1073/pnas.1600279113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Petruska J, Goodman MF. Enthalpy–entropy compensation in DNA melting thermodynamics. J. Biol. Chem. 1995;270:746–750. doi: 10.1074/jbc.270.2.746. [DOI] [PubMed] [Google Scholar]
- 5.Olson AC, Patro JN, Urban M, Kuchta RD. The energetic difference between synthesis of correct and incorrect base pairs accounts for highly accurate DNA replication. J. Am. Chem. Soc. 2013;135:1205–1208. doi: 10.1021/ja309866m. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Gotoh O, Tagashira Y. Locations of frequently opening regions on natural DNAs and their relation to functional loci. Biopolymers. 1981;20:1043–1058. doi: 10.1002/bip.1981.360200514. [DOI] [PubMed] [Google Scholar]
- 7.Starikov EB, Nordén B. Physical rationale behind the nonlinear enthalpy–entropy compensation in DNA duplex stability. J. Phys. Chem. B. 2009;113:4698–4707. doi: 10.1021/jp8089424. [DOI] [PubMed] [Google Scholar]
- 8.Joyce CM, Benkovic SJ. DNA polymerase fidelity: kinetics, structure, and checkpoints. Biochemistry. 2004;43:14317–14324. doi: 10.1021/bi048422z. [DOI] [PubMed] [Google Scholar]
- 9.Wu W-J, Yang W, Tsai M-D. How DNA polymerases catalyse replication and repair with contrasting fidelity. Nat. Rev. Chem. 2017;1:0068. [Google Scholar]
- 10.Zhao Y, et al. Mechanism of somatic hypermutation at the WA motif by human DNA polymerase η. Proc. Natl Acad. Sci. USA. 2013;110:8146–8151. doi: 10.1073/pnas.1303126110. [DOI] [PMC free article] [PubMed] [Google Scholar]

