Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2023 Mar 8;13:3915. doi: 10.1038/s41598-023-30620-0

Uracil-DNA glycosylase efficiency is modulated by substrate rigidity

Paul B Orndorff 1,#, Souvik Poddar 2,3,#, Aerial M Owens 2,4,#, Nikita Kumari 2,3, Bryan T Ugaz 2,3, Samrat Amin 5, Wade D Van Horn 2,4,, Arjan van der Vaart 1,, Marcia Levitus 2,3,
PMCID: PMC9995336  PMID: 36890276

Abstract

Uracil DNA-glycosylase (UNG) is a DNA repair enzyme that removes the highly mutagenic uracil lesion from DNA using a base flipping mechanism. Although this enzyme has evolved to remove uracil from diverse sequence contexts, UNG excision efficiency depends on DNA sequence. To provide the molecular basis for rationalizing UNG substrate preferences, we used time-resolved fluorescence spectroscopy, NMR imino proton exchange measurements, and molecular dynamics simulations to measure UNG specificity constants (kcat/KM) and DNA flexibilities for DNA substrates containing central AUT, TUA, AUA, and TUT motifs. Our study shows that UNG efficiency is dictated by the intrinsic deformability around the lesion, establishes a direct relationship between substrate flexibility modes and UNG efficiency, and shows that bases immediately adjacent to the uracil are allosterically coupled and have the greatest impact on substrate flexibility and UNG activity. The finding that substrate flexibility controls UNG efficiency is likely significant for other repair enzymes and has major implications for the understanding of mutation hotspot genesis, molecular evolution, and base editing.

Subject terms: DNA, Biophysical chemistry, DNA repair enzymes

Introduction

Cellular repair pathways maintain genetic integrity against thousands of daily DNA lesions. The repair of small, non-helix distorting lesions, is initiated by specialized DNA glycosylases that catalyze the excision of the damaged base1. DNA sequence effects have been identified for many glycosylases and alkyl-transferases that repair base mismatches, uracil bases, alkylated bases, etc27. However, the molecular features that give rise to preferences for particular DNA sequences have not yet been resolved. Here, we focus on understanding how DNA sequence impacts the repair of uracil by uracil-DNA glycosylase (UNG). Uracil is a highly mutagenic and common lesion in DNA that arises from dUTP misincorporation or from spontaneous cytosine deamination8. Unrepaired cytosine deamination results in U:G mismatches that give rise to G:C to A:T transition mutations9.

The crystal structure of the human UNG-DNA complex shows the uracil extruded from the duplex into the enzyme active site10. The first step in uracil recognition by UNG relies on trapping spontaneous extrahelical excursions of uracil1113, which occur at enhanced rates compared to thymine in the same sequence context14. Consistent with this observation, a thermodynamic study of the effect of thymine to uracil substitutions found reduced base stacking in U:A base pairs compared to T:A15. While these studies provide a molecular framework for understanding how UNG distinguishes between uracil and thymine, the molecular principles that determine UNG substrate preferences are still poorly understood. In E. coli, the efficiency of UNG varies more than 15-fold depending on the DNA context4,5. A vicinal thymine 3′ of uracil generally results in poor excision, and substrates with high local GC content are generally poor substrates16,17. Yet, the underlying principles that dictate UNG substrate preferences still remain elusive.

Preferential excision of uracil does not correlate with DNA melting temperatures; for instance, substrates containing uracil in TUA contexts are better UNG substrates than sequences with AUT contexts4,5. The differences between TUA and AUT sequences are remarkable and are reminiscent of the asymmetries observed in the mechanical properties of undamaged DNA. TA steps are particularly flexible with regard to roll, slide, and twist, while AT steps are more rigid18,19. DNA flexibility is indeed suspected to dictate the binding preferences of various non-specific DNA enzymes and proteins. For instance, DNA flexibility explains sequence preferences in phosphodiester backbone cleavage by endonuclease DNase 120, and the role of DNA sequence in nucleosome stability and dynamics21. In this work, we test the hypothesis that UNG activity is dictated by the intrinsic local deformability of the DNA sequence around the uracil. Structural studies provide a rationale for this hypothesis; the formation of the catalytically active UNG-DNA complex requires DNA hydrogen bond breakage and loss of stabilizing base-stacking interactions10,22, and because these interactions are determined by the mechanical properties of DNA, the intrinsic deformability of the region surrounding the lesion is likely an important variable in understanding catalytic efficiency. A previous study that compared substrates containing uracil in AUA and GUG contexts hinted at a connection between substrate flexibility and UNG activity23, but the small scale of the study (2 substrates) was not sufficient to establish a correlation between the two variables, nor to determine the molecular basis for substrate preference.

To illuminate the link between UNG repair efficiency and substrate flexibility, we determined UNG specificity constants (kcat/KM) for a variety of designed substrates and correlated the outcomes with the results of biophysical experiments and MD simulations that probe DNA substrate flexibility. Our results not only establish a clear link between the fundamental nature of UNG substrate flexibility, the contributions of distinct substrate dynamics, and the hierarchy of these motions in regulating the resulting UNG catalytic efficiency, but also identify and quantify allosteric coupling of bases flanking the uracil.

Materials and methods

Enzymes, oligonucleotides, and reagents

Uracil-DNA Glycosylase (UNG, MW = 25.7 kDa) was purchased from New England BioLabs (Catalog # M0280L) at a concentration of 5000 U/mL. Free 2-aminopurine riboside was purchased from AstaTech (PA, USA). Oligonucleotides containing canonical bases, uracil, or 2-aminopurine (2AP) were purchased from IDT (IA, USA) as desalted oligonucleotides. All DNA sequences are listed in Table 1 and are represented by a number and two letters. The number refers to the different DNA series (1–4) and the letters refer to the bases adjacent to the uracil. For instance, the four substrates in series 1 (1TA, 1TT, 1AA and 1AT) share the same overall sequence but differ in the bases adjacent to the uracil. Oligonucleotides for fluorescence experiments (kinetic assays, time-resolved fluorescence, and fluorescence quantum yields) were 39 nt in length (Table 1). Oligonucleotides for NMR experiments were shortened to the central 13 nucleotides to facilitate resonance assignments while still ensuring duplex formation. All oligonucleotides were solubilized in 1 × PBS buffer (20 mM sodium phosphate, 100 mM NaCl, 50 µM EDTA, pH 7.0). Concentrations were determined from measured absorbances at 260 nm using extinction coefficients provided by IDT.

Table 1.

DNA sequences.

Name Motifa Fluorescence/Kineticsb NMR Simulation
1TA 5′ T1G2C3A4T5U6A7A8T9G10A11C12 3′ 5′ L1 Motif R1 3′ 5′ G Motif 3′ 5′ CG Motif G 3′
1TT 5′ T1G2C3A4T5U6T7A8T9G10A11C12 3′ 5′ L1 Motif R1 3′ 5′ G Motif 3′ 5′ CG Motif G 3′
1AT 5′ T1G2C3A4A5U6T7A8T9G10A11C12 3′ 5′ L2 Motif R1 3′ 5′ G Motif 3′ 5′ CG Motif G 3′
1AA 5′ T1G2C3A4A5U6A7A8T9G10A11C12 3′ 5′ L2 Motif R1 3′ 5′ G Motif 3′ 5′ CG Motif G 3′
2TA 5′ T1G2A3A4T5U6A7G8T9T10A11C12 3′ 5′ L3 Motif R2 3′ 5′ G Motif 3′ 5′ CG Motif G 3′
2TT 5′ T1G2A3A4T5U6T7G8T9T10A11C12 3′ 5′ L3 Motif R1 3′ 5′ G Motif 3′ 5′ CG Motif G 3′
2AT 5′ T1G2A3A4A5U6T7G8T9T10A11C12 3′ 5′ L3 Motif R1 3′ 5′ G Motif 3′ 5′ CG Motif G 3′
2AA 5′ T1G2A3A4A5U6A7G8T9T10A11C12 3′ 5′ L3 Motif R2 3′ 5′ G Motif 3′ 5′ CG Motif G 3′
3AT 5′ C1T2G3C4A5U6T7A8A9T10G11C12 3′ 5′ L1 Motif R1 3′ N/A 5′ CG Motif G 3′
4TA 5′ T1A2A3C4T5U6A7C8A9T10T11C12 3′ 5′ L4 Motif R3 3′ N/A 5′ CG Motif G 3′

aAll substrates pair adenine (NMR, simulations) or 2AP (fluorescence, kinetics) opposite uracil in the complementary strand. bL1 = 5′ CTATATTGGAAGCT 3′, L2 = 5′ CTATGTTGGAAGCT 3′, L3 = 5′ CTAGAATGGAAGCT 3′, L4 = 5′ CTATATTCGCAGCT 3′, R1 = 5′ TCTGTACACGAAG 3′, R2 = 5′ TCAGTACACGAAG 3′, R3 = 5′ TCTCAACACGCTT 3′

Sample preparation

Duplex DNA substrates used for the fluorescence-based experiments were prepared by annealing uracil-containing oligonucleotides with complementary strands containing 2AP opposite to the uracil. For the kinetic assays, a small excess of the 2AP-strand is preferable to an excess of the uracil-containing strand because the latter is also a substrate of UNG, and therefore its presence may affect the measured kinetic rates. For the kinetic assays, DNA substrates were prepared by annealing the strands at room temperature while monitoring the fluorescence intensity of 2AP in real-time. The uracil-containing strand was added to a known concentration of 2AP-containing strand, and the reduction in fluorescence intensity due to the formation of the duplex was measured until a small addition of the uracil strand did not result in a further decrease. The concentration of the resulting dsDNA substrate was calculated from the absorbance of the initial 2AP-containing strand and the volumes before and after adding the uracil strand. Traditional native polyacrylamide gel electrophoresis was used to confirm that the annealing procedure at room temperature was highly efficient and did not result in any measurable 2AP- or uracil-containing single strands. For the time-resolved and fluorescence quantum yield experiments, a slight excess of the U-strand is preferable to an excess of the 2AP-strand because 2AP in ssDNA is significantly brighter than 2AP in a duplex. Samples for these experiments were prepared as before and followed by the addition of a ~ 20% excess of the uracil-containing strand. In each case, we verified that further addition of the uracil-containing strand did not change the measured lifetimes or quantum yields. Duplexes for NMR experiments were prepared from complementary oligonucleotides mixed at 1:1 molar ratios that were heated to ~ 80 °C and annealed by cooling to room temperature for ~ 2 h.

UNG kinetic assay and data analysis

A continuous fluorescence kinetic assay was used to measure UNG activity on DNA substrates containing 2AP opposite to the uracil24. 2AP is highly fluorescent when exposed to water but is highly quenched in dsDNA. The cleavage of the dU glycosidic bond by UNG results in an aldehydic abasic site opposite to 2AP, and this environmental change leads to a large fluorescence increase. The increase in fluorescence intensity can be used to calculate the reaction rate in a continuous kinetic assay. The kinetic parameters measured in this way are indistinguishable from the values obtained using a traditional radioactivity-based electrophoresis assay with non-fluorescent substrates24. In addition, the perturbation introduced by the 2AP probe has been shown to have a negligible effect on both the measured dissociation and association kinetic constants of the UNG-DNA complex25. 600 μL of duplex DNA was placed in a quartz micro cuvette (optical path length 10 × 2 mm) and fluorescence intensity (λex = 310 nm, λem = 370 nm) was monitored continuously before and after addition of UNG. Fluorescence intensities were corrected in real-time for potential fluctuations in the incident intensity over the long measurement times. A small amount of stock enzyme (5,000 U/mL) was diluted 40-fold in 1 × PBS buffer containing 1 mg/mL BSA, and this dilution was stored at 4 °C for no longer than six hours and used for all kinetic experiments performed the same day. For the kinetic experiments, 2 μL of this dilution were added to the cuvette containing the DNA substrate for a final concentration of UNG in the assay mixture of 0.16 nM. The initial velocity (V0, units of M·s-1) was obtained from the initial slope of the measured intensity (F(t)) as V0=[S]0ΔFdFdt0, where [S]0 is the concentration of DNA substrate (0.075–6 μM) and ΔF is the total change in fluorescence. A representative sample run and a sample calculation are shown in Fig. S1. Error bars in Figs. 1 and S7 represent 95% confidence intervals. To rule out systematic sources of error, for 1AT, 23 trials were performed at 0.3 μM concentration involving (1) at least four different purchased UNG stock solutions, (2) three different experimentalists, and (3) DNA substrates prepared from oligos purchased at different times. The V0 versus [DNA] curves were fitted using the Michaelis–Menten equation in Origin Pro (Northampton, MA) using the Lavenberg Marquardt iteration algorithm and using the reciprocal of the variances as weights.

Figure 1.

Figure 1

Experimental initial rates (V0) against substrate concentration for substrates 1TA (circles) and 2AT (squares). Results for all other samples are shown in Fig. S7. Each data point is the mean of at least three independent experiments. Error bars represent 95% confidence intervals. The data have been fitted to the Michaelis–Menten equation (see Table S4 for all Michalis-Menten parameters). Inset: Values of kcat/KM for all substrates (see Table S4). Bar colors match the colors of the symbols used in other figures for the same sequences.

Time-resolved fluorescence

Time-resolved fluorescence intensity measurements were performed using the time-correlated single-photon counting (TCSPC) technique. A mode-locked Ti:Sapphire laser (Mira 900, Coherent) pumped by a frequency-doubled Nd:YVO4 laser (44% from an 18 W Verdi, Coherent) was used as the excitation source. The 130 fs light pulses (at 800 nm with a repetition rate of 250 kHz) were generated by a regeneratively amplified Ti:S laser system (RegA 9000, Coherent Laser). The pulses were sent to an optical parametric amplifier (OPA) to generate the excitation light at 620 nm and then frequency-doubled to obtain excitation pulses at 310 nm. Fluorescence emission was collected at a 90° geometry setting and detected using a double-grating monochromator (Oriel Instruments) and a microchannel plate photomultiplier tube (Hamamatsu R3809U-51). Decays were measured at three emission wavelengths (380, 390, and 400 nm) for global analysis as described below. The polarization of the emission was 54.7° relative to that of the excitation (magic angle). A single-photon counting card (Becker-Hickel, SPC-830) using two time windows (3.3 ns and 25 ns) was used for data acquisition. Instrumental response functions (IRF) were determined for both time resolutions. The typical IRF had a FWHM of 40 ps, measured from the scattering of Ludox sample at 310 nm. The three decays obtained at different emission wavelengths for each sample were fitted globally keeping the lifetimes as common parameters among the three data sets. This approach minimizes the problem of correlation between pre-exponential factors and lifetimes, which is common when fitting multi-exponential decays26. The fitting parameters were obtained through iterative reconvolution of the model function Fλ,t=F0λi=1nαiλe-t/τi with the measured IRF using an in-home written software package (ASUFIT). Here, λ represents the emission wavelength, and i=1nαiλ=1. Lifetimes as long as ~ 8 ns and as short as ~ 30 ps are expected26, and in our experience, results are more robust and reproducible if the shorter components are determined from data measured with the highest time resolution achievable with the single-photon counting card (814 fs/channel, 212 channels = 3.3 ns total), while the longer lifetimes are obtained from data measured with a wider time window. The decays measured using 3.3 ns acquisition windows were fitted using three exponential terms. A fourth term did not improve the quality of the fit. A new fit was then conducted using the decays measured with a 25 ns acquisition window (6.1 ps/channel) using four exponential terms but fixing the two shortest lifetimes (τ1 and τ2) to the values obtained in the previous fit. In this way, the fitting parameters in the second fit were τ3, τ4 and α1-4λ. This procedure allows a more accurate determination of τ1 and τ2 (both below 0.5 ns) from measurements using 814 fs/channel resolution, and τ3 and τ4 (both over 1 ns) from measurements using 6.1 ps/channel resolution. Mean lifetimes were calculated for each sample as τ=i=14αiτi using the αi values obtained in the second fit.

Fluorescence quantum yields

Fluorescence quantum yields (ϕ) were determined relative to a reference as ϕs=ϕRIsIR×1-10-AR1-10-AS, where the subscripts “S” and “R” refer to the sample and reference, respectively, I is the integrated emission intensity measured over the entire fluorescence band, and A is the absorbance at the excitation wavelength (315 nm)27. Absorbances were kept below 0.05 to avoid inner filter effects. Experiments replacing the 2AP-containing strand with an adenine-containing strand were used to verify that the absorbance of the canonical bases at 315 nm was negligible compared to the absorbance measured with the 2AP-containing DNA. Steady-state emission fluorescence spectra were acquired on a PTI Quantamaster 4/2005SE spectrofluorimeter. Fluorescence spectra showed clear contributions from Raman scattering at 352 nm, and to account for these contributions a buffer sample was used as a blank and subtracted from the measurements containing 2AP-containing DNA. The free 2AP riboside is commonly used as a reference (ϕR=0.68 in water)28, but fluorescence intensities in duplex DNA are reduced 100-fold or more due to quenching, and therefore the determination of ϕ in DNA involves measuring very small fluorescence intensities. To improve accuracy, we performed five independent determinations of ϕ for the sample with the highest quantum yield (1AT) using free 2AP riboside in water as a reference. The average of five determinations was ϕ1AT = 0.0103 (standard deviation = 6.5 × 10–4, 95% confidence interval = 8.1 × 10–4), and this value was subsequently used as a reference for the ϕ determinations of all other 2AP-DNA samples. Values listed in Table S1 are averages of 4 independent determinations. All standard deviations are 3% or lower.

Fractional population of highly stacked species (α0)

The fractional population of dark (highly stacked) 2AP probes in the duplex DNA substrates was calculated from the sample mean lifetime (τ) and fluorescence quantum yield (ϕ) as α0=1-τ2APτϕϕ2AP, with ϕ2AP=0.68 and τ2AP=10.2 ns28,29. Errors were estimated as Δα0=τ2APϕ2AP1τΔϕ2+τ2APϕ2APϕτ2Δτ21/2. Values of Δϕ were taken as the standard deviation of the quantum yield measurements for each sample. Δτ values were estimated from the two repeats performed for each sample (Table S2). Because performing large numbers of repeats for individual samples in unfeasible due to time and cost, we calculated the percent deviation of each τ determination (two values for each of the six measured samples) from the respective average for the same sample. We determined that, on average, τ values are measured with a 2.5% precision. This value was then used to estimate Δτ for each sample.

NMR-detected imino proton exchange rate measurements

NMR samples were prepared in 3 mm tubes with DNA concentrations ranging from 2 to 4 mM with 5% v/v deuterium oxide. NMR experiments were performed on a Bruker 850 MHz Avance III HD spectrometer equipped with a 5 mm TCI CryoProbe, and a Bruker 600 MHz Avance III HD spectrometer equipped with a Prodigy probe. NMR spectra were processed and analyzed using Bruker TopSpin 4.1, MestReNova 14.2, and Matlab 2019b.

Resonance assignments

Two-dimensional 1H, 1H—Nuclear Overhauser Effect Spectroscopy (NOESY) experiments were recorded at 20 °C utilizing water suppression by excitation sculpting. The resulting 2D spectra were used to assign imino protons for each duplex using traditional methods (Figs. S2, S3)30,31. All assignments and subsequent experiments were collected at 20 °C.

Water inversion efficiency factor (E)

The water inversion efficiency factor was measured as previously described32, with a relaxation delay of 30 s, and is further described in the supplementary information (Fig. S4). Data processing and fitting were completed in Bruker TopSpin 4.1 and MestReNova 14.2.

Longitudinal relaxation rate of water (R1w)

The relaxation rate of water (R1w) was measured utilizing a previously described saturation-recovery method that is compatible with high Q cryoprobes32. Variable time delays ranged from 1 ms to 18 s. Data processing and fitting were completed in Bruker TopSpin 4.1 and Matlab 2019b for verification (Fig. S5). Determination of R1w was completed using the TopSpin T1/T2 Module.

Imino proton longitudinal relaxation (R1n) and exchange rates (kex)

The sum of the longitudinal imino proton relaxation rates (R1) and the imino proton solvent exchange rate (kex) can be used to determine the longitudinal relaxation rate of each imino proton (R1n). A pseudo-two-dimensional experiment was implemented to determine the longitudinal relaxation of imino protons (R1n) and the imino proton exchange rates (kex) following established methods33 utilizing a 24-point variable delay sequence ranging from 1 ms to 15 s. The respective spectra were processed in TopSpin 4.1, and the data were fit with MATLAB R2019b (MathWorks) using nonlinear least-squares fit, first fitting for the longitudinal relaxation of imino protons (R1n) followed by the exchange rate of imino protons (kex). Representative fits of R1n and kex are shown in Fig. S6. The kex was determined by fitting the individual peak areas to the equation:

AtA0=1-E·kex/R1w-R1n·e-R1n·t-e-R1w·t 1

where A(t) is the area of the peak at exchange time point t, A0 is the peak intensity at equilibrium, E (determined experimentally) is the water inversion efficiency factor, R1w (determined experimentally) is the water longitudinal relaxation rate, and R1n (determined experimentally) is the sum of imino proton longitudinal relaxation rates and its solvent exchange rate. The reported errors were estimated from the fitting of kex (Table S3).

Double mutant cycles and base-pair coupling analyses

The energetic impact of base substitutions was quantified from thermodynamic cycles that depict differences in transition state free energies: ΔΔG12=ΔG2-ΔG1 , where indices 1 and 2 indicate two different DNA sequences34. Two cycles were constructed. The first was based on NMR data, where ΔG represents the barrier for imino exchange, and ΔΔG12=-RTlnkex2kex1 . Differences between the 2TA, 2TT, 2AA, and 2AT sequences were assessed. The second cycle was based on kcat/KM measurements, where ΔG represents the barrier for enzymatic uracil excision, and ΔΔG12=-RTlnkcat/KM2kcat/KM1. In this cycle, differences between 2AT, 2TT, 2AA, 2TA, and 1AT, 1TT, 1AA, 1TA (see Table 1 for substrate nomenclature) were calculated. From these cycles, coupling free energies

ΔGcoup=ΔΔGATTT-ΔΔGAATA=ΔΔGATAA-ΔΔGTTTA 2

between the base pairs directly adjacent to uracil were assessed. ΔGcoup is zero when these base pairs are independent of each other, and nonzero when they are coupled and influence each other. Since the transition state free energies of the two cycles correspond to different processes, free energy differences and coupling free energies are expected to differ between the thermodynamic cycles.

MD simulations

The dsDNA sequences of Table 1 were built in the unbent BII conformation using 3DNA35; U6 was base-paired to A19. Each strand was solvated in a rectangular TIP3P water box36 of 100 mM NaCl with a solvent layer of 15 Å in each direction. After energy minimizations, each system was heated from 100 to 300 K over 2.5 ns with a 1 kcal/(mol·Å2) harmonic restraint on all DNA atoms. During heating, flat bottom distance restraints with a force constant of 1 kcal/(mol·Å2) were added to the hydrogen bonds between the bases. After heating, the harmonic restraints on the DNA atoms were gradually removed over 1.2 ns, while restraints on the hydrogen bonds remained in effect. The latter were subsequently removed over an additional 3 ns. The unrestrained systems were then equilibrated for 400 ns, followed by at least 600 ns of production simulations. Heating and restraints removal were performed with Langevin dynamics in AMBER37, while the production runs were done with Langevin dynamics in OPENMM38. The simulations were performed in NPT, periodic boundary conditions were in effect, SHAKE39 was applied to all covalently bonded hydrogen atoms, and long-range electrostatic interactions were handled using the particle-mesh Ewald method40. All simulations used the AMBER OL15 DNA force field41; deoxyribose parameters for U were taken from T deoxyribose. Convergence was assessed by monitoring cumulative averages of DNA bending and total winding angles. In addition, all trajectories were decorrelated using pymbar42, and all properties were calculated from 100 decorrelated frames per trajectory. If needed, simulations were extended for additional blocks of 500 ns until convergence. Simulations were run in triplicate for each strand.

Geometric analyses of the DNA base and step parameters were performed with 3DNA35. DNA bending angles (ɸ) were calculated from tilt, roll, and twist base step angles using the MADBEND procedure43,44. Bending persistence lengths (BPLs) were calculated from: BPL=-LlnPϕ1-cosϕ45, where L is the contour length, and P(ɸ) the probability of observing a particular bending angle. Contour lengths were calculated from the sum of the helical rise; to account for fraying, the terminal base steps were excluded from the ɸ and BPL analyses. Torsional persistence lengths (TPLs) were calculated from the variance of the total winding angle (ΔΩ2): TPL=L/ΔΩ246. The total winding angle was calculated as the sum of the individual twist steps for each sequence. To account for fraying and base flipping, the two terminal base steps and those neighboring U6 were excluded from the sum.

Extrahelical flipping of uracil was assessed by monitoring the hydrogen bonding distance between HN3 of U6 and N1 of its complementary A19, and the flipping angle. This flipping angle was taken as the pseudo-dihedral angle between the center of mass (COM) of the base ring of U6, the COM of the U6 backbone, the COM of the backbone of residue 8, and the COM of the backbone of the base complementary to U645,45. Based on visual inspections of the trajectories, U6 was considered flipped out when the U6 − A19 hydrogen bond distance exceeded 4.5 Å and the pseudo-dihedral angle was greater than 40 or less than − 40°. Negative pseudo-dihedral angles correspond to major groove flipping, while positive angles correspond to minor groove flipping.

Results

DNA substrates

Relative excision efficiencies for uracils embedded within a long (> 6 kbp) viral dsDNA have previously been reported5. Inspection of those results suggests that AUT sequences are generally poorer UNG substrates than TUA sequences despite similar overall AT/GC content. Based on these published data, we designed dsDNA substrates containing uracil in TUA or AUT contexts, and adenine opposite uracil (Table 1). An initial set of substrates (1TA, 2AT, 3AT and 4TA) were designed to span a wide range of uracil excision efficiencies. For uracils embedded in a viral dsDNA genome, substrates 1TA and 4TA represent the high and low end of removal efficiencies in TUA contexts (100% and 50%, respectively), while substrates 3AT and 2AT represent the high and low end in AUT contexts (35% and 10%, respectively)5. Substrates 1AT and 2TA were then designed to evaluate the impact of swapping the flanking A and T bases of the substrates with the best (1TA) and worst (2AT) removal efficiency in the original set. Lastly, substrates containing uracil in a AUA (1AA and 2AA) or a TUT (1TT and 2TT) context were designed to evaluate the contributions of the uracil-flanking nucleotides.

Kinetics of uracil removal by UNG

UNG activity was measured as described in Materials and Methods; an example is given in Fig. S1. Although 2AP introduces slight local conformational perturbations48,49, its effects on enzyme binding and kinetics are marginal25,26,50,51. Calculated Michaelis–Menten parameters (KM and kcat, Figs. 1 and S7) and specificity constants (kcat/KM) are listed in Table S4. KM values fall within the range of previous measurements23,5254. For substrates 1TA, 2AT, 3AT, and 4TA, kcat/KM ratios parallel the percent uracil removal efficiencies reported previously5 in longer and more complex viral DNA: 1TA > 4TA > 3AT > 2 AT (Fig. 1, inset). For substrates 1AT and 2TA, whose core motifs are identical to 1TA and 2AT except for the swapped flanking bases, kcat/KM ratios are higher in TUA than AUT contexts (i.e. 1TA > 1AT and 2TA > 2AT, Fig. 1 and Table S4). Competition experiments (Supporting Information) using UNG acting on two different uracil-containing substrates present in the same reaction mixture confirm that UNG’s specificity is greater in TUA than AUT contexts, and that substrate specificity is not solely determined by the flanking bases (see also Fig. 1, inset).

Fluorescence quantum yields and time-resolved fluorescence

Unlike the canonical DNA bases, 2-aminopurine (2AP) is highly fluorescent when free in solution and strongly quenched when incorporated into DNA26. Inter-base interactions with its neighboring bases in duplex DNA give rise to a multiexponential decay that reflects the highly heterogeneous environment sensed by the probe. We used steady-state and time-resolved 2AP fluorescence to probe DNA dynamics around the uracil lesion. Consistent with previous reports5558, four exponential terms with lifetimes ranging from tens of picoseconds to nanoseconds were needed to fit the time-resolved TCSPC data (Table S2):

It=I0i=14αie-t/τi,i=14αi=1 3

Lifetimes in the picosecond timescale have been reported for 2AP in dsDNA using ultrafast methods59, indicating that a fraction of the emitting 2AP fluorophores has lifetimes shorter than the TCSPC resolution (~ 40 ps). The fractional population of the highly stacked probes that give rise to these ultrafast lifetimes (denoted by α0) was determined from the average lifetimes (Table S2) and the measured fluorescence quantum yields (Table S1)29. Results are shown in Table S5 and Fig. 2.

Figure 2.

Figure 2

(A) Inverse correlation between the measured specificity constants (kcat/KM) and α0, a fluorescence-derived experimental observable that measures the degree of base stacking in the region surrounding the uracil. (B) Correlation between kcat/KM and the NMR-measured exchange rate constants (kex) for the uracil imino proton. Inline graphic

Seibert et al. reported fluorescence quantum yields and lifetimes of 2AP opposite a uracil in two different 19 bp DNA substrates: one containing uracil flanked by two As (AUA, high UNG efficiency) and one containing uracil flanked by two Gs (GUG, low UNG efficiency)23. The AUA sequence had a shorter average lifetime (0.32 ns) than GUG (2.48 ns), which was interpreted as AUA being more flexible and therefore leading to more efficient dynamic quenching. Guanine, however, is an efficient quencher of 2AP, and 2AP lifetimes as low as 400 fs, were measured for 2AP in the vicinity of G59. The surprisingly long average lifetime reported for GUG (2.48 ns), therefore, likely reflects the fact that most of the 2AP population is quenched dynamically with lifetimes shorter than the resolution of the experiment (the shorter lifetimes reported were ~ 200 ps while the typical resolution of TCSPC measurements is about 40 ps). Had the authors measured the short lifetimes with high relative amplitudes expected for 2AP in the vicinity of G, the calculated 〈τ❭ would have been significantly shorter and likely below the value measured for AUA. These arguments illustrate the problems with interpreting relative 2AP average lifetimes in terms of substrate flexibility. Because measured average lifetimes are sensitive to instrumental resolution, we favor the currently accepted view that the relative contributions of each lifetime (αi), but not the lifetimes themselves, are useful measures of the degree of base stacking, and therefore reflect substrate flexibility26.

The excited 2AP population is expected to be partitioned between several different local environments that lead to different quenching efficiencies. The normalized amplitudes in Eq. (3) (αi ), measure the fractional population of each conformation detectable by TCSPC, from more stacked (α1), to more exposed to the solvated environment (α4)26. Here, we focus on the remaining population of 2AP molecules that cannot be detected by TCSPC (α0). This population emits with lifetimes shorter than the resolution of the experiment (~ 40 ps) due to ultrafast quenching. All calculated α0 values are quite high (α0 > 0.4 for all sequences, Table S5), but values are higher for substrates containing uracil in a AUT context compared to TUA. A higher α0 value indicates a higher fraction of highly stacked 2AP fluorophores, which we interpret as an indication of a less deformable substrate.

Substrates 1AT and 2TA were designed from parent substrates with high (1TA) and low (2AT) UNG efficiency to test the hypothesis that differences in substrate deformability determine preferential repair efficiencies. Swapping the flanking A and T while keeping all other bases constant affects stacking interactions in the vicinity of the uracil without changing the melting temperature. This swap resulted in a higher fraction of highly stacked 2AP probes for substrates with AUT contexts, i.e. α0 (1AT) > α0 (1TA) and α0 (2AT) > α0 (2TA). As noted above, kcat/KM values follow the opposite trend, suggesting a correlation between substrate deformability and uracil removal efficiency by UNG. A strongly negative correlation between kcat/KM and α0 was indeed observed for all sequences (Fig. 2A), indicating that more flexible sequences have higher repair rates.

Molecular Dynamics (MD) Simulations

The flexibility of the various DNA duplexes (Table 1) was quantified by MD. Calculated bending persistence lengths showed that all TA sequences and 3AT were more flexible than undamaged DNA (which has a persistence length of ~ 500 Ȧ)60,61, while the 1AT, 2AT, 1TT, 2TT, 1AA, and 2AA sequences were similar to undamaged DNA in bending rigidity (Table S6). We observed a clear distinction between the TA and AT sequences, with the former having lower bending persistence lengths than the latter. The AA and TT sequences were least bendable. The higher flexibility of the TA sequences was largely echoed in calculated torsional persistence lengths: the AT sequences generally had larger torsional persistence lengths than the TA sequences. The only exception was the 3AT sequence, which was torsionally more flexible than the stiffest TA sequence (2TA). Torsional persistence lengths of the AA and TT sequences were similar to the AT sequences. The standard deviation of the bending angle was highest for the TA sequences, indicating large bending flexibility of these sequences, and lowest for AA/TT, indicating higher rigidity (Table S6). Overall, calculated properties indicated the highest flexibility for the TA sequences and the lowest flexibility for the AA and TT sequences. Figure 3 shows that these three properties are strongly correlated with the α0 values obtained from fluorescence measurements. The correlation coefficient (r) was 0.918 for the bending persistence length, 0.869 for the torsional persistence length, and − 0.939 for the standard deviation of the bending angle. Moreover, ranking of the sequences by flexibility was largely similar for these MD measures and α0.

Figure 3.

Figure 3

Correlation of MD properties with α0. (A) Bending persistence length (in Å), (B) torsional persistence length (in Å), (C) standard deviation of the bending angle (in degrees). Averages and standard deviations over 3 MD replicas. Red lines show linear regression of the data; correlation coefficients are 0.981 (A), 0.869 (B), and − 0.939 (C). Inline graphic

The sequences displayed markedly different local dynamics around the lesion. Figure S8 shows the shift, slide, and rise translational step parameters, and tilt, roll, and twist rotational parameters for the central A5U6A7, A5U6T7, T5U6A7, and T5U6T7 steps. These values were averaged over all trajectories and all sequences; reported standard deviations are a measure of the base step flexibilities. Interestingly, differences in UA and AU step flexibilities depend on context and do not completely mirror the behavior of TA and AT steps of undamaged DNA18,19,21. For example, UA is particularly flexible in the TUA motif, but rigid in AUA. The TUA motif was by far most flexible, displaying large flexibilities in all step parameters of both steps. Some asymmetries in the UA and TU steps of this motif were observed. The roll angle of its UA step was more flexible than its TU step, its TU step was more flexible than UA in slide and twist, while both steps had similar flexibilities for the other parameters. Second most flexible was AUT. Its UT step was more flexible in roll, twist, shift, and rise, its AU step was more flexible in slide, and its tilt flexibility was similar for both steps. In contrast, all steps of the AUA and TUT motifs displayed low flexibilities.

The main reason for the high flexibility of the central base steps of the TUA sequences was extra-helical base flipping of U6 (Table 2), which was observed in all TUA sequences. Flipping was reversible and would occur throughout the simulations, but in 2TA and 4TA U6 remained extra-helical for nearly the entirety of the simulations. Base flipping was also observed in the AUT sequences, particularly in the 3AT and 1AT sequences, but this was by far not as prominent as when uracil was in a TUA context. 1AA and 2AA displayed even less flipping than the AUT sequences, and flipping did not occur in 1TT and 2TT.

Table 2.

Average and standard deviation of the time U6 is intra-helical over three MD replicas.

Sequence % Time U6 is intra-helical
2TA 0.9 ± 0.9
4TA 6.3 ± 6.6
1TA 15 ± 17
3AT 57 ± 35
1AT 71 ± 28
2AT 96.3 ± 1.9
1AA 96.2 ± 1.0
2AA 98.9 ± 0.4
1TT 100.0 ± 0.0
2TT 100.0 ± 0.0

Flipping of U6 was highly correlated to bending motions. The standard deviation of the flipping angle was highly correlated to the bending persistence length and the standard deviation of the bending angle, with correlation coefficients of -0.972 and 0.897, respectively (Fig. 4). The correlation to the torsional motion was weaker, with a correlation coefficient of − 0.763 for the torsional persistence length. Flipping would start toward the major groove (negative flipping angles). The fully flipped U would either remain in the major groove or start interacting with the minor groove, thereby changing the flipping angle to positive values. Since flipping was prevalent in the TA sequences, this behavior led to large standard deviations over the TA replicas (Fig. 4).

Figure 4.

Figure 4

Correlation of the standard deviation of the flipping angle with the bending (A) and torsional persistence length (B), the standard deviation of the bending angle (C), α0 (D) and kex of U6 (E). Averages and standard deviations over 3 MD replicas. Red lines show linear regression of all data with correlation coefficients of − 0.972 (A), − 0.763 (B), 0.897 (C), − 0.694 (D), and 0.972 (E), blue line the regression excluding 2TA with a correlation coefficient of 0.799 (E). Inline graphic

NMR-detected imino proton exchange rates

We probed individual base pair imino proton exchange rates of UNG substrates with solution NMR spectroscopy. These rates are directly related to nucleic acid breathing motions, whereby the imino base pair protons in transiently open states can exchange with water (Fig. 5A)33,62,63. The imino proton exchange rate (kex) provides insight into the base pair stability and duplex dynamics at the single base-pair level. As indicated by the MD simulations, we hypothesized that the kex of the target uracil (U6, Fig. 5B) will be sequence-dependent, and that a higher kex for central base pairs reflects more suitable UNG substrates. The imino protons for each NMR duplex listed in Table 1 were assigned (Fig. S3), and individual base pair kex values were measured. The 1TA exchange rates were not characterized due to significant resonance overlap of uracil (U6) with T18 (Fig. S3) which impeded accurate deconvolution and analysis of the U6 and T18 resonances in R1n and kex experiments. The UNG substrates from series one were therefore not included in the NMR analyses. The NMR data used for fitting the kex are deposited under BMRB Entry ID 51612.

Figure 5.

Figure 5

Imino proton exchange rates (kex) of individual base pairs in each of the two-series substrates in isolation confirm MD and fluorescence dynamics studies. (A) Schematic representation of the two-state model for base pair opening and imino exchange kinetics in nucleic acids. When the kex is not the rate-limiting step, the kex is directly proportional to the stability of the base pair and provides insight into DNA duplex dynamics. (B) Series two UNG substrate sequences, with the central uracil (U6) highlighted in magenta. (C) Imino proton exchange rates (kex) of individual base pairs in 2AT, 2TA, 2TT, and 2AA substrates. The uracil exchange rate corresponds to substrate efficiency. Error bars represent the propagated fitting error.

Imino proton NMR spectra are only observed in the presence of base-pairing64,65; due to fraying, terminal duplex base-pairs were therefore not observed. Though most of the surrounding exchange rates were comparable between sequences, the central uracil exhibited strikingly distinct kex rates that varied by more than ten-fold across sequences (Fig. 5C). Differences in uracil imino exchange rates demonstrate a strong dependence on adjacent base pairs. In 2AT and 2TA, the kex significantly increases as the result of a conserved change in sequence order of the surrounding base pairs. Given the significant differences in dynamics observed by fluorescence, MD simulations, and NMR when swapping between AT and TA bases flanking the central uracil, we measured imino proton exchange rates for 2TT and 2AA to elucidate whether the 5′ T or 3′ A relative to uracil dictates substrate flexibility and UNG efficiency. Interestingly, the sequences with the highest uracil imino exchange rates are those with adenine 3′ to U6 (A7). The 2TA duplex measures the highest U6 kex of 10.24 ± 0.75 s−1 while 2AA is intermediate with a kex of 3.32 ± 0.31 s−1 (Fig. 5C). Our experiments identify that the base pair at the 3′ side of uracil is most influential on the kex of U6, demonstrated by the nearly 15-fold difference in kex between 2TA and 2TT (10.24 ± 0.75 s−1 and 0.70 ± 0.08 s−1, respectively). This could suggest a structural and/or energetic hindrance to UNG efficiency in repairing such sequences. A positive correlation between kex of U6 and kcat/KM was observed (Fig. 2B), indicating that more flexible sequences have higher repair efficiencies.

Thermodynamic cycle analysis

Given that the base identity 3′ to uracil most significantly contributes to substrate flexibility, we probed allosteric coupling between the 5′ and 3′ uracil flanking positions. Thermodynamic cycle analysis of the NMR data (kex) measured for substrate series two identifies that there is coupling between these positions with a ΔGcoup of 3.9 ± 0.4 kJ/mol (Eq. 2 and Fig. 6). Based on the finding that substrate dynamics govern UNG activity, we expected coupling between the 5′ and 3′ flanking uracil positions in enzyme kinetics as well. This was validated by a thermodynamic cycle of substrate series two using ΔG values from kcat/KM measurements, yielding a coupling energy of 1.8 ± 0.6 kJ/mol (Fig. 6). Thermodynamic theory dictates that this allosteric coupling should be preserved independent of the surrounding sequence, and to confirm this, a thermodynamic cycle was carried out with substrate series one. As expected, we obtained the same coupling energies in both cases (horizontal lines in Fig. 6).

Figure 6.

Figure 6

Double mutant cycle analysis for substrate series one (back face, red arrows) and substrate series two (front face, black arrows). All values represent ΔΔG12 in kJ/mol. Black values for substrate series one and two were calculated from kcat/Km values of Table S4, magenta values for substrate series two from kex values of U6 as reported Fig. 5.

Discussion

The first uracil DNA glycosylase (UNG) was identified almost 50 years ago in E. coli66, and since, has been found across mammals, plants, bacteria, and viruses. While UNG is well characterized functionally and structurally, the observation that DNA substrate sequence significantly impacts its enzymatic efficiency has not been fully explained. To deconvolute the physical DNA substrate properties that impact UNG function, we employed a set of dsDNA UNG substrates that encompasses variable enzymatic efficiencies. UNG activity was correlated to substrate uracil dynamics by fluorescence, NMR, and MD. These studies identified both proximal and distal substrate features whose contributions impact UNG activity.

Uracil recognition by human and E. coli UNG relies on trapping extrahelical uracils that are spontaneously exposed by thermally induced base pair opening motions12,13. NMR imino proton exchange measurements have shown that U:A base pairs open with rate constants that are about one order of magnitude faster than T:A base pairs in the same sequence context14, and this difference has been proposed to contribute to the mechanism by which UNG discriminates between thymine and uracil. Here, we report uracil imino exchange rates that vary nearly 15-fold across sequences. These results support the notion that the sequence context surrounding the uracil modulate its opening dynamics and ultimately regulate the rate of uracil removal. To establish a correlation between substrate flexibility and UNG activity, we determined the UNG Michaelis–Menten constants for ten designed DNA substrates containing uracil flanked by either A or T bases (Figs. 1, S7 and Table S4). This experimental design allowed us to focus on the effects of substrate flexibility without affecting the melting temperature and stability of the duplex. UNG specificity constants (kcat/KM) were calculated and used to evaluate relative UNG substrate preferences. We identify that the UNG specificity constant (kcat/KM) is generally smaller for substrates containing a thymine 3′ to the uracil. Replacing the 3′ thymine in substrates 1TT and 2TT with an adenine (TUT → TUA), results in a c.a. threefold increase in kcat/KM. Swapping A and T around the uracil in substrates 1AT and 2TA (AUT → TUA) also results in an increase in kcat/KM.

The fact that UNG recognizes spontaneously exposed uracil suggests that sequence effects in kcat/KM are due to inherent differences in the flexibility of the DNA helix around the uracil. Accordingly, we observe an inverse correlation between kcat/KM and α0, a fluorescence-derived observable that measures the degree of base stacking around the uracil (Fig. 2A). Values of α0 are smaller for TUA sequences than for AUT, indicating that uracil is less dynamic in the second group. Similarly, a thymine 3′ to U results in lower exchange rate constants for the uracil imino proton (Table S3 and Fig. 2B), and we observed a correlation between kcat/KM and kex for the five substrates for which we were able to obtain kex constants by NMR (Fig. 2B). This further supports the notion that kcat/KM is greater for substrates containing the uracil in more flexible contexts. We note that there is no clear correlation between kcat and α0 (Fig. S9), indicating that the values of KM, but not kcat, are dictated by the mechanical properties of the substrate. This is consistent with current mechanistic knowledge that UNG kcat is limited by product release67,68.

MD data suggests that the fluorescence spectroscopy-derived α0 values and the NMR-determined U6-kex values measure different aspects of substrate motion. Values of α0 correlate more strongly with DNA bending than uracil flipping (Figs. 3, 4), while U6-kex values correlate more strongly to base flipping than DNA bending (Figs. 4, S10). Nevertheless, these motions are coupled (Fig. 4), and given the correlation of α0 and U6kex with kcat/KM (Fig. 2), both motions contribute to UNG substrate recognition and uracil excision. The necessity of both motions is consistent with crystal structures, in which DNA in complex with UNG is both flipped and bent (by an average of 33°)10,13,6973. The stronger correlation of α0 with kcat/KM (Fig. 2) is intriguing since it would indicate that excision is more strongly correlated to DNA bending than base flipping. The observation that KM, but not kcat, is governed by the mechanical properties of the substrate, indicates that DNA bending favors the formation of the enzyme–substrate complex. Our MD data shows that DNA bending correlates with base flipping, which is consistent with previous studies that show that DNA bending facilitates base flipping by pushing the system up in energy74,75.

For a given substrate, changing one of the bases flanking the uracil affects kcat/KM in a way that depends on the identity of the other. For example, for substrate series one (i.e., 1AT, 1AA, 1TT, and 1TA) and two (i.e., 2AT, 2AA, 2TT, and 2TA), the effect of replacing an adenine 3′ to uracil with thymine is much greater when the base 5′ to uracil is T than A. Similarly, the effect of substituting an adenine 5′ to uracil by thymine is much greater when the base 3′ to uracil is T than A. The thermodynamic coupling between the bases flanking the uracil was analyzed by two types of double mutant cycles, one using data from solution NMR on the isolated substrates, and one from UNG activity (Fig. 6)34. It is evident from both cycles that ΔΔGATAAΔΔGTTTA and ΔΔGATTTΔΔGAATA, indicating that substituting one of the bases adjacent to uracil affects the energy of the transition state in a manner that depends on the identity of the other one. In other words, the sites directly 5′ and 3′ of uracil are allosterically coupled. Substituting a thymine 3′ to uracil by adenine (vertical edges of the cube in Fig. 6) stabilizes the transition states, but in both cycles, effects are more significant when the base 5′ to uracil is thymine. Similarly, substituting a thymine 5′ to uracil by adenine stabilizes the transition state when the base 3′ to uracil is thymine, but destabilizes it when adenine is in this position. This allosteric coupling is quantified by the coupling energy, with a value of 3.9 ± 0.4 kJ/mol for the kex-based cycle with substrate series two, and values of 1.8 ± 0.6 kJ/mol and 1.8 ± 0.5 kJ/mol for the kcat/KM -based cycle with substrate series two and one, respectively. While magnitudes of coupling necessarily differ between the two cycles, the coupling has the same sign in both cases, which supports the conclusion that UNG catalysis correlates with substrate dynamics. Coupling energies are higher for the imino proton exchange-based cycle compared to the kcat/KM-based cycle. Considering that the former measures coupling in the isolated substrates, these results point to the role of the enzyme in reducing the coupling energy, consistent with the fact that UNG must be capable of removing uracil in diverse sequence contexts.

For the kcat/KM-based cycle, coupling energies for substrate series one and two are identical, within error, indicating that the effect of a single change in a base adjacent to uracil is independent of the identity of the bases that differ between substrate series one from series two (i.e., ΔΔG1XY1XX=ΔΔG2XY2XX, where X and Y are A or T). This indicates that all the combined differences between substrates one and two have a constant effect in kcat/KM that is independent of the identity of the bases flanking the uracil. Consistent with this, all values of ΔG2XY1XY are the same for any combination of X and Y (lines connecting the thermodynamic squares of substrates 1 and 2 in Fig. 6). Although results show that sequence effects are not limited to the bases immediately surrounding the uracil lesion (kcat/KM values are 1.4-fold greater for substrate series one), for these sequences, effects appear to be additive to the effects of the bases flanking the uracil. Based on this analysis, we conclude that the bases adjacent to the uracil have the greatest impact in both substrate flexibility and UNG activity, and that the bases not immediately surrounding the uracil provide a secondary level of modulation.

While our study focused on E. coli UNG, we anticipate the results to be broadly applicable because the catalytic cores of different UNGs are closely related. For example, the root-mean-square deviation between all Cα positions of human and E. coli UNG enzymes is just 0.9 Å76,77. UNGs are also structurally similar (despite low sequence identity) to bacterial mismatch-specific uracil-DNA glycosylases (MUGs) and to eukaryotic thymine-DNA glycosylates (TDGs), which use a base-flipping mechanism for the recognition of uracil and thymine78,79. We speculate that the MUG and TDG efficiencies will also be dictated in large part by substrate lesion DNA dynamics, though future studies are needed to test this hypothesis. Given the fundamental nature of genomic integrity, the implications of our studies are significant. That repair efficiencies, at least for UNG, are in large part dictated by DNA sequence deformability and flexibility could help explain the molecular mechanisms that underly fundamental observations in the fields of oncogenetics and cancer hotspots80, and evolutionary adaptation81. One particularly relevant context extends to the rapidly expanding field of base editing, where UNG and other glycosylases have been tethered to Cas9 nickases enabling precision DNA alterations with great potential for therapeutic intervention82,83. Ultimately, our data show a clear correlation between UNG activity and substrate flexibility that can be used to make predictions about the functional attributes of substrates, and may help rationalize sequence effects in base editing and other fields.

Supplementary Information

Acknowledgements

M.L. acknowledges use of the Ultrafast Laser Spectroscopy Facility at Arizona State University. W.V.H. acknowledges use of the Magnetic Resonance Research Center at Arizona State University. A.v.d.V. acknowledges use of Research Computing at the University of South Florida. This work was supported by the National Science Foundation [# 1918716 to ML and WVH, # 1919096 to AvdV]. Research reported in this publication was also supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award Number R35GM141933 [WVH]. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author contributions

M.L., A.v.d.V, W.V.H. designed and supervised the research, and analyzed data; S.P., A.M.O., N.K., B.T.U. performed experiments and analyzed data, P.B.O. carried out simulations and analyzed M.D. trajectories, W.V.H., A.M.O., S.A. analyzed NMR data, M.L., A.v.d.V., W.V.H., S.P., A.M.O., P.B.O. wrote and edited the paper. All authors have read and agreed to the published version of the manuscript.

Data availability

The data that support the findings of this paper are available from the corresponding authors upon reasonable request. The NMR data used for fitting the kex are deposited under BMRB Entry ID 51612.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Paul B. Orndorff, Souvik Poddar and Aerial M. Owens.

Contributor Information

Wade D. Van Horn, Email: wade.van.horn@asu.edu

Arjan van der Vaart, Email: avandervaart@usf.edu.

Marcia Levitus, Email: marcia.levitus@asu.edu.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-023-30620-0.

References

  • 1.McCullough AK, Dodson ML, Lloyd RS. Initiation of base excision repair: Glycosylase mechanisms and structures. Annu. Rev. Biochem. 1999;68:255–285. doi: 10.1146/annurev.biochem.68.1.255. [DOI] [PubMed] [Google Scholar]
  • 2.Jones M, Wagner R, Radman M. Repair of a mismatch is influenced by the base composition of the surrounding nucleotide-sequence. Genetics. 1987;115:605–610. doi: 10.1093/genetics/115.4.605. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.SibghatUllah ‡, et al. Base analog and neighboring base effects on substrate specificity of recombinant human G: T mismatch-specific thymine DNA-glycosylase. Biochemistry. 1996;35:12926–12932. doi: 10.1021/bi961022u. [DOI] [PubMed] [Google Scholar]
  • 4.Eftedal I, Guddal PH, Slupphaug G, Volden G, Krokan HE. Consensus sequences for good and poor removal of uracil from double-stranded DNA by uracil-DNA glycosylase. Nucl. Acids Res. 1993;21:2095–2101. doi: 10.1093/nar/21.9.2095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Nilsen H, Yazdankhah SP, Eftedal I, Krokan HE. Sequence specificity for removal of uracil from U-center-dot-a pairs and U-center-dot-G mismatches by uracil-DNA glycosylase from Escherichia-coli, and correlation with mutational hotspots. Febs Lett. 1995;362:205–209. doi: 10.1016/0014-5793(95)00244-4. [DOI] [PubMed] [Google Scholar]
  • 6.Dolan ME, Oplinger M, Pegg AE. Sequence specificity of guanine alkylation and repair. Carcinogenesis. 1988;9:2139–2143. doi: 10.1093/carcin/9.11.2139. [DOI] [PubMed] [Google Scholar]
  • 7.Haseltine WA, et al. Cleavage of pyrimidine dimers in specific DNA-sequences by a pyrimidine dimer DNA-glycosylase of M-luteus. Nature. 1980;285:634–641. doi: 10.1038/285634a0. [DOI] [PubMed] [Google Scholar]
  • 8.Krokan HE, Drablos F, Slupphaug G. Uracil in DNA–occurrence, consequences and repair. Oncogene. 2002;21:8935–8948. doi: 10.1038/sj.onc.1205996. [DOI] [PubMed] [Google Scholar]
  • 9.Visnes T, et al. Uracil in DNA and its processing by different DNA glycosylases. Philos. Trans. R. Soc. B. 2009;364:563–568. doi: 10.1098/rstb.2008.0186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Parikh SS, et al. Base excision repair initiation revealed by crystal structures and binding kinetics of human uracil-DNA glycosylase with DNA. Embo J. 1998;17:5214–5226. doi: 10.1093/emboj/17.17.5214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Fuxreiter M, Luo N, Jedlovszky P, Simon I, Osman R. Role of base flipping in specific recognition of damaged DNA by repair enzymes. J. Mol. Biol. 2002;323:823–834. doi: 10.1016/S0022-2836(02)00999-3. [DOI] [PubMed] [Google Scholar]
  • 12.Cao CY, Jiang YL, Stivers JT, Song FH. Dynamic opening of DNA during the enzymatic search for a damaged base. Nat. Struct. Mol. Biol. 2004;11:1230–1236. doi: 10.1038/nsmb864. [DOI] [PubMed] [Google Scholar]
  • 13.Parker JB, et al. Enzymatic capture of an extrahelical thymine in the search for uracil in DNA. Nature. 2007;449:433–437. doi: 10.1038/nature06131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Parker JB, Stivers JT. Dynamics of uracil and 5-fluorouracil in DNA. Biochemistry. 2011;50:612–617. doi: 10.1021/bi101536k. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Carr CE, Khutsishvili I, Gold B, Marky LA. Thermodynamic stability of DNA duplexes comprising the simplest T –> dU substitutions. Biochemistry. 2018;57:5666–5671. doi: 10.1021/acs.biochem.8b00676. [DOI] [PubMed] [Google Scholar]
  • 16.Slupphaug G, et al. Properties of a recombinant human uracil-DNA glycosylase from the UNG gene and evidence that UNG encodes the major uracil-DNA glycosylase. Biochemistry. 1995;34:128–138. doi: 10.1021/bi00001a016. [DOI] [PubMed] [Google Scholar]
  • 17.Holz K, Pavlic A, Lietard J, Somoza MM. Specificity and efficiency of the uracil DNA glycosylase-mediated strand cleavage surveyed on large sequence libraries. Sci. Rep. 2019;9:17822. doi: 10.1038/s41598-019-54044-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.ElHassan MA, Calladine CR. Conformational characteristics of DNA: Empirical classifications and a hypothesis for the conformational behaviour of dinucleotide steps. Philos. Trans. R. Soc. A. 1997;355:43–100. doi: 10.1098/rsta.1997.0002. [DOI] [Google Scholar]
  • 19.Packer MJ, Dauncey MP, Hunter CA. Sequence-dependent DNA structure: dinucleotide conformational maps. J. Mol. Biol. 2000;295:71–83. doi: 10.1006/jmbi.1999.3236. [DOI] [PubMed] [Google Scholar]
  • 20.Heddi B, Abi-Ghanem J, Lavigne M, Hartmann B. Sequence-dependent DNA flexibility mediates DNase I cleavage. J. Mol. Biol. 2010;395:123–133. doi: 10.1016/j.jmb.2009.10.023. [DOI] [PubMed] [Google Scholar]
  • 21.Widom J. Role of DNA sequence in nucleosome stability and dynamics. Q. Rev. Biophys. 2001;34:269–324. doi: 10.1017/S0033583501003699. [DOI] [PubMed] [Google Scholar]
  • 22.Stivers JT, Jiang YL. a mechanistic perspective on the chemistry of DNA repair glycosylases. Chem. Rev. 2003;103:2729–2760. doi: 10.1021/cr010219b. [DOI] [PubMed] [Google Scholar]
  • 23.Seibert E, Ross JB, Osman R. Role of DNA flexibility in sequence-dependent activity of uracil DNA glycosylase. Biochemistry. 2002;41:10976–10984. doi: 10.1021/bi026121o. [DOI] [PubMed] [Google Scholar]
  • 24.Stivers JT. 2-Aminopurine fluorescence studies of base stacking interactions at abasic sites in DNA: Metal-ion and base sequence effects. Nucl. Acids Res. 1998;26:3837–3844. doi: 10.1093/nar/26.16.3837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Stivers JT, Pankiewicz KW, Watanabe KA. Kinetic mechanism of damage site recognition and uracil flipping by Escherichia coli uracil DNA glycosylase. Biochemistry. 1999;38:952–963. doi: 10.1021/bi9818669. [DOI] [PubMed] [Google Scholar]
  • 26.Jones AC, Neely RK. 2-Aminopurine as a fluorescent probe of DNA conformation and the DNA-enzyme interface. Q. Rev. Biophys. 2015;48:244–279. doi: 10.1017/S0033583514000158. [DOI] [PubMed] [Google Scholar]
  • 27.Levitus M. Tutorial: Measurement of fluorescence spectra and determination of relative fluorescence quantum yields of transparent samples. Methods Appl. Fluores. 2020;8:033001. doi: 10.1088/2050-6120/ab7e10. [DOI] [PubMed] [Google Scholar]
  • 28.Ward DC, Reich E, Stryer L. Fluorescence studies of nucleotides and polynucleotides .I. Formycin 2-aminopurine riboside 2,6-diaminopurine riboside and their derivatives. J. Biol. Chem. 1969;244:1228. doi: 10.1016/S0021-9258(18)91833-8. [DOI] [PubMed] [Google Scholar]
  • 29.Avilov SV, Piemont E, Shvadchak V, de Rocquigny H, Mely Y. Probing dynamics of HIV-1 nucleocapsid protein/target hexanucleotide complexes by 2-aminopurine. Nucl. Acids Res. 2008;36:885–896. doi: 10.1093/nar/gkm1109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Fürtig B, Richter C, Wöhnert J, Schwalbe H. NMR spectroscopy of RNA. ChemBioChem. 2003;4:936–962. doi: 10.1002/cbic.200300700. [DOI] [PubMed] [Google Scholar]
  • 31.Weiss MA, Patel DJ, Sauer RT, Karplus M. Two-dimensional 1H NMR study of the lambda operator site OL1: A sequential assignment strategy and its application. Proc. Natl. Acad. Sci. U. S. A. 1984;81:130–134. doi: 10.1073/pnas.81.1.130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Szulik MW, Voehler M, Stone MP. NMR analysis of base-pair opening kinetics in DNA. Curr. Protoc Nucl. Acid Chem. 2014;59:72021–272018. doi: 10.1002/0471142700.nc0720s59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Anosova I, et al. Structural insights into conformation differences between DNA/TNA and RNA/TNA chimeric duplexes. ChemBioChem. 2016;17:1705–1708. doi: 10.1002/cbic.201600349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Carter PJ, Winter G, Wilkinson AJ, Fersht AR. The use of double mutants to detect structural changes in the active site of the tyrosyl-tRNA synthetase (Bacillus stearothermophilus) Cell. 1984;38:835–840. doi: 10.1016/0092-8674(84)90278-2. [DOI] [PubMed] [Google Scholar]
  • 35.Lu X-J, Olson WK. 3DNA: a versatile, integrated software system for the analysis, rebuilding and visualization of three-dimensional nucleic-acid structures. Nat. Protoc. 2008;3:1213–1227. doi: 10.1038/nprot.2008.104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–935. doi: 10.1063/1.445869. [DOI] [Google Scholar]
  • 37.AMBER 2018 (University of California, San Francisco, 2018).
  • 38.Eastman P, et al. OpenMM 7: Rapid development of high performance algorithms for molecular dynamics. PLOS Comp. Biol. 2017;13:e1005659. doi: 10.1371/journal.pcbi.1005659. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Ryckaert J-P, Ciccotti G, Berendsen H. Numerical-integration of cartesian equations of motion of a system with constraints – molecular-dynamics of N-alkanes. J. Comput. Phys. 1977;23:327–341. doi: 10.1016/0021-9991(77)90098-5. [DOI] [Google Scholar]
  • 40.Darden T, York D, Pedersen L. Particle mesh Ewald: An N⋅log( N ) method for Ewald sums in large systems. J. Chem. Phys. 1993;98:10089–10092. doi: 10.1063/1.464397. [DOI] [Google Scholar]
  • 41.Galindo-Murillo R, et al. Assessing the Current State Of Amber Force Field Modifications for DNA. J. Chem. Theory Comput. 2016;12:4114–4127. doi: 10.1021/acs.jctc.6b00186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Shirts MR, Chodera JD. Statistically optimal analysis of samples from multiple equilibrium states. J. Chem. Phys. 2008;129:124105–124105. doi: 10.1063/1.2978177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Strahs D, Schlick T. A-tract bending: Insights into experimental structures by computational models. J. Mol. Biol. 2000;301:643–663. doi: 10.1006/jmbi.2000.3863. [DOI] [PubMed] [Google Scholar]
  • 44.Ma N, van der Vaart A. Anisotropy of B-DNA groove bending. J. Am. Chem. Soc. 2016;138:9951–9958. doi: 10.1021/jacs.6b05136. [DOI] [PubMed] [Google Scholar]
  • 45.Mazur AK. Wormlike chain theory and bending of short DNA. Phys Rev. Lett. 2007;98:218102. doi: 10.1103/PhysRevLett.98.218102. [DOI] [PubMed] [Google Scholar]
  • 46.Mazur AK. Evaluation of elastic properties of atomistic DNA models. Biophys. J. 2006;91:4507–4518. doi: 10.1529/biophysj.106.091280. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Banavali NK, MacKerell AD. Free energy and structural pathways of base flipping in a DNA GCGC containing sequence. J. Mol. Biol. 2002;319:141–160. doi: 10.1016/S0022-2836(02)00194-8. [DOI] [PubMed] [Google Scholar]
  • 48.Dallmann A, et al. 2-aminopurine incorporation perturbs the dynamics and structure of DNA. Angew. Chem. Int. Ed. 2010;49:5989–5992. doi: 10.1002/anie.201001312. [DOI] [PubMed] [Google Scholar]
  • 49.Lycksell PO, et al. Base pair opening dynamics of a 2-aminopurine substituted Eco RI restriction sequence and its unsubstituted counterpart in oligonucleotides. Nucl. Acids Res. 1987;15:9011–9025. doi: 10.1093/nar/15.21.9011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Bellamy SR, Baldwin GS. A kinetic analysis of substrate recognition by uracil-DNA glycosylase from herpes simplex virus type 1. Nucl. Acids Res. 2001;29:3857–3863. doi: 10.1093/nar/29.18.3857. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.McCullough AK, Dodson ML, Scharer OD, Lloyd RS. The role of base flipping in damage recognition and catalysis by T4 endonuclease V. J. Biol. Chem. 1997;272:27210–27217. doi: 10.1074/jbc.272.43.27210. [DOI] [PubMed] [Google Scholar]
  • 52.Slupphaug G, et al. A nucleotide-flipping mechanism from the structure of human uracil-DNA glycosylase bound to DNA. Nature. 1996;384:87–92. doi: 10.1038/384087a0. [DOI] [PubMed] [Google Scholar]
  • 53.Krusong K, Carpenter EP, Bellamy SR, Savva R, Baldwin GS. A comparative study of uracil-DNA glycosylases from human and herpes simplex virus type 1. J. Biol. Chem. 2006;281:4983–4992. doi: 10.1074/jbc.M509137200. [DOI] [PubMed] [Google Scholar]
  • 54.Kavli B, et al. Excision of cytosine and thymine from DNA by mutants of human uracil-DNA glycosylase. Embo J. 1996;15:3442–3447. doi: 10.1002/j.1460-2075.1996.tb00710.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Neely RK, et al. Time-resolved fluorescence of 2-aminopurine as a probe of base flipping in M.HhaI-DNA complexes. Nucl. Acids Res. 2005;33:6953–6960. doi: 10.1093/nar/gki995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Guest CR, Hochstrasser RA, Sowers LC, Millar DP. Dynamics of mismatched base pairs in DNA. Biochemistry. 1991;30:3271–3279. doi: 10.1021/bi00227a015. [DOI] [PubMed] [Google Scholar]
  • 57.Rachofsky EL, Seibert E, Stivers JT, Osman R, Ross JB. Conformation and dynamics of abasic sites in DNA investigated by time-resolved fluorescence of 2-aminopurine. Biochemistry. 2001;40:957–967. doi: 10.1021/bi001665g. [DOI] [PubMed] [Google Scholar]
  • 58.Nordlund TM, et al. Structure and dynamics of a fluorescent DNA oligomer containing the EcoRI recognition sequence: Fluorescence, molecular dynamics, and NMR studies. Biochemistry. 1989;28:9095–9103. doi: 10.1021/bi00449a021. [DOI] [PubMed] [Google Scholar]
  • 59.Manoj P, Min C-K, Aravindakumar CT, Joo T. Ultrafast charge transfer dynamics in 2-aminopurine modified double helical DNA. Chem. Phys. 2008;352:333–338. doi: 10.1016/j.chemphys.2008.07.005. [DOI] [Google Scholar]
  • 60.Baumann CG, Smith SB, Bloomfield VA, Bustamante C. Ionic effects on the elasticity of single DNA molecules. Proc. Natl. Acad. Sci. U. S. A. 1997;94:6185–6190. doi: 10.1073/pnas.94.12.6185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Ma N, van der Vaart A. KCI dependence of B-DNA groove bending anisotropy. J. Phys. Chem. B. 2017;121:5322–5330. doi: 10.1021/acs.jpcb.7b01957. [DOI] [PubMed] [Google Scholar]
  • 62.Guéron M, Leroy JL. Methods in Enzymology. Academic Press; 1995. [16] Studies of base pair kinetics by NMR measurement of proton exchange; pp. 383–413. [DOI] [PubMed] [Google Scholar]
  • 63.Guéron M, Kochoyan M, Leroy J-L. A single mode of DNA base-pair opening drives imino proton exchange. Nature. 1987;328:89–92. doi: 10.1038/328089a0. [DOI] [PubMed] [Google Scholar]
  • 64.Wemmer DE, Wand AJ, Seeman NC, Kallenbach NR. NMR analysis of DNA junctions: Imino proton NMR studies of individual arms and intact junction. Biochemistry. 1985;24:5745–5749. doi: 10.1021/bi00342a009. [DOI] [PubMed] [Google Scholar]
  • 65.Patel DJ, Pardi A, Itakura K. DNA conformation, dynamics, and interactions in solution. Science. 1982;216:581–590. doi: 10.1126/science.6280281. [DOI] [PubMed] [Google Scholar]
  • 66.Lindahl T. An N-glycosidase from Escherichia coli that releases free uracil from DNA containing deaminated cytosine residues. Proc. Natl. Acad. Sci. U. S. A. 1974;71:3649–3653. doi: 10.1073/pnas.71.9.3649. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Fitzgerald ME, Drohat AC. Coordinating the Initial Steps of Base Excision Repair: apurinic/apyrimidinic endonuclease 1 actively stimulates thymine DNA glycosylase by disrupting the product complex. J. Biol. Chem. 2008;283:32680–32690. doi: 10.1074/jbc.M805504200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Wong I, Lundquist AJ, Bernards AS, Mosbaugh DW. Presteady-state analysis of a single catalytic turnover by Escherichia coli uracil-DNA glycosylase reveals a “pinch-pull-push” mechanism. J. Biol. Chem. 2002;277:19424–19432. doi: 10.1074/jbc.M201198200. [DOI] [PubMed] [Google Scholar]
  • 69.Slupphaug G, et al. A nucleotide-flipping mechanism from the structure of human uracil–DNA glycosylase bound to DNA. Nature. 1996;384:87–92. doi: 10.1038/384087a0. [DOI] [PubMed] [Google Scholar]
  • 70.Parikh Sudip S, et al. Uracil-DNA glycosylase–DNA substrate and product structures: Conformational strain promotes catalytic efficiency by coupled stereoelectronic effects. Proc. Natl. Acad. Sci. U. S. A. 2000;97:5083–5088. doi: 10.1073/pnas.97.10.5083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Bianchet MA, et al. Electrostatic guidance of glycosyl cation migration along the reaction coordinate of uracil DNA glycosylase. Biochemistry. 2003;42:12455–12460. doi: 10.1021/bi035372+. [DOI] [PubMed] [Google Scholar]
  • 72.Kosaka H, Hoseki J, Nakagawa N, Kuramitsu S, Masui R. Crystal Structure of family 5 uracil-DNA glycosylase bound to DNA. J. Mol. Biol. 2007;373:839–850. doi: 10.1016/j.jmb.2007.08.022. [DOI] [PubMed] [Google Scholar]
  • 73.Pedersen H, Johnson K, McVey C, Leiros I, Moe E. Structure determination of uracil-DNA N-glycosylase from Deinococcus radiodurans in complex with DNA. Acta Crystallogr. D. 2015;71:2137–2149. doi: 10.1107/S1399004715014157. [DOI] [PubMed] [Google Scholar]
  • 74.Ramstein J, Lavery R. Energetic coupling between DNA bending and base pair opening. Proc. Natl. Acad. Sci. U. S. A. 1988;85:7231–7235. doi: 10.1073/pnas.85.19.7231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Ma N, van der Vaart A. Free energy coupling between DNA bending and base flipping. J. Chem. Inf. Model. 2017;57:2020–2026. doi: 10.1021/acs.jcim.7b00215. [DOI] [PubMed] [Google Scholar]
  • 76.Mol CD, et al. Crystal structure and mutational analysis of human uracil-DNA glycosylase: structural basis for specificity and catalysis. Cell. 1995;80:869–878. doi: 10.1016/0092-8674(95)90290-2. [DOI] [PubMed] [Google Scholar]
  • 77.Xiao GY, et al. Crystal structure of Escherichia coli uracil DNA glycosylase and its complexes with uracil and glycerol: Structure and glycosylase mechanism revisited. Proteins. 1999;35:13–24. doi: 10.1002/(SICI)1097-0134(19990401)35:1<13::AID-PROT2>3.0.CO;2-2. [DOI] [PubMed] [Google Scholar]
  • 78.Gallinari P, Jiricny J. A new class of uracil-DNA glycosylases related to human thymine-DNA glycosylase. Nature. 1996;383:735–738. doi: 10.1038/383735a0. [DOI] [PubMed] [Google Scholar]
  • 79.Schormann N, Ricciardi R, Chattopadhyay D. Uracil-DNA glycosylases-structural and functional perspectives on an essential family of DNA repair enzymes. Protein Sci. 2014;23:1667–1685. doi: 10.1002/pro.2554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Krokan HE, Drabløs F, Slupphaug G. Uracil in DNA – occurrence, consequences and repair. Oncogene. 2002;21:8935–8948. doi: 10.1038/sj.onc.1205996. [DOI] [PubMed] [Google Scholar]
  • 81.Lewis CA, Crayle J, Zhou S, Swanstrom R, Wolfenden R. Cytosine deamination and the precipitous decline of spontaneous mutation during Earth’s history. Proc. Natl. Acad. Sci. USA. 2016;113:8194–8199. doi: 10.1073/pnas.1607580113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Zhao D, et al. Glycosylase base editors enable C-to-A and C-to-G base changes. Nat. Biotechnol. 2021;39:35–40. doi: 10.1038/s41587-020-0592-2. [DOI] [PubMed] [Google Scholar]
  • 83.Kurt IC, et al. CRISPR C-to-G base editors for inducing targeted DNA transversions in human cells. Nat. Biotechnol. 2021;39:41–46. doi: 10.1038/s41587-020-0609-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

The data that support the findings of this paper are available from the corresponding authors upon reasonable request. The NMR data used for fitting the kex are deposited under BMRB Entry ID 51612.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES