Significance
The most prominent epigenetic modification in mammalian genomes is cytosine methylation at position 5 on the pyrimidine ring. Thymine DNA glycosylase (TDG) plays a central role in the pathways for 5-methyl cytosine removal and thus influences gene silencing, stem cell differentiation, and alterations in normal development. Additionally, methylation abnormalities in DNA are often observed in diseases, specifically cancer. Here we examine the mechanisms by which TDG detects, extrudes, and excises modified bases in DNA. Using path-sampling methodologies, we compute minimum free energy paths for TDG base extrusion. The computed paths reveal a unique mechanism underpinning TDG selectivity for DNA lesions or modified bases, which involves DNA sculpting, global protein dynamics, conformational gating, and specific protein–nucleic acid interactions.
Keywords: molecular dynamics, Markov state models, genome maintenance, epigenetics, DNA glycosylase
Abstract
Thymine DNA glycosylase (TDG) is a pivotal enzyme with dual roles in both genome maintenance and epigenetic regulation. TDG is involved in cytosine demethylation at CpG sites in DNA. Here we have used molecular modeling to delineate the lesion search and DNA base interrogation mechanisms of TDG. First, we examined the capacity of TDG to interrogate not only DNA substrates with 5-carboxyl cytosine modifications but also G:T mismatches and nonmismatched (A:T) base pairs using classical and accelerated molecular dynamics. To determine the kinetics, we constructed Markov state models. Base interrogation was found to be highly stochastic and proceeded through insertion of an arginine-containing loop into the DNA minor groove to transiently disrupt Watson–Crick pairing. Next, we employed chain-of-replicas path-sampling methodologies to compute minimum free energy paths for TDG base extrusion. We identified the key intermediates imparting selectivity and determined effective free energy profiles for the lesion search and base extrusion into the TDG active site. Our results show that DNA sculpting, dynamic glycosylase interactions, and stabilizing contacts collectively provide a powerful mechanism for the detection and discrimination of modified bases and epigenetic marks in DNA.
Genome maintenance occurs in the context of chromatin and it is becoming increasingly apparent that epigenetic regulation is intricately intertwined with the DNA damage response in ensuring genome stability. Understanding how epigenetic marks are recognized, distinguished from exogenous or endogenous DNA lesions, and processed by the canonical DNA repair machinery is a topic of great current interest. Here our focus is on the base excision repair (BER) pathway, which in addition to an established role in genome maintenance, is associated with many other cellular processes (1), including a recently discovered critical role in epigenetic regulation (2–5). The most prominent epigenetic modification in mammalian genomes is cytosine methylation, which typically occurs at CpG islands and enhances chromatin packing to promote gene silencing (6). Consequently, 5-methylcytosine (5mC) demethylation is crucial for resuming the transcription of silenced genes. Notably, unbalanced cytosine methylation is a hallmark of cancer (7–9). In cancer, predominantly demethylated regions of the genome could become hypermethylated leading to the silencing of tumor suppressor genes. Furthermore, 5-methylcytosine deamination results in G:T mismatches that could cause C-to-T transition mutations during DNA replication. It is estimated that nearly a third of cancer mutations found in coding regions of the genome arise from C and 5mC deamination at CpG sites (2). There is also a clear link between aging and methylation levels in GpG islands (10). The importance of maintaining the methylation state of the genome requires tight regulation of pathways controlling the levels of 5mC. Removal of 5mC bases (Fig. 1) is known to proceed through successive steps of oxidation by enzymes from the ten-eleven-translocase (TET) family, producing 5-formylcytosine (5fC) and 5-carboxylcytosine (5caC) intermediates (5, 11, 12). Unlike methylcytosine, these intermediates are substrates for thymine DNA glycosylase (5, 13, 14), a classic DNA repair enzyme. While thymine DNA glycosylase (TDG) is important for the repair of mutagenic DNA lesions, it has an even more prominent role in ensuring epigenetic stability. In this capacity, TDG activity is vital during embryonic development (15). TDG also interacts with numerous protein partners engaged in epigenetic regulation [e.g., DNMT3a (16) and CBP/p300 acetylase (17)] and transcription [transcription factors, nuclear receptors (18)] and is intricately involved in the regulation of gene expression.
A second pathway to process 5mC is through deamination followed by the action of MBD1–4 glycosylases (19, 20). The resulting abasic DNA is then channeled through the BER pathway. BER efficiency relies on a remarkably discriminating search for modified bases among an enormous background of normal DNA. The search is followed by damage-specific base extrusion into the enzyme’s active site, removal of the damaged bases, and handoff of the product DNA to downstream pathway participants.
Here we establish a basis to understand the key principles underpinning the extraordinary power of the TDG glycosylase to discriminate in favor of modified bases against a backdrop of normal genomic DNA. We further elucidate the protein–nucleic acid interactions ensuring specificity for lesions or epigenetic marks. Key to selectivity is nucleotide extrusion, a process involving a nucleotide swinging out of the DNA helix and being accommodated in the catalytic pocket of TDG. Nucleotide extrusion (21–23) is a major determinant of glycosylase selectivity, with potential for selection or rejection of substrates at each intermediate along the base eversion path. Glycosylases (22, 24) also employ DNA sculpting strategies (e.g., DNA bending and loop insertion) to lower the energetic barrier of base extrusion and thus increase the efficiency of the dynamic lesion search. Whether glycosylases employ active or passive strategies in this search process has been a topic of considerable debate. NMR evidence has suggested glycosylases could act as passive kinetic traps for spontaneously exposed extrahelical bases (25, 26). Conversely, evidence from molecular crystallography (MX) has pointed to active base extrusion mechanisms. Numerous glycosylase structures (20, 22, 27–29) have shown that DNA binding is accompanied by a multitude of conformational changes preceding active site chemistry: (i) DNA sculpting through interactions with the enzyme DNA-binding groove; (ii) DNA bending, minor groove compression, and backbone distortion at the lesion site; (iii) residue insertion into the DNA stack to expel the lesion base and stabilize the orphaned base; and (iv) base flipping into lesion-specific recognition pockets that sterically exclude nonlesion bases. Cross-linking strategies have, in rare instances, captured crystallographic snapshots of base extrusion intermediates (30) and could, in principle, provide insight into short-lived species along base extrusion paths. Nonetheless, base flipping is inherently dynamic and therefore not easily construed from static crystallographic snapshots. Therefore, molecular modeling studies have been extensively used to complement structural biology approaches and have proven enormously valuable in unraveling detailed dynamics of glycosylase enzymes and the origins of selectivity in BER (23, 29, 31).
Results and Discussion
To explore whether TDG employs an active or passive mechanism, we carried out simulations on TDG/5caC-DNA complexes. As starting points for computational modeling, we utilized existing structures of TDG/5caC-DNA in a preextrusion and postexcision state [Protein Data Bank (PDB) ID codes: 2RBA and 5HF7] (32, 33). The following initial models were created: (i) preextrusion state (5caC accommodated in the DNA base stack); (ii) fully extruded state (5caC inserted in the TDG active site); and (iii) an initial interrogation complex. To address base interrogation, we started from systems with initially separated TDG and 5caC-DNA and simulated complex formation. We also simulated TDG in the presence of a G:T mismatch and with normal DNA.
Our first goal was to delineate the accessible conformational space for TDG/5caC-DNA and to assess the capacity of TDG to interrogate not only 5caC but also G:T mismatches and nonmismatched (A:T) base pairs. To this end, we carried out accelerated molecular dynamics (aMD) (34) runs on the TDG interrogation complexes. The aMD method enhances sampling of the torsional degrees of freedom to accelerate phase space exploration and facilitate transitions over high energy barriers. Surprisingly, we observed that the presence of TDG induces multiple transient base-opening events that occur within 200 ns of aMD sampling (SI Appendix, Fig. S1A). Base interrogation was found to be highly stochastic and appeared to proceed through insertion of an arginine-containing loop (Arg275) into the DNA minor groove to transiently disrupt Watson–Crick pairing. To ascertain that these events are also detectable in unbiased MD (SI Appendix, Fig. S1B), we carried out multiple trajectory regular MD runs for an aggregate simulation time of 8 μs. Analogous runs were performed on the G:T mismatch and unmodified DNA systems. DNA backbone torsion angles for the interrogated base pair and distances between the two bases and the guanidinium group of Arg275 were selected as coordinates for time-lagged independent component analysis (TICA) (35, 36). The combined trajectories were projected onto the first two independent components (ICs). Different energy minima (metastable states) are present and clearly separated in the TICA projections (Fig. 2 A–C). All trajectory frames were then clustered in the projected space of the two ICs using the k-means algorithm, producing 800 clusters (i.e., microstates). From this data we constructed Markov state models (MSMs) (37, 38) and evaluated the kinetics of TDG base interrogation using transition path theory (39). Results are presented in Fig. 2 D–F. Several conclusions are immediately apparent from our analysis. First, TDG probes DNA bases nonspecifically, interrogating not only 5caC-modified bases but also G:T mismatches and normal base pairs. Among the identified kinetically distinct macrostates we distinguish two low-populated extrahelical states with Arg275 inserted into the DNA stack in two different orientations (Fig. 3 and SI Appendix, Fig. S3). These states are accessed through two kinetic intermediates: (i) an intermediate with TDG-induced local torsional shift and intact Watson–Crick pairing; and (ii) an intermediate with partially broken Watson–Crick pairing and Arg275 inserted between the extruded and the orphaned base (Fig. 3 C and D). The second key observation is that the extrahelical states are extremely short lived, and thus not readily detectable by NMR (Fig. 3A and SI Appendix, Figs. S2A and S3A). The free energy landscapes in Fig. 2 reflect this, with barriers to the extrahelical states not exceeding 4 kcal/mol. Third, the H bonding between Arg275 and 5caC is variable and differs from the pattern for normal DNA (Fig. 3 and SI Appendix, Fig. S3). This observation rationalizes the differences between the free energy landscapes in Fig. 2 A and C with a lower barrier for a 5caC modified base to access the extrahelical state. We also analyzed DNA structural parameters using the Curves+ code (40) to determine whether TDG exploits local DNA deformation to facilitate lesion interrogation and selection. We found very little difference between the interbase parameters as well as the total bend of the 5caC and the A:T intrahelical states (SI Appendix, Fig. S4A and Table S5). However, analysis of the transient extrahelical states revealed changes to both the shift and tilt values (∼2 Å and ∼6.5◦) at the interrogation site and flanking base of 5caC (SI Appendix, Fig. S4B). Importantly, the negative charge on 5caC provides a convenient handle for TDG to stabilize an extrahelical intermediate using an opportunely positioned Lys201 (Fig. 4E).
G:T mismatches could also rapidly transition between nonextruded and extruded states (Fig. 2E), exhibiting two well-defined metastable states with disrupted base pairing. The H-bonding pattern of Arg275 in the observed extrahelical states also differs from the pattern for normal DNA (SI Appendix, Figs. S2 and S3). This can be rationalized by the fact that G:T mismatches form “wobble” hydrogen-bonding pairs, which require a sideways shift of one base relative to Watson–Crick positioning. Our structural analysis of the interrogation site confirms this, with intrahelical shift and twist values differing significantly (by ∼2 Å and ∼5◦, respectively) from the ones measured for the intrahelical states of the 5caC and A:T (SI Appendix, Fig. S4A). This leads to increased propensity for bending and base pair disruption at the G:T site (41–44), which TDG takes advantage of through backbone distortion alone. Thus, TDG has shown stronger G:T mismatch repair activity, in vitro, compared with modified substrates (45). Interestingly, Curves+ analysis of the interrogation states for the G:T mismatch system indicates that base stacking is disrupted not only for the transient extrahelical macrostates but also for the intrahelical basin that contains the majority of conformers from the unbiased MD trajectories. Thus, unlike the regular (A:T) DNA or 5caC, the G:T mismatch has a local structural distortion at the very outset of interrogation. This leads to an energetically destabilized initial state and lower energetic cost for base extrusion in the G:T mismatch system. These results substantiate the differences in the transition timescales, with G:T base extrusion occurring on a faster timescale than both the 5caC and A:T substrates (Fig. 3 and SI Appendix, Figs. S2A and S3A).
The results also highlight the role of the TDG insertion loop (Ala270–Pro280) and particularly Arg275, which stabilizes the extrahelical intermediates by stepwise replacement of the Watson–Crick hydrogen bonds. Several glycosylases have been proposed to utilize similar mechanisms in which both DNA sculpting and loop insertion is exploited. Structures of hOGG1, its bacterial homolog MutM, and MBD4 all exhibit Arg-loop insertion (20, 29, 46, 47). Similarly, single molecule experiments have shown the Escherichia coli repair enzymes Fpg, Nei, and Nth to utilize DNA sculpting and intercalating loop strategies to interrogate and extrude damage bases (48, 49).
As the next step in our analysis, we determined the complete base eversion path for TDG/5caC-DNA, starting with the interrogation complexes and ending with the fully extruded state. We identified the key intermediates imparting selectivity and also computed an effective free energy landscape for this transition. Our results show that base eversion in TDG is a gated process that involves motions of several flexible loops and a gating helix (Fig. 4). Therefore, intuitive reaction coordinates (e.g., pseudo torsions) are, in this case, not practical. Recently, there has been considerable progress in methods (50–52) to optimize minimum energy paths (MEPs) when the initial and final states are known. We leveraged two of these methods, the partial nudged elastic band (PNEB) (53, 54) and finite temperature string method (55, 56), to investigate recognition of modified bases by TDG. Both methods define the MEP as a chain of replicas of the system connecting the initial and final configurations. First, we optimized a MEP between the preextrusion and the fully extruded states using PNEB. Gradually spreading the replicas from these two states allowed the optimization process to discover the path in an unbiased way. The PNEB optimized path (Movie S1) served as a starting point for further optimization with the finite temperature string method, which could provide more extensive path sampling. Since the string method works in projected collective variable (CV) space, a preliminary PNEB step was necessary to provide an unbiased initial path and to select CVs for the string method. Using this protocol for TDG/5caC-DNA, we completed 25 ns of PNEB optimization (with 28 replicas) and 224 ns (200 iterations) of the string method (Fig. 4). After MEP convergence, we released each replica and sampled an aggregate of 11.2 μs of unrestrained MD trajectories along the base eversion path, which we further analyzed to construct an MSM.
Our results reveal an intricate network of protein–DNA contacts necessary to accommodate the 5caC base during its passage from the DNA base stack into the TDG active site. Importantly, these contacts significantly lower the free energy barriers for base extrusion to ∼4 kcal/mol (Fig. 5A). By comparison, umbrella sampling simulations of base eversion in the absence of the glycosylase result in barriers of at least 12 kcal/mol (SI Appendix, Fig. S5), which is consistent with previously published values for base extrusion barriers in DNA (21–23). From the MSM analysis we identify six kinetically distinct states along the 5caC eversion path (Figs. 5 B and C and 6). State S1 has the modified base accommodated in the stack; state S2 is an early intermediate wherein 5caC is inserted between the intercalating Arg275 and Lys201. The positive charge on the lysine stabilizes the negative charge on the 5caC carboxyl group. States S3–S5 correspond to configurations wherein 5caC interacts with residues of the Pro198 loop of TDG. Indeed, access to these three relatively rapidly interconverting states is gated by the motion of the Pro198 loop and the adjacent helix (Ser205–Lys221). The TDG gating helix (Leu143–Lys148) serves as a secondary gate by closing over the active site after the 5caC base is inserted. Thus, base eversion by TDG is a global conformational transition and conformational gating is necessary for the 5caC base to access the active site.
Conclusions
Our computational modeling reveals a mechanism underpinning TDG selectivity for DNA lesions (G:T mismatches) or modified bases (e.g., 5caC), which involves DNA sculpting, global protein dynamics, conformational gating, and specific protein–nucleic acid interactions that stabilize the extruded base along the path from the DNA stack to the TDG active site. Our model for base extrusion by TDG bears certain similarities to an earlier proposed mechanism (“pinch-push-pull”) for human UNG (57–60). In this model, the pinch involved compression of the DNA backbone such that the distances between the phosphates flanking the uracil base were reduced by ∼4 Å. Three static enzyme loops were proposed to mediate DNA recognition: the minor groove reading loop (His268–Ser273), the Prorich loop (Pro165–Pro168) and the Gly–Ser loop (Gly246–Ser247). Nucleotide flipping was proposed to be facilitated by the intercalation (push) of Leu272 into the DNA base stack. The final step was the pulling of the uracil base and ribose ring deep into the uracil recognition pocket, resulting in hydrogen bonding to every polar atom of the uracil and in face-to-face π-stacking with Phe158 and Tyr147. In a similar scenario for TDG, the pinch step is achieved by DNA sculpting via protein–DNA interactions and dynamic Arg275-loop insertion. This leads to kinking of the DNA substrate and compression of the distance between the flanking phosphates above and below the extrusion site by up to 3 Å in the extrusion intermediates (SI Appendix, Table S1). The push involves insertion of an interrogation loop into the DNA minor groove, intercalation of an arginine (Arg275) from the tip of the interrogation loop into the DNA stack, and stepwise replacement of Watson–Crick H bonds to lower the energetic barrier for base flipping. Finally, the pull step is achieved by accommodation of the extrahelical base via specific residue interactions in four stable intermediates along the extrusion path (Fig. 6). However, there are also important differences with the previous model. First, unlike Leu272 in UNG, which plays the role of a steric plug, Arg275 has the capacity to actively disrupt Watson–Crick hydrogen bonding. Stepwise replacement of Watson–Crick H bonds between the extruded and orphaned base lowers the barrier to reach the most populated extrahelical state during interrogation. Second, the pinch-push-pull model originated from molecular crystallography and emphasized the role of static enzyme loops. By contrast, we show that protein dynamics and global gating motions of TDG are essential. Specifically, transitioning the 5caC base into the active site requires gating motions of the Pro198 loop and the adjacent helix (Ser205–Lys221) as well as motions of the TDG gating helix (Leu143–Lys148). These motions cannot be easily construed from static crystal structures. Collectively, our results shed light on the key determinants of glycosylase selectivity and uncover universal rules governing this class of enzymes.
Materials and Methods
Models for the pre- and postextrusion states were constructed from two TDG/DNA crystal structures (PDB ID codes: 5HF7 and 2RBA) (32, 33). To examine base interrogation, we built systems of initially separated TDG and DNA for 5caC:G, G:T, and A:T base pairs. All systems were then minimized, heated in the NVT ensemble, and then equilibrated in the NPT ensemble (1 atm and 300 K). One hundred nanoseconds of MD equilibration was followed by 200 ns of accelerated MD. Snapshots of the aMD trajectories were then replicated and released for an aggregate simulation time of 8 μs for each system. Detailed simulation protocols can be found in SI Appendix, Supplementary Methods. The base eversion path for TDG/5caC was represented by 28 replicas. For path optimization, all heavy atoms of the TDG/5caC complex were included. We first performed PNEB optimization of the replicas for 25 ns with a simulated annealing protocol. We then further optimized the path using the finite-temperature string method with swarms of trajectories. PNEB/string method protocols and the definition of collective variables are provided as SI Appendix. After string convergence, each image underwent 200 ns of unrestrained MD for an aggregate simulation time of 11.2 μs. The combined trajectories were used for MSM analysis and computing the effective free energy profiles. TICA calculations were then performed on the systems and the trajectory frames were then clustered in projected space using the k-means algorithm. From the resultant clusters, MSMs were constructed and used to estimate probability fluxes, as well as transition timescales in and out of each PCCA+ macrostate. For the DNA structure analysis 1,000 frames were selected from each macrostate and analyzed using the Curves+. All TICA and MSM analysis details are provided in SI Appendix, Supplementary Methods.
Supplementary Material
Acknowledgments
This work was supported by National Institutes of Health Grant GM110387 and National Science Foundation Grant MCB-1149521. Computational resources were provided in part by allocations from the National Science Foundation’s Extreme Science and Engineering Discovery Environment program CHE110042 and the National Energy Research Scientific Computing Center supported by the Department of Energy Office of Science Contract DE-AC02-05CH11231.
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1803323115/-/DCSupplemental.
References
- 1.Zharkov DO. Base excision DNA repair. Cell Mol Life Sci. 2008;65:1544–1565. doi: 10.1007/s00018-008-7543-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bellacosa A, Drohat AC. Role of base excision repair in maintaining the genetic and epigenetic integrity of CpG sites. DNA Repair (Amst) 2015;32:33–42. doi: 10.1016/j.dnarep.2015.04.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Liu MY, DeNizio JE, Schutsky EK, Kohli RM. The expanding scope and impact of epigenetic cytosine modifications. Curr Opin Chem Biol. 2016;33:67–73. doi: 10.1016/j.cbpa.2016.05.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Breiling A, Lyko F. Epigenetic regulatory functions of DNA modifications: 5-methylcytosine and beyond. Epigenetics Chromatin. 2015;8:24. doi: 10.1186/s13072-015-0016-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kohli RM, Zhang Y. TET enzymes, TDG and the dynamics of DNA demethylation. Nature. 2013;502:472–479. doi: 10.1038/nature12750. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Eden S, Hashimshony T, Keshet I, Cedar H, Thorne AW. DNA methylation models histone acetylation. Nature. 1998;394:842. doi: 10.1038/29680. [DOI] [PubMed] [Google Scholar]
- 7.Jones PA, Laird PW. Cancer epigenetics comes of age. Nat Genet. 1999;21:163–167. doi: 10.1038/5947. [DOI] [PubMed] [Google Scholar]
- 8.Baylin SB, Jones PA. A decade of exploring the cancer epigenome–Biological and translational implications. Nat Rev Cancer. 2011;11:726–734. doi: 10.1038/nrc3130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Issa JP. CpG island methylator phenotype in cancer. Nat Rev Cancer. 2004;4:988–993. doi: 10.1038/nrc1507. [DOI] [PubMed] [Google Scholar]
- 10.Issa JP. Aging and epigenetic drift: A vicious cycle. J Clin Invest. 2014;124:24–29. doi: 10.1172/JCI69735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ito S, et al. Tet proteins can convert 5-methylcytosine to 5-formylcytosine and 5-carboxylcytosine. Science. 2011;333:1300–1303. doi: 10.1126/science.1210597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Pastor WA, Aravind L, Rao A. TETonic shift: Biological roles of TET proteins in DNA demethylation and transcription. Nat Rev Mol Cell Biol. 2013;14:341–356. doi: 10.1038/nrm3589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Maiti A, Drohat AC. Thymine DNA glycosylase can rapidly excise 5-formylcytosine and 5-carboxylcytosine: Potential implications for active demethylation of CpG sites. J Biol Chem. 2011;286:35334–35338. doi: 10.1074/jbc.C111.284620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.He YF, et al. Tet-mediated formation of 5-carboxylcytosine and its excision by TDG in mammalian DNA. Science. 2011;333:1303–1307. doi: 10.1126/science.1210944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Cortázar D, et al. Embryonic lethal phenotype reveals a function of TDG in maintaining epigenetic stability. Nature. 2011;470:419–423. doi: 10.1038/nature09672. [DOI] [PubMed] [Google Scholar]
- 16.Li YQ, Zhou PZ, Zheng XD, Walsh CP, Xu GL. Association of Dnmt3a and thymine DNA glycosylase links DNA methylation with base-excision repair. Nucleic Acids Res. 2007;35:390–400. doi: 10.1093/nar/gkl1052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Tini M, et al. Association of CBP/p300 acetylase and thymine DNA glycosylase links DNA repair and transcription. Mol Cell. 2002;9:265–277. doi: 10.1016/s1097-2765(02)00453-7. [DOI] [PubMed] [Google Scholar]
- 18.Um S, et al. Retinoic acid receptors interact physically and functionally with the T:G mismatch-specific thymine-DNA glycosylase. J Biol Chem. 1998;273:20728–20736. doi: 10.1074/jbc.273.33.20728. [DOI] [PubMed] [Google Scholar]
- 19.Hashimoto H. Structural and mutation studies of two DNA demethylation related glycosylases: MBD4 and TDG. Biophysics (Nagoya-shi) 2014;10:63–68. doi: 10.2142/biophysics.10.63. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hashimoto H, Zhang X, Cheng X. Excision of thymine and 5-hydroxymethyluracil by the MBD4 DNA glycosylase domain: Structural basis and implications for active DNA demethylation. Nucleic Acids Res. 2012;40:8276–8284. doi: 10.1093/nar/gks628. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Banavali NK, MacKerell AD., Jr Free energy and structural pathways of base flipping in a DNA GCGC containing sequence. J Mol Biol. 2002;319:141–160. doi: 10.1016/S0022-2836(02)00194-8. [DOI] [PubMed] [Google Scholar]
- 22.Hitomi K, Iwai S, Tainer JA. The intricate structural chemistry of base excision repair machinery: Implications for DNA damage recognition, removal, and repair. DNA Repair (Amst) 2007;6:410–428. doi: 10.1016/j.dnarep.2006.10.004. [DOI] [PubMed] [Google Scholar]
- 23.Priyakumar UD, MacKerell AD., Jr Computational approaches for investigating base flipping in oligonucleotides. Chem Rev. 2006;106:489–505. doi: 10.1021/cr040475z. [DOI] [PubMed] [Google Scholar]
- 24.Jacobs AL, Schär P. DNA glycosylases: In DNA repair and beyond. Chromosoma. 2012;121:1–20. doi: 10.1007/s00412-011-0347-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Cao C, Jiang YL, Krosky DJ, Stivers JT. The catalytic power of uracil DNA glycosylase in the opening of thymine base pairs. J Am Chem Soc. 2006;128:13034–13035. doi: 10.1021/ja062978n. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Cao C, Jiang YL, Stivers JT, Song F. Dynamic opening of DNA during the enzymatic search for a damaged base. Nat Struct Mol Biol. 2004;11:1230–1236. doi: 10.1038/nsmb864. [DOI] [PubMed] [Google Scholar]
- 27.Bruner SD, Norman DP, Verdine GL. Structural basis for recognition and repair of the endogenous mutagen 8-oxoguanine in DNA. Nature. 2000;403:859–866. doi: 10.1038/35002510. [DOI] [PubMed] [Google Scholar]
- 28.Fromme JC, Verdine GL. DNA lesion recognition by the bacterial repair enzyme MutM. J Biol Chem. 2003;278:51543–51548. doi: 10.1074/jbc.M307768200. [DOI] [PubMed] [Google Scholar]
- 29.Qi Y, et al. Encounter and extrusion of an intrahelical lesion by a DNA repair enzyme. Nature. 2009;462:762–766. doi: 10.1038/nature08561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Parker JB, et al. Enzymatic capture of an extrahelical thymine in the search for uracil in DNA. Nature. 2007;449:433–437. doi: 10.1038/nature06131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Knips A, Zacharias M. Both DNA global deformation and repair enzyme contacts mediate flipping of thymine dimer damage. Sci Rep. 2017;7:41324. doi: 10.1038/srep41324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Maiti A, Morgan MT, Pozharski E, Drohat AC. Crystal structure of human thymine DNA glycosylase bound to DNA elucidates sequence-specific mismatch recognition. Proc Natl Acad Sci USA. 2008;105:8890–8895. doi: 10.1073/pnas.0711061105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Coey CT, et al. Structural basis of damage recognition by thymine DNA glycosylase: Key roles for N-terminal residues. Nucleic Acids Res. 2016;44:10248–10258. doi: 10.1093/nar/gkw768. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Hamelberg D, de Oliveira CA, McCammon JA. Sampling of slow diffusive conformational transitions with accelerated molecular dynamics. J Chem Phys. 2007;127:155102. doi: 10.1063/1.2789432. [DOI] [PubMed] [Google Scholar]
- 35.Schwantes CR, Pande VS. Improvements in Markov state model construction reveal many non-native interactions in the folding of NTL9. J Chem Theory Comput. 2013;9:2000–2009. doi: 10.1021/ct300878a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Pérez-Hernández G, Paul F, Giorgino T, De Fabritiis G, Noé F. Identification of slow molecular order parameters for Markov model construction. J Chem Phys. 2013;139:015102. doi: 10.1063/1.4811489. [DOI] [PubMed] [Google Scholar]
- 37.Noé F, Schütte C, Vanden-Eijnden E, Reich L, Weikl TR. Constructing the equilibrium ensemble of folding pathways from short off-equilibrium simulations. Proc Natl Acad Sci USA. 2009;106:19011–19016. doi: 10.1073/pnas.0905466106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Scherer MK, et al. PyEMMA 2: A software package for estimation, validation, and analysis of Markov models. J Chem Theory Comput. 2015;11:5525–5542. doi: 10.1021/acs.jctc.5b00743. [DOI] [PubMed] [Google Scholar]
- 39.Metzner P, Schütte C, Vanden-Eijnden E. Illustration of transition path theory on a collection of simple examples. J Chem Phys. 2006;125:084110. doi: 10.1063/1.2335447. [DOI] [PubMed] [Google Scholar]
- 40.Lavery R, Moakher M, Maddocks JH, Petkeviciute D, Zakrzewska K. Conformational analysis of nucleic acids revisited: Curves+ Nucleic Acids Res. 2009;37:5917–5929. doi: 10.1093/nar/gkp608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Hunter WN, et al. The structure of guanosine-thymidine mismatches in B-DNA at 2.5-A resolution. J Biol Chem. 1987;262:9962–9970. doi: 10.2210/pdb113d/pdb. [DOI] [PubMed] [Google Scholar]
- 42.Tsutakawa SE, Jingami H, Morikawa K. Recognition of a TG mismatch: The crystal structure of very short patch repair endonuclease in complex with a DNA duplex. Cell. 1999;99:615–623. doi: 10.1016/s0092-8674(00)81550-0. [DOI] [PubMed] [Google Scholar]
- 43.Lamers MH, et al. The crystal structure of DNA mismatch repair protein MutS binding to a G x T mismatch. Nature. 2000;407:711–717. doi: 10.1038/35037523. [DOI] [PubMed] [Google Scholar]
- 44.Obmolova G, Ban C, Hsieh P, Yang W. Crystal structures of mismatch repair protein MutS and its complex with a substrate DNA. Nature. 2000;407:703–710. doi: 10.1038/35037509. [DOI] [PubMed] [Google Scholar]
- 45.Zhu B, et al. 5-methylcytosine DNA glycosylase activity is also present in the human MBD4 (G/T mismatch glycosylase) and in a related avian sequence. Nucleic Acids Res. 2000;28:4157–4165. doi: 10.1093/nar/28.21.4157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Qi Y, Spong MC, Nam K, Karplus M, Verdine GL. Entrapment and structure of an extrahelical guanine attempting to enter the active site of a bacterial DNA glycosylase, MutM. J Biol Chem. 2010;285:1468–1478. doi: 10.1074/jbc.M109.069799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Chung SJ, Verdine GL. Structures of end products resulting from lesion processing by a DNA glycosylase/lyase. Chem Biol. 2004;11:1643–1649. doi: 10.1016/j.chembiol.2004.09.014. [DOI] [PubMed] [Google Scholar]
- 48.Nelson SR, Dunn AR, Kathe SD, Warshaw DM, Wallace SS. Two glycosylase families diffusively scan DNA using a wedge residue to probe for and identify oxidatively damaged bases. Proc Natl Acad Sci USA. 2014;111:E2091–E2099. doi: 10.1073/pnas.1400386111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Dunn AR, Kad NM, Nelson SR, Warshaw DM, Wallace SS. Single Qdot-labeled glycosylase molecules use a wedge amino acid to probe for lesions while scanning along DNA. Nucleic Acids Res. 2011;39:7487–7498. doi: 10.1093/nar/gkr459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Vanden-Eijnden E, Venturoli M. Revisiting the finite temperature string method for the calculation of reaction tubes and free energies. J Chem Phys. 2009;130:194103. doi: 10.1063/1.3130083. [DOI] [PubMed] [Google Scholar]
- 51.Májek P, Elber R. Milestoning without a reaction coordinate. J Chem Theory Comput. 2010;6:1805–1817. doi: 10.1021/ct100114j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Mathews DH, Case DA. Nudged elastic band calculation of minimal energy paths for the conformational change of a GG non-canonical pair. J Mol Biol. 2006;357:1683–1693. doi: 10.1016/j.jmb.2006.01.054. [DOI] [PubMed] [Google Scholar]
- 53.Kossmann B, Ivanov I. Alkylpurine glycosylase D employs DNA sculpting as a strategy to extrude and excise damaged bases. PLoS Comput Biol. 2014;10:e1003704. doi: 10.1371/journal.pcbi.1003704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Bergonzo C, Campbell AJ, Walker RC, Simmerling C. A partial nudged elastic band implementation for use with large or explicitly solvated systems. Int J Quantum Chem. 2009;109:3781–3790. doi: 10.1002/qua.22405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Pan AC, Sezer D, Roux B. Finding transition pathways using the string method with swarms of trajectories. J Phys Chem B. 2008;112:3432–3440. doi: 10.1021/jp0777059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Maragliano L, Fischer A, Vanden-Eijnden E, Ciccotti G. String method in collective variables: Minimum free energy paths and isocommittor surfaces. J Chem Phys. 2006;125:24106. doi: 10.1063/1.2212942. [DOI] [PubMed] [Google Scholar]
- 57.Slupphaug G, et al. A nucleotide-flipping mechanism from the structure of human uracil-DNA glycosylase bound to DNA. Nature. 1996;384:87–92. doi: 10.1038/384087a0. [DOI] [PubMed] [Google Scholar]
- 58.Parikh SS, et al. Uracil-dna glycosylase-DNA substrate and product structures: Conformational strain promotes catalytic efficiency by coupled stereoelectronic effects. Proc Natl Acad Sci USA. 2000;97:5083–5088. doi: 10.1073/pnas.97.10.5083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Parikh SS, et al. Base excision repair initiation revealed by crystal structures and binding kinetics of human uracil-DNA glycosylase with DNA. EMBO J. 1998;17:5214–5226. doi: 10.1093/emboj/17.17.5214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Mol CD, et al. Crystal structure of human uracil-DNA glycosylase in complex with a protein inhibitor: Protein mimicry of DNA. Cell. 1995;82:701–708. doi: 10.1016/0092-8674(95)90467-0. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.