A two-step nucleotide-flipping mechanism enables kinetic discrimination of DNA lesions by AGT

Jie Hu; Ao Ma; Aaron R Dinner

doi:10.1073/pnas.0708058105

. 2008 Mar 19;105(12):4615–4620. doi: 10.1073/pnas.0708058105

A two-step nucleotide-flipping mechanism enables kinetic discrimination of DNA lesions by AGT

Jie Hu ¹, Ao Ma ^1,^*, Aaron R Dinner ^1,^†

PMCID: PMC2290773 PMID: 18353991

Abstract

O⁶-alkylguanine-DNA alkyltransferase (AGT) repairs damage to the human genome by flipping guanine and thymine bases into its active site for irreversible transfer of alkyl lesions to Cys-145, but how the protein identifies its targets has remained unknown. Understanding molecular recognition in this system, which can serve as a paradigm for the many nucleotide-flipping proteins that regulate genes and repair DNA in all kingdoms of life, is particularly important given that inhibitors are in clinical trials as anticancer therapeutics. Computational approaches introduced recently for harvesting and statistically characterizing trajectories of molecularly rare events now enable us to elucidate a pathway for nucleotide flipping by AGT and the forces that promote it. In contrast to previously proposed flipping mechanisms, we observe a two-step process that promotes a kinetic, rather than a thermodynamic, gate-keeping strategy for lesion discrimination. Connection is made to recent single-molecule studies of DNA-repair proteins sliding on DNA to understand how they sense subtle chemical differences between bases efficiently.

Keywords: commitment probabilities, DNA repair, reaction coordinates, transition path sampling

Many proteins flip nucleotides from the DNA base stack into their active sites to remove or chemically modify bases for gene regulation and repair (1–4). Based on an x-ray crystallographic structure of uracil-DNA glycosylase (UDG) (2), it was proposed that, as the protein compresses the DNA backbone, the nucleotide that flips is “pushed” by a finger residue and “pulled” by specific interactions in the active site (5, 6). Alternatively, more passive mechanisms in which the protein shifts the equilibrium for base pair opening have been suggested (7, 8). Evaluating these mechanisms for different proteins in physiologically relevant contexts is important for understanding how nucleotide-flipping proteins identify their targets (4). Here, we study O⁶-alkylguanine-DNA alkyltransferase (AGT), which repairs alkylated guanine and thymine DNA bases by transferring the alkyl lesions irreversibly to an active-site cysteine (Cys-145; Fig. 1a). This uncommon direct damage reversal mechanism makes AGT of fundamental interest, although understanding its behavior is also of medical significance because inhibitors are in clinical trials as anticancer therapeutics (9).

Fig. 1. — Structure of AGT. (a) Chemical mechanism (21). (b) Comparison of active-site structures: blue, unmodified wt (PDB entry 1EH6) (17); red, wt with methylated Cys-145 (1EH7) (17); gray, wt with benzylated Cys-145 (1EH8) (17); orange, C145S mutant with DNA containing mGua (1T38) (21); yellow, wt with DNA containing N¹,O⁶-ethanoxanthine (1T39) (21); tan, wt bound to N4-alkylcytosine (1YFH structure C) (22). (c) Wild-type–mGua complex constructed for the simulations; water molecules and counterions are not shown. The extrahelical mGua nucleotide (orange; with 5′ and 3′ phosphate groups marked) binds within the enzyme active site buried by an active-site loop [Gly-153–Gly-160 (mauve)]. The “arginine finger” [Arg-128 (red)], at the beginning of the recognition helix [Ala-127–Gly-136 (royal blue)], is positioned inside the DNA duplex (silver) where it intercalates via the minor groove; Tyr-114 is shown in green.

In principle, molecular dynamics simulations can provide atomic-resolution pathways for nucleotide flipping and the forces that promote them, but they are limited to tens of nanoseconds at present. Because instances of spontaneous base flipping are estimated to be separated by milliseconds even in the presence of a protein (4), it is not possible to harvest a statistically meaningful number of such events in simulations without restraints. Free energy projections along selected coordinates can be obtained from simulations in which harmonic “umbrella” potentials are used to enhance sampling of low probability states; such simulations showed that the cytosine-5-methyltransferase of the HhaI bacterium stabilizes a fully flipped state and a major groove pathway that leads to it (4, 10). Nevertheless, the information obtained from such simulations is thermodynamic, not kinetic, and different projections can lead to qualitatively different conclusions.

Here, we exploit recent advances in computational approaches (11–14) to obtain explicit dynamic trajectories for protein-catalyzed nucleotide flipping and translate them into a physically understandable mechanism. The simulations of AGT reveal a two-step mechanism consistent with existing data for mutants. The finger residue (Arg-128) captures spontaneous base pair fluctuations to stabilize an intermediate in which the lesion is extrahelical; a loop (residues Gly-153 to Gly-160) gates the active site, and these motions determine the kinetics of the second step. Comparison of free energies for flipping guanine (Gua) and O⁶-methylguanine (mGua) suggests that the extrahelical intermediate identified enables kinetic discrimination between damaged and undamaged bases. We relate this mechanism to molecular parameters estimated in recent experimental studies that track single DNA-repair proteins sliding on DNA (15) (Y. Lin, T. Zhao, Z. Farooqui, X. Qu, C. He, A.R.D., and N. F. Scherer, unpublished data).

Results

Path Sampling Simulations Reveal a Two-Step Mechanism.

Because of its medical relevance (9), AGT has been the focus of many biochemical (16–19) and structural (17, 20–22) studies, making it an attractive system for computational investigation. For the calculations, we constructed a complex between the wild type (wt) and a DNA duplex containing mGua from the structure of the catalytically inactive C145S mutant bound to that ligand (21). Comparison of structures for the protein pre- and postrepair and in complex with various oligomers (Fig. 1b) shows that the active-site geometry and thus the mode of recognition is the same in all cases, which suggests that our results do not depend on the choice of starting structure. The protein binds the DNA through a helix–turn–helix motif with the second (“recognition”) helix (Ala-127–Gly-136) buried deeply within the minor groove (Fig. 1c). The “arginine finger” (Arg-128) is located at the N-terminal end of this helix, and its side chain intercalates in the DNA base stack to make hydrogen bonds with the orphaned cytosine.

Although lesion detection cannot be studied by straightforward molecular dynamics simulations, it can be simulated with transition path sampling (TPS), a Monte Carlo procedure that harvests trajectories that are constrained to go from one stable state (e.g., the base in the stack) to another (e.g., the base in the active site) but are otherwise unbiased (11, 12). In practice, such simulations consist of repeatedly “shooting”: selecting a phase space point from an existing trajectory with a flipping event, perturbing it, integrating the equations of motion to the stable states (basins), and accepting the resulting trajectory if and only if it still contains a flipping event. By eliminating the time required to wait for spontaneous fluctuations that give rise to flipping from a stable state, TPS enables the efficient study of the dynamics of interest. However, at least one trajectory with a flipping event is needed to initiate the procedure.

To overcome this last problem, we recently introduced bias annealing (14), in which a path for a molecularly rare event is initially generated with targeted molecular dynamics (23) and the resulting bias is removed by shooting with progressively lower force constants for the harmonic restraints used to drive the transition of interest, while enforcing the TPS basin constraints. This method was shown to be an order of magnitude faster than gradually shifting the path ensemble by changing the TPS basin definitions for a two-dimensional model potential (14). We believe that the computational savings is even greater for larger systems, such as that studied here, because the simulations are accelerated exponentially with the bias strength. In addition to the advantage that fewer total integration steps are required to obtain an unbiased path of full length, this feature permits an investigator to explore different possible basin definitions rapidly during the early phase of a project to select those that result in the greatest shooting efficiency. We discuss the strengths and weaknesses of the bias annealing procedure in more detail in ref. 14.

Bias annealing now makes possible the study of the kinetics [rather than simply the thermodynamics (4, 10)] of protein-promoted nucleotide flipping with TPS. Our application of the bias annealing method to the AGT-DNA system (Fig. 1c) revealed that mGua flips through the major groove via a metastable state outside both the DNA base stack and the protein active site. We then used TPS to harvest >100 unbiased trajectories for each of the two transitions: to this extrahelical intermediate from the base stack and from the intermediate to the active site. This number of trajectories is statistically significant given that paths were found to decorrelate within approximately three shooting moves, as estimated by a procedure adapted from ref. 24 [supporting information (SI) Fig. 5].

Dynamics of Flipping from the DNA Base Stack.

Objective description of the paths requires the identification of physically meaningful coordinates that are capable of distinguishing (meta)stable states from transition states defined dynamically by their probabilities to commit to a basin (p). Because of the large number of degrees of freedom in complex systems, relating p to structural and energetic properties of the system by trial-and-error is very costly in terms of both human and computational resources and thus has been achieved for only a handful of relatively simple systems (12, 25). We previously introduced the idea that informatic methods can be used to efficiently search an otherwise prohibitively large number of candidate physical variables for those combinations that can predict p (13), and we apply these methods here to achieve a level of description unprecedented for a large biomolecular system.

For the unstacking step, the informatic method paired the distance between the carbonyl oxygen and C^ζ atoms of Arg-128 (d₁) with the distance between the H1′ atom of mGua and the H3′ atom of its Watson–Crick partner (d₂). Projection of the dynamics onto these coordinates (Fig. 2 and SI Fig. 6a) shows that they are indeed capable of separating the unflipped state (cyan) from the extrahelical intermediate (green) and the transition states that connect them (red). This choice of coordinates was validated by comparison of the dynamics with an independent umbrella sampling calculation restricted to the predicted transition state region (Fig. 2b). The stable and transition states fall on well defined basins and saddlepoints, respectively, in this projection of the free energy. Such a high correspondence is not expected for arbitrary coordinates (see ref. 25 for a discussion), and we thus feel confident that these coordinates adequately describe the dynamics.

Fig. 2. — Flipping from the base stack to the extrahelical intermediate. (a) Range studied in terms of the pseudodihedral angle. (b) Free energy as a function of the coordinates identified by the informatic approach (contours are spaced every 0.2 kcal·mol⁻¹). Colored points indicate structures for which commitment probabilities (p) were calculated: cyan, mGua intrahelical; green, extrahelical; red, at a transition state. (c) Representative structures for the transition (see SI Movie 1 for animation).

The coordinates d₁ and d₂ measure the extension of the arginine finger and the flipping of the base (as the backbone sugars become closer). The transition states for the process occur with d₁ ≈ 4.6 Å and d₂ ≈ 15.1 Å compared with 3 < d₁ < 4 Å and 11.5 < d₂ < 16.5 Å for the intrahelical state and 5 < d₁ < 7 Å and 12 < d₂ < 16 Å for the extrahelical state (SI Fig. 6a). The fact that d₂ must fluctuate to the top of its range before d₁ can increase indicates that the insertion of Arg-128 follows the loss of base pairing. This analysis, together with visual inspection of trajectories, suggests the following sequence of molecular events (Fig. 2c). The mGua and Cyt bases fluctuate apart by shifting their pattern of hydrogen bonds while maintaining the orientation of the planes containing their aromatic rings. Arg-128 captures this transient motion to form a structure in which it interacts comparably with both bases. It then shifts its interaction to Cyt, which enables mGua to diffuse into solution to form the extrahelical intermediate. This mechanism explains the experimental observation that side chain length at this position is directly proportional to the rate of DNA repair (17) more readily than does one in which Arg-128 pushes (longer side chains are floppier). Other highly ranked combinations of coordinates for this step yield qualitatively similar conclusions.

Dynamics of Insertion into the Active Site.

The coordinates selected to describe the transition from the extrahelical intermediate to the active site were the distance between the methyl carbon of mGua and the C^α atom of Ala-154 (d₃) and the distance between the H3′ atom in the deoxyribose ring of the nucleotide sequentially 3′ to the flipping base and the C^γ atom of Asn-157 (d₄) (Fig. 3 and SI Fig. 6b). These coordinates were validated as above by comparison of the predicted commitment probabilities with an independently calculated free energy projection (Fig. 3b). Physically, these coordinates reflect the position of the flipping base and DNA backbone relative to a loop (Gly-153 to Gly-160) that gates the active site. For the base to enter the active site, this loop must undergo fluctuations in position (e.g., the nonhydrogen atoms of Gly-156 move ≈3 Å compared with 1–2 Å for other atoms in the protein); ≈30 ps are required for the system to relax to the fully flipped state from the transition state for this step because of minor local free energy minima, but the base is committed to that stable state once past the loop.

Fig. 3. — Same as Fig. 2 for flipping from the extrahelical intermediate into the active site. In b, colors indicate the extrahelical intermediate (green), transition states (red), and the base in the active site (cyan). *Upper* and *Lower* in c are two views of the same structures (see SI Movies 2 and 3 for animation).

Tyr-114 Acts Electrostatically Rather than Sterically.

It is important to emphasize that the coordinates selected by the informatic procedure are not wholly unique and concern only the base-flipping dynamics. Others are likely to be important for repair, which involves protein binding and catalysis as well. Given these facts, we examined the behavior of Tyr-114, mutations of which impact the overall kinetics of repair to some degree (16). In the simulations, Tyr-114 hydrogen-bonds to the phosphate group 3′ of mGua in the intrahelical conformation and then sequentially transfers its interactions to the N7 and the N3 atoms of the base during flipping (Fig. 3c). The first two of these interactions occur in transient structures and thus were not observed crystallographically (21, 22). In contrast to the suggestion that Tyr-114 interacts sterically with the 3′ phosphate group during flipping (21), we believe that it affects repair in three ways: by stabilizing binding before flipping, by providing hydrogen bonds during flipping, and by influencing the reactivity of the damaged base. All of these are likely to be modest electrostatic effects. That the interactions are of this nature rather than steric is consistent with the mutant data because hydrophobic residues still have substantial partial charges. In structures from the simulations in which we replaced Tyr-114 with phenylalanine (Phe), even without any relaxation, the interaction of Phe with the base was −3.2 kcal·mol⁻¹ compared with −5.6 kcal·mol⁻¹ for Tyr and that with the 3′ phosphate group was −1.2 kcal·mol⁻¹ compared with −1.8 kcal·mol⁻¹ for Tyr.

Comparison of the Energetics for Flipping Gua and mGua Suggests a Kinetic Gate-Keeping Strategy for Lesion Detection.

To determine the functional significance of the base-flipping mechanism elucidated, we compared the free energies for flipping Gua and mGua (Fig. 4 and SI Fig. 7). The free energies projected onto d₁ and d₂ (the unstacking step) were essentially the same for the two bases, with the extrahelical conformation somewhat lower in free energy than the intrahelical conformation. That the protein significantly stabilizes a base separated state (compare with SI Fig. 8) is consistent with findings for other DNA-repair proteins (4, 7, 8, 10, 26). For the active-site entry step, it was necessary to substitute the distance between the O⁶ atom of the flipping base and the C^α atom of Ala-154 (d₅) for d₃ because Gua lacks the methyl carbon. In that case, the free energies differed significantly: the barrier to Gua flipping is much higher (Fig. 4).

Fig. 4. — Comparison of free energies for flipping Gua (solid green) and mGua (solid red) from the extrahelical intermediate into the active site. Curves shown are linear cuts through the saddlepoint from two-dimensional free energy surfaces for d₄ and d₅ (SI Fig. 7): d₄ = −0.0857d₅ + 10.0 Å for Gua and d₄ = −0.0857d₅ + 10.2 Å for mGua. The dashed curves connect the highest and lowest free energy estimates for each (d₄, d₅) pair obtained from partitioning the data for each umbrella sampling window into three separate 20-ps intervals and re-performing the weighted histogram analysis; similar procedures were used to estimate the uncertainty in refs. 10 and 24.

These data suggest a “gate-keeping” strategy for lesion discrimination. Both Gua and mGua would flip to the extrahelical state, but only the latter would have a significant chance to continue on to the active site before flipping back to the intrahelical state. In contrast to that proposed for hOGG1 (3), this gate-keeping mechanism is kinetic rather than thermodynamic in nature: there is only a 3-fold preference for mGua over Gua once in the active site of AGT (18). A gate-keeping mechanism was also suggested for AGT based on a secondary binding site observed in one of the crystal structures (22), but the extrahelical nucleotide in the simulations differs significantly in structure from the partially flipped one observed crystallographically.

Discussion

Transition path sampling (TPS) has transformed the computational study of rare events in molecular systems over the last decade (11, 12), but its application to complex, biological systems has been limited (24, 25, 27–29). Efficient means for generating initial paths (14) together with informatic methods (13) that we introduced now enable us to harvest a statistically significant number of trajectories of a biomedically important stochastic process in its entirety and identify the features that characterize the ensemble of transition states. In contrast to the “push–pull” mechanism once proposed for nucleotide flipping (2, 5), we observe a two-step process that promotes a kinetic, rather than a thermodynamic (3, 7, 8), gate-keeping strategy for lesion discrimination.

Comparison of the simulation results with experimental data is needed to validate this recognition mechanism and refine it as needed. As already discussed, biochemical data for mutants of Tyr-114 (16) and Arg-128 (17) are consistent with the observed dynamics. The simulations also point to the importance of the active-site loop (Gly-153 to Gly-160). In support of this idea, mutants of many of these residues were recently obtained during directed evolution of AGT to maximize activity toward O⁶-benzylguanine derivatives (bGua) (30). In particular, A154T by itself was found to give a 4-fold increase in activity; this is of interest because Ala-154 interacts directly with the methyl group of mGua in the extrahelical intermediate in the simulations. At the same time, care must be used in interpreting these data. Although our simulations concern only nucleotide flipping, changes to the protein sequence are likely to affect both binding and catalysis, and the available data do not enable one to distinguish between these processes. Moreover, the binding mechanism for bGua in the absence of DNA, as in ref. 30, could be very different from that in its presence.

Perhaps the strongest prediction of our simulations is the existence of the extrahelical intermediate. As noted above, there is precedent for extrahelical intermediates in other DNA-repair systems. Base analogs have been used to obtain crystal structures of complexes of UDG and hOGG1 with partially flipped bases (3, 8). Although it is possible for base analogs to stabilize off-pathway intermediates, complementary data from NMR imino proton exchange experiments on UDG with a variety of DNA duplexes (7) suggest that these structures are meaningful representations of the unperturbed systems. Although a terminal overhanging thymine was fortuitously observed to be partially flipped in a structure of two AGT molecules with one bound to a modified cytosine (22), it is not clear that such a structure could be accessed by a nonterminal nucleotide; therefore, additional studies of AGT with alternate DNA constructs are needed. Because AGT can accommodate fluorescently active groups (31), it may also be possible to determine the number of states a base analog populates and the rates for transition between them from single-molecule fluorescence resonance energy transfer experiments with labeled wt and mutant AGTs.

Using the methods in ref. 32, we estimate (SI Table 1) the rate for partial flipping for mGua and Gua to be ≈10⁻³ to 10⁻² ps⁻¹; mGua proceeds rapidly to the active site (≈10⁻¹ ps⁻¹), whereas Gua is much slower (≈10⁻⁶ ps⁻¹). Assuming a diffusive step size of 1 bp, single-molecule tracking studies of the C-terminal domain of Ada, a bacterial homolog of AGT, yield a (helical) sliding rate of 2.6 × 10⁻⁴ steps ps⁻¹ (Y. Lin, T. Zhao, Z. Farooqui, X. Qu, C. He, A.R.D., and N. F. Scherer, unpublished data), which is comparable with that measured previously for hOGG1 (15). Comparison of the rates for sliding with those for flipping suggests that the kinetic gate-keeping mechanism of lesion discrimination would be operative and would allow the protein to scan the sequence rapidly compared with one requiring full flipping. At higher concentrations and in physiological contexts, it is likely that the DNA will be crowded with proteins (18, 19, 21, 22). Then, there will be more chance that undamaged bases enter the active site, but this possibility is not dangerous because the protein directly removes alkyl groups rather than excising bases. Overall, these considerations illustrate how a dialogue between theory and experiment can provide a quantitative understanding of how DNA-repair proteins can search DNA efficiently to maintain genomic integrity.

Methods

Obtaining Reactive Trajectories.

The molecules were represented by the CHARMM all-hydrogen force field (33–36); parameters for the mGua base were generated in a consistent fashion (see SI Methods, SI Fig. 9, and SI Tables 2 and 3). Simulations were performed with the TPS and bias annealing implementations in CHARMM (14, 25). Because TPS and bias annealing both construct new trial trajectories from existing ones (i.e., make local Monte Carlo moves in the space of paths), it is impossible to guarantee that all possible mechanisms are represented in the sampled paths; this issue is discussed further for this system in ref. 14. Measures of the quality of sampling (SI Figs. 5 and 10) suggest that the simulations explore the space efficiently and converge.

Attempts to drive flipping through the minor groove were unsuccessful because of steric clashes, and that pathway was not considered further. For steering the base from the active site through the major groove, a single harmonic restraint based on a pseudodihedral angle (θ) was used. The angle is defined by the centers-of-mass of nonhydrogen atoms in the flipping base, its sugar, the sugar of the 3′ nucleotide, and the base of the 3′ nucleotide together with its Watson–Crick partner (SI Fig. 11). During bias annealing, the target value of θ was advanced following natural fluctuations in the “forward” direction in steps of −0.06 rad from θ = −2.6 rad (the active site) through θ = −π rad to θ = 1.4 rad (the extrahelical intermediate) (SI Table 4). The trajectories harvested with TPS for this step were 80 ps in length and spanned the basins −2.7 ≤ θ ≤ −2.5 rad and 1.2 ≤ θ ≤ 1.6 rad. We believe that this length is more than adequate for sampling the reaction of interest because the actual transition times observed were typically <4 ps (SI Fig. 10a).

To obtain trajectories for forming an intrahelical conformation with Watson–Crick base pairing (θ = −0.2 rad), alternative harmonic restraints were needed. We used three separate restraints (SI Fig. 12): a pseudodihedral angle based on the C^ζ atom of Arg-128 and the centers-of-mass of heavy atoms in the Cyt base, the base 5′ of Cyt, and its Watson–Crick partner (a₁); a pseudodihedral angle based on the C^ζ atom of Arg-128 and the centers-of-mass of heavy atoms in the Cyt base, the base 3′ of Cyt, and its Watson–Crick partner (a₂); and the average of the distance between the N1 atom of mGua and the N4 atom of Cyt and the distance between the N2 atom of mGua and the N3 atom of Cyt (d₀). Bias annealing was performed with the target value of a₁ advanced by −0.02 rad from 0.0 rad to −0.5 rad, a₂ by 0.02 rad from −0.8 rad to −0.1 rad, and d₀ by 0.1 Å from 3.0 Å to 10.0 Å (SI Table 5). The final trajectories harvested with TPS were 60 ps; one basin was defined as −0.2 ≤ a₁ ≤ 0.2 rad, −1.0 ≤ a₂ ≤ −0.6 rad, and 2.8 ≤ s ≤ 3.2 Å, and the other basin was defined as −0.6 ≤ a₁ ≤ −0.4 rad, −0.3 ≤ a₂ ≤ 0.1 rad, and 8.0 ≤ s ≤ 12.0 Å. Again, the time spent between basins was typically a few picoseconds, which suggests that the choice of trajectory length did not restrict the paths sampled (SI Fig. 10b).

Identification of Reaction Coordinates.

To define the stable and transition states, we used p, the likelihood that additional trajectories initiated from a configuration with momenta drawn isotropically from a Maxwell–Boltzmann distribution commit to the forward basin before the backward basin (see SI Methods for details). The combinations of physical variables that predict p were identified by a genetic neural network (GNN) approach (13, 37). For both steps of the reaction of interest (unstacking and active-site entry), four classes of variables were considered: (i) intra- and intermolecular pairwise distances for the DNA and protein molecules, (ii) interaction energies between protein residues and DNA fragments surrounding the flipping base, (iii) accessible surface areas of protein residues at the protein–DNA binding interface, and (iv) fragment center-of-mass-based pseudodihedral angles that measure the relative positions of the flipping base. In total, we tested 3,558 coordinates for the unstacking step and 8,693 coordinates for the active-site entry step. The respective databases comprised the values of these variables and the commitment probabilities for 53 and 75 representative structures. These databases were found to be sufficient to obtain two-descriptor models that predicted the commitment probabilities in a jackknife cross-validated fashion with root-mean-square (rms) errors of 0.1 (compared with a range of 0–1 for p; SI Fig. 13). Because there is some confusion in the literature (38), it is important to emphasize that the structures in the databases were sampled independently of the reaction coordinates, p is calculated only once for these structures, and the GNN procedure does not require the performance of the standard histogram test. These features result in substantial computational savings, as described in ref. 13. Analyses conducted after the completion of this work suggest that there is some advantage to using an alternate scoring function for evaluating the fit and computational resources are better allocated such as to sample more structures with fewer trajectories (39), but these variations were not explored because adequate coordinates had already been identified. Free energy calculations and rate estimations are discussed in SI Methods.

Supplementary Material

Supporting Information

pnas_0708058105_index.html^{(7.4KB, html)}

Acknowledgments.

We thank Chuan He, Martin Karplus, Yihan Lin, Norbert Scherer, and Tong Zhao for helpful discussions and Sung-Sau So for providing the HIPPO program. The work was supported by National Science Foundation Award MCB-0547854. Some simulations were performed in collaboration with Stuart Rice on the Laboratory Computing Resource Center Jazz Cluster at Argonne National Laboratory.

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at www.pnas.org/cgi/content/full/0708058105/DC1.

References

1.Klimasauskas S, Kumar S, Roberts RJ, Cheng X. HhaI methyltransferase flips its target base out of the DNA helix. Cell. 1994;76:357–369. doi: 10.1016/0092-8674(94)90342-5. [DOI] [PubMed] [Google Scholar]
2.Slupphaug G, et al. A nucleotide-flipping mechanism from the structure of human uracil-DNA glycosylase bound to DNA. Nature. 1996;384:87–92. doi: 10.1038/384087a0. [DOI] [PubMed] [Google Scholar]
3.Banerjee A, Yang W, Karplus M, Verdine GL. Structure of a repair enzyme interrogating undamaged DNA elucidates recognition of damaged DNA. Nature. 2005;434:612–618. doi: 10.1038/nature03458. [DOI] [PubMed] [Google Scholar]
4.Priyamkumar UD, MacKerell AD. Computational approaches for investigating base flipping in oligonucleotides. Chem Rev. 2006;106:489–505. doi: 10.1021/cr040475z. [DOI] [PubMed] [Google Scholar]
5.Kunkel TA, Wilson SH. Push and pull of base flipping. Nature. 1996;384:25–26. doi: 10.1038/384025a0. [DOI] [PubMed] [Google Scholar]
6.Stivers JT, Pankiewicz KW, Watanabe KA. Kinetic mechanism of damage site recognition and uracil flipping by Escherichia coli uracil DNA glycosylase. Biochemistry. 1999;38:952–963. doi: 10.1021/bi9818669. [DOI] [PubMed] [Google Scholar]
7.Cao C, Jiang YL, Stivers JT, Song F. Dynamic opening of DNA during the enzymatic search for a damaged base. Nat Struct Biol. 2004;11:1230–1236. doi: 10.1038/nsmb864. [DOI] [PubMed] [Google Scholar]
8.Parker JB, et al. Enzymatic capture of an extrahelical thymine in the search for uracil in DNA. Nature. 2007;449:433–437. doi: 10.1038/nature06131. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Liu LL, Gerson SL. Targeted modulation of MGMT: Clinical implications. Clin Cancer Res. 2006;12:328–331. doi: 10.1158/1078-0432.CCR-05-2543. [DOI] [PubMed] [Google Scholar]
10.Huang N, Banavali NK, MacKerell AD. Protein-facilitated base flipping in DNA by cytosine-5-methyltransferase. Proc Natl Acad Sci USA. 2003;100:68–73. doi: 10.1073/pnas.0135427100. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Dellago C, Bolhuis PG, Csajka FS, Chandler D. Transition path sampling and the calculation of rate constants. J Chem Phys. 1998;108:1964–1977. [Google Scholar]
12.Bolhuis PG, Chandler D, Dellago C, Geissler PL. Transition path sampling: Throwing ropes over rough mountain passes, in the dark. Annu Rev Phys Chem. 2002;53:291–318. doi: 10.1146/annurev.physchem.53.082301.113146. [DOI] [PubMed] [Google Scholar]
13.Ma A, Dinner AR. Automatic method for identifying reaction coordinates in complex systems. J Phys Chem B. 2005;109:6769–6779. doi: 10.1021/jp045546c. [DOI] [PubMed] [Google Scholar]
14.Hu J, Ma A, Dinner AR. Bias annealing: A method for obtaining transition paths de novo. J Chem Phys. 2006;125:114101. doi: 10.1063/1.2335640. [DOI] [PubMed] [Google Scholar]
15.Blainey PC, vanOijen AM, Banerjee A, Verdine GL, Xie XS. A base-excision DNA-repair protein finds intrahelical lesion bases by fast sliding in contact with DNA. Proc Natl Acad Sci USA. 2006;103:5752–5757. doi: 10.1073/pnas.0509723103. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Goodtzova K, Kanugula S, Edara S, Pegg AE. Investigation of the role of tyrosine-114 in the activity of human O6-alkylguanine-DNA alkyltransferase. Biochemistry. 1998;37:12489–12495. doi: 10.1021/bi9811718. [DOI] [PubMed] [Google Scholar]
17.Daniels DS, et al. Active and alkylated human AGT structures: A novel zinc site, inhibitor and extrahelical base binding. EMBO J. 2000;19:1719–1730. doi: 10.1093/emboj/19.7.1719. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Rasimas JJ, Pegg AE, Fried MG. DNA-binding mechanism of O6-alkylguanine-DNA alkyltransferase: Effects of protein and DNA alkylation on complex stability. J Biol Chem. 2003;278:7973–7980. doi: 10.1074/jbc.M211854200. [DOI] [PubMed] [Google Scholar]
19.Rasimas JJ, Kar SR, Pegg AE, Fried MG. Interactions of human O6-alkylguanine-DNA alkyltransferase (AGT) with short single-stranded DNAs. J Biol Chem. 2007;282:3357–3366. doi: 10.1074/jbc.M608876200. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Wibley JEA, Pegg A, Moody PCE. Crystal structure of the human O6-alkylguanine-DNA alkyltransferase. Nucleic Acids Res. 2000;28:393–401. doi: 10.1093/nar/28.2.393. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Daniels DS, et al. DNA binding and nucleotide flipping by the human DNA repair protein AGT. Nat Struct Mol Biol. 2004;11:714–720. doi: 10.1038/nsmb791. [DOI] [PubMed] [Google Scholar]
22.Duguid EM, Rice PA, He C. The structure of the human AGT protein bound to DNA, its implications for damage detection. J Mol Biol. 2005;350:657–666. doi: 10.1016/j.jmb.2005.05.028. [DOI] [PubMed] [Google Scholar]
23.Schlitter J, Engels M, Kruger P, Jacoby E, Wollmer A. Targeted molecular-dynamics simulation of conformational change—Application to the T-R transition in insulin. Mol Simul. 1993;10:291–309. [Google Scholar]
24.Ma L, Cui Q. Activation mechanism of a signaling protein at atomic resolution from advanced computations. J Am Chem Soc. 2007;129:10261–10268. doi: 10.1021/ja073059f. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Hagan MF, Dinner AR, Chandler D, Chakraborty AK. Atomistic understanding of kinetic pathways for single base-pair binding and unbinding in DNA. Proc Natl Acad Sci USA. 2003;100:13922–13927. doi: 10.1073/pnas.2036378100. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Krosky DJ, Song F, Stivers JT. The origins of high-affinity enzyme binding to an extrahelical DNA base. Biochemistry. 2005;44:5949–5959. doi: 10.1021/bi050084u. [DOI] [PubMed] [Google Scholar]
27.Bolhuis PG. Transition-path sampling of hairpin folding. Proc Natl Acad Sci USA. 2003;100:12129–12134. doi: 10.1073/pnas.1534924100. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Radhakrishnan R, Schlick T. Orchestration of cooperative events in DNA synthesis and repair mechanism unraveled by transition path sampling of DNA polymerase's closing. Proc Natl Acad Sci USA. 2004;101:5970–5975. doi: 10.1073/pnas.0308585101. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Quaytman SL, Schwartz SD. Reaction coordinate of an enzymatic reaction revealed by transition path sampling. Proc Natl Acad Sci USA. 2007;104:12253–12258. doi: 10.1073/pnas.0704304104. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Gronemeyer T, Chidley C, Juillerat A, Heinis C, Johnsson K. Directed evolution of O6-alkylguanine-DNA alkyltransferase for applications in protein labeling. Prot Eng Des Sel. 2006;19:309–316. doi: 10.1093/protein/gzl014. [DOI] [PubMed] [Google Scholar]
31.Keppler A, et al. A general method for the covalent labeling of fusion proteins with small molecules in vivo. Nat Biotechnol. 2003;21:86–89. doi: 10.1038/nbt765. [DOI] [PubMed] [Google Scholar]
32.Ma A, Nag A, Dinner AR. Dynamic coupling between coordinates in a model for biomolecular isomerization. J Chem Phys. 2006;124:144911. doi: 10.1063/1.2183768. [DOI] [PubMed] [Google Scholar]
33.Brooks BR, et al. CHARMM: A program for macromolecular energy, minimization, and dynamics calculations. J Comput Chem. 1983;4:187–217. [Google Scholar]
34.MacKerell AD, et al. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J Phys Chem B. 1998;102:3586–3616. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
35.Foloppe N, MacKerell AD. All-atom empirical force field for nucleic acids: 1. Parameter optimization based on small molecule and condensed phase macromolecular target data. J Comput Chem. 2000;21:86–104. [Google Scholar]
36.MacKerell AD, Banavali NK. All-atom empirical force field for nucleic acids: 2. Application to molecular dynamics simulations of DNA, RNA in solution. J Comput Chem. 2000;21:105–120. [Google Scholar]
37.So S-S, Karplus M. Genetic neural networks for quantitative structure-activity relationships: Improvements and application of benzodiazepine affinity for benzodiazepine/GABAA receptors. J Med Chem. 1996;39:5246–5256. doi: 10.1021/jm960536o. [DOI] [PubMed] [Google Scholar]
38.Peters B, Trout BL. Obtaining reaction coordinates by likelihood maximization. J Chem Phys. 2006;125 doi: 10.1063/1.2234477. 054108. [DOI] [PubMed] [Google Scholar]
39.Peters B, Beckham GT, Trout BL. Extensions to the likelihood maximization approach for finding reaction coordinates. J Chem Phys. 2007;127 doi: 10.1063/1.2748396. 034109. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

pnas_0708058105_index.html^{(7.4KB, html)}

pnas_0708058105_11.pdf^{(55.9KB, pdf)}

pnas_0708058105_10.pdf^{(88.1KB, pdf)}

pnas_0708058105_12.pdf^{(39.9KB, pdf)}

pnas_0708058105_13.pdf^{(35.7KB, pdf)}

pnas_0708058105_14.pdf^{(42.5KB, pdf)}

pnas_0708058105_15.pdf^{(31.7KB, pdf)}

pnas_0708058105_1.pdf^{(49.2KB, pdf)}

pnas_0708058105_2.pdf^{(441.5KB, pdf)}

pnas_0708058105_3.pdf^{(396.7KB, pdf)}

pnas_0708058105_4.pdf^{(56.3KB, pdf)}

pnas_0708058105_5.pdf^{(58.9KB, pdf)}

pnas_0708058105_6.pdf^{(51.1KB, pdf)}

pnas_0708058105_7.pdf^{(66.8KB, pdf)}

pnas_0708058105_8.pdf^{(70.9KB, pdf)}

pnas_0708058105_9.pdf^{(56.5KB, pdf)}

Download video file^{(4.2MB, mpg)}

Download video file^{(7.9MB, mpg)}

Download video file^{(10.2MB, mpg)}

[B1] 1.Klimasauskas S, Kumar S, Roberts RJ, Cheng X. HhaI methyltransferase flips its target base out of the DNA helix. Cell. 1994;76:357–369. doi: 10.1016/0092-8674(94)90342-5. [DOI] [PubMed] [Google Scholar]

[B2] 2.Slupphaug G, et al. A nucleotide-flipping mechanism from the structure of human uracil-DNA glycosylase bound to DNA. Nature. 1996;384:87–92. doi: 10.1038/384087a0. [DOI] [PubMed] [Google Scholar]

[B3] 3.Banerjee A, Yang W, Karplus M, Verdine GL. Structure of a repair enzyme interrogating undamaged DNA elucidates recognition of damaged DNA. Nature. 2005;434:612–618. doi: 10.1038/nature03458. [DOI] [PubMed] [Google Scholar]

[B4] 4.Priyamkumar UD, MacKerell AD. Computational approaches for investigating base flipping in oligonucleotides. Chem Rev. 2006;106:489–505. doi: 10.1021/cr040475z. [DOI] [PubMed] [Google Scholar]

[B5] 5.Kunkel TA, Wilson SH. Push and pull of base flipping. Nature. 1996;384:25–26. doi: 10.1038/384025a0. [DOI] [PubMed] [Google Scholar]

[B6] 6.Stivers JT, Pankiewicz KW, Watanabe KA. Kinetic mechanism of damage site recognition and uracil flipping by Escherichia coli uracil DNA glycosylase. Biochemistry. 1999;38:952–963. doi: 10.1021/bi9818669. [DOI] [PubMed] [Google Scholar]

[B7] 7.Cao C, Jiang YL, Stivers JT, Song F. Dynamic opening of DNA during the enzymatic search for a damaged base. Nat Struct Biol. 2004;11:1230–1236. doi: 10.1038/nsmb864. [DOI] [PubMed] [Google Scholar]

[B8] 8.Parker JB, et al. Enzymatic capture of an extrahelical thymine in the search for uracil in DNA. Nature. 2007;449:433–437. doi: 10.1038/nature06131. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] 9.Liu LL, Gerson SL. Targeted modulation of MGMT: Clinical implications. Clin Cancer Res. 2006;12:328–331. doi: 10.1158/1078-0432.CCR-05-2543. [DOI] [PubMed] [Google Scholar]

[B10] 10.Huang N, Banavali NK, MacKerell AD. Protein-facilitated base flipping in DNA by cytosine-5-methyltransferase. Proc Natl Acad Sci USA. 2003;100:68–73. doi: 10.1073/pnas.0135427100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11.Dellago C, Bolhuis PG, Csajka FS, Chandler D. Transition path sampling and the calculation of rate constants. J Chem Phys. 1998;108:1964–1977. [Google Scholar]

[B12] 12.Bolhuis PG, Chandler D, Dellago C, Geissler PL. Transition path sampling: Throwing ropes over rough mountain passes, in the dark. Annu Rev Phys Chem. 2002;53:291–318. doi: 10.1146/annurev.physchem.53.082301.113146. [DOI] [PubMed] [Google Scholar]

[B13] 13.Ma A, Dinner AR. Automatic method for identifying reaction coordinates in complex systems. J Phys Chem B. 2005;109:6769–6779. doi: 10.1021/jp045546c. [DOI] [PubMed] [Google Scholar]

[B14] 14.Hu J, Ma A, Dinner AR. Bias annealing: A method for obtaining transition paths de novo. J Chem Phys. 2006;125:114101. doi: 10.1063/1.2335640. [DOI] [PubMed] [Google Scholar]

[B15] 15.Blainey PC, vanOijen AM, Banerjee A, Verdine GL, Xie XS. A base-excision DNA-repair protein finds intrahelical lesion bases by fast sliding in contact with DNA. Proc Natl Acad Sci USA. 2006;103:5752–5757. doi: 10.1073/pnas.0509723103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] 16.Goodtzova K, Kanugula S, Edara S, Pegg AE. Investigation of the role of tyrosine-114 in the activity of human O6-alkylguanine-DNA alkyltransferase. Biochemistry. 1998;37:12489–12495. doi: 10.1021/bi9811718. [DOI] [PubMed] [Google Scholar]

[B17] 17.Daniels DS, et al. Active and alkylated human AGT structures: A novel zinc site, inhibitor and extrahelical base binding. EMBO J. 2000;19:1719–1730. doi: 10.1093/emboj/19.7.1719. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] 18.Rasimas JJ, Pegg AE, Fried MG. DNA-binding mechanism of O6-alkylguanine-DNA alkyltransferase: Effects of protein and DNA alkylation on complex stability. J Biol Chem. 2003;278:7973–7980. doi: 10.1074/jbc.M211854200. [DOI] [PubMed] [Google Scholar]

[B19] 19.Rasimas JJ, Kar SR, Pegg AE, Fried MG. Interactions of human O6-alkylguanine-DNA alkyltransferase (AGT) with short single-stranded DNAs. J Biol Chem. 2007;282:3357–3366. doi: 10.1074/jbc.M608876200. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B20] 20.Wibley JEA, Pegg A, Moody PCE. Crystal structure of the human O6-alkylguanine-DNA alkyltransferase. Nucleic Acids Res. 2000;28:393–401. doi: 10.1093/nar/28.2.393. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B21] 21.Daniels DS, et al. DNA binding and nucleotide flipping by the human DNA repair protein AGT. Nat Struct Mol Biol. 2004;11:714–720. doi: 10.1038/nsmb791. [DOI] [PubMed] [Google Scholar]

[B22] 22.Duguid EM, Rice PA, He C. The structure of the human AGT protein bound to DNA, its implications for damage detection. J Mol Biol. 2005;350:657–666. doi: 10.1016/j.jmb.2005.05.028. [DOI] [PubMed] [Google Scholar]

[B23] 23.Schlitter J, Engels M, Kruger P, Jacoby E, Wollmer A. Targeted molecular-dynamics simulation of conformational change—Application to the T-R transition in insulin. Mol Simul. 1993;10:291–309. [Google Scholar]

[B24] 24.Ma L, Cui Q. Activation mechanism of a signaling protein at atomic resolution from advanced computations. J Am Chem Soc. 2007;129:10261–10268. doi: 10.1021/ja073059f. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B25] 25.Hagan MF, Dinner AR, Chandler D, Chakraborty AK. Atomistic understanding of kinetic pathways for single base-pair binding and unbinding in DNA. Proc Natl Acad Sci USA. 2003;100:13922–13927. doi: 10.1073/pnas.2036378100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B26] 26.Krosky DJ, Song F, Stivers JT. The origins of high-affinity enzyme binding to an extrahelical DNA base. Biochemistry. 2005;44:5949–5959. doi: 10.1021/bi050084u. [DOI] [PubMed] [Google Scholar]

[B27] 27.Bolhuis PG. Transition-path sampling of hairpin folding. Proc Natl Acad Sci USA. 2003;100:12129–12134. doi: 10.1073/pnas.1534924100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B28] 28.Radhakrishnan R, Schlick T. Orchestration of cooperative events in DNA synthesis and repair mechanism unraveled by transition path sampling of DNA polymerase's closing. Proc Natl Acad Sci USA. 2004;101:5970–5975. doi: 10.1073/pnas.0308585101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B29] 29.Quaytman SL, Schwartz SD. Reaction coordinate of an enzymatic reaction revealed by transition path sampling. Proc Natl Acad Sci USA. 2007;104:12253–12258. doi: 10.1073/pnas.0704304104. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B30] 30.Gronemeyer T, Chidley C, Juillerat A, Heinis C, Johnsson K. Directed evolution of O6-alkylguanine-DNA alkyltransferase for applications in protein labeling. Prot Eng Des Sel. 2006;19:309–316. doi: 10.1093/protein/gzl014. [DOI] [PubMed] [Google Scholar]

[B31] 31.Keppler A, et al. A general method for the covalent labeling of fusion proteins with small molecules in vivo. Nat Biotechnol. 2003;21:86–89. doi: 10.1038/nbt765. [DOI] [PubMed] [Google Scholar]

[B32] 32.Ma A, Nag A, Dinner AR. Dynamic coupling between coordinates in a model for biomolecular isomerization. J Chem Phys. 2006;124:144911. doi: 10.1063/1.2183768. [DOI] [PubMed] [Google Scholar]

[B33] 33.Brooks BR, et al. CHARMM: A program for macromolecular energy, minimization, and dynamics calculations. J Comput Chem. 1983;4:187–217. [Google Scholar]

[B34] 34.MacKerell AD, et al. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J Phys Chem B. 1998;102:3586–3616. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]

[B35] 35.Foloppe N, MacKerell AD. All-atom empirical force field for nucleic acids: 1. Parameter optimization based on small molecule and condensed phase macromolecular target data. J Comput Chem. 2000;21:86–104. [Google Scholar]

[B36] 36.MacKerell AD, Banavali NK. All-atom empirical force field for nucleic acids: 2. Application to molecular dynamics simulations of DNA, RNA in solution. J Comput Chem. 2000;21:105–120. [Google Scholar]

[B37] 37.So S-S, Karplus M. Genetic neural networks for quantitative structure-activity relationships: Improvements and application of benzodiazepine affinity for benzodiazepine/GABAA receptors. J Med Chem. 1996;39:5246–5256. doi: 10.1021/jm960536o. [DOI] [PubMed] [Google Scholar]

[B38] 38.Peters B, Trout BL. Obtaining reaction coordinates by likelihood maximization. J Chem Phys. 2006;125 doi: 10.1063/1.2234477. 054108. [DOI] [PubMed] [Google Scholar]

[B39] 39.Peters B, Beckham GT, Trout BL. Extensions to the likelihood maximization approach for finding reaction coordinates. J Chem Phys. 2007;127 doi: 10.1063/1.2748396. 034109. [DOI] [PubMed] [Google Scholar]

PERMALINK

A two-step nucleotide-flipping mechanism enables kinetic discrimination of DNA lesions by AGT

Jie Hu

Ao Ma

Aaron R Dinner

Abstract

Fig. 1.

Results

Path Sampling Simulations Reveal a Two-Step Mechanism.

Dynamics of Flipping from the DNA Base Stack.

Fig. 2.

Dynamics of Insertion into the Active Site.

Fig. 3.

Tyr-114 Acts Electrostatically Rather than Sterically.

Comparison of the Energetics for Flipping Gua and mGua Suggests a Kinetic Gate-Keeping Strategy for Lesion Detection.

Fig. 4.

Discussion

Methods

Obtaining Reactive Trajectories.

Identification of Reaction Coordinates.

Supplementary Material

Acknowledgments.

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A two-step nucleotide-flipping mechanism enables kinetic discrimination of DNA lesions by AGT

Jie Hu

Ao Ma

Aaron R Dinner

Abstract

Fig. 1.

Results

Path Sampling Simulations Reveal a Two-Step Mechanism.

Dynamics of Flipping from the DNA Base Stack.

Fig. 2.

Dynamics of Insertion into the Active Site.

Fig. 3.

Tyr-114 Acts Electrostatically Rather than Sterically.

Comparison of the Energetics for Flipping Gua and mGua Suggests a Kinetic Gate-Keeping Strategy for Lesion Detection.

Fig. 4.

Discussion

Methods

Obtaining Reactive Trajectories.

Identification of Reaction Coordinates.

Supplementary Material

Acknowledgments.

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases