Abstract
Base flipping is a structural mechanism common to many DNA processing and repair enzymes. Changes in the local backbone torsions that occur during base flipping and the effect of environment on their behavior are of particular interest in understanding different base flipping mechanisms. In the present study, structures sampled during umbrella sampling molecular dynamics (MD) simulations of base flipping in aqueous and protein-bound environments, carried out with two different MD simulation strategies, are analyzed to find the most significant phosphodiester backbone distortions in the vicinity of the flipping base. Torsional sampling on the 5′ side of the flipping base during flipping through the major groove show similarities to the torsional sampling on the 3′ side during flipping through the minor groove and vice versa. In differing environments, this behavior varies only marginally. These compensating torsional changes in the DNA backbone on 5′ and 3′ sides of the flipping base limit overall distortion of the DNA double helix during single base flipping. Rotameric intermediate states observed during base flipping are identified and postulated to be metastable states implicated in both large-scale structural changes and functional effects of chemical modifications in DNA.
Keywords: replication, DNA repair, molecular dynamics, umbrella sampling, structural analysis, backbone dihedrals, major groove, minor groove
Introduction
Hydrogen bonds between paired bases (e.g. Watson-Crick interactions) help to stabilize the DNA double helix but they also partially obscure the individual stacked bases from the environment. Although the chemical identity of the bases can be established through major groove interactions, for catalysis of chemical modification or repair of the bases, structural distortion away from the duplex state is required. Base flipping is one such commonly used structural mechanism seen in multiple crystal structures of DNA-protein complexes where the flipped state of a single base is stabilized in the enzyme active site1-5. The details of the intermediates leading from the base stacked state to the final flipped state are diffcult to characterize experimentally because they involve metastable structures that are not significantly populated. Chemical substitutions that trap intermediates by making these usually transient states more stable than either the standard double-helical state or the fully flipped state can sometimes highlight their detailed structural features6-8, but these still represent only snapshots along the flipping pathway.
Umbrella sampling molecular dynamics (MD) simulations have been used to investigate the energetic basis of the structural distortion of DNA associated with DNA base flipping in aqueous solution9-11. This approach allows for visualization of the base flipping intermediates not typically accessible to experimental methods. The relative free energy differences between base stacked and flipped states obtained through these investigations show near quantitative agreement with experiments probing base pair dynamics9;12-15. In the presence of a suitable protein environment, such as that provided by the cofactor-bound cytosine-5-methyltransferase from HhaI (M.HhaI), base flipping can be facilitated through destabilization of all states (including the original Watson-Crick paired stacked state) relative to the completely flipped state, through a balance between protein interactions and solvent exclusion16. Recognition of specific DNA sequences by M.HhaI is related to the sequence-dependent variability in protein facilitation of the base flipping process with the cognate sequence being the one that experiences the greatest extent of such facilitation17.
In the present study, analysis of all torsions in the local backbone region between the two sugars on either side of the flipping base is performed to identify the specific patterns in their distortions that occur during flipping. This focus on internal degrees of freedom of DNA is necessary to identify the details of the structural pathway utilized for base flipping. The energetic aspects of the interactions with the environment16-18 and with the extended neighboring DNA sequence itself9, while included explicitly in the MD simulations, have been addressed before and are not actively considered in the present analysis. The flipping of a central (underlined) cytosine base in the M.HhaI cognate sequence GTCAGCGCATGG has been studied16 using MD studies in four different environmental conditions: (a) in aqueous solution, (b) in the ”open” binary complex with M.HhaI in which the catalytic loop is in an extended, inactive conformation19, (c) in the ”closed” binary complex with M.HhaI in which the catalytic loop is in its closed, active conformation20, and (d) in the closed ternary complex with both the enzyme and cofactor present20. In addition, flipping of the cytosine and its base-paired guanine, was studied using a different simulation methodology in aqueous solution9. The goal of the present study is to identify the torsional sampling occurring during base flipping in different environments, and to determine the commonalities of these torsional changes. To fulfil this goal, the structures gathered during these MD simulation studies are analyzed to identify patterns of torsional changes occurring during base flipping, and to examine the influence of the environment on these patterns.
Motivation for this work came partly from an early study by Olson and coworkers on the mechanics of distortion of DNA to accomodate binding of carcinogenic compounds21. The study explored local torsional changes required to ensure closure of extended DNA sequences with bulges. It was suggested that simultaneous and anticorrelated variations of certain rotations around bonds whose bond vectors were approximately parallel to the helix axis would allow partial base flipping in DNA while preserving the direction of the helical backbone. Using a different nomenclature for backbone torsions than the presently accepted one22, that study also indicated that the torsion pairs (in the present nomenclature): γ and 3′-α or 5′-ε and ε on opposite (5′ vs. 3′) sides of the flipping base along the helical axis can mediate such base flipping. Our present MD simulation results support their prediction of compensating distortions on opposite (5′ vs. 3′) sides of the flipping base, while indicating that the backbone torsion changes involved are not simple pairwise correlations. Instead, they involve sampling of complex combinations of alternate minima in backbone torsion energy surfaces, mostly centered around the phosphodiester moiety, including some minima not typically populated in isolated DNA crystal structures.
Methods
The MD simulation methods used to obtain the structures analyzed in the present study have been described before9;16;17, and are only briefly summarized here. The program CHARMM23 was used to perform all calculations with the all-hydrogen protein24 and nucleic acid25;26 force field parameters, the TIP3P water model27, and published sodium parameters28.
A total of six different umbrella sampling MD simulations were analyzed in this study. Five of these simulations studied the flipping of the cytosine in: (a) aqueous solution with periodic boundary conditions and Particle Mesh Ewald (PME)29 electrostatics, (b) aqueous solution with stochastic boundary conditions, (c) the presence of M.HhaI in the open state (binary, open) with stochastic boundary conditions, (d) the presence of M.HhaI in the closed state (binary, closed) with stochastic boundary conditions, (e) the presence of M.HhaI with S-adenosyl-homocysteine cofactor in the closed state (ternary, closed) with stochastic boundary conditions The sixth simulation studied the flipping of the base-paired guanine in aqueous solution with periodic boundary conditions and PME electrostatics.
The M.HhaI-S-adenosylmethionine binary complex crystal structure19 (PDB ID: 1HMY) was used to model the open-binary complex. The closed-binary and ternary complex structures were modeled based on the M.HhaI-DNA-S-adenosylhomocysteine crystal structure20 (PDB ID: 5MHT). A pseudodihedral linkage between the four center-of-mass points created by the four non-hydrogen atom aggregates: (a) ipping cytosine base, (b) sugar attached to the flipping cytosine base, (c) sugar 3′ to the flipping base position on the same strand, and (d) base-pair 3′ to the flipping base position was the one dimensional reaction coordinate describing base flipping. The 360° range of this center-of-mass pseudodihedral restraint governing base flipping was spanned by generating 72 windows which were each sampled in the four different environments. The intermediate windows in the protein environment were prepared by least-squares fitting separately generated flipped DNA intermediate structures onto the crystal structure of bound DNA excluding the flipping base, followed by minimization to remove bad contacts. Stochastic boundary simulations with a 35 Å TIP3P water sphere with randomly distributed sodium ions were carried out for all four environments.
Periodic boundary PME constant temperature and pressure (NPT)30 base flipping umbrella sampling MD simulations were also carried out for the same sequence in a cubic pree-quilibrated solvent box with dimensions 54×54×54 Å3 with enough randomly distributed sodium ions to neutralize the system. Periodic boundary conditions were used with the images generated using the CRYSTAL module in CHARMM31. To use an integration timestep of 0.002 ps, the fast motion of covalent bonds with hydrogen atoms was constrained using SHAKE32. Long-range electrostatic interactions were treated with the PME approach29 with a B-spline order of 4 and a Fast Fourier Transform grid of one point per Å and a real-space gaussian-width kappa of 0.3 Å-1. Real space and Lennard-Jones (LJ) interaction cutoffs of 10 Å were used with non-bond interaction lists maintained and heuristically updated out to 12 Å.
These umbrella sampling simulations were performed for both cytosine and its base-paired partner, guanine, flipping separately out of the otherwise stacked, equilibrated B-form double helix. In the case of flipping of the guanine base, the atoms defining the pseudodihedral restraint were varied such that the four atom aggregates were: (a) flipping guanine base, (b) sugar attached to the flipping guanine base, (c) sugar 5′ to the flipping base position on the same strand, and (d) base-pair 5′ to the flipping base position.
Analysis of the backbone torsion parameters was carried out using a version of the FREEHELIX program33 modified to read CHARMM DNA trajectories. Time frames for analysis were saved at 1 ps intervals in the umbrella sampling MD simulations. The sampling of torsion pairs such as α - β, γ - δ, or ε - ζ obtained through the FREEHELIX analysis was distributed into two dimensional (2D) density histograms binned at 10° intervals in each torsion. These 2D density distributions were then converted into 2D percentage occupancy histograms by multiplying each bin by 100 such that a cumulative percentage value of 100 was obtained if the values in each bin over the entire 2D torsional space were integrated. The maximum percentage occupancy of each bin in this histogram was cut off at a maximum of 5 % to allow visualization of the regions of 2D torsional space showing less than 1 % occupancy. The minor groove flipping pathway was defined as the sampling that occurred within the pseudodihedral flipping coordinate umbrella sampling windows between 10° and 180°; all the other windows were classified as the major groove pathway. The main aim of this 2D analysis is to identify the sampling of every torsion with respect to every other torsion, for all torsions intimately involved in base flipping. All figures are made using the OPENDX program, version 4.2.0 (http://www.opendx.org) and the Gnu Image Manipulation Program (GIMP) version 1.2 (http://www.gimp.org).
Results and Discussion
Dihedral distortion patterns
According to the standard nomenclature of nucleic acids22 partially depicted in Figure 1A, the torsional degrees of freedom for each base position are described by the backbone torsions: α, β, γ, δ, ε, ζ, and χ and the set of sugar torsions collectively represented by the pseudorotational angle (P) and the pseudorotation amplitude. Taken together, these degrees of freedom can almost completely describe the internal geometry of a single DNA base position. Other descriptors, such as helicoidal parameters34, are necessary to characterize the geometric complexity of the DNA double helix created by multiple combinations of different internal geometries among individual base positions. These helicoidal parameters are not considered here because the present study is focused on understanding the detailed mechanics behind the localized nature of the base flipping distortion in the context of the rotatable bonds in the DNA backbone9. The only parameters analyzed are, therefore, the ones providing the best description for the restriction of the flipping distortion to the continuous DNA backbone stretch between the two base positions on either side of the flipping cytosine or guanine base shown in Figure 1A. Such a localization was also indicated in a survey of experimental flipped DNA structures bound to M.HhaI35.
Figure 1.

A. Depiction of nomenclature for backbone torsions, B. Percentage occupancy of 2D torsional space during restrained umbrella sampling base flipping NPT MD simulations with PME electostatics treatment. Two columns on left labeled ’Cytosine’ show sampling during cytosine base flipping, two columns on right labeled ’Guanine’ show sampling during guanine base flipping. ’Major’ and ’Minor’ refer to sampling through major and minor groove pathways, respectively. Labels on right indicate the torsion pairs plotted in that row, for example, 3′ α - 3′ β refers to the row of panels with α torsion on the X-axis and β torsion on the Y-axis for the base position 3′ to the flipping base. Color bar shows color gradation of percentage occupancy, all dihedral values are in degrees.
Single base flipping can occur through either the major groove or the minor groove of the DNA double helix9. The behavior of the backbone dihedrals analyzed separately for these two pathways is shown in Figure 1B. The 5′α, 5′β, 5′γ, 3′ε, and 3′ζ torsions, as visualized along the appropriate axis of the 2D histograms, sample primarily B-form conformations25, supporting the observation that the two bordering sugar moieties limit the range of the distortion of the DNA backbone during base flipping9.
The majority of the remaining rotable bonds between the sugar moieties surrounding the flipping base sample a wide range that of conformations. The exceptions are the γ-δ and 3′γ-3′δ pairs, show only marginal flexibility. The changes that do occur in the δ torsion, with values shifting from 130° to 80°, are due to changes in sugar pucker from C2′-endo to C3′-endo forms36. Sampling of C3′-endo sugar pucker occurs in both groove pathways for both flipping bases; however, the most substantial sampling for the C3′-endo sugar pucker is seen during guanine flipping through the minor groove for the 5′-sugar moiety, and cytosine flipping through the major groove for the sugar moiety attached to the flipping cytosine itself.
The dihedrals that show the greatest distortion during flipping are those closely associated with the two phosphodiester groups surrounding the flipping base. This class includes the 5′ε, 5′ζ, α, β, ε, ζ, 3′α, and 3′β torsions. Each of these torsions show different sampling behavior for the major versus the minor groove pathways. However, the specific patterns observed during cytosine flipping also occur during guanine flipping in both minor and major groove pathways. Comparing individual panels from the first and third columns, and the second and fourth columns, for the 5′ε-5′ζ, α - β, ε - ζ, and 3′α-3′β torsion pairs in Figure 1B illustrates these similarities. This pattern occurs despite the differences in both sequence context and atom selections used in the pseudodihedral restraint for cytosine versus guanine flipping, suggesting that the sampling patterns in backbone torsions during single base flipping are intrinsically conserved.
There is another interesting pattern of similarities between sampling for torsions on one side (5′ or 3′) of the base flipping through the major groove pathway and the corresponding torsions on the opposite side of the base flipping through the minor groove pathway. This can be visualized by comparing the following panels for both cytosine and guanine base flipping in Figure 1B: the major groove 5′ε-5′ζ with the minor groove ε - ζ, the major groove ε - ζ with the minor groove 5′ε-5′ζ, the major groove α - β with the minor groove 3′α-3′β, and the major groove 3′α-3′β with the minor groove α - β . For these pairs, it is evident that rotameric states sampled on the 5′-side of the flipping base in the major groove pathway are sampled on the 3′-side of the flipping base in the minor groove pathway, and vice versa. Separate analysis of the minor groove and major groove pathways can thus identify the rotameric patterns sampled by the phosphodiester backbone on opposite (5′ vs. 3′) sides of a flipping base while moving through the two helical grooves. Altered sampling of a set of torsions on one side (5′ or 3′) of the flipping base is balanced by compensating alterations in sampling of a similar set of torsions on the opposite side. Moreover, the alterations are specific with respect to the side (i.e. 5′ or 3′) on which the torsions are located and the groove through which the base flips. For example, a greater range of the 3′-α torsion is sampled during base flipping through the major groove while a greater range of the α torsion is sampled during base flipping through the minor groove.
Torsions on opposite (5′ vs. 3′) sides of flipping base
The previous section indicated that certain torsions show different behavior depending on the helical groove through which base flipping occurs. This pattern of behavior repeats itself on the opposite (5′ vs. 3′) sides of the flipping base, during flipping through opposite DNA grooves. To further probe this pattern, 2D percentage occupancy histograms of identical torsion pairs, separated by one nucleotide unit (e.g. α and 3′α), are shown in Figure 2. In addition, the cytosine and guanine base flipping results from the PME periodic boundary simulations are also compared to cytosine base flipping carried out under stochastic boundary (SB) conditions in aqueous solution.
Figure 2.

Percentage occupancy of 2D torsional space during restrained umbrella sampling MD simulations of cytosine and guanine base flipping shown as follows: two columns at left labeled ’Cytosine’ and two columns in center labeled ’Guanine’ show sampling for cytosine and guanine base flipping, respectively, both carried out with PME electrostatics and periodic boundary conditions; two columns on right labeled ’SB’ (for stochastic boundary) show sampling for cytosine base flipping carried out with SB conditions, ’Major’ and ’Minor’ labels refer to sampling through the major and minor groove pathways, respectively. Labels on right indicate torsions plotted in each row, for example, γ 3α refers to the row of panels with α torsion for the flipping base position on the X-axis and α torsion for the 3′ base position on the Y-axis. Color bar shows color gradation of percentage occupancy. All dihedral values are in degrees.
A systematic comparison of the SB and the PME periodic boundary MD simulations for cytosine base flipping is beyond the scope of this study. In terms of relative computational cost, the SB approach allows the simulation to be run about 3 times faster than the PME periodic boundary approach for the isolated DNA system. The SB approach37 is necessarily more approximate since long-range electrostatic effects are ignored through atom-based truncation and Langevin dynamics determines the behavior of the solvent at the boundary of the enclosed spherical system. The PME approach with periodic boundary conditions takes into account long-range electrostatic effects and allows diffusion of solvent molecules in and out of the primary cell, but imposes an artificial periodicity in its behavior29. The choice of SB conditions in a previous study16 was a tradeoff between accuracy and computational cost that was considered adequate because of the similarities between the base flipping free energy profiles for isolated DNA in aqueous solution obtained using both approaches. In the present study, the additional structural analysis criteria considered for comparison between the SB and PME periodic boundary approaches are the detailed conformational behavior of the local torsions during base flipping.
Analysis of Figure 2 again shows the γ and δ torsions to be relatively rigid, mostly occupying either one or two specific conformations. The α, β, ε and ζ torsions, on the other hand, occupy many different conformational states, consistent with the more flexible nature of the phosphodiester moiety38, with the major and minor groove pathways showing distinctly different behavior for each torsion pair (See Supporting Information for a more detailed analysis of pairwise sampling amongst these phosphodiester torsions on opposite (5′ vs. 3′) sides of the flipping base).
The most noticeable patterns are the greater range of sampling for (a) the 3′α torsion in the major groove pathway and the α torsion in the minor groove pathway, and (b) the ζ torsion in the major groove pathway and the 5′ζ torsion in the minor groove pathway, for both cytosine and guanine base flipping in the periodic boundary system MD simulations. The torsional sampling for the more approximate SB simulations of cytosine base flipping is spread out over a larger range for the flexible torsions. Nevertheless, a similar pattern of a wider range of sampling of the 3′α torsion during major groove flipping, and a wider range of sampling of the α torsion during minor groove flipping, can be discerned. As stated in the introduction, Olson and coworkers anticipated anticorrelated behavior in specific backbone torsion pairs on opposite (5′ vs. 3′) sides of DNA bulges21. From Figure 2, the essential insight is that flipping does not seem to occur due to correlated (or anticorrelated) transitions in specific torsion pairs on opposite (5′ vs. 3′) sides of the flipping base, but rather due to adjustments in specific combinations of phosphodiester torsions on opposite (5′ vs. 3′) sides of the flipping base in patterns that compensate for energetic costs associated with the individual dihedral distortions. The distortions seen here are consistent with the distortions observed in the survey of flipped base structures (involving 5′-ε, γ, ζ, and 3′-β torsions) with the same cognate sequence bound to M.HhaI35. These patterns of distortions appear to allow the DNA duplex to maintain an approximately canonical conformation beyond the portion of the strand involved in the localized flipping transition itself.
Effect of environment
DNA in aqueous solution is in a homogeneous environment that imposes minimal asymmetry on its dynamic behavior. When bound to a DNA-binding protein that specifically facilitates base flipping, the stabilization of intermediate flipped states due to specific DNA-protein interactions could, in principle, change the conformational pathway utilized for base flipping entirely as compared to aqueous solution. In Figure 3, percentage occupancy 2D torsional space histograms are shown for SB base flipping MD simulations with: (a) DNA alone in aqueous solution (labeled ’Aqueous Solution’), (b) DNA bound to M.HhaI with the catalytic loop in the open conformation (labeled ’Binary, Open’), (c) DNA bound to M.HhaI with the catalytic loop closed around the DNA (labeled ’Binary, Closed’), and (d) DNA bound to M.HhaI with bound S-adenosyl-homocysteine cofactor and the catalytic loop also closed around the DNA (labeled ’Ternary, Closed’).
Figure 3.
Percentage occupancy of 2D torsional space during restrained umbrella sampling SB MD simulations of cytosine base flipping in four environments shown as follows: two columns on left labeled ’Aqueous Solution’ show sampling in aqueous solution, two columns in center, left labeled ’Binary, Open’ show sampling in the binary, open loop M.HhaI-DNA complex, two columns in center, right labeled ’Binary, Closed’ show sampling in the binary, closed loop M.HhaI-DNA complex, two columns on right labeled ‘Ternary, Closed’ show sampling in the ternary, closed loop M.HhaI-DNA-S-adenosyl- homocysteine complex; ’Major’ and ’Minor’ labels refer to sampling through major and minor groove pathways, respectively. Labels on right indicate torsion pairs plotted in that row, for example, 5′ α - 5′ β refers to the row of panels with α torsion on the X-axis and β torsion on the Y-axis for the base position 5′ to the flipping base. Color bar shows color gradation of percentage occupancy, all dihedral values are in degrees.
As mentioned before, there is a tendency for the SB MD simulation results to show sampling of a greater conformational range than those obtained using PME periodic boundary MD simulations. The clearest example is the much wider range of values sampled in the ε-ζ torsion pair seen by comparing the PME periodic boundary MD simulation results in Figure 1B with the SB results in Figure 3. This increased sampling in the SB simulations may be associated with the different strategy for obtaining starting conformations for the individual umbrella sampling windows, the use of atom truncation versus PME for the treatment of long-range electrostatic interactions, and/or edge effects associated with the SB approach16. Some of the trends, however, do remain consistent with those observed in the PME simulations, e.g., the α-β histograms in Figure 1 and Figure 3 show a tendency for smaller range of torsional sampling during major groove flipping as compared to minor groove flipping.
Comparison of the torsional space sampling during flipping in the presence of M.HhaI shows both similarities and differences as compared to flipping in aqueous solution. The sampling of the 5′α-5′β torsion pair is marginally increased in the binary and ternary complexes presumably due to interactions with the protein. The 5′γ-5′δ and 3′γ-3′δ torsion pairs maintain their relative rigidity for the most part, while the γ - δ torsion pair for the flipping base has increased sampling in the closed binary and ternary complexes, occupying alternate minimum energy conformations not seen in the aqueous solution MD simulations. The other torsional degrees of freedom between the two bordering sugar moieties sample a large conformational range, but maintain their distinct behavior in the major and minor groove pathways in all four environments to a significant extent. The major groove α - β and the minor groove 3′α-3′β torsional distributions provide good examples of this trend. Even with drastic changes in the environment of the DNA, the patterns of the local dihedral distortions associated with base flipping through specific grooves remain similar to those in aqueous solution for the SB MD simulations of cytosine base flipping. Although this could be attributed to the initial configuration of the DNA in all SB simulations being the same, the change in environment clearly does not result in large-scale conformational pathway readjustments.
Alternate metastable rotameric states
The metastable states that are involved in the observed patterns of backbone dihedral distortions are indicated in Table 1 and Table 2. These states are identified by locating bins in the 2D torsional percentage occupancy histograms shown in Figure 1B and Figure 3 that show the greatest occupancy amongst any island cluster of bins with an occupancy greater than 1 %. This criteria eliminates regions of space that are sampled less than 1 % of the time in each 10×10°2 region of 2D torsional space. This selection process is expected to filter out transition state conformations, leaving only metastable intermediate states.
Table 1.
2D backbone torsional states around the underlined flipping cytosine in the sequence GTCAGCGCATGG, or the underlined flipping guanine in the paired opposite strand sequence CCATGCGCTGAC that have substantial occupancy in the MD simulations with periodic boundary systems and PME electrostatics
| Flipping Base | Position | Groove | (α, β) | (γ, δ) | (ε, ζ) |
|---|---|---|---|---|---|
| Cytosine | 5′ | Major | 300,170 A,B | 50,140 B | 190,270 BI 250,280 g-,g- |
| Minor | 300,170 A,B | 50,140 B | 190,250 BI 250,200 BII |
||
| Flipping | Major | 300,170 A,B 320,230 g-,t |
60,140 B 60, 80 A |
190,260 BI | |
| Minor | 300,170 A,B | 60,140B | 190,270 BI 250,290 g-,g- |
||
| 3′ | Major | 300,180 A,B | 60,140 B 180,140 t,t |
190,260 BI | |
| Minor | 300,230 g-,t 300,180 A,B |
60,140 B 60, 90A |
190,260 BI | ||
| Guanine | 5′ | Major | 300,170 A,B | 60,140 B | 190,270 BI 250,290 g-,g- |
| Minor | 290,170 A,B | 60, 90 A 50,140 B |
180,270 BI 250,220 g-,t |
||
| Flipping | Major | 300,170 A,B | 60,140 B 60, 90 A |
190,260 BI 250, 60g-,g+ |
|
| Minor | 300,180 A,B 190,150 t,t |
60,140 B | 190,270 BI 250,300 g-,g- |
||
| 3′ | Major | 300,170 A,B | 60,140B 60, 80 A |
190,260 BI | |
| Minor | 300,180 A,B 300,230 g-, t |
60,140 B | 190,260 BI | ||
All values are in degrees, estimated in 10° intervals. Note that individual torsion pair states on the same row but in different columns are not implied to occur simultaneously. Subscripts A and B refer to conformations occurring in the A- or B-forms of DNA, respectively, subscripts BI and BII refer to BI and BII conformations of the ε-ζ torsion pair respectively. Subscript labels g+, t, and g- are assigned to values in the ranges 0° through 120°, 120° through 240°, and 240° through 360°, respectively.
Table 2.
2D backbone torsional states around the underlined flipping cytosine in the sequence GTCAGCGCATGG showing substantial occupancy in SB MD simulations in different environments
| Aqueous Solution | Binary, Open | ||||||
|---|---|---|---|---|---|---|---|
| Position | Groove | (α, β) | (γ, δ) | (ε, ζ) | (α, β) | (γ, δ) | (ε, ζ) |
| 5′ | Major | 300,180 A,B | 50,140 B 50, 80 A |
190,260 BI 40,160 g+,t 60, 80 g+,g+ |
300,170 A,B | 60,140 B 60, 80 A |
190,260 BI 40,170 g+,t 60, 80 g+,g+ |
| Minor | 300,170 A,B | 50,140 B 50, 80 A |
190,260 BI 260,200 BII |
300,170 A,B | 60,140 B 60, 90 A |
180,260 BI 260,200 BII |
|
| Flipping | Major | 300,170 A,B 220,240 t,t |
60,140 B 60, 90 A |
190,270 t,g- | 300,180 A,B 220,180 t,t 140,270 t,g- |
60,140 B 60, 90 A 270,110 g-,t |
190,260 BI 190, 50 t,g+ 190,110 t,g+ |
| Minor | 300,170 A,B | 60, 80 A 60,140 B |
190,270 BI 40,160 g+,t 70, 80 g+,g+ |
300,170 A,B | 50, 80 A 60,140 B |
190,260 BI 60, 80 g+,g+ 100,190 g+,t 180,130 t,t 180, 50 t,g+ |
|
| 3′ | Major | 300,170 A,B | 60,140 B | 190,270 BI | 300,170 A,B | 60,140 B | 190,260 BI |
| Minor | 300,180 A,B 230,180 t,t |
60,140 B | 190,260 BI | 300,190 A,B 220,190 t,t |
60,140 B 300,140 g-,t |
190,260 BI | |
| Binary, Closed | Ternary, Closed | ||||||
|---|---|---|---|---|---|---|---|
| Position | Groove | (α, β) | (γ, δ) | (ε, ζ) | (α, β) | (γ, δ) | (ε, ζ) |
| 5′ | Major | 290,170 A,B | 60, 80 A 50,140 B |
40,160 g+,t 200,270 BI |
290,170 A,B | 50, 90 A 60,140 B |
40,170 g+,t 200,270 BI |
| Minor | 290,180 A,B | 60,140 B 60, 90 A |
190,260 BI 260,220 g-,t |
290,180 A,B | 60,140 B 60, 90 A |
190,260 BII 260,220 g-,t |
|
| Flipping | Major | 270,170 A,B | 60,140 B 180, 90 t,g+ 180,150 t,t |
180,260 BI 180,110 t,g+ 260,170 BII 50,170 g-,t |
280,170 A,B | 60,140 B 180, 90 t,g+ 180,140 t,t 60, 90 A |
180,120 t,g+ 40,170 g+,t |
| Minor | 300,160 A,B 40,160 g+,t |
60, 90 A 60,130 B |
190,270 BI 180, 40 t,g+ 80,190 g+,t |
300,160 A,B 50,160 g+,t 160,160 t,t |
70, 70 A 50,130 B |
190,270 BI 180, 40 t,g+ 180,270 BI |
|
| 3′ | Major | 320,270 g-,g- 300,200 A,B |
60,140 B 280,150 g-,t 160,100 t,g+ 60, 90 A |
190,270 BI | 280,210 g-,t 180,190 t,t |
60,140 B 280,140 g-,t 60,100 A |
190,270 BI |
| Minor | 300,170 A,B 260,240 g-,t |
60,140 B 180,140 |
190,270 BI | 300,170 A,B 250,250 g-,g- |
60,140 B | 190,270 BI | |
All values are in degrees, estimated in 10° intervals. Note that individual torsion pair states on the same row but in different columns are not implied to occur simultaneously. Subscripts A and B refer to conformations occurring in the A- or B-forms of DNA, respectively, subscripts BI and BII refer to BI and BII conformations of the ε-ζ torsion pair respectively. Subscript labels g+, t, and g- are assigned to values in the ranges 0° through 120°, 120° through 240°, and 240° through 360°, respectively
Analysis of Table 1 and Table 2 shows that the usual local minima seen in A-form and B-form DNA25 are present for all torsion pairs in both pathways and in all environments. However, a variety of additional local minima not seen in canonical structures are also sampled. Specific examples are t,t values for the α - β and γ - δ torsion pairs, and g-,g- for the ε - ζ torsion pair. Specific alternate local conformations of torsions during base flipping identified here are consistent with alternate local energy minima in potential energy surfaces calculated using quantum mechanical (QM) calculations of the same torsions in representative model compounds39;40. The present analysis along with the QM data indicate that these alternate conformations may be useful in mediating important distortions in DNA, even though they may not be substantially represented in isolated DNA crystal structure distributions39.
Conclusions
Structural analysis of umbrella sampling MD simulations of base flipping indicate that the localized distortion of DNA during base flipping involves changes in multiple torsions in the vicinity of the flipping base. The perturbed torsions lie on both the 5′ and 3′ sides of the flipping base and are mostly associated with the phosphodiester moiety. The ability to manipulate the conformational behavior of the phosphodiester moiety through specific charged amino acid sidechain interactions may be critical to achieving these distortions in the presence of nucleic acid-binding proteins. Specific torsion combinations define the intermediate configurations that are sampled during flipping through minor or major groove pathways. The work of Olson and coworkers21 predicted these torsion combinations to be composed of one specific torsion on the 5′-side of a DNA bulge and another specific torsion on the 3′-side of a DNA bulge. The present results are consistent with that general concept, however, the torsion combinations involved in base flipping do not show the anticipated highly correlated pairwise changes. The torsional changes involved instead are complex combinations of canonical and alternate minima in specific individual backbone torsion degrees of freedom on opposite (5′ vs. 3′) sides of the flipping base.
The pseudodihedral restraint used to induce base flipping, the simulation methodology, sequence effects, and environmental effects, all show discernable but limited effects on the dihedral distortion details. The effect of DNA sequence or flipping base identity on the pattern of dihedral distortions is not systematically addressed in this work. The flipping of a guanine surrounded by a different sequence context in aqueous solution does, however, show similar dihedral distortions as those occurring during cytosine flipping in aqueous solution. This suggests that sequence may influence the energetics of the flipping process through intramolecular and intermolecular non-bonded interactions9 but may not substantially affect the structural backbone dihedral distortion pathways used in base flipping. The predicted local backbone transitions, though identified for base flipping only, may also be applicable to other distortions of DNA. If base flipping events occur in close conjunction repeatedly over neighboring base pairs, they can ultimately lead to DNA unwinding41;42. The present observations are therefore potentially useful in probing large-scale DNA distortions involved in the crucial biological functions of replication and transcription.
The identification of conserved distortions in DNA can advance the understanding of the biological effects of introduction of chemical modifications in the nucleic acid backbone. These modifications, utilized in the design of antisense oligonucleotides, can affect backbone torsions involved in base flipping without changing their sequence-dependent behavior35;43. The ability or inability of backbone-modified DNA analogs to attain intermediate rotameric states required in DNA distortion pathways can be tested with ab initio calculations on model compounds and MD simulation studies. Knowledge of common metastable substates sampled during DNA distortions can thus be linked to the impact of chemical modifications on the dynamic behavior of modified nucleic acid analogs. The connection between specific chemical modifications and the resulting change in dynamic behavior can be further exploited to modulate interaction of the modified analogs with nucleic acid binding proteins.
Supplementary Material
Six tables showing linear correlation coefficients for all pairs of torsions analyzed, one table showing linear correlation coe cients between δ and pseudorotation angle that illustrates what the value of the correlation coe cient should be for two torsions that are truly correlated. Five figures showing the configurations visited in multiple torsion pairs on opposite (5′ vs. 3′) sides of the flipping base in di erent MD simulation systems and environments, a text description of the observations in these five figures, and one figure illustrating the five di erent MD simulation systems used in the calculations. This material is available free of charge via the Internet at http://pubs.acs.org.
Acknowledgements
We would like to thank Professor Wilma Olson for helpful discussions and sending us a reprint of her early study on torsional patterns in DNA with bulges. The work was supported by NIH grant GM51505 and the computational resources were provided by National Center for Supercomputing Alliance, Pittsburgh Supercomputing Center, and Department of Defense.
References
- [1].Klimasauskas S, Kumar S, Roberts RJ, Cheng X. Cell. 1994;76:357–369. doi: 10.1016/0092-8674(94)90342-5. [DOI] [PubMed] [Google Scholar]
- [2].Mol CD, Hosfield DJ, Tainer JA. Mutat. Res. 2000;460:211–229. doi: 10.1016/s0921-8777(00)00028-8. [DOI] [PubMed] [Google Scholar]
- [3].Roberts RJ. Cell. 1995;82:9–12. doi: 10.1016/0092-8674(95)90046-2. [DOI] [PubMed] [Google Scholar]
- [4].Roberts RJ, Cheng XD. Ann. Rev. Biochem. 1998;67:181–198. doi: 10.1146/annurev.biochem.67.1.181. [DOI] [PubMed] [Google Scholar]
- [5].Stivers JT. Prog Nucleic Acid Res Mol Biol. 2004;77:37–65. doi: 10.1016/S0079-6603(04)77002-6. [DOI] [PubMed] [Google Scholar]
- [6].van Aalten DMF, Erlanson DA, Verdine GL, Joshua-Tor L. Proc. Natl. Acad. Sci. USA. 1999;96:11809–11814. doi: 10.1073/pnas.96.21.11809. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Verdine GL, Norman DPG. Ann. Rev. Biochem. 2003;72:337–366. doi: 10.1146/annurev.biochem.72.121801.161447. [DOI] [PubMed] [Google Scholar]
- [8].Horton JR, Ratner G, Banavali NK, Huang N, Choi Y, Maier MA, Marquez VE, MacKerell AD, Jr., Cheng XD. Nucleic Acids Res. 2004;32:3877–3886. doi: 10.1093/nar/gkh701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Banavali NK, MacKerell AD., Jr. J. Mol. Biol. 2002;319:141–160. doi: 10.1016/S0022-2836(02)00194-8. [DOI] [PubMed] [Google Scholar]
- [10].Varnai P, Lavery R. J. Am. Chem. Soc. 2002;124:7272–7273. doi: 10.1021/ja025980x. [DOI] [PubMed] [Google Scholar]
- [11].Priyakumar UD, MacKerell AD., Jr. Chemical Reviews. 2006;106:489–505. doi: 10.1021/cr040475z. [DOI] [PubMed] [Google Scholar]
- [12].Gueron M, Kochoyan M, Leroy JL. Nature. 1987;328:89–92. doi: 10.1038/328089a0. [DOI] [PubMed] [Google Scholar]
- [13].Varnai P, Canalia M, Leroy JL. J. Am. Chem. Soc. 2004;126:14659–14667. doi: 10.1021/ja0470721. [DOI] [PubMed] [Google Scholar]
- [14].Priyakumar UD, MacKerell AD., Jr. Journal of Chemical Theory and Computation. 2006;2:187–200. doi: 10.1021/ct0501957. [DOI] [PubMed] [Google Scholar]
- [15].Priyakumar UD, MacKerell AD., Jr. J. Am. Chem. Soc. 2006;128:678–679. doi: 10.1021/ja056445a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Huang N, Banavali NK, MacKerell AD., Jr. Proc. Natl. Acad. Sci. 2003;100:68–73. doi: 10.1073/pnas.0135427100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Huang N, MacKerell AD., Jr. J. Mol. Biol. 2005;345:265–274. doi: 10.1016/j.jmb.2004.10.042. [DOI] [PubMed] [Google Scholar]
- [18].Huang N, MacKerell AD., Jr. Philosophical Transactions of The Royal Society, London, A. 2004;362:1429–1460. [Google Scholar]
- [19].Cheng XD, Kumar S, Posfai J, Pflugrath JW, Roberts RJ. Cell. 1993;74:299–307. doi: 10.1016/0092-8674(93)90421-l. [DOI] [PubMed] [Google Scholar]
- [20].O’Gara M, Roberts RJ, Cheng X. J. Mol. Biol. 1996;263:597–606. doi: 10.1006/jmbi.1996.0601. [DOI] [PubMed] [Google Scholar]
- [21].Olson WK, Marky NL, Srinivasan AR, Do KD, Cicariello J. Molecular Basis of Cancer: Part A. Alan R. Liss, Inc; New York: 1985. [Google Scholar]
- [22].Saenger W. Principles of Nucleic acid Structure. Springer-Verlag; New York: 1984. [Google Scholar]
- [23].Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M. J. Comput. Chem. 1983;4:187–217. [Google Scholar]
- [24].MacKerell A, Jr., et al. J. Phys. Chem. B. 1998;102:3586–3616. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
- [25].Foloppe N, MacKerell AD., Jr. J. Comput. Chem. 2000;21:86–104. [Google Scholar]
- [26].MacKerell AD, Jr., Banavali NK. J. Comput. Chem. 2000;21:105–120. [Google Scholar]
- [27].Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. J. Chem. Phys. 1983;79:926–935. [Google Scholar]
- [28].Beglov D, Roux B. J. Chem. Phys. 1994;100:9050–9063. [Google Scholar]
- [29].Darden T, York D, Pedersen L. J. Chem. Phys. 1993;98:10089–10092. [Google Scholar]
- [30].Feller SE, Zhang YH, Pastor RW, Brooks BR. J. Chem. Phys. 1995;103:4613–4621. [Google Scholar]
- [31].Field MJ, Karplus M. CRYSTAL: program for crystal calculations in CHARMM. Harvard University; Cambridge, MA: 1992. [Google Scholar]
- [32].Ryckaert JP, Ciccotti G, Berendsen HJC. J. Comp. Phys. 1977;23:327–341. [Google Scholar]
- [33].Dickerson RE. Nucleic Acids Res. 1998;26:1906–1926. doi: 10.1093/nar/26.8.1906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [34].Lu XJ, Olson WK. Nucleic Acids Res. 2003;31:5108–5121. doi: 10.1093/nar/gkg680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Wang PY, Brank AS, Banavali NK, Nicklaus MC, Marquez VE, Christman JK, MacKerell AD., Jr. J. Am. Chem. Soc. 2000;122:12422–12434. [Google Scholar]
- [36].Lu XJ, Shakked Z, Olson WK. J. Mol. Biol. 2000;300:819–840. doi: 10.1006/jmbi.2000.3690. [DOI] [PubMed] [Google Scholar]
- [37].Brooks CL, III, Karplus M. J. Mol. Biol. 1989;208:159–181. doi: 10.1016/0022-2836(89)90093-4. [DOI] [PubMed] [Google Scholar]
- [38].Banavali NK, MacKerell AD., Jr. J. Am. Chem. Soc. 2001;123:6747–6755. doi: 10.1021/ja010295w. [DOI] [PubMed] [Google Scholar]
- [39].Foloppe N, MacKerell AD., Jr. J. Phys. Chem. 2000;102:6669–6678. [Google Scholar]
- [40].Foloppe N, MacKerell AD., Jr. Biophys. J. 1999;76:3206–3218. doi: 10.1016/S0006-3495(99)77472-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41].Patel PH, Suzuki M, Adman E, Shinkai A, Loeb LA. J. Mol. Biol. 2001;308:823–837. doi: 10.1006/jmbi.2001.4619. [DOI] [PubMed] [Google Scholar]
- [42].Schneider TD. Nucleic Acids Res. 2001;29:4881–4891. doi: 10.1093/nar/29.23.4881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43].Banavali NK, MacKerell AD., Jr. Nucleic Acids Res. 2001;29:3219–3230. doi: 10.1093/nar/29.15.3219. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Six tables showing linear correlation coefficients for all pairs of torsions analyzed, one table showing linear correlation coe cients between δ and pseudorotation angle that illustrates what the value of the correlation coe cient should be for two torsions that are truly correlated. Five figures showing the configurations visited in multiple torsion pairs on opposite (5′ vs. 3′) sides of the flipping base in di erent MD simulation systems and environments, a text description of the observations in these five figures, and one figure illustrating the five di erent MD simulation systems used in the calculations. This material is available free of charge via the Internet at http://pubs.acs.org.

