Abstract
The Varkud Satellite RNA contains a self-cleaving ribozyme that has been shown to function independently of its surroundings. This 160 nucleotide ribozyme adopts a catalytically active tertiary structure that includes a kissing hairpin complex formed by stem-loop I and stem-loop V (SLV). The five-nucleotide 5′-rUGACU loop of the isolated SLV has been shown to adopt a Mg2+-dependent U-turn structure by solution NMR. This U-turn hairpin is examined here by molecular dynamics simulations in the presence of monovalent and divalent ions. Simulations confirm on an all-atom level the hypotheses for the role of the Mg2+ ions in stabilizing the loop, as well as the role of the solvent exposed U700 base. Additionally, these simulations suggest the Mg2+-free stem-loop adopts a wide range of structures, including energetically favorable structures similar to the Mg2+-bound loop structure. We propose this structure is a “gatekeeper” or precursor to Mg2+ binding when those ions are present.
Introduction
The Varkud Satellite (VS) ribozyme is a catalytic RNA which performs cleavage and ligation of a phosphodiester bond, and it requires Mg2+ for tertiary structure formation and catalysis.1−3 Though there is no available structure of the complete ribozyme, several of the four hairpin loops and internal stems and junctions have been characterized by NMR.4−7 Stem-loop V (SLV) undergoes a Mg2+-dependent conformational change which allows it to orient the substrate stem-loop I (SLI) via Watson–Crick base pairing in a kissing-loop interaction.3,8
The structure and dynamics of SLV were examined by NMR revealing a Mg2+-dependent conformational switch of the loop conformation (shown in Figure 1a,b).4,5 The Mg2+-dependence is particularly interesting since it illustrates a role for divalent ions in the association of SLI with SLV.1,2 SLV is a six base pair stem capped by a five-membered loop with the sequence 5′-U696G697A698C699U700-3′ and it forms a canonical U-turn motif in the presence of Mg2+. The structural features of a U-turn motif include a UNR sequence (where N is any nucleotide and R is a purine); a Y:A, Y:Y, or G:A loop-closing base pair (Y is pyrimidine); a sharp turn of the backbone α torsion angle 3′ to the U; a hydrogen bond from U 2′ OH to R N7 (Figure 1c,d); a hydrogen bond from U NH3 to the phosphate oxygen 3′ to R (Figure 1e,f); and stacking of the U with the R 5′ phosphate group (Figure 1e,f). These canonical features are illustrated in molecular graphics images of the experimental structures in the presence and absence of Mg2+ in Figure 1 panels c, d, e, and f, respectively. In a monovalent ion environment (-Mg2+ or MgFree), SLV is described as a noncanonical or “loose” U-turn motif, and these characteristic intermolecular interactions range from weak to nonexistent.4 In a divalent ion environment these interactions are able to form and stabilize a more compact loop.5 It has been proposed that Mg2+ stabilizes the U-turn of SLV and neutralizes the electronegative charges in the loop, making it easier for the substrate SLI to bind.5
The experimental evidence outside of NMR includes kinetic and thermodynamic characterization of SLI and SLV. A focus on the solvent exposed U700 base showed that mutation to another base maintained the enzymatic rate of the wild type U, but deleting the base reduces kcat/KM.9 Similarly, mutating U700 to C showed no change in binding affinity for SLI, but deletion of U700 reduced the binding affinity.8 This led to the hypothesis that U700 contributes to the enthalpic stability of SLV by donating a phosphate group that coordinates a Mg2+ ion, and that the solvent exposed nature of the base would contribute to increased entropy in the monovalent (unbound) environment.
SLV is a rich source of information about Mg2+-dependent structural changes in small RNA molecules, and so is of particular interest for molecular dynamics (MD) simulations. Nucleic acid force fields lag behind those for proteins, in part due to the high charge density and high flexibility of the nucleic acid backbone.10 In RNA—which adopts many types of structures beyond a simple duplex—capturing the dynamic aspects, coupled with the long time scales over which these conformational changes can occur, is particularly challenging.11−13 However, recent modifications, some specific to RNA, have brought force fields into better alignment with experimental data.14−16
Here we use MD simulations to provide atomic-level detail into the dynamics of ion association with SLV. We show computational evidence supporting the current hypotheses for the role of Mg2+ ions in stabilizing SLV, as well as the role of the U700 base. Additionally, we show that the dynamics of the unbound SLV (in a monovalent ion environment) adopt U-turn characteristics similar to the canonical U-turn characteristics of the Mg2+-bound conformation, though admittedly not identical to the NMR structure. We propose the SLV RNA in the presence of monovalent ions samples a dynamic and wide range of structures, including a “gatekeeper” structure which is a precursor to Mg2+ binding, adopting structural features which are more similar to the bound loop than to the excess of unbound conformations.
Experimental Methods
A flowchart of the simulation protocol is provided in the Supporting Information as Supporting Figure 1. A summary of simulations is provided in Table 1. Starting structures for simulations were taken from the first model of the NMR ensembles of SLV: 1TBK (MgFree) and 1YN2 (MgBound).4,5 Structures were solvated with TIP3P water in an octahedral box.17 The RNA, without the ions that were present in the pdb file, was placed with a buffer of 12 Å of water, resulting in the addition of ∼5800 water molecules. For the MgBound systems eight Mg2+ ions were added back to neutralize the system charge, and five additional MgCl2 molecules were added to generate an ∼40 mM excess Mg2+ concentration. For the MgFree systems 16 Na+ ions were added to neutralize the system charge and six additional NaCl molecules were added to generate an ∼50 mM excess Na+ concentration. All ion initial positions were randomized with placement at least 6 Å from the RNA and at least 4 Å from each other by swapping positions with random water molecules. Joung–Cheatham parameters were used for monovalent ions and Allner et al. parameters were used for Mg2+.18,19
Table 1. Summary of MD Simulations Performed in This Work.
starting RNA structure | ions added | nomenclature | figure |
---|---|---|---|
Rerefinement | |||
1TBK models 2–11 | 20 Na+, 4 Cl– | rerefined MgFree ensemble | Figure 2b |
Long MD Simulations −10 × 250 ns Simulations for Each System | |||
1TBK model 1 | 20 Na+, 4 Cl– | n/a | Supporting Figure 2 |
rerefined 1TBK, all 10 models | 20 Na+, 4 Cl– | MgFree + 50 mM NaCl | Figure 2b; Figure 3c,g; Figure 4, blue; Figure 6, red |
1YN2 model 1 | 20 Na+, 4 Cl– | MgBound + 50 mM NaCl | Figure 3a,e |
1YN2 model 1 | 12 Mg2+, 4 Cl– | MgBound + 40 mM MgCl2 | Figure 3b,f; Figure 4, black and red; Figure 6, black |
rerefined 1TBK, all 10 models | 12 Mg2+, 4 Cl– | MgFree + 40 mM MgCl2 | Figure 3d,h |
Replica Exchange Simulation: 48 replicas × 1.2 μs per Replica | |||
rerefined 1TBK, model 1 | 20 K+, 4 Cl– | MgFree + 50 mM KCl | Figure 6, blue |
Simulations were carried out using the Amber12 and developmental versions of the Amber14 suite of programs20,21 using the GPU (CUDA) version of PMEMD with SPFP (a mixed single/fixed precision model on the GPU).22 The ff12 force field for nucleic acid simulations was used, combining ff99 + parmbsc0 modifications + chiOL3 modifications for RNA.14,23−25 Periodic boundary conditions were used in all simulations, and the particle mesh Ewald method, with default parameters, was used to handle electrostatic interactions with a 9 Å cutoff for direct interactions.26
As presented in the PDB, the original NMR structures of SLV ± Mg2+ were described by an ensemble of 10 structures consistent with the NMR constraints. Our initial simulations used model 1 of each ensemble as a starting structure, and to evaluate experimental variability, 10 simulations were produced from that model 1, with different initial velocities (via randomized random number generator seeds using the ig = −1 option in AMBER) as well as different randomized initial ion positions.27 Minimization and equilibration procedures were the same as previously used.28 In brief, the initial minimization of coordinates and heating to 300 K were followed by successive rounds of minimization and equilibration performed with decreasing positional restraints (from 5 kcal mol–1 Å–2) on the RNA. The system was determined to be equilibrated when the potential energy, density, and pressure of the system stabilized.
Unrestrained production MD simulation runs of 250 ns were begun from these equilibrated systems, and the properties of the RNA and ions were measured from this portion of the trajectory. Production runs were constant volume and temperature; temperature was set to 300 K and regulated using the Langevin algorithm with a collision frequency of 5 ps–1.29 Bonds to hydrogen were constrained using the SHAKE algorithm, allowing use of a 2 fs time step.30
The instability of the SLV RNA in 50 mM NaCl simulations described above led us to rerefine the 1TBK NMR structure in explicit solvent using the published NMR restraint data. Rerefinement simulations were carried out in 50 mM NaCl, with equilibration as described above for models 2 through 11 of the original ensemble (10 structures total). In addition to positional restraints, NMR experimental NOE distance restraints were applied to the RNA. Reported NMR restraints in DYANA format were translated into Amber format using the makeDIST_RST program, totaling 553 restraints; 527 distance restraints with a force constant of 20 kcal mol–1 Å–2 and 26 torsion angle restraints with a force constant of 500 kcal mol–1 rad–1 were used throughout equilibration and rerefinement unless otherwise noted. Equilibration was performed as previously described. Restrained simulations were run for 20 ns at constant temperature and pressure. Temperature was regulated using a Langevin thermostat and a 1 ps–1 collision frequency.29 The pressure coupling time constant was set to 5 ps. The position of the center-of-mass of the molecule was reset to zero every 10 ps.
To analyze the ensemble of 10 SLV structures in the restrained rerefined simulations, the CPPTRAJ analysis program from AmberTools was used to cluster the structures with a hierarchical agglomerative algorithm on all heavy atoms using an epsilon value of 1.0 Å with a sieve of 5 frames (Supporting Script 1).31,32 The representative structure from the largest cluster in each of the 10 trajectories was minimized using 1000 steps of steepest descent followed by 1000 steps of conjugate gradient, with restraints intact to allow structures to relax with respect to the NMR restraints. Each minimized structure was subsequently subjected to heating and annealing over 4 ns, followed by 6 ns restrained NPT dynamics (Supporting Script 2). During this step coupling to the heat bath was lowered to 0.5 ps. Temperature was increased linearly from 300 to 700 K over 1 ns, held constant at 700 K for 1 ns, and then reduced linearly to 300 K over 2 ns. The NMR restraints were lowered to 80% of their strength for the first 1.5 ns, and then linearly increased to 100% over the next 500 ps, and remained at 100% strength for the duration of the simulation. Average structures were generated from the last 6 ns of the trajectory.
The new average structures were used to start a second series of 250 ns unrestrained production runs. Production MD simulation runs were performed for every model in the new ensemble, with constant pressure and volume. Temperature was set to 300 K and regulated using the Langevin algorithm with a collision frequency of 5 ps –1.29 Bonds to hydrogen were constrained using the SHAKE algorithm, allowing use of a 2 fs time step.30
We did not rerefine the NMR structures where Mg2+ was present, since model 1 from 1YN2 was stable in the 10 initial 250 ns simulations. To separate the representative structures present in the bimodal distribution of Mg2+-bound SLV RNA, kmeans clustering was used to group structures into two clusters from all snapshots with an RMSD to the MgBound reference below 2.5 Å (Supporting Script 5). Clustering was performed on the heavy atoms of residues U695 to A701, including the loop and its closing base pair (Supporting Script 6).
Temperature replica exchange MD simulations were performed to enhance the sampling space accessible to the MgFree RNA in the presence of monovalent ions. Model 1 of the MgFree NMR ensemble 1TBK was solvated with TIP3P water in an octahedral box.17 The RNA was placed with a buffer of 9 Å of water, resulting in the addition of 3953 water molecules. Sixteen K+ ions were added to neutralize the system charge and six additional KCl molecules were added to generate an ∼50 mM excess K+ concentration. All initial ion positions were randomized with placement at least 6 Å from the RNA and at least 4 Å from each other by swapping positions with random water molecules. Joung–Cheatham parameters were used for monovalent ions.18
Simulations were carried out using the Amber12 and developmental versions of the Amber14 suite of programs20,21 using the GPU (CUDA) version of PMEMD with SPFP (a mixed single/fixed precision model on the GPU).22 The ff12 force field for nucleic acid simulations was used, which combines ff99 + parmbsc0 modifications + chiOL3 modifications for RNA.14,23−25 Simulations were run with Watson–Crick restraints on the stem (residues 690–694 and 702–706), enhancing sampling of the five loop residues and closing base pair only (residues 695–701), and allowing a smaller solvent box. The restraints were enforced during minimization and equilibration using a force constant of 20 kcal mol–1 Å–2. Minimization and equilibration were performed as previously described.28
For the T-REMD simulations, 48 temperatures spanning 277 to 400 K were carried out in the NVT ensemble using the pmemd.cuda.MPI module of AMBER 14.33 The Langevin thermostat was used to regulate temperature with a collision frequency of 5 ps–1.29 The particle mesh Ewald method, with default parameters, was used to handle electrostatic interactions with a 9 Å cutoff for direct interactions.26 The “ig = −1” option was set to generate random seeds for each restart, avoiding synchronization effects.27 Exchanges were attempted every 1 ps. Repartitioning of the solute hydrogen masses to 3.07 au was performed by decreasing the mass on atoms to which the H is bonded by a respective amount, allowing a 4 fs time step.34 SHAKE was used to constrain bonds to hydrogens.30 Each replica was run for ∼1.2 μs.
Results were plotted using Grace or Microsoft Excel. The sorting of replica trajectories and all analysis was performed using CPPTRAJ and visualized in VMD. Grid density analysis was performed for Mg2+ ions, and the histogram of the ions was calculated with a grid spacing of 0.5 Å and normalized by ion density (Supporting Script 3). Hydrogen bond analysis was performed setting the cations as solvent acceptors and the entire RNA as solvent donors, using a distance cutoff of 4.5 Å (Supporting Script 4).
Results
Simulations of MgFree SLV Deviate from the NMR Structure
The NMR structure of SLV in 50 mM NaCl found that the canonical U-turn characteristics were absent, although stacking of G697A698C699 was observed. Ten 250 ns length-independent MD simulations were carried out in 50 mM NaCl, starting from model 1 of the NMR ensemble (1TBK), each with a different random starting velocity as well as different randomized ion positions. The all-residue RMSD for each simulation was calculated with reference to the model 1 starting structure; Supporting Figure 2 shows that the heavy atom RMSD can be attributed mainly to flexibility seen in the loop region, since the stem RMSD remains consistently around 2.5 Å for the duration of the simulations. The base stacking of the G-A-C triplet was disrupted, and the G697 alpha torsion angle of 116° was not stable (Supporting Figure 3). To test the influence of that torsion angle on the loop structure, it was restrained in a separate 100 ns length MD simulation that started from model 1. The resulting trajectory did not show an improvement in the stability of the loop structure, however, and its RMSD from the starting structure was not reduced suggesting that deviation from the extreme alpha angle of 116° was not the root cause of instability of the loop structure (Supporting Figure 4).
Overall, the resulting MD simulation structures were more compact with respect to the much extended initial NMR ensemble. Supporting Figure 5 shows decreased major groove distances across the loop and stem compared to the same distances in the NMR ensemble. We hypothesize this is due to the inclusion of an electrostatic component in the force field that was absent during the original refinement. To better understand the structure, we rerefined the original MgFree (1TBK) structure using the NMR data in the presence of explicit solvent, counterions, the excess ion environment, and with a proper treatment of the electrostatics.
Rerefinement Results in Two Distinct Conformations Which Equally Satisfy NMR Restraints and Have Properties of the MgBound NMR Ensemble
The NMR structure was originally refined in vacuo with XPLOR using a simplified version of the CHARMM force field that omitted the attractive van der Waals and the proper electrostatic contributions.4 We have previously seen that using explicit water and ions, together with more complete force field parameters (that is, with electrostatic and attractive van der Waals force field components included in addition to explicit solvent and counterions) can improve refinement of structures from NMR data.28,35 Each of the 10 structures in the MgFree NMR ensemble was rerefined as described in the Experimental Methods. From each of the final 6 ns of restrained simulations, which started from each of the rerefined structures, a new average structure was calculated. These minimized average structures make up the “new” rerefined NMR ensemble, and are shown in Figure 2. Deviations from the experimental restraints were calculated using the sander module in AMBER (employing the pencut = −1 input flag for analyzing all restraints), and are shown in Table 1. The members of the rerefined ensemble are in good agreement with the experimental distance and torsion restraints, and are more similar to their minimized average structure, suggesting that the structure determination was more consistent.
The rerefined ensemble contains two main conformations, shown in Figure 2, bottom right, and described in Table 2. Conformation 1 is the dominant conformation, adopted by 70% of the final ensemble, and Conformation 2 is the less dominant conformation, adopted by 30% of the final ensemble (Figure 2, highlighted in green). The characteristic U-turn features of the original NMR ensemble, the original MgBound NMR ensemble (1YN2), the rerefined ensemble, and Conformations 1 and 2 are reported in Table 3. Conformation 1, with its G697 α torsion angle of 139°, is more comparable to the original NMR structure with its G697 α torsion angle of ∼116°, while Conformation 2 adopts a more canonical U-turn α torsion angle of 154°, closer to the MgBound ensemble. For perspective, most canonical U-turns adopt α torsion angle values from 160° to 170° (Supporting Table 1). An exception is HIV-1 A-rich hairpin loop which adopts an average α torsion value of 135°, which is very similar to the rerefined value of Conformation 1.
Table 2. NOE and RMSD Analyses for SLV Ensembles with Na+ Ions.
total no. of restraints =553 | original NMR | rerefinement |
---|---|---|
Deviation from Experimental Restraints | ||
NOE (Å) | 0.00029 ± 0.0004a | 0.0013 ± 0.0002 |
torsion (deg) | 0.0082 ± 0.0051 | 0 |
number of NOE deviations | 16.9 ± 2.9 | 11.2 ± 2.0 |
number of NOE deviations >0.1 Å | 0 | 2.4 ± 0.8 |
number of torsion deviations | 2.7 ± 1.7 | 0 |
number of torsion deviations >0.5 Å | 0 | 0 |
RMS Deviation from Ideal Geometry | ||
bonds (Å) | 0.0096 ± 0 | 0.068 ± 0.003 |
angles (deg) | 1.70 ± 0.01 | 2.89 ± 0.07 |
Heavy Atom RMSD to Minimized Average Structure | ||
overall (690–706, 690–705) (Å) | 1.11 ± 0.14, 1.38 ± 0.57 | 0.46 ± 0.16, 0.56 ± 0.39 |
stem (690–295,701–706) (Å) | 0.80 ± 0.08 | 0.21 ± 0.02 |
loop (696–700) (Å) | 0.73 ± 0.08 | 0.43 ± 0.16 |
Recalculated using the SANDER program in AMBER12.
Table 3. U-turn Characteristics of NMR Ensembles.
U-turn characteristic: | original MgFree ensemble | original MgBound ensemble | rerefined MgFree ensemble | Conf. 1 | Conf. 2 |
---|---|---|---|---|---|
G697 alpha (deg) | 116 ± 7 | 167 ± 34 | 151 ± 10 | 139 ± 4 | 153 ± 8 |
G697/A698 stacking (Å) | 3.6 ± 0.2 | 4.2 ± 0.4 | 3.7 ± 0.1 | 3.7 ± 0.0 | 3.7 ± 0.1 |
A698/C699 stacking (Å) | 3.7 ± 0.3 | 3.8 ± 0.2 | 3.6 ± 0.1 | 3.7 ± 0.0 | 3.6 ± 0.0 |
U696 N3–A698 5′phosphate stacking (Å) | not stacked: 4.2 ± 0.3 | stacked: 4.2 ± 0.6 | 4.2 ± 0.7 | 4.0 ± 0.1 | 5.2 ± 0.2 |
U696 2′OH–A698 N7 h-bond (Å) | 3.4 ± 0.3 | 2.5 ± 0.2 | 3.6 ± 0.6 | 3.8 ± 0.1 | 3.1 ± 0.2 |
U696 H3 – A698 3′phosphate h-bond (Å) | 8.8 ± 0.3 | 5.1 ± 0.7 | 8.0 ± 0.5 | 8.0 ± 0.3 | 8.9 ± 0.6 |
Another characteristic feature of U-turns is the hydrogen bond between U 2′ OH and R N7 on the minor groove side of the loop (refer to Figure 1c,d). In SLV, this hydrogen bond between U696 2′ OH and A698 N7 is present in Conformation 2 at an average heavy atom distance of 3.1 Å, which is within error bars of the original MgBound NMR ensemble. While Conformation 1 maintains a similar geometry, it samples a longer average distance of 3.8 Å, which is more similar to the original MgFree NMR ensemble. Another attribute of U-turns is the stacking of U with the 5′ phosphate group of R. In SLV, both with and without Mg2+ present, these U696 and A698 5′ phosphate distances are 4.2 Å, and remain this average distance in the rerefined ensemble. However, Conformation 1 consistently adopts lower distances of 4.0 Å at which stacking interactions may occur, similar to the MgBound NMR ensemble. Conformation 2 adopts longer distances of 5.2 Å at which no stacking would occur, as in the MgFree original NMR ensemble. What we conclude from this rerefinement is that the MgFree SLV RNA can adopt some canonical U-turn characteristics and equally fulfill the experimental NMR restraints.
MD Simulations Show a Gatekeeper Conformation of MgFree SLV Enables Mg2+ Binding
To understand the differences in dynamics of SLV when Mg2+ ions are either present or absent, we ran four sets of independent simulations, starting from MgBound or MgFree SLV RNA structures in the presence of monovalent or divalent ions. We started from model 1 of the MgBound NMR ensemble (MgBound starting structure) or each model of the rerefined MgFree ensemble (MgFree starting structure), and ran 10 250–300 ns length MD simulations in either 40 mM MgCl2 or 50 mM NaCl. The RMSD correlation plots shown in Figure 3 measure the deviation from the MgBound loop (x-axis) and MgFree loop (y-axis), and are colored by the number of structures adopting a particular RMSD value. Below each correlation plot is a 1-dimensional histogram which shows each of the 10 simulations’ RMSD to the MgBound loop (x-axis). From the 1-dimensional histograms it is apparent that there is a bimodal distribution of the SLV RNA structures in the presence of 40 mM MgCl2 with peaks centered on 1.3 and 2.0 Å. In contrast, there is a single broad distribution of SLV RMSDs to the MgBound loop structures in the presence of 50 mM NaCl centered on 2.0 Å. The SLV RNA prefers the same conformational minima when the ion environment is the same, despite the conformation of the starting structure. Interestingly, the 2-dimensional correlation plots show that in the presence of Mg2+, the population of lower 1.3 Å RMSD structures are directly accessible only from the 2.0 Å structures. In this way, the 2.0 Å structure is a gatekeeper, or an intermediate structure in the path to the Mg2+ bound loop. In the absence of Mg2+, the simulations show the loop populates many structures, but this gatekeeper structure represents the most populated structure. However, in the presence of Mg2+, the loop populates a more restricted set of structures. The gatekeeper structure is still populated, but the RNA is able to form a structure more similar to the MgBound reference with the Mg2+ ions present. The RNA is sampling structures that are close to the folded loop structure (i.e., the gatekeeper structure at 2.0 Å RMSD to the MgBound reference), but Mg2+ is required to stabilize the U-turn in its final form (i.e., at 1.3 Å RMSD to the MgBound reference).
We wanted to compare the 2.0 Å representative structures found in the presence and absence of Mg2+ to the low 1.3 Å representative structure preferred, and only seen, when Mg2+ is present. To do so we only considered structures with RMSDs to the MgBound loop below 2.5 Å, and split each of these structure sets by RMSD similarity into two clusters using the kmeans algorithm in CPPTRAJ (Supporting Scripts 3 and 4). Figure 4a shows the RMSD from the MgBound loop distribution for the top cluster from divalent ion simulations (black), representing the 1.3 Å structure, the top cluster from monovalent ion simulations (blue), representing the 2.0 Å gatekeeper structure, and the second cluster from divalent ion simulations (red), representing the 2.0 Å gatekeeper structure observed in simulations with Mg2+ present. The distance across the major groove of the loop (Figure 4b), represented by a phosphate–phosphate distance of residues U695 to A698, shows a shorter distance is sampled by the top divalent ion cluster, and slightly longer distances are sampled by the second cluster and the top cluster in the absence of Mg2+. Similarly, the radius of gyration (Figure 4c) of the UNR U-turn residues is smaller in the top divalent ion cluster, indicating a tighter U-turn in the presence of Mg2+. The other two clusters sample a larger radius of gyration. The representative structures for the top clusters in divalent and monovalent ions are shown overlapped in Figure 4d, and again demonstrate the more compact U-turn of the loop in the presence of divalent ions. The representative gatekeeper structures from 2.0 Å are overlapped in Figure 4e, and show a very similar looser U-turn structure.
The overall similarity between these structures can be assessed by how well they fit the MgBound NMR restraints. Table 4 shows the results of this analysis for each structure in the three clusters compared to the original MgBound NMR ensemble. Though the top cluster from the divalent ion environment showed the best fit to experiment with an 0.01 Å NOE deviation, the second divalent ion cluster and top monovalent ion cluster’s deviation of 0.02 Å were still remarkably small and very similar to each other. A detailed analysis of deviations in the loop NOEs can be found in Supporting Table 2, and shows that different NOEs are violated by each cluster. Specifically, the less compact gatekeeper loop structure had violations involving residues U696, G697, and A698, again reflecting the less tight U-turn conformation.
Table 4. Top Clusters Populated in Monovalent and Divalent Ion Environments Equally Satisfy MgBound NMR Restraints.
total no. of restraints = 488 | original MgBound NMR (1YN2) | top cluster, divalent ion environment | top cluster, monovalent ion environment | 2nd cluster, divalent ion environment |
---|---|---|---|---|
Deviation from Experimental Restraints | ||||
NOE (Å) | 0.004 ± 0.002 | 0.01 ± 0.08 | 0.02 ± 0.13 | 0.02 ± 0.11 |
torsion (deg) | 0.037 ± 0.010 | 0.11 ± 0.51 | 0.07 ± 0.34 | 0.06 ± 0.28 |
Heavy Atom RMSD to Minimized Average Structure | ||||
overall (690–706) (Å) | 1.11 ± 0.14 | 4.22 ± 0.39 | 3.70 ± 0.53 | 4.03 ± 0.53 |
stem (690–295,701–706) (Å) | 0.63 ± 0.19 | 2.24 ± 0.22 | 2.22 ± 0.27 | 2.32 ± 0.27 |
loop (696–700) (Å) | 0.64 ± 0.01 | 1.39 ± 0.19 | 2.07 ± 0.21 | 1.95 ± 0.27 |
Density-based grid analysis was performed on each cluster of the divalent ion simulations to understand the different influence Mg2+ had on the more compact and less compact U-turn. The localization of the top 10% of Mg2+ ion density is reported in Figure 5 and Table 5. Figure 5 shows a significant amount of density coordinating the phosphates of the UNR motif residues. The association of Mg2+ in this region explains the ability of the U-turn to preferentially adopt compact structures, since charged phosphate groups are shielded from each other by the localized charge density of the Mg2+ ion. Interestingly, the localization of associated Mg2+ did not change in the second cluster from divalent ion simulations (gatekeeper structure), despite the less-compact nature of the U-turn (Supporting Figure 6). What did change was the percent occupancy, or residence time, of the associated Mg2+, which was far lower for most residues in the second less compact structure (and is reported in Table 5). We hypothesize that this is a result of imperfect clustering, and is a combination of Mg2+ ions binding and leaving rather than remaining bound for any length of time. Table 5 also identifies that the majority of density is localized in previously described binding sites 1 and 3.5 In the simulations starting from MgFree + 40 mM MgCl2, Mg2+ binding in sites 1 and 3 correlates with RNA sampling the MgBound conformation, while binding in sites 2 and 4 does not (Supporting Figure 7). Supporting Table 3 details the percent occupancy of all atoms considered part of binding sites 1–4, and all of these sites have nonzero occupancy in the simulations.5 Additionally, this table shows the average distance between Mg2+ ions and their RNA hydrogen-bond acceptors is around 4.0 Å in each case, indicating that the Mg2+ ions associate with the RNA through interactions with first shell water molecules and do not directly chelate RNA phosphate oxygen atoms.
Table 5. Atoms with Mg2+ Ion Occupancy over 10%.
residue and atom | percent occupancy in top cluster from divalent ion environment (%) | percent occupancy in 2nd cluster from divalent ion environment (%) | part of established binding site? |
---|---|---|---|
U695 O4 | 16.9 | 20.5 | 3 |
U696 OP2 | 43.4 | 15.1 | 1 |
G697 OP2 | 5.3 | 14.6 | no |
A698 OP2 | 32.1 | 18.6 | 1 |
C699 OP1 | 14.1 | 13.2 | 3 |
C699 OP2 | 23.1 | 9.2 | 3 |
A701 OP2 | 10.9 | 9.3 | no |
To gain insight into the motion the SLV RNA undergoes to adopt its characteristic conformations in the presence and absence of Mg2+, principal component (PC) analysis was performed on three trajectories: the combined 10 simulations in the presence of monovalent ions, the combined 10 simulations in the presence of divalent ions, and the 300 K replica from temperature replica exchange molecular dynamics (T-REMD). The T-REMD was run starting from the rerefined MgFree structures in the presence of 50 mM KCl, without Mg2+ ions. Because of the enhanced sampling exchanges performed in T-REMD, the conformational space sampled was much larger than the 10 noninteracting simulations combined. PC projections were calculated for the combined trajectories of all systems, and each individual system was independently projected along the top two PCs, shown in Figure 6. The lowest frequency motions described by the PCs are considered relevant to function, with the higher frequency modes describing more local fluctuations.36 It is apparent that the PC space sampled by the divalent ion simulations (black) is more limited than the space sampled by an equivalent amount of simulation with monovalent ions (red). The T-REMD samples a much wider range of PC space (blue) than either set of simulations. Additionally, it is important that the PC space sampled by the divalent ion simulations overlaps with that sampled by the monovalent ion simulations. This indicates that the SLV RNA is not sampling new conformational space or dynamics in the presence of Mg2+ ions, but rather that Mg2+ ions serve to limit the dynamics and conformations adopted by the SLV RNA loop.
Evidence Supporting the Proposed Role of the U700 Base
The U700 base has been proposed to play a significant role in the dynamics of SLV RNA.8 In the presence of Mg2+, the base contributes a phosphate group to a Mg2+ binding site, increasing the enthalpic stability of the MgBound SLV RNA.9 Here, we confirm this contribution of U700 in our divalent ion simulations. Specifically, Figure 5 shows Mg2+ density localized around both the 5′ and 3′ phosphate groups of the U700 backbone and Table 5 and Supporting Table 3 contain the measured percent occupancies (note that the 3′ phosphate of U700 is equivalent to the 5′ phosphate of A701 and is represented as such in the analysis). It was also suggested that the U700 base could increase the dynamics of the unbound loop, providing an entropic advantage for SLI recognition.9Figure 7 shows the RMS fluctuations for each residue of SLV RNA in the presence and absence of Mg2+. The highest fluctuations in the absence of Mg2+ ions can be seen in residue U700, and the largest change upon Mg2+ association is seen in the same residue.
Discussion
The Varkud satellite ribozyme SLV participates in a tertiary structure interaction that aligns the catalytic site of the RNA with its intramolecular cleavage site. In the isolated ribozyme, SLV forms a kissing complex with the loop of SLI, anchoring SLI to the ribozyme body, so the properties of SLV that allow its three GAC nucleotides to form base pairs with SLI are essential for a productive interaction. The NMR solution structure of SLV was solved based on interpretations of the NMR observables measured with and without divalent ions, and here we revisit those structures with MD simulations to describe its molecular processes in more detail.
Rerefinement of the SLV structure ensemble with a more complete description of the NMR solution environment (including ∼50 mM excess NaCl) suggested that the structures separate into two main loop conformations, each of which equally fulfill the experimental NMR restraints. These conformations have characteristics of noncanonical and canonical U-turn loops. Each conformation adopts some characteristics of the MgBound NMR ensemble, as shown in Table 3, though a single conformation does not adopt all representative U-turn characteristics.
MD simulations starting from each solution NMR structure (MgFree and MgBound) were performed in both a divalent and monovalent ion environment. Despite the starting structure, the simulations with divalent ions populated a bimodal distribution of low RMSDs to the MgBound loop, with peaks centering around 1.3 and 2.0 Å, and the simulations with monovalent ions populated a peak at 2.0 Å RMSD from the MgBound loop and did not sample lower RMSD structures. Analysis of the representative clusters in the presence of Mg2+ and in a monovalent ion environment show that Mg2+ binding is required for access to the lower 1.3 Å RMSD. Additionally, the similarity of the monovalent ion cluster and the second cluster of the divalent ion simulations tell us that this structure acts as a gatekeeper for accessing the final Mg2+-bound structure.
Though the subtle differences in populated structures are an influence of Mg2+ binding, analysis of the overall dynamics using principal components shows that the low frequency modes of motion for the SLV RNA in the presence of divalent ions is a subset of the projection seen for monovalent ions. Instead of shifting the SLV RNA to a previously unseen structure, Mg2+ ions instead limit the conformational ensemble and stabilize loop characteristics which are already present in the monovalent ion ensemble.
Molecular dynamics simulations have traditionally struggled to adequately describe RNA structure and dynamics, since they remain extremely sensitive to solution environment and subtle force field parameters that guide nonbonded interactions. Here, we present evidence which both qualitatively and quantitatively matches experiment. Our improved description of the ion environment adequately reproduces the subtle conformational shift seen in multiple experiments of SLV RNA.
Conclusions
Using molecular dynamics simulations of SLV RNA in the presence of divalent and monovalent ions, we are able to confirm experimental hypotheses on an all-atom level. We show that the U-turn prefers a more compact structure in the presence of Mg2+, effectively neutralizing the electronegative charges of the loop phosphate groups. We confirm the proposed role of the U700 base as a contributor to Mg2+ binding site, and as a means to increase entropy of the unbound loop (and therefore conformational search space for its substrate SLI) in a monovalent ion environment. Additionally, we show SLV RNA preferentially samples a “gatekeeper” conformation in the absence of Mg2+, which is a conformational substate leading to Mg2+ binding and formation of the tight U-turn structure.
Acknowledgments
This work was supported by the National Institutes of Health [R01-GM098102]. This research was enabled by the Blue Waters sustained-petascale computing project [NSF OCI 07–25070, PRAC OCI-10440031], the National Science Foundation Extreme Science and Engineering Discovery Environment [XSEDE, OCI-1053575 and allocation MCA01S027P], and the Center for High Performance Computing at the University of Utah.
Supporting Information Available
The Supporting Information is available free of charge on the ACS Publications website at DOI: 10.1021/acs.jpcb.5b05190.
The authors declare no competing financial interest.
Supplementary Material
References
- Beattie T. L.; Olive J. E.; Collins R. A. A Secondary-Structure Model for the Self-Cleaving Region of Neurospora VS RNA. Proc. Natl. Acad. Sci. U. S. A. 1995, 92, 4686–4690 10.1073/pnas.92.10.4686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rastogi T.; Beattie T. L.; Olive J. E.; Collins R. A. A Long-Range Pseudoknot Is Required for Activity of the Neurospora VS Ribozyme. EMBO J. 1996, 15, 2820–2825. [PMC free article] [PubMed] [Google Scholar]
- Hiley S. L.; Collins R. A. Rapid Formation of a Solvent-Inaccessible Core in the Neurospora Varkud Satellite Ribozyme. EMBO J. 2001, 20, 5461–5469 10.1093/emboj/20.19.5461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Campbell D. O.; Legault P. Nuclear Magnetic Resonance Structure of the Varkud Satellite Ribozyme Stem-Loop V RNA and Magnesium-Ion Binding from Chemical-Shift Mapping. Biochemistry 2005, 44, 4157–4170 10.1021/bi047963l. [DOI] [PubMed] [Google Scholar]
- Campbell D. O.; Bouchard P.; Desjardins G.; Legault P. NMR Structure of Varkud Satellite Ribozyme Stem-Loop V in the Presence of Magnesium Ions and Localization of Metal-Binding Sites. Biochemistry 2006, 45, 10591–10605 10.1021/bi0607150. [DOI] [PubMed] [Google Scholar]
- Bouchard P.; Legault P. Structural Insights into Substrate Recognition by the Neurospora Varkud Satellite Ribozyme: Importance of U-Turns at the Kissing-Loop Junction. Biochemistry 2014, 53, 258–269 10.1021/bi401491g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonneau E.; Legault P. NMR Localization of Divalent Cations at the Active Site of the Neurospora VS Ribozyme Provides Insights into RNA-Metal-Ion Interactions. Biochemistry 2014, 53, 579–590 10.1021/bi401484a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bouchard P.; Legault P. A Remarkably Stable Kissing-Loop Interaction Defines Substrate Recognition by the Neurospora Varkud Satellite Ribozyme. RNA 2014, 20, 1451–1464 10.1261/rna.046144.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bouchard P.; Lacroix-Labonté J.; Desjardins G.; Lampron P.; Lisi V.; Lemieux S.; Major F.; Legault P. Role of SLV in SLI Substrate Recognition by the Neurospora VS Ribozyme. RNA 2008, 14, 736–748 10.1261/rna.824308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheatham T. E. 3rd; Case D. A. Twenty-Five Years of Nucleic Acid Simulations. Biopolymers 2013, 99, 969–977 10.1002/bip.22331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bergonzo C.; Henriksen N. M.; Roe D. R.; Swails J. M.; Roitberg A. E.; Cheatham T. E. 3rd Multidimensional Replica Exchange Molecular Dynamics Yields a Converged Ensemble of an RNA Tetranucleotide. J. Chem. Theory Comput. 2014, 10, 492–499 10.1021/ct400862k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roe D. R.; Bergonzo C.; Cheatham T. E. 3rd Evaluation of Enhanced Sampling Provided by Accelerated Molecular Dynamics with Hamiltonian Replica Exchange Methods. J. Phys. Chem. B 2014, 118, 3543–3552 10.1021/jp4125099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henriksen N. M.; Roe D. R.; Cheatham T. E. 3rd Reliable Oligonucleotide Conformational Ensemble Generation in Explicit Solvent for Force Field Assessment Using Reservoir Replica Exchange Molecular Dynamics Simulations. J. Phys. Chem. B 2013, 117, 4014–4027 10.1021/jp400530e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zgarbová M.; Otyepka M.; Šponer J.; Mládek A.; Banáš P.; Cheatham T. E. 3rd; Jurečka P. Refinement of the Cornell et Al. Nucleic Acids Force Field Based on Reference Quantum Chemical Calculations of Glycosidic Torsion Profiles. J. Chem. Theory Comput. 2011, 7, 2886–2902 10.1021/ct200162x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yildirim I.; Kennedy S. D.; Stern H. A.; Hart J. M.; Kierzek R.; Turner D. H. Revision of AMBER Torsional Parameters for RNA Improves Free Energy Predictions for Tetramer Duplexes with GC and iGiC Base Pairs. J. Chem. Theory Comput. 2012, 8, 172–181 10.1021/ct200557r. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steinbrecher T.; Latzer J.; Case D. A. Revised AMBER Parameters for Bioorganic Phosphates. J. Chem. Theory Comput. 2012, 8, 4405–4412 10.1021/ct300613v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jorgensen W. L.; Chandrasekhar J.; Madura J. D.; Impey R. W.; Klein M. L. Comparison of Simple Potential Functions for Simulating Liquid Water. J. Chem. Phys. 1983, 79, 926–935 10.1063/1.445869. [DOI] [Google Scholar]
- Joung I. S.; Cheatham T. E. 3rd Determination of Alkali and Halide Monovalent Ion Parameters for Use in Explicitly Solvated Biomolecular Simulations. J. Phys. Chem. B 2008, 112, 9020–9041 10.1021/jp8001614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allnér O.; Nilsson L.; Villa A. Magnesium Ion–Water Coordination and Exchange in Biomolecular Simulations. J. Chem. Theory Comput. 2012, 8, 1493–1502 10.1021/ct3000734. [DOI] [PubMed] [Google Scholar]
- Case D. A.; Cheatham T. E. 3rd; Darden T.; Gohlke H.; Luo R.; Merz K. M.; Onufriev A.; Simmerling C.; Wang B.; Woods R. J. The Amber Biomolecular Simulation Programs. J. Comput. Chem. 2005, 26, 1668–1688 10.1002/jcc.20290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Case D. A.; Darden T. A.; Cheatham T. E. 3rd; Simmerling C. L.; Wang J.; Duke R. E.; Luo R.; Walker R. C.; Zhang W.; Merz K. M.; et al. AMBER 12; University of California, San Francisco: San Francisco, 2012.
- Salomon-Ferrer R.; Götz A. W.; Poole D.; Le Grand S.; Walker R. C. Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 2. Explicit Solvent Particle Mesh Ewald. J. Chem. Theory Comput. 2013, 9, 3878–3888 10.1021/ct400314y. [DOI] [PubMed] [Google Scholar]
- Cheatham T. E. 3rd; Cieplak P.; Kollman P. A. A Modified Version of the Cornell et Al. Force Field with Improved Sugar Pucker Phases and Helical Repeat. J. Biomol. Struct. Dyn. 1999, 16, 845–862 10.1080/07391102.1999.10508297. [DOI] [PubMed] [Google Scholar]
- Wang J.; Cieplak P.; Kollman P. A. How Well Does a Restrained Electrostatic Potential (RESP) Model Perform in Calculating Conformational Energies of Organic and Biological Molecules?. J. Comput. Chem. 2000, 21, 1049–1074. [DOI] [Google Scholar]
- Pérez A.; Marchán I.; Svozil D.; Sponer J.; Cheatham T. E. 3rd; Laughton C. A.; Orozco M. Refinement of the AMBER Force Field for Nucleic Acids: Improving the Description of Alpha/gamma Conformers. Biophys. J. 2007, 92, 3817–3829 10.1529/biophysj.106.097782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Essmann U.; Perera L.; Berkowitz M. L.; Darden T.; Lee H.; Pedersen L. G. A Smooth Particle Mesh Ewald Method. J. Chem. Phys. 1995, 103, 31–34 10.1063/1.470117. [DOI] [Google Scholar]
- Sindhikara D. J.; Kim S.; Voter A. F.; Roitberg A. E. Bad Seeds Sprout Perilous Dynamics: Stochastic Thermostat Induced Yrajectory Synchronization in Biomolecules. J. Chem. Theory Comput. 2009, 5, 1624–1631 10.1021/ct800573m. [DOI] [PubMed] [Google Scholar]
- Henriksen N. M.; Davis D. R.; Cheatham T. E. 3rd Molecular Dynamics Re-Refinement of Two Different Small RNA Loop Structures Using the Original NMR Data Suggest a Common Structure. J. Biomol. NMR 2012, 53, 321–339 10.1007/s10858-012-9642-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loncharich R. J.; Brooks B. R.; Pastor R. W. Langevin Dynamics of Peptides: The Frictional Dependence of Isomerization Rates of N-Acetylalanyl-N′-Methylamide. Biopolymers 1992, 32, 523–535 10.1002/bip.360320508. [DOI] [PubMed] [Google Scholar]
- Ryckaert J.-P.; Ciccotti G.; Berendsen H. J. C. Numerical Integration of the Cartesian Equations of Motion of a System with Constraints: Molecular Dynamics of N-Alkanes. J. Comput. Phys. 1977, 23, 327–341 10.1016/0021-9991(77)90098-5. [DOI] [Google Scholar]
- Shao J.; Tanner S. W.; Thompson N.; Cheatham T. E. 3rd Clustering Molecular Dynamics Trajectories: 1. Characterizing the Performance of Different Clustering Algorithms. J. Chem. Theory Comput. 2007, 3, 2312–2334 10.1021/ct700119m. [DOI] [PubMed] [Google Scholar]
- Roe D. R.; Cheatham T. E. 3rd PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data. J. Chem. Theory Comput. 2013, 9, 3084–3095 10.1021/ct400341p. [DOI] [PubMed] [Google Scholar]
- Case D. A.; Darden T. A.; Cheatham T. E. 3rd; Simmerling C. L.; Roitberg A.; Wang J.; Duke R. E.; Luo R.; Roe D. R.; Walker R. C.; et al. AMBER 14; University of California, San Francisco: San Francisco, 2014.
- Hopkins C. W.; Le Grand S.; Walker R. C.; Roitberg A. E. Long Time Step Molecular Dynamics through Hydrogen Mass Repartitioning. J. Chem. Theory Comput. 2015, 11, 1864–1874 10.1021/ct5010406. [DOI] [PubMed] [Google Scholar]
- Konerding D. E.; Cheatham T. E. 3rd; Kollman P. A.; James T. L. Restrained Molecular Dynamics of Solvated Duplex DNA Using the Particle Mesh Ewald Method. J. Biomol. NMR 1999, 13, 119–131 10.1023/A:1008353423074. [DOI] [PubMed] [Google Scholar]
- Amadei A.; Linssen A. B. M.; Berendsen H. J. C. Essential Dynamics of Proteins. Proteins: Struct., Funct., Genet. 1993, 17, 412–425 10.1002/prot.340170408. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.