Abstract
Remarkable new insight has emerged into the biological role of RNA in cells. RNA folding and dynamics enable many of these newly discovered functions, calling for an understanding of RNA self-assembly and conformational dynamics. Because RNAs pass through multiple structures as they fold, an ensemble perspective is required to visualize the flow through fleetingly populated sets of states. Here, we combine microfluidic mixing technology and small angle X-ray scattering (SAXS) to measure the Mg-induced folding of a small RNA domain, the tP5abc three helix junction. Our measurements are interpreted using ensemble optimization to select atomically detailed structures that recapitulate each experimental curve. Structural ensembles, derived at key stages in both time-resolved studies and equilibrium titrations, reproduce the features of known intermediates, and more importantly, offer a powerful new structural perspective on the time-progression of folding. Distinct collapse phases along the pathway appear to be orchestrated by specific interactions with Mg ions. These key interactions subsequently direct motions of the backbone that position the partners of tertiary contacts for later bonding, and demonstrate a remarkable synergy between Mg and RNA across numerous time-scales.
INTRODUCTION
Decades of research revealed the dynamics that enable protein folding and conformational transitions, propelling the development of novel biomaterials (1,2), drug design (3,4) and therapeutics (5,6). Despite the growing recognition of its central biological roles as mediators and catalysts of numerous biological reactions (7–9), no similar picture exists for RNA (10). Understanding how RNA structures fold to accomplish these various roles can unlock new strategies for drug design and treatment (11–13). However, as the RNA folding landscape is rugged, involving many intermediates, pathways and co-existing states (11,14,15), experimental results are scarce and challenging to interpret. This richness and complexity demands an experiment that embraces an ensemble view of the process (11).
RNA folding has been studied by many powerful experimental techniques, including those which provide site specific information about folding intermediates (16–20), interconverting species (21–24), and yield intricate frameworks for the reaction (10,25,26). While critical to our full understanding of the process, these experiments provide a limited window into a complete visualization of folding, which also relies on crucial, large scale structural re-arrangements (27).
Small-angle X-ray scattering reports large scale, as well as subtle, changes in global molecular shapes in an ensemble measurement. Time resolution can be achieved by integrating either stopped- or continuous- flow mixing (28–31). These challenging experiments are small in number and, to date, have yielded an incomplete picture of RNA’s route(s) from unfolded to native states. Limitations of previous SAXS experiments reflect difficulties associated with measuring high quality scattering profiles on the shortest time scales and/or over a broad range of scattering angles that reflect both small and large scale changes in conformation. In addition, previous studies relied on the application of standard methods for data interpretation which allow only general statements and rough conformational properties to be discerned.
Here, we reveal the Mg-dependent folding pathway of a ubiquitous RNA motif, the three helix junction (32), using ensemble decomposition methods and atomically detailed models to interpret both static, and time-resolved SAXS data. The latter were acquired with a new type of microfluidic mixer, which is based on a novel design (33). The former relied on advanced, molecular dynamics based simulation techniques to offer comprehensive information on structures and dynamics, charting plausible and consistent folding pathways for this RNA. Experimental refinement of these simulation results is carried out by ensemble optimization methods (EOM), an essential step, and provides atomically detailed structures for evaluation. This tightly coupled approach alleviates the above limitations for interpreting SAXS data. It provides unique insight into the structures that populate the folding pathway of this RNA, offering never-before-seen intermediates that underscore the unique role(s) of Mg2+ in sculpting the folding landscape. The properties of known reaction intermediates are recognized in this analysis, validating the approach.
Our work focuses on the tP5abc three-helix junction (Figure 1) a truncated subdomain from the Tetrahymena ribozyme. At 56 nucleotides in length, tP5abc is ideally sized for these studies. It is small enough to model with atomic detail, yet displays a rich variety of interactions that often characterize RNA folding: electrostatic repulsion (31,34), Mg2+ binding (35–39), both non-specific and specific compactions (28), helix-docking (40), tertiary contact formation (14) and secondary structure shifts (41). Furthermore, previous studies of this molecule using a variety of techniques provide complementary data on which to benchmark and compare our results (e.g. (16,34,41–43)), though past measurements of the rates of P5abc folding show a clear dependence on the ionic conditions employed (44,45). Together with new intermediates presented here, we identify a clear trajectory across the landscape, as the RNA evolves from extended to native structures.
MATERIALS AND METHODS
Detailed descriptions of all experiments, simulations and data analysis are provided in the Supporting Information. Here, we briefly outline the methods employed.
RNA synthesis
TP5abc was synthesized in vitro using PCR amplified DNA and purified by size exclusion chromatography using a Superdex HiLoad 16/600 200 pg column on an AKTA FPLC purifier. After purification, the RNA was buffer exchanged into a solution containing 20 mM KCl, 10 mM KMOPS, 20 μM EDTA, pH 7 (we refer to this as buffer A) using Amicon Ultra-0.5 10KdA concentrators before being stored.
SAXS experiments
For experimental conditions with varying KCl concentrations, the samples were buffer exchanged five times into solutions containing the stated amount of monovalent salt in addition to 10 mM KMOPS and 20 μM EDTA, pH 7. Samples intended for studies as a function of divalent ion concentration were initially prepared in buffer A. For static measurements, Mg was added to the quoted concentration immediately before exposure to X-rays. Details about sample preparation, experimental setup and interpretation of SAXS data for static experiments are provided in the Supporting Information, text and Supplementary Figures S1 and S2.
For the time-resolved experiments, tP5abc in buffer A was rapidly mixed with Mg to reach a concentration of 1 mM at the end of the mixing region for all time points, via rapid diffusion. Details about the time-resolved experiments, including the design, fabrication and testing of the microfluidic mixer, experimental setup and data collection, are provided in the Supporting Information text, Supplementary Figures S3–S7 and Results and Discussion: Time-resolved SAXS Experiments.
Static and time-resolved SAXS data were acquired at beamline G1 of the Cornell High Energy Synchrotron Source (CHESS). The RNA was annealed at 65°C for 5 minutes and subsequently slow cooled to room temperature before both experiments. For static experiments, RNA concentrations of 80, 60 and 40 μM were used. At each salt condition, data were acquired at multiple RNA concentrations to remove the effects of interparticle interference in the data. Buffer subtracted SAXS curves for a given salt condition but different sample concentrations were matched in the range q > 0.1 A−1, and either averaged together, or extrapolated to the zero concentration limit to give the final curves. Time-resolved experiments were carried out at [RNA] of 80 μM. All SAXS data were analyzed using MATLAB (MathWorks, Natick, MA, USA) with code written in-house. The SAXS data for the static and time-resolved experiments are available through the SASBDB, codes are given at the end of the manuscript.
Simulations
Simulations used the known structures of the beginning and ending states of the folding reaction. A minimum energy pathway is computed, connecting the two, and serves as the starting point for extensive MD simulations of the pool of possible structures, including waters and ions. Detailed descriptions of structural pool generation from simulation is provided in the Simulations section within the Supporting Information.
Ensemble optimization
SAXS profiles for all MD structures were calculated with CRYSOL (46), with maximum harmonic order 15, Fibonacci grid of order 18 and default hydration parameters. Ensemble optimization was run to fit the calculated SAXS curves from the pool to the experimental SAXS data using GAJOE 2.0 (47). For each experimental condition, the program was run for 2000 generations, eight ensembles were selected for each generation, each ensemble had a self-optimizing size, and the scheme was iterated 100 times to accumulate statistics with constant subtraction allowed. Selections made in the final rounds of ensemble optimization were interpreted to provide the results and landscapes presented here.
Thus, for each condition (every SAXS curve), 800 ensembles of structures were interpreted, where each ensemble independently fits the experimental SAXS data. The mean number of structures required to fit the data at each condition (in each ensemble) varies between 6 and 10 on average, depending on the experimental condition. Multiple ensembles of structures that equally fit the data (as assessed through a chi-square analysis) are included to accumulate statistics, and to account for slight variations in structures that recapitulate the data.
RESULTS AND DISCUSSION
Static SAXS experiments
RNA folding occurs in several steps or phases, with molecules populating a variety of intermediate, partially folded conformations (40). To assess the global structural properties of these differing states we first performed static SAXS experiments over a range of ionic conditions. To accentuate differing interactions and ion-specific conformations, experiments were carried out in solutions containing various amounts of either KCl or MgCl2. The former promotes the effects of electrostatic screening and non-specific collapse (28), while the latter is used to stabilize tertiary contact formation (48). These studies can also reveal intermediates that rely on the presence of Mg (40).
SAXS profiles display the intensity of scattered X-rays (I) as a function of momentum transfer q, which equals 4πsin θ/λ, where λ is the X-ray wavelength and 2θ is the scattering angle. Scattering into the smallest angles, where the signal is largest, reflects the global size of a molecule through its radius of gyration (Rg). The expected/computed Rg’s of purely extended (unfolded) or native tP5abc, shown in Figure 2A, vary significantly. Interesting differences are observed following the addition of the two different salts. As [KCl] increases, a gradual decrease in Rg is measured, reflecting changes in the screening length of the phosphate charges by additional salt. In contrast, Mg is far more efficient in compacting the RNA, consistent with its role in stabilizing tertiary contacts. While Rg values and trends with increasing salt provide limited insight into the folding landscape, they contain some important information: Mg specific interactions have global consequences for structures in the tP5abc ensemble.
Additional information about RNA’s conformations can be inferred through inspection of the scattering at higher q values than those used to determine Rg. Higher q scattering is emphasized through Kratky plots of the SAXS data, shown in Figure 2C, which display q2I versus q. The height of the mid-q peak (q ∼ 0.1 Å−1) and the slope of the higher q scattering (q > 0.1 Å−1) are important features of the Kratky plots and reveal subtle details of molecular conformations. A pronounced peak in the mid-q regime followed by a negatively sloped tail generally indicates ‘folded’ or compact conformations (49). A shallower peak followed by a less strongly sloped tail signifies ‘unfolded’ or extended conformations. Changes in the higher q portions of the curve are interpreted to indicate subtle structural changes such as extension/expansion for a shallower slope, or compaction for a steeper slope. These trends are illustrated in Figure 2A, using computed profiles for extended and native tP5abc generated from pdb coordinates.
Turning to the data, scattering profiles acquired at the lowest salt conditions display ‘unfolded’ behavior in Kratky plots, with a low mid-q peak and flat slope at higher q. The mid-q peak rises slightly with increasing [KCl], but no discernible trend is seen in the high q tail. These curves display few signatures of the folded state. In contrast, Kratky plots of profiles acquired as a function of increasing [Mg] show a more definitive increase in peak height at mid-q, paired with a gradually declining tail. At the highest Mg concentrations, the curve's features hint that a fraction of the population may be occupying native (or at least highly compact) states.
Ensemble modeling
The complex and subtle behavior of the curves at high q is difficult to interpret by visual inspection. The analysis is further complicated by the possible presence of coexisting states under any given condition. A more quantitative interpretation of these changes can be obtained through a deconvolution of the SAXS profiles into curves that describe representative RNA conformations, using an ensemble optimization method (50). Ensemble optimization identifies sets of model structures, selected from a vast pool representing many potential states, whose summed SAXS profiles best fit the input experimental curve. This method is especially powerful for capturing nucleic acids conformations, as the phosphates along the backbone provide the majority of scattering contrast (51). However, a large and physically reasonable pool of models is required as a starting point.
To generate this pool we performed MD simulations, starting from the NMR derived (no Mg) structure (41), and ending at the native state, truncated from the P4P6 crystal structure (52). Subsequently, we used MD simulations with varying solution conditions to refine the two end state structures (see SI for details). We used the reaction path (RP) approach (53–55) to generate plausible pathways between these states. Given the coordinates of these two states, we sought a curve in configuration space that connects them, with a minimal free energy barrier. We added the structures along this pathway to the pool, and in addition explored and added conformations orthogonal to this path (see Supporting Information, ‘Structure Pool’ section for more discussion on pool design, and Supplementary Figure S8). The theoretical SAXS profiles of each model conformation is then calculated (46). As a next step, sets of structures are selected from this pool through a well-defined iterative refinement procedure (50). The summed scattering profile of each set recapitulates the given input experimental SAXS curve, as assessed by a chi square analysis. The selection of multiple sets (800 for each condition) accounts for variations in structures that can adequately fit the data. With this approach, the MD generated pool is refined against experimental SAXS data for each experimental condition, circumventing challenges associated with current MD force-fields and ion modelling (56). The structures selected from the pool are interpreted to be representative of the underlying folding landscape at each experimental condition. This procedure is summarized in Figure 3 and fully described in the Supporting Information, including a critical assessment of the ensemble approach (the section title).
Static RNA landscapes
The fits achieved using this method show excellent agreement between the computed profiles of selected states and raw SAXS curves (Figure 4C and Supplementary Figure S9). Having identified sets of structures from the pool that match each experimental condition (mean ensemble size shown in Supplementary Figure S10), we developed a parameterization to aid in visualizing the selected conformations. We compute two metrics for each model structure: its Rg and the distance to native state (ΔN). The latter parameter (defined in Figure 4D and Supplementary Figure S11) is computed by summing the distance between all pairs of tertiary contacts in a given model structure, and subtracting from this sum the distances measured in the native structure. The residues that contribute to each tertiary contact are pictured in color in Figure 1. To compute ΔN for one tertiary contact (for example the contact between A139 and G164, depicted in orange in Figure 1), the distance between them is computed within each model and the corresponding distance measured in the native structure subtracted. Thus, the overall value of ΔN drops to zero for the native structure, while high value of ΔN suggests lack of native contact formation. This metric provides a quantitative measure of proximity to the native state.
Each selected model is represented by a point placed on a landscape, whose axes correspond to these two metrics. This landscape is also a heat map, where the density of selected structures within a 0.5 Å circle centered about each point is represented by color. To aid the presentation, these maps are smoothed by a 5-point rolling mean. Landscapes derived from the static salt titrations are shown in Figure 4A and B (with more details in Supplementary Figure S12). A representative structure is shown near highly populated clusters to illustrate the shared conformational properties of these groups of states (to aid in evaluating the range of structures present, an expanded number of structures for each cluster and landscape is given in Supplementary Figures S13–S19).
At the lowest salt condition probed, extended RNA conformations result from repulsive interactions between locally unscreened phosphates along the charged backbone. The majority of selected models populate the large Rg, large ΔN corner, in the upper left of the conformational landscape. These expanded states are consistent with the measured Kratky plot (signalling an ‘unfolded’ rather than ‘folded’ ensemble), as well as the relatively large measured Rg. The more effective backbone screening that occurs with increasing [KCl] enables some compaction, and the nature of the selected models shifts from extended to so-called screened or electrostatically relaxed states. These latter conformations are defined by Rg and ΔN values that are intermediate between the limiting values for tP5abc. They are consistent with a measured decrease in Rg, and a slight increase in the mid-q region of the Kratky plot found in the raw data. A careful examination of the locations of the colored bases in the models, representing the two sides of a potential tertiary contact (Figure 1A), suggests a lack of tertiary interactions. Thus, the reduction in ΔN relative to the lower salt, extended states is a natural consequence of an electrostatic relaxation or collapse. Finally, states with smaller Rg appear at the largest [KCl] probed. Model structures that recapitulate the data suggest that these states lack tertiary contacts, and the backbone conformation in the ensembles suggests that these molecules are non-specifically collapsed. The native tP5abc structure is never achieved in this range of [KCl], nor is evidence found for tertiary contact formation. These findings are consistent with previous studies of tP5abc in KCl, which show no indication of native structure at any concentration of KCl (57).
Taken together, the landscapes are well described by two classes of states at low to moderate [KCl]: extended and screened/electrostatically relaxed. A new class of conformations, non-specifically collapsed states, appear only at the highest salt concentrations. This classification is reinforced by the application of singular value decomposition (SVD) analysis to the KCl salt series. In this analysis, at least three basis states are required to fit the entire set of experimental data with χ2 ≤ 1 (Supplementary Figure S20). The EOM fitting not only captures this behavior, but provides an interpretation in terms of model structures, in place of SVD-derived basis curves that may not be physically relevant.
Completely different sets of states are required to provide a comparable analysis of conformations in a MgCl2 titration, reflected by the appearance of the heat maps (Figure 4B). In Mg, more distinct patches of states are present, with vastly varying Rg and ΔN values, suggesting a more rugged energy landscape. The states identified at the smallest [Mg] studied are similar to those found at low [KCl]. These states have smaller Rg and ΔN values than the extended states. As [Mg] increases to 0.5 mM conformations with similar Rg values to the electrostatically relaxed states appear to funnel toward a region of the landscape which represents states with surprisingly large ΔN values. Structures in this cluster display distinct, expanded orientations of the P5c and P5a helices about the molecular core. Insight from the model conformations suggests the absence of tertiary contacts. The absence of these states at all measured [KCl] and at lower [Mg] suggests that a critical number of Mg ions is required to stabilize these conformations. The unique arrangement of P5a, and the location of P5c relative to P5b would be consistent with specific Mg2+ interactions with both the A-rich bulge and P5c, regions with known Mg2+ binding sites (41,52,58). Given their characteristic structural features, we name these states 'open intermediates’. The properties of these Mg dependent open intermediates are consistent with the intermediate proposed in (43).
At the highest [Mg], the selected populations shift from open intermediates to clusters of states found in the bottom right corner of the landscape, a region with small Rg and ΔN values (Figure 4B). This shift reflects the trends in the raw data: smaller Rg values are measured, and the peaks become enhanced in Kratky plots. The P5a and P5b helices in the selected model conformations are stacked on the molecular core, with orange (A139–G164) and purple (C137–A186) pairs of tertiary contacts formed. The red colored tertiary contact is not detected in the states on the left side of this cluster. We call these states ‘helix-docked’. In contrast, structures that populate the right side of this cluster, with smaller Rg, do display interactions in the red ‘hinge’ region. We call these the native tP5abc states, as all pairs of tertiary contacts have formed. It is interesting that both of these clusters are absent in lower [Mg]. Even at 1 mM Mg, a large percentage of the population still resides in the open intermediates. Thus, these states appear to be promoted from the open intermediates through additional interactions with Mg. Only 12% of the population reaches the native state in 1 mM Mg (Supplementary Figure S21) with the remainder residing either in the helix-docked, or open intermediate states. This relatively small number is in good agreement with NMR studies that report 8% native state population under similar conditions (16). These trends are once again recapitulated in an SVD analysis, where a minimum of 3 basis states are required to define the Mg series (Supplementary Figure S20).
Time-resolved SAXS (TR-SAXS) experiments
With the landscapes of tP5abc established for static salt titrations, the next step is to carry out a folding experiment to study the kinetic pathway from initially extended conformations to the native state. A microfluidic assembly that addresses the shortcomings of previous mixers for TR-SAXS utilizes mixing technology developed for time-resolved crystallographic experiments (33), delivering high signal-to-noise data with low sample consumption on time scales that span the folding reaction (for details, see Supporting Information). The experimental arrangement is sketched in Figure 5A. Extended tP5abc in a low salt state (20 mM KCl) was rapidly combined with Mg2+ to induce folding and subsequently probed at time-points ranging from 10 ms to 3 s after mixing. These time scales were selected to capture both early phase electrostatic collapse and late stage secondary structural shifts (16,34). The final [Mg2+] after mixing was 1 mM, sufficient to yield the native structure (16).
Results of the TR-SAXS experiments are shown in Figure 5B–D. Changes in the profiles are detected within 10ms after mixing to a final concentration of 1 mM Mg2+. The raw data show a large reduction in Rg, and the emergence of a peak in the Kratky plot as the high angle tail drops significantly. Between 10 and 100 ms, the average size of the RNA (Rg) remains constant within error. However, subtle conformational rearrangements during this period are indicted by gradual, upward shifts in the high q region of the Kratky plot.
A second collapse is detected 300 ms after the initiation of folding. A measurable reduction in Rg, is accompanied by an increased negative slope at the largest q in the Kratky plots. Native-like features in the scattering profiles begin to appear on time scales of 1000–3000 ms. A pronounced peak in the mid-q region of the Kratky plot, as well as a moderately downwards slope at higher q are reminiscent of the profile acquired in static titrations, at 1 mM Mg (Figure 2C), and the Rg’s agree within error.
In summary, the raw SAXS data reveal two distinct collapse phases, on short (10 ms) and medium (300 ms) time-scales. More subtle conformational re-arrangements are observed throughout the remaining time-course.
Time-resolved RNA landscapes
To interpret the TR-SAXS data, we performed the same EOM analysis employed for the static titrations, using the full structural pool for all time points. The fits achieved using this pool are in excellent agreement with the data (Figure 6B and Supplementary Figures S22–S24 contain more details of ensemble parameters, including the mean ensemble size), though the reduced signal-to-noise in the TR experiments results in smaller chi-square values for all conditions. The time evolution of the derived landscapes is shown in Figure 6A. A different heat-map color scheme was used for the time-resolved data to increase the dynamic range so that all seven time points can be simultaneously displayed. An expanded structure set for each cluster and time point is shown in Supplementary Figures S25–S31.
In the low salt starting condition for the TR series (20 mM KCl), the majority of selected EOM structures represent extended states. This landscape closely resembles the corresponding static point. Minor differences result from slight structure factor present in the time-resolved SAXS data at this time point, reflecting slight interparticle interference at the higher [RNA] of these studies.
Within 10ms after initiation of folding, dramatic changes to the landscape are already apparent. Although a cluster is visible on the left side of the landscape, a large fraction of EOM-derived structures populate an ill-defined swath of space covering a wide range of Rg and ΔN values (electric blue). Native contacts are sparse within this ensemble: the reduction of ΔN relative to the t = 0 state signals a non-specific (electrostatic) collapse. Surprisingly, no comparable set of structures is found in the ensembles derived from static SAXS experiments. The electrostatically relaxed, non-specifically collapsed states detected in titrations of KCl or MgCl2 reveal preferred conformations that reflect deep, narrow electrostatic wells (Figure 4A and B). In contrast, few preferred conformations are observed in the ensemble measured 10 ms after mixing. The presence of these transient structures likely explains the distinctive behavior observed in the Kratky plots for the TR series, which is absent from the static data.
From 10 to 30 ms, electrostatically relaxed states funnel to globally compact states with large ΔN. After 30ms, the population coalesces into a single cluster, suggestively co-located in the landscape with the open intermediates observed in the static Mg titrations. These highly stable states feature prominently in the TR landscapes, and persist throughout the remaining time series.
At 100 ms after mixing, the selected structures reveal a dramatic backbone reorganization and another cluster of states appears, which we call the second intermediates. In this cluster, models assume a distinct ‘T’ shape as the P5a helix adopts a more vertical orientation under the molecular core in our representation. Close inspection of structures reveals that a twisting of the backbone between P5a and P5b helices appears to induce this re-arrangement. As a result of this global shift, one pair of helix docking interactions (G164–A139 orange bases) appears primed for association.
The Kratky plots shown in Figure 5D reveal that the subtle expansion reverses at ∼300 ms into folding, when a second compaction is detected. The SAXS profiles that follow this compaction begin to resemble those acquired in 1 mM Mg. The selected structures shift out of the open intermediate cluster, and more heavily weight the second intermediates and new states in the middle of the conformational landscape. Examination of these new models, discussed below, reveals the presence of tertiary contacts, thus we name them ‘specifically collapsed’. States in this region show a bowing of P5c helix towards the molecular core, as the P5b helix becomes more vertical. Such an arrangement places the tandem G-A pairs (orange residues) in close proximity. As a result, this pair likely form first. This re-arrangement simultaneously moves one side of the P5c hinge interaction closer to its partners in P5a (red residues), suggesting cooperative formation of tertiary contacts.
The subsequent transition of this specifically collapsed intermediate to an even more compact state is reflected in the EOM structures selections at 1000 ms. A population of molecules show the P5a and P5b helices docked about the molecular core, with both orange and purple residue pairs formed. Interestingly, the selected models still lack both the P5c secondary structure switch (green pair) and hinge interactions (red pairs) that define the native fold. From its position on the landscape, this state is recognizable as the helix docked state (1 mM Mg static landscape). The occupation of the helix-docked and native conformations at this time point and beyond are evident from the Kratky plots of the raw SAXS curves: a high mid q peak is paired with a downward sloping tail.
From 1000 to 3000 ms, the population shifts to occupy more of the native state, suggesting that the final conformational re-arrangement represents a transition from the helix docked to the native state. This shift may proceed through a concurrent P5c switch (green pairs) and hinge formation (red pairs), as has been previously suggested (43). At the final time-point for these experiments, the native state comprises 15% of the population, consistent with our static measurements and close to the results of NMR studies carried out under similar conditions, as previously discussed (Supplementary Figure S32). The majority of remaining molecules either reside in the helix-docked state or revert to the open intermediate cluster. The distribution of molecules on the landscape for this final time point closely resembles that derived for the static titrations (1D projections are compared in Supplementary Figure S34). While the signal-to-noise in the TR SAXS experiments makes it more challenging to distinguish basis components in SVD analysis, at least three states are required to span the space occupied by the seven time points (Supplementary Figure S33).
Distinct collapse phases reflect interactions with Mg
The application of ensemble methods to analyze SAXS data enriches our understanding of the symbiotic relationship between Mg and RNA: Mg ions sculpt the RNA backbone into structures that precisely position the different sides of tertiary contacts for interaction. Notably, although these intermediates are primed for tertiary contact formation, our models suggest that most lack stable tertiary contacts. Strikingly, many of the structures found in Mg titrations are absent from titrations in KCl.
The analysis of time resolved studies, in conjunction with atomistic models, provides additional evidence highlighting the importance of Mg–RNA interactions at all stages of folding. States found in the electrostatically relaxed ensemble have no analogue in static salt titrations. In the next phase of folding (10–100 ms) this unwieldly collection of states funnels rapidly into an open intermediate, collections of structures which are relatively similar to each other. These states are never found in static titrations with KCl, requiring a certain threshold of Mg for formation. Their prevalence in all conditions exceeding this threshold, (and throughout later TR series) suggests that the Mg ions surround RNA in a diffuse manner. The distinct conformations of helices within these states, coupled with MD computations of Mg ion density around these structures (see Supporting Information, Supplementary Figure S35 and accompanying text), suggests that locally high concentrations of ions within the A-rich bulge and P5c stabilize these intermediates. The emergence of these states early in the folding pathway hints that such interactions are crucial in directing the molecule away from the deep, non-specifically collapsed wells found in the KCl titration, maintaining the RNA in folding competent conformations.
Folding proceeds from this open intermediate through states found at higher [Mg] in static titrations, including the second intermediates, helix-docked and native states. Additional Mg mediated events are therefore required to evolve out of the open intermediates, and induce the specific collapse transition. Atomistic models, selected to recapitulate the SAXS data, suggest that each structural re-arrangement is designed to bring the distant sides of tertiary contacts into closer proximity. Transitions between these distinctly structured intermediates illustrate that large scale conformational re-arrangements occur as a direct result of small scale shifts in the molecular backbone.
The final conformational switch, from the helix-docked to the native state, occurs at long time points (1000–3000 ms). Although we cannot directly visualize Mg ions in the experiment, simulations suggest that this re-arrangement is triggered by ion occupation of a phosphate lined pocket in the helix docked state, as observed in the P5abc subdomain from the P4P6 crystal structure (52). This pocket is created by the orientations of the P5a, b and c helices in this late stage intermediate. Interestingly, the ion occupying this pocket is partially dehydrated (52), thus would require the highest degree of RNA ordering to capture (36). Figure 7 shows a cartoon summary of the time-scales, kinetic phases, key intermediates and (inferred) Mg associations.
CONCLUSION
The combination of TR- and static SAXS, ensemble decomposition and atomistic structures generated by MD simulations has elucidated an intricate folding landscape for an RNA three helix junction. This approach grants a wide description of RNA folding for this ubiquitous structural motif, revealing an array of intermediates, their roles in the folding pathway, and suggests a complex interplay of tertiary contact formation and distinct collapse phases with Mg2+ associations. Given the modularity of RNA folding, the themes we have outlined in this work are likely echoed in the folding of many other RNAs.
DATA AVAILABILITY
The SAXS data along with analysis and metrics is deposited in the SASBDB under accession codes: SASDC74, SASDC94 SASDCA4, SASDCB4, SASDCC4, SASDCD4 and SASDCE4 for the time-resolved data and SASDCF4, SASDCG4, SASDCH4, SASDCJ4, SASDCK4, SASDCL4 and SASDCM4 for the static data.
Supplementary Material
ACKNOWLEDGEMENTS
We gratefully acknowledge useful discussions with Pollack lab members and Robb Welty. We thank Scott Showalter for a critical reading of this manuscript, and Arthur Woll and Richard Gillilan for assistance at the CHESS G1 station.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
National Institutes of Health [R01-GM085062 to L.P. and R.E., R01-GM059796 to R. E., R35-GM122514 to L.P.]; National Science Foundation through a Science and Technology Center [1231306 to L.P.]; Welch Foundation [F-1896 to R.E.]; AD181 faculty research grant (to S.K.); S.K. acknowledges the NYU Abu Dhabi High Performance Computing Center for computational resources; Cornell High Energy Synchrotron Source (CHESS), National Science Foundation and the National Institutes of Health/National Institute of General Medical Sciences under NSF [DMR-1332208]. Funding for open access charge: National Institutes of Health.
Conflict of interest statement. None declared.
REFERENCES
- 1. Porter E.A., Wang X., Lee H., Weisblum B.. Non-haemolytic beta-amino-acid oligomers. Nature. 2000; 404:565. [DOI] [PubMed] [Google Scholar]
- 2. English E.P., Chumanov R.S., Gellman S.H., Compton T.. Rational development of beta-peptide inhibitors of human cytomegalovirus entry. J. Biol. Chem. 2006; 281:2661–2667. [DOI] [PubMed] [Google Scholar]
- 3. Mobley D.L., Dill K.A.. Binding of small-molecule ligands to proteins: ‘what you see’ is not always ‘what you get’. Structure. 2009; 17:489–498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Mortier J., Rakers C., Bermudez M., Murgueitio M.S., Riniker S., Wolber G.. The impact of molecular dynamics on drug design: applications for the characterization of ligand-macromolecule complexes. Drug Discov. Today. 2015; 20:686–702. [DOI] [PubMed] [Google Scholar]
- 5. Salahuddin P., Siddiqi M.K., Khan S., Abdelhameed A.S., Khan R.H.. Mechanisms of protein misfolding: Novel therapeutic approaches to protein-misfolding diseases. J. Mol. Struct. 2016; 1123:311–326. [Google Scholar]
- 6. Cohen F.E., Kelly J.W.. Therapeutic approaches to protein-misfolding diseases. Nature. 2003; 426:905–909. [DOI] [PubMed] [Google Scholar]
- 7. Cech T.R., Steitz J.A.. The noncoding RNA revolution - trashing old rules to forge new ones. Cell. 2014; 157:77–94. [DOI] [PubMed] [Google Scholar]
- 8. Morris K.V., Mattick J.S.. The rise of regulatory RNA. Nat. Rev. Genet. 2014; 15:423–437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Sharp P.A. The centrality of RNA. Cell. 2009; 136:577–580. [DOI] [PubMed] [Google Scholar]
- 10. Bisaria N., Greenfeld M., Limouse C., Pavlichin D.S., Mabuchi H., Herschlag D.. Kinetic and thermodynamic framework for P4-P6 RNA reveals tertiary motif modularity and modulation of the folding preferred pathway. Proc. Natl. Acad. Sci. U.S.A. 2016; 113:E4956–E4965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Herschlag D., Allred B.E., Gowrishankar S.. From static to dynamic: The need for structural ensembles and a predictive model of RNA folding and function. Curr. Opin. Struct. Biol. 2015; 30:125–133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Hermann T., Westhof E.. RNA as a drug target: chemical, modelling, and evolutionary tools. Curr. Opin. Biotechnol. 1998; 9:66–73. [DOI] [PubMed] [Google Scholar]
- 13. Matsui M., Corey D.R.. Non-coding RNAs as drug targets. Nat. Rev. Drug Discov. 2016; 16:167–179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Behrouzi R., Roh J.H., Kilburn D., Briber R.M., Woodson S.A.. Cooperative tertiary interaction network guides RNA folding. Cell. 2012; 149:348–357. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Solomatin S. V, Greenfeld M., Chu S., Herschlag D.. Multiple native states reveal persistent ruggedness of an RNA folding landscape. Nature. 2010; 463:681–684. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Xue Y., Gracia B., Herschlag D., Russell R., Al-Hashimi H.M.. Visualizing the formation of an RNA folding intermediate through a fast highly modular secondary structure switch. Nat. Commun. 2016; 7:ncomms11768. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Haller A., Rieder U., Aigner M., Blanchard S.C., Micura R.. Conformational capture of the SAM-II riboswitch. Nat. Chem. Biol. 2011; 7:393–400. [DOI] [PubMed] [Google Scholar]
- 18. Bailor M.H., Sun X., Al-Hashimi H.M.. Topology links RNA secondary structure with global conformation, dynamics, and adaptation. Science. 2010; 327:202–206. [DOI] [PubMed] [Google Scholar]
- 19. Sclavi B., Sullivan M., Chance M.R., Brenowitz M., Woodson S.A.. RNA folding at millisecond intervals by synchroton hydroxyl radical footprinting. Science. 1998; 279:1940–1943. [DOI] [PubMed] [Google Scholar]
- 20. Welty R., Hall K.B.. Nucleobases undergo dynamic rearrangements during RNA tertiary folding. J. Mol. Biol. 2016; 428:4490–4502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Manz C., Kobitski A.Y., Samanta A., Keller B.G., Jäschke A., Nienhaus G.U.. Single-molecule FRET reveals the energy landscape of the full-length SAM-I riboswitch. Nat. Chem. Biol. 2017; 13:1172–1178. [DOI] [PubMed] [Google Scholar]
- 22. Vušurović N., Altman R.B., Terry D.S., Micura R., Blanchard S.C.. Pseudoknot formation seeds the twister ribozyme cleavage reaction coordinate. J. Am. Chem. Soc. 2017; 139:8186–8193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Chandra V., Hannan Z., Xu H., Mandal M.. Single-molecule analysis reveals multi-state folding of a guanine riboswitch. Nat. Chem. Biol. 2016; 13:194–201. [DOI] [PubMed] [Google Scholar]
- 24. Panja S., Hua B., Zegarra D., Ha T., Woodson S.A.. Metals induce transient folding and activation of the twister ribozyme. Nat. Chem. Biol. 2017; 13:1109–1114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Bisaria N., Greenfeld M., Limouse C., Mabuchi H., Herschlag D.. Quantitative tests of a reconstitution model for RNA folding thermodynamics and kinetics. Proc. Natl. Acad. Sci. U.S.A. 2017; 114:E7688–E7696. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Suddala K.C., Wang J., Hou Q., Walter N.G.. Mg2+ shifts ligand-mediated folding of a riboswitch from induced-fit to conformational selection. J. Am. Chem. Soc. 2015; 137:14075–14083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Brenowitz M., Pollack L.. Following RNA folding from local and global perspectives. Biophysics of RNA Folding. 2013; NY: Springer; 187–203. [Google Scholar]
- 28. Kwok L.W., Shcherbakova I., Lamb J.S., Park H.Y., Andresen K., Smith H., Brenowitz M., Pollack L.. Concordant exploration of the kinetics of RNA folding from global and local perspectives. J. Mol. Biol. 2006; 355:282–293. [DOI] [PubMed] [Google Scholar]
- 29. Das R., Kwok L.W., Millett I.S., Bai Y., Mills T.T., Jacob J., Maskel G.S., Seifert S., Mochrie S.G.J., Thiyagarajan P. et al. . The fastest global events in RNA folding: electrostatic relaxation and tertiary collapse of the tetrahymena ribozyme. J. Mol. Biol. 2003; 332:311–319. [DOI] [PubMed] [Google Scholar]
- 30. Roh J.H., Guo L., Kilburn J.D., Briber R.M., Irving T., Woodson S.A.. Multistage collapse of a bacterial ribozyme observed by time-resolved small-angle x-ray scattering. J. Am. Chem. Soc. 2010; 132:10148–10154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Russell R., Millett I.S., Tate M.W., Kwok L.W., Nakatani B., Gruner S.M., Mochrie S.G.J., Pande V., Doniach S., Herschlag D. et al. . Rapid compaction during RNA folding. Proc. Natl. Acad. Sci. U.S.A. 2002; 99:4266–4271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. de la Peña M., Dufour D., Gallego J.. Three-way RNA junctions with remote tertiary contacts: a recurrent and highly versatile fold. RNA. 2009; 15:1949–1964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Calvey G.D., Katz A.M., Schaffer C.B., Pollack L.. Mixing injector enables time-resolved crystallography with high hit rate at X-ray free electron lasers. Struct. Dyn. 2016; 3:54301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Pabit S.A., Sutton J.L., Chen H., Pollack L.. Role of ion valence in the submillisecond collapse and folding of a small RNA domain. Biochemistry. 2013; 52:1539–1546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Auffinger P., Grover N., Westhof E.. Sigel A, Sigel H, Sigel RKO. Metal ion binding to RNA. Structural and Catalytic Roles of Metal Ions in RNA. 2011; 9:Cambridge: Royal Society of Chemistry; 1–35. [Google Scholar]
- 36. Bowman J.C., Lenz T.K., Hud N. V., Williams L.D.. Cations in charge: magnesium ions in RNA folding and catalysis. Curr. Opin. Struct. Biol. 2012; 22:262–272. [DOI] [PubMed] [Google Scholar]
- 37. Chen S.-J. RNA folding: conformational statistics, folding kinetics, and ion electrostatics. Annu. Rev. Biophys. 2008; 37:197–214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Denesyuk N.A., Thirumalai D.. How do metal ions direct ribozyme folding. Nat. Chem. 2015; 7:793–801. [DOI] [PubMed] [Google Scholar]
- 39. Draper D.E. RNA folding: thermodynamic and molecular descriptions of the roles of ions. Biophys. J. 2008; 95:5489–5495. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Woodson S.A. Compact intermediates in RNA folding. Annu Rev Biophys. 2010; 39:61–77. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Wu M., Tinoco I. Jr. RNA folding causes secondary structure rearrangement. Proc. Natl. Acad. Sci. U.S.A. 1998; 95:11555–11560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Gracia B., Xue Y., Bisaria N., Herschlag D., Al-Hashimi H.M., Russell R.. RNA structural modules control the rate and pathway of RNA folding and assembly. J. Mol. Biol. 2016; 428:3972–3985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Koculi E., Cho S.S., Desai R., Thirumalai D., Woodson S.A.. Folding path of P5abc RNA involves direct coupling of secondary and tertiary structures. Nucleic Acids Res. 2012; 40:8011–8020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Schlatterer J.C., Kwok L.W., Lamb J.S., Park H.Y., Andresen K., Brenowitz M., Pollack L.. Hinge stiffness is a barrier to RNA folding. J. Mol. Biol. 2008; 379:859–870. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Deras M.L., Brenowitz M., Ralston C.Y., Chance M.R., Woodson S.A. Folding mechanism of the Tetrahymena ribozyme P4-P6 domain. Biochemistry. 2000; 39:10975–10985. [DOI] [PubMed] [Google Scholar]
- 46. Svergun D., Barberato C., Koch M.H.. CRYSOL - a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates. J. Appl. Crystallogr. 1995; 28:768–773. [Google Scholar]
- 47. Tria G., Mertens H.D.T., Kachala M., Svergun D.I.. Advanced ensemble modelling of flexible macromolecules using X-ray solution scattering. IUCrJ. 2015; 2:207–217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Silverman S.K., Cech T.R.. Energetics and cooperativity of tertiary hydrogen bonds in RNA structure. Biochemistry. 1999; 38:8691–8702. [DOI] [PubMed] [Google Scholar]
- 49. Pollack L. SAXS studies of ion-nucleic acid interactions. Annu. Rev. Biophys. 2011; 40:225–242. [DOI] [PubMed] [Google Scholar]
- 50. Bernadó P., Mylonas E., Petoukhov M.V., Blackledge M., Svergun D.I.. Structural characterization of flexible proteins using small-angle X-ray scattering. J. Am. Chem. Soc. 2007; 129:5656–5664. [DOI] [PubMed] [Google Scholar]
- 51. Tokuda J.M., Pabit S.A., Pollack L.. Protein–DNA and ion–DNA interactions revealed through contrast variation SAXS. Biophys. Rev. 2016; 8:139–149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Juneau K., Podell E., Harrington D.J., Cech T.R.. Structural basis of the enhanced stability of a mutant ribozyme domain and a detailed view of RNA-solvent interactions. Structure. 2001; 9:221–231. [DOI] [PubMed] [Google Scholar]
- 53. Kirmizialtin S., Johnson K.A., Elber R.. Enzyme selectivity of HIV reverse transcriptase: conformations, ligands, and free energy partition. J. Phys. Chem. B. 2015; 119:11513–11526. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Kirmizialtin S., Nguyen V., Johnson K.A., Elber R.. How conformational dynamics of DNA polymerase select correct substrates: experiments and simulations. STRUCTURE. 2012; 20:618–627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Maragliano L., Vanden-Eijnden E.. On-the-fly string method for minimum free energy paths calculation. Chem. Phys. Lett. 2007; 446:182–190. [Google Scholar]
- 56. Sun L.-Z., Zhang D., Chen S.-J.. Theory and modeling of RNA structure and interactions with metal ions and small molecules. Annu. Rev. Biophys. 2017; 46:227–246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Takamoto K., He Q., Morris S., Chance M.R., Brenowitz M.. Monovalent cations mediate formation of native tertiary structure of the Tetrahymena thermophila ribozyme. Nat. Struct. Biol. 2002; 9:928–933. [DOI] [PubMed] [Google Scholar]
- 58. Zheng M., Wu M., Tinoco I.. Formation of a GNRA tetraloop in P5abc can disrupt an interdomain interaction in the Tetrahymena group I ribozyme. Proc. Natl. Acad. Sci. U.S.A. 2001; 98:3695–3700. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Leontis N.B., Westhof E.. Geometric nomenclature and classification of RNA base pairs. RNA. 2001; 7:499–512. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The SAXS data along with analysis and metrics is deposited in the SASBDB under accession codes: SASDC74, SASDC94 SASDCA4, SASDCB4, SASDCC4, SASDCD4 and SASDCE4 for the time-resolved data and SASDCF4, SASDCG4, SASDCH4, SASDCJ4, SASDCK4, SASDCL4 and SASDCM4 for the static data.