Abstract
While there is increasing interest in the study of RNA as a therapeutic target, efforts to understand RNA-ligand recognition at the molecular level lag far behind our understanding of protein–ligand recognition. This problem is complicated due to the more than 10 orders of magnitude in time scales involved in RNA dynamics and ligand binding events, making it not straightforward to design experiments or simulations. Here, we make use of artificial intelligence (AI)-augmented molecular dynamics simulations to directly observe ligand dissociation for cognate and synthetic ligands from a riboswitch system. The site-specific flexibility profiles from our simulations are compared with in vitro measurements of flexibility using selective 2′ hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP). Our simulations reproduce known relative binding affinity profiles for the cognate and synthetic ligands, and pinpoint how both ligands make use of different aspects of riboswitch flexibility. On the basis of our dissociation trajectories, we also make and validate predictions of pairs of mutations for both the ligand systems that would show differing binding affinities. These mutations are distal to the binding site and could not have been predicted solely on the basis of structure. The methodology demonstrated here shows how molecular dynamics simulations with all-atom force-fields have now come of age in making predictions that complement existing experimental techniques and illuminate aspects of systems otherwise not trivial to understand.
Short abstract
We use artificial intelligence-augmented molecular dynamics to study a riboswitch aptamer in complex with ligands. We obtain ligand dissociation and predict mutations that are then verified.
1. Introduction
RNA regulates diverse cellular processes, far beyond coding for protein sequence or facilitating protein biogenesis.1 In this role, RNA can interact with proteins2 or small molecule factors3,4 to directly or indirectly impact gene expression and cellular homeostasis.5 For example, riboswitches are sequences found primarily in bacterial mRNAs that contain aptamers for small molecules and regulate transcription or translation.6 In humans, mutations in noncoding RNAs can directly impact neurodegenerative diseases and cancer.7,8 The regulatory role of bacterial riboswitches, coupled with the broader link between RNA and human disease, has also led to interest in RNA as a therapeutic target for small molecules.9−11 However, in comparison to proteins, methods to computationally and experimentally interrogate RNA structure are relatively less mature.12,13 Thus, more robust tools are needed to better understand RNA structure, dynamics, and recognition of small molecules. In spite of its clear relevance to fundamental science and drug discovery efforts, the highly dynamic nature of RNA makes it difficult to study due to the diverse conformational ensembles it adopts.14−16 Luckily, riboswitches represent a valuable model for RNA-small molecule recognition. When ligands bind to riboswitch aptamers, the RNA undergoes a conformational change that modulates gene expression.17,18 The cognate ligand for a riboswitch is typically associated with its upstream gene, often part of the ligand’s biosynthetic pathway.18,19 This feedback mechanism, paired with well-established structural analyses, makes riboswitches excellent models for RNA-ligand binding since the target ligand is known and often highly specific.18
Although we can use riboswitches to model RNA-small molecule interactions, visualizing riboswitch-ligand dissociation pathways with high spatiotemporal resolution remains a challenge for in vitro and in silico experiments. Such a femtosecond and all-atom resolution is hard to directly achieve in in vitro experiments, while the associated time scales are several orders of magnitude too slow for molecular dynamics (MD) simulations performed even on the most powerful supercomputers. Previously, ref (20) used Go̅-model simulations of PreQ1 riboswitches, specifically transcriptional Bacillus subtilis (Bs) and translational Thermoanaerobacter tengcongensis (Tte) aptamers. Their work suggests that in respective cases the ligand binds at late and early stages of riboswitch folding, indicating that perhaps these riboswitches respectively fold via mechanisms of conformational selection and induced fit. Since ref (20), RNA force-fields have become more detailed and accurate, but the time scales involved in RNA-small molecular dissociation continue to stay beyond reach.21
In order to better understand the ligand-riboswitch dissociation process, in this work we employ a combination of (i) enhanced sampling methods combining statistical physics with artificial intelligence (AI)22−25 and (ii) experimental techniques to measure RNA flexibility and ligand binding.26−28 The computational sampling techniques allow us to study the dissociation process in an accelerated but controlled manner, while the selective 2′ hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP), microscale thermophoresis (MST), and fluorescence intensity assay (FIA) provide a rigorous validation of the computational findings. Specifically, we study the Tt-PreQ1 riboswitch aptamer interacting with its cognate ligand PreQ1 (7-aminomethyl-7-deazaguanine) and a synthetic ligand (2-[(dibenzo[b,d]furan-2-yl)oxy]-N,N-dimethylethan-1-amine).29 See Figure 1 for an illustration of the riboswitch and chemical structures of the two ligands. First, we perform 2 μs long unbiased MD simulations of the ligand-bound and ligand-free systems to quantify the flexibility of individual nucleotides. Comparing the flexibility profiles from unbiased MD with SHAPE-MaP reveals some broad overall agreement; however, a more detailed quantitative analysis reveals the limitations of both approaches. To simulate the dissociation process, which is too slow for MD simulations,21 a key challenge in most enhanced sampling methods is the need for an a priori estimate of the dominant dissociation mechanisms as captured through the reaction coordinate (RC). However, it is very challenging to estimate such an RC without already having simulated the dissociation process, especially for flexible biomolecules like RNA.30 Here, we use our AI-based sampling method RAVE to directly observe the dissociation of both of these ligands from the wild-type (WT) Tte-PreQ1 riboswitch. RAVE automates the learning of RCs through a scheme that iterates between MD to generate data and deep learning to construct an approximate RC from this data. The MD-deep learning iteration continues until we see no further improvement in accelerating the dissociation events as measured through typical MD time to observe dissociation. The RC itself is learned as the most informative low-dimensional representation expressed as a past-future information bottleneck.31
Figure 1.
Conformational change of the PreQ1 riboswitch and chemical structures of PreQ1 and synthetic dibenzofuran ligands.
By efficiently sampling multiple independent riboswitch-ligand dissociation events for both cognate and synthetic ligands, we are able to reproduce relative binding affinity measurements from MST and FIA and give a qualitatively reasonable fit to flexibility measurements from SHAPE-MaP experiments, though the limitations are also strikingly obvious. Furthermore, we are able to pinpoint which nucleotides play the most critical roles in dissociation. These are found to be the nucleotides U22 and A32 for the cognate and synthetic ligands, respectively. On the basis of these, we design and perform a set of in vitro mutagenesis experiments, which validate our mutation predictions. We note that these predicted mutations are unlikely to have been found simply from a structural perspective, as they are distal from the binding site and do not directly interact with either of the ligands. Our findings thus reveal the complex nature of the interaction between RNA and small molecules, and might help guide the design of RNA-binding small molecules in other systems in the future. Thus, this methodology will be broadly useful in understanding increasingly complex, biologically important RNA-small molecule interactions.
2. Results
A. Ligand Binding Induces Flexibility Change in Riboswitch Aptamer
Our first test is to compare the nucleotide-specific flexibility from all-atom simulations to those from SHAPE-MaP measurements. We perform 2 μs unbiased MD simulations (4 independent sets of 500 ns each for both ligands) each for PreQ1 riboswitch bound with the cognate and synthetic ligand, respectively. See Supporting Information for details of the simulation setup. While these simulations are not long enough to observe ligand dissociation, they can still give a robust measure of how the flexibility varies between the two different ligand-bound systems and provides nucleotide-level insight into how specific ligand binding induces different conformation changes in the riboswitch. Specifically, we compare the fluctuations of the distances between consecutive C2 atoms obtained from these simulations with SHAPE-MaP measurements (see Supporting Information for details of SHAPE-MaP). We performed SHAPE-MaP analyses using a PreQ1 riboswitch construct consisting of the entire riboswitch (both the aptamer domain and expression platform). Probing was performed using a recently reported SHAPE reagent, 2A3, that provides improved mutagenesis rates during reverse transcription.32 We analyzed data in the absence of any ligand, and in the presence of either cognate ligand (PreQ1) or synthetic ligand. Our results were in good agreement with previous studies reporting SHAPE-MaP on PreQ1 riboswitches using different SHAPE reagents and a slightly shorter construct.33 As discussed previously,34 base pairing interactions are the most important determinant of RNA structural constraints, and thus base–base distance fluctuations are a crucial metric of RNA backbone flexibility. Indeed, as shown in Figure 2, we obtain good agreement between C2–C2 distances from unbiased MD and experimentally measured SHAPE flexibilities for all three systems: ligand-free PreQ1 aptamer, cognate-ligand-bound PreQ1 aptamer, and synthetic-ligand-bound aptamer. We would like to note that though unbiased MD simulations capture most of the fluctuations measured by SHAPE, some of the peaks shown by SHAPE are not observed in MD, which might be due to the time-scale limitations of unbiased simulations or the kinetic regime measured by SHAPE reactivity. In addition to the agreement between MD and SHAPE, Figure 2 also shows that, for all three systems, nucleotides in the ranges A13–C15, U21–A24, and A32–G33 are more flexible than other regions of the riboswitch.33 The mixed agreement between MD and SHAPE approaches gives some partial amount of confidence into how accurately classical force-fields35 accurately model the behavior of complex RNAs and highlights the need for continued method development from both the computational and experimental sides that can facilitate more direct comparisons of nucleotide-dependent flexibility. The lack of unequivocal quantitative agreement could be due to many reasons. First, while SHAPE is a measurement of flexibility, it is strictly speaking a surrogate measurement relating to the kinetics of acylation of each nucleotide. Although flexibility of each nucleotide is a main factor in determining the probability of a nucleotide being acylated, there are other factors that can determine how the SHAPE profile looks like. Second, our unbiased simulations are not long enough to reduce the statistical error. Finally, as pointed out by the reviewer, we only simulated the aptamer domain of the PreQ1 riboswitch, while in the experiment the SHAPE reactivity of the whole PreQ1 riboswitch was measured. The presence of other domains may also contribute to flexibility changes in the system.
Figure 2.
SHAPE reactivities for the Tte-PreQ1 riboswitch aptamer obtained using the 2A3 reagent (red circles joined by solid lines) are compared with the fluctuations of C2–C2 distances (blue triangles joined by dashed lines) for the PreQ1 riboswitch (a) in the absence of ligand, (b) bound to cognate ligand, and (c) bound to synthetic ligand. Pearson correlation coefficients R are shown in upper left corners.
B. Reaction Coordinates and Approximate Free Energy Profile
Encouraged by the agreement between flexibility profiles from unbiased MD and SHAPE, we next aim to calculate the free energy profiles and reaction coordinates for ligand dissociation from PreQ1. As these time scales are far beyond unbiased MD, here we use our recent AI-augmented MD method RAVE which learns the dominant slow degrees of freedom on-the-fly by iterating between rounds of sampling and AI. See Supporting Information for further details of the protocol. After four rounds of such iterations, we obtain converged RCs for both systems, expressed as a linear combination of different riboswitch-ligand heavy atom contacts. Figure 3 shows different contacts and their weights as obtained from AMINO23 and RAVE simulations. It is interesting to note that the RC for the cognate ligand has many more components than the RC for synthetic ligand, highlighting differences between the molecules. Most of these contacts in the RC are present also in the crystal structure (PDB 6E1W, 6E1U), except carbon 5-U2, carbon 5-A23 for the cognate ligand, and carbon 1-A23 for the synthetic ligand (ligand atom numbering corresponds to numbering in the PDB files). We would like to highlight here that AMINO provides a basis set for RAVE to build the reaction coordinate as a function thereof, and the precise output of AMINO could differ depending on which precise trajectory was used with it. However, as shown previously in ref (22), we do not expect the final result from RAVE to significantly vary with the precise basis set fed into RAVE.
Figure 3.
Weights for different order parameters (OPs) learned by RAVE in the corresponding RC for riboswitch aptamer with (a) cognate ligand and (b) synthetic ligand. The OPs are named in the format “ligand atom-residue name”. Ligand structures and atom names are shown in the insets.
We then used this optimized RC and ran 10 independent biased MD simulations each using well-tempered metadynamics25 to sample the free energy profiles of the systems. The probability distribution was estimated by using the reweighting technique introduced in ref (36). In Figure 4, we provide the free energy profiles as functions of the ligand-riboswitch coordination number for both systems. Note that for both systems there were two predominant pathways as indicated through arrows in Figure 5. A key difference between the two paths is that in the first path the ligand unbinds through the space in between stem 2 and the backbone connecting stem 1 and loop 1 (Figure 5, blue arrows). In the second path, the ligand unbinds through the space in between loop 2 and stem 2 (Figure 5, red arrows). In order to compare the binding free energies of the two systems, we combine the information from both of the pathways by plotting the free energy as a function of the ligand-riboswitch shared coordination number C. During the dissociation process, the coordination number gradually decreases to 0 irrespective of which path is adopted during dissociation. We define the coordination number C as
![]() |
1 |
where N and M are the number of ligand and riboswitch atoms, respectively. rij is the distance between ligand atom i and riboswitch atom j and r0 is set to be 3 Å. It can be seen from Figure 4 that when bound to the riboswitch the cognate ligand has a higher C relative to when the synthetic ligand is bound to it, illustrating that the synthetic ligand makes overall fewer contacts per ligand atom with the riboswitch, as also observed in crystal structures (PDB 6E1W and 6E1U). Though we were not able to simulate the rebinding of ligands to get a converged free energy profile, the probability distributions used to construct these free energies are obtained by averaging over multiple independent dissociation events. This leads to the relatively small error bars shown in Figure 4, which indicates that we can use the free energy profile to give a rough estimation of the strength of the binding affinity. By considering C ≲ 5.6 as the unbound states, we can calculate that the binding affinity difference between bound and unbound state is 2–4 kcal/mol higher for the cognate ligand than that for the synthetic ligand. We however caution the reader that this is an approximate free energy profile in the absence of strict rebinding events during metadynamics. In principle, specialized methods37,38 can be used to address this limitation and will be the subject of careful investigation in future work. This is in quantitative agreement with relative affinity measurements (3.0 ± 1.4 kcal/mol) through fluorescence titrations as reported in ref (29).
Figure 4.
Free energy profiles of riboswitch with different ligands; coordination number 0 (see eq 1) indicates ligand fully dissociated from riboswitch. The error bars are shown by the highlighted colors. The approximate range of coordination numbers corresponding to the unbound pose is highlighted in light green color (C ≲ 5.6).
Figure 5.
Dissociation pathways for (a–b) cognate ligand and (c–d) synthetic ligand. Stem 1 (S1), stem 2 (S2), loop 1 (L1), loop2 (L2), and loop (L3) are colored green, cyan, magenta, gray, and orange, respectively. During the dissociation processes, the ligands at different time frames are colored from white to dark blue and white to red, for the top pathway and bottom pathway, respectively. The dissociation pathways are indicated with arrows. The predicted mutation sites U22 and A32 are highlighted in red and blue, respectively.
C. Predicting Critical Nucleotides for Mutagenesis Experiments
We now analyze our 10 independent dissociation trajectories each obtained for the synthetic ligand bound and cognate ligand bound systems to predict critical nucleotides in the riboswitch that play a role in the dissociation process. We are interested in not just nucleotides we could have predicted from looking at the static structures of ligand-bound complexes but also in nucleotides distal from the binding site. Those nucleotides might play an essential role in determining the structural ensemble of a riboswitch. Thus, changing the properties of those nucleotides can alter the interaction between ligand and other nucleotides. Such critical nucleotides are hard to predict from a purely structural perspective, but our all-atom resolution dissociation trajectories are ideally poised to make such predictions. Predictions made in this section are then validated through mutation experiments reported in section 2D.
From our collection of dissociation trajectories, we observe that dissociation mainly happens through two pathways (Figure 5) and that for the cognate ligand, 7 out of 10 simulations went through the top/blue pathway, while for the synthetic ligand, 6 out of 10 simulations went through the bottom/red pathway. These observations indicate how two ligands interact with the riboswitch differently before fully dissociating from the pocket. To more rigorously quantify the role played by different nucleotides, we monitor the change in the RMSD of every nucleotide in the enhanced MD simulation. In principle, a comprehensive study of ligand–receptor interactions should also include simulations of the rebinding events. As previously stated, that will require the implementation of specialized methods and thus is not explored in this study. Nonetheless, dissociation trajectories should be able to give a good sampling for bound state ensembles, allowing us to make predictions about the effects of mutations on binding affinity. Our motivation here is that the nucleotides showing greatest relative movement during the dissociation process are the ones most likely to be impacted once mutated. As shown in Supplementary Figure 2, every nucleotide’s RMSD changes with the coordination number C, which captures the extent of dissociation. These changes indicate different conformations the riboswitch adopts during ligand dissociation. Through this procedure, we identify U22 and A32, highlighted in Figure 5, to be two such critical nucleotides for the cognate and synthetic ligands respectively (see Supporting Information for further details of this protocol). Figure 6 shows the changes in the RMSDs of these two selected nucleotides with different coordination numbers calculated from the enhanced MD trajectories. For U22, in RAVE simulations the RMSD increases much more for the cognate ligand-bound system than for the synthetic ligand-bound system. However, the reverse is found to be true for the nucleotide A32, which displays more enhanced movement during the dissociation of the synthetic ligand bound complex relative to the cognate ligand bound complex.
Figure 6.
Critical nucleotides prediction. Change in RMSD for nucleotides U22 and A32 during the dissociation process, relative to RMSD variation in the apo state. RMSD profiles as a function of coordination number are shown by blue dashed line and orange solid line for systems with cognate ligand and synthetic ligand separately. The change in RMSDs is calculated by subtracting the mean RMSDs of the corresponding nucleotide in the apo system from the RMSDs measured in the biased simulations.
On the basis of the above observations, we can predict that for the cognate ligand, mutating U22 should display more tangible effects on the strength of the complex than mutating A32. For the synthetic ligand, we predict an opposite trend—mutating A32 should have a more pronounced effect on the interaction relative to mutating U22.
D. Validating Predicted Critical Nucleotides through Mutagenesis Experiments
In section 2C, we predicted on the behalf of our dissociation trajectories that mutating nucleotides U22 and A32 will have differing effects on the cognate ligand-bound and synthetic ligand-bound systems. Also, we would like to note that, as we can see from Figure 6, the RSMD changes appear in the range of coordination numbers that correspond to the bound state. These changes imply the role of these nucleotides in determining the bound state interactions and suggest that the effects of mutations can be reflected by changes in binding affinity. Here we report in vitro tests of our predictions. As shown in Figure 7, we measure the equilibrium dissociation constant KD for six different complexes, namely, the two wild-type complexes, U22A-cognate ligand, U22A-synthetic ligand, A32U-cognate ligand, and A32U-synthetic ligand. We also performed in silico mutagenesis experiments to validate that the mutations change the bound state ensemble of both systems.
Figure 7.
Affinity measurement of PreQ1 (cognate) and dibenzofuran (synthetic) ligands binding to WT Tte riboswitch and U22A and A32U mutants. MST of 5′-cy5-labeled (a) WT Tte, (b) U22A, and (c) A32U mutants in the presence of PreQ1 ligand. FIA of 5′-cy5-labeled (d) WT Tte, (e) U22A, and (f) A32U mutants in the presence of dibenzofuran. Error bars represent the standard deviation of three replicate experiments.
To experimentally test whether the A32U and U22A mutations impacted KD values, we used microscale thermophoresis (MST). KD values were measured using the fluorescently labeled wild type, A32U, and U22A PreQ1 riboswitch aptamer constructs for the cognate ligand. For PreQ1, we observed a KD of 11.7 ± 0.35 nM for the WT construct. In contrast, the A32U mutant had a KD of 22.5 ± 0.42 nM, while the U22A mutant had a KD of 49.4 ± 1.5 nM. For the synthetic ligand, changes were observed in fluorescence of the labeled RNA upon incubation with the ligand, and thus a fluorescence intensity assay (FIA) was used to measure KD values rather than MST. Using FIA, we observed a KD of 1.6 ± 0.7 μM for the wild type aptamer. In contrast, both the U22A and A32U mutant constructs displayed significantly weaker KD values (150 ± 1.3 μM, and ≥100 μM respectively). These KD measurements directly validate the findings from section 2C. There we had predicted that mutating U22 will effect the cognate ligand-bound system more than mutating A32, and indeed we find that KD for U22A is five times weaker than WT, while only two times weaker for A32U. At the same time, we had predicted that mutating A32 will effect the synthetic ligand-bound system more than mutating U22, and indeed we find that, while the KD for U22A is 75 times weaker than WT, there is effectively no tangible association at all for A32U with the synthetic ligand.
The in silico experiments also support that the mutations alter the bound state ensemble and give more insights into the interaction change in the bound state after the mutations. As shown in Supplementary Figure 3, the free energy profiles of the bound state ensemble of the mutated system differ from that of the wild type. Further analyses of the hydrogen bond interactions between ligand and riboswitch show that even though nucleotides 22 and 32 do not directly form hydrogen bonds with both ligands in the bound pose as seen in the crystal structure, interactions between ligands and other nucleotides are still altered when they are mutated and can be seen in simulating the bound pose ensemble of states. See the Supporting Information for detailed discussions of these simulations.
3. Discussion
Over recent years, a consensus view is emerging that a superior understanding of biomolecular structural dynamics, and not just static structures, can be fruitful in predicting key emergent biochemical and biophysical properties.5,39,40 However, in spite of the importance of the problem, tools to study biomolecular structural dynamics are still limited in their scope. This is true for proteins but even more so for RNAs, where it is especially challenging to determine the ensemble of possible structures that the RNA can adopt and pinpoint the specific importance of different structures.39 In this work, we demonstrate that this is indeed feasible—and how one can obtain detailed and robust insights into RNA structural dynamics through AI-augmented molecular simulations that maintain all-atom resolution for RNA, ions, and all water molecules. All our predictions are validated through a gamut of complementary and experimental techniques that measure RNA flexibility and ligand binding. We specifically focused on the Tte-PreQ1 riboswitch aptamer system in its apo state and bound to two different ligands, where we first showed that 2 μs long unbiased MD simulations with classical all-atom force-fields for RNA and water can provide the same flexibility profile as measured in SHAPE experiments. Since the time scales for ligand dissociation are far slower than MD can typically access, we used the RAVE simulation method22 that enhances sampling along a low-dimensional reaction coordinate learned on-the-fly through the past-future information bottleneck framework. Through RAVE, we obtain multiple independent dissociation trajectories for both cognate and synthetic liganded systems, demonstrating how the cognate and synthetic ligands invoke different aspects of the riboswitch’s flexibility in order to dissociate. On this basis, we were able to predict pairs of mutations which would be expected to show contrasting behaviors for the cognate and synthetic ligands. The ability to make such predictions accurately and efficiently is of vital importance, as mutations that can disrupt RNA structural dynamics and ligand recognition processes have been linked to numerous human diseases.39,41 Our predictions for the Tte-PreQ1 aptamer were validated by designing mutant aptamers and performing affinity measurements.
Our work thus demonstrates a pathway to gain predictive insights, as opposed to retrospective validation, from enhanced molecular dynamics RAVE simulations that maintain all-atom resolution while directly observing processes as slow as ligand dissociation. While the focus in this work was on predicting the thermodynamically averaged aspects of riboswitch flexibility, in future work it should also be possible to directly calculate rate constants using complementary approaches and for more complicated RNA systems. More broadly, this work indicates that long time scale MD simulations can accurately model ligand dissociation from RNA and provide validatable hypotheses about structural features that impact small molecule recognition of RNA. Thus, this strategy will be widely applicable not only to study natural systems such as riboswitches but to more completely understand the structural features and dynamics that govern how synthetic small molecules interact with RNA.
Acknowledgments
The authors thank NIH HPC Biowulf cluster, Deepthought2, MARCC, and XSEDE (projects CHE180007P and CHE180027P) for providing computational resources used in this work. Y.W. would like to thank the NCI-UMD Partnership for Integrative Cancer Research for financial support. This work was supported by the intramural program of the National Institutes of Health, National Cancer Institute, Center for Cancer Research (1 ZIA BC011585 07) (J.S.S. and S.P.). Research reported in this publication was supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award Number R35GM142719 (P.T. and Y.W.). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The authors thank Philip Homan for help analyzing SHAPE-MaP data and Alex Wilson for help setting up the simulations. We also thank Dr. Sergey G. Tarasov and Marzena Dyba from the CCR Center for Structural Biology Biophysics Resource for help with MST experiments.
Supporting Information Available
These include details of: The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acscentsci.2c00149.
In vitro transcription of Tte-PreQ1 RNA; acylation of RNA in vitro; primer extension and mutational profiling (MaP) reverse transcription of RNA; SHAPE-MaP library preparation, sequencing, and data analysis; microscale thermophoresis; fluorescence intensity assay; MD simulation setup; flexibility measurement from MD simulations and comparison with SHAPE data; AI-augmented enhanced sampling method RAVE; network architecture and hyperparameter setting for enhanced sampling simulation; predicting critical nucleotides for mutagenesis experiments; in silico mutagenesis experiments (PDF)
Transparent Peer Review report available (PDF)
Author Contributions
# Y.W. and S.P. contributed equally.
The authors declare no competing financial interest.
Notes
The SHAPE-MaP sequencing reads were deposited into the NLM/NCBI Sequence Read Archive (SRA) under the BioProject ID PRJNA767082. All PLUMED input files required to perform the enhanced MD reported in this paper are available on PLUMED-NEST (www.plumed-nest.org), the public repository of the PLUMED consortium,42 as plumID:22.013. The GROMACS input files and raw data to reproduce the simulation results in this paper can be found at go.umd.edu/wang_ACS_central_science_2022.
Supplementary Material
References
- A focus on regulatory RNAs. Nat. Cell Biol. 2019, 21. 535. 10.1038/s41556-019-0325-2 [DOI] [PubMed] [Google Scholar]
- Sanchez de Groot N.; Armaos A.; Grana-Montes R.; Alriquet M.; Calloni G.; Vabulas R. M.; Tartaglia G. G. RNA structure drives interaction with proteins. Nat. Commun. 2019, 10, 1–13. 10.1038/s41467-019-10923-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meyer S. M.; Williams C. C.; Akahori Y.; Tanaka T.; Aikawa H.; Tong Y.; Childs-Disney J. L.; Disney M. D. Small molecule recognition of disease-relevant RNA structures. Chem. Soc. Rev. 2020, 49, 7167–7199. 10.1039/D0CS00560F. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Falese J. P.; Donlic A.; Hargrove A. E. Targeting RNA with small molecules: from fundamental principles towards the clinic. Chem. Soc. Rev. 2021, 50, 2224–2243. 10.1039/D0CS01261K. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mustoe A. M.; Brooks C. L.; Al-Hashimi H. M. Hierarchy of RNA Functional Dynamics. Annu. Rev. Biochem. 2014, 83, 441–466. 10.1146/annurev-biochem-060713-035524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tucker B. J.; Breaker R. R. Riboswitches as versatile gene control elements. Curr. Opin. Struct. Biol. 2005, 15, 342–348. 10.1016/j.sbi.2005.05.003. [DOI] [PubMed] [Google Scholar]
- Lekka E.; Hall J. Noncoding RNAs in disease. FEBS letters 2018, 592, 2884–2900. 10.1002/1873-3468.13182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Esteller M. Non-coding RNAs in human disease. Nat. Rev. Genet. 2011, 12, 861–874. 10.1038/nrg3074. [DOI] [PubMed] [Google Scholar]
- Blount K. F.; Breaker R. R. Riboswitches as antibacterial drug targets. Nature biotechnology 2006, 24, 1558–1564. 10.1038/nbt1268. [DOI] [PubMed] [Google Scholar]
- Connelly C. M.; Moon M. H.; Schneekloth J. S. Jr The emerging role of RNA as a therapeutic target for small molecules. Cell chemical biology 2016, 23, 1077–1090. 10.1016/j.chembiol.2016.05.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Umuhire Juru A.; Hargrove A. E. Frameworks for targeting RNA with small molecules. J. Biol. Chem. 2021, 296, 100191. 10.1074/jbc.REV120.015203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sponer J.; Bussi G.; Krepl M.; Banáš P.; Bottaro S.; Cunha R. A.; Gil-Ley A.; Pinamonti G.; Poblete S.; Jurečka P.; et al. RNA structural dynamics as captured by molecular simulations: a comprehensive overview. Chem. Rev. 2018, 118, 4177–4338. 10.1021/acs.chemrev.7b00427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bottaro S.; Lindorff-Larsen K. Biophysical experiments and biomolecular simulations: A perfect match?. Science 2018, 361, 355–360. 10.1126/science.aat4010. [DOI] [PubMed] [Google Scholar]
- Walter N. G. The blessing and curse of RNA dynamics: past, present, and future. Methods 2009, 49, 85–86. 10.1016/j.ymeth.2009.09.002. [DOI] [PubMed] [Google Scholar]
- Weeks K. M. Advances in RNA structure analysis by chemical probing. Curr. Opin. Struct. Biol. 2010, 20, 295–304. 10.1016/j.sbi.2010.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Al-Hashimi H. M.; Walter N. G. RNA dynamics: it is about time. Curr. Opin. Struct. Biol. 2008, 18, 321–329. 10.1016/j.sbi.2008.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Serganov A.; Nudler E. A decade of riboswitches. Cell 2013, 152, 17–24. 10.1016/j.cell.2012.12.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Connelly C. M.; Numata T.; Boer R. E.; Moon M. H.; Sinniah R. S.; Barchi J. J.; Ferré-D’Amaré A. R.; Schneekloth J. S. Synthetic ligands for PreQ1 riboswitches provide structural and mechanistic insights into targeting RNA tertiary structure. Nat. Commun. 2019, 10, 1501. 10.1038/s41467-019-09493-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roth A.; Breaker R. R. The structural and functional diversity of metabolite-binding riboswitches. Annual review of biochemistry 2009, 78, 305–334. 10.1146/annurev.biochem.78.070507.135656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suddala K. C.; Rinaldi A. J.; Feng J.; Mustoe A. M.; Eichhorn C. D.; Liberman J. A.; Wedekind J. E.; Al-Hashimi H. M.; Brooks C. L. III; Walter N. G. Single transcriptional and translational preQ1 riboswitches adopt similar pre-folded ensembles that follow distinct folding pathways into the same ligand-bound structure. Nucleic acids research 2013, 41, 10462–10475. 10.1093/nar/gkt798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schroeder G. M.; Dutta D.; Cavender C. E.; Jenkins J.; Pritchett E. M.; Baker C. D.; Ashton J. M.; Mathews D. H.; Wedekind J. E. Analysis of a preQ1-I riboswitch in effector-free and bound states reveals a metabolite-programmed nucleobase-stacking spine that controls gene regulation. Nucleic Acids Res. 2020, 48, 8146–8164. 10.1093/nar/gkaa546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Y.; Ribeiro J. M. L.; Tiwary P. Past-future information bottleneck for sampling molecular reaction coordinate simultaneously with thermodynamics and kinetics. Nat. Commun. 2019, 10, 3573. 10.1038/s41467-019-11405-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ravindra P.; Smith Z.; Tiwary P. Automatic mutual information noise omission (AMINO): generating order parameters for molecular systems. Mol. Syst. Des. Eng. 2020, 5, 339–348. 10.1039/C9ME00115H. [DOI] [Google Scholar]
- Barducci A.; Bussi G.; Parrinello M. Well-Tempered Metadynamics: A Smoothly Converging and Tunable Free-Energy Method. Phys. Rev. Lett. 2008, 100, 020603. 10.1103/PhysRevLett.100.020603. [DOI] [PubMed] [Google Scholar]
- Valsson O.; Tiwary P.; Parrinello M. Enhancing Important Fluctuations: Rare Events and Metadynamics from a Conceptual Viewpoint. Annu. Rev. Phys. Chem. 2016, 67, 159–184. 10.1146/annurev-physchem-040215-112229. [DOI] [PubMed] [Google Scholar]
- Siegfried N. A.; Busan S.; Rice G. M.; Nelson J. A. E.; Weeks K. M. RNA motif discovery by SHAPE and mutational profiling (SHAPE-MaP). Nat. Methods 2014, 11, 959–965. 10.1038/nmeth.3029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smola M. J.; Rice G. M.; Busan S.; Siegfried N. A.; Weeks K. M. Selective 2’-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP) for direct, versatile and accurate RNA structure analysis. Nat. Protoc. 2015, 10, 1643–1669. 10.1038/nprot.2015.103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moon M. H.; Hilimire T. A.; Sanders A. M.; Schneekloth J. S. Jr Measuring RNA-ligand interactions with microscale thermophoresis. Biochemistry 2018, 57, 4638–4643. 10.1021/acs.biochem.7b01141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Connelly C. M.; Numata T.; Boer R. E.; Moon M. H.; Sinniah R. S.; Barchi J. J.; Ferré-D’Amaré A. R.; Schneekloth J. S. Synthetic ligands for PreQ1 riboswitches provide structural and mechanistic insights into targeting RNA tertiary structure. Nat. Commun. 2019, 10, 1501. 10.1038/s41467-019-09493-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mlỳnskỳ V.; Bussi G. Exploring RNA structure and dynamics through enhanced sampling simulations. Curr. Opin. Struct. Biol. 2018, 49, 63–71. 10.1016/j.sbi.2018.01.004. [DOI] [PubMed] [Google Scholar]
- Wang D.; Tiwary P. State predictive information bottleneck. J. Chem. Phys. 2021, 154, 134111. 10.1063/5.0038198. [DOI] [PubMed] [Google Scholar]
- Marinus T.; Fessler A. B.; Ogle C. A.; Incarnato D. A novel SHAPE reagent enables the analysis of RNA structure in living cells with unprecedented accuracy. Nucleic acids research 2021, 49, e34–e34. 10.1093/nar/gkaa1255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schroeder G. M.; Dutta D.; Cavender C. E.; Jenkins J. L.; Pritchett E. M.; Baker C. D.; Ashton J. M.; Mathews D. H.; Wedekind J. E. Analysis of a preQ1-I riboswitch in effector-free and bound states reveals a metabolite-programmed nucleobase-stacking spine that controls gene regulation. Nucleic acids research 2020, 48, 8146–8164. 10.1093/nar/gkaa546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pinamonti G.; Bottaro S.; Micheletti C.; Bussi G. Elastic network models for RNA: a comparative assessment with molecular dynamics and SHAPE experiments. Nucleic acids research 2015, 43, 7260–7269. 10.1093/nar/gkv708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tan D.; Piana S.; Dirks R. M.; Shaw D. E. RNA force field with accuracy comparable to state-of-the-art protein force fields. Proc. Natl. Acad. Sci. U. S. A. 2018, 115, E1346–E1355. 10.1073/pnas.1713027115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tiwary P.; Parrinello M. A time-independent free energy estimator for metadynamics. J. Phys. Chem. B 2015, 119, 736–742. 10.1021/jp504920s. [DOI] [PubMed] [Google Scholar]
- Capelli R.; Carloni P.; Parrinello M. Exhaustive search of ligand binding pathways via volume-based metadynamics. journal of physical chemistry letters 2019, 10, 3495–3499. 10.1021/acs.jpclett.9b01183. [DOI] [PubMed] [Google Scholar]
- Limongelli V.; Bonomi M.; Parrinello M. Funnel metadynamics as accurate binding free-energy method 2013, 110, 6358–6363. 10.1073/pnas.1303186110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ganser L. R.; Kelly M. L.; Herschlag D.; Al-Hashimi H. M. The roles of structural dynamics in the cellular functions of RNAs. Nat. Rev. Mol. Cell Biol. 2019, 20, 474–489. 10.1038/s41580-019-0136-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gruebele M. Protein Dynamics in Simulation and Experiment. J. Am. Chem. Soc. 2014, 136, 16695. 10.1021/ja510614s. [DOI] [PubMed] [Google Scholar]
- Halvorsen M.; Martin J. S.; Broadaway S.; Laederach A. Disease-associated mutations that alter the RNA structural ensemble. PLoS genetics 2010, 6, e1001074. 10.1371/journal.pgen.1001074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonomi M.; Bussi G.; Camilloni C.; Tribello G. A.; Banáš P.; Barducci A.; Bernetti M.; Bolhuis P. G.; Bottaro S.; Branduardi D.; et al. Promoting transparency and reproducibility in enhanced molecular simulations. Nat. Methods 2019, 16, 670–673. 10.1038/s41592-019-0506-8. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.