RNA molecules have long been known to adopt unusual conformations. The first crystal structure of an RNA was that of yeast tRNAphe (1, 2), which provided a portent of what was to come. With more structures available now from X-ray crystallography and NMR, that promise has been met beyond expectations. The RNAs that are folded in molecular dynamics simulations by Chen and Garcia (3) are tetraloops (RNA hairpin loops of only four nucleotides) that are among the most unusual small structures known today. Although or because they are unusual with atypical thermal stability, these loops are also ubiquitous in longer RNA strands [rUUCG (4); rGNRA and rCUUG (5)]; they were first identified as the most abundant hairpins in prokaryotic rRNA. The function of these loops varies: the GNRA tetraloops (where N is any base and R is a purine) is often found in RNA:RNA tertiary interactions to anchor a global fold. The UUCG tetraloop is solitary, with no known interactions with RNA or proteins: it is thought to nucleate folding and so direct in vivo RNA folding patterns. These little hairpins have been the objects of considerable in vitro experimental structural characterization by NMR and X-ray crystallography, and are found to consistently form a single unique structure. The rUUCG tetraloop in particular has been subjected to many thermodynamic investigations to measure the sources of its unusual stability (6). The rCUUG hairpin is structurally the most flexible, because it tucks its first U into the minor groove but its second U is relatively unconstrained. Among the three tetraloops, CUUG has a propensity for significant conformational exchange (7). Although the three tetraloops used in the Chen and Garcia (3) computational folding experiments are all unusually stable, they achieve their stability by very different means. Their structural diversity makes them excellent candidates for folding studies, because no one mechanism dominates their ability to adopt their unique structures.
The myriad of conformations that an RNA molecule can adopt makes RNA structural predictions very challenging, even now as the biology of RNA molecules is exploding. It is not an exaggeration to say that RNA molecules are involved at every level of gene regulation, so the accuracy with which their structures can be predicted or determined has become a critical problem for biology. The structures of the three tetraloops here could not have been predicted a priori today, given the intricate hydrogen bonding and base-stacking arrangements that they contain. These tetraloops are but a microcosm of what an RNA molecule can do, and because an RNA structure is an intricate part of its function, there are major research efforts devoted to develop predictive algorithms (8).
The static structure of an RNA is only a part of its characterization, though, because its dynamic motions are also linked to its function. RNAs are typically single-stranded, with short double-stranded (duplex) regions interspersed among local unstructured sequences. The short duplexes can lead to the formation of hairpin loops, internal bulges flanked by duplexes, three-way junctions, four-way junctions, and beyond. It is easy to imagine how the nucleotides in a loop could move, because the constraints on their conformation and consequent motions are greatly reduced from the strictures of a double strand (Fig. 1). The seven degrees-of-freedom of a nucleotide (including backbone torsions and glycosidic bond) allow it in theory to adopt many conformations. One of the experimental challenges is to describe the motions of nucleotides in nonduplex regions, because nucleotides in these regions control RNA folding, tertiary structure formation, and interactions with other molecules (proteins, small molecules, and other RNAs).
Unfortunately, experimental methods to measure dynamic motions and their consequences for RNA properties are rather limited by their timescales and probes. NMR methods are the most versatile for measurements of changes in the ribose pucker, stacking interactions between nucleobases and phosphate backbone flexing, but not every RNA is amenable to NMR experiments, and there are too many biologically functional RNAs that need investigation. Computational methods are ideally suited to explore the structure and dynamics of an RNA, given their timescales, which now range from picoseconds to microseconds in all atom molecular dynamics simulations with explicit solvent, and the implementation of clever sampling methods that bypass linear time lines. One such method, replica exchange molecular dynamics, is used by Chen and Garcia (3) for their RNA folding experiments.
Applications of molecular dynamics simulations to protein structure, dynamics, ligand binding, and folding have become standard, although certainly questions remain about force-field stability, water models that give correct protein correlation times, and sampling methods. In contrast, the need for accurate computational methods to describe RNA molecules has become acute, but molecular dynamics simulations of nucleic acids, in particular of RNAs, have lagged behind protein applications. Many of the questions related to accurate depictions of RNAs in silico remain unanswered. Several groups are making substantial contributions to RNA computational methods, as shown by recent reports from Denning et al. (9), using CHARMM36 to address the contributions of the ribose 2′ OH, and Tubbs et al. (10), incorporating revisions to the glycosidic angle χ using AMBER; AMBER applications for nucleic acids are reviewed in a perspective by Cheatham and Case (11).
The report by Chen and Garcia (3) is unique in successfully describing folding of an RNA to its experimentally determined structure. This achievement is exciting not only for what it promises for computational experiments that are sure to follow, but for the underlying reasons for its success.
One of the driving forces for RNA structure and dynamics is the stacking of nucleobases. Base stacking in the most common force fields, AMBER and CHARMM, has been a source of considerable trouble in practical and theoretical terms. AMBER is generally known to overstack nucleobases (3), which limits conformational sampling and leads to duplexes that are too stable. In contrast, CHARMM is known to understack bases (9), leading to disruption of duplexes and instability of noncanonical structures that are known to stack (such as tetraloops). Experimentally, nucleobase stacking is known to be enthalpically driven; it is not hydrophobic, and is generally considered to be described in chemical physics terms by London dispersion forces. As expressed by the Lennard–Jones potential, V = 4ε[(σ/r)12−(σ/r)6], r(σ) is the distance between molecules, ε the well depth, and σ the distance at which the potential is zero (the van der Waals radius). The first term is the repulsive term, and the second term the attractive term. This potential is heuristic, and the variables must be determined by measurement or fitting. In the RNA folding experiment by Chen and Garcia (3), one of the keys to their success was the optimization of ε and σ for base stacking in the AMBER force field.
As folding targets, Chen and Garcia (3) chose three RNA tetraloops, including the ubiquitous UUCG and GCAA loops, and the CUUG loop. By choosing to look at folding rather than describe the time-dependent motions of the nucleotides, the authors set themselves a more difficult task. Biological RNA folding is notorious for its off-pathway excursions to alternative structures; there is often no single low-energy well at the bottom of a steep folding funnel. Rather, the funnel is rugged, with nooks and crannies that trap the molecule, and the bottom of the funnel is shallow and broad (12–14). Although Chen and Garcia facilitated their task by the selection of three unusually stable hairpin loops, even these RNAs displayed complications. The authors found that the UUCG tetraloop adopted its unusual loop structure via several paths that involved intermediates, in agreement with spectroscopic experiments from the Gruebele laboratory (15). Chen and Garcia’s simulations also predict multiple folding pathways for the GCAA loop, again in agreement with several different in vitro experiments. In vitro folding experiments can identify kinetic intermediates, but are typically unable to assign a specific conformation to them. In silico folding experiments have the potential to provide that information, which can be subsequently used to design modified molecules to test that pathway. The Chen and Garcia experiments using replica exchange molecular dynamics have no kinetic data, so correspondence between their structural intermediates and those inferred from experiment will need to be tested. Their third chosen tetraloop, CUUG, remained recalcitrant; the experimentally determined folded structure evaded capture. When the structure of the CUUG tetraloop was solved by NMR, the investigators noted that its bases underwent considerable internal motion, especially the second U, so perhaps its performance in the simulations is not too far removed from its solution reality. By any criteria, the success of these computational folding experiments brings the field one step closer to accurate simulations of RNAs.
What RNA questions can be addressed now, with this improved AMBER force field? The DNA field has been obsessed with the timescales and geometries of bases that flip out of DNA duplexes. The analogous scenario for RNA is the dynamics of unpaired bases in asymmetric internal bulges. These are difficult to measure experimentally, but are important sites for binding to ligands; TAR:tat is a famous example (16). Beyond repeating computational experiments where RNA conformations are important for RNA function, fundamental questions relating RNA structure/dynamics are everywhere: how does a three-way junction become a site for small-molecule binding (riboswitches) or how does it fold to form a catalytic pocket (hammerhead ribozyme)? How are unpaired nucleotides accommodated in miRNA duplexes and bound to Argonaut proteins? Nature provides many more examples that await examination. RNAs will continue to be challenging molecules to describe experimentally, both in vitro and in silico. The prize when the puzzle is solved is a window into the world of RNA.
Footnotes
The author declares no conflict of interest.
See companion article on page 16820.
References
- 1.Robertus JD, et al. Structure of yeast phenylalanine tRNA at 3 A resolution. Nature. 1974;250(467):546–551. doi: 10.1038/250546a0. [DOI] [PubMed] [Google Scholar]
- 2.Kim SH, et al. Three-dimensional tertiary structure of yeast phenylalanine transfer RNA. Science. 1974;185(4149):435–440. doi: 10.1126/science.185.4149.435. [DOI] [PubMed] [Google Scholar]
- 3.Chen AA, Garcia AE. High-resolution reversible folding of hyperstable RNA tetraloops using molecular dynamics simulations. Proc Natl Acad Sci USA. 2013;110:16820–16825. doi: 10.1073/pnas.1309392110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Tuerk C, et al. CUUCGG hairpins: Extraordinarily stable RNA secondary structures associated with various biochemical processes. Proc Natl Acad Sci USA. 1988;85(5):1364–1368. doi: 10.1073/pnas.85.5.1364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Woese CR, Winker S, Gutell RR. Architecture of ribosomal RNA: Constraints on the sequence of “tetra-loops”. Proc Natl Acad Sci USA. 1990;87(21):8467–8471. doi: 10.1073/pnas.87.21.8467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Antao VP, Lai SY, Tinoco I., Jr A thermodynamic study of unusually stable RNA and DNA hairpins. Nucleic Acids Res. 1991;19(21):5901–5905. doi: 10.1093/nar/19.21.5901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Jucker FM, Pardi A. Solution structure of the CUUG hairpin loop: A novel RNA tetraloop motif. Biochemistry. 1995;34(44):14416–14427. doi: 10.1021/bi00044a019. [DOI] [PubMed] [Google Scholar]
- 8.Cruz JA, et al. RNA-Puzzles: A CASP-like evaluation of RNA three-dimensional structure prediction. RNA. 2012;18(4):610–625. doi: 10.1261/rna.031054.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Denning EJ, Priyakumar UD, Nilsson L, Mackerell AD., Jr Impact of 2′-hydroxyl sampling on the conformational properties of RNA: Update of the CHARMM all-atom additive force field for RNA. J Comput Chem. 2011;32(9):1929–1943. doi: 10.1002/jcc.21777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Tubbs JD, et al. The nuclear magnetic resonance of CCCC RNA reveals a right-handed helix, and revised parameters for AMBER force field torsions improve structural predictions from molecular dynamics. Biochemistry. 2013;52(6):996–1010. doi: 10.1021/bi3010347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cheatham TE, 3rd, Case DA. Twenty-five years of nucleic acid simulations. Biopolymers. 2013;99(12):969–977. doi: 10.1002/bip.22331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Gluick TC, Gerstner RB, Draper DE. Effects of Mg2+, K+, and H+ on an equilibrium between alternative conformations of an RNA pseudoknot. J Mol Biol. 1997;270(3):451–463. doi: 10.1006/jmbi.1997.1119. [DOI] [PubMed] [Google Scholar]
- 13.Laederach A, Shcherbakova I, Liang MP, Brenowitz M, Altman RB. Local kinetic measures of macromolecular structure reveal partitioning among multiple parallel pathways from the earliest steps in the folding of a large RNA molecule. J Mol Biol. 2006;358(4):1179–1190. doi: 10.1016/j.jmb.2006.02.075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Treiber DK, Rook MS, Zarrinkar PP, Williamson JR. Kinetic intermediates trapped by native interactions in RNA folding. Science. 1998;279(5358):1943–1946. doi: 10.1126/science.279.5358.1943. [DOI] [PubMed] [Google Scholar]
- 15.Ma H, et al. Exploring the energy landscape of a small RNA hairpin. J Am Chem Soc. 2006;128(5):1523–1530. doi: 10.1021/ja0553856. [DOI] [PubMed] [Google Scholar]
- 16.Puglisi JD, Tan R, Calnan BJ, Frankel AD, Williamson JR. Conformation of the TAR RNA-arginine complex by NMR spectroscopy. Science. 1992;257(5066):76–80. doi: 10.1126/science.1621097. [DOI] [PubMed] [Google Scholar]