Chemical probing coupled to high throughput sequencing offers a high throughput and flexible approach to uncover many aspects of RNA structure relevant to its cellular function and interactions. Chemical probes work by covalently modifying RNAs preferentially in regions that are unconstrained – i.e. those that are flexible, unstructured, unpaired and unbound by other RNA/protein/ligand interactions. These covalent modifications can then be mapped by sequencing to obtain structural information across a complex pool of RNAs all at once. A wide range of inputs can be used in these experiments, ranging from in vitro purified RNAs to RNAs from whole cells and tissues. Following the probing reaction, RNAs are reverse transcribed into cDNAs, formatted into high throughput sequencing libraries, and sequenced to detect modifications either as truncated cDNA products (RT-stops) or modification-induced mutations (RT-mutations) (Strobel et al., 2018). The distribution of modifications across a molecule is then used to calculate a ‘reactivity’ value for each nucleotide in the RNA, with higher reactivities typically corresponding to more unconstrained positions (Strobel et al., 2018).
Chemical probing reactivities can be used to uncover many layers of RNA structure. Since the probing reaction is sensitive to the general structural environment of each nucleotide (McGinnis et al., 2012), reactivities reflect base-pairing interactions, complex tertiary interactions and other environmental effects such as ion-mediated interactions, proteins/RNA/ligand interactions, and the change of RNA flexibility with different temperatures. Moreover, different probes access different structural information (Strobel et al., 2018): some modify the backbone of the RNA, while others preferentially modify specific bases (Ehresmann et al.). The choice of probe is an important consideration in the design of a chemical probing experiment (Strobel et al., 2018). For example, probes with short-half lives are advantageous for experiments that require rapid sample handling to capture out of equilibrium structures that may be present when RNAs are folded contranscriptionally (Watters et al., 2016), while probes with long half-lives are better suited to probing RNA structures inside cells (Spitale et al., 2015).
Chemical probing can be used to address many biological questions about RNA function, and give insights into the structural mechanisms behind these functions. For example, a protein-RNA binding interaction can be characterized by performing parallel probing experiments on purified RNA alone and RNA folded in the presence of the RNA-binding protein (RBP). By looking for changes in reactivities between the two conditions, RBP binding sites can be uncovered as well as other structural changes that result from the RBP-RNA interaction (Smola et al., 2015). Similar comparisons can reveal where ligands bind RNAs (Stoddard et al., 2008), how RNA folds change in the complex cellular environment (Spitale et al., 2015), how RNA folds change during transcription (Watters et al., 2016), and many others.
Chemical probing data can also be leveraged alongside computational methods to yield higher-resolution RNA structural models. Single-structure methods can use generated or existing databases of reactivity information (Yesselman et al. 2017) to increase the accuracy of 2- or 3-dimensional structural models of the RNA (Lorenz et al., 2016). However, RNAs often fold into an ensemble of different structures in solution which are captured at a population level by the bulk nature of chemical probing experiments. Multistate methods have been recently developed to extract this population-level information from probing data to predict the ensemble of distinct folds of an RNA molecule, as well as their relative distributions within the population (Li and Aviran, 2018). Beyond analysis of individual RNAs, comparative methods have been developed to ask questions about the conservation of structural elements between different sequences, and genome-wide tools are useful for linking reactivity patterns to genomic elements (Spitale et al., 2015, Mustoe et al., 2018, Strobel et al., 2018).
High throughput chemical probing offers a powerful and growing suite of experiments to uncover the RNA structure-function relationship. However, there are several limitations to keep in mind when designing these experiments. In some cases, it can be difficult to unambiguously assign structural changes from reactivity changes, as many different changes in structural context can lead to the same observed change in reactivity. Second, low amounts of input RNA can result in low signal and inaccurate reactivity estimates; in these cases, increasing the amount of starting material and/or increasing sequencing depth is necessary. Third, different steps in probing experiments can introduce bias into the data (Strobel et al., 2018). Protocols are being continuously developed to remove these biases, but careful processing of samples is important for producing high-quality libraries for sequencing. Other best practices include general considerations of sequencing depth. Genome-wide probing (Mustoe et al., 2018), cotranscriptional probing (Watters et al., 2016), and studies of long RNAs require increased sequencing depth compared to single-length experiments to produce robust reactivity estimates. Additionally, the use of multiple probes to study to same RNA is often advantageous to leverage the complementary information offered by different probes.
Chemical probing of RNA structure continues to increase in power and resolution. There are also exciting opportunities to continue to merge these techniques with other high-throughput methods such as CLIP, ribosome profiling and others to study the structural basis of RNA-protein interactions, the impact of RNA structure on translation and many other features of the RNA structure-function relationship. Overall, we anticipate these techniques will help uncover new RNA functional roles and the structural aspects of their mechanisms across the cell.
Acknowledgement:
This work was supported by an Arnold and Mabel Beckman Foundation Postdoctoral Fellowship (to E.J.S.), the Tri-Institutional Training Program in Computational Biology and Medicine (via a National Institutes of Health training grant T32GM083937 to A.M.Y.), the National Institute of General Medical Sciences of the National Institutes of Health (grant numbers 1DP2GM110838 and R01GM120582 to J.B.L.) and Searle Funds at the Chicago Community Trust (to J.B.L.). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
References:
- Ehresmann C, Baudin F, Mougel M, Romby P, Ebel J-P, and Ehresman B (1987) Probing the structure of RNAs in solution. Nucleic Acids Research, 15, 9109–9128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H, and Aviran S (2018). Statistical modeling of RNA structure profiling experiments enables parsimonious reconstruction of structure landscapes. Nature Communications 9, 606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lorenz R, Wolfinger MT, Tanzer A, and Hofacker IL (2016). Predicting RNA secondary structures from sequence and probing data. Methods 103, 86–98. [DOI] [PubMed] [Google Scholar]
- McGinnis JL, Dunkle JA, Cate JHD, and Weeks KM (2012). The Mechanisms of RNA SHAPE Chemistry. J. Am. Chem. Soc 134, 6617–6624. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mustoe AM, Busan S, Rice GM, Hajdin CE, Peterson BK, Ruda VM, Kubica N, Nutiu R, Baryza JL, and Weeks KM (2018). Pervasive Regulatory Functions of mRNA Structure Revealed by High-Resolution SHAPE Probing. Cell 173, 181–195.e18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smola MJ, Calabrese JM, and Weeks KM (2015). Detection of RNA–Protein Interactions in Living Cells with SHAPE. Biochemistry 54, 6867–6875. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spitale RC, Flynn RA, Zhang QC, Crisalli P, Lee B, Jung J-W, Kuchelmeister HY, Batista PJ, Torre EA, Kool ET, et al. (2015). Structural imprints in vivo decode RNA regulatory mechanisms. Nature 2017 548:7665 519, 486–490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stoddard CD, Gilbert SD, and Batey RT (2008). Ligand-dependent folding of the three-way junction in the purine riboswitch. RNA 14,675–684. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Strobel EJ, Yu AM, and Lucks JB (2018). High-throughput determination of RNA structures. Nature Reviews Genetics doi: 10.1038/s41576-018-0034-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watters KE, Strobel EJ, Yu AM, Lis JT, and Lucks JB (2016). Cotranscriptional folding of a riboswitch at nucleotide resolution. Nature Structural & Molecular Biology 23, 1124–1131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yesselman JD, Tian S, Liu X, Shi L, acids JLN, (2017) Updates to the RNA mapping database (RMDB), version 2. Nucleic Acids Research 46, D375–D379. [DOI] [PMC free article] [PubMed] [Google Scholar]