Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2011 Oct 26.
Published in final edited form as: Biochemistry. 2010 Oct 26;49(42):9058–9062. doi: 10.1021/bi101164s

Thermodynamic Characterization of RNA Triloops

Praneetha Thulasi 1, Lopa K Pandya 1, Brent M Znosko 1,*
PMCID: PMC2966716  NIHMSID: NIHMS238676  PMID: 20843054

Abstract

Relatively few thermodynamic parameters are available for RNA triloops. Therefore, 24 stem-loop sequences containing naturally occurring triloops were optically melted, and the thermodynamic parameters ΔH°, ΔS°, ΔG°37, and TM for each stem-loop were determined. These new experimental values, on average, are 0.5 kcal/mol different from the values predicted for these triloops using the model proposed by Mathews et al. [Mathews, D. H., Disney, M. D., Childs, J. L., Schroeder, S. J., Zuker, M., and Turner, D. H. (2004) Proc. Natl. Acad. Sci. USA 101, 7287–7292]. The data for the 24 triloops reported here were then combined with the data for five triloops that were published previously. A new model was derived to predict the free energy contribution of previously unmeasured triloops. The average absolute difference between the measured values and the values predicted using this proposed model is 0.3 kcal/mol. These new experimental data and updated predictive model allow for more accurate calculations of the free energy of RNA stem-loops containing triloops, and, furthermore, should allow for improved prediction of secondary structure from sequence.


RNA stem-loops containing three nucleotides in the loop, triloops, are common secondary structure motifs found in naturally occurring RNA. For example, bacterial 16S rRNAs strongly favor tetraloops; however, the UUU triloop is the most common replacement (1). In the 16S-like rRNA variable regions, triloops account for 7% of the loops in bacteria and 16% of the loops in eukaryotes (2). Triloops are also found in large subunit rRNAs (3,4), 5S rRNAs (5), signal recognition particles (6), RNase P RNAs (7), and group I introns (8,9). More specifically, triloops are found in Brome mosaic virus (+) strand RNA (10), human rhinovirus isotype 14 (11), iron responsive element RNA (12), and an RNA aptamer for bacteriophage MS2 coat protein (13), to name a few. Although relatively unstable due to the strain in the loop, triloops may be an important structural feature due to the accessibility of the loop nucleotides for recognition by proteins, other nucleic acids, or small molecules. It has been shown that triloops play a role in various biological processes, including virus replication (11,14), viral synthesis (15), and iron response (12), to name a few. Nevertheless, only a few studies have thermodynamically characterized RNA triloops (1620), and only three of these (1618) were done using 1 M NaCl (the salt concentration in which most nearest neighbor parameters are derived).

The current model used by secondary structure prediction algorithms to predict the thermodynamic contribution of RNA triloops to stem-loop stability is sequence independent; all triloops contribute 5.4 kcal/mol to stem-loop stability, with the exception of 5’CCC3’ which contributes 6.9 kcal/mol (21). In addition, there are two unstable triloops sequences (5’CAACG3’ and 5’GUUAC3’) for which this predictive model is not used; instead, the ΔG°37,loop values (6.8 and 6.9 kcal/mol, respectively) for these two triloops are provided in a lookup table (21).An interesting study by the Bevilacqua lab (19) used a combinatorial approach and temperature gradient gel electrophoresis to identify stable and unstable RNA triloops. It was discovered that sequence preferences for exceptionally stable triloops included a U-rich loop and C-G as the closing base pair. Although they used 10 mM NaCl during their melting experiments, they suggested that the rules for predicting triloop stability at 1 M NaCl should be modified; however, this has yet to be done. Here, we report the thermodynamic parameters for 24 previously unmeasured RNA triloops in 1 M NaCl and propose a new algorithm for predicting the contribution of triloops to stem-loop stability, which includes two bonuses for stabilizing sequence features.

MATERIALS AND METHODS

Compiling and Searching a Database for RNA Triloops

The initial aim of this project was to identify the most frequently occurring RNA triloops in nature and to thermodynamically characterize these hairpin triloop sequences. Therefore, a database of 1349 RNA secondary structures containing 123 small subunit rRNAs (22), 223 large subunit rRNAs (3,4), 309 5S rRNAs (5), 484 tRNAs (23), 91 signal recognition particles (6), 16 RNase P RNAs (7), 100 group I introns (8,9), and 3 group II introns (24) was compiled. This database was searched for triloops, and the number of occurrences for each type of triloop was tabulated. In this work, G-U pairs are considered to be canonical base pairs.

Design of Sequences for Optical Melting Studies

Since most thermodynamic parameters for RNA secondary structure motifs are reported for RNA solutions containing 1 M NaCl, the melting buffer used in this work also contained 1 M NaCl. A major limitation of a thermodynamic analysis of RNA hairpins using this high salt concentration is the possible bimolecular association of RNA strands (19,25). To ensure that unimolecular triloop formation out-competed bimolecular association in a 1 M NaCl solution, the following equations, derived from the equilibrium equations and ΔG° = −RT ln K, were utilized:

[H]=1+1+((8*KD*[A]T)/(KH*KH))(4*KD)/(KH*KH) (1)
[D]=([A]T[H])/2 (2)
%H=[H]/([H]+[D])*100 (3)

Here, [H] is the concentration of hairpin, KD is the equilibrium constant for duplex formation, [A]T is the total strand concentration, KH is the equilibrium constant for hairpin formation, [D] is the concentration of duplex, and %H is the percent hairpin in solution. KH and KD values were calculated at 37 °C using ΔG°37 values from RNAstructure (21, 26, 27) for hairpin and duplex formation, respectively. Calculations were done for [A]T = 1 µM and 0.1 mM, which are the typical concentration range for the melting experiments. Due to potential competition from duplex formation, many of the most frequently occurring triloops were not studied here; only those that were likely to form triloops were used. All of the sequences studied here had %[H] > 84% at [A]T = 0.1 mM and %[H] > 99% at [A]T = 1 µM.

Sequences of triloops and closing base pairs were designed to represent those found in the database described above. Each stem contained three Watson-Crick pairs in addition to the closing base pair. The terminal base pair was always a G-C pair in order to prevent end fraying of the sequence during melting. Care was taken to design the stem-loop sequences so that the triloop of interest would form, with little competition from other secondary structure motifs.

RNA Synthesis and Purification

Oligonucleotides were ordered from Integrated DNA Technologies (Coralville, IA). The purification of the oligonucleotides followed standard procedures that were described previously (28).

Optical Melting Experiments and Thermodynamics

Optical melting experiments were performed in 1 M NaCl, 20 mM sodium cacodylate, and 0.5 mM Na2EDTA (pH 7.0). All stem-loops were melted about nine times with approximately a 50-fold concentration range. Each stem-loop melting curve resulted in a single transition, and all melts of a given stem-loop were concentration independent, suggesting stem-loop formation. Stem-loop thermodynamics were determined by averaging the thermodynamics derived from each individual curve fit. The thermodynamic contributions of triloops to stem-loop thermodynamics (ΔG°37,triloop, ΔH°triloop, and ΔS°triloop) were determined by subtracting the Watson-Crick contribution (29) from the measured thermodynamics. Stem sequences in which the terminal pair of the stem (or the triloop closing base pair) is A-U, U-A, G-U, or U-G utilize the 0.45 kcal/mol terminal A-U penalty (29) when calculating the contribution of the Watson-Crick stem; thus, this 0.45 kcal/mol contribution is not included in the parameters for predicting the loop contribution to stem-loop stability.

Linear Regression and Triloop Thermodynamic Parameters

Data collected for the 24 triloops in this study were combined with previously published data for five triloops (1618) that were also melted in 1 M NaCl. Data for seven triloops measured in these previous studies (1618) were omitted from the analysis and discussion here because, using Eq 13, they resulted in %H < 75%. Additional thermodynamic data are available for triloops in 10 mM sodium phosphate (17,19,20) and 0.1 M NaCl (16), but due to different salt concentrations, were not included in the analysis described here.

A predictive model was derived by linear regression as described previously (28). The calculated experimental contribution of the triloop to stem-loop stability was used as a constant when doing linear regression. Many combinations of variables were tested. To simultaneously solve for each variable, the LINEST function of Microsoft Excel was used for linear regression.

RESULTS

Database Searching

A database of RNA secondary structures was searched for triloops. In this database, 822 triloops were found, averaging about one triloop for every two secondary structures. Table 1 shows a summary of the database results obtained. Table S1 shows the complete results. The first set of data in Table 1 lists frequency and percent occurrence when the triloop nucleotides and the closing base pair are specified. Because the stability of hairpin loops depends on both the identity of the nucleotides in the loop and the closing base pair (1618,3038), this categorization is most important. Categorizing triloops in this fashion results in 177 types of triloops in the database. The 10 triloop types listed in the first data set (Table 1) account for 45% of the total number of triloops found. The 167 types of triloops not shown account for the remaining 55%; however, each type represents <1.8% of the total number of triloops found.

Table 1.

Summary of Database Search Results for RNA Triloopsa

Dataset 1: Triloop with
Adjacent Base Pair
Dataset 2: Triloop Dataset 3: Adjacent Base Pairs Dataset 4: Triloop
Nucleotides Classified as
Purine (R) or Pyrimidine (Y)




Triloopa Freqb %c Triloopa Freqb %c Closing bp Freqb %c Triloopb Freqb %c




GGGGC 106 12.9 GGG 108 13.1 C-G 345 42.0 RRR 207 25.2
CGCAG 53 6.4 UAA 96 11.7 G-C 219 26.6 RYR 181 22.0
CUAAG 45 5.5 GCA 66 8.0 U-A 163 19.8 YRR 123 15.0
UUAAA 41 5.0 UUU 53 6.4 A-U 56 6.8 YYY 120 14.6
CAUAG 34 4.1 AUA 48 5.8 U-G 26 3.2 YYR 53 6.4
UGAAA 24 2.9 AAA 32 3.9 G-U 13 1.6 YRY 47 5.7
CUUUG 24 2.9 GAA 32 3.9 RYY 45 5.5
CUCUG 17 2.1 GUA 32 3.9 RRY 41 5.0
CAGAG 15 1.8 UCU 24 2.9
GGUAC 15 1.8 AGA 22 2.7
a

Not all combinations in Datasets 1 and 2 are shown due to space limitations. For each set of sequences, the strand is writen 5' to 3'.

b

Frequency of occurrence in the secondary structure database described in Materials and Methods.

c

Percent out of 822 triloops, the total number of triloops found in the secondary structure database.

The second set of data (Table 1) lists frequency and percent occurrence when only the triloop sequence is specified (the closing base pair is not considered). Categorizing triloops in this fashion results in 62 types of triloops in the database; however, there are 64 sequence possibilities. Therefore, two sequence possibilities, 5’AGG3’ and 5’CUG3’ are not found in the database. The 10 triloops listed in the second data set (Table 1) account for 62% of the total number of triloops found. The 52 types of triloops not shown account for the remaining 38%, with each triloop representing <2.5% of the total number of triloops found. If the occurrence of triloops were completely random, we would expect each triloop sequence to occur ~13 times in the database.

The third set of data (Table 1) lists frequency and percent occurrence of the closing base pair. Categorizing triloops in this fashion results in six types of closing base pairs in the database, representing all possible types. If the occurrence of triloops were completely random, we would expect each closing base pair to occur 137 times in the database.

The fourth set of data (Table 1) lists frequency and percent occurrence of the triloop nucleotides when A and G are categorized as purines (R) and C and U are categorized as pyrimidines (Y). Categorizing triloops in this fashion results in eight types of triloops, representing all possible combinations. If the occurrence of triloops were completely random, we would expect each type to occur ~100 times in the database.

Thermodynamic Parameters

As discussed earlier, due to possible bimolecular association of strands, many of the most frequently occurring triloops were not studied. All of the triloops studied here were found in the secondary structure database described above and were most likely in the hairpin conformation during the optical melting studies (as determined by using Eqs 13). Table 2 shows the thermodynamic parameters of hairpin formation that were obtained from the average of fitting each melting curve to the two-state model. Data for 29 stem-loops containing unique triloops are shown in order of decreasing frequency as determined by the database search described above. Interestingly, all five of the triloops studied previously are of the sequence 5’ANA3’.

Table 2.

Thermodynamic Parameters for Stem-Loop Formation and Contributions of Triloops to Stem-Loop Stabilitya

Frequencyb Sequencec ΔH°
(kcal/mol)
ΔS°
(eu)
ΔG°37
(kcal/mol)
TMd
(°C)
ΔH°,triloope
(kcal/mol)
ΔG°37,triloope
(kcal/mol)
Predicted
ΔG°37,triloopf
(kcal/mol)
34 GGCAUAGCCg −21.9 ± 1.9 −67.5 ± 6.0 −0.94 ± 0.15 51.0 6.4 5.74 5.2
15 GGCCAGAGGCC −39.8 ± 2.1 −113.9 ± 6.2 −4.46 ± 0.22 76.2 1.9 5.48 5.2
14 GCCUUUUAGGC −36.2 ± 1.8 −105.6 ± 5.4 −3.45 ± 0.15 69.7 2.6 4.86 4.7
12 GGCUAAAAGCC −29.9 ± 2.5 −88.5 ± 7.8 −2.47 ± 0.18 64.9 8.9 5.84 5.9
10 GGGAUACAAAGUAUCCAh −56.2 −160.9 −6.3 76.4 3.7 5.38 5.2
10 GGCCUUCGGCC −36.1 ± 2.8 −102.1 ± 7.9 −4.38 ± 0.36 79.9 5.6 5.56 5.2
9 GCCGUUUCGGC −43.4 ± 3.5 −125.7 ± 10.7 −4.40 ± 0.17 72.0 −4.5 4.64 4.7
8 GGCGACACGCC −29.1 ± 3.1 −85.4 ± 9.7 −2.66 ± 0.22 68.2 9.8 6.38 5.9
8 GGCUCAAAGCC −27.9 ± 2.7 −82.2 ± 8.5 −2.40 ± 0.12 66.1 10.9 5.91 5.9
7 GGCAUAUUGCC −33.6 ± 1.4 −99.1 ± 4.5 −2.85 ± 0.06 65.7 5.1 5.49 5.9
7 GGCUUAUAGCC −31.7 ± 1.6 −92.9 ± 4.7 −2.85 ± 0.14 67.7 7.1 5.46 5.9
7 GGCCUCCGGCC −39.4 ± 2.5 −110.6 ± 7.3 −4.77 ± 0.19 80.1 2.3 5.17 5.2
6 GGCGAGACGCC −37.0 ± 2.5 −109.3 ± 8.0 −3.11 ± 0.25 65.4 1.9 5.93 5.9
6 GGGAUACCCg −27.6 ± 1.7 −88.3 ± 5.1 −0.26 ± 0.22 38.9 −0.8 6.26 5.9
6 GGCGCUUCGCC −26.7 ± 2.8 −78.7 ± 8.8 −2.28 ± 0.12 66.0 12.2 6.76 5.9
5 GGCUACAAGCC −30.7 ± 1.9 −90.7 ± 6.0 −2.61 ± 0.15 65.7 8.1 5.70 5.9
5 GCCCGAAGGGC −43.4 ± 1.9 −122.7 ± 5.8 −5.35 ± 0.07 80.6 −1.7 4.59 5.2
5 GCCCUAUGGGC −42.6 ± 1.9 −120.4 ± 6.8 −5.24 ± 0.17 80.6 −0.9 4.70 5.2
4 GGCAAAAUGCC −30.0 ± 2.2 −89.2 ± 6.6 −2.36 ± 0.17 63.5 8.7 5.98 5.9
4 GGCGAAACGCC −38.0 ± 2.3 −110.9 ± 7.3 −3.63 ± 0.13 69.8 0.9 5.41 5.9
4 GGCCACAGGCC −38.2 ± 1.7 −109.8 ± 5.0 −4.16 ± 0.17 74.9 3.5 5.78 5.2
4 GGCGCAACGCC −33.0 ± 1.9 −95.4 ± 5.7 −3.42 ± 0.15 72.9 5.9 5.62 5.9
4 GGCACAUUGCC −31.6 ± 2.4 −93.8 ± 7.4 −2.47 ± 0.16 63.3 7.1 5.87 5.9
4 GGCCCUUGGCC −51.8 ± 1.6 −149.3 ± 4.8 −5.47 ± 0.22 73.6 −10.1 4.47 5.2
2 GGCUAACGGCC −34.5 ± 1.4 −101.8 ± 4.4 −2.91 ± 0.08 65.6 5.9 5.40 5.9
2 GGCGACCCGCC −35.3 ± 1.2 −103.4 ± 3.6 −3.23 ± 0.19 68.3 3.6 5.81 5.9
1 GGCUACCAGCC −30.9 ± 2.3 −91.1 ± 7.3 −2.63 ± 0.10 65.9 7.9 5.68 5.9
0 GGGAAAUCCi −18.4 ± 2.9 −60.4 ± 9.2 0.3 ± 0.2 31.8 3.3 6.37 5.9
0 GGAGAAAUUCCi −31.2 ± 3.6 −97.6 ± 11.9 −0.9 ± 0.4 46.1 −2.2 6.34 5.9
a

Measurements were made in 1.0 M NaCl, 10 mM sodium cacodylate, and 0.5 mM Na2EDTA at pH 7.0.

b

Frequency of occurrence in the database described in Materials and Methods.

c

The sequences are written 5' to 3'. The nucleotides in the triloop are underlined for easy identification.

d

The average Tm difference between the most and least concentrated sample for each stem-loop was 2.7 °C. If the sequences were forming a duplex, nearest neighbor thermodynamic values (29) predict that the Tm should change by about 16 °C for the typical concentration range used here, suggesting the formation of stem-loop structures.

e

The enthalpy and free energy contribution of the triloop was calculated by subtracting the Watson-Crick contribution of the stem (29) from the experimental enthalpy and free energy of the stem-loop, respectively.

f

Predicted triloop free energy contributions using the new predictive model shown in Eq 5.

g

Ref (16).

h

Ref (17).

i

Ref (18).

Contribution of Triloops to Stem-Loop Free Energy

The contributions of the 29 triloops to stem-loop stability are also listed in Table 2. The examination of the free energy contributions of triloops to stem-loop free energy indicates a large variance, with ΔG°37,triloop ranging from 4.5 to 6.8 kcal/mol. Interestingly, the 5’CUU3’ triloop is both the least and most stable triloop reported here, depending on its adjacent pair. It is the least stable triloop (ΔG°37,triloop = 6.8 kcal/mol) when adjacent to a G-C pair and the most stable triloop (ΔG°37,triloop = 4.5 kcal/mol) when adjacent to a C-G pair.

Updated Model for Predicting the Free Energy of Previously Unmeasured Triloops

Currently, in order to predict the free energy contribution of triloops, all triloops are assigned a free energy penalty of 5.4 kcal/mol. The only modifications to this penalty is for a triloop containing all C residues, which includes an additional 1.5 kcal/mol penalty (21) and a lookup table for two unstable triloops, 5’CAACG3’ (6.8 kcal/mol) and 5’GUUAC3’ (6.9 kcal/mol) (21). When predicting the free energy contributions of the 29 triloops reported here, experimental values can now be used. For triloops that still do not have experimental values, a predictive model can be utilized.

The current predictive model was applied to all of the triloops reported here. On average, the predicted value was 0.5 ± 0.4 kcal/mol different than the experimental value. The average thermodynamic contribution of the triloops reported here was 5.6 kcal/mol. Using this average value in place of the 5.4 kcal/mol penalty (21) results in an average difference between the predicted value and the experimental value of 0.4 ± 0.4 kcal/mol.

In an attempt to further improve prediction, various other models were tried. The best model (lowest difference between experimental and predicted values with small deviations for the derived parameters) was based on the findings of Shu and Bevilacqua (19). They discovered that sequence preferences for stable triloops included a U-rich loop and C-G as the closing base pair. Based on these observations and the data reported here, the following model was derived:

ΔG°37,triloop=ΔG°37,i+ΔG°37,UUU+ΔG°37,C-Gclosure (5)

Here, ΔG°37,i is the triloop initiation term, 5.9 ± 0.1 kcal/mol, ΔG°37,UUU is a −1.2 ± 0.3 kcal/mol bonus for a UUU triloop, and ΔG°37,C-G closure is a −0.7 ± 0.2 kcal/mol bonus for triloops adjacent to a C-G pair. Triloops adjacent to a G-C pair are not given the C-G bonus. Additional parameters and/or additional bonuses may be discovered with additional experiments. This model was used to predict the free energy contribution of the triloops measured here and the triloops measured previously (1618) (Table 2). On average, the predicted value was 0.3 ± 0.2 kcal/mol different than the experimental value.

DISCUSSION

Database Searching

Due to the size and diversity of the RNA secondary structure database that was searched, we have assumed that the number and type of triloops found in this database are representative of triloops found in naturally occurring RNA.

From the onset of this study, we anticipated that it would be difficult to find triloop sequences that would outcompete bimolecular association of the RNA strand to form a duplex with a 3×3 nucleotide internal loop (or smaller depending on the sequence of the nucleotides) in 1 M NaCl. In fact, triloop formation outcompeted bimolecular association with only two of the ten most frequently occurring triloop sequences, and one of those had already been studied thermodynamically (16). Therefore, the triloops studied here are not the most frequently occurring triloops (Table 1, Dataset 1); however, all of the triloops studied here do occur in the database and do appear to outcompete bimolecular association of the RNA strand.

It is interesting to note that the most frequent triloop in this database was 5’GGG3’ (representing 13% of all triloops in the database). In a similar database containing tetraloops, 5’GGGG3’ was not found (39). In both the triloop and tetraloop database, a C-G base pair sat atop the list of closing base pairs, representing 49 and 42% of all closing base pairs in tetraloops and triloops, respectively.When categorizing the loop nucleotides as purines and pyrimidines (Table 1, Dataset 4), it is interesting to note that loop sequences containing all purines were the most common type for triloops (25%) as well as tetraloops (35%).

Thermodynamic Contributions of Triloops to Duplex Thermodynamics

From the data in Table 2, it is evident that the stability of a triloop alone does not determine its frequency of occurrence. For example, the most stable triloop reported here (5’CCUUG5’, ΔG°37,triloop = 4.5 kcal/mol) is only the 45th most common in the database. Interestingly, due to its abnormally low destabilizing free energy contribution, it is also the worst predicted (using Eq. 5) of those reported here.

The sequence preferences observed here for stable triloops are consistent with the general trends reported by Shu and Bevilacqua (19). For example, triloops containing all uracils in the loop were, on average, 0.9 kcal/mol more stable than other triloop sequences. Triloops containing two uracils were, on average, 0.2 kcal/mol more stable than other triloop sequences; therefore, only triloops with three uracils are given a bonus in the proposed model (Eq. 5). Similarly, triloops adjacent to a C-G pair were, on average, 0.6 kcal/mol more stable than triloops with other adjacent pairs. More specifically, when adjacent to a C-G pair, both an 5’AUA3’ triloop and an 5’AGA3’ triloop are 0.5 kcal/mol more stable than when adjacent to a G-C adjacent pair. Surprisingly, when adjacent to a C-G pair, a 5’CUU3’ triloop is 2.3 kcal/mol more stable than when adjacent to a G-C pair. Although this difference in stability is almost four times the average stabilization by a C-G closing pair, this magnitude of stabilization is not anomalous. When adjacent to a C-G pair, the tetraloop 5’UUCG3’ was found to be 2.3 kcal/mol more stable than when adjacent to a G-C pair in 10 mM NaCl (32). Similarly, when adjacent to a C-G pair, the 5’UUA3’ triloop found to be 2.1 kcal/mol more stable than when adjacent to a G-C pair in 10 mM NaCl (19).

Updated Model for Predicting Thermodynamics of Triloops

Because we have collected thermodynamic data for 24 triloops that previously did not have experimental values, when predicting the free energy contributions of these triloops in an RNA stem-loop, the experimental values can be used. For triloops that still do not have experimental values, the predictive model (Eq 5) can be utilized. It is unclear why 5’UUU3’ is more stable than any other triloop sequence. Similarly, it is unclear why triloops with adjacent C-G pairs are more stable than triloops with adjacent G-C pairs. Structural studies are currently underway to help understand these thermodynamic patterns. Additional studies with more triloop sequences and adjacent base pairs may result in additional free energy bonuses.

Supplementary Material

1_si_001

Acknowledgments

This work was supported by award number R15GM085699 from the National Institute of General Medical Sciences. The content is solely the responsibility of the authors and does not necessarily represent the official view of the National Institute of General Medical Sciences of the National Institutes of Health.

Abbreviations

[A]T

the total strand concentration

[D]

concentration of duplex

[H]

concentration of hairpin

KD

equilibrium constant for duplex formation

KH

equilibrium constant for hairpin formation

R

purines

Y

pyrimidines

%H

percent hairpin in solution

Footnotes

SUPPLEMENTARY INFORMATION

A table listing all of the triloops found in the secondary structure database and the type of RNA in which each was found. Supplemental materials may be accessed free of charge online at http://pubs.acs.org.

REFERENCES

  • 1.Woese CR, Winker S, Gutell RR. Architecture of ribosomal RNA: Constraints on the sequence of tetra-loops. Proc. Natl. Acad. Sci. U.S.A. 1990;87:8467–8471. doi: 10.1073/pnas.87.21.8467. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Wolters J. The nature of preferred hairpin structures in 16S-like rRNA variable regions. Nucleic Acids Res. 1992;20:1843–1850. doi: 10.1093/nar/20.8.1843. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Gutell RR, Gray MW, Schnare MN. A compilation of large subunit (23s and 23s-like) ribosomal-RNA structures - 1993. Nucleic Acids Res. 1993;21:3055–3074. doi: 10.1093/nar/21.13.3055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Schnare MN, Damberger SH, Gray MW, Gutell RR. Comprehensive comparison of structural characteristics in eukaryotic cytoplasmic large subunit (23 S-like) ribosomal RNA. J. Mol. Biol. 1996;256:701–719. doi: 10.1006/jmbi.1996.0119. [DOI] [PubMed] [Google Scholar]
  • 5.Szymanski M, Specht T, Barciszewska MZ, Barciszewski J, Erdmann VA. 5S rRNA data bank. Nucleic Acids Res. 1998;26:156–159. doi: 10.1093/nar/26.1.156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Larsen N, Samuelsson T, Zwieb C. The signal recognition particle database (SRPDB) Nucleic Acids Res. 1998;26:177–178. doi: 10.1093/nar/26.1.177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Brown JW. The ribonuclease P database. Nucleic Acids Res. 1998;26:351–352. doi: 10.1093/nar/26.1.351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Waring RB, Davies RW. Assessment of a model for intron RNA secondary structure relevant to RNA self-splicing - A review. Gene. 1984;28:277–291. doi: 10.1016/0378-1119(84)90145-8. [DOI] [PubMed] [Google Scholar]
  • 9.Damberger SH, Gutell RR. A comparative database of group I intron structures. Nucleic Acids Res. 1994;22:3508–3510. doi: 10.1093/nar/22.17.3508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Kim CH, Kao CC, Tinoco I. RNA motifs that determine specificity between an viral replicase and its promoter. Nat. Struct. Biol. 2000;7:415–423. doi: 10.1038/75202. [DOI] [PubMed] [Google Scholar]
  • 11.Huang H, Alexandrov A, Chen X, Barnes TW, Zhang HY, Dutta K, Pascal SM. Structure of an RNA hairpin from HRV-14. Biochemistry. 2001;40:8055–8064. doi: 10.1021/bi010572b. [DOI] [PubMed] [Google Scholar]
  • 12.McCallum SA, Pardi A. Refined solution structure of the iron-responsive element RNA using residual dipolar couplings. J. Mol. Biol. 2003;326:1037–1050. doi: 10.1016/s0022-2836(02)01431-6. [DOI] [PubMed] [Google Scholar]
  • 13.Convery MA, Rowsell S, Stonehouse NJ, Ellington AD, Hirao I, Murray JB, Peabody DS, Phillips SE, Stockley PG. Crystal structure of an RNA aptamer-protein complex at 2.8 A resolution. Nat. Struct. Biol. 1998;5:133–139. doi: 10.1038/nsb0298-133. [DOI] [PubMed] [Google Scholar]
  • 14.Olsthoorn RCL, Bol JF. Role of an essential triloop hairpin and flanking structures in the 3'-untranslated region of alfalfa mosaic virus RNA in in vitro transcription. J. Virol. 2002;76:8747–8756. doi: 10.1128/JVI.76.17.8747-8756.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Haasnoot PCJ, Bol JF, Olsthoorn RCL. A plant virus replication system to assay the formation of RNA pseudotriloop motifs in RNA-protein interactions. Proc. Natl. Acad. Sci. U.S.A. 2003;100:12596–12600. doi: 10.1073/pnas.2135413100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Serra MJ, Barnes TW, Betschart K, Gutierrez MJ, Sprouse KJ, Riley CK, Stewart L, Temel RE. Improved parameters for the prediction of RNA hairpin stability. Biochemistry. 1997;36:4844–4851. doi: 10.1021/bi962608j. [DOI] [PubMed] [Google Scholar]
  • 17.Groebe DR, Uhlenbeck OC. Characterization of RNA hairpin loop stability. Nucleic Acids Res. 1988;16:11725–11735. doi: 10.1093/nar/16.24.11725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Giese MR, Betschart K, Dale T, Riley CK, Rowan C, Sprouse KJ, Serra MJ. Stability of RNA hairpins closed by wobble base pairs. Biochemistry. 1998;37:1094–1100. doi: 10.1021/bi972050v. [DOI] [PubMed] [Google Scholar]
  • 19.Shu Z, Bevilacqua PC. Isolation and characterization of thermodynamically stable and unstable RNA hairpins from a triloops combinatorial library. Biochemistry. 1999;38:15369–15379. doi: 10.1021/bi991774z. [DOI] [PubMed] [Google Scholar]
  • 20.Davis PW, Thurmes W, Tinoco I. Structure of a small RNA hairpin. Nucleic Acids Res. 1993;21:537–545. doi: 10.1093/nar/21.3.537. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Mathews DH, Disney MD, Childs JC, Schroeder SJ, Zuker M, Turner DH. Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. Proc. Natl. Acad. Sci. U.S.A. 2004;101:7287–7292. doi: 10.1073/pnas.0401799101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Gutell RR. Collection of small-subunit (16s- and 16s-like) ribosomal-RNA structures: 1994. Nucleic Acids Res. 1994;22:3502–3507. doi: 10.1093/nar/22.17.3502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sprinzl M, Horn C, Brown M, Ioudovitch A, Steinberg S. Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Res. 1998;26:148–153. doi: 10.1093/nar/26.1.148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Michel F, Umesono K, Ozeki H. Comparative and functional-anatomy of group-Ii catalytic introns - A review. Gene. 1989;82:5–30. doi: 10.1016/0378-1119(89)90026-7. [DOI] [PubMed] [Google Scholar]
  • 25.Bevilacqua PC, Blose JM. Structures, kinetics, thermodynamics, and biological functions of RNA hairpins. Annu. Rev. Phys. Chem. 2008;59:79–103. doi: 10.1146/annurev.physchem.59.032607.093743. [DOI] [PubMed] [Google Scholar]
  • 26.Lu ZJ, Turner DH, Mathews DH. A set of nearest neighbor parameters for predicting the enthalpy change of RNA secondary structure formation. Nucleic Acids Res. 2006;34:4912–4924. doi: 10.1093/nar/gkl472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Mathews DH, Sabina J, Zuker M, Turner DH. Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J. Mol. Biol. 1999;288:911–940. doi: 10.1006/jmbi.1999.2700. [DOI] [PubMed] [Google Scholar]
  • 28.Wright DJ, Rice JL, Yanker DM, Znosko BM. Nearest neighbor parameters for inosine-uridine pairs in RNA duplexes. Biochemistry. 2007;46:4625–4634. doi: 10.1021/bi0616910. [DOI] [PubMed] [Google Scholar]
  • 29.Xia T, SantaLucia J, Jr, Burkard ME, Kierzek R, Schroeder SJ, Jiao X, Cox C, Turner DH. Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with Watson-Crick base pairs. Biochemistry. 1998;37:14719–14735. doi: 10.1021/bi9809425. [DOI] [PubMed] [Google Scholar]
  • 30.Serra MJ, Lyttle MH, Axenson TJ, Schadt CA, Turner DH. RNA hairpin loop stability depends on closing base pair. Nucleic Acids Res. 1993;21:3845–3849. doi: 10.1093/nar/21.16.3845. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Tuerk C, Gauss P, Thermes C, Groebe DR, Gayle M, Guild N, Stormo G, d'Aubenton-Carafa Y, Uhlenbeck OC, Tinoco I, Borody EN, Gold L. CUUCGG hairpins: Extraordinary stable RNA secondary structures associated with various biochemical processes. Proc. Natl. Acad. Sci. U.S.A. 1988;85:1364–1368. doi: 10.1073/pnas.85.5.1364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Antao VP, Lai SY, Tinoco I. A thermodynamic study of unusually stable RNA and DNA hairpins. Nucleic Acids Res. 1991;19:5901–5905. doi: 10.1093/nar/19.21.5901. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Varani G, Cheong C, Tinoco I., Jr Structure of an unusually stable RNA hairpin. Biochemistry. 1991;30:3280–3289. doi: 10.1021/bi00227a016. [DOI] [PubMed] [Google Scholar]
  • 34.Heus HA, Pardi A. Structural features that give rise to the unusual stability of RNA hairpins containing GNRA loops. Science. 1991;253:191–194. doi: 10.1126/science.1712983. [DOI] [PubMed] [Google Scholar]
  • 35.Antao VP, Tinoco I. Thermodynamic parameters for loop formation in RNA and DNA hairpin tetraloops. Nucleic Acids Res. 1992;20:819–824. doi: 10.1093/nar/20.4.819. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Williams DJ, Hall KB. Experimental and computational studies of the G[UUCG]C RNA tetraloop. J. Mol. Biol. 2000;297:10445–11061. doi: 10.1006/jmbi.2000.3623. [DOI] [PubMed] [Google Scholar]
  • 37.Dale T, Smith R, Serra MJ. A test of the model to predict unusually stable RNA hairpin loop stability. RNA. 2000;6:608–615. doi: 10.1017/s1355838200992495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Proctor DJ, Schaak JE, Bevilacqua JM, Falzone CJ, Bevilacqua PC. Isolation and characterization of a family of stable RNA tetraloops with the motif YNMG that participate in tertiary interactions. Biochemistry. 2002;41:12062–12075. doi: 10.1021/bi026201s. [DOI] [PubMed] [Google Scholar]
  • 39.Sheehy JP, Davis AR, Znosko BM. Thermodynamic characterization of naturally occurrring RNA tetraloops. RNA. 2010;16:417–429. doi: 10.1261/rna.1773110. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1_si_001

RESOURCES