Abstract
The NMR structures of and are reported. The internal loop, , is about 2 kcal/mol more stable than at 37 °C. The duplexes assemble into similar global folds characterized by the formation of tandem sheared GA pairs. The different stabilities of the loops are accompained by differences in the local structure of the closing GU pairs. In the internal loop, the GU pairs form canonical wobble configurations with two hydrogen bonds, whereas in the GU pairs form a single hydrogen bond involving the amino group, GH22, to the carbonyl group, UO4. This pairing is similar to the GU closing pair of the 690 hairpin loop found in E. coli 16S rRNA. The and structures reveal how the subtle interplay between stacking and hydrogen bonding determines sequence dependent conformation and thermodynamic stability. Thus, this work provides structural and thermodynamic benchmarks for theoreticians in the ongoing effort to understand the sequence dependence of RNA physicochemical properties.
Introduction
In the post-genomic era, it is evident that RNA actively choreographs many biological events ranging from catalysis to gene regulation (1-4). This emerging awareness of RNA function compels development of a fundamental understanding of the sequence dependence of RNA stability, structure, and dynamics. The impetus derives from the difficulty of empirically determining important physicochemical properties for every interesting RNA molecule revealed by genome sequencing projects. By systematically studying model systems, rules can be developed for predicting these properties from primary sequence (5-7). This will greatly assist the rapid extraction of important functional RNAs from genome-wide searches (8, 9).
RNA typically exists as a single stranded molecule capable of folding into secondary and tertiary structural elements. One important element of RNA secondary structure is the internal loop, which consists of stretches of non-Watson-Crick pairs flanked by canonical helical segments. The sequence dependence of internal loop stability and structure has been extensively studied experimentally (10-38) and some computational approaches have also been tested (39-41). Despite this effort, there are many unknowns regarding the nature of the physicochemical forces shaping internal loop stability, structure, and dynamics (19, 42, 43). In order to gain further insight into the nature of these forces, this paper presents the solution three dimensional structures of tandem GA pairs closed by GU pairs.
GA pairs are highly represented in known RNA secondary structures, and their occurrence has been linked to both functional and structural roles (44, 45). The sequence dependence of both stability and local structure of tandem GA pairs have been investigated (11, 15, 19, 23, 25-27, 37, 46). The thermodynamic stabilities of symmetric tandem GA pairs vary in free energy over 4 kcal/mol depending on the identity of the adjacent/closing base pair (Table 1) (note negative ΔG° values are stabilizing) (15). The stability increment follows the trend: .
Table 1.
Thermodynamics of loop formation and type of sugar pucker for internal loops.
| Loop | ΔG° 37 loopa (kcal/mol) |
Sugar Pucker of Loop Internal Gb |
GA Type |
|---|---|---|---|
| −2.6 | C3′-endo (N) | Imino | |
| −0.7 | C3′-endo (N) | Sheared | |
| 0.1 | C3′/C2′-endo (N/S) | Sheared | |
| 0.3 | not determined | Sheared | |
| 0.7 | C2′-endo (S) | Sheared | |
| 1.8 | C2′-endo (S) | Sheared |
The reported loop free energy increments for the GU/UG loops are updated with the most recent nearest neighbor parameters (5, 7). References for the original measurements are given in (15). The thermodynamic stability increments were calculated from the following expression (80): ΔG°37, loop = ΔG°37, (duplex w/loop) − ΔG°37, (duplex w/out loop) + ΔG°37, (interrupted basepair)
As an illustration, the loop free energy increment of GGAU is calculated as follows:
ΔG°37 (GGAU)2 = ΔG°37 (rGCGGAUGC)2 − ΔG°37 (rGCGUGC)2 + ΔG°37 (GU)2
where ΔG°37 (rGCGGAUGC)2 is the measured value of the duplex containing the internal loop (25), ΔG°37 (rGCGUGC)2 is the measured value of the duplex without the loop (26), and ΔG°37 (GU)2 is the free energy increment of the nearest neighbor interaction interrupted by the loop(5).
The most stable tandem GA pair internal loop is , which has a favorable loop free energy increment of −2.6 kcal/mol (15, 26). The NMR structure of this loop (23) has head to head GA imino pairs (cis Watson-Crick/Watson-Crick) (47), with Watson-Crick GC closing pairs. The most destabilizing tandem GA internal loop is , which has an unfavorable loop free energy increment of about 2 kcal/mol at 37 °C (15, 26). Thus, the two C to U substitutions in going from to produce an approximately 1000-fold change in the equilibrium constant for folding at 37 °C. This contrasts with a destabilization of about 1 kcal/mol at 37 °C in going from to , which corresponds to less than a 10-fold change in equilibrium constant for folding. The internal loop is less stable than the internal loop by 1.7 kcal/mol at 37 °C (15, 26). According to a hydrogen bonding model, and should be isoenergetic; yet they differ by at least a factor of 10 in Keq at 37 °C.
What are the underlying forces responsible for the disparate stabilities? Part of the ΔΔG°loop in going from to can be parsed into a change in the number of hydrogen bonds in the closing base pairs. The average difference between closing an internal loop with two GC and two AU pairs is 1.3 kcal/mol (19) and is expected to be similar for GC to GU closure if only the number of hydrogen bonds is important. The difference in stabilities of and is consistent with this expectation. This leaves unexplained, however, about 3 kcal/mol of sequence dependence of free energy between and .
To provide insight into correlations between energetics and structure for tandem GA mismatches with GU closing pairs, the local structures of (rGCUGAGGCU)2 and (rGCGGAUGCU)2 have been determined by solution NMR and restrained molecular dynamic simulations (the residues in the antiparallel strand will be denoted with an * throughout the text). The two loops assemble into similar three dimensional structures with sheared GA pairs (trans Hoogsteen/Sugar edge (47)), but with subtle differences in hydrogen bonding and stacking. Comparisons to the thermodynamics and structures of symmetrical tandem GA internal loops with Watson-Crick closing pairs provide new benchmarks for theoreticians investigating the intricate details of the interactions molding tandem GA internal loop stability (39).
MATERIALS AND METHODS
RNA Synthesis and Purification
rGCUGAGGCU and rGCGGAUGCU were prepared by solid phase synthesis on an ABI 392 DNA/RNA synthesizer following the phosphoramidite method (48, 49). CPG support and phosphoramidites were purchased from Proligo and Azco, respectively. Following synthesis, the resulting support bound oligonucleotides were incubated at 55 °C overnight in a 3:1 ammonium hydroxide: ethanol solution to remove the base protecting groups and to cleave the oligonucleotides from the support. The soluble oligonucleotides were separated from the support with a Quik-Sep filtration column (BIO-RAD). Subsequently, the silyl protecting groups were removed by incubating at 55 °C in a 9:1 TEA-3HF (triethylamine trihydrofluoride): DMF solution for 2-4 h at 55 °C. The deprotected oligonucleotides were desalted on a Sep-Pak C18 column (Waters) and purified on a large preparative Whatman TLC plate (20 cm × 20 cm) using a 55:35:10 1-propanol: ammonia: water mobile phase. The main product bands were visualized by UV shadowing, scraped from the plate and extracted with RNase free water. The purity of the oligonucleotides was checked by analytical TLC, and the masses were verified by ESI MS with a Hewlett-Packard 1100 LC/MS chemstation.
UV Melting Experiments
Purified oligonucleotides were lyophilized and reconstituted in 1 M NaCl, 20 mM cacodylate (pH 7), and 0.5 mM Na2EDTA. All experiments were performed on a Beckman Coulter DU-640 spectrophotometer equipped with a Peltier temperature controller cooled with flowing water. Curves of absorbance at 260 nm versus temperature were recorded using a heating rate of 1 °C/min with a 1 min averaging time. The data were fit to a two-state model assuming linear sloping baselines and temperature independent ΔH° and ΔS° using the MeltWin program (50). Additionally, the melting data were analyzed at different concentrations with the following relationship to extract the van’t Hoff thermodynamic parameters (51).
| (1) |
NMR Sample Preparation
Samples were prepared for NMR by reconstituting the purified oligonucleotides in 500 μL of RNase free water. The samples were then dialyzed exhaustively against 1 L of autoclaved double distilled water. Following dialysis, the samples were lyophilized and reconstituted in 300 μL of NMR buffer containing 80 mM NaCl, 0.5 mM Na2EDTA and 10 mM sodium phosphate, pH 5.5 (90% H2O/10% D2O). The pH was measured in the NMR tube using an Orion micro-combination pH/sodium electrode (Thermo Electron Corporation), and was 5.30 and 5.18 respectively, for (rGCUGAGGCU)2 and (rGCGGAUGCU)2. For non-exchangeable proton spectra, the oligonucleotides were lyophilized and reconstituted in 99.96% D2O three times. The final NMR samples were prepared in 99.996% D2O (Cambridge Isotopes) at pH 5.85. The single strand concentrations for (rGCUGAGGCU)2 and (rGCGGAUGCU)2 were ~ 1.5 mM and 2.0 mM, respectively.
NMR Spectroscopy
All of the NMR experiments were performed on a Varian Inova 600 MHz spectrometer. The 1D imino proton spectra were recorded in 90% H2O using an S-shaped excitation pulse with excitation maximum near 12 ppm. SNOESY spectra were recorded at 100, 120, and 200 ms mixing times at various temperatures. NOESY spectra were acquired in D2O at 100, 200, and 400 ms mixing times at 10 °C for (rGCUGAGGCU)2 and 5 °C for (rGCGGAUGCU)2. Scalar coupled protons were identified in TOCSY spectra recorded at 12 and 40 ms mixing time and in DQFCOSY spectra. Data processing were performed with NMRPipe (52).
Restraint Generation and Structure Calculation
Peak integrations were performed in Sparky using the “sum over box” method (53). Data from the 100 ms mixing time NOESY spectra were used to derive distance restraints using the two-spin approximation with 1/r6 scaling. Data from the 400 ms mixing time NOESY spectra were only used to confirm peaks that were weak at 100 ms. The reference volumes were taken from the H1′-H2′ cross-peaks of the stem bases with the reference distance for this pair of protons set to 2.75 Å. To account for relaxation effects, spin diffusion, or base-line distortions, distance bounds were set to ±30% for the 100 ms mixing time data. Distances derived from overlapped cross-peaks were loosened due to inaccuracies in their volume measurements. There were several critical loop defining distance restraints for both (rGCUGAGGCU)2 and (rGCGGAUGCU)2 determined from the SNOESY data, which are discussed below. All of these restraints were placed into bins of 1.8 – 3.5 Å and 2.5 – 5.5 Å based on the average volume of pyrimidine H5-H6 cross-peaks (2.45 Å) at 5 °C for both (rGCUGAGGCU)2 and (rGCGGAUGCU)2. Hydrogen bond restraints were applied to the four GC pairs in the stems of each duplex. The values were set to the average hydrogen bond distance ± the standard deviation measured for a 1.20 Å crystal structure of an all A-form RNA duplex determined by Steitz and coworkers (54) (see caption to Tables S4 and S5 for full detail). The GU pairs in (rGCUGAGGCU)2 were restrained to be in a canonical wobble configuration, but the GU pairs of (rGCGGAUGCU)2 were left unrestrained or restrained to various conformations due to the lack of G and U imino NOE cross-peaks (see below).
Backbone torsion angle restraints were semi-quantitatively determined from 1H-31P HETCOR spectra. The Watson-Crick region of each oligonucleotide displayed a typical A-form spread in the phosphorus chemical shifts (~ 1 ppm) (55). As a result, this region was restrained to have A-form like backbone torsion angles. The loop torsion angles for both duplexes were left unrestrained. The glycosidic torsion angle, χ, was restrained to the anti conformation for all nucleotides in the duplex. This is consistent with the NMR data as there are no unusually large intraresidue H1′-H8 cross-peaks indicative of a syn conformation. Sugar pucker restraints, δ, were derived from the TOCSY and DQFCOSY spectra. All stem bases were restrained to the C3′ endo conformation based on the absence of H1′-H2′ cross-peaks and the existence of strong H2′-H3′ cross-peaks for the duplexes. In (rGCUGAGGCU)2, the δ torsion angles for the G4, A5, and U9 were restrained to cover both the C2′ and C3′ endo conformations. In (rGCGGAUGCU)2, the δ torsion angle for G4 was restrained to the C2′ endo conformation due to the presence of a strong H1′-H2′ (> 10 Hz) cross-peak in the DQFCOSY spectrum. A5 and U9 were restrained to cover both sugar pucker conformations as in (rGCUGAGGCU)2.
Restrained molecular dynamics and simulated annealing calculations were performed in CNS version 1.1 (56). The initial input structures were built as A-form duplexes in the Biopolymer module of Insight II. The rna-dna-allatom parameter and topology files of CNS were used to assign atomic charges, masses, atom linkages and force constants (56). The structure calculations proceeded as follows: (1) High temperature dynamics – the system was raised to 1000 K and allowed to develop over 1 ps under the influence of bond, angle, improper, NOE (5 kcal/mol Å2) and dihedral angle (100 kcal/mol rad2) restraints; (2) Simulated annealing in Cartesian space – the temperature was cooled from 1000 to 0 K over 200 ps during which the NOE and dihedral angle force constants were increased to 150 kcal/mol Å2 and 200 kcal/mol rad2, respectively. The van der Waals scale factor was raised from 1 to 4 and the electrostatics were set to 10%; (3) Energy Minimization – 1,000 steps of Powell energy minimization was applied with full van der Waals and electrostatic terms. Planarity restraints (25 kcal/mol Å2) were applied to the GC stem regions during the course of the simulation. All structures agreed with the experimental NOE and dihedral angle restraints within 0.2 Å and 5°, respectively. All models were visualized and analyzed in Pymol (57) and helical parameters calculated with X3DNA (58).
RESULTS
Thermodynamics
Table 2 shows the thermodynamic parameters measured for duplex formation of (rGCUGAGGCU)2 and (rGCGGAUGCU)2, along with previous values for the duplexes without the 3′ dangling uracils (26). Addition of the 3′ dangling uracils increases the duplex stability by 0.9 and 1.5 kcal/mol at 37 °C for (rGCUGAGGCU)2 and (rGCGGAUGCU)2, respectively. This is less than the 2.2 kcal/mol increase in stability anticipated for two interactions per duplex (59). Evidently, the internal loops destabilize the 3′ favorable 3′G stacking interaction imparted by the dangling U relative to a fully Watson-Crick helix. This is a non-nearest neighbor effect. A similar destabilization for 3′ dangling ends has been observed for bulge loops where the stability increments become less favorable as a function of the loop size (60).
Table 2.
Thermodynamic parameters of duplex formationa.
| 1/Tm VS lOg(CT) | Average of Curve Fits | |||||||
|---|---|---|---|---|---|---|---|---|
| Sequence | −ΔH° (kcal/mol) |
−ΔS° (eu) |
−ΔG°37 (kcal/mol) |
Tbm (°C) |
−ΔH° (kcal/mol) |
−ΔS° (eu) |
−ΔG° 37 (kcal/mol) |
Tbm (°C) |
| 63.5 ± 3 (65.1 ± 2) |
180.0 ± 9.6 (188.4 ± 5) |
7.63 ± 0.07 (6.70 ± 0.03) |
46.9 (42.0) |
69.9 ± 13 (58.4 ± 6) |
200.1 ± 40.8 (166.71 ± 20) |
7.88 ± 0.36 (6.71 ± 0.06) |
47.1 (42.6) |
|
| 58.3 ± 5 (51.5 ± 1) |
168.3 ± 16.2 (151.1 ± 4) |
6.12 ± 0.08 (4.64 ± 0.03) |
39.4 (30.9) |
63.4 ± 9 (50.4 ± 9) |
184.6 ± 29.6 (147.18 ± 31) |
6.19 ± 0.27 (4.79 ± 0.27) |
39.5 (31.6) |
|
All measurements were made in 1 M NaCl, 10 mM sodium cacodylate (pH 7.0) and 0.5 mM Na2EDTA. The values in parentheses were measured without the 3′ dangling U and are taken from reference 26.
Calculated at 10−4 M oligonucleotide strand concentration.
Exchangeable Proton Assignments
The 1D imino proton spectra for both duplexes are shown in Figure 1. In the 12.5-13.5 ppm region, each duplex gives two imino proton peaks, which correspond to GC base pairs. In (rGCUGAGGCU)2 at 5 °C, four imino proton peaks are observed upfield of the GC pairs. These peaks belong to the remaining G and U bases. Even at pH 5.2 and 5 °C, only three of the expected four peaks are distinctly observed in an analogous region of the spectrum for (rGCGGAUGCU)2. A previous 1D NMR analysis at 20 °C of (rGCGGAUGC)2 without a 3′ dangling U exhibited a broad peak around 13 ppm (26). On the basis of the downfield shift of this peak, it was suggested that (rGCGGAUGC)2 forms a tandem head to head Watson-Crick GA pair similar to that found in (rGCGGACGC)2 (23). The work shown here contradicts that assessment, as a similar shifted peak is not observed until the temperature is elevated above 25 °C, which is within 15 °C of the Tm (Figure 1). This peak is probably a result of conformational exchange of one of the stem guanosines. Also, the spectral assignments of (rGCGGAUGCU)2 reported here establish that G4H1 resonates at ~ 9.5 ppm (see below), which is typical for non-hydrogen bonded imino protons of guanosines.
Figure 1.
NMR spectra for the imino proton region as a function of temperature for (left) (rGCUGAGGCU)2 and (right) (rGCGGAUGCU)2. The spectra were recorded in 80 mM NaCl, 0.5 mM Na2EDTA, and 10 mM sodium phosphate pH 5.30 and 5.18, respectively. The bars above demarcate the regions expected for base pair types: GC, Watson-Crick GC; GU, wobble GU; nhbU, non-hydrogen bonded U; and nhbG, non-hydrogen bonded G. The loop G and U bases are colored red.
The imino and stem amino protons were assigned from the SNOESY spectra following established procedures (Figures S1 and S2) (61). Some base-specific assignments of the G and U imino protons were confirmed by natural abundance 1H-15N HSQC spectra (Figure S3). G1 and G7 imino proton peaks were confirmed by cross-strand NOEs to amino and H5 (by spin diffusion through aminos) protons of C8* and C2*, respectively, indicative of the formation of homoduplexes (Figures S1 and S2).
In (rGCUGAGGCU)2, the U3 and G6 imino resonances were assigned by the presence of a strong cross-peak between the two protons (Figure 2), which is expected for a GU wobble pair as the separation distance is less than 3 Å. G6H1 was confirmed to resonate upfield of U3H3 on the basis of the imino resonance connectivity pathway, a strong NOE to its H21 (Figure 2), and from the natural abundance 1H-15N HSQC spectrum (Figure S3). The U9 imino resonance was confirmed by a very weak cross-peak to G1H1 and to C8H41 The chemical shift of U9 at ~ 10.9 ppm is consistent with the chemical shift of the 3′ dangling uracil in 5′GGUGGAGGCU/3′PCCGAAGCCG (10). By process of elimination, the imino resonance at 9.7 ppm was assigned to G4.
Figure 2.
Comparison of the imino region of the 120 ms mixing time SNOESY spectra at 5°C for the internal loops of (left) (rGCUGAGGCU)2 and (right) (rGCGGAUGCU)2. The labels and color scheme are as follows: red, similar NOE cross-peaks in both (rGCUGAGGCU)2 and (rGCGGAUGCU)2; blue, NOE cross-peaks unique to (rGCUGAGGCU)2; green, NOE cross-peaks unique to (rGCGGAUGCU)2; and orange, similar NOE cross-peaks in both (rGCUGAGGCU)2 and (rGCGGAUGCU)2, with ~ 2 ppm chemical shift difference in the ω1 dimension. Notice that the GU pair in (rGCUGAGGCU)2 is characterized by a strong NOE between U3/3*H3 and G6*/6H1, whereas in (rGCGGAUGCU)2 the U6/6*H3 peak is missing (see text for more details).
A similar strategy provided the assignments for (rGCGGAUGCU)2. An important difference between (rGCGGAUGCU)2 and (rGCUGAGGCU)2 is the lack of a G3/3*H1 to U6*/6H3 cross-peak because the U6H3 imino resonance is broad (Figure 1). The lack of a G3/3*H1-U6*/6H3 NOE in (rGCGGAUGCU)2 (Figure 2) suggests that the closing GU pairs are not in a wobble conformation.
Important NOE restraints for defining the loop topology in both duplexes were obtained from SNOESY spectra (Figure 2). In (rGCUGAGGCU)2, there is a cross-strand NOE between the G4 imino proton and G4*H8. This NOE is useful in orienting the G4 base from one strand relative to the G4 base in the other strand. Also, G4H1 has an NOE to G4*H1′ and to G4*H2′. Both cross-peaks are cross-strand NOEs as the intranucleotide separation is > 6 Å, which is too far to give rise to a cross-peak of the observed size. Thus, the G4 base from one strand is close to the G4 from the other strand.
In (rGCGGAUGCU)2, G4H1 shows similar cross-strand NOE patterns as G4H1 in (rGCUGAGGCU)2 (Figure 2), i.e. G4H1-G4*H8 and G4H1-G4*H1′ cross-peaks. Additional NOEs were observed in (rGCGGAUGCU)2 relative to (rGCUGAGGCU)2 (Figure 2). The differences between the two spectra include the following: (1) G4H1 of (rGCGGAUGCU)2 has a cross-strand NOE to its own OH2′ (the OH2′ assignment was confirmed by a H2O TOCSY spectrum (data not shown)); (2) Both A5H61 and A5H62 have a strong cross-strand NOE to G4*H1′; (3) Both G4H21 and G4H22 have a strong cross-strand NOE to G4*H1′. The aforementioned NOE patterns suggest that different interactions are involved in forming the local structure of (rGCGGAUGCU)2 relative to (rGCUGAGGCU)2.
Tandem GA pairs, as well as some GU pairs, have been implicated as metal binding pockets, with cobalt hexamine serving as a spectroscopic probe for binding (30, 62, 63). The addition of 3 mM cobalt hexamine to (rGCUGAGGCU)2 did not provide any further NOE information regarding the exchangeable protons, and the non-exchangeable NOE patterns with and without cobalt hexamine were identical. Similarly, the addition of 1 mM cobalt hexamine did not sharpen the linewidth of U6H3 in (rGCGGAUGCU)2 or provide any additional information about the other exchangeable protons. Only slight spectral differences were observed in the aromatic region of the spectrum, while leaving the NOESY walk pattern unchanged (data not shown). Evidently, cobalt hexamine does not perturb the structures.
Non-exchangeable Proton Assignments
NOESY spectra were recorded at 10 and 5 °C for (rGCUGAGGCU)2 and (rGCGGAUGCU)2, respectively. At higher temperatures, both duplexes show a number of conformational exchange peaks. The 400 ms mixing time NOESY spectra in the (H8/H2/H6) – (H5/H1′) region are shown in Figure 3. Complete sequential NOEs are observed for both duplexes indicating structured loops. Except for H5′, H5″, and U9H4′ of (rGCGGAUGCU)2, complete aromatic and sugar proton assignments (Tables S2 and S3) were made for both duplexes following established procedures (61). Identification of the pyrimidine H5-H6 cross-peaks from TOCSY spectra assisted with initiating the assignments (Figure S4). In (rGCUGAGGCU)2, the G6H1′ is shifted upfield to around 4.3 ppm. This peak was confirmed by the observation of a cross-peak in the natural abundance 1H-13C HSQC spectrum (Figure S5). An upfield shift of the G6H1′ of (rGCUGAGGCU)2 is consistent with this proton being positioned above or below an aromatic ring as observed with GNRA tetraloops and sheared purine – purine pairs (27, 64, 65). The equivalent H1′ in (rGCGGAUGCU)2, U6H1′, has a chemical shift of 5.77 ppm (Figures 3 and S6), which is within the standard chemical shift region for H1′/H5 protons. The H5 proton of U6 in (rGCGGAUGCU)2 is shifted upfield to 4.4 ppm, however (Figures S4 and S6). The differences in the chemical shifts of H1′ at position 6 of (rGCUGAGGCU)2 and (rGCGGAUGCU)2 suggest a difference in the local structure at the base 5 to base 6 step in the duplexes. Also, the A5H2 protons in the two duplexes show medium strength NOEs to both A5H1′ protons and to G6H1′ of (rGCUGAGGCU)2 or U6H1′ of (rGCGGAGUCU)2 (Figure 3). The intensity of the A5H2-H1′ NOE for both duplexes suggest that it is an interstrand NOE implying that the A5 bases stack on each other across-strand and on G6 of (rGCUGAGGCU)2 or U6 of (rGCGGAUGCU)2 as has been observed in (GGCAAGCCU)2 and (GCCGAGGC)2 (14, 27).
Figure 3.
The 400 ms mixing time 2D NOESY spectra of the H8/H2/H6 to H1′/H5 region of (top) (rGCUGAGGCU)2 at 10 °C (pH 5.85) and (bottom) (rGCGGAUGCU)2 at 5 °C (pH 5.85). Black lines trace the connectivity from H8/H6/H2 to H1′ pathway for both duplexes. The green rectangles indicate identical NOE patterns in both (rGCUGAGGCU)2 and (rGCGGAUGCU)2 involving A5H2 to A5*H1′, G6H1′ for (rGCUGAGGCU)2 and A5H2 to A5*H1′, U6H1′ in (rGCGGAUGCU)2. Asterisks denote H5-H6 cross-peaks.
The assignments of H2′ protons follow directly from the strong H1′-H2′ cross-peaks in NOESY spectra. The assignments were confirmed by strong intrastrand (n)H8/H6 – H2′ (n-1) NOEs in the aromatic to sugar region of the spectra. H3′ assignments were made from cross-peaks to H2’s in DQFCOSY spectra and from 1H-31P HETCOR spectra (Figures S7 and S8), which also facilitated assignments of phosphorus resonances. The H3′ assignments were further verified by the presence of medium strength intrastrand NOEs from H8/H6-H3′ protons in NOESY spectra. Some H4′ assignments were made from the scalar coupling to H3′ protons observed in DQFCOSY spectra. All aromatic and most ribose proton assignments were further verified by natural abundance 1H-13C HSQC spectra.
Structure Modeling
Structures for (rGCUGAGGCU)2 and (rGCGGAUGCU)2 were modeled by restrained molecular dynamics and simulated annealing as described in the Materials and Methods section. The closing GU pairs of (rGCUGAGGCU)2 form canonical wobble configurations, which is consistent with the strong cross-peak between U3/U3* and G6*/G6 imino resonances (Figure 2). The conformation of the closing GU pairs in (rGCGGAUGCU)2 is less obvious due to the lack of an NOE between the imino protons of the GU pairs (Figure 2). The NOE is missing because the U6H3 resonance is broad (Figure 1) and lost during the mixing time presumably due to rapid exchange with the solvent. The broad linewidth of U6H3 is unlikely a result of conformational dynamics as the linewidths of its aromatic and ribose protons are comparable to the linewidths of the other aromatic and ribose protons in the duplex. In order to identify possible GU pair structures consistent with the data, the modeling was done several times with different imposed restraints.
Modeling calculations for (rGCGGAUGCU)2 were performed by restraining the GU pairs in four conformations (Figure 4): (a) single hydrogen bond 4 (SHB4); GH22 to UO4, (b) bifurcated 4; GH1/H21 to UO4, (c) GU wobble, and (d) bifurcated 2; GH1/H21 to UO2 (note that the numbering for the SHB and bifurcated structures refer to the number of the acceptor oxygen atom of U). Each set of structures were analyzed for their agreement with the NMR restraints and chemical shifts. Each conformation yielded structures consistent with the NOE derived distance bounds within 0.2 Å, but only the SHB4 and bifurcated 4 conformations satisfied the dihedral angle selection criteria. Structures derived from the GU wobble and bifurcated 2 conformations consistently displayed a G3 χ dihedral angle violation greater than 5°, which is likely a result of the tight contraints used for the χ dihedral angles (−150 ± 20°). Tables S1 (a-d) show the average simulated energies for all converged structures along with the NOE energy components. As judged by the simulated energies, the SHB4 (-453.1 ± 14.3 kcal/mol) and bifurcated 4 (-459.1 ± 9.1 kcal/mol) models satisfy the force field parameters better than the wobble (-359.6 ± 9.0 kcal/mol) and bifurcated 2 (-356.6 ± 13.0 kcal/mol) models. This is in part a reflection of the more unfavorable NOE energies associated with the wobble (10.96 ± 1.19) and bifurcated 2 (12.34 ± 2.05) models compared to the SHB4 (1.84 ± .36) and bifurcated 4 (1.75 ± .25) models (Tables S1).
Figure 4.
Modeled GU pair conformations for (rGCGGAUGCU)2 (see text for more details).
The SHB4 pairing (Figure 4) provided the best agreement between the observed chemical shifts for the loop aromatic and H1′ protons and those predicted by the program Nuchemics (66, 67) (Tables 3 and S1). In particular, the unusual U6H5 chemical shift of 4.41 ppm is predicted to be 4.57 ± 0.4 ppm for SHB4 and 5.45 to 5.59 ± 0.1 ppm for the other GU pairings. SHB4 also shows the best agreement between the observed and predicted U6H1′ shifts (Table 3). The unusual U6H5 shift is a result of ring current effects from the neighboring A4 nucleoside. A similar ring current effect explains the upfield shift to 4.32 ppm of G6H1′ of (rGCUGAGGCU)2, which is consistent with the predicted chemical shift of 4.25 ± 0.3 ppm. Figure S9 shows the position of U6H5 and H1′ for the SHB4 and wobble models along with the position of G6H1′ of (rGCUGAGGCU)2, relative to the respective A5 nucleoside.
Table 3.
Comparison of observed and predicted chemical shift values for U6H5 and H1′ of (rGCGGAUGCU)2 modeled with different GU hydrogen bond conformations.
| U6H5Obs | U6H5Pred | U6H1′Obs | U6H1′Pred | |
|---|---|---|---|---|
| GU SHB4 | 4.41 | 4.57 ± 0.4 | 5.77 | 5.43 ± 0.30 |
| GU bifurcated 4 | 4.41 | 5.45 ± 0.10 | 5.77 | 5.19 ± 0.15 |
| GU wobble | 4.41 | 5.59 ± 0.07 | 5.77 | 3.55 ± 0.46 |
| GU bifurcated 2 | 4.41 | 5.59 ± 0.10 | 5.77 | 3.58 ± 0.43 |
Note, the GU SHB4 model shows the best agreement between the observed and predicted shifts (see text for more detail). The predicted values were calculated with the program Nuchemics (66, 67). As a comparison, the GU SHB4 structure was also calculated in the Discover 98 software package using the AMBER 95 forcefield. The predicted values for U6H5 and H1′ are 4.26 ± 0.16 and 5.87 ± 0.04, respectively.
U6H3 in (rGCGGAUGCU)2 has a broad lineshape compared to the sharp lines for the non-exchangeable protons of U6 and also has a strong cross-peak to water. Both observations indicate that U6H3 is in rapid exchange with water and not in a GU wobble conformation. The sharp linewidth of the G3/3* imino proton (Figure 1), however, suggests that it is protected from exchange with water. In structures with the GU SHB4 conformation, G3/3*H1 is in the major groove underneath the G4/4* carbonyl oxygen. In this orientation, G3/3*H1 could be stabilized by a water bridged hydrogen bond to the G4/4* carbonyl oxygen. Based on functional group substitution (68) and NMR derived structures (64), water bridged hydrogen bonds have been proposed to help stabilize GNRA tetraloops.
The SHB4 conformation of (rGCGGAUGCU)2 agrees best with the NMR data and the ensuing discussions will focus on this model. The structure calculation statistics are provided in Tables S4 and S5. Figure 5 shows the superposition of 15 lowest energy structures for (rGCUGAGGCU)2 and (rGCGGAUGCU)2. Both structures are well-defined as the all-atom pairwise rmsd for the 30 calculated structures of (rGCUGAGGCU)2 and (rGCGGAUGCU)2 are 0.96 and 0.78 Å, respectively. Both internal loops form tandem sheared GA base pairs (trans Hoogsteen/Sugar edge AG) (47, 69) as has been observed elsewhere (11, 27).
Figure 5.
Minor groove view of a superposition of 15 selected structures of the 30 converged structures for (left) (rGCUGAGGCU)2 (rmsd of 0.96 Å) and (right) (rGCGGAUGCU)2 (rmsd of 0.78 Å). The color scheme is indicated in the sequence below each structure. Hydrogens and non-bridging oxygens have been removed for clarity.
DISCUSSION
A complete description of RNA structure-function relationships requires a mastery of the sequence dependence of physicochemical properties of RNA. It is impractical to determine these properties empirically for every known RNA sequence. If the connections between sequence, stability, and structure can be deciphered, then rules for predicting structure and possibly function could be established. While these linkages are largely known for fully paired Watson-Crick regions, internal loops still pose a challenge. This is largely due to the vast number of possible base pair and stacking configurations within internal loops. To provide benchmarks toward advancing basic comprehension of RNA internal loops, the solution structures of two duplexes containing tandem GA pairs adjacent to GU pairs have been solved. Despite being thermodynamically destabilizing (Table 1), both loops adopt relatively ordered structures at low temperatures. The detailed structures of the loops consist of tandem sheared GA mismatches, but with different hydrogen bonding within the GU closing pairs. Presumably, the sequence dependent hydrogen bonding and stacking contribute to the disparate loop thermodynamic stabilities.
Thermodynamics of Internal Loops Containing Tandem GA Pairs
Table 1 shows the experimental free energy increments for all symmetric tandem GA internal loops of the type: , where XY denotes GC/CG, AU/UA or GU/UG base pairs. The loop free energies for tandem GA pairs span 4 kcal/mol at 37 °C. Of the six internal loops listed in Table 1, five have solution NMR structures (23, 27, 37) with one of those also having a crystal structure (11). The exception is the internal loop, which is believed to form a sheared GA pair on the basis of a 1D NMR spectrum (26). The structural and thermodynamic results provide a database to probe structure-stability relationships. All the loops have the nearest neighbor. Thus differences in stability and structure presumably reflect differences in pairing within the adjacent base pairs and stacking between a GA pair and its nearest neighbor.
Five of six internal loops in Table 1 contain two sheared GA pairs. Only has imino GA pairs. It is the most stable internal loop at 37 °C, about 2 kcal/mol more stable than the next most stable internal loop, . The extra stability of may reflect the smaller distortion from A-form like structure for imino GA pairs compared to sheared GA pairs. Stacking interactions within the loop and with the adjacent helix must also be important. Interestingly, the least stable internal loop is , which is about 1 kcal/mol less stable than the next least stable internal loop, . The other sheared GA internal loops have similar stabilities, averaging 0.1 ± 0.6 kcal/mol at 37 °C. Most of this variation can be attributed to the extra hydrogen bond in each closing pair of , relative to , , and . The extra hydrogen bonds are expected to enhance stability by about 1.3 kcal/mol (19). This suggests that interactions in the internal loop differ from those in the other sheared GA structures. As discussed below, the instability of the internal loop is accompanied by unexpected structural features in the closing GU pairs.
Context Dependent Structural Interactions
As outlined above and shown in Figures 2 and 3, the base pair stacking in (rGCUGAGGCU)2 is established by several spectroscopic features, which result in the following stacking patterns: major groove, U3/G4/G4*/U3* and minor groove, G6/A5/A5*/G6* (Figures 6 and 7). Generally, the adenines overlap more than the guanines in the GA pairs, possibly due to the continuous spine of purine bases forming in the minor groove. By comparison, the observed NOE patterns shown in Figures 2 and 3 for (rGCGGAUGCU)2 result in a major groove, G3/G4/G4*/G3* stack and a minor groove, U6/A5/A5*/U6* stack (Figures 6 and 7). In contrast to (rGCUGAGGCU)2, the major groove of (rGCGGAUGCU)2 is characterized by a continuous spine of guanines.
Figure 6.
Major and minor groove view of the packing and stacking topology in the internal loops of (top) (rGCUGAGGCU)2 and (bottom) (rGCGGAUGCU)2. The loop bases are represented as transparent van der Waals spheres (front) and sticks (back). The color scheme is indicated in the sequence below each structure. Minimized average structures are shown. Hydrogens and non-bridging oxygen atoms have been removed to emphasize base ring overlap.
Figure 7.
Major and minor groove functional group disposition in the internal loops of (top) (rGCUGAGGCU)2 and (bottom) (rGCGGAUGCU)2. Notice the differences in the number of hydrogen bond acceptors in the major grooves of (rGCUGAGGCU)2 and (rGCGGAUGCU)2. In both (rGCUGAGGCU)2 and (rGCGGAUGCU)2, the major grooves could provide a suitable molecular surface for ligand docking, tertiary, or quaternary interactions. The loop bases are represented as transparent van der Waals spheres (front) and sticks (back). Hydrogens have been removed for ease of view; therefore GH1 and GH21/2 functional groups are denoted by N1 and N2, respectively. The atom color code is C (light gray), N (blue), and O (red). The phosphate backbone is depicted as a gray tube.
The internal loop of (rGCUGAGGCU)2 is characterized by different functional groups in the major and minor grooves. Due to the U3/G4/G4*/U3* stacking pattern, the major groove has four adjacent carbonyl groups and two adjacent N7 groups providing six hydrogen bond acceptors in the major groove (Figure 7). This arrangement could serve as a docking site for metal ions, protein side chains, or small molecules. Conversely, the minor groove is characterized by a tandem AA arrangement flanked by guanosines from the closing GU pairs. The stacking topology of the adenines results in the N1-C2-N3 base edges forming the minor groove. It has been suggested that this type of stacking pattern could provide the molecular determinants for the formation of A-minor or other tertiary interactions (10). As expected for a GU wobble pair, the G6/6* amino groups (denoted as H22 in Figure 7) protrude into the minor groove capping the side-by-side AA arrangement.
The G3/G4/G4*/G3* stacking pattern in the major groove of (rGCGGAUGCU)2 places eight hydrogen bond acceptors in the major groove, two more than for (rGCUGAGGCU)2 (Figure 7). The minor groove surface of (rGCGGAUGCU)2 is similar to (rGCUGAGGCU)2 locally due to the comparable side-by-side stacking arrangement of the adenines. As a result of the to nearest neighbor switch in going from (rGCUGAGGCU)2 to (rGCGGAUGCU)2, the tandem AA stack in the minor groove of (rGCGGAUGCU)2 is capped by U6/6*O2 instead of G6/G6* amino groups, however. Thus, the minor grooves of (rGCUGAGGCU)2 and (rGCGGAUGCU)2 are characterized by different hydrogen bond readouts, which offers different specificity for ligands or tertiary interactions.
Context Dependent Hydrogen Bonding Interactions
Although explicit hydrogen bonds were not measured or restrained for the GA pairs, a number hydrogen bonding contacts are discernable from the structures (Figure 8). For (rGCUGAGGCU)2, the GA sheared pairs are stabilized by hydrogen bond interactions involving G4/4* N3 to A5*/5 H61 (2.9 Å) and G4/4* H21 to A5*/5 N7 (2.0 Å), consistent with other examples of sheared GA pairs (10, 27, 37, 46). Base to sugar-phosphate backbone hydrogen bonds are also observed. The detailed nature of these interactions is predicated on the G4 sugar pucker. In (rGCUGAGGCU)2, the G4 ribose displayed intermediate (~ 5 Hz) H1′ to H2′ scalar couplings indicative of sampling between C2′ and C3′-endo sugar conformers (data not shown). As a result, the sugar pucker for G4 was restrained to cover both the C2′ and C3′-endo conformations. For all 50 calculated structures, the G4 sugar pucker was in the C3′-endo conformation. The C3′-endo conformation of G4 results in a G4/4* H22 to A5*/5 O2P hydrogen bond (2.5 Å). Similar hydrogen bonds have been observed in other sheared GA pairs and may add stability to the loops (27, 46, 64, 65, 68).
Figure 8.
Hydrogen bond network in the GA sheared pairs of (left) (rGCUGAGGCU)2 and (right) (rGCGGAUGCU)2. The donor and acceptor atoms are rendered as white spheres. The hydrogen bonds are shown as dashed lines connecting the atoms along with the average distance. The detailed hydrogen bond network is predicated on the G4 sugar. In (rGCUGAGGCU)2 the G is displayed as C3′-endo (N) but is likely dynamic (see text for details). In (rGCGGAUGCU)2 the G4 sugar is restricted to the C2′-endo (S) conformation.
In (rGCGGAUGCU)2, the base-to-base hydrogen bond interactions for the GA pairs are comparable to those in (rGCUGAGGCU)2. The average distances between G4/4* N3 to A5*/5 H61 and G4/4* H21 to A5*/5 N7 are 2.1 Å and 1.9 Å, respectively. Thus, the two internal loops form congruent sheared GA pairs stabilized by similar base-to-base hydrogen bond contacts.
Base-to-backbone hydrogen bonds in (rGCGGAUGCU)2 differ from those in (rGCUGAGGCU)2, however. The G4 sugar pucker was restrained to the C2′-endo conformation on the basis of the > 10 Hz G4 H1′ to H2′ cross-peak in the DQFCOSY spectrum (data not shown). As a result, the G4H22 to A5*O2P interaction observed in (rGCUGAGGCU)2 is lost and supplanted with G4H1 to G4*O2′ (3.4 Å) and G4H21 to G4*O2′ (2.0 Å). A similar hydrogen bond interaction has been observed for the internal loop where the equivalent of G4 is in a C2′-endo conformation (37).
Context Dependent Structural Differences in the GU Closures
Perhaps the most striking difference between (rGCUGAGGCU)2 and (rGCGGAUGCU)2 is the base pairing configuration of the closing/adjacent GU pairs. In (rGCUGAGGCU)2, the closing GU pairs form a canonical wobble configuration. This is experimentally verified by the strong NOE between the imino resonances of G3/3* and U6*/6 and the upfield shift (~ 6.5 ppm) of the observable G6/6* amino protons (Figure 2), which is expected for a GU wobble pair as the G amino group is not in a hydrogen bond. The GU pairs in (rGCGGAUGCU)2, however, do not form a canonical wobble configuration. The GU pairs in (rGCGGAUGCU)2 are stabilized by a G3/3*H22 to U6*/6O4 single hydrogen bond (Figure 4). SantaLucia et al. observed an identical non-canonical GU pair in the 690 hairpin from E. coli 16S rRNA modeled with the NMR construct 5′-GGCGGUGAAAUGCC-3′, where the bold nucleotides are G690 and U697 which are paired (70). In their NMR spectra, the G690H1 to U697H3 NOE is also missing, although the G690H1 resonance is as sharp as the stem iminos and the U697H3 resonance is broadened by solvent exchange (70). The adjacent GA pair is also sheared. An analogous base pairing configuration for the 690 hairpin is found in the E. coli ribosome crystal structure (71). In the context of the ribosome, the 690 loop is involved in tertiary and quaternary contacts with the 790 hairpin loop and the S11 protein. Evidently, the nearest neighbor forms a noncanonical ordered structure that is not predicated on tertiary or quaternary interactions, as similar structures are found for the nearest neighbor in the context of an isolated hairpin and internal loop.
The different hydrogen bonding patterns for (rGCUGAGGCU)2 and (rGCGGAUGCU)2 suggest that a competition between stacking and hydrogen bonding determines the structure and that hydrogen bonding is not dominant. Figure 9 shows a comparison of the base overlap of the and nearest neighbor interactions in the NMR structure of (rGCGGAUGCU)2 modeled with SHB4 and GU wobble restraints. The and 14 nearest neighbors taken from the NMR structure of the 690 hairpin loop of E. coli 16S rRNA are also shown for comparison (70). At the step, the helix is overwound 3′ to the CG pair more in the wobble model compared to the SHB4 model. This results in a cross-strand stack of the Gs in the wobble conformation at the step, which leaves the C unstacked on its 3′ side, and a diminution of the G stacking at the step compared to SHB4 (Figure 9). In the SHB4 structure, G3 is rotated towards the major groove and U6 is shifted to the minor groove. This subtle perturbation of G3 and U6 provides more extensive overlap with the adjacent GA pair. Presumably, the more favorable stacking in the SHB4 structure compensates for the difference in hydrogen bonds compared to the GU wobble conformation. As shown in Figure 9, a similar stacking arrangement is observed for the nearest neighbor in the 690 hairpin loop of E. coli 16S rRNA (70, 71). Thus, the base stacking interactions for the nearest neighbors are important determinants in fine-tuning the pairing configuration for this internal loop.
Figure 9.
Comparison of the base stacking differences for the and nearest neighbors in modeled as a SHB4 pair (top) and wobble GU pair (middle). The NMR structure of the E. coli 690 hairpin loop (bottom) is also shown. The base stacks are viewed down the helical axis with the lower base pair rendered as spheres and the upper base pair as sticks. Notice that in the SHB4 model the GU pair overlaps more extensively with the adjacent GA pair compared to the corresponding bases in the GU wobble model. Only the base atoms are shown. The color scheme for the CG, GU, and GA pairs are the same for O (red), N (blue), and H (white), but differ for C (gray for CG, brown for GU, and black for GA pairs).
The two least stable tandem GA internal loops, and (Table 1), both have less than optimal hydrogen bonds given their hydrogen bonding capacity. In , this is manifested as stretched UA pairs with distances of 2.8 and 3.0 Å for the AH61 to UO4 and AN1 to UH3 hydrogen bonds, respectively (37). Presumably, optimal hydrogen bonding would give less favorable free energy due to less favorable stacking. This implies that hydrogen bonds are not dominant in determining local geometry. Local geometry is a competition between stacking and hydrogen bonding. This type of competition has also been suggested for tandem GU pairs (72) and is found in molecular dynamics simulations of of them (41). The results presented here provide more benchmarks for testing simulations (39, 42).
Sugar pucker correlates with stability
It has been suggested that the stability increment of the adjacent pair determines the sugar pucker of the G in tandem sheared GA pairs (73). For instance, in the G in the tandem GA pair has a C3′-endo (N) conformation (27). Switching to results in a G sugar pucker change to C2′-endo (S) (37). Moreover, in the internal loop from the Neurospora VS ribozyme the G that is 3′ to the CA pair is in the C2′-endo (S) conformation while the G that is 3′ to the CG pair is in equilibrium between C3′-endo (N) and C2′-endo (S) (73). The results for and agree with this hypothesis (Table 1). Evidently, stacking and base to backbone hydrogen bonding interactions are strong enough to affect sugar pucker as well as hydrogen bonding. This is not surprising because the free energy difference between C3′-endo and C2′-endo sugars is about 0.5 kcal/mol (74, 75) whereas the base to base hydrogen bonds in RNA typically enhance stability by about 1 kcal/mol (68, 76, 77), and stacking interactions can contribute up to 2 kcal/mol to stability (78).
Correlation with occurences in rRNA
It is probable that the structural polymorphism and thermodynamic stability of GU pairs are related to their phylogenetic occurrence, as the motif occurs more often than the motif (44). In a survey of 30 of 23S and 20 of 16S rRNA secondary structures (79), the motif occurs 394 times, whereas the motif occurs only 47 times. The average number of appearances of the motif per secondary structure for 23S and 16S rRNA are 11.4 and 2.8, respectively. Conversely, the average number of appearances of the motif per secondary structure for 23S and 16S rRNA are 0.5 and 1.7, respectively. Evidently, there is selective pressure to maintain the nearest neighbor interaction relative to , which may be related to the different stability and structure of compared to . Of the 50 rRNA structures surveyed, the internal loop occurs in the 23S rRNA subunits of Micrococcus luteus, Listeria monocytogenes, and Lactococcus lactis. Conversely, the internal loop was not observed in the 50 structures. A internal loop is found in the 5′ half of the S. cerevisiae 23S rRNA, however.
CONCLUSIONS
The local structure and stability of internal loops are highly sequence dependent, which presents a challenge to predicting these properties from sequence information alone. The structural polymorphism of tandem pairs depends upon several factors such as the type and number of hydrogen bonds, electrostatics, base stacking interactions, and the stability increment of the closing/adjacent base pair. A delicate balance between these factors determines local structure. This work and others provide theoreticians (39-41) with experimental structures and thermodynamic data to benchmark theories aimed at understanding the interplay between these components.
Supplementary Material
ACKNOWLEGEMENTS
We thank past and present members of the Turner lab for helpful discussions regarding the manuscript. Special recognition goes to Anthony Furstoss for his help with assigning the (GCGGAUGCU)2:cobalt hexamine spectra. We also thank NIGMS for providing a minority supplement to grant GM22939 for BST.
Footnotes
This work was supported by NIH grant GM22939 (D.H.T.)
The coordinates have been deposited as Protein Data Bank entries 2IRN and 2IRO
Abbreviations: CT, total strand concentration; Tm, melting temperature in degrees Celsius
SUPPORTING INFORMATION AVAILABLE
Tables listing chemical shift assignments, distance restraints, structure calculation statistics, and figures of SNOESY, natural abundance 1H-15N HSQC, 1H-13C HSQC, TOCSY, and 1H-31P HETCOR spectra are available free of charge via the Internet at http://pubs.acs.org.
REFERENCES
- 1.Gesteland RF, Cech TR, Atkins JF. The RNA World. Third ed. Cold Spring Harbor Press; Cold Spring Harbor, NY: 2006. [Google Scholar]
- 2.Sevignani C, Calin GA, Siracusa LD, Croce CM. Mammalian microRNAs: a small world for fine-tuning gene expression. Mamm Genome. 2006;17:189–202. doi: 10.1007/s00335-005-0066-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Stark A, Brennecke J, Bushati N, Russell RB, Cohen SM. Animal MicroRNAs confer robustness to gene expression and have a significant impact on 3′UTR evolution. Cell. 2005;123:1133–46. doi: 10.1016/j.cell.2005.11.023. [DOI] [PubMed] [Google Scholar]
- 4.Winkler WC. Riboswitches and the role of noncoding RNAs in bacterial metabolic control. Curr. Opin. Chem. Biol. 2005;9:594–602. doi: 10.1016/j.cbpa.2005.09.016. [DOI] [PubMed] [Google Scholar]
- 5.Mathews DH, Sabina J, Zuker M, Turner DH. Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J. Mol. Biol. 1999;288:911–40. doi: 10.1006/jmbi.1999.2700. [DOI] [PubMed] [Google Scholar]
- 6.Mathews DH, Disney MD, Childs JL, Schroeder SJ, Zuker M, Turner DH. Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. Proc. Natl. Acad. Sci. U. S. A. 2004;101:7287–92. doi: 10.1073/pnas.0401799101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Xia T, SantaLucia J, Jr., Burkard ME, Kierzek R, Schroeder SJ, Jiao X, Cox C, Turner DH. Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with Watson-Crick base pairs. Biochemistry. 1998;37:14719–35. doi: 10.1021/bi9809425. [DOI] [PubMed] [Google Scholar]
- 8.Washietl S, Hofacker IL, Lukasser M, Huttenhofer A, Stadler PF. Mapping of conserved RNA secondary structures predicts thousands of functional noncoding RNAs in the human genome. Nat. Biotechnol. 2005;23:1383–90. doi: 10.1038/nbt1144. [DOI] [PubMed] [Google Scholar]
- 9.Uzilov AV, Keegan JM, Mathews DH. Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change. BMC Bioinformatics. 2006;7:173. doi: 10.1186/1471-2105-7-173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Chen G, Znosko BM, Kennedy SD, Krugh TR, Turner DH. Solution structure of an RNA internal loop with three consecutive sheared GA pairs. Biochemistry. 2005;44:2845–56. doi: 10.1021/bi048079y. [DOI] [PubMed] [Google Scholar]
- 11.Jang SB, Baeyens K, Jeong MS, SantaLucia J, Jr., Turner D, Holbrook SR. Structures of two RNA octamers containing tandem G.A base pairs. Acta Crystallogr D Biol Crystallogr. 2004;60:829–35. doi: 10.1107/S0907444904003804. [DOI] [PubMed] [Google Scholar]
- 12.Chen G, Znosko BM, Jiao X, Turner DH. Factors affecting thermodynamic stabilities of RNA 3 × 3 internal loops. Biochemistry. 2004;43:12865–76. doi: 10.1021/bi049168d. [DOI] [PubMed] [Google Scholar]
- 13.Schroeder SJ, Fountain MA, Kennedy SD, Lukavsky PJ, Puglisi JD, Krugh TR, Turner DH. Thermodynamic stability and structural features of the J4/5 loop in a Pneumocystis carinii group I intron. Biochemistry. 2003;42:14184–96. doi: 10.1021/bi0301587. [DOI] [PubMed] [Google Scholar]
- 14.Znosko BM, Burkard ME, Schroeder SJ, Krugh TR, Turner DH. Sheared Aanti.Aanti base pairs in a destabilizing 2 × 2 internal loop: the NMR structure of 5′(rGGCAAGCCU)2. Biochemistry. 2002;41:14969–77. doi: 10.1021/bi020326f. [DOI] [PubMed] [Google Scholar]
- 15.Schroeder SJ, Turner DH. Thermodynamic stabilities of internal loops with GU closing pairs in RNA. Biochemistry. 2001;40:11509–17. doi: 10.1021/bi010489o. [DOI] [PubMed] [Google Scholar]
- 16.Burkard ME, Xia T, Turner DH. Thermodynamics of RNA internal loops with a guanosine-guanosine pair adjacent to another noncanonical pair. Biochemistry. 2001;40:2478–83. doi: 10.1021/bi0012181. [DOI] [PubMed] [Google Scholar]
- 17.Schroeder SJ, Turner DH. Factors affecting the thermodynamic stability of small asymmetric internal loops in RNA. Biochemistry. 2000;39:9257–74. doi: 10.1021/bi000229r. [DOI] [PubMed] [Google Scholar]
- 18.Burkard ME, Turner DH. NMR structures of r(GCAGGCGUGC)2 and determinants of stability for single guanosine-guanosine base pairs. Biochemistry. 2000;39:11748–62. doi: 10.1021/bi000720i. [DOI] [PubMed] [Google Scholar]
- 19.Schroeder SJ, Burkard ME, Turner DH. The energetics of small internal loops in RNA. Biopolymers. 1999;52:157–67. doi: 10.1002/1097-0282(1999)52:4<157::AID-BIP1001>3.0.CO;2-E. [DOI] [PubMed] [Google Scholar]
- 20.Kierzek R, Burkard ME, Turner DH. Thermodynamics of single mismatches in RNA duplexes. Biochemistry. 1999;38:14214–23. doi: 10.1021/bi991186l. [DOI] [PubMed] [Google Scholar]
- 21.Xia T, McDowell JA, Turner DH. Thermodynamics of nonsymmetric tandem mismatches adjacent to G.C base pairs in RNA. Biochemistry. 1997;36:12486–97. doi: 10.1021/bi971069v. [DOI] [PubMed] [Google Scholar]
- 22.Wu M, SantaLucia J, Jr., Turner DH. Solution structure of (rGGCAGGCC)2 by two-dimensional NMR and the iterative relaxation matrix approach. Biochemistry. 1997;36:4449–60. doi: 10.1021/bi9625915. [DOI] [PubMed] [Google Scholar]
- 23.Wu M, Turner DH. Solution structure of (rGCGGACGC)2 by two-dimensional NMR and the iterative relaxation matrix approach. Biochemistry. 1996;35:9677–89. doi: 10.1021/bi960133q. [DOI] [PubMed] [Google Scholar]
- 24.Schroeder S, Kim J, Turner DH. G.A and U.U mismatches can stabilize RNA internal loops of three nucleotides. Biochemistry. 1996;35:16105–9. doi: 10.1021/bi961789m. [DOI] [PubMed] [Google Scholar]
- 25.Wu M, McDowell JA, Turner DH. A periodic table of symmetric tandem mismatches in RNA. Biochemistry. 1995;34:3204–11. doi: 10.1021/bi00010a009. [DOI] [PubMed] [Google Scholar]
- 26.Walter AE, Wu M, Turner DH. The stability and structure of tandem GA mismatches in RNA depend on closing base pairs. Biochemistry. 1994;33:11349–54. doi: 10.1021/bi00203a033. [DOI] [PubMed] [Google Scholar]
- 27.SantaLucia J, Jr., Turner DH. Structure of (rGGCGAGCC)2 in solution from NMR and restrained molecular dynamics. Biochemistry. 1993;32:12612–23. doi: 10.1021/bi00210a009. [DOI] [PubMed] [Google Scholar]
- 28.SantaLucia J, Jr., Kierzek R, Turner DH. Stabilities of consecutive A.C, C.C, G.G, U.C, and U.U mismatches in RNA internal loops: Evidence for stable hydrogen-bonded U.U and C.C.+ pairs. Biochemistry. 1991;30:8242–51. doi: 10.1021/bi00247a021. [DOI] [PubMed] [Google Scholar]
- 29.Peritz AE, Kierzek R, Sugimoto N, Turner DH. Thermodynamic study of internal loops in oligoribonucleotides: symmetric loops are more stable than asymmetric loops. Biochemistry. 1991;30:6428–36. doi: 10.1021/bi00240a013. [DOI] [PubMed] [Google Scholar]
- 30.Znosko BM, Kennedy SD, Wille PC, Krugh TR, Turner DH. Structural features and thermodynamics of the J4/5 loop from the Candida albicans and Candida dubliniensis group I introns. Biochemistry. 2004;43:15822–37. doi: 10.1021/bi049256y. [DOI] [PubMed] [Google Scholar]
- 31.Morse SE, Draper DE. Purine-purine mismatches in RNA helices: evidence for protonated G.A pairs and next-nearest neighbor effects. Nucleic Acids Res. 1995;23:302–6. doi: 10.1093/nar/23.2.302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Butcher SE, Dieckmann T, Feigon J. Solution structure of a GAAA tetraloop receptor RNA. Embo J. 1997;16:7490–9. doi: 10.1093/emboj/16.24.7490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Butcher SE, Allain FH, Feigon J. Solution structure of the loop B domain from the hairpin ribozyme. Nat. Struct. Biol. 1999;6:212–6. doi: 10.1038/6651. [DOI] [PubMed] [Google Scholar]
- 34.Bourdelat-Parks BN, Wartell RM. Thermodynamics of RNA duplexes with tandem mismatches containing a uracil-uracil pair flanked by C.G/G.C or G.C/A.U closing base pairs. Biochemistry. 2005;44:16710–7. doi: 10.1021/bi051659q. [DOI] [PubMed] [Google Scholar]
- 35.Hoogstraten CG, Legault P, Pardi A. NMR solution structure of the lead-dependent ribozyme: evidence for dynamics in RNA catalysis. J. Mol. Biol. 1998;284:337–50. doi: 10.1006/jmbi.1998.2182. [DOI] [PubMed] [Google Scholar]
- 36.Campbell DO, Legault P. Nuclear magnetic resonance structure of the Varkud satellite ribozyme stem-loop V RNA and magnesium-ion binding from chemical-shift mapping. Biochemistry. 2005;44:4157–70. doi: 10.1021/bi047963l. [DOI] [PubMed] [Google Scholar]
- 37.Heus HA, Wijmenga SS, Hoppe H, Hilbers CW. The detailed structure of tandem G.A mismatched base-pair motifs in RNA duplexes is context dependent. J. Mol. Biol. 1997;271:147–58. doi: 10.1006/jmbi.1997.1158. [DOI] [PubMed] [Google Scholar]
- 38.Chen G, Kennedy SD, Qiao J, Krugh TR, Turner DH. An alternating sheared AA pair and elements of stability for a single sheared purine-purine pair flanked by sheared GA pairs in RNA. Biochemistry. 2006;45:6889–903. doi: 10.1021/bi0524464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Villescas-Diaz G, Zacharias M. Sequence context dependence of tandem guanine:adenine mismatch conformations in RNA: a continuum solvent analysis. Biophys J. 2003;85:416–25. doi: 10.1016/S0006-3495(03)74486-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Sponer J, Mokdad A, Sponer JE, Spackova N, Leszczynski J, Leontis NB. Unique tertiary and neighbor interactions determine conservation patterns of Cis Watson-Crick A/G base-pairs. J. Mol. Biol. 2003;330:967–78. doi: 10.1016/s0022-2836(03)00667-3. [DOI] [PubMed] [Google Scholar]
- 41.Pan Y, Priyakumar UD, MacKerell AD., Jr. Conformational determinants of tandem GU mismatches in RNA: insights from molecular dynamics simulations and quantum mechanical calculations. Biochemistry. 2005;44:1433–43. doi: 10.1021/bi047932q. [DOI] [PubMed] [Google Scholar]
- 42.Yildirim I, Turner DH. RNA challenges for computational chemists. Biochemistry. 2005;44:13225–34. doi: 10.1021/bi051236o. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Heus HA, Hilbers CW. Structures of non-canonical tandem base pairs in RNA helices: review. Nucleosides Nucleotides Nucleic Acids. 2003;22:559–71. doi: 10.1081/NCN-120021955. [DOI] [PubMed] [Google Scholar]
- 44.Gautheret D, Konings D, Gutell RR. A major family of motifs involving G.A mismatches in ribosomal RNA. J. Mol. Biol. 1994;242:1–8. doi: 10.1006/jmbi.1994.1552. [DOI] [PubMed] [Google Scholar]
- 45.Elgavish T, Cannone JJ, Lee JC, Harvey SC, Gutell RR. AA.AG@helix.ends: A:A and A:G base-pairs at the ends of 16 S and 23 S rRNA helices. J. Mol. Biol. 2001;310:735–53. doi: 10.1006/jmbi.2001.4807. [DOI] [PubMed] [Google Scholar]
- 46.SantaLucia J, Jr., Kierzek R, Turner DH. Effects of GA mismatches on the structure and thermodynamics of RNA internal loops. Biochemistry. 1990;29:8813–9. doi: 10.1021/bi00489a044. [DOI] [PubMed] [Google Scholar]
- 47.Leontis NB, Stombaugh J, Westhof E. The non-Watson-Crick base pairs and their associated isostericity matrices. Nucleic Acids Res. 2002;30:3497–531. doi: 10.1093/nar/gkf481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Wincott F, DiRenzo A, Shaffer C, Grimm S, Tracz D, Workman C, Sweedler D, Gonzalez C, Scaringe S, Usman N. Synthesis, deprotection, analysis and purification of RNA and ribozymes. Nucleic Acids Res. 1995;23:2677–84. doi: 10.1093/nar/23.14.2677. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Usman N, Ogilvie KK, Jiang MY, Cedergren RJ. Automated chemical synthesis of long oligoribonucleotides using 2′-O-silylated ribonucleoside 3′-O-phosphoramidites on a controlled-pore-glass support: Sythesis of a 43 nucleotide sequence similar to the 3′-half molecule of an Escherichia coli formylmethionine tRNA. J. Am. Chem. Soc. 1987;109:7845–7854. [Google Scholar]
- 50.McDowell JA, Turner DH. Investigation of the structural basis for thermodynamic stabilities of tandem GU mismatches: solution structure of (rGAGGUCUC)2 by two-dimensional NMR and simulated annealing. Biochemistry. 1996;35:14077–89. doi: 10.1021/bi9615710. [DOI] [PubMed] [Google Scholar]
- 51.Borer PN, Dengler B, Tinoco I, Jr., Uhlenbeck OC. Stability of ribonucleic acid double-stranded helices. J. Mol. Biol. 1974;86:843–53. doi: 10.1016/0022-2836(74)90357-x. [DOI] [PubMed] [Google Scholar]
- 52.Delaglio F, Grzesiek S, Vuister GW, Zhu G, Pfeifer J, Bax A. NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR. 1995;6:277–93. doi: 10.1007/BF00197809. [DOI] [PubMed] [Google Scholar]
- 53.Goddard TD, Kneller DG. Sparky 3 – NMR assignment and integration software. 2004. [Google Scholar]
- 54.Klosterman PS, Shah SA, Steitz TA. Crystal structures of two plasmid copy control related RNA duplexes: An 18 base pair duplex at 1.20 A resolution and a 19 base pair duplex at 1.55 A resolution. Biochemistry. 1999;38:14784–92. doi: 10.1021/bi9912793. [DOI] [PubMed] [Google Scholar]
- 55.Gorenstein DG. Phosphorus-31 NMR: Principles and Applications. Academic Press; Orlando, Fl: 1984. [Google Scholar]
- 56.Brunger AT, Adams PD, Clore GM, DeLano WL, Gros P, Grosse-Kunstleve RW, Jiang JS, Kuszewski J, Nilges M, Pannu NS, Read RJ, Rice LM, Simonson T, Warren GL. Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta. Crystallogr. D. Biol. Crystallogr. 1998;54(Pt 5):905–21. doi: 10.1107/s0907444998003254. [DOI] [PubMed] [Google Scholar]
- 57.DeLano WL. The PyMol Molecular Graphics System. 2002. [Google Scholar]
- 58.Lu XJ, Olson WK. 3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. Nucleic Acids Res. 2003;31:5108–21. doi: 10.1093/nar/gkg680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Burkard ME, Kierzek R, Turner DH. Thermodynamics of unpaired terminal nucleotides on short RNA helixes correlates with stacking at helix termini in larger RNAs. J. Mol. Biol. 1999;290:967–82. doi: 10.1006/jmbi.1999.2906. [DOI] [PubMed] [Google Scholar]
- 60.Longfellow CE, Kierzek R, Turner DH. Thermodynamic and spectroscopic study of bulge loops in oligoribonucleotides. Biochemistry. 1990;29:278–85. doi: 10.1021/bi00453a038. [DOI] [PubMed] [Google Scholar]
- 61.Varani G, Aboulela F, Allain FHT. NMR investigation of RNA structure. Prog. Nucl. Magn. Reson. Spectrosc. 1996;29:51–127. [Google Scholar]
- 62.Kieft JS, Tinoco I., Jr. Solution structure of a metal-binding site in the major groove of RNA complexed with cobalt (III) hexammine. Structure. 1997;5:713–21. doi: 10.1016/s0969-2126(97)00225-6. [DOI] [PubMed] [Google Scholar]
- 63.Rudisser S, Tinoco I., Jr. Solution structure of Cobalt(III)hexammine complexed to the GAAA tetraloop, and metal-ion binding to G.A mismatches. J. Mol. Biol. 2000;295:1211–23. doi: 10.1006/jmbi.1999.3421. [DOI] [PubMed] [Google Scholar]
- 64.Jucker FM, Heus HA, Yip PF, Moors EH, Pardi A. A network of heterogeneous hydrogen bonds in GNRA tetraloops. J. Mol. Biol. 1996;264:968–80. doi: 10.1006/jmbi.1996.0690. [DOI] [PubMed] [Google Scholar]
- 65.Heus HA, Pardi A. Structural features that give rise to the unusual stability of RNA hairpins containing GNRA loops. Science. 1991;253:191–4. doi: 10.1126/science.1712983. [DOI] [PubMed] [Google Scholar]
- 66.Wijmenga SS, Kruithof M, Hilbers CW. Analysis of 1H chemical shifts in DNA: assesment of reliability of 1H chemical shift calculations for use in structure refinement. J. Biomol. NMR. 1997;10:337–350. doi: 10.1023/A:1018348123074. [DOI] [PubMed] [Google Scholar]
- 67.Cromsigt JAMTC, Hilbers CW, Wijmenga SS. Prediction of proton chemical shifts in RNA. Their use in structure refinement and validation. J. Biomol. NMR. 2001;21 doi: 10.1023/a:1011914132531. [DOI] [PubMed] [Google Scholar]
- 68.SantaLucia J, Jr., Kierzek R, Turner DH. Context dependence of hydrogen bond free energy revealed by substitutions in an RNA hairpin. Science. 1992;256:217–9. doi: 10.1126/science.1373521. [DOI] [PubMed] [Google Scholar]
- 69.Lescoute A, Leontis NB, Massire C, Westhof E. Recurrent structural RNA motifs, Isostericity Matrices and sequence alignments. Nucleic Acids Res. 2005;33:2395–409. doi: 10.1093/nar/gki535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Morosyuk SV, Cunningham PR, SantaLucia J., Jr. Structure and function of the conserved 690 hairpin in Escherichia coli 16 S ribosomal RNA. II. NMR solution structure. J. Mol. Biol. 2001;307:197–211. doi: 10.1006/jmbi.2000.4431. [DOI] [PubMed] [Google Scholar]
- 71.Schuwirth BS, Borovinskaya MA, Hau CW, Zhang W, Vila-Sanjurjo A, Holton JM, Cate JH. Structures of the bacterial ribosome at 3.5 A resolution. Science. 2005;310:827–34. doi: 10.1126/science.1117230. [DOI] [PubMed] [Google Scholar]
- 72.Chen X, McDowell JA, Kierzek R, Krugh TR, Turner DH. Nuclear magnetic resonance spectroscopy and molecular modeling reveal that different hydrogen bonding patterns are possible for G.U pairs: one hydrogen bond for each G.U pair in r(GGCGUGCC)(2) and two for each G.U pair in r(GAGUGCUC)(2) Biochemistry. 2000;39:8970–82. [PubMed] [Google Scholar]
- 73.Michiels PJ, Schouten CH, Hilbers CW, Heus HA. Structure of the ribozyme substrate hairpin of Neurospora VS RNA: a close look at the cleavage site. RNA. 2000;6:1821–32. doi: 10.1017/s1355838200001394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Brameld KA, Goddard WA. Ab Initio Quantum Mechanical Study of the Structures and Energies for the Pseudorotation of 5′-Dehydroxy Analogues of 2′-Deoxyribose and Ribose Sugars. J. Am. Chem. Soc. 1999;121:985–993. [Google Scholar]
- 75.Foloppe N, MacKerell AD., Jr. Intrinsic conformational properties of deoxyribonucleosides: implicated role for cytosine in the equilibrium among the A, B, and Z forms of DNA. Biophys. J. 1999;76:3206–18. doi: 10.1016/S0006-3495(99)77472-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Turner DH, Sugimoto N, Kierzek R, Dreiker SD. Free Energy Increments for Hydrogen Bonds in Nucleic Acid Base Pairs. J. Am. Chem. Soc. 1987;109:3783–5. [Google Scholar]
- 77.SantaLucia J, Kierzek R, Turner DH. Functional Group Substitutions as Probes for Hydrogen Bonding between GA Mismatches in RNA Internal Loops. J. Am. Chem. Soc. 1991;113:4313–22. [Google Scholar]
- 78.Turner DH. Conformational Changes. In: Bloomfield VA, Crothers DM, Tinoco I Jr., editors. Nucleic Acids: Structures, Properties, and Functions. University Science Books; Sausalito, CA: 2000. pp. 259–334. [Google Scholar]
- 79.Cannone JJ, Subramanian S, Schnare MN, Collett JR, D’Souza LM, Du Y, Feng B, Lin N, Madabusi LV, Muller, K M, Pande N, Shang Z, Yu N, Gutell RR. The comparative RNA web (CRW) site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinformatics. 2002;3:2. doi: 10.1186/1471-2105-3-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Gralla J, Crothers DM. Free energy of imperfect nucleic acid helices. 3. Small internal loops resulting from mismatches. J. Mol. Biol. 1973;78:301–19. doi: 10.1016/0022-2836(73)90118-6. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.









