Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2022 May 7;50(9):5251–5262. doi: 10.1093/nar/gkac261

Nearest neighbor rules for RNA helix folding thermodynamics: improved end effects

Jeffrey Zuber 1,2, Susan J Schroeder 2,2, Hongying Sun 3,4, Douglas H Turner 5,6, David H Mathews 7,8,9,
PMCID: PMC9122537  PMID: 35524574

Abstract

Nearest neighbor parameters for estimating the folding stability of RNA secondary structures are in widespread use. For helices, current parameters penalize terminal AU base pairs relative to terminal GC base pairs. We curated an expanded database of helix stabilities determined by optical melting experiments. Analysis of the updated database shows that terminal penalties depend on the sequence identity of the adjacent penultimate base pair. New nearest neighbor parameters that include this additional sequence dependence accurately predict the measured values of 271 helices in an updated database with a correlation coefficient of 0.982. This refined understanding of helix ends facilitates fitting terms for base pair stacks with GU pairs. Prior parameter sets treated 5′GGUC3′ paired to 3′CUGG5′ separately from other 5′GU3′/3′UG5′ stacks. The improved understanding of helix end stability, however, makes the separate treatment unnecessary. Introduction of the additional terms was tested with three optical melting experiments. The average absolute difference between measured and predicted free energy changes at 37°C for these three duplexes containing terminal adjacent AU and GU pairs improved from 1.38 to 0.27 kcal/mol. This confirms the need for the additional sequence dependence in the model.

INTRODUCTION

Over 80% of the human genome is transcribed into RNA, but <3% of the RNA codes for proteins (1,2). Functions for most RNA in the biosphere are still being discovered but already include catalysis (3), control of transcription, translation and expression (4–6), templating for synthesis of DNA (7) and RNA (8), recognition of sites for modification and editing (9–11) and sometimes combining such functions (12). RNA is the genomic material for many viruses, including human pathogens such as SARS and SARS-CoV-2, influenza, HIV, Ebola and Hepatitis C. RNA can also be the basis for vaccines against some of these viruses. For example, mRNA vaccines are effective against SARS-CoV-2 infections (13).

RNA sequence determines the base pairing and 3D structure as well as function of the RNA. Prediction of secondary structure, i.e. the canonical set of Watson–Crick-Franklin (WCF) and GU base pairs, from sequence is a first step in predicting 3D structure (14) and in finding RNAs with common structures and functions (15,16). Some RNA, such as riboswitches, have more than one structure, and the ability to change structure is critical to function (5).

Secondary structure can be predicted from one or more sequences by minimizing free energy change for folding, ΔG°, often augmented with information from chemical mapping and/or sequence comparison. Usually, about half the nucleotides in structured non-coding RNA are canonically paired (17,18). GU pairs play important roles in RNA structure and function as sites for binding metal ions (19,20), therapeutics (21), proteins or metabolites (22).

A database of thermodynamic measurements for helices with canonical pairs and model non-canonical motifs forms the foundation for folding free energy predictions of RNA structure. These data are then fit to a nearest neighbor (NN) model to estimate parameters that can be used to predict folding stabilities of any RNA secondary structure (23). Hallmarks of the NN model are that each stability increment depends on local sequence and that total stability is the sum of the increments.

The model and parameters for approximating stabilities of WCF base-paired helixes have not changed substantially since 1998 (24,25). Individual parameters for nearest neighbors containing at least one GU pair, however, were revised on the basis of new data (25). In that revision, the penalty of 0.45 kcal/mol applied to terminal AU pairs and previously assumed for terminal GU pairs (24), was found unnecessary for GU pairs. Expansion of the database for duplexes, particularly those with terminal GU pairs (26) and the data presented here, make possible more extensive considerations of terminal effects on base pair stability. In particular, the data allow expansion of the model to include six new parameters specific for terminal nearest neighbors, i.e. sequence-specific terms for the ends of helices that account for the last and penultimate base pairs. Surprisingly, stabilities of terminal GU and AU pairs depend on whether the neighboring pair is a GC, AU or GU pair. Incorporating this effect in the NN model also produces significant revision of parameters for internal Inline graphic and Inline graphic nearest neighbors, where stacks are shown for a top strand in the 5′ to 3′ direction pairing to a bottom strand in the opposite direction. With these changes, Inline graphic fits the NN model rather than being an outlier as considered previously (27). Thus, the new model presented here will be especially important for predicting structures containing GU pairs.

It is not surprising that GU pairs are more idiosyncratic than WCF pairs. Guanine has more hydrogen bonding groups and a larger dipole moment than other bases (28). Base stacking and hydrogen bonding that stabilize GU pairs can vary depending on local context, including position in a helix. Base stacking depends on interactions with both bases of a nearest neighbor. GU pairs can adopt different hydrogen bonded configurations and stacking interactions (Figure 1) (22). In Figure 1A, the terminal GU pair in the foreground is in a Inline graphic nearest neighbor and has a single hydrogen bond while the penultimate UG pair has two hydrogen bonds. The conformation of the terminal GU pair may be influenced by solvent interactions or crystal contacts through stacking interactions with the terminal GU pair of an adjacent molecule. In Figure 1B, the Inline graphic nearest neighbor is flanked by WCF GC pairs on both sides in the middle of the helix, i.e. Inline graphic In that context, both the GU pairs have a hydrogen bond from the G carbonyl to the U imino proton, a bifurcated hydrogen bond between the U carbonyl at C2 and the G imino and amino protons, and extensive cross-strand guanine stacking. In contrast, when the sequence is reversed, i.e. 5′…CGUG… compared to 5′…GUGC…, in the self-complementary duplex (Figure 1C), each GU pair has a single bifurcated hydrogen bond, and there is no cross-strand stacking.

Figure 1.

Figure 1.

GU pair stacking and hydrogen bonding for three contexts of tandem GU pairs. Three distinct patterns of hydrogen bonding (shown in green) are observed in these three examples of cis Watson–Crick/Watson–Crick pairs by the Leontis and Westhof nomenclature (152). (A) X-ray crystal structure (1.4 Å resolution, Rfree = 20.7%) with terminal stacking GU pairs in 5′UGCUCCUAGUACGUAAGGACCGGAGUG, PDB ID# 1MSY (153). Nucleotides in bold in the sequence are shown. The nucleotides with gold carbon atoms are in the forefront and these constitute the terminal base pair, while nucleotides with gray carbon atoms are in the back. The crystal packing has the terminal GU pairs of two molecules stacking on each other. Here, the GU pair, although cis Watson–Crick/Watson–Crick has a single hydrogen bond. (B) NMR structure with internal UG pairs in (5′GAGUGCUC)2, PDB ID# 1EKA (137). 28 unique NOE measurements define the orientation for these nucleotides. (C) NMR structure with internal GU pairs in (5′GGCGUGCC)2. PDB ID# 1EKD (137). 26 unique NOE measurements define the orientation for these nucleotides.

Reported thermodynamic stabilities for the internal NN GU stacks in Figure 1B and C reflect the different nucleotide configurations and effect of considering terminal effects. The ΔG°37 of Inline graphic andInline graphic motifs are –0.38 and –0.19 kcal/mol, respectively, in the new model as compared to –0.57 and + 0.72 kcal/mol in a prior model (25). The new parameters also include an increment of –0.74 kcal/mol for terminal consecutive GU pairs, i.e. a GU end on a GU pair, like those shown in Figure 1A. In contrast, previous models did not add favorable folding free energy for this sequence motif. In many NMR structures with terminal GU and AU pairs, these pairs show more dynamic behavior relative to internal pairs. This is observed as broad imino proton resonances and fewer NOE restraints (26,29–33). Thus, sequence orientation, stacking interactions, hydrogen bonding and nucleotide dynamics are important factors in the structure and stabilities of GU pairs. They are now more accurately accounted for in the new thermodynamic parameter set. This should improve predictions of secondary structure from sequence.

MATERIALS AND METHODS

Optical melting experiment database

For this analysis, optical melting experiments were compiled through an extensive literature review (25,26,29,34–62). Enumeration of all melting experiments included in this analysis is available in Supplementary Table S1 and in the spreadsheet provided in the supplementary materials. Experiments are included if only unmodified nucleotides are present and buffer has 1 M Na+ with pH between 6.5 and 7.5. Additionally, duplexes that were reported by the original authors to have non-2-state unfolding transitions were excluded from this analysis (43). A total of 223 experiments were included in the fits, with 125 for Watson–Crick-Franklin pair parameters and 98 for GU pair parameters.

Most melting experiments for generating NN parameters have used 1 M Na+. This was initially chosen to assure formation of duplexes rather than hairpins, as also seen for deoxy A–T oligonucleotides (63) and to allow measurements of the concentration dependence of duplexes that could only be synthesized with many AU pairs (64,65). The high melting temperatures dependent on 1 M NaCl became more important when comparisons with calorimetry revealed that interpretation of optical melting improved if sloping upper and lower baselines were considered (66).

The most important result from thermodynamic studies is the relative sequence dependence of nearest neighbor stability. This is expected not to depend on salt conditions because there is no site binding of Na+ to fully base paired RNA (67–72). Local concentrations of mobile cations around large folded RNAs, however, depend on the local charge density of phosphate groups. Manning developed a first order cation condensation model that predicts local concentrations of cations around RNA do not depend on bulk concentrations (73). For A-form double helical and single strand RNA, respectively, the local ‘ion atmosphere’ in the absence of multiple charged cations is predicted to have 1.7 M and 0.4 M of M+ ions (74). They respectively neutralize 0.8 and 0.6 of the phosphate charge. In the absence of M+ cations, M2+ cations are predicted to neutralize 0.9 and 0.8, respectively, of backbone charge. More detailed computations and experiments agree qualitatively with expectations from Manning theory (29,52,58,67,68,70,71,75–77). GU pairs can be the sites of metal ion binding (20,58,78,79), but optical melting experiments of duplexes with consecutive GU pairs did not find differences in stabilities in 1 M Na+ and in 150 mM K+ with 10 mM Mg2+ (26). Together these suggest that, while salt conditions vary between and within cells (80–84), they are unlikely to affect dramatically the relative stabilities of NN canonical pairs.

Feature correlations

Feature correlations were calculated for each model using the R statistical programming language. The resulting correlation matrices were then visualized with the R corrplot library, available at https://github.com/taiyun/corrplot.

Fitting linear models

Parameter models were fit using measured ΔG°37 and ΔH° values for each optical melting experiment. For the fit of WCF stacking parameters, the theoretical contribution of RT ln (2) due to 2-fold symmetry of self-complementary duplexes, was subtracted from the experimentally measured duplex ΔG°37 (24,85). For the fit of GU stacking parameters, the contributions due to sequence symmetry and the WCF stacks from each duplex with any GU base pairs were subtracted from measurements. The calculated ΔG°37 and ΔH° are then used to fit linear models in the R statistical programming language using the base function lm. ΔS° values for the nearest neighbor parameters are calculated from the ΔG°37 and ΔH° values.

To estimate uncertainty in NN parameter values, a covariation analysis was used to account for the dependencies (due to the nested nature of the regressions) and correlation (due to a base pair appearing in up to two neighboring stacks) between parameters (86,87). To perform covariation analysis, the optical melting data were resampled within experimental error (ΔH°σ = 12% ΔH° and ΔS°σ = 13.5% ΔS° (24)). The resampling was performed with the mvrnorm function from the R MASS library (88), which preserves the observed correlation between ΔH° and ΔS° (ρ = 0.9996 (24)). The updated experimental values are then used to recalculate multiple sets of model parameters. The sets of model parameters are then used to calculate average values for each parameter as well as covariation (86,87). The standard errors of regression, which neglect the correlations and the effect of nested regressions, for the NN parameters can be found in Supplementary Table S2.

Leave-one-out analysis

To assess the impact of any one experimental value on the fit models, models were fit in which each experimental value was individually excluded from the fitting data. The root mean square deviations (RMSDs) in parameter values were calculated from the model fit to the full data set to measure the impact of excluding each individual experimental value.

Optical melting experiments to test the revised model

Optical melting experiments were conducted on three additional duplexes, (5′UGUCGAUA)2, (5′AUAGCUGU)2 and (5′AUUCGAGU)2, following standard protocols described in (89). Oligonucleotides were purchased from Integrated DNA Technologies including purification with standard desalting procedures and assessment of purity by mass spectrometry. Oligonucleotides were dissolved in milliQ water, and the absorbance at 260 nm at 80°C was measured. The appropriate amount of oligonucleotide was dried in a speed vac and resuspended in standard melting buffer of 1 M NaCl, 20 mM sodium cacodylate, pH 7, and 0.5 mM Na2EDTA. Optical melting experiments were conducted in a Beckman DU800 UV-Vis spectrometer with a custom sample holder and cuvettes at 0.1 cm and 1.0 cm path lengths. Absorbance vs. temperature was measured at 280 nm. Data was analyzed with Meltwin software (52).

Stacking term counts

An archive of RNA sequences of known secondary structure (18,90) was analyzed to count the number of occurrences of each NN stacking parameter. A Python script was used to parse each structure into individual helices and then to parse each helix into component NN stacking and helix end parameters.

RESULTS

AU end parameters depend on penultimate pair

Prior work demonstrated that multiple GU terminal base pairs impact the stability of helical duplexes (26), and this motivated a reexamination of the treatment of helix ends. New terms to account for the end of a helix were introduced into the NN model. This was done by including a parameter for an AU terminal pair on an AU penultimate pair (not accounting for orientation of the two pairs and therefore applying to Inline graphic, Inline graphic, Inline graphic, or Inline graphichelix ends) and a parameter for an AU terminal pair on a CG penultimate pair (applying to Inline graphic, Inline graphic, Inline graphic, or Inline graphic helix ends). In these terminal stacks, the end of the helix is to the right, a top strand is shown from 5′ to 3′, and pairs are shown to a bottom strand running in the opposite direction. These parameters are applied in addition to the base pair stacking parameter for these end stacks. This model for terminal AU pairs considers the identity but not the orientation of pairs and is consistent with experiments on terminal GU pairs (26). Those experiments revealed that a bonus for multiple terminal GU pairs is largely independent of orientation. Those observations inspired this model.

Comparisons between the updated model and those used in the 1998 and 2004 NN models are in Table 1A. Fitting to the model with the modified helix end parameters resulted in only moderate changes to the WCF stacks. The parameters for intermolecular initiation and the individual NN stacks were all within error of the 1998 parameters for both ΔG°37 and ΔH°. The only significant change was that an AU terminal pair on an AU penultimate pair is more favorable compared to the 1998 and 2004 models.

Table 1A.

The ΔG°37 and for ΔH° nearest neighbor parameters for helices composed of WCF pairs. The base pair stacks are represented with the top strand (5′ to 3′), the slash, and then the bottom strand (3′ to 5′). For example, AC/UG is the stack Inline graphic. The end terms are added in addition to the stacks, with example calculations in Figure 4

New Model§ 1998 Model
Feature Inline graphic (kcal/mol) ΔH° (kcal/mol) ΔS° (eu) Inline graphic (kcal/mol) ΔH°(kcal/mol)
GC/CG –3.46 ± 0.08 –16.52 ± 1.57 –42.13 ± 4.26 –3.42 ± 0.08 –14.88 ± 1.58
CC/GG –3.28 ± 0.08 –13.94 ± 1.18 –34.41 ± 3.58 –3.26 ± 0.07 –13.39 ± 1.24
GA/CU –2.42 ± 0.05 –13.75 ± 1.00 –36.53 ± 3.16 –2.35 ± 0.06 –12.44 ± 1.20
CG/GC –2.33 ± 0.09 –9.61 ± 1.57 –23.46 ± 4.74 –2.36 ± 0.09 –10.64 ± 1.65
AC/UG –2.25 ± 0.06 –11.98 ± 1.17 –31.37 ± 3.86 –2.24 ± 0.06 –11.40 ± 1.23
CA/GU –2.07 ± 0.07 –10.47 ± 1.25 –27.08 ± 3.73 –2.11 ± 0.07 –10.44 ± 1.28
AG/UC –2.01 ± 0.07 –9.34 ± 1.23 –23.66 ± 3.63 –2.08 ± 0.06 –10.48 ± 1.24
UA/AU –1.29 ± 0.08 –9.16 ± 1.71 –25.40 ± 5.55 –1.33 ± 0.09 –7.69 ± 2.02
AU/UA –1.09 ± 0.07 –8.91 ± 1.55 –25.22 ± 4.75 –1.10 ± 0.08 –9.38 ± 1.68
AA/UU –0.94 ± 0.04 –7.44 ± 0.80 –20.98 ± 2.56 –0.93 ± 0.03 –6.82 ± 0.79
Initiation +4.10 ± 0.24 +4.66 ± 3.85 +1.78 ± 11.93 +4.09 ± 0.22 +3.61 ± 4.12
Symmetry +0.43 0 –1.38 +0.43 0
AU End on AU +0.22 ± 0.06 +4.36 ± 1.23 +13.35 ± 3.83 +0.45 ± 0.04 +3.72 ± 0.83
AU End on CG +0.44 ± 0.04 +3.17 ± 0.80 +8.79 ± 2.50 +0.45 ± 0.04 +3.72 ± 0.83

The updated model shows excellent correlations between predicted and measured values of ΔG°37 (R2= 0.9830, Figure 2) and ΔH° (R2= 0.8877, Supplemental Figure S1). The correlations between the model feature frequencies are modest and are mostly limited to expected correlations between stacks that can extend on each other (Supplemental Figure S2). The predicted folding ΔG°37 were within 0.5 kcal/mol of the measured value for 86.4% of the experiments (Supplemental Figure S3). Predicted ΔH° are within 5 kcal/mol of the measured value for 76% of the experiments (Supplemental Figure S3).

Figure 2.

Figure 2.

Correlation between predicted and measured ΔG°37 for duplexes with only WCF pairs. ΔG°37 values predicted from updated nearest neighbor parameters for duplexes composed solely of WCF base pairs (Table 1A) are plotted against values determined from optical melting experiments.

The impact of each optical melting experiment was determined by fitting the NN parameters on a data set that excluded that individual experiment and comparing the resulting parameter values to those fit on the full data set. The root mean squared deviations (RMSDs) in ΔG°37 and ΔH° parameter values for these leave-one-out (LOO) data sets can be seen in Supplemental Figure S4. No one individual experiment heavily impacted the parameter values. The biggest impacts were RMSDs of 0.0363 kcal/mol in ΔG°37 and 0.3346 kcal/mol in ΔH°, substantially smaller than uncertainty in the parameter values.

GU stacking parameters

A similar model for terminal AU and GU stacks was used when fitting duplexes with GU base pairs. The model requires terms for an AU end with a penultimate GU pair, a terminal GU pair with a penultimate AU pair, a terminal GU pair with a penultimate GC pair, and a GU pair with a penultimate GU pair. The orientation of the two pairs is not considered.

Prior GU stack NN parameter sets treated Inline graphic as a special, non-nearest neighbor case. When results from a fitting model including a parameter for the non-nearest neighbor quadruplet Inline graphic, however, were compared to results for a model not including that parameter, the other parameter values were all within uncertainty of each other. Additionally, for each duplex containing Inline graphic, predicted ΔG°37 and ΔH° values from each model were also close to each other and to predicted ΔG°37 and ΔH° values from the 2012 model (25) (Supplementary Table S3). Additionally, the inclusion of the special Inline graphic parameter results in almost identical R2 value for the fit (0.9256 versus 0.9267) (Figure 3 and Supplemental Figure S5). Evidently, a special, non-nearest neighbor parameter is not needed in the updated model when end effects for terminal AU and GU nearest neighbors are accounted for.

Figure 3.

Figure 3.

Correlation between predicted and observed ΔG°37 for duplexes with WCF and GU pairs. ΔG°37 values predicted from parameters in Table 1 plotted against values determined from optical melting experiments.

For the GU internal stacking NN parameters, the most substantial change is for the Inline graphic stack, where the ΔG°37 changed from +0.72 kcal/mol to –0.19 kcal/mol between the previous (25) and new models. An additional increment of –0.74 is added for a terminal Inline graphic stack. The second most substantial change is for the Inline graphic stack, which went from having a ΔG°37 contribution of –0.35 kcal/mol (25) to + 0.02 kcal/mol, but with an additional end increment of –0.31 kcal/mol.

The updated model shows good correlation between predicted and measured values for folding ΔG°37 (R2= 0.9256, Figure 3) and ΔH° (R2= 0.7659, Supplemental Figure S6). Correlations between model feature frequencies are mostly limited to expected correlations between the GU on GU end feature and the three possible stacks that can form that end (Supplemental Figure S7). For folding ΔG°37, 53.1% of experiments had predicted values within 0.5 kcal/mol of the measured value (85.7% were within 1 kcal/mol) (Supplemental Figure S8). For ΔH°, 57.1% of experiments had predictions within 5 kcal/mol of the measured value (79.6% were within 10 kcal/mol) (Supplemental Figure S8).

RMSDs in ΔG°37 and ΔH° parameter values for the LOO data sets can be seen in Supplemental Figure S9. As with the WCF stacking parameters, no one individual experiment heavily impacted the parameter values. The biggest impacts are RMSDs of 0.0641 kcal/mol in ΔG°37 and 0.8675 kcal/mol in ΔH°, smaller than the uncertainty in the parameter values.

Uncertainty in parameter values for the updated NN model presented in Tables 1A and 1B were determined from a covariation analysis, which randomly perturbed the experimental values within experimental uncertainty and calculated the covariance matrix from observed changes in parameter values. We previously found that this approach is important for estimating the uncertainties of folding free energies and enthalpies because the correlated nature of the NN parameters and the use of sequential regressions break the assumptions used in the calculations of standard errors of regression (87). Covariances between parameters are presented in Supplemental Figures S10 and S11. Covariances were generally small, with the strongest interactions between the intermolecular initiation parameter and parameters for individual stacks. There are also weaker interactions between GU end terms and equivalent internal GU stacks that can form that end term. For example, the GU on GU end parameter value is negatively correlated with the values for internal Inline graphic, Inline graphic, and Inline graphic stacks.

Table 1B.

The ΔG°37 and for ΔH° nearest neighbor parameters for stacks with GU pairs

New Model§ 2012 Model *
Feature Inline graphic (kcal/mol) ΔH° (kcal/mol) ΔS°(eu) Inline graphic (kcal/mol) ΔH°(kcal/mol)
GC/UG –2.23 ± 0.07 –14.73 ± 1.44 –40.32 ± 4.60 –2.15 ± 0.10 –11.09 ± 1.78
CU/GG –1.93 ± 0.08 –9.26 ± 1.58 –23.64 ± 5.16 –1.77 ± 0.09 –9.44 ± 1.76
GG/CU –1.80 ± 0.07 –12.41 ± 1.52 –34.23 ± 4.72 –1.80 ± 0.09 –7.03 ± 1.75
CG/GU –1.05 ± 0.07 –5.64 ± 1.47 –14.83 ± 4.57 –1.25 ± 0.09 –5.56 ± 1.68
AU/UG –0.76 ± 0.07 –9.23 ± 1.61 –27.32 ± 5.09 –0.90 ± 0.08 –7.39 ± 1.65
GA/UU –0.60 ± 0.06 –10.58 ± 1.52 –32.19 ± 4.81 –0.51 ± 0.08 –10.38 ± 1.79
UG/GU –0.38 ± 0.07 –8.76 ± 1.74 –27.04 ± 5.21 –0.57 ± 0.19 –12.64 ± 4.01
UA/GU –0.22 ± 0.07 –2.72 ± 1.54 –8.08 ± 4.79 –0.39 ± 0.09 –0.96 ± 1.80
GG/UU –0.20 ± 0.08 –9.06 ± 1.89 –28.57 ± 6.04 –0.25 ± 0.16 –17.82 ± 3.75
GU/UG –0.19 ± 0.08 –7.66 ± 1.80 –24.11 ± 5.81 +0.72 ± 0.19 –13.83 ± 4.21
AG/UU +0.02 ± 0.06 –5.10 ± 1.45 –16.53 ± 4.56 –0.35 ± 0.08 –3.96 ± 1.73
GGUC/CUGG (–3.80 ± 0.13) (–32.49 ± 2.75) (–92.57 ± 11.09) –4.12 ± 0.54 –30.80 ± 8.87
AU End on GU –0.71 ± 0.15 +5.16 ± 2.99 +18.96 ± 9.15 +0.45 ± 0.04 +3.72 ± 0.83
GU End on CG +0.13 ± 0.08 +3.91 ± 1.43 +12.17 ± 4.34 0.00 ± 0.00 0.00 ± 0.00
GU End on AU –0.31 ± 0.06 +3.65 ± 1.37 +12.78 ± 4.23 0.00 ± 0.00 0.00 ± 0.00
GU End on GU –0.74 ± 0.08 +6.23 ± 2.12 +22.47 ± 6.65 0.00 ± 0.00 0.00 ± 0.00

§Uncertainty values were calculated from the experiment covariation analysis. Uncertainty values from standard errors of regression are listed in Supplementary Table S2.

Parameters taken from (24).

The 1998 model did not have separate values for each AU End variant.

*Values taken from (25).

The new model does not have this parameter. The shown value is the result of combining the NN stacks for that sequence.

The 2012 model did not include a parameter for terminal GU base pairs.

Additional melting experiments support the model

Three duplexes, designed to test features of the new model for thermodynamic parameters with terminal AU pairs and penultimate GU pairs, were studied by optical melting. A significant difference between the two models occurs for the end parameter for an AU end on a penultimate GU pair. The new and previous (24) models use values of –0.71 kcal/mol and 0.45 kcal/mol at 37°C, respectively. All three duplexes in Table 2 contain this motif, which was represented in only two duplex sequences in the database of optical melting experiments. The duplexes, (5′UGUCGAUA)2 and (5′AUAGCUGU)2 differ in orientation of the terminal AU pair stacking on the penultimate GU pair. Duplex (5′AUUCGAGU)2 contains the motif Inline graphic, which has values of –0.02 and –0.35 kcal/mol for the new and prior models, respectively. Table 2 compares predictions based on the new and prior model with the experimentally measured thermodynamic values for these three duplexes. The new model predicts that on average the duplexes are 1.60 kcal/mol more stable at 37°C than predicted by the prior model. This equates to an average 13-fold more favorable equilibrium constant for duplex formation. The experimental results confirm improvement of the new model.

Table 2.

Optical melting experimental tests of nearest neighbor parameters. Melting buffer was 1 M NaCl, 20 mM sodium cacodylate pH 7, and 0.5 mM Na2EDTA. a) Predicted ΔG°37 are calculated using Table 1 values for the new model parameters. b) Predicted ΔG°37 are calculated from the previous models described in (25) and (24). The duplex values in italics show borderline apparent 2-state behavior with a difference in ΔH° between the two analyses of 18.2%, rather than the typical <15% criterion

New modela Prior modelb Van’t Hoff plot analysis Curve fit analysis
Duplex −Δ37 (kcal/mol) −Δ37 (kcal/mol) −Δ37 (kcal/mol) −Δ (kcal/mol) −Δ (eu) Tm (°C) −Δ37 (kcal/mol) −Δ (kcal/mol) −Δ (eu) T m (°C)
(5′UGUCGAUA)2 6.02 ± 0.31 4.22 ± 0.37 6.10 ± 0.01 59.32 ± 0.01 171.58 ± 0.01 34.8 6.11 ± 0.18 66.83 ± 3.42 195.78 ± 11.21 39.0
(5′AUAGCUGU)2 6.33 ± 0.29 4.74 ± 0.37 5.81 ± 0.02 44.15 ± 0.01 123.63 ± 0.02 32.0 5.86 ± 0.26 47.88 ± 6.08 135.48 ± 19.18 38.2
(5AUUCGAGU)2 5.54 ± 0.26 4.14 ± 0.36 5.32 ± 0.03 34.58 ± 0.01 94.33 ± 0.02 26.5 5.10 ± 0.29 42.30 ± 2.98 119.94 ± 9.69 32.9

T m values were calculated for an RNA strand concentration of 1 × 10 –4 M.

DISCUSSION

The database of thermodynamic parameters forms the foundation for predictions of RNA structure and function in many widely used software suites (14,91–98). These RNA structure prediction programs enable design of mRNA vaccine sequences (13,99,100), analysis of metaproperties of transcriptomic changes in response to stress (101–103), determination of effects of nucleotide modifications on folding stability (104–106), discovery of accessible regions to target with antisense DNA or siRNA (107–110), and rational design of small molecules targeting RNA (111–114). Curation and improvement of the RNA thermodynamic database facilitates hypothesis-driven RNA research in many fields and has significant impact on the RNA community. The progress reported here expands, compiles, and presents the thermodynamic NN parameters for WCF and GU pairs. Statistical significance of the new parameters is robust. Inclusion of helix-end effects for AU and GU pairs improves predictions of helices with these common motifs and resolves previously poorly understood terms for ‘special cases’ of motifs containing GU pairs. Supplemental Figure S12 shows the improvement in the residuals for the new model compared to the previous.

To illustrate a NN calculation to estimate helix stability, Figure 4 provides two example calculations. The first is the sequence (5′UGUCGAUA)2, with experimental stability provided in Table 2. The second is 5′UAGGUCAG paired with 5′CUGGUCUA. This calculation illustrates that the Inline graphic motif, an outlier in prior nearest neighbor models, is now handled with nearest neighbor stacks. An Excel spreadsheet is provided with the Supplementary Materials to calculate user-inputted helical NN stabilities.

Figure 4.

Figure 4.

Example calculations of helical ΔG°37. Panel (A) shows the stability calculation for (5′UGUCGAUA)2, which is shown in Table 2 to have an experimentally determined ΔG°37 of –6.10 kcal/mol. This sequence is self-complementary and therefore the symmetry penalty is added. Panel (B) shows the stability calculation for 5′UAGGUCAG paired to 5′CUGGUCUA. This demonstrates the difference in treatment for the (GGUC)2 motif. For both sequences, calculations are provided for the current parameters derived here and the previous parameters (24,25). The total stability is the sum of the stability increments.

This work presents the next advance in development of a robust NN model for predicting RNA duplex stabilities. The NN model for ΔG° of RNA helixes composed of canonical pairs uses stacks of adjacent base pairs (115–117). This assumes that the total ΔG° and temperature dependence, ΔH°, for helix formation can be approximated by summing ΔG°s and ΔH°s assigned to nearest neighbors of canonical pairs. The experimental foundation for this approach was laid by Uhlenbeck and Martin in the Doty lab when they used optical melting to measure thermodynamics of duplex formation (64,65). Uhlenbeck and the Tinoco lab used biochemical methods to expand the database of sequences. Because WCF base pairing depends on strong, local hydrogen bonding and stacking interactions, a NN model developed for polynucleotides was tested and found to fit the database (118). This suggested that the NN model would allow predictions for unmeasured sequences (116). In collaboration with related efforts in the Crothers lab, this led to original rules for predicting the thermodynamics of RNA folding (115).

Subsequent insights and research have continually improved success of the NN method. A rotational symmetry term was added to the model to account for the difference between duplexes formed by self- or non-self-complementary strands (43,85,119). Application of T4 RNA ligase and development of chemical synthesis on polymer supports allowed expansion of sequences available (42,43,120,121). Particularly important was addition of duplexes not beginning with multiple AU pairs and having melting temperatures near 37°C, human body temperature. Analysis of the number of parameters allowed by the model (122,123) led to discovery that duplexes with the same nearest neighbors can have different thermodynamics depending on the terminal base pair (24).

Applications of the method were expanded to larger RNAs by modifying dynamic programming algorithms to predict folding that optimized ΔG° (91,124,125) rather than base pairing (92). The thermodynamic approach lends itself to modeling ensembles of structures, including calculations of base pairing probabilities and stochastic samples (98,126,127). Additional applications include predicting structures for multiple interacting strands (128), designing sequences to fold to specific structures (129,130), and integrating mapping or conservation data into structure prediction (131–134). Recently, applications to even larger RNAs have become possible due to linearization of the algorithms (135,136).

NN parameters for canonical pairs have undergone substantial revisions over time, including treatment of end effects (24,25,27,43,116). GU terminal base pairs were initially assumed to be equivalent to AU terminal base pairs and were given the same penalty term (27). Expansion of the database and refitting of the model indicated that GU terminal base pairs do not require an end penalty (25). Measurements on helices with consecutive terminal GU pairs, however, revealed they are surprisingly more stable than predicted (26). Our updated NN model includes new parameters that account for end effects of both AU and GU pairs, including dependence on the penultimate pair.

Context-dependent variation of GU pair conformations (Figure 1) provides a structural rationale for treating terminal GU pairs differently. A fundamental assumption of a NN model is that strong local interactions dominate the energetic contributions determining conformation and stability for a particular nucleotide sequence. This implies that a stack of two WCF pairs will have the same thermodynamic stability in the middle of a helix as at the end of a helix. This NN approximation is consistent with structures of WCF RNA helices and the regular periodic shape of an RNA double helix. The diversity of GU pair conformations and stabilities, however, introduces variation into WCF paired helices. The unique structures of GU pairs facilitate binding recognition and specificity for metal ions, RNA tertiary interactions, protein interactions and drug binding (22). The challenge is incorporating this functionally important and structurally diverse motif into a NN model.

Prior models attempting to combine GU and WCF pairs into one set of thermodynamic parameters always had a few unexplained exceptions. For example, the motif Inline graphic had one NN parameter value with an exception for the motif Inline graphic, which had an extra bonus. Analysis of NMR structures and crystal structures of this motif, however, did not indicate a reason for this additional stability (52,53,137,138). In addition, the crystal structure of consecutive terminal GU pairs in (5′GGUGGCUGUU3′)2 had three slightly different helical conformations in the asymmetric unit but an overall remarkably A-form like structure that did not reveal a physical explanation for exceptional thermodynamic parameters (30). NMR studies of duplexes with consecutive terminal GU pairs usually showed broad resonances and few or weak NOEs in the final two GU pairs (26,30). Fluorescence and NMR studies have quantified different base pair dynamics in the middle and ends of helices for various types of base pairs (139–142). Consistent with this, the new NN model has increments for terminal nearest neighbors to distinguish them from internal nearest neighbors.

Interestingly, these increments are penalties of +0.22 and +0.44 kcal/mol at 37°C for an AU end pair on a penultimate AU or CG pair, respectively (Table 1). A similar penalty of +0.45 kcal/mol has previously been attributed to the presence of one fewer hydrogen bond when duplexes with identical nearest neighbors have two terminal AU pairs rather than terminal GC pairs (24). In contrast, incremental bonuses of –0.31 to –0.74 kcal/mol are assigned to terminal nearest neighbors consisting of an AU and a GU pair or two GU pairs. This would be consistent with a ΔS° bonus due to increased base pair dynamics at the ends of helices. For example, equal populations of three conformations at the end of a helix would provide a ΔG° bonus of –RT ln (3) = –0.68 kcal/mol at 37°C.

The NN approximation is essential for efficient dynamic programming approaches to computing the minimum ΔG° secondary structure for an RNA sequence (91,92). In prior models, the special cases for GU pairs required additional considerations in dynamic programming algorithm computations. In current application of the nearest neighbor parameters (143), a helix end occurs not only at the 5′ and 3′ ends of an RNA molecule but also at every junction, internal loop, hairpin loop, and mismatch or bulge in an RNA secondary structure. GU helix end pairs occur in accepted RNA secondary structures at a rate of approximately 13 per 1000 bases in the sequences (Supplemental Figure S13). They occur at a much higher rate in predicted structural ensembles. Thus, consideration of GU pairs and special rules for positional dependence present a frequent step in the computations. The NN parameter model presented here improves predictions for sequences and structures with terminal AU and GU pairs and will also accelerate computation of the minimum free energy structure for any sequence.

For example, several terminal AU and GU motifs occur in the secondary structure for the Ψ packaging sequence in HIV-1 RNA (21) and the motif Inline graphic binds a novel drug. Each helix in the three-way junction that binds the drug has an AU or GU pair at the end, and the new NN parameters in this work would estimate that the ΔG°37 for these three helices is at least 0.9 kcal/mol more stable than current predictions.

Another recent example is the SL3 helix that forms between the 5′ and 3′ ends of SARS-CoV-2. This helix has been identified experimentally (144) and computationally (145). One end of SL3 terminates in a Inline graphic nearest neighbor. Results in Table 1B assign a ΔG°37 of –0.38–0.74 = –1.12 kcal/mol to this end, which is more stable than previous predictions.

Free energy predictions from nearest neighbors for RNA secondary structures provide the base line for analysis of the stabilities of RNA interactions with drugs and proteins, and thus provide a foundational resource for RNA structure and function studies. Our future analyses will evaluate the impact of the new NN parameters on the thermodynamic parameters for mismatches, internal loops, bulges, and helix junctions. These loop motifs form many of the recognition sites for proteins, metal ions, and therapeutics.

While the parameters in Tables 1A and 1B provide excellent predictions of the measured duplex stabilities at 37°C, there is slightly less agreement for duplexes with WCF and GU pairs (Figures 2 and 3). This is not surprising because the database with GU pairs is smaller than that with only WCF pairs. Additionally, GU pairs are more likely to have different structures (Figure 1) (22). One example of this is (GGCGUGCC)2, where measured and predicted values for ΔG°37 are, respectively, –9.72 and –11.24 kcal/mol. On the basis of NMR and nucleobase substitution results (137), and mesoscopic modeling based on melting temperatures (146), the GU pairs in (GGCGUGCC)2 were determined to have only one H-bond each (Figure 1C) rather than the usual two. That may explain the overprediction of thermodynamic stability.

There are known limitations to the current parameterization of the nearest neighbor rules. These parameters depend on the two-state fits of melting data and assume that the enthalpy and entropy changes are temperature independent. It has been shown that parameters can be determined without assuming two-state melting by fitting directly to the optical melting data (absorbance as a function of temperature) (147), although much of these data are not currently available for the duplexes studied here. It is also known that enthalpy change and entropy change both depend on temperature (148–151). The changes have antagonizing effects with respect to free energy change, however, so folding free energy estimates at 37°C are probably little affected by the temperature dependencies because most strands are designed to have melting temperatures close to 37°C (149). On the other hand, extrapolation of folding free energies to other temperatures and estimates of melting temperatures are likely to be affected by these temperature dependencies. Future work could develop new nearest neighbor parameters that do not rely on these assumptions.

In summary, the updated NN model is consistent with previous parameters for WCF pairs, includes new parameters accounting for increased base pair dynamics at ends for helices ending in AU or GU pairs, improves predictions for duplexes with terminal AU or GU pairs, and resolves a prior exceptional parameter for a specific GU motif. The model for the NN parameters has low uncertainty in ΔG° and ΔH° and low correlations between parameters. The statistically robust model maintains the physical basis that differences in hydrogen bonding, stacking, and nucleotide dynamics determine the sequence dependence of NN base stacks. The new thermodynamic parameters will help improve RNA structure prediction tools and facilitate discoveries in RNA biology, catalysis, and therapeutics.

Supplementary Material

gkac261_Supplemental_Files

Contributor Information

Jeffrey Zuber, Alnylam Pharmaceuticals, Inc., Cambridge, MA 02142, USA.

Susan J Schroeder, Department of Chemistry and Biochemistry, and Department of Microbiology and Plant Biology, University of Oklahoma, Norman, OK 73019, USA.

Hongying Sun, Department of Biochemistry & Biophysics, University of Rochester, Rochester, NY 14642, USA; Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA.

Douglas H Turner, Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA; Department of Chemistry, University of Rochester, Rochester, NY 14627, USA.

David H Mathews, Department of Biochemistry & Biophysics, University of Rochester, Rochester, NY 14642, USA; Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA; Department of Biostatistics & Computational Biology, University of Rochester, Rochester, NY 14642, USA.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

National Institutes of Health [R01GM076485]. Funding for open access charge: National Institutes of Health.

Conflict of interest statement. None declared.

REFERENCES

  • 1. Encode Project Consortium An integrated encyclopedia of DNA elements in the human genome. Nature. 2012; 489:57–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Encode Project Consortium Moore J.E., Purcaro M.J., Pratt H.E., Epstein C.B., Shoresh N., Adrian J., Kawli T., Davis C.A., Dobin Aet al.. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature. 2020; 583:699–710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Doudna J.A., Cech T.R.. The chemical repertoire of natural ribozymes. Nature. 2002; 418:222–228. [DOI] [PubMed] [Google Scholar]
  • 4. Tucker B.J., Breaker R.R.. Riboswitches as versatile gene control elements. Curr. Opin. Struct. Biol. 2005; 15:342–348. [DOI] [PubMed] [Google Scholar]
  • 5. Serganov A., Nudler E.. A decade of riboswitches. Cell. 2013; 152:17–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Wu L., Belasco J.G.. Let me count the ways: mechanisms of gene regulation by miRNAs and siRNAs. Mol. Cell. 2008; 29:1–7. [DOI] [PubMed] [Google Scholar]
  • 7. Blackburn E.H., Collins K.. Telomerase: an RNP enzyme synthesizes DNA. Cold Spring Harb. Perspect. Biol. 2011; 3:a003558. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Dasgupta A., Baron M.H., Baltimore D.. Poliovirus replicase: a soluble enzyme able to initiate copying of poliovirus RNA. Proc. Natl. Acad. Sci. U.S.A. 1979; 76:2679–2683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Ohman M., Bass B.. Soll D., Nishimura S., Moore P.B.. RNA. 2001; Pergamon/Elsevier Science; 125–136. [Google Scholar]
  • 10. Simpson L. Gesteland R.F., Cech T.R., Atkins J.F.. The RNA World (3rd edn.). 2006; Cold Spring Harbor Laboratory Press; 401–417. [Google Scholar]
  • 11. Abudayyeh O.O., Gootenberg J.S., Essletzbichler P., Han S., Joung J., Belanto J.J., Verdine V., Cox D.B.T., Kellner M.J., Regev A.et al.. RNA targeting with CRISPR-Cas13. Nature. 2017; 550:280–284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Lambowitz A.M., Zimmerly S.. Group II introns: mobile ribozymes that invade DNA. Cold Spring Harb. Perspect. Biol. 2011; 3:a003616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Corbett K.S., Edwards D.K., Leist S.R., Abiona O.M., Boyoglu-Barnum S., Gillespie R.A., Himansu S., Schafer A., Ziwawo C.T., DiPiazza A.T.et al.. SARS-CoV-2 mRNA vaccine design enabled by prototype pathogen preparedness. Nature. 2020; 586:567–571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Watkins A.M., Rangan R., Das R.. FARFAR2: improved de novo rosetta prediction of complex global RNA folds. Structure. 2020; 28:963–976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Gruber A.R., Findeiss S., Washietl S., Hofacker I.L., Stadler P.F.. RNAz 2.0: improved noncoding RNA detection. Pac. Symp. Biocomput. 2010; 69–79. [PubMed] [Google Scholar]
  • 16. Fu Y., Xu Z.Z., Lu Z.J., Zhao S., Mathews D.H.. Discovery of novel ncRNA sequences in multiple genome alignments on the basis of conserved and stable secondary structures. PLoS One. 2015; 10:e0130200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Petrov A.I., Kay S.J.E., Gibson R., Kulesha E., Staines D., Bruford E.A., Wright M.W., Burge S., Finn R.D., Kersey P.J.et al.. RNAcentral: an international database of ncRNA sequences. Nucleic Acids Res. 2015; 43:D123–D129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Bellaousov S., Mathews D.H.. ProbKnot: fast prediction of RNA secondary structure including pseudoknots. RNA. 2010; 16:1870–1880. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Strobel S.A., Cech T.R.. Minor groove recognition of the conserved G.U pair at the tetrahymena ribozyme reaction site. Science. 1995; 267:675–679. [DOI] [PubMed] [Google Scholar]
  • 20. Keel A.Y., Rambo R.P., Batey R.T., Kieft J.S.. A general strategy to solve the phase problem in RNA crystallography. Structure. 2007; 15:761–772. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Ding P., Kharytonchyk S., Waller A., Mbaekwe U., Basappa S., Kuo N., Frank H.M., Quasney C., Kidane A., Swanson C.et al.. Identification of the initial nucleocapsid recognition element in the HIV-1 RNA packaging signal. Proc. Natl. Acad. Sci. U.S.A. 2020; 117:17737–17746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Westhof E., Yusupov M., Yusupova G.. The multiple flavors of GoU pairs in RNA. J. Mol. Recognit. 2019; 32:e2782. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Andronescu M., Condon A., Turner D.H., Mathews D.H.. The determination of RNA folding nearest neighbor parameters. Methods Mol. Biol. 2014; 1097:45–70. [DOI] [PubMed] [Google Scholar]
  • 24. Xia T., SantaLucia J. Jr, Burkard M.E., Kierzek R., Schroeder S.J., Jiao X., Cox C., Turner D.H.. Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with watson-crick base pairs. Biochemistry. 1998; 37:14719–14735. [DOI] [PubMed] [Google Scholar]
  • 25. Chen J.L., Dishler A.L., Kennedy S.D., Yildirim I., Liu B., Turner D.H., Serra M.J.. Testing the nearest neighbor model for canonical RNA base pairs: revision of GU parameters. Biochemistry. 2012; 51:3508–3522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Nguyen M.T., Schroeder S.J.. Consecutive terminal GU pairs stabilize RNA helices. Biochemistry. 2010; 49:10574–10581. [DOI] [PubMed] [Google Scholar]
  • 27. Mathews D.H., Sabina J., Zuker M., Turner D.H.. Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J. Mol. Biol. 1999; 288:911–940. [DOI] [PubMed] [Google Scholar]
  • 28. Sponer J., Leszczynski J., Hobza P.. Electronic properties, hydrogen bonding, stacking, and cation binding of DNA and RNA bases. Biopolymers. 2001; 61:3–31. [DOI] [PubMed] [Google Scholar]
  • 29. Schroeder S.J., Turner D.H.. Factors affecting the thermodynamic stability of small asymmetric internal loops in RNA. Biochemistry. 2000; 39:9257–9274. [DOI] [PubMed] [Google Scholar]
  • 30. Gu X., Mooers B.H., Thomas L.M., Malone J., Harris S., Schroeder S.J.. Structures and energetics of four adjacent G.U pairs that stabilize an RNA helix. J. Phys. Chem. B. 2015; 119:13252–13261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Rinnenthal J., Klinkert B., Narberhaus F., Schwalbe H.. Direct observation of the temperature-induced melting process of the salmonella fourU RNA thermometer at base-pair resolution. Nucleic Acids Res. 2010; 38:3834–3847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Bothe J.R., Nikolova E.N., Eichhorn C.D., Chugh J., Hansen A.L., Al-Hashimi H.M.. Characterizing RNA dynamics at atomic resolution using solution-state NMR spectroscopy. Nat. Methods. 2011; 8:919–931. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Huang Y., Weng X., Russu I.M.. Enhanced base-pair opening in the adenine tract of a RNA double helix. Biochemistry. 2011; 50:1857–1863. [DOI] [PubMed] [Google Scholar]
  • 34. Nelson J.W., Martin F.H., Tinoco I. Jr. DNA and RNA oligomer thermodynamics: the effect of mismatched bases on double-helix stability. Biopolymers. 1981; 20:2509–2531. [DOI] [PubMed] [Google Scholar]
  • 35. Petersheim M., Turner D.H.. Base-stacking and base-pairing contributions to helix stability: thermodynamics of double-helix formation with CCGG, CCGGp, CCGGAp, ACCGGp, CCGGUp, and ACCGGUp. Biochemistry. 1983; 22:256–263. [DOI] [PubMed] [Google Scholar]
  • 36. Freier S.M., Burger B.J., Alkema D., Neilson T., Turner D.H.. Effects of 3′ dangling end stacking on the stability of GGCC and CCGG double helices. Biochemistry. 1983; 22:6198–6206. [Google Scholar]
  • 37. Freier S.M., Alkema D., Sinclair A., Neilson T., Turner D.H.. Contributions of dangling end stacking and terminal base-pair formation to the stabilities of XGGCCp, XCCGGp, XGGCCYp, and XCCGGYp helixes. Biochemistry. 1985; 24:4533–4539. [DOI] [PubMed] [Google Scholar]
  • 38. Freier S.M., Sinclair A., Neilson T., Turner D.H.. Improved free energies for G.C base-pairs. J. Mol. Biol. 1985; 185:645–647. [DOI] [PubMed] [Google Scholar]
  • 39. Freier S.M., Kierzek R., Caruthers M.H., Neilson T., Turner D.H.. Free energy contributions of G.U and other terminal mismatches to helix stability. Biochemistry. 1986; 25:3209–3213. [DOI] [PubMed] [Google Scholar]
  • 40. Freier S.M., Sugimoto N., Sinclair A., Alkema D., Neilson T., Kierzek R., Caruthers M.H., Turner D.H.. Stability of XGCGCp, GCGCYp, and XGCGCYp helixes: an empirical estimate of the energetics of hydrogen bonds in nucleic acids. Biochemistry. 1986; 25:3214–3219. [DOI] [PubMed] [Google Scholar]
  • 41. Sugimoto N., Kierzek R., Freier S.M., Turner D.H.. Energetics of internal GU mismatches in ribooligonucleotide helixes. Biochemistry. 1986; 25:5755–5759. [DOI] [PubMed] [Google Scholar]
  • 42. Kierzek R., Caruthers M.H., Longfellow C.E., Swinton D., Turner D.H., Freier S.M.. Polymer-supported RNA synthesis and its application to test the nearest-neighbor model for duplex stability. Biochemistry. 1986; 25:7840–7846. [DOI] [PubMed] [Google Scholar]
  • 43. Freier S.M., Kierzek R., Jaeger J.A., Sugimoto N., Caruthers M.H., Neilson T., Turner D.H.. Improved free-energy parameters for predictions of RNA duplex stability. Proc. Natl. Acad. Sci. U.S.A. 1986; 83:9373–9377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Sugimoto N., Kierzek R., Turner D.H.. Sequence dependence for the energetics of dangling ends and terminal base pairs in ribonucleic acid. Biochemistry. 1987; 26:4554–4558. [DOI] [PubMed] [Google Scholar]
  • 45. Sugimoto N., Kierzek R., Turner D.H.. Sequence dependence for the energetics of terminal mismatches in ribooligonucleotides. Biochemistry. 1987; 26:4559–4562. [DOI] [PubMed] [Google Scholar]
  • 46. Longfellow C.E., Kierzek R., Turner D.H.. Thermodynamic and spectroscopic study of bulge loops in oligoribonucleotides. Biochemistry. 1990; 29:278–285. [DOI] [PubMed] [Google Scholar]
  • 47. Peritz A.E., Kierzek R., Sugimoto N., Turner D.H.. Thermodynamic study of internal loops in oligoribonucleotides: symmetric loops are more stable than asymmetric loops. Biochemistry. 1991; 30:6428–6436. [DOI] [PubMed] [Google Scholar]
  • 48. Hall K.B., McLaughlin L.W.. Thermodynamic and structural properties of pentamer DNA.DNA, RNA.RNA, and DNA.RNA duplexes of identical sequence. Biochemistry. 1991; 30:10606–10613. [DOI] [PubMed] [Google Scholar]
  • 49. He L., Kierzek R., SantaLucia J. Jr, Walter A.E., Turner D.H.. Nearest-neighbor parameters for G.U mismatches: 5′GU3′/3′UG5′ is destabilizing in the contexts CGUG/GUGC, UGUA/AUGU, and AGUU/UUGA but stabilizing in GGUC/CUGG. Biochemistry. 1991; 30:11124–11132. [DOI] [PubMed] [Google Scholar]
  • 50. Walter A.E., Wu M., Turner D.H.. The stability and structure of tandem GA mismatches in RNA depend on closing base pairs. Biochemistry. 1994; 33:11349–11354. [DOI] [PubMed] [Google Scholar]
  • 51. Wu M., McDowell J.A., Turner D.H.. A periodic table of symmetric tandem mismatches in RNA. Biochemistry. 1995; 34:3204–3211. [DOI] [PubMed] [Google Scholar]
  • 52. McDowell J.A., Turner D.H.. Investigation of the structural basis for thermodynamic stabilities of tandem GU mismatches: solution structure of (rGAGGUCUC)2 by two-dimensional NMR and simulated annealing. Biochemistry. 1996; 35:14077–14089. [DOI] [PubMed] [Google Scholar]
  • 53. McDowell J.A., He L., Chen X., Turner D.H.. Investigation of the structural basis for thermodynamic stabilities of tandem GU wobble pairs: NMR structures of (rGGAGUUCC)2 and (rGGAUGUCC)2. Biochemistry. 1997; 36:8030–8038. [DOI] [PubMed] [Google Scholar]
  • 54. Xia T., McDowell J.A., Turner D.H.. Thermodynamics of nonsymmetric tandem mismatches adjacent to G.C base pairs in RNA. Biochemistry. 1997; 36:12486–12497. [DOI] [PubMed] [Google Scholar]
  • 55. Kierzek R., Burkard M.E., Turner D.H.. Thermodynamics of single mismatches in RNA duplexes. Biochemistry. 1999; 38:14214–14223. [DOI] [PubMed] [Google Scholar]
  • 56. Burkard M.E., Xia T., Turner D.H.. Thermodynamics of RNA internal loops with a guanosine-guanosine pair adjacent to another noncanonical pair. Biochemistry. 2001; 40:2478–2483. [DOI] [PubMed] [Google Scholar]
  • 57. Schroeder S.J., Turner D.H.. Thermodynamic stabilities of internal loops with GU closing pairs in RNA. Biochemistry. 2001; 40:11509–11517. [DOI] [PubMed] [Google Scholar]
  • 58. Serra M.J., Baird J.D., Dale T., Fey B.L., Retatagos K., Westhof E.. Effects of magnesium ions on the stabilization of RNA oligomers of defined structures. RNA. 2002; 8:307–323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Ziomek K., Kierzek E., Biala E., Kierzek R.. The thermal stability of RNA duplexes containing modified base pairs placed at internal and terminal positions of the oligoribonucleotides. Biophys. Chem. 2002; 97:233–241. [DOI] [PubMed] [Google Scholar]
  • 60. Serra M.J., Smolter P.E., Westhof E.. Pronounced instability of tandem IU base pairs in RNA. Nucleic Acids Res. 2004; 32:1824–1828. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Chen G., Znosko B.M., Jiao X., Turner D.H.. Factors affecting thermodynamic stabilities of RNA 3 x 3 internal loops. Biochemistry. 2004; 43:12865–12876. [DOI] [PubMed] [Google Scholar]
  • 62. Clanton-Arrowood K., McGurk J., Schroeder S.J.. 3′ terminal nucleotides determine thermodynamic stabilities of mismatches at the ends of RNA helices. Biochemistry. 2008; 47:13418–13427. [DOI] [PubMed] [Google Scholar]
  • 63. Scheffler I.E., Elson E.L., Baldwin R.L.. Helix formation by d(TA) oligomers. II. Analysis of the helix-coli transitions of linear and circular oligomers. J. Mol. Biol. 1970; 48:145–171. [DOI] [PubMed] [Google Scholar]
  • 64. Uhlenbeck O.C., Martin F.H., Doty P.. Self-complementary oligoribonucleotides: effects of helix defects and guanylic acid-cytidylic acid base pairs. J. Mol. Biol. 1971; 57:217–229. [DOI] [PubMed] [Google Scholar]
  • 65. Martin F.H., Uhlenbeck O.C., Doty P.. Self-complementary oligoribonucleotides: adenylic acid-uridylic acid block copolymers. J. Mol. Biol. 1971; 57:201–215. [DOI] [PubMed] [Google Scholar]
  • 66. Breslauer K.J., Sturtevant J.M., Tinoco I. Jr. Calorimetric and spectroscopic investigation of the helix-to-coil transition of a ribo-oligonucleotide: rA7U7. J. Mol. Biol. 1975; 99:549–565. [DOI] [PubMed] [Google Scholar]
  • 67. Draper D.E. A guide to ions and RNA structure. RNA. 2004; 10:335–343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Draper D.E. RNA folding: thermodynamic and molecular descriptions of the roles of ions. Biophys. J. 2008; 95:5489–5495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Pabit S.A., Qiu X., Lamb J.S., Li L., Meisburger S.P., Pollack L.. Both helix topology and counterion distribution contribute to the more effective charge screening in dsRNA compared with dsDNA. Nucleic Acids Res. 2009; 37:3887–3896. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Kirmizialtin S., Pabit S.A., Meisburger S.P., Pollack L., Elber R.. RNA and its ionic cloud: solution scattering experiments and atomically detailed simulations. Biophys. J. 2012; 102:819–828. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Lipfert J., Doniach S., Das R., Herschlag D.. Understanding nucleic acid-ion interactions. Annu. Rev. Biochem. 2014; 83:813–841. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Gebala M., Herschlag D.. Quantitative studies of an RNA duplex electrostatics by ion counting. Biophys. J. 2019; 117:1116–1124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Manning G.S. The molecular theory of polyelectrolyte solutions with applications to the electrostatic properties of polynucleotides. Q. Rev. Biophys. 1978; 11:179–246. [DOI] [PubMed] [Google Scholar]
  • 74. Turner D.H. Fundamental interactions in RNA: questions answered and remaining. Biopolymers. 2013; 99:1097–1104. [DOI] [PubMed] [Google Scholar]
  • 75. Bai Y., Greenfeld M., Travers K.J., Chu V.B., Lipfert J., Doniach S., Herschlag D.. Quantitative and comprehensive decomposition of the ion atmosphere around nucleic acids. J. Am. Chem. Soc. 2007; 129:14981–14988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Jiang T., Kennedy S.D., Moss W.N., Kierzek E., Turner D.H.. Secondary structure of a conserved domain in an intron of influenza A M1 mRNA. Biochemistry. 2014; 53:5236–5248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Banerjee A.R., Jaeger J.A., Turner D.H.. Thermal unfolding of a group I ribozyme: the low-temperature transition is primarily disruption of tertiary structure. Biochemistry. 1993; 32:153–163. [DOI] [PubMed] [Google Scholar]
  • 78. Gonzalez R.L. Jr, Tinoco I. Jr. Solution structure and thermodynamics of a divalent metal ion binding site in an RNA pseudoknot. J. Mol. Biol. 1999; 289:1267–1282. [DOI] [PubMed] [Google Scholar]
  • 79. Wang W., Zhao J., Han Q., Wang G., Yang G., Shallop A.J., Liu J., Gaffney B.L., Jones R.A.. Modulation of RNA metal binding by flanking bases: 15N NMR evaluation of GC, tandem GU, and tandem GA sites. Nucleosides. Nucleotides Nucleic Acids. 2009; 28:424–434. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Leamy K.A., Assmann S.M., Mathews D.H., Bevilacqua P.C.. Bridging the gap between in vitro and in vivo RNA folding. Q. Rev. Biophys. 2016; 49:e10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Alberts B., Bray D., Lewis J., Raff M., Roberts K., Watson J.D.. Molecular Biology of the Cell (3rd edn.). 1994; NY: Garland Publishing. [Google Scholar]
  • 82. London R.E. Methods for measurement of intracellular magnesium: NMR and fluorescence. Annu. Rev. Physiol. 1991; 53:241–258. [DOI] [PubMed] [Google Scholar]
  • 83. Romani A.M. Magnesium homeostasis in mammalian cells. Front. Biosci. 2007; 12:308–331. [DOI] [PubMed] [Google Scholar]
  • 84. Lusk J.E., Williams R.J., Kennedy E.P.. Magnesium and the growth of Escherichia coli. J. Biol. Chem. 1968; 243:2618–2624. [PubMed] [Google Scholar]
  • 85. Cantor C.R., Schimmel P.. Biophysical Chemistry. Part III, The Behavior of Biological Macromolecules. 1980; San Francisco: W.H. Freeman and Company. [Google Scholar]
  • 86. Zuber J., Cabral B., McFadyen I., Mauger D.M., Mathews D.H.. Analysis of RNA nearest neighbor parameters reveals interdependencies and quantifies the uncertainty in RNA secondary structure prediction. RNA. 2018; 24:1568–1582. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87. Zuber J., Mathews D.H.. Estimating uncertainty in predicted folding free energy changes of RNA secondary structures. RNA. 2019; 25:747–754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88. Venables W.R., Ripley B.D.. Modern Applied Statistics with S. 2002; 4th ednNY: Springer. [Google Scholar]
  • 89. Schroeder S.J., Turner D.H.. Optical melting measurements of nucleic acid thermodynamics. Methods Enzymol. 2009; 468:371–387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90. Sloma M.F., Mathews D.H.. Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures. RNA. 2016; 22:1808–1818. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91. Zuker M., Stiegler P.. Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucleic Acids Res. 1981; 9:133–148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92. Nussinov R., Jacobson A.B.. Fast algorithm for predicting the secondary structure of single-stranded RNA. Proc. Natl. Acad. Sci. U.S.A. 1980; 77:6309–6313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93. Zuker M. On finding all suboptimal foldings of an RNA molecule. Science. 1989; 244:48–52. [DOI] [PubMed] [Google Scholar]
  • 94. Lorenz R., Bernhart S.H., Honer Zu Siederdissen C., Tafer H., Flamm C., Stadler P.F., Hofacker I.L.. ViennaRNA package 2.0. Algorithms Mol. Biol. 2011; 6:26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95. Sato K., Akiyama M., Sakakibara Y.. RNA secondary structure prediction using deep learning with thermodynamic integration. Nat. Commun. 2021; 12:941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96. Shi H., Rangadurai A., Abou Assi H., Roy R., Case D.A., Herschlag D., Yesselman J.D., Al-Hashimi H.M.. Rapid and accurate determination of atomistic RNA dynamic ensemble models using NMR and structure prediction. Nat. Commun. 2020; 11:5531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97. Reuter J.S., Mathews D.H.. RNAstructure: software for RNA secondary structure prediction and analysis. BMC Bioinf. 2010; 11:129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98. Ding Y., Lawrence C.E.. A statistical sampling algorithm for RNA secondary structure prediction. Nucleic Acids Res. 2003; 31:7280–7301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99. Mauger D.M., Cabral B.J., Presnyak V., Su S.V., Reid D.W., Goodman B., Link K., Khatwani N., Reynders J., Moore M.J.et al.. mRNA structure regulates protein expression through changes in functional half-life. Proc. Natl. Acad. Sci. U.S.A. 2019; 116:24075–24083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100. Wayment-Steele H.K., Kim D.S., Choe C.A., Nicol J.J., Wellington-Oguri R., Watkins A.M., Parra Sperberg R.A., Huang P.S., Participants E., Das R.. Theoretical basis for stabilizing messenger RNA through secondary structure design. Nucleic Acids Res. 2021; 49:10604–10617. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101. Ritchey L.E., Tack D.C., Yakhnin H., Jolley E.A., Assmann S.M., Bevilacqua P.C., Babitzke P.. Structure-seq2 probing of RNA structure upon amino acid starvation reveals both known and novel RNA switches in bacillus subtilis. RNA. 2020; 26:1431–1447. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102. Su Z., Tang Y., Ritchey L.E., Tack D.C., Zhu M., Bevilacqua P.C., Assmann S.M.. Genome-wide RNA structurome reprogramming by acute heat shock globally regulates mRNA abundance. Proc. Natl. Acad. Sci. U.S.A. 2018; 115:12170–12175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103. Tack D.C., Su Z., Yu Y., Bevilacqua P.C., Assmann S.M.. Tissue-specific changes in the RNA structurome mediate salinity response in arabidopsis. RNA. 2020; 26:492–511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104. Hoshika S., Leal N.A., Kim M.J., Kim M.S., Karalkar N.B., Kim H.J., Bates A.M., Watkins N.E. Jr, SantaLucia H.A., Meyer A.Jet al.. Hachimoji DNA and RNA: a genetic system with eight building blocks. Science. 2019; 363:884–887. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105. Kierzek E., Zhang X., Watson R.M., Kennedy S.D., Szabat M., Kierzek R., Mathews D.H.. Secondary structure prediction for RNA sequences including N6-methyladenosine. Nat. Commun. 2021; 13:1271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106. Wright D.J., Force C.R., Znosko B.M.. Stability of RNA duplexes containing inosine.cytosine pairs. Nucleic Acids Res. 2018; 46:12099–12108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107. Lu Z.J., Mathews D.H.. Efficient siRNA selection using hybridization thermodynamics. Nucleic Acids Res. 2008; 36:640–647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108. Lu Z.J., Mathews D.H.. Fundamental differences in the equilibrium considerations for siRNA and antisense oligodeoxynucleotide design. Nucleic Acids Res. 2008; 36:3738–3745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109. Shao Y., Chan C.Y., Maliyekkel A., Lawrence C.E., Roninson I.B., Ding Y.. Effect of target secondary structure on RNAi efficiency. RNA. 2007; 13:1631–1640. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110. Tafer H., Ameres S.L., Obernosterer G., Gebeshuber C.A., Schroeder R., Martinez J., Hofacker I.L.. The impact of target site accessibility on the design of effective siRNAs. Nat. Biotechnol. 2008; 26:578–583. [DOI] [PubMed] [Google Scholar]
  • 111. Szabat M., Lorent D., Czapik T., Tomaszewska M., Kierzek E., Kierzek R.. RNA secondary structure as a first step for rational design of the oligonucleotides towards inhibition of influenza a virus replication. Pathogens. 2020; 9:925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112. Angelbello A.J., Rzuczek S.G., McKee K.K., Chen J.L., Olafson H., Cameron M.D., Moss W.N., Wang E.T., Disney M.D.. Precise small-molecule cleavage of an r(CUG) repeat expansion in a myotonic dystrophy mouse model. Proc. Natl. Acad. Sci. U.S.A. 2019; 116:7799–7804. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113. Costales M.G., Aikawa H., Li Y., Childs-Disney J.L., Abegg D., Hoch D.G., Pradeep Velagapudi S., Nakai Y., Khan T., Wang K.W.et al.. Small-molecule targeted recruitment of a nuclease to cleave an oncogenic RNA in a mouse model of metastatic cancer. Proc. Natl. Acad. Sci. U.S.A. 2020; 117:2406–2411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114. Suresh B.M., Li W., Zhang P., Wang K.W., Yildirim I., Parker C.G., Disney M.D.. A general fragment-based approach to identify and optimize bioactive ligands targeting RNA. Proc. Natl. Acad. Sci. U.S.A. 2020; 117:33197–33203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115. Tinoco I. Jr, Borer P.N., Dengler B., Levin M.D., Uhlenbeck O.C., Crothers D.M., Gralla J.. Improved estimation of secondary structure in ribonucleic acids. Nat. New Biol. 1973; 246:40–41. [DOI] [PubMed] [Google Scholar]
  • 116. Borer P.N., Dengler B., Tinoco I. Jr, Uhlenbeck O.C.. Stability of ribonucleic acid double-stranded helices. J. Mol. Biol. 1974; 86:843–853. [DOI] [PubMed] [Google Scholar]
  • 117. Gralla J., Crothers D.M.. Free energy of imperfect nucleic acid helices. II. Small hairpin loops. J. Mol. Biol. 1973; 73:497–511. [DOI] [PubMed] [Google Scholar]
  • 118. Gray D.M., Tinoco Jr. I.. A new approach to the study of sequence-dependent properties of polynucleotides. Biopolymers. 1970; 9:223–244. [Google Scholar]
  • 119. Breslauer K.J., Frank R., Blöcker H., Marky L.A.. Predicting DNA duplex stability from the base sequence. Proc. Natl. Acad. Sci. U.S.A. 1986; 83:3746. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120. England T.E., Uhlenbeck O.C.. Enzymic oligoribonucleotide synthesis with T4 RNA ligase. Biochemistry. 1978; 17:2069–2076. [DOI] [PubMed] [Google Scholar]
  • 121. Scaringe S.A., Wincott F.E., Caruthers M.H.. Novel RNA synthesis method using 5′-O-Silyl-2′-O-orthoester protecting groups. J. Am. Chem. Soc. 1998; 120:11820–11821. [Google Scholar]
  • 122. Gray D.M. Derivation of nearest-neighbor properties from data on nucleic acid oligomers. I. Simple sets of independent sequences and the influence of absent nearest neighbors. Biopolymers. 1997; 42:783–793. [DOI] [PubMed] [Google Scholar]
  • 123. Gray D.M. Derivation of nearest-neighbor properties from data on nucleic acid oligomers. II. Thermodynamic parameters of DNA.RNA hybrids and DNA duplexes. Biopolymers. 1997; 42:795–810. [DOI] [PubMed] [Google Scholar]
  • 124. Jaeger J.A., Turner D.H., Zuker M.. Improved predictions of secondary structures for RNA. Proc. Natl. Acad. Sci. U.S.A. 1989; 86:7706–7710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125. Sieger G., Hofmann H., Förtsch J., Gross H.J., Randies J.W., Sänger H.L., Riesner D.. Conformational transitions in viroids and virusoids: comparison of results from energy minimization algorithm and from experimental data. J. Biomol. Struct. Dyn. 1984; 2:543–571. [DOI] [PubMed] [Google Scholar]
  • 126. McCaskill J.S. The equilibrium partition function and base pair binding probabilities for RNA secondary structure. Biopolymers. 1990; 29:1105–1119. [DOI] [PubMed] [Google Scholar]
  • 127. Mathews D.H. Revolutions in RNA secondary structure prediction. J. Mol. Biol. 2006; 359:526–532. [DOI] [PubMed] [Google Scholar]
  • 128. Dirks R.M., Bois J.S., Schaeffer J.M., Winfree E., Pierce N.A.. Thermodynamic analysis of interacting nucleic acid strands. SIAM Rev. 2007; 49:65–88. [Google Scholar]
  • 129. Zadeh J.N., Wolfe B.R., Pierce N.A.. Nucleic acid sequence design via efficient ensemble defect optimization. J. Comput. Chem. 2011; 32:439–452. [DOI] [PubMed] [Google Scholar]
  • 130. Garcia-Martin J.A., Clote P., Dotu I.. RNAiFold: a web server for RNA inverse folding and molecular design. Nucleic Acids Res. 2013; 41:W465–W470. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 131. Sloma M.F., Mathews D.H.. Improving RNA secondary structure prediction with structure mapping data. Methods Enzymol. 2015; 553:91–114. [DOI] [PubMed] [Google Scholar]
  • 132. Seetin M.G., Mathews D.H.. RNA structure prediction: an overview of methods. Methods Mol. Biol. 2012; 905:99–122. [DOI] [PubMed] [Google Scholar]
  • 133. Havgaard J.H., Gorodkin J.. RNA structural alignments, part I: Sankoff-based approaches for structural alignments. Methods Mol. Biol. 2014; 1097:275–290. [DOI] [PubMed] [Google Scholar]
  • 134. Asai K., Hamada M.. RNA structural alignments, part II: non-Sankoff approaches for structural alignments. Methods Mol. Biol. 2014; 1097:291–301. [DOI] [PubMed] [Google Scholar]
  • 135. Huang L., Zhang H., Deng D., Zhao K., Liu K., Hendrix D.A., Mathews D.H.. LinearFold: linear-time approximate RNA folding by 5′-to-3′ dynamic programming and beam search. Bioinformatics. 2019; 35:i295–i304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136. Zhang H., Zhang L., Mathews D.H., Huang L.. LinearPartition: linear-time approximation of RNA folding partition function and base-pairing probabilities. Bioinformatics. 2020; 36:i258–i267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137. Chen X., McDowell J.A., Kierzek R., Krugh T.R., Turner D.H.. Nuclear magnetic resonance spectroscopy and molecular modeling reveal that different hydrogen bonding patterns are possible for G.U pairs: one hydrogen bond for each G.U pair in r(GGCGUGCC)2 and two for each G.U pair in r(GAGUGCUC)2. Biochemistry. 2000; 39:8970–8982. [PubMed] [Google Scholar]
  • 138. Jang S.B., Hung L-W., Jeong M.S., Holbrook E.L., Chen X., Turner D.H., Holbrook S. R.. The crystal structure at 1.5 Å resolution of an RNA octamer duplex containing tandem G·U base pairs. Biophys. J. 2006; 90:4530–4537. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139. Liu J.D., Zhao L., Xia T.. The dynamic structural basis of differential enhancement of conformational stability by 5′- and 3′-dangling ends in RNA. Biochemistry. 2008; 47:5962–5975. [DOI] [PubMed] [Google Scholar]
  • 140. Nikolova E.N., Al-Hashimi H.M.. Thermodynamics of RNA melting, one base pair at a time. RNA. 2010; 16:1687–1691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 141. Dethoff E.A., Petzold K., Chugh J., Casiano-Negroni A., Al-Hashimi H.M.. Visualizing transient low-populated structures of RNA. Nature. 2012; 491:724–728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 142. Rangadurai A., Szymanski E.S., Kimsey I., Shi H., Al-Hashimi H.M.. Probing conformational transitions towards mutagenic Watson–Crick-like G.T mismatches using off-resonance sugar carbon R1rho relaxation dispersion. J. Biomol. NMR. 2020; 74:457–471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 143. Turner D.H., Mathews D.H.. NNDB: the nearest neighbor parameter database for predicting stability of nucleic acid secondary structure. Nucleic Acids Res. 2010; 38:D280–D282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 144. Ziv O., Price J., Shalamova L., Kamenova T., Goodfellow I., Weber F., Miska E.A.. The Short- and Long-Range RNA-RNA interactome of SARS-CoV-2. Mol. Cell. 2020; 80:1067–1077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 145. Li S., Zhang H., Zhang L., Liu K., Liu B., Mathews D.H., Huang L.. LinearTurboFold: linear-time global prediction of conserved structures for RNA homologs with applications to SARS-CoV-2. Proc. Natl. Acad. Sci. U.S.A. 2021; 118:e2116269118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 146. Amarante T.D., Weber G.. Evaluating hydrogen bonds and base stacking of single, tandem and terminal GU mismatches in RNA with a mesoscopic model. J. Chem. Inf. Model. 2016; 56:101–109. [DOI] [PubMed] [Google Scholar]
  • 147. Spasic A., Berger K.D., Chen J.L., Seetin M.G., Turner D.H., Mathews D.H.. Improving RNA nearest neighbor parameters for helices by going beyond the two-state model. Nucleic Acids Res. 2018; 46:4883–4892. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 148. Diamond J.M., Turner D.H., Mathews D.H.. Thermodynamics of three-way multibranch loops in RNA. Biochemistry. 2001; 40:6971–6981. [DOI] [PubMed] [Google Scholar]
  • 149. Lu Z.J., Turner D.H., Mathews D.H.. A set of nearest neighbor parameters for predicting the enthalpy change of RNA secondary structure formation. Nucleic. Acids. Res. 2006; 34:4912–4924. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 150. Vaitiekunas P., Crane-Robinson C., Privalov P.L.. The energetic basis of the DNA double helix: a combined microcalorimetric approach. Nucleic Acids Res. 2015; 43:8577–8589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 151. Privalov P.L., Crane-Robinson C.. Forces maintaining the DNA double helix. Eur. Biophys. J. 2020; 49:315–321. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 152. Leontis N.B., Stombaugh J., Westhof E.. The non-Watson–Crick base pairs and their associated isostericity matrices. Nucleic Acids Res. 2002; 30:3497–3531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 153. Correll C.C., Swinger K.. Common and distinctive features of GNRA tetraloops based on a GUAA tetraloop structure at 1.4 a resolution. RNA. 2003; 9:355–363. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkac261_Supplemental_Files

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES