Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Sep 15.
Published in final edited form as: J Chem Theory Comput. 2019 Apr 2;15(5):3288–3305. doi: 10.1021/acs.jctc.8b00955

IMPROVING THE PERFORMANCE OF THE AMBER RNA FORCE FIELD BY TUNING THE HYDROGEN-BONDING INTERACTIONS

Petra Kührová 1,#, Vojtěch Mlýnský 2,#, Marie Zgarbová 1, Miroslav Krepl 1,2, Giovanni Bussi 3, Robert B Best 4, Michal Otyepka 1, Jiří Šponer 1,2,*, Pavel Banáš 1,2,*
PMCID: PMC7491206  NIHMSID: NIHMS1626259  PMID: 30896943

Abstract

Molecular dynamics (MD) simulations became a leading tool for investigation of structural dynamics of nucleic acids. Despite recent efforts to improve the empirical potentials (force fields, ffs), RNA ffs have persisting deficiencies, which hamper their utilization in quantitatively accurate simulations. Previous studies have shown that at least two salient problems contribute to difficulties in description of free-energy landscapes of small RNA motifs: (i) excessive stabilization of the unfolded single-stranded RNA ensemble by intramolecular base-phosphate and sugar-phosphate interactions, and (ii) destabilization of the native folded state by underestimation of stability of base pairing. Here, we introduce a general ff term (gHBfix) that can selectively fine-tune non-bonding interaction terms in RNA ffs, in particular the H-bonds. gHBfix potential affects the pair-wise interactions between all possible pairs of the specific atom types, while all other interactions remain intact, i.e., it is not a structure-based model. In order to probe the ability of the gHBfix potential to refine the ff non-bonded terms, we performed an extensive set of folding simulations of RNA tetranucleotides and tetraloops. Based on these data we propose particular gHBfix parameters to modify the AMBER RNA ff. The suggested parametrization significantly improves the agreement between experimental data and the simulation conformational ensembles, although our current ff version still remains far from being flawless. While attempts to tune the RNA ffs by conventional reparametrizations of dihedral potentials or non-bonded terms can lead to major undesired side effects as we demonstrate for some recently published ffs, gHBfix has a clear promising potential to improve the ff performance while avoiding introduction of major new imbalances.

Keywords: RNA, force field, MD simulation, enhanced sampling, folding, tetraloop

INTRODUCTION

Molecular dynamics (MD) simulations have become a very important tool for studies of biomolecular systems such as nucleic acids with routine access to micro- or even millisecond timescales.17 MD simulations are often instrumental for understanding and clarifying experimental results and for obtaining a more complete picture of their biological implications. Nevertheless, a sufficiently realistic description of biopolymers by the used empirical potentials (force fields, ffs) is essential for successful applications of MD.4, 8 The polyanionic nature and high structural variability of ribonucleic acid (RNA) makes the development of RNA ffs an especially daunting task.4 Despite huge efforts to fix problems that have emerged on ns-μs simulation timescales,917 RNA ffs still cause some behaviors in simulations, which are not consistent with experiments.

Limitations of the available RNA ffs have been reviewed in detail, with a suggestion that the currently available pair-additive RNA ffs are approaching the limits of their applicability.4 Examples of such problems for some recently suggested ff versions are documented in the present study. A radical solution of the RNA ff problem could be the use of polarizable ffs.1819 However, their sophisticated parametrization not only results in extra computational cost but it yet remains to be seen if their accuracy will eventually surpass the best non-polarizable RNA ffs in extended simulations. Another option is to augment the existing ff forms by some additional simple ff terms that could be used to tune the ff performance while minimizing adverse side effects. One such ff term is introduced in this study.

Compact folded RNA molecules are typically well-described by modern ffs on a sub-μs timescale when starting simulations from established experimental structures.4 This has allowed many insightful studies on RNAs and protein-RNA complexes. However, the stability of folded RNAs on longer timescales is affected by the free-energy balance between folded, misfolded and unfolded states. Therefore, a common way to identify major problems in ffs is using enhanced-sampling techniques, where, generally, either the probability distribution of a limited number of selected degrees of freedom is changed (importance-sampling methods), or the total energy (equivalently, the temperature) of the system is modified (generalized-ensemble algorithms); see Refs.2021. Considering biomolecular simulations, replica exchange (RE) methods2223 are among the most popular enhanced-sampling techniques. RE simulations profit from multiple loosely coupled simulations running in parallel over a range of different temperatures (T-REMD22 simulations) or Hamiltonians (e.g. REST223 simulations). Exchanges between replicas are attempted at regular time intervals and accepted conforming to a Metropolis-style algorithm. RE methods do not require any prior chemical insights regarding the folding landscapes. Obviously, even enhanced-sampling simulations are not a panacea and their capability to accelerate sampling has some limits; see Refs.4 and24 summarizing recent applications of enhanced-sampling methods to RNA systems.

The structural dynamics of RNA tetranucleotides (TNs) represents one of the key benchmarks for testing RNA ffs.1415, 17, 2532 TNs are ideal testing systems due to their small size and straightforward comparison of their simulations with solution experiments. Obviously, any quantitative ff assessment is critically dependent on the convergence of structural populations because only well-converged simulations can provide unambiguous benchmark datasets. Nevertheless it appears that contemporary simulation methods and hardware already allow to obtain sufficiently converged simulation ensembles for TNs.1415, 17, 2829, 31, 3334 TN simulations can specifically evaluate performance of ffs for several salient energy contributions, namely, (i) sugar-phosphate (SPh) and base-phosphate (BPh) interactions, (ii) base stacking interactions, (iii) backbone conformations and (iv) balance of these contributions with solvation. Experimental data revealed that TNs mostly populate A-form conformations2527 but MD simulations tend to significantly sample also non-native intercalated structures (or some other non-native structures, Figure 1) that are considered to be a ff artifact.1415, 17, 2731, 35 Obviously, when using TNs as a benchmark for ff development, one has to be concerned about a possible over-fitting of the ff towards the canonical A-RNA conformation, which may have detrimental consequences for simulations of folded RNAs (see below).

Figure 1.

Figure 1.

Tertiary structures and detail overview of the three systems: (A) r(GACC) TN in the dominant native A-major conformation (top) and the spurious intercalated structure (bottom), and (B) r(gcGAGAgc) and (C) r(gcUUCGgc) TLs in their native conformations. A, C, G and U nucleotides are colored in sand, white, red, and blue, respectively. The two insets (bottom left) highlight three and five signature H-bonds, i.e., GL1(N2H)…AL4(pro-RP), GL1(N2H)…AL4(N7), and GL1(2’-OH)…GL3(N7) for the GNRA TL (B) and UL1(2’-OH)…GL4(O6), UL2(2’-OH)…GL4(N7), CL3(N4H)…UL2(pro-RP), and bifurcated GL4(N1H/N2H)…UL1(O2) for the UNCG (C) TL, respectively. Note that all analyses involving occurrence of the native state within generated conformational ensembles considered just four signature H-bonds for the UNCG TL, i.e., we counted only the GL4(N1H)…UL1(O2) interaction from the bifurcated GL4(N1H/N2H)…UL1(O2) H-bond.

Another key energy contribution that needs to be described by a ff are base-pair interactions. A large part of the thermodynamic stability of folded RNAs is due to formation of canonical A-RNA helices stabilized by canonical Adenine-Uracil (AU) and Guanine-Cytosine (GC) Watson-Crick base pairs and their stacking interactions.36 Thus, correct description of canonical base pairs is fundamental for folding studies of RNA molecules. Besides this, RNA molecules contain an astonishing variability of other base-pair patterns.4, 37 Assessment of the capability of the ffs to describe these interactions requires analyses of more complicated systems like RNA tetraloops (TLs) and folded RNA motifs.4 The GNRA and UNCG TLs (N and R stand for any and purine nucleotide, respectively) are the most abundant hairpin loops in RNA molecules.3841 These TLs contribute to various biological functions including tertiary folding, RNA-RNA and protein-RNA interactions, ligand binding, and thus play key roles in transcription, translation and gene regulation.4248 Both above-noted TL types possess a clear dominant folded topology that is characterized by a set of signature molecular interactions that determine the consensus sequences (Figure 1).39 The structure of isolated small TL motifs in solution is in dynamic temperature-dependent equilibrium between folded and unfolded conformations. The two main parts of TL’s, i.e., their A-RNA stem and the structured loop itself, contribute differently to the folding free-energy landscape, and thus simulations of TLs allow us to simultaneously probe the capability of the ffs to describe A-RNA duplexes as well as some noncanonical interactions and backbone conformations.4951 In contrast to dimerization of A-form duplexes, sampling of the hairpin free-energy landscape is simplified due to unimolecular nature of its folding. TL’s with longer stems formed by canonical base pairs are more stable and thus require higher temperature for melting (unfolding).5051 Therefore, minimal 8-nucleotide long (8-mers) TL motifs with just two base pairs are the preferred targets for computational studies due to their small size and relative ‘metastability’, which is further affected by the nature and orientation of both the closing and terminal base pairs.50 Nevertheless, there is a clear experimental evidence that even these short oligomers should be dominantly in the folded state.52 The native TL conformations thus represent a genuine benchmark for the ff development, which might unambiguously identify underestimated stability of the folded RNA structures in the ffs. Such problems are manifested through a complete loss of the characteristic TL structures (Figure 1) in MD simulations; note that reversible fluctuations about the native TL conformations are not sufficient for ff verification since the available experimental data do not provide sufficiently unambiguous information about subtle plasticity of the folded TLs. The ffs need to be tested also on broad range of other RNA systems.4

Recent studies generated large conformational ensembles of TNs1415, 17, 2829, 31, 3334 and TLs1415, 17, 2829, 31, 35, 5356 in order to assess the performance of RNA ffs. They showed that the available RNA ffs have persisting deficiencies causing, e.g., (i) shifts of the backbone dihedral angles to non-native values, (ii) problems with the χ-dihedral angle distribution, and (iii) over-populated SPh and BPh hydrogen bond interactions.15, 25, 55 Some of those studies also suggested potential directions for ff improvement, which included modifications of backbone dihedral terms, van der Waals (vdW) radii and charges, RNA interaction with solvent/ions (adjustments of Lennard-Jones parameters typically accompanied by modifications of the Lennard-Jones combining rules via nonbonded fix, NBfix, to balance the RNA-solvent interaction) and enforced distributions from solution experiments.14, 17, 2931, 53, 5760 Computer folding of UNCG TLs appears to be much more challenging than folding of GNRA TLs and description of TNs,15, 35, 53, 55, 61 as for the latter two systems some partial successes have been reported. Note that there have been repeated past claims in the literature about successful folding of RNA TLs in simulations. However, the claims were not confirmed by independent research groups, as extensively reviewed in Ref.4. The performance and possible shortcomings of another recently released ff60 are discussed as part of this work.

Previously, we have shown that at least two different imbalances likely contribute to the (in)correct folded/unfolded free-energy balance of TNs and TLs. The first problem was excessive stabilization of the unfolded ssRNA structure by intramolecular BPh and SPh interactions.62 The excessive binding of 2’-hydroxyl groups (2’-OH) towards phosphate nonbridging oxygens (nbO) was reported earlier27, 58 and could be partially reduced by using alternative phosphate oxygen parameters developed by Case et al.63 in combination with the OPC64 explicit solvent water model.14 However, the OPC water model was shown to destabilize three-tetrad DNA quadruplex stems65 while the modified phosphate parameters were not successful in correcting base-phosphate H-bonding in simulations of Neomycin-sensing riboswitch,66 indicating that the modified phosphate parameters with OPC water model do not represent the ultimate solution for tuning the phosphate-group interactions. The second problem was destabilization of the native folded state by underestimation of the native H-bonds including the stem base pairing. As a general correction for under- or overestimation of base-base and other H-bond interactions remains challenging,4 we recently introduced a weak local structure-specific short-range biasing potential supporting the native H-bonds (HBfix). Its application led to a substantial improvement of GAGA TL folding62 and stabilization of U1A protein-RNA interface.67 The HBfix approach, however, does not provide a transferable ff that can be applied on systems, where the native structure is not known a priori.

In distant past, H-bond specific terms were used both in AMBER68 and CHARMM69 molecular mechanics (MM) packages for tuning H-bond distances and energies. They were introduced in the form of a modified Lennard-Jones 10–12 potential (AMBER ff)70 or more sophisticated Lennard-Jones 10–12 or 4–6 potential accompanied by cosine terms improving the directionality of the H-bond (CHARMM ff).71 Later, the breakthrough paper in ff development by Cornell et al.72 introduced the improved charge model, the restrained electrostatic potential (RESP) fit of charges. The authors claimed that a proper calibration of the point-charge model combined with standard vdW parameters for the AMBER ff is entirely sufficient to describe medium-strength H-bond interactions present in biomolecular systems.73 Similarly for CHARMM ff, Smith and Karplus argued that H-bond specific terms can be sufficiently represented by a balanced combination of electrostatic and vdW interactions.74 Thus, in the last two decades, separate terms for H-bonding interactions were deemed unnecessary. However, those conclusions were based on analyses of rather short simulations (sub ns-long time scales), which provided only limited and crude insight into the stability of the H-bond interactions. On the other hand, nowadays, powerful enhanced-sampling techniques and routine accessibility of μs-long time scales allow us to probe the quantitative details of the H-bond interactions. Recently, we have shown that the fine energy balances of the H-bonds significantly affect the conformational preferences between folded and unfolded states of RNA molecules.53, 62 These new observations fully justify our attempt to re-introduce some H-bond specific terms into the AMBER nucleic acids ff in order to fine-tune its performance. Further, it has been noted that the presently used basic form of the pair-additive ff for nucleic acids simulations may be, after the recent parameter refinements, close to its accuracy limits.4 Then its further tuning becomes impractical, since improving one simulation aspect deteriorates others. To lift this deadlock, addition of new ff terms orthogonal to the common ff terms appears as a viable strategy. Finally, also high-quality QM calculations suggest that relying only on the common Lennard-Jones sphere and point-charge model in description of H-bonding has inevitable accuracy limitations.7576

In this work, we introduce a generalized formulation of the HBfix potential, in order to tune all interactions of the same kind, henceforth labelled as gHBfix. Most importantly, the gHBfix correction can be applied without previous knowledge of the native structure. gHBfix can be used for tunable modification of selected nonbonded terms, namely specified types of H-bond interactions, in order to improve the behavior of the current state-of-the-art RNA ff.11, 13, 73, 77 In the preceding applications, HBfix was used as a native-structure-centered ff correction to support known native interactions. In contrast, gHBfix is an interaction-specific ff correction, whose application depends only on the atom types. It is not biased in favor of any specific fold or RNA sequence. It can be easily applied and does not require any modification of standard simulation codes.7879 We used enhanced-sampling methods and obtained a large amount of data (total simulations timescale more than 4 ms) during the testing phase with several variants of the gHBfix parameters. We mainly focused on structural dynamics and folding of TN and TL systems with the aim to generate ensembles with population of major conformers close to those reported by experimental datasets.5052 The specific variant of the gHBfix potential suggested based on the TN and TL simulations was subsequently tested on various important RNA structural motifs using unbiased MD simulations. Although the suggested ff version certainly does not eliminate all the ff problems, as discussed in detail, it brings valuable improvements so far without visible side effects. We further propose that the gHBfix methodology has a substantial potential for further tuning.

METHODS

Starting structures and simulation setup.

The starting structures of r(gcGAGAgc) and r(gcUUCGgc) (in unfolded states) and r(GACC) as well as the other four TNs were prepared using Nucleic Acid Builder of AmberTools1480 as one strand of an A-form duplex. The topology and coordinates were prepared using the tLEaP module of AMBER 16 program package.78 Single strands were solvated using a rectangular box of OPC64 water molecules with a minimum distance between box walls and solute of 10 Å, yielding ~2000 water molecules added and ~40×40×40 Å3 box size for TN and ~7000 water molecules added and ~65×65×65 Å3 box size for both TLs, respectively. Simulations were run at ~1 M KCl salt excess using the Joung-Cheatham ion parameters81 (K+: R = 1.590 Å, ε = 0.2795 kcal/mol, Cl: R = 2.760 Å, ε = 0.0117 kcal/mol). We used the ff99bsc0χOL311, 13, 73, 77 basic RNA ff version with the vdW modification of phosphate oxygen developed by Case et al.63. These phosphate parameters for phosphorylated aminoacids were shown to improve the performance of RNA ff.14, 62 All the affected dihedrals were adjusted as described elsewhere.58 AMBER library file of this ff version can be found in Supporting Information of Ref.62. Notice that although the OPC water model was used in this study, the derived gHBfix parameters should be applicable also with other water models, though one can expect that for some RNA systems the simulation outcomes may moderately depend on the used water models. We think that it could be possible in future to couple optimization of gHBfix parameters with specific water models. This would, however, be justified when being capable to test a finer “grid” of the gHBfix parameters, e.g., with the help of reweighting parametrization procedures. Based on the available literature data we note that although a choice of the water model may have some influence on RNA simulations of specific systems, it is not a decisive factor.8284 In addition, we do not think it would be appropriate to suggest one water model as being universally best for RNA simulations. In summary, the RNA ff version primarily used here is composed of: ff99bsc0χOL3 basic RNA ff11, 13, 73, 77 combined with (i) the phosphate oxygens by Case et al.,63 (ii) the ion parameters by Joung&Cheatham,81 and (iii) the OPC water model.64 For the sake of simplicity, this RNA ff is abbreviated as χOL3CP through the text and is further tuned by our external gHBfix potential.

REST2 Settings.

The replica exchange solute tempering (REST2)23 simulations of r(GACC) TN, r(gcGAGAgc), and r(gcUUCGgc) TLs were performed at 298 K with 8 (TN) and 12 (TLs) replicas. Details about settings can be found elsewhere.62 The scaling factor (λ) values ranged from 1 to 0.6017 (TN) or to 0.59984 (TLs) and were chosen to maintain an exchange rate above 20%. The effective solute temperature ranged from 298 to ~500 K. The hydrogen mass repartitioning85 with a 4-fs integration time step was used. The standard length of REST2 simulations was 10 μs per replica (specific tests were terminated earlier; see Table S1 in Supporting Information for summary of all enhanced-sampling simulations); the cumulative time of all REST2 simulations was ~4.2 ms.

gHBfix for support/weakening of specific interactions.

The main aim of this work is extension of the previously introduced structure-specific HBfix potential to the generalized interaction-specific gHBfix potential (Figure 2), creating essentially new generally applicable RNA ff versions. The potential function is defined as:

E(r)=|0r<rbegη(rrbeg)2c(rendrbeg)2rbegr<rbeg+c(rendrbeg)η(1c)(rendrbeg)2[2(rendrbeg)(rrbeg)(rrbeg)2c(rendrbeg)2]rbeg+c(rendrbeg)r<rendηrrend

where rbeg and rend delimit the region, in which the potential is not constant, c defines the position of the point of inflection, so that its position is rbeg + c(rend - rbeg), and the η is a controllable parameter, which defines total energy bias (in kcal/mol) for the H-bonds. The potential was originally applied to heavy atom distances of H-bonds, i.e., between proton donor and proton acceptor heavy atoms.62 Here we applied it between hydrogen of the proton donor and proton acceptor heavy atom.

Figure 2.

Figure 2.

Description of the gHBfix potential (green curves) used for either support (A) or weakening (B) of H-bond interactions. In the present study, gHBfix potential is in all cases applied to the distance between hydrogen of the proton donor and proton acceptor heavy atom. The potential is constant (i.e., it provides zero forces) at all distances except for the narrow region between rbeg and rend corresponding roughly to the expected location of the free energy barrier between bound and unbound states. For the used distance between hydrogen and proton acceptor heavy atom of the hydrogen bond we applied here 2 Å and 3 Å for rbeg and rend, respectively. The potential is formally composed of a combination of two flat-well restraints with opposite sign of curvature and linear extensions that cancel each other at distances above rend (red and blue curves). The η parameter defines total energy support (A) or penalty (B) for each H-bond interaction. Although the gHBfix potential is technically constructed using two restraining functions, it is not a restraint; we have used this technical solution solely in order to be able to efficiently use common functions that are available in the simulation codes without any necessity to modify them.

Originally, the HBfix potential was used to support native H-bonds in a structure-specific manner that was sufficient to achieve folding of the r(gcGAGAgc) TL.62 For the sake of completeness, we report equivalent structure-specific HBfix T-REMD folding simulation of r(gcUUCGgc) TL as part of this work. However, the main focus of this study was the development of the gHBfix potential, where all possible interactions between specific groups are affected. We have tested a number of variants and combinations of the gHBfix potential. The list of all RNA groups and atoms involved in the gHBfix is summarized in Tables 1 and 2 (see Table S1 for summary of performed REST2 simulations with particular gHBfix settings). For the clarity, each tested setting of the gHBfix potential is marked as gHBfix(interaction pairη), where the upper label in the parentheses defines the specific interactions between RNA groups (Table 2) and the lower parameter η defines the total energy bias for each H-bond interaction of this kind (in kcal/mol, Figure 2).

Table 1.

The list of groups and atoms from RNA nucleotides whose interactions were modified by the gHBfix.

Donors Nucleotides
A C G U
NH (base) N6(H61, H62) N4(H41, H42) N1(H1), N2(H21, H22) N3(H3)
2-OH (sugar) O2’(HO2’) O2’(HO2’) O2’(HO2’) O2’(HO2’)
Acceptors
N (base) N1, N3, N7 N3 N3, N7 -
O (base) - O2 O6 O2, O4
O (sugar) O4’, O2’ O4’, O2’ O4’, O2’ O4’, O2’
bO (phosphate) O3’, O5’ O3’, O5’ O3’, O5’ O3’, O5’
nbO (phosphate) pro-RP, pro-SP pro-RP, pro-SP pro-RP, pro-SP pro-RP, pro-SP

Table 2.

All possible combinations of groups used for various gHBfix settings and their abbreviations. See Table 1 for atoms involved within each group.

gHBfix name Donors Acceptors
2-OH…nbO 2’-OH (sugar) nbO (phosphate)
2-OH…bO 2’-OH (sugar) bO (phosphate)
2-OH…O2 2’-OH (sugar) O2’ (sugar)
2-OH…O4 2’-OH (sugar) O4’ (sugar)
2-OH…N 2’-OH (sugar) N (base)
2-OH…O 2’-OH (sugar) O (base)
NH…N NH (base) N (base)
NH…O NH (base) O (base)
NH…O2 NH (base) O2’ (sugar)
NH…O4a NH (base) O4’ (sugar)
NH…bOa NH (base) bO (phosphate)
NH…nbOa NH (base) nbO (phosphate)
a

Not tested in this work.

Conformational analysis.

Native states of r(gcGAGAgc) and r(gcUUCGgc) TLs were determined based on the presence of all native H-bonds, i.e., those participating on the base pairing in the stem as well as signature interactions of the loop (presence of a H-bond was inferred from a hydrogen-acceptor distance within cutoff 2.5 Å), in combination with the εRMSD metric.86 The dominant conformations sampled in REST2 simulations were identified using a cluster analysis based on an algorithm introduced by Rodriguez and Laio87 in combination with the εRMSD.86 The algorithm was introduced in Ref.62 and uses the idea that cluster centers are characterized by a higher local density of points than their neighbors and by a relatively large distance from any other points with higher local density. In order to apply the clustering algorithm with the εRMSD metric,86 cluster selection was customized. The point having highest product of the local density and the distance to the nearest point of higher density was assigned as a next cluster center. Subsequently, the other points were assigned to this cluster center based on two rules: their nearest neighbor of higher density is (i) also member of the same cluster and is (ii) closer than 0.7 of εRMSD (see Supporting Information of the Ref.62 for details about implementation of the algorithm). This process was repeated until all points were processed and clusters with population above 1% were further analyzed. The cluster analysis was performed for unscaled replicas (λ=1, T = 298 K).

The estimations of error bars were calculated by bootstrapping88 using the implementation introduced in our recent study.62 The estimations of errors were obtained by resampling of the time blocks of the whole set of replicas and subsequent resampling of the coordinate-following replicas for the purpose of their demultiplexing to follow Hamiltonians in order to obtain a final resampled population at the reference unbiased Hamiltonian state (see Figure S1 in Supporting Information for more details). In time domain, we used a moving block bootstrap where optimal size of the blocks was selected to be close to n1/3, where n is the number of snapshots.89 Subsequently, the continuous replicas following coordinates were resampled (with replacement) and this resampled set of continuous replicas was subsequently demultiplexed, i.e., resorted to follow the Hamiltonians. The conformational ensemble was analyzed at each level of Hamiltonian bias (including the reference unbiased Hamiltonian) from this set of replicas. For each of the 1000 resample trials, we calculated the population of folded and misfolded states at each level of Hamiltonian. Finally, the confidence interval of these populations with a 5% level of significance was calculated from all 1000 resample trials. We would like to note that the bootstrapping is typically implemented in the literature as resampling in the time-domain only. Such simple resampling of time blocks might be used to estimate uncertainties originating from the limited sampling in the time domain. However, the more sophisticated bootstrap implementation used in this study allows us in addition to clearly distinguish between cases where the native state is formed only in one or very few replicas versus the native state being repeatedly formed in most or all replicas. Although in both cases one would observe both folded and unfolded structures in the reference replica, the latter scenario corresponds to populations that can be expected to be significantly closer to convergence, since multiple replicas contribute to the population of the native state. As shown in our previous study,62 the above-described resampling of replicas takes this effect into account and thus provides a considerably more conservative estimate of the convergence than the commonly used plain bootstrapping (see Figure S1 in the Supporting Information). The example of data analysis with error bars calculated only by bootstrap resampling in time-domain is shown on Figure S2 in the Supporting Information.

Comment on convergence analysis of the TL REST2 simulations.

RE simulations are converged once all replicas sample the same ensemble.15 Such ultimate convergence occurs typically on a much longer time-scale than the time-scale needed to reach an apparently steady-state population on the reference replica. Thus, seemingly converged steady-state population of the folded state as revealed by reference replica might still contain some statistical inaccuracy if the folding events were observed only in some replicas (see Supporting Information for details). Note that this is the case of the present REST2 simulations of TLs, where the full convergence is still beyond what is achievable for current computers. On the other hand, the bootstrapping implementation used in this study with resampling both in time- and replica-domain provides a fair estimate of uncertainties stemming from such still rather limited convergence. We further note that folding simulations performed in different ffs might require different simulation time to achieve the same level of convergence. In conclusion, we caution that with presently available computer resources and enhanced-sampling algorithms, quantitatively converged simulations of RNA TLs are not achievable; this unavoidable uncertainty obviously complicates fine-tuning of the RNA ffs.

Data analysis.

All trajectories were analyzed with the PTRAJ module of the AMBER package78 and the simulations were visualized using a molecular visualization program VMD90 and PyMOL91.

Additional simulations.

Besides the REST2 simulations of different variants of our gHBfix potential, we report also a number of other simulations for various RNA systems, to check the performance of the modified ff. The cumulative time of all unbiased simulations was ~100 μs and for space reasons, methodological details of these simulations are given in Supporting Information.

Simulations with other RNA ffs.

As part of our study, we tested the recently published ff by D. E. Shaw and co-workers.60 We applied this ff for folding simulations of RNA TLs with short A-RNA stems as well as for standard simulations of selected RNA motifs. We observed major instabilities caused by this ff version in folded RNAs. In addition, we tested the recently published vdW/NBfix RNA ff modification by Pak and co-workers.31 We also spotted major problems of this ff in description of noncanonical interactions within the Kink-turn RNA motif during standard MD simulations. Methodological details of both tested ffs are given in the Supporting Information of the article.

RESULTS AND DISCUSSION

In the present study, we have attempted to parameterize the gHBfix correction for the χOL3CP13, 73, 77 AMBER RNA ff. Our primary training systems were r(GACC) TN and two TL hairpins, namely r(gcGAGAgc) and r(gcUUCGgc) (see Figure 1 for structures). Our goal was to improve performance of the simulations for these systems while avoiding undesired side effects for other RNA systems. We have used extended REST2 simulations to generate sufficiently converged conformational ensembles of the above small RNA systems while testing various combinations of the gHBfix potentials acting on specific types of H-bonds (see Methods and Table S1 in Supporting Information for overview of REST2 simulations). The obtained results were compared with the available experimental data.2527, 5052 We have found a promising gHBfix correction that decisively improves behavior of the r(GACC) TN and r(gcGAGAgc) TL. We did not detect any side effects so far in standard simulations of a wide range of other RNA structures. This indicates that although the suggested modification is not robust enough to fold the r(gcUUCGgc) TL and is less convincing for some other TNs, it provides a significant improvement of RNA simulations when added to the widely used χOL3CP RNA ff. We reiterate that the primary goal of the paper was introduction of the basic principles of the gHBfix potential and demonstration of its capability for improving RNA simulations.

Weakening of the over-stabilized SPh interactions.

According to the NMR data,2527 RNA TNs adopt mostly two different conformations, A-major and A-minor (Figure S6 in Supporting Information). However, recently published standard MD and replica exchange simulations1415, 17, 27, 29, 31, 3334 using the χOL3CP11, 13, 73, 77 RNA ff reported unsatisfactory populations of canonical A-form conformations and, instead, significant population of artificial intercalated structures (Figures 1, 3, and Figure S6 in Supporting Information) that are stabilized by SPh and BPh interactions.27 Overstabilization of the BPh and especially SPh interactions was repeatedly suggested in our recent studies.58, 62 Among all TNs for which benchmark experimental data is available,2527 i.e., r(GACC), r(CAAU), r(CCCC), r(UUUU), r(AAAA), the r(GACC) sequence was most frequently used in the preceding ff testing, therefore, we primarily focused on this system, while the other sequences were tested afterwards.

Figure 3.

Figure 3.

Application of the gHBfix potential and its impact on tertiary structures of the three main test systems, i.e., r(GACC) TN in the spurious intercalated structure (top left) and the dominant native A-major conformation (top right), and r(gcGAGAgc) (bottom left) and r(gcUUCGgc) (bottom right) TLs in their native conformations. Groups included in the gHBfix potential are highlighted by spheres (H, N, and O atoms in white, blue, and red, respectively). The particular version of the gHBfix depicted in the Figure is tuning H-bond interactions in order to: (i) destabilize sugar – phosphate interactions (red dashed lines highlighted by yellow background on the top left panel) and, simultaneously, (ii) support base – base interactions (lower panels; black dashed lines highlighted by blue and red background for –NH…N– and –NH…O– interactions, respectively). Note that the gHBfix potential is affecting all interactions of the same type, i.e., not only those present in the depicted conformations.

In an attempt to eliminate the unsatisfactory MD behavior, we first designed a negative (destabilizing) gHBfix potential to probe the effect of destabilization of SPh interactions, i.e., H-bonds between all 2’-OH groups and phosphate nbOs (pro-RP, pro-SP), bOs (O3’, O5’) and sugar O4’ oxygens (Figure 3 and Tables 1 and 2). Several variants of the negative gHBfix were tested using a set of REST2 simulations of r(GACC) TN used as the training system. The results from clustering analysis show that all introduced negative gHBfix potentials acting on SPh interactions increased significantly the native RNA A-major population and essentially eliminated the presence of the intercalated structures (observed populations of artificial intercalated structures were marginal, typically ~1.0 %, Table 3). However, as a side effect, the canonical RNA A-minor conformation (characterized by the change of the α backbone dihedral at the 3’-end, allowing formation of the SPh contact, Figure S6 in Supporting Information) appeared to be destabilized and could even be eliminated when the repulsive gHBfix correction was too strong. For further tests, we took the gHBfix(2OHnbO/bO0.5) combination, which provides close agreement with experiments,25 i.e., χ2 value with respect to NMR observables of ~0.15 and populations of A-major/A-minor/Intercalated structures around 65%/18%/0%, respectively (Table 3). The chosen gHBfix(2OHnbO/bO0.5) setting consists of the negative gHBfix applied to all possible interactions between 2’-OH groups as proton donors and nbO and bO oxygens as acceptors with the destabilization constant η of –0.5 kcal/mol. It is notable that comparable improvement over the control χOL3CP simulation was achieved by other tested potentials, especially those involving interactions between 2’-OH groups and nbO oxygens (Table 3), suggesting the possibility of further tuning of the gHBfix parameters for SPh interactions.

Table 3.

Analysis of the most populated structural clusters (in %) obtained from r(GACC) REST2 simulations at the reference replica (T = 298 K) with five variants of the gHBfix parameters.a

Cluster Population (%)
Exp.b Ref.c gHBfix(2OHnbO1.0) gHBfix(2OHnbO/bO1.0) gHBfix(2OHnbO/bO/O41.0) gHBfix(2OHnbO0.5) gHBfix(2OHnbO/bO0.5)
A-major ~65 36.9 ± 2.8 69.2 ± 0.5 76.5 ± 3.2 61.3 ± 4.3 65.1 ± 2.5 58.7 ± 2.1
A-minor ~18 14.4 ± 1.5 4.4 ± 0.5 3.7 ± 0.4 2.8 ± 0.5 7.7± 0.9 7.4 ±1.3
Intercalatedd 0 ~9 ~0 ~0 ~0 ~1 ~1
χ2 e - 0.75 0.14 0.18 1.27 0.13 0.15
a

Clustering was performed from the last 7 μs of the 10-μs-long trajectory.

b

Experimental populations obtained by NMR refinement simulations25 and by reweighting extensive MD simulations based on these NMR data.33

c

Present control χOL3CP simulation without any external potential (see Methods for the detailed setup).

d

See Figures 1, 2 and Figure S6 in Supporting Information for examples of intercalated structures.

e

χ2 values were obtained as described elsewhere33 by comparing calculated and experimental backbone 3J scalar couplings, sugar 3J scalar couplings, nuclear Overhauser effect intensities (NOEs), and the absence of specific peaks in NOE spectroscopy (uNOEs). NOEs and uNOEs were calculated as averages over the N samples, i.e.,(u)NOECALC=(iNri6N)1/6. The calculated NOEs were directly compared against experimental values. Each uNOE signal was considered as violated, when the calculated value from the simulation was shorter than the assigned one from the experiment. 3J scalar couplings were obtained from the Karplus relationships (see Supporting Information of the Ref.33 for the details and experimental datasets).

Stabilization of the base-base H-bonding interactions.

Short TL motifs are ideal model systems for assessing the folding capability of ffs due to their small size, clearly defined native conformation and the possible variety of other competing conformations. We initially applied the gHBfix2-OHnbO/bO-0.5 potential in r(gcGAGAgc) REST2 folding simulation but we did not obtain native states, i.e., neither the stem nor the loop sampled native conformations for the entire 10 μs of the REST2 simulation (Table 4). Nonetheless, the gHBfix(2OHnbO/bO0.5) was able to (at least partially) suppress condensed (‘random-coil’) states with excess of BPh and SPh interactions in favor of A-form single stranded structures (see Figure S7 in Supporting Information for geometries and populations of major clusters during the r(gcGAGAgc) REST2 simulations). The persisting inability to sample the native state was not surprising because the problem with TL folding in χOL3CP RNA ff was deemed to be connected with two separate problems, namely, spurious stabilization of unfolded states by excessive BPh and SPh interactions and underestimation of the native H-bonds within the stem (canonical base-pairing interactions) and the loop (signature BPh and other interactions involving the 2’-OH groups, Figure 3).55, 62 Recently, we have shown that addition of structure-specific HBfix potential supporting native H-bonds results in satisfactory r(gcGAGAgc) TL folding.62 This suggests that the base-pairing interactions should be additionally stabilized, which is also consistent with several recent studies on DNA and RNA guanine quadruplexes and their folding intermediates stabilized by GG Hoogsteen base pairs.9294 In addition, the usefulness of stabilization of base – base H-bonds is also indicated by potentially excessive fraying of base-paired segments occasionally seen in MD simulations of folded RNAs.4

Table 4.

Tuning of the gHBfix potential for base pairing and its effects on the folding of r(gcGAGAgc) TL. Populations (in %) of two major conformations, i.e., the native state (with properly folded stem and loop) and all states with folded A-form stem (independent of the loop conformation), are displayed for each gHBfix combination at the reference REST2 replica (T = 298 K). See Figure S7 for the examples of most populated clusters from REST2 simulations.a

(–NH…N–) interaction bias (kcal/mol) (–NH…O–) interaction bias (kcal/mol)
(NHO+0.0) (NHO+0.5) (NHO+1.0)
(NHN+1.0) 20.0/33.2b 40.3/72.1 0.0/5.6
(NHN+0.5) 5.1/6.9 9.1/26.4 22.1/53.6
(NHN+0.0) 0.0/0.0c 0.0/3.9 0.0/29.1
a

Cluster analysis performed for the last 7 μs from the 10-μs-long trajectory at 298 K; the gHBfix(2OHnbO/bO0.5) potential applied in all simulations.

b

The numbers before and after “/” report population of native state with all signature interactions and all states with folded A-form stem, respectively. The first population is thus inherently included in the second.

c

REST2 simulation with the gHBfix(2OHnbO/bO0.5) potential affecting only SPh interactions.

In an attempt to fix the base-base interactions in a general fashion, i.e., to support both canonical and noncanonical base pairing, we added additional terms to the gHBfix potential, which support H-bonds between any proton donor from nucleobases (–NH groups) and nucleobase proton acceptors (N/O– atoms, see Figures 2, 3 and Methods for details). We would like to reiterate that these gHBfix potentials are biased neither towards any specific type of base pairing95 nor towards any specific pairs of nucleobases in the sequence, so they work irrespective to the base pairing in the native state. They are as general as all other ff terms and can be in principle transferred to other systems.

We have tested several variants of the base–base gHBfix function, by using a combination of separate terms for (–NH…N–) and (–NH…O–) H-bonds, each with three different values of the gHBfix bias constant η, namely 0.0, +0.5, and +1.0 kcal/mol. Thus, we performed nine r(gcGAGAgc) TL REST2 simulations with the gHBfix(2OHnbO/bO0.5) term introduced above for SPh interactions and various combinations of the gHBfix(NHNη) and gHBfix(NHOη) base-base terms (Table 4). Subsequently, we used clustering analysis to estimate the population of the native state (see Methods). It turned out that additional support of (–NH…N–) H-bond interactions (with η = +1.0 kcal/mol) is crucial in order to promote significant population of the folded stem and to stabilize the native arrangement of the TL (Figure 4 and Table 4). This is in agreement with the known ff imbalance, where N atoms tend to have too large vdW radii, forcing the (–NH…N–) H-bonds to fluctuate around larger distances with respect to Quantum Mechanical (QM) calculations and experimental datasets.5052, 96 We observed that a sufficiently strong support of (–NH…O–) H-bonds (with η equal to +1.0 kcal/mol) could also stabilize folded stem and/or native stem/loop even in combination with weaker (–NH…N–) support. This likely compensates for the effect of weaker (–NH…N–) interactions; see the r(gcGAGAgc) REST2 simulation with the gHBfix(2OHnbO/bO0.5)(NHN+0.5)(NHO+1.0) combination (Figure 4 and Table 4). Nonetheless, in such case the population of the native loop conformation is suboptimal and this rather heavy gHBfix combination leads already to some side effects in simulations of TNs (e.g., the r(GACC) TN tends to form non-native loop-like structures with too strong stabilization of base-base interactions, see Supporting Information for details). It also leads to a significant population of the left-handed Z-form helix conformation97 (stem guanines in syn orientation) instead of the dominant A-form in the r(gcGAGAgc) system (Figure 4). In general, it is advisable to keep the gHBfix ff corrections as mild as possible. Despite the fact that r(gcGAGAgc) REST2 simulations are not fully converged (Figure 4 and Figure S5 in Supporting Information), the data strongly suggests applying the positive biases (strengthening) on the (–NH…N–) H-bonds rather than on the (–NH…O–) H-bonds. This observation was also confirmed by nine REST2 simulations of r(GACC) TN differing in base-base gHBfix terms (using the same parameter combinations as tested for the r(gcGAGAgc) TL folding), all with gHBfix(2OHnbO/bO0.5) weakening SPh interactions (Table 5). Hence, REST2 simulations with both r(GACC) TN and r(gcGAGAgc) TL show that the gHBfix(2OHnbO/bO0.5)(NHN+1.0) potential represents currently the best compromise (among those variants tested) for tuning H-bonding interactions with a correct balance between different conformers according to the experiments.25, 27, 50

Figure 4.

Figure 4.

Populations (%) of the most important types of structures in r(gcGAGAgc) REST2 simulations with various gHBfix potentials for all twelve ladder replicas with errors estimated using bootstrapping with resampling both over time- and replica-domains (see Methods for details). Population of the native state is shown in blue. The remaining populations represent correctly folded A-form stem with any loop conformation (in green), correctly folded apical loop with stem not folded (red), and structures with left-handed Z-form stem with any loop conformation, i.e., including structures with properly structured apical loop accompanied with Z-form stem (gray). Displayed numbers highlight the final population in the unbiased replica 1 (T=298 K). The gHBfix(2OHnbO/bO0.5) potential was applied in all simulations.

Table 5.

The effect of gHBfix potential for base pairing on structural dynamics of r(GACC) TN. Populations (in %) of two major conformations, i.e., RNA A-major (the first number) and A-minor (the second number), are displayed for each gHBfix combination at the reference replica (T = 298 K). See Figure S6 in Supporting Information for examples of the other populated conformations from the REST2 simulations. The number in parentheses displays the χ2, which further validates the simulations against the data from experiments.a,b

(–NH…N–) interaction bias (kcal/mol) (–NH…O–) interaction bias (kcal/mol)
(NHO+0.0) (NHO+0.5) (NHO+1.0)
(NHN+1.0) 46.2/9.7 (0.32 c) 35.1/6.0 (0.78) 13.5/2.4 (1.98)
(NHN+0.5) 56.8/5.6 (0.11) 55.9/9.0 (0.23) 33.2/5.7 (0.95)
(NHN+0.0) 58.7/7.4 (0.15) d 57.6/9.8 (0.15) 54.1/9.3 (0.42)
a

gHBfix(2OHnbO/bO0.5) weakening SPh interactions is applied in all simulations.

b

Cluster analysis performed for the last 7 out of 10 μs-long trajectory at 298 K.

c

χ2 values were obtained by calculating and comparing backbone 3J scalar couplings, sugar 3J scalar couplings, NOEs and uNOEs (see Table 3 for details).

d

REST2 simulation with the gHBfix(2OHnbO/bO0.5) potential only (Table 3, last column).

Present combination of gHBfix parameters is not sufficient to entirely eliminate the intercalated state in other TN sequences.

Besides the r(GACC) TN sequence, four other TNs, i.e., r(CAAU), r(CCCC), r(AAAA), and r(UUUU), are commonly used for ff validation1415, 17, 2627, 29, 31, 33 due to availability of the corresponding benchmark NMR data. We thus performed additional REST2 simulations of those TN sequences in order to explore the effects of the gHBfix(2OHnbO/bO0.5)(NHN+1.0) potential. The simulations were initiated with the same setup as for the r(GACC) TN (see Methods). Application of the gHBfix ff to these sequences resulted in a visible but, in contrast to r(GACC), not fully satisfactory improvement over the χOL3CP. Namely, the population of native A-form was increased in all four sequences while the population of artificial intercalated structure was significantly reduced only in two of them, r(CCCC) and r(AAAA) (see Table 6). Population of intercalated structures still remained unsatisfactory for r(CAAU), r(CCCC), and r(AAAA) sequences, and notably, the r(CAAU) sequence revealed intercalated structure to be still more populated than the native A-form (Table 6). This suggests that the gHBfix(2OHnbO/bO0.5)(NHN+1.0) variant is not yet robust enough to entirely eliminate all artificial intercalated states in TNs and further ff modifications would be vital. It is worth to note that with the gHBfix(2OHnbO/bO0.5)(NHN+1.0) correction, all sequences having still unsatisfactory population of the intercalated structures involve C or A as 5’-terminal nucleotide, i.e., nucleotide, which can form 7BPh interaction98 in the intercalated state. This indicates that weakening of BPh interactions that were previously reported to be overpopulated in unfolded states62 might be used to eliminate the artificial intercalated structure. Work is in progress in our laboratories to find appropriate gHBfix potentials and parameters to refine simulations of all TNs simultaneously. However, although we already have promising results, finding the best solution is beyond the scope of the present study. It requires testing of a large number of parameter combinations, including subsequent verification of absence of side effects for a broad set of other RNA systems.

Table 6.

Population (%) of native RNA A-form (including both A-major and A-minor states) and artificial intercalated structures as obtained in standard χOL3CP and with the suggested gHBfix(2OHnbO/bO0.5)(NHN+1.0) modification.

sequence A-form Intercalated
χOL3CP gHBfix χOL3CP gHBfix
r(CAAU) <~2% a ~21% d ~50% a,b ~51% d
r(CCCC) ~20% b,c ~54% d ~56% b,c ~39% d
r(AAAA) ~21% a,b ~25% d ~23% a,b ~11% d
r(UUUU) ~4% a,b ~5% d ~18% b ~3% d
a

Ref.27

b

Ref.33

c

Ref.14

d

this study. Note that we used the implementation of modified vdW parameters of phosphates by Case et al.63 including correction for the affected dihedrals (see Methods).

Comment on BPh interactions.

When tuning H-bond interactions we need to take into account that under or over-estimation of a given H-bond type is likely not uniform across diverse RNA structures. The apparent ff deficiency for a given interaction reflected by its unrealistic population can be substantially context-dependent, as it is also affected by contributions from many other terms such as base stacking, backbone conformations, etc. Thus, one has to be always concerned with potential of over-corrections, which could improve some structures but deteriorate others. This is the reason why we so far did not apply gHBfix for the BPh interactions. Tuning of BPh interactions may have conflicting consequences in simulations of TNs, where they should be avoided and TLs, where they do form significant signature interactions. As BPh interactions are widespread in folded RNAs,98 their uniform weakening or strengthening in the ff could lead to ambiguous results. We nevertheless work on finding suitable gHBfix modifications of the BPh interactions that would improve RNA simulations without introducing side effects.

UNCG TL as a challenging system.

As noted in the Introduction, earlier studies indicated that the UNCG TL is a considerably more difficult system than the TNs and GNRA TLs. In order to prove that the native state99 population of the r(gcUUCGgc) TL might be improved by modification of non-bonded terms, we initially applied the structure-specific HBfix potential to all ten native interactions61, 99 of the r(gcUUCGgc) TL motif, i.e., to the six H-bonds of the two canonical GC base pairs of the stem and the four signature H-bonds within the loop (Figure 1). Each native H-bond was biased by +1.0 kcal/mol in favor of the bound state (see Methods for details). Similar structure-based HBfix successfully folds the r(gcGAGAgc) TL in T-REMD simulations.62 We obtained converged results, with the population of r(gcUUCGgc) TL native state (27.1 ± 9.6% at 298 K in T-REMD) in a reasonable agreement with experiments5052 (see Supporting Information for details). With aid of the structure-based HBfix, the r(gcUUCGgc) TL was readily able to significantly populate the native fold including syn-orientation of the GL4 nucleotide (Figure 1), without any specific tuning of the guanosine dihedral potential. However, we note that the HBfix corrections are structure-specific and cannot thus be used for general RNA sequences.

Next, we tried gHBfix potentials that revealed promising results for structural description and folding of both r(GACC) TN and r(gcGAGAgc) TL. Namely, we performed r(gcUUCGgc) REST2 folding simulation with gHBfix(2OHnbO/bO0.5)(NHN+1.0)(NHO+1.0), gHBfix(2OHnbO/bO0.5)(NHN+1.0), and gHBfix(2OHnbO/bO0.5)(NHN+0.5) potentials (Table S1 in Supporting Information). Unfortunately, we did not detect any presence of the native state. The application of gHBfix(2OHnbO/bO0.5)(NHN+1.0)(NHO+1.0) showed rapid stem folding but the loop fluctuated between several non-native conformers (Figure S8 in Supporting Information). In contrast, the gHBfix(2OHnbO/bO0.5)(NHN+1.0) and gHBfix(2OHnbO/bO0.5)(NHN+0.5) potentials led to either zero or marginal stem folding while misfolded states dominated the simulations (Figure S8 in Supporting Information). The fact that the gHBfix(2OHnbO/bO0.5)(NHN+1.0) potential did not lead to any folding of the native state in r(gcUUCGgc) TL, while it was sufficient to reveal significant folding in r(gcGAGAgc) TL confirms that the inherent stability of the UNCG loop region is underestimated by the ff significantly more than in the case of GNRA TL.53 This conclusion is further supported by the observation that only misfolded loop states are populated when the stem is forced to fold by gHBfix(2OHnbO/bO0.5)(NHN+1.0)(NHO+1.0) potential, which already overstabilizes the base-base interactions (see above).

The analysis of misfolded conformers obtained by the r(gcUUCGgc) REST2 simulations with the presently available gHBfix potentials showed several noncanonical interactions in the loop region dominating over the native ones. Based on this analysis, we decided to probe effect of either stabilization or destabilization of few other kinds of H-bonds. In particular, we tested the effect of the stabilization of all possible interactions between 2’-OH groups and H-bond acceptors of nucleobases, and the destabilization of interactions between all nucleobase proton donors (–NH) and all O2’ oxygens, and between 2’-OH groups and O2’/O4’ oxygens. We observed that the population of structures with correctly folded A-form stem slightly increased (up to ~70% in some cases, see Supporting Information for more details) but the folded native state appeared only in simulation using the most complex gHBfix(2OHnbO/bO0.5)(NHN+0.5)(NHO+0.5)(2OHN/O+0.5)(NHO20.5)(2OHO2/O40.5) potential, with a rather marginal population of ~6 %. It was accompanied with ~14% population of misfolded states involving properly folded A-form stem but non-native loop conformations (Figure S8 and other details in Supporting Information). In summary, additional biases to SPh and base-base interactions were not sufficient to correct the free-energy imbalance between native folded and misfolded states of the r(gcUUCGgc) TL in χOL3CP. Despite some possible convergence uncertainties of REST2 TL folding simulations, the data suggests that correct description of the r(gcUUCGgc) folding might be precluded also by some other ff inaccuracies whose elimination goes beyond fine-tuning of the non-bonded terms. In contrast to TNs which should be relatively easily correctable (see above), we presently have no clues how to decisively improve folding of the UNCG TLs.

Application of the suggested set of gHBfix parameters to a diverse set of folded RNAs shows no side effects.

Structural dynamics and folding simulations of both TN and TL motifs revealed that the combination of the gHBfix(2OHnbO/bO0.5)(NHN+1.0) potential with χOL3CP significantly improves their structural behavior. Although further work is required, especially considering folding of the r(gcUUCGgc) TL, we decided to test the performance of the χOL3CP/gHBfix(2OHnbO/bO0.5)(NHN+1.0) combination for other important (and much larger) RNA motifs and systems. Namely, we performed a set of standard MD simulations of Sarcin-Ricin loop, ribosomal L1-stalk, Kissing-loop complex, Hairpin ribozyme, preQ and Neomycin-sensing riboswitches, Kink-turns, RNA duplex and G-quadruplex (see Table S2 for overview of unbiased MD simulations). Importantly, the gHBfix potential with the presently suggested parameters did not cause any side effects and unexpected rearrangements in comparison to simulations with the unmodified widely used χOL3CP RNA ff (see Supporting Information for details). Therefore, the suggested set of gHBfix parameters appears to significantly improve simulation performance for at least some difficult RNA systems while not causing any undesired side effects in standard simulations of folded RNAs.

The relationship between gHBfix and NBfix approaches.

As discussed above, the gHBfix term is designed for versatile and fully controllable fine-tuning of specific pairwise interactions. It should be noted that at least part of the effect of gHBfix might be also achieved by the standard ff terms, in particular by modification of the pairwise vdW parameters via breakage of the combination (mixing) rules, i.e., by usage of the so-called NBfix term; for a recent review see Ref.100. The NBfix approach has been in the past rather commonly applied in connection with proteins,100105 carbohydrates,106 DNA molecular assemblies57, 100 and for cation interactions.107108 Occasional attempts were made also for RNA molecules.31, 54

Nonetheless, the NBfix term has several drawbacks and limitations both from the perspective of ff development as well as ff performance in practical applications. Further, the earlier attempts to use NBfix term for RNA simulations used an implementation, where the solvent-solute interactions were retained after modifications of the solute atomic vdW parameters.31, 54 In this approach all the intra-RNA interactions involving the modified atoms are scaled. We suggest that such NBfix approaches are not convenient for nucleic acids because they are seriously prone to undesired side effects. For instance, below we show that the recently introduced vdW/NBfix modification by Pak et al.31 designed to improve the behavior of TNs causes swift disruption of the native conformation of the K-turn RNA motif (see the next section called Performance of other ffs for details) and is thus not suitable for common simulations of folded RNAs. This underlines that any significant ff modifications have to be extensively tested prior publication on a broad range of RNA systems to detect uncontrollable side effects. In contrast to the above-mentioned implementations of NBfix to RNA molecules, we would recommend applying the NBfix terms rather on specific pairwise interactions in a similar manner as suggested here for the gHBfix approach. Such pairwise-specific NBfix parametrizations have been used for proteins and ions in the past.100, 104105

However, even performance of the pairwise-specific implementation of NBfix term may be sometimes limited as the modification of vdW parameters affects the interaction energy indirectly via electrostatics. Thus, for example, smaller vdW radii allow closer contact of the atom-centered partial charges, which in turn affects the interaction energy. In other words, although the NBfix approach formally modifies the vdW parameters, its primary effect comes from the electrostatic term and is dominantly localized in the region of short interatomic distances (at the interatomic contact region). As a consequence, the effect of the NBfix term might be ambiguous and substantially dependent on partial charges of the interacting atoms. For instance, although U(N3-H3)…A(N1) and G(N1-H1)…C(N3) interactions would share the same NBfix parameters they would be affected unequally due to the different partial charges (see Figure S9 and other details in the Supporting Information). In addition, only limited range of energy (de)stabilization might be achieved by the NBfix term in order to avoid extensive geometrical distortions of the molecular contacts, e.g., to avoid potentially excessively clashed or elongated geometries caused by modification of pairwise vdW radius in the NBfix term. Finally, some polar hydrogen atoms, like those from the 2’-OH groups, have zero vdW radii in the parent ff, which causes obstacles for fitting extra stabilization typically done by decreasing vdW radii. The way to proceed perhaps would be to combine very small hydrogen-atom radii with very high value of the epsilon parameter (depth of the Lennard-Jones potential). Note that when using the gHBfix modification we ultimately found that it is very useful to implement it via the polar hydrogens. In contrast to NBfix, the gHBfix modification straightforwardly tunes the stability of a given H-bond interaction and its effect is distributed over a broader range of interatomic distances beyond the interatomic contact region (Figure S10 in the Supporting Information). The suggested gHBfix term modifies the pairwise potential at intermolecular distances, where typically polarization is assumed to contribute, which is also one of the reasons, which to our opinion justifies the use of the gHBfix term. In our work we have used a default 1 Å interval for application of the gHBfix bias, however, this parameter (as well as the onset of the biased region), could be further tunable, which may provide an additional flexibility for the refinements.

In order to illustrate the relation between the gHBfix and NBfix approaches, we aimed to parameterize particular pairwise-specific NBfix terms to match as closely as possible the effect of our suggested gHBfix terms. In other words, we attempted to find NBfix parameters that would be equivalent to our so far best gHBfix(2OHnbO/bO0.5)(NHN+1.0) parameters. In order to parameterize the NBfix term, we calculated interaction energies as a function of the separation distance in a small testing system containing only H-bond donor, H-bond acceptor and hydrogen atom. We adjusted the NBfix parameters to reproduce the interaction energy curves obtained with the gHBfix term (see Supporting Information for details, including the NBfix parameters). Thus, we aimed to find (i) NBfix parameters that would provide ~+1.0 kcal/mol stabilization of all possible –NH…N– H-bonds (for stabilization of base-base interactions) and, simultaneously, (ii) another NBfix term that would provide ~–0.5 kcal/mol destabilization of all 2’-OH…bO/nbO H-bonds (for destabilization of SPh interactions, Table S4 in the Supporting Information). Note that the effect of NBfix might differ due to different partial charges of interacting atoms as mentioned in the previous paragraph. Such uncertainties of the NBfix correction derived from gHBfix became apparent, when we validated the fitted NBfix terms for base-base interactions by calculating the interaction energies for simple interacting fragments, i.e., AT and GC Watson-Crick base pairs (see Figure S10 in the Supporting Information), as a function of the base-pair stretch. As expected, the gHBfix approach was able to reproduce exactly the requested energy stabilization of 1.0 kcal/mol without affecting the position of stretch minima. However, the NBfix fit revealed non-uniform effect on the stabilization, as the observed change in the interaction energy in one-dimensional stretch scans was 1.2 kcal/mol and 0.6 kcal/mol for AT and GC base pairs, respectively (see Supporting Information for details). Note that although these optimized NBfix parameters lead to the stabilization effects distributed around the target value of ~1 kcal/mol, the distribution is rather broad and cannot be narrowed by further tuning of the NH…N NBfix parameters due to reasons described above. Subsequently, we tested our best attempt of the NBfix correction derived from the gHBfix for both base-base and SPh interactions (Table S4 in the Supporting Information) in a REST2 folding simulation of GACC TN. The results were rather comparable with the gHBfix folding simulations; the population of A-major, A-minor, and Intercalated states was 37.7%, 16.2%, and 4.6%, respectively, albeit the χ2 value with respect to NMR observables was increased to 0.72.

We would like to reiterate that, for reasons explained above, introduction of NBfix terms is equally unphysical as that of gHBfix terms. The gHBfix implementation is more straightforward for post-processing trajectory analysis (e.g., reweighting). In contrary, NBfix is, probably, easier to handle for users in current implementation of the MM packages. Most importantly, a controllable ff modification by the NBfix is much more difficult to parameterize than the gHBfix (see above). Taken together, we suggest that the gHBfix term may be more flexible for fine-tuning of the non-bonded interactions, avoids some side-effects and provides some advantages with respect to the NBfix approach in pair-specific treatment of H-bonds and interatomic contacts. Obviously, further work will be necessary to fully characterize the similarities and differences between the gHBfix and NBfix approaches. However, this issue is already beyond the scope of the present work and will be in detail addressed in our future studies. It is also possible that both approaches can be combined in future.

Comment on other RNA ffs.

As noted in the Introduction, despite occasional optimistic claims in the literature, none of the other ffs available in the literature as of the end of 2017 was proven to satisfactorily simulate the RNA TLs and TNs without undesired side effects, as reviewed in Ref.4. Another RNA ff modification (abbreviated as DESRES here) has been published recently,60 reparametrizing the non-bonded as well as dihedral terms of the AMBER RNA ff and complementing the resulting parametrization with a specific water model.109 We have tested this ff (see Supporting Information for full details) with the following results.

The DESRES parameters, as implemented by us (see Supporting Information for the parameters), lead to serious side effects for some important folded RNA molecules. For example, structures of RNA Kink-turns and ribosomal L1-stalk RNA segment were entirely and reproducibly disrupted (Figure 6 and Figures S19 and S20 in Supporting Information), suggesting that the DESRES ff may be excessively biased in favor of the A-form RNA while also some noncanonical base pairs may be destabilized by the introduced modifications of the non-bonded terms (Figure 6 and Figures S15S18 in Supporting Information). Similarly to the DESRES potential, the recent vdW/NBfix tuning by Pak and coworkers31 also revealed poor description of Kink-turn noncanonical interactions during classical MD simulations (Figure S20 and Supporting Information for details; further testing of this ff on additional systems was not performed). Such unstable behavior of two recently published ffs has never been observed in common AMBER ffs (including the version used and tested here modified by the gHBfix potential), which work well for these systems. Note that the tested Kink-turn 7 is type of Kink-turn that folds in absence of proteins. In presence of monovalent ions, experiments show that the Kink-turn 7 folds at around 30 mM Na+,110111 which is well below the monovalent ion concentration used in our simulations. Thus, with a correct ff, Kink-turn 7 should be dominantly folded at our simulation conditions. Swift loss of the Kink-turn 7 fold on a sub-μs time scale is thus indication of some major ff imbalances.

Figure 6.

Figure 6.

The example of poor description of the folded RNA molecule containing several noncanonical interactions during classical MD simulations with the recent DESRES60 RNA ff. (A) The starting snapshot from the MD simulation of the ribosomal L1-stalk RNA segment, where the compact fold is characterized by presence of two Kink-turns and by numerous tertiary interactions between three RNA loops, including the UNCG TL (highlighted in red; bases forming base-pair triplets with UNCG TL nucleotides are highlighted in blue). Water molecules, counter ions and hydrogen atoms are not shown for clarity. (B) Last snapshot from the simulation of the same system, where loss of several tertiary interactions, coupled with poor description of Kink-turns, was followed by large-scale degradation of the RNA fold. The degradation began immediately and fully progressed till the end of MD. Basically the same behavior was observed in a number of independent simulations (see Supporting Information for details).

Further, using extensive REST2 simulations with the DESRES potential, we were not capable to fold the r(gcGAGAgc) and r(gcUUCGgc) TLs, despite the fact that even these short constructs should have a non-negligible folded population based on experimental data.52 In other words, the DESRES potential60 with our protocol/implementation did not bring any observable benefit over standard χOL3CP2627, 5052 for the description of small 8-mer TLs. Note that in the original paper, the DESRES ff was reported to fold TLs with very long stems. Thus, the folding events may result from the increased propensity to form A-form double helix. Therefore, as another test, we have carried out series of unbiased DESRES simulations of the UNCG TL with a long stem starting from the folded states and we have observed loss of UNCG signature interactions on a time scale around 5–10 μs in all of them (Figure S13 in the Supporting Information). We thus suggest that the DESRES ff would require more tests before being used to simulate general RNA motifs.

CONCLUDING REMARKS

Several recent studies suggested that the nucleic acid ffs suffer from imbalanced description of the non-bonded interactions.3, 1415, 17, 31, 53, 57, 60, 62 Here we introduce a general pair potential, a generalized HBfix (gHBfix), that can selectively fine-tune specific non-bonded terms, in particular specific types of hydrogen-bonding interactions. The gHBfix potential can be easily combined with any existing ffs to probe the effect of the modification of their non-bonded terms. It is easily applicable for common MM engines such as AMBER and GROMACS packages without any requirement for the source code modifications (see Supporting Information for the attached toolkit, which generates files for MM packages). Most importantly, unlike in case of vdW or point-charge parameter modifications, it affects only the selected type of the interactions, while the others, including, e.g., interactions with the solvent and ions, remain intact. Thus, it reduces the likelihood of introducing undesirable side effects that are frequently associated with other ways of modifying non-bonded terms. The gHBfix term is an entirely legitimate way of modifying the pair-additive NA ff since, in fact, it is as empirical as the other terms. The common MM parameters such as atomic charges and radii are not QM observables and thus do not correspond to any real physical properties. So it seems genuine to straightforwardly target interaction energies of selected molecular contacts instead of trying to refine their properties indirectly via modified parameters of the MM atoms. In other words, the gHBfix term is as general as all the other ff terms and can be to certain extent physically justified, as it may compensate, e.g., for the lack of polarization terms in H-bonding.4, 62 As extensively discussed, a similar effect can be achieved also by modification of vdW parameters via NBfix,31, 54, 57, 100 although to our opinion in significantly less controllable manner with higher risk of causing some major simulation artifacts for other systems. It is possible that in future the NBfix and gHBfix corrections could be applied simultaneously, in a synergy.

We tested the gHBfix potential for fine-tuning of the solute-solute H-bond interactions in folding of small TL and TN RNAs; in particular by strengthening of the base-base interactions and weakening of the SPh interactions. Incorrect description of both interaction types was suggested to, e.g., interfere with balanced description of the TLs folding. We found out that the negative gHBfix applied to all possible interactions between 2’-OH groups as proton donors and nbO and bO oxygens as acceptors (i.e., weakening of the SPh interactions) eliminates (or reduces) some ff artifacts, namely, the presence of intercalated structures of TNs. Furthermore, the additional support of (–NH…N–) H-bonds (i.e., gHBfix strengthening the base pairing interactions) significantly promotes population of the TL folded stem and of the native arrangement of the GNRA TL. The necessity of additional support of H-bonds between groups with N atoms is probably related to their oversized vdW radii in ffs as suggested in some recent studies.5052, 96 In summary, we propose a gHBfix(2OHnbO/bO0.5)(NHN+1.0) potential as an extension of the AMBER χOL3CP ff. The conformational ensembles generated by this ff reveal better agreement with available experimental data than the corresponding ensembles generated by the χOL3CP alone. In particular a significant improvement was observed in population of both canonical A-major and A-minor single stranded structures in r(GACC) TN folding simulations, together with elimination of the intercalated structures. It also achieved a significant population of native state in the simulation of small 8-mer GNRA TL. Most importantly, we have tested the suggested gHBfix tuning on a wide portfolio of other RNA structures without observing any undesired side effects in standard simulations. We suggest that especially the improved stabilization of base pairs may be profitable for many other systems in long simulations. In particular, the gHBfix potential also reduces the A-RNA end fraying (see Supporting Information for details), which was shown in the past to cause problems for example in simulations of some protein-RNA complexes.112 Although not directly tested, the gHBfix ff should be readily applicable to protein-RNA complexes, as the modification does not have any direct impact on the interactions at the protein-RNA interface.

Despite that the suggested specific gHBfix parameters provide undisputable improvement of the standard AMBER RNA ff, the modification is not yet sufficient to fully eliminate intercalated structures for r(CAAU), r(CCCC), r(AAAA) TNs and to correct the free-energy imbalance between folded and misfolded states observed for the challenging r(gcUUCGgc) TL. Thus, further tuning of the gHBfix potential is required and it seems that UNCG TL might require additional corrections that go beyond simple fine-tuning of H-bond interactions. On the other hand, correction of the remaining TNs should be rather straightforward thought, as discussed above, one has to be careful to avoid overstabilization of the A-form RNA, a major problem of several recently introduced ffs. Considering the variability and complexity of RNA structures, and the simplicity of the functional form of current ffs, it is probably naive to suppose that one would be able to introduce a perfect RNA ff with the ability to provide correct behavior of all RNA systems by tuning just some non-bonded terms within the framework of pair-additive ffs.4 Nevertheless, the introduced gHBfix potential increases the flexibility of the ff in a controlled way by adding small number of independent parameters directly modifying stabilities (populations) of the target interactions, indicating that performance of the pair-additive ffs could still be significantly improved. In general, ff improvements may be aided by using new terms that go beyond the basic conventional ff form and that further increase the flexibility of the ff. gHBfix is an example of such modifications.

The computational overhead of the gHBfix is rather insignificant for small systems (~2% for GPU accelerated simulations using fast GTX1080Ti GPU cards and even smaller for slower GPU cards or CPU version of the MM engine code). Although the computational overhead increases with the system size, it might be almost entirely eliminated by the direct implementation of gHBfix potential into the MM engine code in future, which would allow using the Non-Bonded pair list for calculations of the gHBfix terms. The specific version of RNA AMBER ff presented and tested in this work does provide some improvements while not showing any undesired side effect for the broad set of important folded RNA systems. It can thus be quite safely used in RNA simulations. Work is in progress to balance some additional H-bond interactions using the gHBfix potentials, to optimize the biasing parameters η via reweighting and to merge the gHBfix potentials with adjustments of some of the core ff terms. We suggest that future refinements of the pair-additive RNA ffs will likely require adding additional ff terms (such as the gHBfix) to the basic functional form to increase flexibility of the parametrization. This, together with testing on a broad set of RNA systems, will help to avoid over-fitting of the ff in favor of one type of target RNA structures, such as, e.g., the TNs or A-form RNA, a problem demonstrated in our study for several recently published ffs. Due to the huge dimensionality of the ff parameter space and their mutual inter-dependence, finding an optimal ff will be most likely a long-term process, which would profit from collaborative efforts within the RNA simulation community.

Supplementary Material

supporting information

Figure 5.

Figure 5.

Population analysis of r(GACC) REST2 simulation with various gHBfix potentials. Occurrences (in %) of major conformers, i.e., RNA A-major (in blue) and A-minor (in red) is shown for each eight replicas in the ladder. Displayed numbers indicate the final population in the unbiased replica 1 (T=298 K). Dashed horizontal lines indicate populations suggested by experiments.25 Note that the gHBfix(2OHnbO/bO0.5) potential was applied in all simulations.

ACKNOWLEDGMENT

We thank Petr Jurečka and Petr Stadlbauer for useful discussions. Sandro Bottaro is acknowledged for critically reading the manuscript and providing comments. Financial support from the Operational Programme Research, Development and Education – European Regional Development Fund project no. CZ.02.1.01/0.0/0.0/16_019/0000754 [to M.O., P.B., P.K.] of the Ministry of Education, Youth and Sports of the Czech Republic and Czech Science Foundation [18-25349S to P.B., P.K.] are acknowledged. J.S. acknowledges support by Praemium Academiae.

Footnotes

Supporting Information.

The following files are available free of charge. Details about convergence of enhanced-sampling simulations, description of the r(gcUUCGgc) T-REMD folding simulation with structure specific HBfix, detailed description of additional REST2 folding simulations with various gHBfix potentials, further details about the relation between the gHBfix and NBfix approaches, structure preparation and other details about simulation protocols, detailed description of REST2 and standard MD simulations with recently published ffs, description of standard MD simulations with the gHBfix potential, Supporting Tables and Figures (PDF). C++ code to generate inputs for AMBER and GROMACS in order to perform simulations with the gHBfix (gHBfix.pdf). AMBER input files containing DESRES parameters (desres.zip).

REFERENCES

  • 1.Ditzler MA; Otyepka M; Sponer J; Walter NG, Molecular dynamics and quantum mechanics of RNA: conformational and chemical change we can believe in. Acc Chem Res 2010, 43 (1), 40–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Cheatham TE; Case DA, Twenty-Five Years of Nucleic Acid Simulations. Proc Natl Acad Sci U S A 2013, 99 (12), 969–977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Sponer J; Banas P; Jurecka P; Zgarbova M; Kuhrova P; Havrila M; Krepl M; Stadlbauer P; Otyepka M, Molecular Dynamics Simulations of Nucleic Acids. From Tetranucleotides to the Ribosome. J Phys Chem Lett 2014, 5 (10), 1771–1782. [DOI] [PubMed] [Google Scholar]
  • 4.Sponer J; Bussi G; Krepl M; Banas P; Bottaro S; Cunha RA; Gil-Ley A; Pinamonti G; Poblete S; Jurecka P; Walter NG; Otyepka M, RNA Structural Dynamics As Captured by Molecular Simulations: A Comprehensive Overview. Chem Rev 2018, 118 (8), 4177–4338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Maffeo C; Yoo J; Comer J; Wells DB; Luan B; Aksimentiev A, Close encounters with DNA. J Phys Condens Matter 2014, 26 (41), 413101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Vangaveti S; Ranganathan SV; Chen AA, Advances in RNA molecular dynamics: a simulator’s guide to RNA force fields. Wiley Interdiscip Rev RNA 2017, 8 (2). [DOI] [PubMed] [Google Scholar]
  • 7.Smith LG; Zhao J; Mathews DH; Turner DH, Physics-based all-atom modeling of RNA energetics and structure. Wiley Interdiscip Rev RNA 2017, 8 (5). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Nerenberg PS; Head-Gordon T, New developments in force fields for biomolecular simulations. Curr Opin Struct Biol 2018, 49, 129–138. [DOI] [PubMed] [Google Scholar]
  • 9.Mackerell AD Jr.; Feig M; Brooks CL 3rd, Extending the treatment of backbone energetics in protein force fields: limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. J Comput Chem 2004, 25 (11), 1400–15. [DOI] [PubMed] [Google Scholar]
  • 10.MacKerell AD Jr.; Banavali N; Foloppe N, Development and current status of the CHARMM force field for nucleic acids. Biopolymers 2000, 56 (4), 257–65. [DOI] [PubMed] [Google Scholar]
  • 11.Perez A; Marchan I; Svozil D; Sponer J; Cheatham TE; Laughton CA; Orozco M, Refinenement of the AMBER force field for nucleic acids: Improving the description of alpha/gamma conformers. Biophys J 2007, 92 (11), 3817–3829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Denning EJ; Priyakumar UD; Nilsson L; Mackerell AD Jr., Impact of 2’-hydroxyl sampling on the conformational properties of RNA: update of the CHARMM all-atom additive force field for RNA. J Comput Chem 2011, 32 (9), 1929–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zgarbova M; Otyepka M; Sponer J; Mladek A; Banas P; Cheatham TE 3rd; Jurecka P, Refinement of the Cornell et al. Nucleic Acids Force Field Based on Reference Quantum Chemical Calculations of Glycosidic Torsion Profiles. J Chem Theory Comput 2011, 7 (9), 2886–2902. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bergonzo C; Cheatham TE, Improved Force Field Parameters Lead to a Better Description of RNA Structure. J Chem Theory Comput 2015, 11 (9), 3969–3972. [DOI] [PubMed] [Google Scholar]
  • 15.Bergonzo C; Henriksen NM; Roe DR; Cheatham TE 3rd, Highly sampled tetranucleotide and tetraloop motifs enable evaluation of common RNA force fields. Rna 2015, 21 (9), 1578–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Lemkul JA; MacKerell AD Jr., Polarizable Force Field for DNA Based on the Classical Drude Oscillator: II. Microsecond Molecular Dynamics Simulations of Duplex DNA. J Chem Theory Comput 2017, 13 (5), 2072–2085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Aytenfisu AH; Spasic A; Grossfield A; Stern HA; Mathews DH, Revised RNA Dihedral Parameters for the Amber Force Field Improve RNA Molecular Dynamics. J Chem Theory Comput 2017, 13 (2), 900–915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zhang C; Lu C; Jing Z; Wu C; Piquemal JP; Ponder JW; Ren P, AMOEBA Polarizable Atomic Multipole Force Field for Nucleic Acids. J Chem Theory Comput 2018, 14 (4), 2084–2108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Lemkul JA; MacKerell AD Jr., Polarizable force field for RNA based on the classical drude oscillator. J Comput Chem 2018, 39 (32), 2624–2646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bernardi RC; Melo MCR; Schulten K, Enhanced sampling techniques in molecular dynamics simulations of biological systems. Biochim Biophys Acta 2015, 1850 (5), 872–877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Valsson O; Tiwary P; Parrinello M, Enhancing Important Fluctuations: Rare Events and Metadynamics from a Conceptual Viewpoint. Annu Rev Phys Chem 2016, 67, 159–84. [DOI] [PubMed] [Google Scholar]
  • 22.Sugita Y; Okamoto Y, Replica-exchange molecular dynamics method for protein folding. Chem Phys Lett 1999, 314 (1–2), 141–151. [Google Scholar]
  • 23.Wang L; Friesner RA; Berne BJ, Replica Exchange with Solute Scaling: A More Efficient Version of Replica Exchange with Solute Tempering (REST2) (vol 115, pg 9431, 2011). J Phys Chem B 2011, 115 (38), 11305–11305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Mlynsky V; Bussi G, Exploring RNA structure and dynamics through enhanced sampling simulations. Curr Opin Struct Biol 2018, 49, 63–71. [DOI] [PubMed] [Google Scholar]
  • 25.Yildirim I; Stern HA; Tubbs JD; Kennedy SD; Turner DH, Benchmarking AMBER Force Fields for RNA: Comparisons to NMR Spectra for Single-Stranded r(GACC) Are Improved by Revised chi Torsions. J Phys Chem B 2011, 115 (29), 9261–9270. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tubbs JD; Condon DE; Kennedy SD; Hauser M; Bevilacqua PC; Turner DH, The nuclear magnetic resonance of CCCC RNA reveals a right-handed helix, and revised parameters for AMBER force field torsions improve structural predictions from molecular dynamics. Biochemistry-Us 2013, 52 (6), 996–1010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Condon DE; Kennedy SD; Mort BC; Kierzek R; Yildirim I; Turner DH, Stacking in RNA: NMR of Four Tetramers Benchmark Molecular Dynamics. J Chem Theory Comput 2015, 11 (6), 2729–2742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Bergonzo C; Henriksen NM; Roe DR; Swails JM; Roitberg AE; Cheatham TE 3rd, Multidimensional Replica Exchange Molecular Dynamics Yields a Converged Ensemble of an RNA Tetranucleotide. J Chem Theory Comput 2014, 10 (1), 492–499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Gil-Ley A; Bottaro S; Bussi G, Empirical Corrections to the Amber RNA Force Field with Target Metadynamics. J Chem Theory Comput 2016, 12 (6), 2790–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Cesari A; Gil-Ley A; Bussi G, Combining Simulations and Solution Experiments as a Paradigm for RNA Force Field Refinement. J Chem Theory Comput 2016, 12 (12), 6192–6200. [DOI] [PubMed] [Google Scholar]
  • 31.Yang C; Lim M; Kim E; Pak Y, Predicting RNA Structures via a Simple van der Waals Correction to an All-Atom Force Field. J Chem Theory Comput 2017, 13 (2), 395–399. [DOI] [PubMed] [Google Scholar]
  • 32.Schrodt MV; Andrews CT; Elcock AH, Large-Scale Analysis of 48 DNA and 48 RNA Tetranucleotides Studied by 1 mus Explicit-Solvent Molecular Dynamics Simulations. J Chem Theory Comput 2015, 11 (12), 5906–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Bottaro S; Bussi G; Kennedy SD; Turner DH; Lindorff-Larsen K, Conformational ensembles of RNA oligonucleotides from integrating NMR and molecular simulations. Sci Adv 2018, 4 (5), eaar8521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Bottaro S; Gil-Ley A; Bussi G, RNA folding pathways in stop motion. Nucleic Acids Res 2016, 44 (12), 5883–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Haldar S; Kuhrova P; Banas P; Spiwok V; Sponer J; Hobza P; Otyepka M, Insights into Stability and Folding of GNRA and UNCG Tetra loops Revealed by Microsecond Molecular Dynamics and Well-Tempered Metadynamics. J Chem Theory Comput 2015, 11 (8), 3866–3877. [DOI] [PubMed] [Google Scholar]
  • 36.Mathews DH; Turner DH, Prediction of RNA secondary structure by free energy minimization. Curr Opin Struct Biol 2006, 16 (3), 270–8. [DOI] [PubMed] [Google Scholar]
  • 37.Sweeney BA; Roy P; Leontis NB, An Introduction to Recurrent Nucleotide Interactions in RNA. Wiley Interdiscip Rev RNA 2015, 6, 17–45. [DOI] [PubMed] [Google Scholar]
  • 38.Hall KB, Mighty tiny. Rna 2015, 21 (4), 630–1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Hsiao C; Mohan S; Hershkovitz E; Tannenbaum A; Williams LD, Single nucleotide RNA choreography. Nucleic Acids Res 2006, 34 (5), 1481–1491. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Tuerk CG,P; Thermes C; Groebe DR; Gayle M; Guild N; Stormo G; d’Aubenton-Carafa Y; Uhlenbeck OC; Tinoco I Jr. et al. , CUUCGG hairpins: extraordinarily stable RNA secondary structures associated with various biochemical processes. Proc Natl Acad Sci U S A 1988, 85 (5), 1364–1368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Woese CR; Winker S; Gutell RR, Architecture of ribosomal RNA: constraints on the sequence of “tetra-loops”. Proc Natl Acad Sci U S A 1990, 87 (21), 8467–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Varani G, Exceptionally Stable Nucleic-Acid Hairpins. Annu Rev Biophys Biom 1995, 24, 379–404. [DOI] [PubMed] [Google Scholar]
  • 43.Uhlenbeck OC, Nucleic-Acid Structure - Tetraloops and Rna Folding. Nature 1990, 346 (6285), 613–614. [DOI] [PubMed] [Google Scholar]
  • 44.Pley HW; Flaherty KM; Mckay DB, Model for an Rna Tertiary Interaction Front the Structure of an Intermolecular Complex between a Gaaa Tetraloop and an Rna Helix. Nature 1994, 372 (6501), 111–113. [DOI] [PubMed] [Google Scholar]
  • 45.Xin YR; Laing C; Leontis NB; Schlick T, Annotation of tertiary interactions in RNA structures reveals variations and correlations. Rna 2008, 14 (12), 2465–2477. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Chauhan S; Woodson SA, Tertiary interactions determine the accuracy of RNA folding. J Amer Chem Soc 2008, 130 (4), 1296–1303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Marino JP; Gregorian RS; Csankovszki G; Crothers DM, Bent Helix Formation between Rna Hairpins with Complementary Loops. Science 1995, 268 (5216), 1448–1454. [DOI] [PubMed] [Google Scholar]
  • 48.Brion P; Westhof E, Hierarchy and dynamics of RNA folding. Annu Rev Biophys 1997, 26, 113–137. [DOI] [PubMed] [Google Scholar]
  • 49.Antao VP; Tinoco I, Thermodynamic Parameters for Loop Formation in Rna and DNA Hairpin Tetraloops. Nucleic Acids Res 1992, 20 (4), 819–824. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Sheehy JP; Davis AR; Znosko BM, Thermodynamic characterization of naturally occurring RNA tetraloops. Rna 2010, 16 (2), 417–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Mathews DH; Disney MD; Childs JL; Schroeder SJ; Zuker M; Turner DH, Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. Proc Natl Acad Sci U S A 2004, 101 (19), 7287–7292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Abdelkafi M; Ghomi M; Turpin PY; Baumruk V; Herve du Penhoat C; Lampire O; Bouchemal-Chibani N; Goyer P; Namane A; Gouyette C; Huynh-Dinh T; Bednarova L, Common structural features of UUCG and UACG tetraloops in very short hairpins determined by UV absorption, Raman, IR and NMR spectroscopies. J Biomol Struct Dyn 1997, 14 (5), 579–93. [DOI] [PubMed] [Google Scholar]
  • 53.Bottaro S; Banas P; Sponer J; Bussi G, Free Energy Landscape of GAGA and UUCG RNA Tetraloops. J Phys Chem Lett 2016, 7 (20), 4032–4038. [DOI] [PubMed] [Google Scholar]
  • 54.Chen AA; Garcia AE, High-resolution reversible folding of hyperstable RNA tetraloops using molecular dynamics simulations. Proc Natl Acad Sci U S A 2013, 110 (42), 16820–16825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Kuhrova P; Banas P; Best RB; Sponer J; Otyepka M, Computer Folding of RNA Tetraloops? Are We There Yet? J Chem Theory Comput 2013, 9 (4), 2115–2125. [DOI] [PubMed] [Google Scholar]
  • 56.Smith LG; Tan Z; Spasic A; Dutta D; Salas-Estrada LA; Grossfield A; Mathews DH, Chemically Accurate Relative Folding Stability of RNA Hairpins from Molecular Simulations. J Chem Theory Comput 2018, 14 (12), 6598–6612. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Yoo J; Aksimentiev A, Improved Parameterization of Amine-Carboxylate and Amine-Phosphate Interactions for Molecular Dynamics Simulations Using the CHARMM and AMBER Force Fields. J Chem Theory Comput 2016, 12 (1), 430–443. [DOI] [PubMed] [Google Scholar]
  • 58.Mlynsky V; Kuhrova P; Zgarbova M; Jurecka P; Walter NG; Otyepka M; Sponer J; Banas P, Reactive Conformation of the Active Site in the Hairpin Ribozyme Achieved by Molecular Dynamics Simulations with epsilon/zeta Force Field Reparametrizations. J Phys Chem B 2015, 119 (11), 4220–4229. [DOI] [PubMed] [Google Scholar]
  • 59.Zgarbova M; Jurecka P; Banas P; Havrila M; Sponer J; Otyepka M, Noncanonical alpha/gamma Backbone Conformations in RNA and the Accuracy of Their Description by the AMBER Force Field. J Phys Chem B 2017, 121 (11), 2420–2433. [DOI] [PubMed] [Google Scholar]
  • 60.Tan D; Piana S; Dirks RM; Shaw DE, RNA force field with accuracy comparable to state-of-the-art protein force fields. Proc Natl Acad Sci U S A 2018, 115 (7), E1346–E1355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Banas P; Hollas D; Zgarbova M; Jurecka P; Orozco M; Cheatham TE; Sponer J; Otyepka M, Performance of Molecular Mechanics Force Fields for RNA Simulations: Stability of UUCG and GNRA Hairpins. J Chem Theory Comput 2010, 6 (12), 3836–3849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Kuhrova P; Best RB; Bottaro S; Bussi G; Sponer J; Otyepka M; Banas P, Computer Folding of RNA Tetraloops: Identification of Key Force Field Deficiencies. J Chem Theory Comput 2016, 12 (9), 4534–4548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Steinbrecher T; Latzer J; Case DA, Revised AMBER Parameters for Bioorganic Phosphates. J Chem Theory Comput 2012, 8 (11), 4405–4412. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Izadi S; Anandakrishnan R; Onufriev AV, Building Water Models: A Different Approach. J Phys Chem Lett 2014, 5 (21), 3863–3871. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Havrila M; Stadlbauer P; Islam B; Otyepka M; Sponer J, Effect of Monovalent Ion Parameters on Molecular Dynamics Simulations of G-Quadruplexes. J Chem Theory Comput 2017, 13 (8), 3911–3926. [DOI] [PubMed] [Google Scholar]
  • 66.Krepl M; Vogele J; Kruse H; Duchardt-Ferner E; Wohnert J; Sponer J, An intricate balance of hydrogen bonding, ion atmosphere and dynamics facilitates a seamless uracil to cytosine substitution in the U-turn of the neomycin-sensing riboswitch. Nucleic Acids Res 2018, 46 (13), 6528–6543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Sponer J; Krepl M; Banas P; Kuhrova P; Zgarbova M; Jurecka P; Havrila M; Otyepka M, How to understand atomistic molecular dynamics simulations of RNA and protein-RNA complexes? Wiley Interdiscip Rev RNA 2017, 8 (3). [DOI] [PubMed] [Google Scholar]
  • 68.Salomon-Ferrer RC,AC; Walker RC, An overview of the Amber biomolecular simulation package. Wires Comput Mol Sci 2013, 3 (2), 198–210. [Google Scholar]
  • 69.Brooks BRB,RE; Olafson BD; States DJ; Swaminathan S, CHARMM: A program for macromolecular energy, minimization, and dynamics calculations. J Comput Chem 1983, Summer, 187–217. [Google Scholar]
  • 70.Weiner SJK,PA; Case DA; Singh UC; Ghio C; Alagona G; Profeta S; Weiner P, A new force field for molecular mechanical simulation of nucleic acids and peptids. J Amer Chem Soc 1984, 106 (3), 765–784. [Google Scholar]
  • 71.Nilsson LK,M, Empirical energy functions for energy minimization and dynamics of nucleic acids. . J Comput Chem 1986, October. [Google Scholar]
  • 72.Cornell WD; Cieplak P; Bayly CI; Gould IR; Merz KM; Ferguson DM; Spellmeyer DC; Fox T; Caldwell JW; Kollman PA, A second generation force field for the simulation of proteins, nucleic acids, and organic molecules J Amer Chem Soc 1995, 117 (19), 5179–5197. [Google Scholar]
  • 73.Cornell WD; Cieplak P; Bayly CI; Gould IR; Merz KM; Ferguson DM; Spellmeyer DC; Fox T; Caldwell JW; Kollman PA, A second generation force field for the simulation of proteins, nucleic acids, and organic molecules (vol 117, pg 5179, 1995). J Amer Chem Soc 1996, 118 (9), 2309–2309. [Google Scholar]
  • 74.Smith JCK, M., Empirical force field study of geometries and conformational transitions of some organic molecules. J Amer Chem Soc 1992, 114 (3), 801–812. [Google Scholar]
  • 75.Sponer J; Jurecka P; Hobza P, Accurate interaction energies of hydrogen-bonded nucleic acid base pairs. J Am Chem Soc 2004, 126 (32), 10142–10151. [DOI] [PubMed] [Google Scholar]
  • 76.Pokorna P; Kruse H; Krepl M; Sponer J, QM/MM Calculations on Protein-RNA Complexes: Understanding Limitations of Classical MD Simulations and Search for Reliable Cost-Effective QM Methods. J Chem Theory Comput 2018, 14 (10), 5419–5433. [DOI] [PubMed] [Google Scholar]
  • 77.Wang JM; Cieplak P; Kollman PA, How well does a restrained electrostatic potential (RESP) model perform in calculating conformational energies of organic and biological molecules? J Chem Theory Comput 2000, 21 (12), 1049–1074. [Google Scholar]
  • 78.Case DA; Betz RM; Cerutti DS; Cheatham TE; Darden TA; Duke RE; Giese TJ; Gohlke H; Goetz AW; Homeyer N; Izadi S; Janowski P; Kaus J; Kovalenko A; Lee TS; LeGrand S; Li P; Lin C; Luchko T; Luo R; Madej B; Mermelstein D; Merz KM; Monard G; Nguyen H; Nguyen HT; Omelyan I; Onufriev A; Roe DR; Roitberg A; Sagui C; Simmerling CL; Botello-Smith WM; Swails J; Walker RC; Wang J; Wolf RM; Wu X; Xiao L; P.A. K, AMBER 2016. San Francisco, 2016. [Google Scholar]
  • 79.Abraham MJ; Murtola T; Schulz R; Páll S; Smith JC; Hess B; Lindahl E, GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 2015, 1–2, 19–25. [Google Scholar]
  • 80.Case DA; Cheatham TE 3rd; Darden T; Gohlke H; Luo R; Merz KM Jr.; Onufriev A; Simmerling C; Wang B; Woods RJ, The Amber biomolecular simulation programs. J Comput Chem 2005, 26 (16), 1668–88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Joung IS; Cheatham TE, Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulations. J Phys Chem B 2008, 112 (30), 9020–9041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Besseova I; Banas P; Kuhrova P; Kosinova P; Otyepka M; Sponer J, Simulations of A-RNA Duplexes. The Effect of Sequence, Solute Force Field, Water Model, and Salt Concentration. J Phys Chem B 2012, 116 (33), 9899–9916. [DOI] [PubMed] [Google Scholar]
  • 83.Besseova I; Otyepka M; Reblova K; Sponer J, Dependence of A-RNA simulations on the choice of the force field and salt strength. Phys Chem Chem Phys 2009, 11 (45), 10701–10711. [DOI] [PubMed] [Google Scholar]
  • 84.Kuehrova P; Otyepka M; Sponer J; Banas P, Are Waters around RNA More than Just a Solvent? - An Insight from Molecular Dynamics Simulations. J Chem Theory Comput 2014, 10 (1), 401–411. [DOI] [PubMed] [Google Scholar]
  • 85.Hopkins CW; Le Grand S; Walker RC; Roitberg AE, Long-Time-Step Molecular Dynamics through Hydrogen Mass Repartitioning. J Chem Theory Comput 2015, 11, 1864–1874. [DOI] [PubMed] [Google Scholar]
  • 86.Bottaro S; Di Palma F; Bussi G, The role of nucleobase interactions in RNA structure and dynamics. Nucleic Acids Res 2014, 42 (21), 13306–13314. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Rodriguez A; Laio A, Clustering by fast search and find of density peaks. Science 2014, 344 (6191), 1492–1496. [DOI] [PubMed] [Google Scholar]
  • 88.Hall PH,J, A simple bootstrap method for constructing nonparametric confidence bands for functions. Ann Stat 2013, 41, 1892–1921. [Google Scholar]
  • 89.Hall PH,J; Jing BY, On blocking rules for the bootstrap withe dependent data. Biometrika 1995, 82, 561–574. [Google Scholar]
  • 90.Humphrey W; Dalke A; Schulten K, VMD: Visual molecular dynamics. J Mol Graph Model 1996, 14 (1), 33–38. [DOI] [PubMed] [Google Scholar]
  • 91.The PyMOL Molecular Graphics System, Version 2.0 .
  • 92.Sponer J; Bussi G; Stadlbauer P; Kuhrova P; Banas P; Islam B; Haider S; Neidle S; Otyepka M, Folding of guanine quadruplex molecules-funnel-like mechanism or kinetic partitioning? An overview from MD simulation studies. Biochim Biophys Acta 2017, 1861 (5 Pt B), 1246–1263. [DOI] [PubMed] [Google Scholar]
  • 93.Yang C; Kulkarni M; Lim M; Pak Y, Insilico direct folding of thrombin-binding aptamer G-quadruplex at all-atom level. Nucleic Acids Res 2017, 45 (22), 12648–12656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Havrila M; Stadlbauer P; Kuhrova P; Banas P; Mergny JL; Otyepka M; Sponer J, Structural dynamics of propeller loop: towards folding of RNA G-quadruplex. Nucleic Acids Res 2018, 46 (17), 8754–8771. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Leontis NB; Westhof E, Geometric nomenclature and classification of RNA base pairs. Rna 2001, 7 (4), 499–512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Banas P; Mladek A; Otyepka M; Zgarbova M; Jurecka P; Svozil D; Lankas F; Sponer J, Can We Accurately Describe the Structure of Adenine Tracts in B-DNA? Reference Quantum-Chemical Computations Reveal Overstabilization of Stacking by Molecular Mechanics. J Chem Theory Comput 2012, 8 (7), 2448–2460. [DOI] [PubMed] [Google Scholar]
  • 97.Hall K; Cruz P; Tinoco I Jr.; Jovin TM; van de Sande JH, ‘Z-RNA’--a left-handed RNA double helix. Nature 1984, 311 (5986), 584–586. [DOI] [PubMed] [Google Scholar]
  • 98.Zirbel CL; Sponer JE; Sponer J; Stombaugh J; Leontis NB, Classification and energetics of the base-phosphate interactions in RNA. Nucleic Acids Res 2009, 37 (15), 4898–918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Nozinovic S; Furtig B; Jonker HRA; Richter C; Schwalbe H, High-resolution NMR structure of an RNA model system: the 14-mer cUUCGg tetraloop hairpin RNA. Nucleic Acids Res 2010, 38 (2), 683–694. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Yoo J; Aksimentiev A, New tricks for old dogs: improving the accuracy of biomolecular force fields by pair-specific corrections to non-bonded interactions. Phys Chem Chem Phys 2018, 20 (13), 8432–8449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Best RB; Zheng W; Mittal J, Balanced Protein-Water Interactions Improve Properties of Disordered Proteins and Non-Specific Protein Association. J Chem Theory Comput 2014, 10 (11), 5113–5124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Nerenberg PS; Jo B; So C; Tripathy A; Head-Gordon T, Optimizing solute-water van der Waals interactions to reproduce solvation free energies. J Phys Chem B 2012, 116 (15), 4524–4534. [DOI] [PubMed] [Google Scholar]
  • 103.Miller MS; Lay WK; Li S; Hacker WC; An J; Ren J; Elcock AH, Reparametrization of Protein Force Field Nonbonded Interactions Guided by Osmotic Coefficient Measurements from Molecular Dynamics Simulations. J Chem Theory Comput 2017, 13 (4), 1812–1826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Yoo J; Aksimentiev A, Refined Parameterization of Nonbonded Interactions Improves Conformational Sampling and Kinetics of Protein Folding Simulations. J Phys Chem Lett 2016, 7 (19), 3812–3818. [DOI] [PubMed] [Google Scholar]
  • 105.Huang J; Rauscher S; Nawrocki G; Ran T; Feig M; de Groot BL; Grubmuller H; MacKerell AD Jr., CHARMM36m: an improved force field for folded and intrinsically disordered proteins. Nat Methods 2017, 14 (1), 71–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Lay WK; Miller MS; Elcock AH, Optimizing Solute-Solute Interactions in the GLYCAM06 and CHARMM36 Carbohydrate Force Fields Using Osmotic Pressure Measurements. J Chem Theory Comput 2016, 12 (4), 1401–1407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Berneche SR, B., Energetics of ion conduction through the K+ channel. Nature 2001, 414 (6859), 73–77. [DOI] [PubMed] [Google Scholar]
  • 108.Yoo JJ; Aksimentiev A, Improved Parametrization of Li+, Na+, K+, and Mg2+ Ions for All-Atom Molecular Dynamics Simulations of Nucleic Acid Systems. J Phys Chem Lett 2012, 3 (1), 45–50. [Google Scholar]
  • 109.Piana S; Donchev AG; Robustelli P; Shaw DE, Water dispersion interactions strongly influence simulated structural properties of disordered protein states. J Phys Chem B 2015, 119 (16), 5113–23. [DOI] [PubMed] [Google Scholar]
  • 110.Goody TA; Melcher SE; Norman DG; Lilley DM, The kink-turn motif in RNA is dimorphic, and metal ion-dependent. Rna 2004, 10 (2), 254–264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Huang LLDMJ, The kink-turn in the structural biology of RNA. Quarterly reviews of Biophysics 2018, 51 (E5). [DOI] [PubMed] [Google Scholar]
  • 112.Krepl M; Havrila M; Stadlbauer P; Banas P; Otyepka M; Pasulka J; Stefl R; Sponer J, Can We Execute Stable Microsecond-Scale Atomistic Simulations of Protein-RNA Complexes? J Chem Theory Comput 2015, 11 (3), 1220–1243. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supporting information

RESOURCES