Skip to main content
RNA logoLink to RNA
. 2010 May;16(5):1053–1061. doi: 10.1261/rna.2008110

Induced fit or conformational selection for RNA/U1A folding

Fang Qin 1, Yue Chen 1, Maoying Wu 1, Yixue Li 2, Jian Zhang 3, Hai-Feng Chen 1,2
PMCID: PMC2856877  PMID: 20354153

Abstract

The hairpin II of U1 snRNA can bind U1A protein with high affinity and specificity. NMR spectra suggest that the loop region of apo-RNA is largely unstructured and undergoes a transition from unstructured to well-folded upon U1Abinding. However, the mechanism that RNA folding coupled protein binding is poorly understood. To get an insight into the mechanism, we have performed explicit-solvent molecular dynamics (MD) to study the folding kinetics of bound RNA and apo-RNA. Room-temperature MD simulations suggest that the conformation of bound RNA has significant adjustment and becomes more stable upon U1A binding. Kinetic analysis of high-temperature MD simulations shows that bound RNA and apo-RNA unfold via a two-state process, respectively. Both kinetics and free energy landscape analyses indicate that bound RNA folds in the order of RNA contracting, U1A binding, and tertiary folding. The predicted Φ-values suggest that A8, C10, A11, and G16 are key bases for bound RNA folding. Mutant Arg52Gln analysis shows that electrostatic interaction and hydrogen bonds between RNA and U1A (Arg52Gln) decrease. These results are in qualitative agreement with experiments. Furthermore, this method could be used in other studies about biomolecule folding upon receptor binding.

Keywords: mRNA splicing complex, RNA folding, binding, transition state, Φ-values

INTRODUCTION

The U1A protein is a component of the U1 small nuclear ribonucleoprotein (snRNP), which is one of five large RNA–protein complexes involved in most eukaryotic pre-mRNA splicing (Lührmann et al. 1990; Green 1991). It contains two RNA recognition motifs (RRMs) (Lu and Hall 1995). The N-terminal domain of U1A binds to hairpin II of U1 snRNA and two adjacent internal loops in the 3′ UTR with high affinity and specificity. The AUUGCAC of the conserved 7-nucleotide (nt) sequence of snRNA is the U1A protein binding site (Scherly et al. 1989; van Gelder et al. 1993).

The crystal structure of U1A and snRNA was released in 1994 (pdb code:1URN) (Oubridge et al. 1994). U1A is comprised of a βαββαβ sandwich fold that forms a four-stranded β-sheet flanked on one side by two α-helices. The 10-nt loop of the RNA hairpin binds to the protein as an open structure. The AUUGCAC sequence of the RNA loop fits into the groove between the β2–β3 loop and the C-terminal domain of protein. The last three nucleotides (UCC) in the loop of RNA have no apparent contact with the β-sheet (see Fig. 1A).

FIGURE 1.

FIGURE 1.

The structures of bound and apo states. (A) Ribbon representation of crystal structure for RNA-U1A (pdb code: 1URN). The location of main secondary structures is indicated. (B) The average structures of TSE for apo-RNA, bound RNA, and bound RNA–U1A (Arg52Gln). (C) Unfolding pathway of bound RNA. (a) <0 ns (F), (b) 1.38 ns (τQf), (c) 9.49 ns (τQb), (d) 11.78 ns (τRg), (e) >15 ns (U).

NMR experiments suggest that the chemical shifts of many protons for bound RNA are substantially different from those for apo-RNA, especially in the loop region of the RNA hairpin (Hall 1994). Furthermore, the hairpin loop of apo-RNA is largely unstructured (Oubridge et al. 1994). Then, RNA undergoes a transition from unstructured to well-folded upon U1A binding. However, the mechanism of RNA folding coupled binding is poorly understood. Fortunately, molecular dynamics (MD) simulation is a powerful tool for analyzing the structural and dynamic features of biomacromolecules. Several MD simulations of apo-RNA, apo-U1A, or their complex have been made to reveal the recognition mechanism and the conformational change (Reyes and Kollman 1999; Tang and Nilsson 1999; Blakaj et al. 2001; Pitici et al. 2002; Showalter and Hall 2005; Kormos et al. 2006, 2007; Zhao et al. 2006). The previous works suggest that the structural adaption of RNA and U1A obeys the induced fit mechanism (Koshland 1958; Tang and Nilsson 1999; Pitici et al. 2002; Showalter and Hall 2005). Here we focus on the RNA folding kinetics upon U1A binding. Our research intends to reveal a series of interesting questions: (1) How does the conformation of the RNA hairpin change upon U1A binding? (2) What is the difference in the folding pathway between bound RNA and apo-RNA? (3) Which mechanism does bound RNA folding obey? To shed light on these questions, we utilize atomic molecular dynamics simulation in explicit solvent to analyze the coupling mechanism between RNA folding and U1A binding (Onoa and Tinoco 2004).

However, all atomic MD simulations are currently restricted to the time scale of less than 1 μs, which is much shorter than the folding half-time of most biomacromolecules at room temperature (at least 1 ms) (Baker 1998; Fersht and Daggett 2002). Fortunately, the rate of unfolding can accelerate by approximately six orders of magnitude at high temperature (usually 498 K) (Shea and Brooks 2001), so most biomacromolecules unfold in the nanoscale time scale at this temperature (Fersht and Daggett 2002). Furthermore, experiments confirm that the transition state for folding and unfolding is expected to be the same from the principle of microscopic reversibility (Fersht and Daggett 2002). Based on these previous works, unfolding simulations at high temperature have been used in the current study.

In this work, we will discuss the folded states, the unfolding kinetics, the unfolding landscape, transition states, and the unfolding pathway for both bound RNA and apo-RNA to understand RNA folding coupled U1A binding.

RESULTS

Folded state

Previous work suggests that a small number of trajectories for MD simulation (5–10) are sufficient to capture the average properties of the protein (Day and Daggett 2005). Therefore, 10 trajectories of 10.0 ns each were simulated at 298 K to analyze the folded state of apo-RNA, apo-U1A, and their complex. To show the influence of wild-type (WT) and mutant U1A on the stability of the folded RNA, C5′ and Cα variations for bound and apo states are illustrated in Figure 2. The C5′ variations of bound RNA for wild type and mutant are significantly smaller than that of apo-RNA, especially in the hairpin–loop region. The Cα variations of bound U1A for wild type and mutant are also smaller than that of apo-U1A, especially in the loop region between β2 and β3 fitting into the RNA hairpin. This suggests that both bound RNA and U1A become less flexible and more stable upon binding, which is consistent with previous experiments (Hall 1994; Oubridge et al. 1994). In general, the angle parameters (α, β, γ, δ, ε, and ζ) are used to describe the secondary structure of RNA (Murray et al. 2003). Surprisingly, the fluctuations of α, β, γ, δ, ε, and ζ for bound RNA seem larger than that of apo-RNA (shown in Fig. 3). This suggests that the secondary structure of bound RNA changes disorders upon U1A binding. However, the Φ/ψ variation of bound U1A is similar to that of apo-U1A except in the loop between β2 and β3 and the C-terminal domain (shown in Supplemental Fig. 1S).

FIGURE 2.

FIGURE 2.

C5′ or Cα variation of bound and apo states.

FIGURE 3.

FIGURE 3.

α, β, γ, δ, ε, and ζ fluctuation for bound RNA and apo-RNA.

The sugar conformations of bound RNA and apo-RNA are illustrated in Figure 4. Our simulation suggests that the conformation of bases U8 and C10 is C2′-endo in the complex during a 10-ns simulation. This is consistent with experimental observation (Oubridge et al. 1994). Besides, the sugar conformations of bases C12 and U13 change to C3′-endo of bound RNA from C2′-endo of apo-RNA. This testifies that the sugar conformation of RNA also becomes more stable upon U1A binding.

FIGURE 4.

FIGURE 4.

The sugar pucker of bound RNA and apo-RNA (black: C3′-endo, gray: C2′-endo). (A) bound RNA; (B) apo-RNA.

The landscape of the distance difference for base pairs between bound RNA and apo-RNA is shown in Figure 5A. The landscape can reflect the relative conformational change of the RNA backbone. The deep gray area indicates that the distance difference for bases 6–8 and 12–14 is a positive value. This suggests that nucleotides 12–14 are stretched upon U1A binding. The French gray area represents that the distance difference is a negative value. This suggests that the stem region of bound RNA bends closer to the loop region, which is consistent with the alignment result of the average structure between bound RNA and apo-RNA (shown in Supplemental Fig. 2S). The distance landscape for residue pairs between bound and apo-U1A is shown in Figure 5B. The result shows that most regions of the U1A backbone did not undergo an obvious conformational change except in the β2–β3 loop and C-terminal region. The β2–β3 loop and C-terminal region might play key roles in the formation of RNA/U1A complex. This is consistent with experimental observation that the C-terminal region participates in base recognition by forming direct and water-mediated hydrogen bonds (Oubridge et al. 1994).

FIGURE 5.

FIGURE 5.

The landscape of distance difference between bound and apo state. (A) RNA; (B) U1A.

To study the drive force of the binding-induced conformer change in the folded state, the electrostatic, hydrophobic, and hydrogen-binding interactions between RNA and U1A were analyzed. The electrostatic interactions in the simulation of 10 trajectories are shown in Supplemental Figure 3S. There are 19 electrostatic interactions between positively charged amino acids and the phosphates of RNA, with populations higher than 60%. The positively charged residues, such as Lys20, Lys22, Lys23, Arg47, Lys50, Arg52, Arg83, Lys88, and Lys96, provide electrostatic interactions with the phosphates of the RNA stem and loop regions. Note, also, these positively charged residues are almost around the RNA (shown in Supplemental Fig. 4S). This suggests that the electrostatic interactions between the positively charged amino acids and the negative phosphates of the RNA are so important in the complex formation and its stability. Our simulations are consistent with the mutant experiment of Lys20, Lys22, and Lys23, in which substitutions of these residues resulted in a loss of RNA-binding affinity (Nagai et al. 1990; Law et al. 2006). The hydrophobic contacts in the simulation of 10 trajectories are also shown in Supplemental Figure 3S. Eight stable hydrophobic interactions can be found: C10/Try13, A11/Phe56, C10/Phe56, G16/Leu49, C12/Leu44, C10/Ala87, A11/Leu44, and U7/Leu49, with populations higher than 60%. These hydrophobic residues of Try13, Leu49, Phe56, and Ala87 are located on the antiparallel four-stranded β-sheet. Furthermore, U7 and G16 interact with Leu49 on the β2–β3 protein loop. This suggests that the RNA hairpin seems to clamp the protein loop. Supplemental Figure 3S also shows 13 hydrogen bonds with populations higher than 60%. These strong hydrogen bonds are between residues Asp92, Ser91, Asp90, Thr89, Lys88, Try86, Glu85, Arg52, and Asn16 and the Watson–Crick edges of A6, G9, C10, A11, C12, and G16. Remarkably, nearly all of the hydrophobic contacts and hydrogen bonds are relative to G9, C10, A11, C12, and G16 of the RNA hairpin, which suggests that these five nucleotides play key roles in RNA conformational adjustment. In summary, U1A binding introduced more electrostatic interactions, hydrophobic contacts, and hydrogen bonds at the interface, and these are responsible for the higher stability in bound RNA.

Unfolding kinetics

The fraction of native tertiary contacts (Qf) and native binding contacts (Qb) is used to monitor unfolding and unbinding kinetics, respectively. Time evolutions of Qb and Qf for bound RNA are shown in Figure 6. Figure 6 suggests that the tertiary unfolding and unbinding kinetics can be represented well by single exponential functions, indicating first-order kinetics in the NVT ensemble at 498 K. The fitted kinetics data are listed in Table 1. The kinetics analysis shows that the unbinding half-time is 9.49 ± 1.03 ns and the tertiary unfolding half-time is 1.38 ± 0.038 ns. This indicates that the tertiary unfolding is much faster than the unbinding for bound RNA. The time evolution of Qf for apo-RNA is shown in Figure 7. It is found that the tertiary unfolding of apo-RNA also obeys first-order kinetics, with a half-time of 1.10 ± 0.034 ns, which is faster than the tertiary unfolding and unbinding of bound RNA. The time evolution of the radius of gyration (Rg) for bound RNA is also presented in Figure 6. The extension of bound RNA obeys first-order kinetics, and the half-time of Rg is 11.77 ± 2.92 ns. This indicates that the extension of bound RNA is slower than the tertiary unfolding and unbinding. The time evolution of the radius of gyration (Rg) for apo-RNA is shown in Figure 8. The half-time of Rg is 2.94 ± 0.33 ns. This suggests that the tertiary unfolding and extension of bound RNA are postponed upon U1A binding.

FIGURE 6.

FIGURE 6.

Kinetics fitting for bound RNA.

TABLE 1.

Unfolding kinetics constants for bound RNA and apo-RNA

graphic file with name 1053tbl1.jpg

FIGURE 7.

FIGURE 7.

Kinetics fitting for apo-RNA.

FIGURE 8.

FIGURE 8.

Unfolding landscapes with respect to Qb, Rg, and Qf for bound RNA.

Unfolding landscape

To further understand the interdependence between the RNA folding and U1A binding, the unfolding landscape of bound RNA was analyzed with the variables Qf and Qb, as shown in Figure 8. The unfolding landscape shows that the tertiary unfolding proceeds first, while the binding contacts are held stable, and then is followed by the unbinding. This is in agreement with the unfolding kinetics analysis for bound RNA and suggests the formation of the tertiary contact depends on the formation of the binding interface.

The coupling between the tertiary unfolding and the extension of RNA is also investigated for both bound (shown in Fig. 8) and apo states (shown in Supplemental Fig. 5S). It was found that Qf decreases first, while the radius of gyration does not change, and then it is followed by the extension of RNA for both systems. This suggests that the extension of RNA is followed by the tertiary unfolding, and this is also consistent with the unfolding kinetics analysis.

Transition state

Kinetics analysis shows that the tertiary unfolding of bound RNA and apo-RNA obeys first-order kinetics. The tertiary unfolding of bound RNA–mutant U1A (Arg52Gln) also obeys first-order kinetics (shown in Supplemental Fig. 6S). This suggests that bound RNA, apo-RNA, and bound RNA–mutant U1A (Arg52Gln) unfold via a two-state process. This is consistent with previous results indicating that mechanical unfolding of the RNA hairpin is stochastic and obeys two-state dynamics (Hyeon and Thirumalai 2005). However, an intermediate state was found during the folding pathway of the RNA hairpin (Bowman et al. 2008). In this simulation, there was a transition state corresponding to the free energy maximum along the unfolding pathway. The structures of the free energy maxima comprise the transition state ensemble (TSE). TSE structures can either fold or unfold, and the transition probability (P) will be 50%. We have scanned MD snapshots for TSE structures in all unfolding trajectories for each of bound RNA, apo-RNA, and bound RNA–mutant U1A(Arg52Gln) (Pande and Rokhsar 1999). The transition probability curves were further fitted by the Boltzmann equation and are shown in Supplemental Figure 7S. The analysis yields 73 snapshots for the bound RNA, 113 snapshots for the apo-RNA, and 13 snapshots for the RNA–mutant U1A(Arg52Gln) TSE.

The average structures of bound RNA, apo-RNA, and bound RNA–U1A(Arg52Gln) TSE are shown in Figure 1B. There are five native contacts (G9/Asn15, C10/Tyr13, C10/Phe56, C10/Ala87, and A11/Phe56) for bound RNA TSE, while there is no native contact for apo-RNA TSE. For the TSE of bound RNA–mutant U1A (Arg52Gln), there are just two native contacts (U7/Glu19 and U8/Asn16). Apparently, it can be concluded that the TSE of bound RNA is more native-like than that of bound RNA–mutant U1A (Arg52Gln) and apo-RNA. This indicates that the relative folding activation free energy of bound RNA(ΔGbound) is probablly lower than that of apo-RNA (ΔGapo), leading to a relatively faster folding rate for bound RNA. This is consistent with the result that the tertiary unfolding time scale of bound RNA is larger than that of apo-RNA. Furthermore, the TSE of the bound RNA is different from that observed for the bound RNA–mutant U1A. This suggests that the mutation of U1A (Arg52Gln) significantly reduces the RNA affinity.

Φ-Value prediction

Φ-Values have been widely used to determine key residues in protein folding by theoretical and experimental investigations (Fersht et al. 1992; Fersht 2000; Fernandez-Escamilla et al. 2004; Sato and Fersht 2007). In this study, all of TSE structures were used to predict Φ-values for bound RNA and apo-RNA. The Φ-values are shown in Figure 9. In general, the Φ-values of bound RNA are significantly larger than those of apo-RNA. This suggests that there are more key bases in RNA folding upon U1A binding. Note also that the highest Φ-values are found for U8, C10, A11, and G16, suggesting that these bases play a key role in the folding of bound RNA. This is consistent with the structural analysis that the critical roles of U8, C10, A11, and G16 form electrostatic interactions, hydrophobic contacts, and hydrogen-bonding networks with U1A. These predicted Φ-values can be confirmed by experiment.

FIGURE 9.

FIGURE 9.

Predicted Φ-values of bound RNA and apo-RNA.

DISCUSSION

Comparison with experiment

The structural analysis suggests that Arg52 of U1A is the critical residue in stabilizing the complex (Oubridge et al. 1994). Our room-temperature simulation finds two stable electrostatic interactions for G16/Arg52 and C15/Arg52. Besides electrostatic interactions, there are also three stable hydrogen bonds for A6/Arg52, G9/Arg52, and G16/Arg52. The simulation of mutating Arg52 to Gln suggests that this mutant completely abolishes these stable hydrogen bonds with RNA bases and electrostatic interactions (shown in Supplemental Fig. 8S). This is in agreement with the mutational experiment in which Arg52Gln completely abolishes RNA binding (Nagai et al. 1990).

Furthermore, an X-ray experiment observed the hydrogen-bonding network between RNA and U1A. Hydrogen bonds, such as G9/Asn16, G9/Asn15, U8/Arg93, U8/Lys80, C10/Gln85, C10/Lys88, A11/Ser91, C12/Asp90, Thr89, and C12/Asp92, are also found in our simulation. Besides, there are also two marginally stable hydrogen bonds (U7/Glu19 and U8/Asn16) with a population around 50%. This is consistent with the observation of the X-ray experiment (Oubridge et al. 1994).

Finally, we predict Φ-values of RNA (shown in Fig. 9) and find that the Φ-values of U8, C10, A11, and G16 are higher than those of other residues for bound RNA. These results are also consistent with previous reports and can be confirmed by experiment.

Comparison with other simulations

The landscape of distance difference between bound RNA and apo-RNA suggests that RNA undergoes a significant conformational change upon U1A binding, especially in the loop region. This is consistent with the observations of the previous simulations (Tang and Nilsson 1999; Pitici et al. 2002; Showalter and Hall 2005).

Our simulation indicates that A8, C10, A11, and G16 are key bases for U1A binding and U13, C14, and C15 have no contact with U1A. This is consistent with the investigation of Tang and Nilsson (1999). Tang and Nilsson (1999). have mentioned that the sugar conformation of U7 changes from 3′-endo to 2′-endo during the simulation. This could be connected to the high mobility of U8, where purine keeps the C3′-endo pucker and pyrimidine prefers the C2′-endo pucker. Those are in accord with our results regarding the sugar conformational adjustment of U7.

Among the 19 stable electrostatic interactions, there were six between Lys20, Lys22, and Lys23 and the phosphates of RNA. This suggests that these positively charged residues are important for complex stability. This is in agreement with previous work showing that electrostatic interactions are important (Hermann and Westhof 1999; Law et al. 2006). In addition to electrostatic interactions, we also found two important hydrophobic contacts (A11/Phe56 and C10/Phe56). This is consistent with the simulation in which the Phe56Ala mutant decreased the affinity of RNA (Blakaj et al. 2001).

Convergence and sampling

Ten trajectories were simulated for bound RNA, apo-RNA, and apo-U1A. First, in this study, we intend to reveal if multiple trajectories are necessary. The populations of eight hydrophobic contacts and 13 hydrogen bonds in 10 trajectories are shown in Supplemental Figures 9S and 10S. The populations of three former hydrophobic contacts and seven hydrogen bonds are very similar among the 10 trajectories. However, the remainders have large fluctuations. If we just sample one or two simulations, some stable hydrophobic contacts and hydrogen bonds will be missed. Therefore, multiple simulations are necessary for this system. Second, we will check if 10 trajectories are sufficient for these systems. The average number of hydrophobic contacts and hydrogen bonds with standard error versus the number of trajectories is plotted in Supplemental Figure 11S. As the number of trajectories increases, the number of the hydrophobic contacts and hydrogen bonds with the standard error gradually changes. Finally, the average number and standard error almost remain constant. This is consistent with a previous report in which a small number of MD simulations (5–10) are sufficient to capture the average properties of a protein observed in experiment (Day and Daggett 2005).

Unfolding and folding pathways

Based on the unfolding kinetics and the unfolding landscape, the unfolding pathway for bound RNA can now be constructed, and it is shown in Figure 1C. (1) At the tertiary unfolding half-time, there are four out of 15 (folded state) native contacts within RNA. Most of the lost native contacts are within the stem region. The native binding contacts between RNA and U1A also start to disappear; still 11 out of 16 exist. (2) At the half-time of U1A unbinding, all of the native contacts within RNA have disappeared. There are four out of 16 native binding contacts remaining. The average time for visiting the transition state is between the half-time of the tertiary unfolding and the unbinding. (3) At the half-time of the extension for RNA, there are one out of 15 native contacts within RNA and five out of 16 between RNA and U1A. (4) For the unfolded state, there are 1 native contact within RNA and 5 binding contacts between RNA and U1A remaining. Interestingly, three native hydrophobic contacts of G9/Asn15, C10/Tyr13, and C10/Lys88 (see Fig. 1C, gray area) always exist during the unfolding of bound RNA. Furthermore, there are also two native contacts of G9/Asn15 and C10/Tyr13 included in the TSE of bound RNA. These base/residue pairs might be nuclei and play key roles in the folding of bound RNA. This is consistent with experimental observation (Oubridge et al. 1994).

Because the unfolding pathways of chymotrypsin inhibitor 2 and the engrailed homeodomain are confirmed to be the reverse of the folding ones (McCully et al. 2008; Day and Daggett 2007), we assume that the folding pathway of RNA also obeys the same rule. Therefore, the proposed folding/binding pathway of bound RNA is RNA contracting, U1A binding, and the tertiary folding. This suggests that U1A binding indeed induces the folding of RNA. The difference between the folding pathways of bound RNA and apo-RNA is the accelerated tertiary folding for RNA upon U1A binding. This is similar to the folding of the TIS11d/RNA system (Qin et al. 2009).

Induced fit or conformational selection

Two main models are used to explain the biomolecule folding coupled receptor binding (Boehr and Wright 2008). One is the “induced-fit” model (Koshland 1958), the other is “conformational selection” (Ma et al. 1999, 2002; Tsai et al. 1999, 2001; Kumar et al. 2000; Weikl and von Deuster 2009). If the bound conformation of the biomolecule exists prior to the receptor binding, the receptor will directly select bound conformation, otherwise it will adjust the biomolecule conformation before binding (Turjanski et al. 2008). Recently, a residual dipolar coupling experiment suggested that the folding of the ubiquitin complex obeys conformational selection, rather than the induced-fit mechanism (Lange et al. 2008). Nevertheless, the kinetics character for both mechanisms has also been observed in the same system (James and Tawfik 2003; Okazaki and Takada 2008). In this system, the folding pathway of bound RNA shows that U1A binding is prior to RNA folding. Furthermore, the average structure of the transition state ensemble includes two native binding contacts. These native contacts are favored in the formation of bound RNA conformation. Finally, the conformational clusters based on Cα RMSD, relative to the average structures of bound RNA and apo-RNA, are illustrated in Figure 10. In the current simulation, the Cα RMSD of bound RNA is between 1 Å and 3.5 Å; this suggests that bound RNA has a uniform binding conformer. The conformational distribution for apo-RNA is from 1.5 Å to 4.5 Å and also has a single peak. Moreover, our simulation does not observe the bound conformer in apo-RNA. This suggests that bound RNA conformation is formed only after U1A binding. In summary, the folding of bound RNA is favorable that it obeys an induced-fit mechanism (Pitici et al. 2002; Turner et al. 2005; Adilakshmi et al. 2008).

FIGURE 10.

FIGURE 10.

Cα RMSD conformational cluster relative to average structure.

CONCLUSION

Both 298 and 498K molecular dynamics simulations are performed for RNA and U1A in bound and apo states. The loop region of RNA becomes more rigid and stable upon U1A binding. This is consistent with experimental observation.

Unfolding landscape and unfolding kinetics analyses suggest that the unfolding order of bound RNA is tertiary unfolding, then U1A unbinding, and finally, RNA extension. The unfolding order of apo-RNA is tertiary unfolding, then RNA expanding. The difference of folding pathways between bound RNA and apo-RNA is the accelerated tertiary folding for RNA upon U1A binding. Unfolding kinetics and conformational cluster also indicate that the folding of bound RNA obeys an induced-fit mechanism.

The simulation results show that bound RNA and apo-RNA unfold via a two-state process. The transition probability was used to determine the TSE for bound RNA and apo-RNA. Transition state analysis suggests that the TSE of bound RNA is more native-like than that of apo-RNA. The predicted Φ-values suggest that U8, C10, A11, and G16 are key bases for bound RNA folding. Furthermore, this method could be used in other studies on biomolecule folding upon receptor binding.

MATERIALS AND METHODS

Room-temperature and high-temperature molecular dynamics simulations

The atomic coordinates of the snRNA–U1A complex were obtained from the crystal structure (pdb code: 1URN) (Oubridge et al. 1994). Hydrogen atoms were added using the LEAP module of AMBER8 (Case et al. 2004). Counter-ions were used to maintain system neutrality. All systems were solvated in a truncated octahedron box of TIP3P (Jorgensen et al. 1983) water with a buffer of 10 Å. Particle mesh Ewald (PME)(Darden et al. 1993) was employed to compute long-range electrostatic interactions with the default setting in AMBER8 (Case et al. 2004). A parm99 force field was used for intramolecular interactions (Wang et al. 2000; Lwin and Luo 2006). The SHAKE algorithm (Rychaert et al. 1977) was used to constrain the bonds involving hydrogen atoms. A 1000-step steepest descent minimization was performed to relieve any structural clash in the solvated systems. Then 20-ps simulation was performed by heating up and performing brief equilibration in the NVT ensemble at 298 K. Langevin dynamics with a time step of 2 fs was used in the heating and equilibration runs with a friction constant of 1 ps−1.

To study the folded state of each solvated system, 10 independent trajectories, 10.0 ns each, in the NPT ensemble (Berendsen et al. 1984) at 298 K were then simulated with PMEMD of AMBER8. To investigate the unfolding pathway of each solvated system, 10 independent trajectories of 20 ns each were performed for the RNA–U1A complexes in the NVT ensemble at 498 K. Ten independent trajectories, 10 ns each, were simulated for the apo-RNA and apo-U1A systems. To confirm the influence of a mutant for Arg52Gln, a 10-ns simulation was performed for the RNA–mutant U1A at 298 and 498K. A total of 720-ns trajectories were collected for four systems (the RNA–U1A complex, apo-RNA, apo-U1A, and RNA–mutant U1A) at both 298 and 498K, taking about 84,200 CPU hours on the xeon (3.0 GHz) cluster.

Native contacts between RNA and U1A were monitored to detect the beginning of the unfolded state. It was found that 15 ns at 498 K were needed to reach the equilibrium stage for the complex, so that the first 15 ns (a total of 150 ns for the complex) were used to study the unfolding kinetics, and the remaining 5 ns (a total of 50 ns for the complex) were used to monitor the unfolded state. Native contacts within apo-RNA itself were also monitored to detect the beginning of its unfolded state, so the first 8 ns were used to study the unfolding kinetics and the remaining 2 ns (a total of 20 ns for apo-RNA) were used to monitor the unfolded state.

Transition state simulations

According to the definition of the transition state (TS), 40 test MD runs for each candidate snapshot were performed to calculate the transition probability (P) (Pande and Rokhsar 1999; Gsponer and Caflisch 2002; Chong et al. 2005). All simulations had the same initial conformation, but differing initial velocities. TS simulations were done at 498 K to accelerate the simulated folding/unfolding rate. Each test trajectory was terminated when the conformation had reached the folded or unfolded state. The folded state is defined as the C5′ RMSD within 9.0 Å from the average structure of the folded state at 298 K. Up to a 1-ns simulation at 498 K was found to be sufficient for each test trajectory (Chen and Luo 2007; Chen 2008; Qin et al. 2009), i.e., P-values for tested snapshots were no longer changing when longer simulations were done.

Data analysis

Tertiary contact assignment was handled with in-house software (Chen and Luo 2007; Chen 2008, 2009a,b; Qin et al. 2009). These residues and nucleotides are in contact when their side chains are closer than 6.5 Å for the complex. The nonadjacent nucleotides are in contact when their bases are closer than 7.5 Å within RNA. Electrostatic interactions are assigned when the distance between the positive charge residue and the RNA phosphate backbone is less than 11 Å (Garcia-Garcia and Draper 2003). Secondary structure assignment was performed with DSSP (Kabsch and Sander 1983). RNA sugar conformation of C3′-endo and C2′-endo was mapped with R language. The unfolding kinetics was fitted in Origin 7.0. The unfolding landscape was performed by calculating the normalized probability from histogram analysis (Pande and Rokhsar 1999). Here we used the fraction of native tertiary contacts (Qf), the fraction of native binding contacts (Qb), and the radius of gyration (Rg) to map the unfolding landscapes. Representative structures at unfolding half-times were used to construct the unfolding pathways.

According to the definition of protein Φ-values, which were computed with a strategy similar to those used in other studies (Caflisch and Karplus 1994; Vendruscolo et al. 2001; Gsponer and Caflisch 2002), this method might be suitable for the calculation of RNA Φ-values. Therefore, the following equation was used to process the RNA Φ-values:

graphic file with name 1053equ1.jpg

where NiTS is the number of native contacts for base i at the transition state, and NiF and NiU are the number of native contacts for base i at the folded and unfolded states, respectively.

SUPPLEMENTAL MATERIAL

Supplemental material can be found at http://www.rnajournal.org.

ACKNOWLEDGMENTS

This work was supported by the Instrumental Analysis Center of Shanghai Jiaotong University; by the National Natural Science Foundation of China (Grant Nos. 30770502 and 20773085); by the Natural Science Foundation of Shanghai China (Grant No. 10ZR1414500); in part by grants from Ministry of Science and Technology China (2010CB833601); and by the National 863 High-Tech Program (2007DFA31040).

Footnotes

Article published online ahead of print. Article and publication date are at http://www.rnajournal.org/cgi/doi/10.1261/rna.2008110.

REFERENCES

  1. Adilakshmi T, Bellur DL, Woodson SA 2008. Concurrent nucleation of 16S folding and induced fit in 30S ribosome assembly. Nature 455: 1268–1272 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Baker D 1998. Metastable states and folding free energy barriers. Nat Struct Biol 5: 1021–1024 [DOI] [PubMed] [Google Scholar]
  3. Berendsen HJC, Postma JPM, van Gunsteren WF, DiNola A, Haak JR 1984. Molecular dynamics with coupling to an external bath. J Chem Phys 81: 3684–3690 [Google Scholar]
  4. Blakaj DM, McConnell KJ, Beveridge DL, Baranger AM 2001. Molecular dynamics and thermodynamics of protein–RNA interactions: Mutation of a conserved aromatic residue modifies stacking interactions and structural adaptation in the U1A-stem–loop 2 RNA complex. J Am Chem Soc 123: 2548–2551 [DOI] [PubMed] [Google Scholar]
  5. Boehr DD, Wright PE 2008. Biochemistry. How do proteins interact? Science 320: 1429–1430 [DOI] [PubMed] [Google Scholar]
  6. Bowman GR, Huang X, Yao Y, Sun J, Carlsson G, Guibas LJ, Pande VS 2008. Structural insight into RNA hairpin folding intermediates. J Am Chem Soc 130: 9676–9678 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Caflisch A, Karplus M 1994. Molecular-dynamics simulation of protein denaturation—solvation of the hydrophobic cores and secondary structure of barnase. Proc Natl Acad Sci 91: 1746–1750 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Case DA, Darden TA, Cheatham TE, Simmerling CL III, Wang J, Duke RE, Luo R, Merz KM, Wang B, Pearlman DA, et al. 2004. AMBER 8 University of California, San Francisco [Google Scholar]
  9. Chen HF 2008. Mechanism of coupled folding and binding in the siRNA–PAZ complex. J Chem Theory Comput 4: 1360–1368 [DOI] [PubMed] [Google Scholar]
  10. Chen HF 2009a. Aggregation mechanism investigation of the GIFQINS cross-β amyloid fibril. Comput Biol Chem 33: 41–45 [DOI] [PubMed] [Google Scholar]
  11. Chen HF 2009b. Molecular dynamics simulation of phosphorylated KID post-translational modification. PLoS One 4: e6516 doi: 10.1371/journal.pone.0006516 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chen HF, Luo R 2007. Binding induced folding in p53-MDM2 complex. J Am Chem Soc 129: 2930–2937 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chong LT, Snow CD, Rhee YM, Pande VS 2005. Dimerization of the p53 oligomerization domain: Identification of a folding nucleus by molecular dynamics simulations. J Mol Biol 345: 869–878 [DOI] [PubMed] [Google Scholar]
  14. Darden T, York D, Pedersen L 1993. Particle mesh Ewald: An N log(N) method for Ewald sums in large systems. J Chem Phys 98: 10089–10092 [Google Scholar]
  15. Day R, Daggett V 2005. Ensemble versus single-molecule protein unfolding. Proc Natl Acad Sci 102: 13445–13450 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Day R, Daggett V 2007. Direct observation of microscopic reversibility in single-molecule protein folding. J Mol Biol 366: 677–686 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fernandez-Escamilla AM, Cheung MS, Vega MC, Wilmanns M, Onuchic JN, Serrano L 2004. Solvation in protein folding analysis: Combination of theoretical and experimental approaches. Proc Natl Acad Sci 101: 2834–2839 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fersht AR 2000. Transition-state structure as a unifying basis in protein-folding mechanisms: Contact order, chain topology, stability, and the extended nucleus mechanism. Proc Natl Acad Sci 97: 1525–1529 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Fersht AR, Daggett V 2002. Protein folding and unfolding at atomic resolution. Cell 108: 573–582 [DOI] [PubMed] [Google Scholar]
  20. Fersht AR, Matouschek A, Serrano L 1992. The folding of an enzyme. I. Theory of protein engineering analysis of stability and pathway of protein folding. J Mol Biol 224: 771–782 [DOI] [PubMed] [Google Scholar]
  21. Garcia-Garcia C, Draper DE 2003. Electrostatic interactions in a peptide–RNA complex. J Mol Biol 331: 75–88 [DOI] [PubMed] [Google Scholar]
  22. Green MR 1991. Biochemical mechanisms of constitutive and regulated pre-mRNA splicing. Annu Rev Cell Biol 7: 559–599 [DOI] [PubMed] [Google Scholar]
  23. Gsponer J, Caflisch A 2002. Molecular dynamics simulations of protein folding from the transition state. Proc Natl Acad Sci 99: 6719–6724 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hall KB 1994. Interaction of RNA hairpins with the human U1A N-terminal RNA binding domain. Biochemistry 33: 10076–10088 [DOI] [PubMed] [Google Scholar]
  25. Hermann T, Westhof E 1999. Simulations of the dynamics at an RNA–protein interface. Nat Struct Biol 6: 540–544 [DOI] [PubMed] [Google Scholar]
  26. Hyeon C, Thirumalai D 2005. Mechanical unfolding of RNA hairpins. Proc Natl Acad Sci 102: 6789–6794 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. James LC, Tawfik DS 2003. Conformational diversity and protein evolution—a 60-year-old hypothesis revisited. Trends Biochem Sci 28: 361–368 [DOI] [PubMed] [Google Scholar]
  28. Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML 1983. Comparison of simple potential functions for simulating liquid water. J Chem Phys 79: 926–935 [Google Scholar]
  29. Kabsch W, Sander C 1983. Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22: 2577–2637 [DOI] [PubMed] [Google Scholar]
  30. Kormos BL, Baranger AM, Beveridge DL 2006. Do collective atomic fluctuations account for cooperative effects? Molecular dynamics studies of the U1A–RNA complex. J Am Chem Soc 128: 8992–8993 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kormos BL, Baranger AM, Beveridge DL 2007. A study of collective atomic fluctuations and cooperativity in the U1A–RNA complex based on molecular dynamics simulations. J Struct Biol 157: 500–513 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Koshland DE 1958. Application of a theory of enzyme specificity to protein synthesis. Proc Natl Acad Sci 44: 98–104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Kumar S, Ma B, Tsai CJ, Sinha N, Nussinov R 2000. Folding and binding cascades: Dynamic landscapes and population shifts. Protein Sci 9: 10–19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Lange OF, Lakomek NA, Fares C, Schroder GF, Walter KF, Becker S, Meiler J, Grubmuller H, Griesinger C, de Groot BL 2008. Recognition dynamics up to microseconds revealed from an RDC-derived ubiquitin ensemble in solution. Science 320: 1471–1475 [DOI] [PubMed] [Google Scholar]
  35. Law MJ, Linde ME, Chambers EJ, Oubridge C, Katsamba PS, Nilsson L, Haworth IS, Laird-Offringa IA 2006. The role of positively charged amino acids and electrostatic interactions in the complex of U1A protein and U1 hairpin II RNA. Nucleic Acids Res 34: 275–285 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Lu J, Hall KB 1995. An RBD that does not bind RNA: NMR secondary structure determination and biochemical properties of the C-terminal RNA binding domain from the human U1A protein. J Mol Biol 247: 739–752 [DOI] [PubMed] [Google Scholar]
  37. Lührmann R, Kastner B, Bach M 1990. Structure of spliceosomal snRNPs and their role in pre-mRNA splicing. Biochim Biophys Acta 1087: 265–292 [DOI] [PubMed] [Google Scholar]
  38. Lwin TZ, Luo R 2006. Force field influences in β-hairpin folding simulations. Protein Sci 15: 2642–2655 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Ma B, Kumar S, Tsai CJ, Nussinov R 1999. Folding funnels and binding mechanisms. Protein Eng 12: 713–720 [DOI] [PubMed] [Google Scholar]
  40. Ma B, Shatsky M, Wolfson HJ, Nussinov R 2002. Multiple diverse ligands binding at a single protein site: A matter of pre-existing populations. Protein Sci 11: 184–197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. McCully ME, Beck DA, Daggett V 2008. Microscopic reversibility of protein folding in molecular dynamics simulations of the engrailed homeodomain. Biochemistry 47: 7079–7089 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Murray LJ, Arendall WB 3rd, Richardson DC, Richardson JS 2003. RNA backbone is rotameric. Proc Natl Acad Sci 100: 13904–13909 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Nagai K, Oubridge C, Jessen TH, Li J, Evans PR 1990. Crystal structure of the RNA-binding domain of the U1 small nuclear ribonucleoprotein A. Nature 348: 515–520 [DOI] [PubMed] [Google Scholar]
  44. Okazaki K, Takada S 2008. Dynamic energy landscape view of coupled binding and protein conformational change: Induced-fit versus population-shift mechanisms. Proc Natl Acad Sci 105: 11182–11187 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Onoa B, Tinoco I Jr 2004. RNA folding and unfolding. Curr Opin Struct Biol 14: 374–379 [DOI] [PubMed] [Google Scholar]
  46. Oubridge C, Ito N, Evans PR, Teo CH, Nagai K 1994. Crystal structure at 1.92 A resolution of the RNA-binding domain of the U1A spliceosomal protein complexed with an RNA hairpin. Nature 372: 432–438 [DOI] [PubMed] [Google Scholar]
  47. Pande VS, Rokhsar DS 1999. Molecular dynamics simulations of unfolding and refolding of a β-hairpin fragment of protein G. Proc Natl Acad Sci 96: 9062–9067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Pitici F, Beveridge DL, Baranger AM 2002. Molecular dynamics simulation studies of induced fit and conformational capture in U1A–RNA binding: Do molecular substates code for specificity? Biopolymers 65: 424–435 [DOI] [PubMed] [Google Scholar]
  49. Qin F, Chen Y, Li YX, Chen HF 2009. Induced fit of mRNA–TIS11D complex. J Chem Phys 131: 115103 doi: 10.1063/1.3224126 [DOI] [PubMed] [Google Scholar]
  50. Reyes CM, Kollman PA 1999. Molecular dynamics studies of U1A–RNA complexes. RNA 5: 235–244 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Rychaert JP, Ciccotti G, Berendsen HJC 1977. Numerical integration of Cartesian equations of motion of a system with constraints: Molecular dynamics of n-alkanes. Comput Phys 23: 327–341 [Google Scholar]
  52. Sato S, Fersht AR 2007. Searching for multiple folding pathways of a nearly symmetrical protein: Temperature dependent ϕ-value analysis of the B domain of protein A. J Mol Biol 372: 254–267 [DOI] [PubMed] [Google Scholar]
  53. Scherly D, Boelens W, van Venrooij WJ, Dathan NA, Hamm J, Mattaj IW 1989. Identification of the RNA binding segment of human U1 A protein and definition of its binding site on U1 snRNA. EMBO J 8: 4163–4170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Shea JE, Brooks CL 3rd 2001. From folding theories to folding proteins: A review and assessment of simulation studies of protein folding and unfolding. Annu Rev Phys Chem 52: 499–535 [DOI] [PubMed] [Google Scholar]
  55. Showalter SA, Hall KB 2005. Correlated motions in the U1 snRNA stem–loop 2:U1A RBD1 complex. Biophys J 89: 2046–2058 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Tang Y, Nilsson L 1999. Molecular dynamics simulations of the complex between human U1A protein and hairpin II of U1 small nuclear RNA and of free RNA in solution. Biophys J 77: 1284–1305 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Tsai CJ, Ma B, Nussinov R 1999. Folding and binding cascades: Shifts in energy landscapes. Proc Natl Acad Sci 96: 9970–9972 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Tsai CJ, Ma B, Sham YY, Kumar S, Nussinov R 2001. Structured disorder and conformational selection. Proteins 44: 418–427 [DOI] [PubMed] [Google Scholar]
  59. Turjanski AG, Gutkind JS, Best RB, Hummer G 2008. Binding-induced folding of a natively unstructured transcription factor. PLoS Comput Biol 4: e1000060 doi: 10.1371/journal.pcbi.1000060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Turner B, Melcher SE, Wilson TJ, Norman DG, Lilley DM 2005. Induced fit of RNA on binding the L7Ae protein to the kink-turn motif. RNA 11: 1192–1200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. van Gelder CW, Gunderson SI, Jansen EJ, Boelens WC, Polycarpou-Schwarz M, Mattaj IW, van Venrooij WJ 1993. A complex secondary structure in U1A pre-mRNA that binds two molecules of U1A protein is required for regulation of polyadenylation. EMBO J 12: 5191–5200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Vendruscolo M, Paci E, Dobson CM, Karplus M 2001. Three key residues form a critical contact network in a protein folding transition state. Nature 409: 641–645 [DOI] [PubMed] [Google Scholar]
  63. Wang JM, Cieplak P, Kollman PA 2000. How well does a restrained electrostatic potential (RESP) model perform in calculating conformational energies of organic and biological molecules? J Comput Chem 21: 1049–1074 [Google Scholar]
  64. Weikl TR, von Deuster C 2009. Selected-fit versus induced-fit protein binding: Kinetic differences and mutational analysis. Proteins 75: 104–110 [DOI] [PubMed] [Google Scholar]
  65. Zhao Y, Kormos BL, Beveridge DL, Baranger AM 2006. Molecular dynamics simulation studies of a protein–RNA complex with a selectively modified binding interface. Biopolymers 81: 256–269 [DOI] [PubMed] [Google Scholar]

Articles from RNA are provided here courtesy of The RNA Society

RESOURCES