Abstract
Although group II intron ribozymes are intensively studied the question how structural dynamics affects splicing catalysis has remained elusive. We report for the first time that the group II intron domain 6 exists in a secondary structure equilibrium between a single- and a two-nucleotide bulge conformation, which is directly linked to a switch between sugar puckers of the branch site adenosine. Our study determined a functional sugar pucker equilibrium between the transesterification active C2′-endo conformation of the branch site adenosine in the 1nt bulge and an inactive C3′-endo state in the 2nt bulge fold, allowing the group II intron to switch its activity from the branching to the exon ligation step. Our detailed NMR spectroscopic investigation identified magnesium (II) ions and the branching reaction as regulators of the equilibrium populations. The tuneable secondary structure/sugar pucker equilibrium supports a conformational selection mechanism to up- and downregulate catalytically active and inactive states of the branch site adenosine to orchestrate the multi-step splicing process. The conformational dynamics of group II intron domain 6 is also proposed to be a key aspect for the directionality selection in reversible splicing.
INTRODUCTION
Group II introns are catalytically active RNAs with self-splicing activity (1,2). In the branching pathway, a lariat RNA is formed, which resembles lariats formed during splicing of nuclear immature mRNA. This similarity in the reaction pathway suggests that the spliceosome might originate from the older group II introns further supported by an akin structure of domain 5 (D5) in group II introns and the U6/U2 snRNA. Group II introns take influence on gene expression in all domains of life and significant efforts were undertaken to get detailed insights into the chemistry of the ribozyme. X-ray crystallographic studies gave rich information on the intricate three-dimensional architecture of group II introns and the key residues- the conserved catalytic C358-G359-C360 triad and the 2-nucleotide (nt) A376-C377 bulge in D5 and the A287-G288-C289 J2/3 junction along with two catalytically important divalent metal ions (3) – for the second exon ligation step were identified (4–15). Recently, Costa and co-workers succeeded to crystallize a chimeric Oceanobacillus iheyenis/Azotobacter vinelandii (Oc19) group II intron from the IIC subclass, in which the branching and not the hydrolysis pathway was activated (Figure 1A and B) (16,17). The lariat intron was obtained and the X-ray structure gave interesting insights into the branch site arrangement and thus into the first step of group II intron catalysis. Based on extensive comparative analysis the branch-site adenosine (A) in domain 6 (D6) of the group II intron was presumed to form a 1nt bulge structure (Figure 1C, fold 1B) (18,19). In the recent crystal structure of Costa and co-workers, however, a basal 3-base pair (bp) stem with the branch point A in a 2nt bulge was consistent with the electron density maps (Figure 1C, fold 2B). Such a 2nt bulge arrangement in D6 was earlier reported in an X-ray crystal structure of a trans-active 70 nucleotide D56 construct of the ai5γ group II intron (20). The 2nt bulge is speculated to be the favoured conformational arrangement to promote the second step, the exon ligation. The structure and dynamics of domain 6 and the branching site adenosine were further studied by solution NMR spectroscopy with the main focus on the metal ion binding behaviour (21–23). These studies, however, used D6 motifs focusing mainly on the branch site A containing bulge with shortened stems and thus a crucial aspect in the dynamic behaviour of domain 6 was overlooked. Using NMR spectroscopy, we addressed the secondary structure rearrangement in a group II intron D56 construct (Figure 1D). In a 61nt D56 RNA, an associated secondary structure and sugar pucker equilibrium was observed, which was found again in several isolated D6 constructs from other group II intron subclasses. The reversible secondary structure/sugar pucker equilibrium in D6 unveiled by NMR spectroscopy supports a conformational selection mechanism in group II intron splicing. The tuneable conformational equilibrium, which can be influenced by external cues, such as magnesium ions, serves as a conformational pool to guarantee the availability of active and inactive states during the multistep catalysis and also to switch the directionality of the splicing process.
MATERIALS AND METHODS
Sample preparation
Standard 2′-O-TBDMS RNA phosphoramidites (rAAc, rCAc, rGAc and rU, Chemgenes, USA) were used in combination with in-house synthesized 13C-, 15N- and 2H-modified RNA phosphoramidites. The synthetic access to the stable isotope labelled phosphoramidites was earlier described (24–27). Controlled pore glass (CPG) RNA solid support (1000 Å pore size, ChemGenes, USA) with an average loading of 40 μmol g−1 was used to synthesize the RNAs on an ABI 391 PCR Mate using a self-written RNA synthesis cycle. Amidite (0.1 M) and activator (5-benzylthio-1H-tetrazole, 0.25 M) solutions were dried over freshly activated molecular sieves (3 Å) for at least 48 h. The following reagent mixtures were used: Cap A: acetic anhydride/lutidine/tetrahydrofuran 1/1/8 (v/v/v). Cap B: tetrahydrofuran/N-methylimidazole 86/16 (v/v). Oxidation solution: 500 mg iodine dissolved in a mixture of 70 ml THF, 20 ml pyridine and 10 ml water. Detritylation solution: 4% dichloroacetic acid in anhydrous toluene. After complete RNA synthesis, the solid support was dried in 1 h in high vacuum. Standard alkaline deprotection: 650 μl aq. methylamine solution (40%) and 650 μl aq. ammonia solution (28–30%) were added to the solid support. The reaction tube was shaken vigorously and incubated at 37°C for 6 h. The solid support was pelleted via centrifugation and the supernatant was transferred to a 10 ml round bottom flask. The remaining solid support was washed three times with a mixture of THF/water (1/1), the liquid phases were combined with the first filtrate and evaporated to dryness. The residual white precipitate was dried in high vacuum for at least 1 h. 2′-O-TBDMS deprotection: The residue from the previous step was dissolved in 300 μl anhydrous dimethylsulfoxide, and triethylamine trihydrofluoride was added (50 eq. per TBDMS group) and the deprotection mixture incubated at 37°C for at least 16 h. Then, the deprotection mixture was quenched with 2 ml quenching buffer (GlenResearch, USA) and directly applied to a HiPrep 26/10 desalting column (GE Healthcare, Austria) using a ÄKTA start system (GE Healthcare, Austria). The crude RNA was eluted using HPLC grade water and the RNA containing fractions (UV detection at 254 nm) were collected in a 50 ml round bottom flask. After evaporation, the crude RNA was dissolved in 1 ml HPLC grade water and transferred to a 1.5 ml reaction tube. The crude RNA was stored at −20°C. The quality of the crude RNAs was checked via anion exchange chromatography on an analytical Dionex DNAPac PA-200 column (4 × 250 mm; Eluent A: 25 mM Tris−HCl, 6 M urea, pH 8.0; Eluent B: 25 mM Tris−HCl, 500 mM sodium perchlorate, 6 M urea, pH 8.0) and at elevated temperature (80°C). Purification of the RNA sequences was achieved in a single run by applying the crude RNA on a preparative Dionex DNAPac PA-200 column (22 × 250 mm, eluents as before). The fractions containing the desired RNA were pooled and loaded on a C18 SepPak catridge (Waters, Austria) to remove HPLC buffer salts. The RNA sodium salt form was then eluted from the C18 column with water/acetonitrile (1/1, v/v), concentrated and transferred to a 1.5 ml reaction tube for concentration determination and mass spectrometric analysis. Sample concentrations were determined by measuring UV absorption at 260 nm on a NanoPhotometer (Implen, Germany).
Synthetic access to the 2′-photolabile protected adenosine building block
The photolabile α-methyl-o-nitrophenyl (NPE) protecting group was attached to the 2′-hydroxyl of adenosine, which was then transformed into a phosphoramidite building block suitable for solid phase RNA synthesis. In the first step, the NPE group was attached to adenosine 2′-OH and 3′-OH under alkaline conditions. The resulting mixture of 2′- and 3′-substituted products was successfully separated via silica column chromatography after the introduction of a phenoxyacetyl moiety at the exocyclic amino group. Subsequently, the 2′-and N6-protected nucleoside was transformed to its 5′-O-4,4′-dimethoxytrityl derivative under standard conditions before the 2-cyanoethyl-N,N-diisopropyl phosphoramidite function was introduced to yield the 2′-O-photolabile protected adenosine building block 4 for RNA solid phase synthesis. A detailed description for the preparation can be found in the supplementary information (Supplementary Scheme S1).
Synthesis of the branched RNA mimic
For branched RNA synthesis, the 2′-photolabile NPE protected adenosine building block was used in combination with standard RNA phosphoramidites as described in the sample preparation section. Prior to the attachment of nucleotides at the 2′-oxygen of the branch site A23, the 2-cyanoethyl protection groups of the phosphodiester backbone were removed from the solid support bound RNA with 10% diethylamine in acetonitrile (20 ml with a flowrate of 3 ml min−1), followed by washing with 50 ml acetonitrile and drying the solid support in high vacuum for 1h. This step is crucial to avoid a strand cleavage after the removal of the photolabile NPE protecting group. The solid support bound RNA was suspended in 2 ml methanol in a UV cuvette and constantly shaken while exposed to UV light (wavelength 260 nm) for 5 h at room temperature. Then, the solid support was washed 3× with 5 ml methanol, and 3× with 10 ml dry acetonitrile, before it was again connected to the RNA synthesizer. The additional nucleotides were added to the A23 2′-oxygen using commercially available reverse direction RNA phosphoramidites (Chemgenes, USA) giving the desire 2′-5′-phosphodiester linkage in the branched RNA. Concentrated reverse amidite solutions (0.3 M) were dried over freshly activated molecular sieves (3 Å) for at least 48 h. The coupling time for the reverse amidites was increased to 8 min. For capping, oxidation and deblocking the standard reagents described in the sample preparation section were used. The branched RNA was deprotected and purified according to the previously described protocol.
RNA LC–MS analysis
All RNAs were analyzed on Finnigan LCQ Advantage MAX ion trap instrumentation connected to a Thermo Scientific UHPLC (components: Ultimate 3000 RS Pump, Ultimate 3000 RS Autosampler, Ultimate 3000 RS Column Compartment, Ultimate 3000 Diode Array Detector). RNA mass spectra were acquired in the negative-ion mode with a potential of −4 kV applied to the spray needle (capillary temperature: 270°C, capillary voltage: −2−3 V). LC: 250 pmol RNA dissolved in 30 μl of 20 mM ethylenediaminetetraacetic acid (EDTA) solution; average injection volume: 30 μl; column: Waters xBridge C18 2.5 μm column (1.0 × 50 mm) at 30°C; flow rate: 100 μl/min; eluent A: 8.6 mM triethylamine (TEA), 100 mM 1,1,1,3,3,3-hexafluoroisopropanol in H2O (pH 8.0); eluent B: methanol; gradient: 0–100% B in A within 30 min; UV detection at 260/280 nm. The correct assembly of all RNAs used in this study was confirmed by the mass data.
NMR spectroscopy
RNA samples were lyophilized as sodium salts and dissolved in 280 μl NMR buffer (15 mM sodium phosphate, 25 mM NaCl, 0.1% NaN3, pH 6.9) and transferred into restricted volume Shigemi tubes giving 0.7–1 mM sample concentrations. NMR experiments on 13C- and 15N-modified RNA sequences were conducted either on a Bruker 600 MHz Avance II+ NMR with a Prodigy TCI probe or on Bruker 700 MHz Avance Neo system with a QXI (1H/13C/15N/31P) room temperature probe. The 2D 1H–13C correlation spectra were acquired using either a 1H–13C-BEST TROSY pulse sequence or a standard HMQC Bruker pulse program (hmqcetgpsi.2). The 2D 1H–15N correlation spectra were acquired using a 1H–15N-SOFAST HMQC pulse sequence. The fold-specific Watson Crick base pairs were determined using a HNN–COSY experiment from the Bruker pulse sequence library (na_hnncosygpphspwg). The 13C-longitudinal exchange data for the D56 13C1′-modified RNA was determined using an alternative approach combining a T1 and a longitudinal exchange NMR experiment (28). Arrays of T1 and ZZ exchange spectra were recorded at 14.1 T at 25°C with mixing times of 5, 10, 20, 30, 40, 50, 60, 70, 80, 90 and 100 ms. The size of the data matrices for each spectrum was 1024*96 complex data points, the number of scans was 96 and the inter-scan delay was 1.25 s to yield a total measuring time of 84 h for eleven T1 experiments (42h) and eleven ZZ exchange (42 h) experiments, respectively.
13C longitudinal exchange data fitting
Peak intensities at the various mixing periods were obtained by summing over 1 × 1 (f1 × f2) data points using the nmrpipe software package (29). Then, a least square fitting procedure was applied to determine k1B→2B, k2B→1B and R11B and R12B by fitting the following expressions in MATLAB:
and
with MAA and MBB are the intensities of the correlation peaks from the ZZ exchange experiment. These intensities are normalized by the peak volumes at zero mixing time (MAA0 and MBB0). MA(0)/MB(0) and MA(t)/MB(t) are the peak intensities of fold A and B at zero mixing time or the mixing time t from the T1 experiment. The eigenvalues of the spin density matrix with magnetization transfer effects from chemical exchange are given by
with the elements aij defined as:
Parameters k1B→2B and k2B→1B describe the refolding rate constants describing the interconversion between the two folding states (1B and 2B) of the D56 RNA, while R11B and R12B are the longitudinal relaxation rates in conformation 1B and 2B, respectively. Experimental uncertainties were estimated from 1000 Monte Carlo runs in which synthetic data sets were generated from the best-fit values of k1B→2B, k2B→1B and R11B, R12B by adding random errors based on the signal-to-noise ratio to the best-fit curves.
Sugar pucker determination of U12 in 1B and 2B and A23 in 1B fold
To address the sugar pucker in the 1B and 2B fold we adapted the H(N)HA experiment originally proposed for the determination of the three bond scalar coupling constant between the amide proton and the α-proton in proteins (30). The experiment was run as the 13C version—termed H1′(C1′)H2′ experiment—on the two 13C1′-labelled D56 RNA mutants. In detail, the experiment was run on 1 mM 1B and 2B mutant RNA samples in restricted volume Shigemi tubes in a 2D manner without 13C chemical shift evolution. The data matrix for each spectrum was 1024 × 96 complex data points with 12 ppm spectral width in both proton dimension, the number of scans was 384 and the inter-scan delay was 1 s to yield a total measuring time of 12 h per spectrum. The cross peak/diagonal peak ratio of U12 in both folds and A23 in the 1B fold provide a direct measure of the 3JH1’H2’ according to the equation:
with Scross crosspeak intensity, Sdiag crosspeak intensity, JHH the 3-bond H1′–H2′ scalar coupling constant and the ζ delay set to 13.05 ms. The cross- and diagonal peak intensities were obtained by using the nmrpipe nlinLS peak fitting procedure using Lorentzian peak fitting (29).
Melting curve analysis of the 61nt D56 constructs
The RNAs were lyophilized and then dissolved in 800 μl (1 cm cuvettes) or 300 μl (1 mm cuvettes) melting curve buffer (10 mM sodium phosphate, pH 7.0, 150 mM NaCl) to give RNA concentrations of 2 and 5 μM, respectively. Absorbance versus temperature profiles were recorded at 250 and 260 nm on a Cary-100 spectrophotometer equipped with a multiple cell holder and a Peltier temperature-control device. Data were collected for five heating-cooling cycles at a rate of 0.7°C/minute. Melting transitions were essentially the same with respect to the two different wavelengths and heating-cooling cycles. Melting point temperatures are reported as mean value of the five measurements. The thermodynamic parameters of the monomolecular melting processes of the D56 1B and 2B mutants were obtained by plotting the association degree α versus temperature and fitting the experimental data in KaleidaGraph (Synergy Software) using the following equation:
with α association degree and R ideal gas constant to give the enthalpy ΔH° and the entropy ΔS° of the melting transition at 298 K. For all RNAs- the wildtype D56 RNA, the 1B mutant RNA and the 2B mutant RNA-concentration independent melting points indicative for monomolecular folds were observed (Supplementary Figures S13, S14 and Table S3).
RESULTS
A secondary structure equilibrium in a D56 Oc19 RNA construct with a hierarchical kinetic network
Group II introns are composed of six domains with D5 forming extensive interactions with D1 to shape the catalytic center for the second exon ligation step (Supplementary Figure S1). The branch-point adenosine resides in D6, whose 2′-hydroxyl functionality serves as the nucleophile in the branching reaction. An RNA comprising only D5 and D6 is able to catalyse branched RNA formation and the release of the 5′-exon, when added in trans to a D1 RNA (20). Thus, we decided to study the folding behavior of D6 in a D56 sequence spanning a 61-nucleotide region of the group II intron Oc19 intron using solution NMR spectroscopy (Figure 2a). Five residues—C18, C19, U22, A23 and U24—of the D56 RNA were replaced by the 13C-labeled C1′ counterparts. First, we acquired an imino proton spectrum of this D56 RNA and found a complex signal pattern (Figure 2B). Even more surprisingly, instead of five 1H–13C correlations in a heteronuclear multi quantum correlation (HMQC) spectrum nine peaks with strongly varying intensities were observed (Figure 2C). The resonance assignment procedure revealed that the D56 RNA construct is slowly exchanging between two competing secondary structures—the 1B fold and the 2B fold— on the chemical shift time scale resulting in two resonances for each peak but C18 (Figure 2C).
By making extensive use of 13C and 15N-isotope labeling via solid phase synthesis the secondary structure equilibrium between the 1B fold with A23 in the bulged conformation and a 4-base pair (bp) basal stem and the 2B fold with a A23-U24 two nucleotide bulge and a 3-bp basal stem could be verified. All the residues, which were replaced by the 13C or 15N labeled counterparts in the course of this study, are shown in the supporting information (Supplementary Figure S2). For example, the G7-C20 base pair in 1B and the co-existing G7-C19 of 2B could be unambiguously verified by HNN-COSY experiments further confirming the secondary structure equilibrium in D6 (Supplementary Figure S3). The populations of the 1B and 2B fold could be estimated by peak integration and a slight preference for 1B with 60% was found. The secondary structure equilibrium between 1B and 2B was further characterized by 13C-longitudinal exchange NMR spectroscopy and a hierarchical kinetic network was found (Figure 2C and D, Supplementary Figure S4 and S5, Supplementary Table S1). The highest exchange rate, kex, as the sum of forward (kf) and backward (kB) exchange rate, was found for residue U24 with 20.89 ± 0.91 s−1, followed by A23 (15.24 ± 0.57 s−1) and the exchange rate decreases steadily approaching the cUUCGg tetraloop (U22, kex = 10.11 ± 0.47 s−1 and C19, kex = 7.88 ± 0.39 s−1). The same rate differentiated kinetics was found for a D56 construct with aromatic nucleobase 6–13C- and 8–13C-labels on U12, U13, U24, A4 and A23 (Supplementary Figure S4). This hierarchical kinetic network suggests that in both the forward and the backward folding pathway the one or two nucleotide bulge regions, respectively, serve as the starting point for the refolding process. Based on the rate constants, the folding from 1B to 2B is initiated by the formation of the A4–U22 base pair and the concomitant formation of the A23–U24 2nt bulge, followed by the base pair register change of the central G–C rich tract and completed by the base pairing of G8 and C18. In a similar way, refolding from 2B to 1B starts by the formation of the A4–U24 and G5•U22 base pairs again followed by base pair register change of the central G–C rich tract and is finished by the formation of C9-C18 symmetric bulge (Supplementary Figure S6).
Noteworthy, a highly modular structure of the group II intron is confirmed by our NMR data as the secondary structure equilibrium is preserved in a 27nt D6 only construct. Further, chemical shifts of the imino proton and 1H–13C correlations of D6 residues are virtually identical in the 61nt and 27nt RNA indicating no or very weak tertiary interactions between the two domains. The equilibrium between the 1B- and 2B-fold is found in this shorter D6 RNA with unchanged equilibrium populations and similar kinetics (Supplementary Figure S7, Table S1).
Shifting the secondary structure equilibrium
The experiments so far focused on the equilibrium between 1B and 2B in the pre-catalytic state, where the slightly favored 1B-fold is presumed to be the active species in forming the branched RNA. However, during the multi-step catalytic process the group II intron ribozyme must be able to reshuffle the population from 1B towards the 2nt bulge fold, as 2B is speculated to be the active species in the exon ligation step. We thus started to search for triggers switching the equilibrium position in D6 (Figure 3A).
Two magnesium (II) ions are essential to form the active site catalytic triad in D5 and are involved in exon ligation catalysis (15). Thus, we speculated that the addition of magnesium (II) ions might also influence the fold population distribution in D6. Very strikingly, we saw a strong fold shifting effect upon the addition of magnesium (II) chloride. The equilibrium position change was reflected in the intensity change of the G16 N1H resonance in the imino proton spectrum (Figure 3C) and also in the 1H–13C-HMQC spectra of an 8–13C A4 labelled D56 RNA (Figure 3D). By using peak integration of the 1H8–13C8-correlation signal of A4 we found a decrease of fold 1B from 60% to 30% and the corresponding increase from 40% to 70% for 2B (Supplementary Table S2). The equilibrium reshuffling process reached a steady state at 20 eq. of Mg2+-ions (15 mM) with respect to the strand molarity (0.75 mM) of the D56 RNA. As the physiological Mg2+-ion concentration can be up to 1 mM in cells (31), 20 equivalents of magnesium (II) ions would correspond to a plausible group II intron RNA strand concentration of about 50 μM in a cellular environment. We also probed potential magnesium (II) ion binding sites by running an imino NOESY experiment in the presence of hexamminecobalt (III) chloride, a mimic for hydrated magnesium ions (Supplementary Figure S8). Two binding sites could be identified—a strong NOE to the G6 H1 in fold 2B points towards specific magnesium binding in the central G–C rich tract of 2B and a low-intensity NOE to U6 H3 indicates weak and unspecific cation binding in the UUCG tetraloop. The magnesium ion induced switching is fully reversible as the addition of 24 eq. of ethylenediaminetetraacetic acid (EDTA), a chelating agent for magnesium (II) ions,) with respect to the D56 strand molarity led to a change of the fold equilibrium with an 1B-population of 75%.
Next the folding behaviour of a branched D6 RNA was investigated to probe the influence of the first step – the branching reaction. Using solid phase RNA synthesis an 8–13C-A4 modified 27nt Oc19 D6 RNA was synthesized and relying on a 2′-photolabile protected adenosine building block and reverse amidites an RNA tetramer was attached to the 2′-OH of the branch-site adenosine A23 (Figure 3B, Supplementary Scheme S1, Figure S9) (32). Then, the 1B/2B-fold distribution was estimated from the imino proton region of the NMR spectrum again focusing on the resolved G16 N1H resonances and from the 1H–13C-correlation of the 8–13C-A4 in the branched RNA (Figure 3E and F). In the branched RNA, the fold equilibrium is strongly shifted toward the 2B conformation with a population of approximately 75%, which upon the addition of 25 eq. of magnesium (II) ions was almost exclusively pushed towards the 2nt-bulge state (90%). The preference very likely results from the unfavorable steric arrangement in 1B by the attachment of additional nucleotides at the 2′-hydroxyl of A23. The branching puts a high steric pressure at the bulge site in the 1nt-bulge conformation. This steric pressure is counteracted by switching into fold 2B as in this state a structural relief due to the higher conformational flexibility of the 2nt bulge is possible. This leads to a better accommodation of the branched RNA structure within the group II intron RNA and also prepares the ribozyme for the subsequent exon ligation step (Figure 3B). The branching induced equilibrium shift is in accordance with the assumption that 1B is the active conformation in the first branching step. After the formation of the branched lariat, 1B gets unfavored and the population of the state 2B increases up to 90% setting the stage for the subsequent exon-ligation reaction.
Sugar pucker of the branch-site adenosine A23 in the 1B and 2B
To get further insights into the function of the bulge conformation in the branching reaction the sugar pucker of A23 in both states was investigated. First, the D56 RNA construct was trapped in either one of the two bulge conformations by nucleotide mutations. The G6C and C21G mutations lead to a strong preference of 1B as in the competing 2B fold a C6–C20 and a G5–G21 mismatch will occur. The NMR data is fully in support of a switching towards 1B (Figure 4A, Supplementary Figure S10). Similarly, by the introduction of the G6C/C20G mutations a C6–C19 and a G7–G20 mismatch are introduced in 1B and 2B is now exclusively populated (Figure 4D, Supplementary Figure S10).
By using the 13C version of a H(N)HA-NMR experiment, (30,33) here termed H1′(C1′)H2′ experiment, the sugar puckers of 13C1′-labeled U12 and 13C1′-labelled A23 in the 1B and 2B mutant were addressed. The U12 residue served as an internal standard as in the cUUCGg tetraloop the second loop uridine is known to populate the C2′-endo conformation (34,35). The nmrpipe nlinLS peak fitting procedure was used to determine the peak intensities of the diagonal and cross peaks and the 3JH1’H2’ coupling constants of A23 and U12 in the 1B and 2B fold were obtained (29).
First, in 1B for U12 a scalar coupling constant of 8.8 Hz was found in-line with a high population of the C2′-endo sugar pucker, for A23–1B a coupling constant of 4.6 Hz was determined. Assuming a simplified model with a purely south/north sugar pucker equilibrium this would correspond to 46% C2′-endo sugar pucker of A23 in the 1B fold (Figure 4B and C) (36,37). In stark contrast, for the 2B G6C/C20G mutant no cross-peak was observed in the H1′(C1′)H2′ experiment for A23–2B and thus the 3JH1’H2’ coupling constant was not determinable (Figure 4D–F). This confirms an almost exclusive C3′-endo conformation of A23 in the 2nt bulge conformation. For U12, however, a cross-peak was found and a coupling constant of ca. 9 Hz was again confirmed.
Our data supports a functional sugar pucker switching between 1B and 2B. In fold 1B the A23 C2′-endo sugar pucker is the internal transesterification active conformation allowing step one—the branching reaction—in group II intron catalysis. This is in perfect agreement with a recent NMR and computational work, which showed that the ribose south conformation has a lower activation energy in an RNA cleavage step via the internal transesterification reaction (38). Once step one is accomplished the lariat structure is put in a resting state by switching into the north C3′-endo conformation and the group II intron can prepare for the exon ligation step.
Various D6 constructs display structural heterogeneity on various time scales
Six group II intron domain 6 constructs from various organisms were tested for the occurrence of the secondary structure equilibrium. Thereby, we wanted to rule out that the secondary structure equilibrium observed in the Oceanobacillus iheyenis/Azotobacter vinelandii D6 chimera is an artefact originating from this very special sequence context. In detail, we investigated the folding behaviour of the Oceanobacillus iheyensis group II intron truncated domain 6, optimized for the X-ray structure determination by Pyle and co-workers, and compared it to the wildtype sequence (12). For this truncated 20nt D6 RNA with an 8–13C-labelled branch site adenosine we found a single 1H–13C correlation (Supplementary Figure S11A and B) and also the imino proton part of the 1H spectrum indicated a single fold. Strikingly, in the wildtype 35nt OcI D6 sequence we found again two H-C correlations for the branch site adenosine with an 8–13C-label and also low intensity imino proton signals pointed towards an additional fold. Integration of the HMQC correlation peaks gave a fold ratio of 7/3 similar to the fold distribution we found for the Oc19 D56 RNA. Using the MC-sym secondary structure prediction algorithm for the truncated D6 sequence a single fold was suggested, whereas for the 35nt wt construct two main folds with similar free energies were predicted in line with our experimental data (Supplementary Figure S11C and D) (39). We then incorporated the branch-site 8–13C-A label into four other D6 RNAs belonging to various classes, like the group IIC class and the bacterial group B and E class. For all D6 RNAs we observed structural heterogeneity either on the slow chemical shift time scale or as for the Gracilibacillus group IIC D6 on the intermediate exchange regime (Supplementary Figure S12). Our results suggest that the structural heterogeneity of the branch site adenosine in D6 is preserved in various group II intron classes. This points toward true functional dynamics, which is important for the catalytic mechanism of the ribozyme to switch between sugar puckers and conformations between the branching and exon ligation step, respectively.
DISCUSSION
Here, by characterizing secondary structure dynamics and a functional sugar pucker switching of the group II intron D6, a novel aspect in ribozyme catalysis was identified. Our data revealed the co-existence of two secondary structures with either a 1nt or a 2nt bulge in the 61nt Oc19 D56 RNA. This conformational heterogeneity of D6 leads to a sugar pucker change of the branch site adenosine from the C2-’ to C3′-endo state, thereby modulating the transesterification activity. The findings suggest that the group II intron ribozyme utilizes a conformational selection mechanism to vary the population of reaction competent and incompetent states to orchestrate the various steps—branching and exon ligation—of group II intron catalysis. The secondary structure equilibrium is further proposed to be a key aspect for the reversibility in group II intron splicing. The findings of our work are summarized in a revised schematic representation of the branching pathway integrating the secondary structure and sugar pucker equilibrium in the splicing process (Figure 5).
Using solution NMR spectroscopy, we were able to unambiguously determine the slow exchange process of the two D6 secondary structures. Further, residue- and atom-specific 15N stable isotope labeling of the D56 RNA allowed to detect fold-specific Watson–Crick base pairs of both states, thus confirming the two secondary structure predictions.
The application of 13C longitudinal exchange NMR experiments revealed a hierarchical kinetic network with the fastest exchange rates observed at the bulge sites and a steady decrease of the folding rates towards the tetraloop. The kinetic data shed light on the forward folding pathway with an initial base pairing rearrangement process at the 1nt-bulge site of 1B, followed by a sequential base pairing register change in the G-C rich central tract culminating in fold 2B. The backward pathway again starts with the base pair switching in the labile 2nt-bulge sequence part, followed by the G-C base pair rearrangement in the central tract and closed by the formation of the symmetrical C-C bulge of 1B.
We then addressed the question how the secondary structure equilibrium of D6 can be modulated by external cues. A very strong equilibrium shift effect could be observed upon the addition of magnesium (II) ions. The added divalent cations lead to a preference of 2B, thus preparing the group II intron for the second exon ligation step. Magnesium (II) ions are essential co-factors for this step as catalytically relevant Mg2+ ions are found in D5—the catalytic centre of the second step (4). This suggests that by an increase of the magnesium ion concentration the group II intron switches its activity towards the exon ligation step. By establishing a synthetic access to a branched RNA mimic the influence of additional nucleotides attached to the branch site adenosine 2′-oxygen on the secondary structure equilibrium was probed. The branching reaction product favors the 2nt-bulge, which gets even more pronounced in the presence of magnesium (II) ions where the 2B fold is populated at 90%. This equilibrium shift is again in perfect accordance with the assumption that 1B is the active conformation in the first step, followed by switching to 2B by branching to facilitate step two. We also determined the sugar pucker of the branch site adenosine in both the 1B and 2B state. We found a switching from approximately 50% C2′-endo in 1B to an exclusive C3′-endo conformation in 2B. In 1B the adenosine C2′-endo sugar pucker population is the active conformation allowing the branching reaction, as this ribose south state has a lower activation energy in an RNA cleavage step via the internal transesterification reaction (38). Then, the branched RNA structure is put in a resting state by switching into the north conformation and the exon ligation can occur. Our data gives a molecular level explanation for the results of earlier conducted splicing assays on mutants favoring one or the other fold/sugar pucker and thus activating either the branching or exon ligation step (16).
The structural heterogeneity could be recapitulated in five other D6 RNA constructs from various classes suggesting that we indeed observe functional secondary structure and sugar pucker dynamics in D6 with an important role in catalysis.
Taken together, our findings strongly suggest that conformational dynamics is a crucial aspect in group II intron self-splicing. So far, the conformational selection concept was mainly discussed in context of biomolecular recognition and in protein catalysis but not as a key aspect in ribozyme activity (40–49). The dynamic nature of D6 leads to a tuneable abundance of the reaction competent state of the branch site adenosine by re-shuffling the equilibrium populations by specific triggers. Thereby, the ribozyme can move along the reaction coordinate in both directions, thus giving a rational for the reversibility in group II intron splicing. Although we were able to identify some of the triggers to switch the fold/sugar pucker equilibrium within D6, such as magnesium (II) ions or lariat formation, it is very tempting to speculate that another very important group II intron cofactor—the group II intron protein component—could dynamically modulate the secondary structure and the sugar pucker equilibrium in D6 as it was recently shown that the maturase protein is important for the active site substrate exchange (15,50,51). Furthermore, transient RNA-RNA contacts such as the π–π’ or the η–η’ interaction between D2 and D6 could regulate the populations of the secondary structure equilibrium to switch on and off functional states in splicing (10).
Supplementary Material
ACKNOWLEDGEMENTS
CK thanks Ronald Micura (University of Innsbruck) and Hashim Al-Hashimi (Duke University) for intensive discussions and critical reading of the manuscript.
Notes
Present address: Michael Andreas Juen, Roche Diagnostics GmbH, Nonnenwald 2, 82377 Penzberg, Germany.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
Austrian Science Fund (FWF) [P28725 and P30370 to C.K.]; Austrian Research Promotion Agency FFG [West Austrian BioNMR, 858017]. Funding for open access charge: Austrian Science Fund.
Conflict of interest statement. None declared.
REFERENCES
- 1. Pyle A.M. Group II intron self-splicing. Annu. Rev. Biophys. 2016; 45:183–205. [DOI] [PubMed] [Google Scholar]
- 2. Belfort M., Lambowitz A.M.. Group II intron RNPs and reverse transcriptases: From retroelements to research tools. Cold Spring Harbor Perspect. Biol. 2019; 11:a032375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Marcia M., Pyle A.M.. Visualizing Group II intron catalysis through the stages of splicing. Cell. 2012; 151:497–507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Chan R.T., Peters J.K., Robart A.R., Wiryaman T., Rajashankar K.R., Toor N.. Structural basis for the second step of group II intron splicing. Nat. Commun. 2018; 9:4676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Chan R.T., Robart A.R., Rajashankar K.R., Pyle A.M., Toor N.. Crystal structure of a group II intron in the pre-catalytic state. Nat. Struct. Mol. Biol. 2012; 19:555. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Costa M., Michel F., Westhof E.. A three-dimensional perspective on exon binding by a group II self-splicing intron. EMBO J. 2000; 19:5007–5018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Keating K.S., Toor N., Perlman P.S., Pyle A.M.. A structural analysis of the group II intron active site and implications for the spliceosome. RNA. 2010; 16:1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Michel F., Costa M., Westhof E.. The ribozyme core of group II introns: a structure in want of partners. Trends Biochem. Sci. 2009; 34:189–199. [DOI] [PubMed] [Google Scholar]
- 9. Peters J.K., Toor N.. Group II intron lariat: Structural insights into the spliceosome. RNA Biol. 2015; 12:913–917. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Robart A.R., Chan R.T., Peters J.K., Rajashankar K.R., Toor N.. Crystal structure of a eukaryotic group II intron lariat. Nature. 2014; 514:193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Toor N., Keating K.S., Fedorova O., Rajashankar K., Wang J., Pyle A.M.. Tertiary architecture of the Oceanobacillus iheyensis group II intron. RNA. 2010; 16:57–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Toor N., Keating K.S., Taylor S.D., Pyle A.M.. Crystal structure of a self-spliced group II intron. Science. 2008; 320:77. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Toor N., Rajashankar K., Keating K.S., Pyle A.M.. Structural basis for exon recognition by a group II intron. Nat. Struct. Mol. Biol. 2008; 15:1221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Wiryaman T., Toor N.. Structure determination of group II introns. Methods. 2017; 125:10–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Zhao C., Pyle A.M.. Structural Insights into the Mechanism of Group II Intron Splicing. Trends Biochem. Sci. 2017; 42:470–482. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Costa M., Walbott H., Monachello D., Westhof E., Michel F.. Crystal structures of a group II intron lariat primed for reverse splicing. Science. 2016; 354:aaf9258. [DOI] [PubMed] [Google Scholar]
- 17. Monachello D., Michel F., Costa M.. Activating the branch-forming splicing pathway by reengineering the ribozyme component of a natural group II intron. RNA. 2016; 22:443–455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Michel F., Kazuhiko U., Haruo O.. Comparative and functional anatomy of group II catalytic introns — a review. Gene. 1989; 82:5–30. [DOI] [PubMed] [Google Scholar]
- 19. Candales M.A., Duong A., Hood K.S., Li T., Neufeld R.A.E., Sun R., McNeil B.A., Wu L., Jarding A.M., Zimmerly S.. Database for bacterial group II introns. Nucleic Acids Res. 2011; 40:D187–D190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Zhang L., Doudna J.A.. Structural insights into group II intron catalysis and branch-site selection. Science. 2002; 295:2084. [DOI] [PubMed] [Google Scholar]
- 21. Schlatterer J.C., Crayton S.H., Greenbaum N.L.. Conformation of the group II intron branch site in solution. J. Am. Chem. Soc. 2006; 128:3866–3867. [DOI] [PubMed] [Google Scholar]
- 22. Erat M.C., Zerbe O., Fox T., Sigel R.K.O.. Solution structure of domain 6 from a self-splicing group II intron ribozyme: a Mg2+ binding site is located close to the stacked branch adenosine. ChemBioChem. 2007; 8:306–314. [DOI] [PubMed] [Google Scholar]
- 23. Erat M.C., Besic E., Oberhuber M., Johannsen S., Sigel R.K.O.. Specific phosphorothioate substitution within domain 6 of a group II intron ribozyme leads to changes in local structure and metal ion binding. JBIC J. Biol. Inorg. Chem. 2018; 23:167–177. [DOI] [PubMed] [Google Scholar]
- 24. Juen M.A., Wunderlich C.H., Nußbaumer F., Tollinger M., Kontaxis G., Konrat R., Hansen D.F., Kreutz C.. Excited states of nucleic acids probed by proton relaxation dispersion NMR spectroscopy. Angew. Chem. Int. Ed. 2016; 55:12008–12012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Neuner S., Santner T., Kreutz C., Micura R.. The “speedy” synthesis of atom-specific 15N imino/amido-labeled RNA. Chemistry. 2015; 21:11634–11643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Wunderlich C.H., Spitzer R., Santner T., Fauster K., Tollinger M., Kreutz C.. Synthesis of (6-13C)Pyrimidine Nucleotides as Spin-Labels for RNA Dynamics. J. Am. Chem. Soc. 2012; 134:7558–7569. [DOI] [PubMed] [Google Scholar]
- 27. Wenter P., Reymond L., Auweter S.D., Allain F.H.T., Pitsch S.. Short, synthetic and selectively 13C-labeled RNA sequences for the NMR structure determination of protein–RNA complexes. Nucleic Acids Res. 2006; 34:e79. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Kloiber K., Spitzer R., Grutsch S., Kreutz C., Tollinger M.. Longitudinal exchange: an alternative strategy towards quantification of dynamics parameters in ZZ exchange spectroscopy. J. Biomol. NMR. 2011; 51:123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Delaglio F., Grzesiek S., Vuister G.W., Zhu G., Pfeifer J., Bax A.. NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR. 1995; 6:277–293. [DOI] [PubMed] [Google Scholar]
- 30. Vuister G.W., Bax A.. Quantitative J correlation: a new approach for measuring homonuclear three-bond J(HNH.alpha.) coupling constants in 15N-enriched proteins. J. Am. Chem. Soc. 1993; 115:7772–7777. [Google Scholar]
- 31. Romani A.M.P. Cellular magnesium homeostasis. Arch. Biochem. Biophys. 2011; 512:1–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Katolik A., Johnsson R., Montemayor E., Lackey J.G., Hart P.J., Damha M.J.. Regiospecific Solid-Phase synthesis of branched oligoribonucleotides that mimic intronic lariat RNA intermediates. J. Org. Chem. 2014; 79:963–975. [DOI] [PubMed] [Google Scholar]
- 33. Kuboniwa H., Grzesiek S., Delaglio F., Bax A.. Measurement of HN-Hα J couplings in calcium-free calmodulin using new 2D and 3D water-flip-back methods. J. Biomol. NMR. 1994; 4:871–878. [DOI] [PubMed] [Google Scholar]
- 34. D’Ascenzo L., Leonarski F., Vicens Q., Auffinger P.. Revisiting GNRA and UNCG folds: U-turns versus Z-turns in RNA hairpin loops. RNA. 2017; 23:259–269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Clay M.C., Ganser L.R., Merriman D.K., Al-Hashimi H.M.. Resolving sugar puckers in RNA excited states exposes slow modes of repuckering dynamics. Nucleic Acids Res. 2017; 45:e134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Haziri A.I., Leumann C.J.. Synthesis and pairing properties of oligodeoxynucleotides containing Bicyclo-RNA and Bicyclo-ANA modifications. J. Org. Chem. 2012; 77:5861–5869. [DOI] [PubMed] [Google Scholar]
- 37. Altona C., Sundaralingam M.. Conformational analysis of the sugar ring in nucleosides and nucleotides. Improved method for the interpretation of proton magnetic resonance coupling constants. J. Am. Chem. Soc. 1973; 95:2333–2344. [DOI] [PubMed] [Google Scholar]
- 38. Guo F., Yue Z., Trajkovski M., Zhou X., Cao D., Li Q., Wang B., Wen X., Plavec J., Peng Q. et al.. Effect of ribose conformation on RNA cleavage via internal transesterification. J. Am. Chem. Soc. 2018; 140:11893–11897. [DOI] [PubMed] [Google Scholar]
- 39. Parisien M., Major F.. The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data. Nature. 2008; 452:51. [DOI] [PubMed] [Google Scholar]
- 40. Vogt A.D., Di Cera E.. Conformational selection or induced fit? A critical appraisal of the kinetic mechanism. Biochemistry. 2012; 51:5894–5902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Boehr D.D., Nussinov R., Wright P.E.. The role of dynamic conformational ensembles in biomolecular recognition. Nat. Chem. Biol. 2009; 5:789. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Haller A., Rieder U., Aigner M., Blanchard S.C., Micura R.. Conformational capture of the SAM-II riboswitch. Nat. Chem. Biol. 2011; 7:393. [DOI] [PubMed] [Google Scholar]
- 43. Heppell B., Blouin S., Dussault A.-M., Mulhbacher J., Ennifar E., Penedo J.C., Lafontaine D.A.. Molecular insights into the ligand-controlled organization of the SAM-I riboswitch. Nat. Chem. Biol. 2011; 7:384. [DOI] [PubMed] [Google Scholar]
- 44. Moschen T., Wunderlich C.H., Spitzer R., Levic J., Micura R., Tollinger M., Kreutz C.. Ligand-detected relaxation dispersion NMR spectroscopy: dynamics of preQ1–RNA binding. Angew. Chem. Int. Ed. 2015; 54:560–563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Boehr D.D., McElheny D., Dyson H.J., Wright P.E.. Millisecond timescale fluctuations in dihydrofolate reductase are exquisitely sensitive to the bound ligands. Proc. Natl. Acad. Sci. U.S.A. 2010; 107:1373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Oyen D., Fenwick R.B., Aoto P.C., Stanfield R.L., Wilson I.A., Dyson H.J., Wright P.E.. Defining the structural basis for allosteric product release from E. coli dihydrofolate reductase using NMR relaxation dispersion. J. Am. Chem. Soc. 2017; 139:11233–11240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Henzler-Wildman K.A., Lei M., Thai V., Kerns S.J., Karplus M., Kern D.. A hierarchy of timescales in protein dynamics is linked to enzyme catalysis. Nature. 2007; 450:913. [DOI] [PubMed] [Google Scholar]
- 48. Henzler-Wildman K.A., Thai V., Lei M., Ott M., Wolf-Watz M., Fenn T., Pozharski E., Wilson M.A., Petsko G.A., Karplus M. et al.. Intrinsic motions along an enzymatic reaction trajectory. Nature. 2007; 450:838. [DOI] [PubMed] [Google Scholar]
- 49. Villali J., Kern D.. Choreographing an enzyme's dance. Curr. Opin. Chem. Biol. 2010; 14:636–643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Haack D.B., Yan X., Zhang C., Hingey J., Lyumkis D., Baker T.S., Toor N.. Cryo-EM Structures of a Group II intron reverse splicing into DNA. Cell. 2019; 178:612–623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Zhao C., Pyle A.M.. Crystal structures of a group II intron maturase reveal a missing link in spliceosome evolution. Nat. Struct. Mol. Biol. 2016; 23:558. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.