Abstract
Knowledge of RNA structure, either in isolation or in complex, is fundamental to understand the mechanism of cellular processes. Solid‐state NMR (ssNMR) is applicable to high molecular‐weight complexes and does not require crystallization; thus, it is well‐suited to study RNA as part of large multicomponent assemblies. Recently, we solved the first structures of both RNA and an RNA‐protein complex by ssNMR using conventional 13C‐ and 15N‐detection. This approach is limited by the severe overlap of the RNA peaks together with the low sensitivity of multidimensional experiments. Here, we overcome the limitations in sensitivity and resolution by using 1H‐detection at fast MAS rates. We develop experiments that allow the identification of complete nucleobase spin‐systems together with their site‐specific base pair pattern using sub‐milligram quantities of one uniformly labelled RNA sample. These experiments provide rapid access to RNA secondary structure by ssNMR in protein‐RNA complexes of any size.
Keywords: 1H detection, base-pair pattern, RNA structure, RNA-protein complex, solid-state NMR spectroscopy
We developed solid‐state NMR experiments to detect base pair patterns in RNA. The experiments use high‐sensitivity 1H‐detection and allow site‐specific assignment of base pairs to individual nucleotide spin‐systems. In addition, the experiments yield full assignment of nucleobase resonances. The methodology is applicable to RNA embedded in particles of any size and provides rapid access to RNA secondary structure in biological complexes.
Introduction
Ribonucleic acids (RNA) perform a multitude of cellular functions, including the regulation of gene expression. The structures of the RNAs involved in these functions, and of their complexes with proteins, shed light on the molecular mechanisms of the corresponding cellular processes and facilitate intervention in a disease context. Nuclear magnetic resonance (NMR) spectroscopy in solution plays an important role in studying RNA structure, as it can handle well the inherent flexibility exhibited by many RNAs.[ 1 , 2 ] However, due to the strong dependency of the resonance linewidths on the rate of rotational diffusion, solution‐state NMR can be applied straightforwardly only to molecules of moderate size, typically up to molecular‐weights (MW) of ≈100 and 40 kDa for proteins and nucleic acids, respectively. Even for RNAs larger than 20 kDa, tailored labelling schemes, such as segmental or nucleotide specific 13C/15N/2H‐labelling, are required to alleviate spectral overlap and reduce line‐broadening.[ 3 , 4 , 5 ] Recently, the Summers laboratory [6] has used 2H‐edited NMR to probe the three‐dimensional structure of the 42 kDa HIV‐1 Cap1G‐LTPUA RNA; the study required a total of 16 nucleotide‐type specifically labelled samples, demonstrating that the application of solution‐state NMR to RNAs of MW ≥40 kDa remains challenging.
In solid‐state NMR (ssNMR), the linewidth does not increase with the molecular size. As a consequence, the technique can be applied to particles of any size, provided that the sensitivity is sufficiently high and that peak‐crowding does not become a limiting factor. For medium‐sized RNAs embedded in large complexes, peak‐crowding is rarely a problem, as typically only the RNA of interest is visible in isotope‐edited NMR spectra.[ 7 , 8 ] However, high‐MW particles lead to low sensitivity, simply because the number of copies that can be packed in a given volume is inversely proportional to the particle size. Sensitivity has been improved by optimizing both the hardware and the experimental design, particularly with the recent development of ultra‐fast MAS (magic‐angle spinning) probe‐heads that have allowed the advent of 1H‐detection in biomolecular solid‐state NMR.[ 9 , 10 , 11 , 12 , 13 , 14 ]
Over the last decade, we have developed experiments to assign 13C and 15N resonances of RNA bound to proteins. We demonstrated the methodology using the 8.5‐kDa 26mer box C/D RNA in complex with the ∼13.5 kDa protein L7Ae from Pyrococcus furiosus (Pf) and determined its structure by ssNMR (Figure 1 a,b).[ 7 , 8 , 15 , 16 , 17 ] In this work, we used 15N‐ and 13C‐detection, which limited the sensitivity of the experiments and prevented acquisition of three‐dimensional spectra. In addition, at the achievable MAS rate of 20 kHz, the 15N‐ and 13C‐linewidths were such as to lead to significant peak‐overlap, especially in the regularly structured A‐form helical regions. As a consequence, solving the structure of the RNA required eight different samples with single and double nucleotide‐type selective labelling.
1H‐detected ssNMR in combination with high‐speed MAS combines the high sensitivity of 1H detection with sharp and MW‐independent linewidths. To date, 1H‐detection has been applied mostly to proteins,[ 9 , 10 , 11 , 12 , 13 ] with 1H‐detected ssNMR of RNA remaining somewhat of a challenge, due to the low proton density in the nucleobases and the poor 1H chemical‐shift dispersion in the ribose ring. Nonetheless, the improved linewidth and sensitivity of 1H‐detected ssNMR at a MAS rate of 100 kHz have allowed us to measure a range of three‐ and four‐dimensional spectra of uniformly 13C,15N‐labelled 26mer box C/D RNA in complex with L7Ae, [14] such as HCN‐like[ 18 , 19 , 20 ] experiments.
NMR spectroscopy, both in solution‐ and in solid‐state, allows the rapid identification of RNA base pair patterns.[ 2 , 8 , 21 , 22 ] This information, even in the absence of site‐specific assignments, provides accurate characterization of the RNA secondary structure, which is often sufficient to predict the function, the processing pathway and the stability of the RNA.[ 23 , 24 , 25 ] Recently, 1H‐detected ssNMR at 40 kHz MAS was used to observe inter‐nucleotide Watson–Crick hydrogen bonds in the crystallized 23mer DIS‐HIV‐1. Unfortunately, the spectra lacked any site‐specific resolution. [26]
Here we present two‐ and three‐dimensional (2D and 3D) 1H‐detected ssNMR experiments that allow rapid, nucleotide‐specific detection of both Watson–Crick (G:C and A:U) and non‐Watson–Crick (G:A and U:U) base pairs within hours (2D experiments) to days (3D experiments) at 100 kHz MAS. Furthermore, we describe 1H‐detected ssNMR experiments that yield the complete resonance assignment of the nucleobases. This suite of solely 1H‐detected ssNMR experiments is sufficient to accurately define the RNA secondary structure. Overall, the experimental approach described here paves the way to complete structure determination of RNA by ssNMR using only uniformly labelled samples.
Results and Discussion
Detection of Inter‐Strand Hydrogen Bonds
In ssNMR, Watson–Crick (WC) base pairs can be identified by the spatial proximity (2.6–2.9 Å) of the two nitrogen atoms involved in NH⋅⋅⋅N hydrogen bonds (U N3⋅⋅⋅N1 A; G N1⋅⋅⋅N3 C), using 15N‐15N RFDR (radio‐frequency‐driven recoupling) correlations.[ 8 , 21 ] However, this strategy is not suited to observing several non‐Watson–Crick base pairs (e.g. 2‐carbonyl‐N3, 4‐carbonyl‐N3 U:U and trans‐Hoogsteen/sugar‐edge G:A [27] base pairs), which either do not feature NH⋅⋅⋅N hydrogen bonds (U:U) or have longer distances between the donor and acceptor nitrogen atoms (G:A). More conveniently, NHHN and NHHC[ 8 , 22 , 28 ] experiments correlate 15N/13C spins via magnetization transfer through their bound hydrogen atoms, utilizing proton spin diffusion (PSD) between hydrogen atoms that are close in space. In essence, these experiments rely on through‐space 1H‐1H correlations, similar to NOESY in solution‐state NMR. [29] Because at fast MAS rates (≥40 kHz) spin diffusion‐based recoupling schemes are inefficient, here we instead used the RFDR scheme to exploit 1H‐1H through‐space correlations and measure cross‐strand hydrogen bonds at 100 kHz MAS rate.[ 30 , 31 ]
We developed two different 3D 1H,1H RFDR correlation experiments that allow the identification of Watson–Crick (G:C, A:U) and non‐Watson–Crick (G:A, U:U) base pairs and also facilitate nucleobase assignment. The 2D versions of these experiments (2D (H)N(HHC)H and 2D (H)N(HHN)H) at 100 kHz MAS are similar to the NHHC and NHHN[ 8 , 22 , 28 ] experiments at low MAS rates (≤20 kHz), respectively, but the high sensitivity of proton detection allowed reduced experiment times of 12–15‐hours, compared to the 60‐hour experiment time needed for each of the NHHN/NHHC experiments. [32] Moreover, the proton spectral dimension yields improved separation of the correlations in the resulting spectra. In these experiments, the magnetization is first transferred using a short cross polarization (CP) period from 1H to 15N, where it evolves during t 1, before being transferred back to 1H. 1H‐1H inter‐strand transfer across the base pair is then achieved by 0.48‐ms (2D (H)N(HHN)H) or 0.4‐ms (2D (H)N(HHC)H) 1H‐1H RFDR mixing. The magnetization is then transferred to 15N (2D (H)N(HHN)H) or 13C (2D (H)N(HHC)H) by CP, after which the frequency of the heteronucleus can be encoded during t 2 in the 3D version of the experiments. A final CP transfer brings the magnetization back to 1H for direct detection (Figure 1 c–f and Figure S1a‐b).
Both WC A:U and non‐Watson–Crick G:A (trans‐Hoogsteen/sugar‐edge [27] ) base pairs can be detected with the 2D (H)N(HHC)H experiment. In trans‐Hoogsteen/sugar‐edge G:A base pairs, the G‐H1′ is close to the A‐H6, leading to the corresponding A‐N6–G‐H1′ peak (Figure 1 c,g). In A:U base pairs, the proximity of A‐H2 to the U‐H3 (≈2.8 Å) allows their straightforward identification through the U‐N3–A‐H2 correlations (Figure 1 d,h). As expected from the previously determined secondary structure, [8] we observed two G:A and one A:U base pair in the 26mer box C/D RNA (Figure 1 g,h).
In G:C base pairs, the proximity of G‐H1 to the C‐H4 protons (≈2.3–2.7 Å) allows for efficient 1H‐1H magnetization transfer (Figure 1 e). Using the 2D (H)N(HHN)H experiment, we observed three out of the expected four G:C base pairs, identified by their C‐N4–G‐H1 and G‐N1–C‐H41/H42 cross‐peaks (Figure 1 i). In this experiment, the close proximity of intra‐residue G‐H1 and G‐H2 protons (≈2.1–2.3 Å) also results in intra‐residue G‐N1–H2 and G‐N2–H1 correlations. While inter‐strand C‐N4–G‐H1 correlations are well separated from the intra‐residue G‐N2–H1 correlations, the region of inter‐strand G‐N1–C‐H41/H42 cross‐peaks overlaps with that of the intra‐residue G‐N1–H2 peaks, due to similar C‐H41/H42 and G‐H2 chemical shifts (Figure 1 i).
The relatively short distance between the two U‐H3 imino protons in U:U base pairs (≈2.9 Å) should allow their detection in the (H)N(HHN)H experiment (Figure 1 f). However, the presence of the predicted U3:U23 2‐carbonyl‐N3, 4‐carbonyl‐N3 base pair [8] could not be confirmed in the 2D spectrum, due to the near‐identical chemical shifts of U3‐N3 and U23‐N3. The presence of this base pair was confirmed later (Figure 5 e) in a 3D version of this experiment, which featured the evolution of both U‐H3 proton frequencies.
In conclusion, these two 2D experiments allowed the rapid and almost complete identification of the base pairing pattern in the 26mer box C/D RNA.
Assignment of Nucleobases
After initial assessment of the secondary structure of the 26mer box C/D RNA, we proceeded to the site‐specific assignment of nucleobase spins, and thus base pairs, by 1H‐detected ssNMR. Figure 2 shows 2D dipolar‐based 1H‐13C (Figure 2 a) and 1H‐15N (Figure 2 b) cross polarization CP‐HSQC spectra recorded for the nucleobases. With a few modifications, the experiment follows the scheme proposed by Rienstra and co‐workers.[ 33 , 34 , 35 ]
Our strategy for the complete assignment of RNA resonances in individual nucleotides by 1H‐detected ssNMR on uniformly 13C,15N‐labelled RNA comprises five steps. First, C1′‐H1′ resonances are correlated with C6‐H6 and C8‐H8 resonances in pyrimidines and purines, respectively. Second, ribose C‐H resonances (C2′‐H2′, C3′‐H3′, C4′‐H4′, C5′‐H5′ and ‐H5′′) are correlated to the C1′‐H1′ resonances. These two steps were described previously [14] and provided the assignment of 75 % of the ribose spin systems as well as their connection to the pyrimidine C6‐H6 and purine C8‐H8 resonances. Third, all nucleobase carbon resonances are correlated to the previously assigned C6‐H6 (pyrimidines) or C8‐H8 (purines) groups. In the fourth step, the 15N‐1H imino and amino resonances are correlated to the assigned carbons and C‐H groups. Finally, the resonances of the non‐protonated nitrogen atoms (N1, N3, N7 in purines and N3 in cytosine) are correlated to the assigned C‐H groups. Here, we present experiments to accomplish steps 3–5 as well as to assign cross‐strand hydrogen bonds. In the following spectra, for clarity, we annotate the spin systems according to their site‐specific assignment, which we accomplished in previous work.[ 7 , 8 , 14 ] The sequential assignment of RNA relies on through‐space 1H‐1H magnetization transfer, as explained previously; [8] experiments exploiting 1H‐detection to accomplish sequential assignment of RNA are currently in development in our laboratory. The experiments presented here are complementary, allowing the complete assignment of nucleobase spin systems and the identification of the base pair pattern with spin‐system resolution.
The 3D (H)CCH experiment (Figure S1d) accomplishes step 3. The experiment starts with a long‐range 1H‐13C CP transfer followed by evolution of 13C magnetization during t 1. A selective 13C REBURP pulse [36] with a bandwidth of ≈50 ppm centered at ω(13C)=160 ppm allows phase‐cycled cancellation of the signals of the ribose ring, whose carbon spins are not affected by the pulse. The following 8 ms‐long 13C‐13C RFDR mixing period transfers 13C magnetization to nearby 13C atoms, whose chemical shifts are recorded during t 2. We used the 13C‐13C RFDR mixing scheme[ 30 , 31 ] to transfer magnetization among the nucleobase carbon spins (C2, C4, C5, C6, C8) because PDSD [37] (proton‐driven spin‐diffusion) and DARR [38] (dipolar‐assisted rotational resonance) 13C‐13C transfer schemes fail at high spinning rates. Finally, the 13C magnetization is transferred to the directly attached 1H via a second short‐range CP and detected during t3 (Figure 3 a,j). This 3D (H)CCH experiment correlates all carbons of the nucleobase to either the C8‐H8 and C2‐H2 groups in purines, or the C6‐H6 and C5‐H5 groups in pyrimidines. For instance, for the 26mer box C/D RNA, the experiment provided unambiguous assignment of all five adenosine C2‐H2 groups in the well‐structured region (nucleotides 2–9 and 14–24) (Figure 2 a).
The fourth step of the assignment protocol is accomplished by the 3D (H)CNH experiment (Figure S1e). The experiment starts with a long‐range 1H‐13C CP transfer of 3 ms to transfer the 1H magnetization to non‐protonated carbon atoms, whose chemical shifts are recorded during t 1. Next, a 7‐ms 13C‐15N CP transfers the 13C magnetization to nitrogen atoms, whose chemical shifts are recorded in t 2. Here, a selective 15N refocusing pulse (bandwidth of ≈90 ppm), centered at either ω(15N)=154 ppm or ω(15N)=80 ppm after the t 2 period selects for either imino (G‐N1, U‐N3) or amino (A‐N6, G‐N2, C‐N4) nitrogen magnetization, respectively. Finally, the magnetization is transferred by CP from the nitrogen to the directly bound hydrogen for detection (Figure 3 k,l). The experiment yielded imino and amino‐selective CX–NX–HX correlations in only 35 and 60 hours, respectively.
The combination of the 3D (H)CCH and the two 3D (H)CNH experiments allows assignment of the imino and amino resonances to individual nucleotides, provided that the chemical shift of at least one of the correlated carbons is unique to that nucleotide. This condition is not always fulfilled, due to the narrow chemical shift dispersion of A‐C6, C‐C4 and G‐C2, which may hinder the nucleotide‐specific assignment of A, C and G amino groups. Nevertheless, the imino‐selective (H)CNH spectrum provided four U‐C2/C4–N3–H3 and six G‐C2/C6–N1–H1 correlations (Figure 3 b–g), allowing the unambiguous assignment of all four uridine imino resonances and six of the guanosine imino resonances present in the 1H‐15N CP‐HSQC spectrum (Figure 2 b, the peak of the nucleotide G24 is missing). In the amino‐selective (H)CNH spectrum, we detected four sets of cytidine C4–N4–H4 correlations, but the low chemical shift dispersion of the carbon C4 resonances hindered the assignment of the amino groups of the C2, C9 and C17 nucleotides (Figure 4 a), and only the amino group of C7 could be assigned unambiguously (Figure 3 i). Similarly, due to the low chemical shift dispersion of A‐C6 and G‐C2 spins, the adenosine and guanosine amino resonances were poorly resolved. The 3D (H)CNH spectrum yielded the assignment of the amino resonance of nucleotide A19 only, which has a unique carbon C6 chemical shift (156.5 ppm, Figure 3 h). The unambiguous assignment of the other adenosine amino groups was hindered by almost identical C6 chemical shifts, such as those of A15 and A18 (≈157.9–158.0 ppm, Figure 4 b) or A5 and A22 (≈157.3–157.4 ppm, Figure 4 c).
To overcome the problems imposed by the limited chemical shift dispersion described above, we developed 3D (H)N(C)CH and H(NC)CH experiments, with the aim of exploiting the better resolution of protonated carbons to resolve either the nitrogen or the hydrogen chemical shifts of imino and amino groups. The 3D (H)N(C)CH experiment (Figure S1f) starts with a 1H‐15N CP transfer of 1 ms, followed by evolution of 15N chemical shifts during t 1. The following 10 ms‐long 13C‐15N CP period transfers magnetization from the nitrogen to the directly attached carbon. The selective REBURP refocusing pulse (bandwidth of ≈50 ppm) centered at ω(13C)=160 ppm selects for nucleobase carbon magnetization, which is then transferred to all adjacent carbons, including the protonated ones, via a 14 ms‐long 13C‐13C RFDR mixing period. Finally, after evolution of the carbon chemical shift during t 2, the magnetization is transferred from the protonated carbon atoms to the directly attached hydrogen atoms for detection (Figure 4 a–c). In the (H)N(C)CH experiment, the frequencies of the imino and amino nitrogen atoms are correlated to the C5‐H5 or C6‐H6 groups in pyrimidines and to the C2‐H2 or C8‐H8 groups in purines (Figure 4 g–i). The poor resolution of the C8‐H8 peaks of guanosines makes the assignment of their imino and amino nitrogen atoms more difficult than for the other nucleobases, where the assignment is facilitated by the well‐resolved C2‐H2 (adenosines) and C5‐H5 (cytidine and uridine) groups.
Although this experiment has relatively low sensitivity, due to the long contact‐time of 10 ms for the 15N‐13C transfer, together with the low efficiency of 13C‐13C mixing for some spin systems, correlations of imino nitrogen atoms were observed for four uridines (U3‐, U8‐, U20‐, U23‐N3 to C5‐H5 groups and U3‐, U20‐N3 to C6‐H6 groups) and three guanosines (G4‐, G6‐, G21‐N1 to C8‐H8 groups), whereas correlations of amino nitrogen atoms were obtained for two cytidines (C7‐, C17‐N4 to C5‐H5 and C6‐H6 groups), four adenosines (A5‐, A18‐, A19‐, A22‐N6 to C2‐H2 groups and A5‐, A19‐, A22‐N6 to C8‐H8 groups) and three guanosines (G4‐, G6‐, G21‐N2 to C8‐H8 groups) after a measurement time of 80 hours. These correlations allowed us to resolve the ambiguity in the assignment of A15 and A18: while in the (H)CNH experiment, the two corresponding peaks at N6‐H6 chemical shifts of 82.6/7.1 ppm and 79.7/6.8 ppm cannot be attributed to one or other of A15 and A18 due to the degeneracy in the C6 chemical shift (Figure 4 e), the presence of a cross‐peak between the C2‐H2 of A18 with an N6 resonance at 79.7 ppm (Figure 4 h) unambiguously attributes the N6‐H6 group at 79.7/6.8 ppm to A18. Furthermore, the presence of cross‐peaks between an N6‐H6 group with a nitrogen chemical shift at 81.1 ppm and both C8‐H8 and C2‐H2 groups of both A5 and A22 revealed that the amino groups of A5 and A22 overlap (Figure 4 f,i). Finally, we observed N4–C5–H5 cross‐peaks for C7 and C17 (Figure 4 g); however, we find another experiment to be better suited for the assignment of cytidine amino groups (vide infra). By contrast, the sensitivity of the 3D H(N)(C)CH experiment, where the chemical shifts of 1H instead of 15N were evolved during t 1, remained very low after 62 hours of measurement time. This can be explained by the short coherence lifetimes of the imino and amino protons (imino protons T 2 ′≈4 ms, amino protons T 2 ′≈1 ms at 100 kHz MAS).
For the assignment of the cytidine amino groups, we used the 3D (H)N(HH)CH experiment (Figure S1a), which utilizes the efficient 1H‐1H transfer between the proximal H5 and H41/H42 hydrogens (≈2.4–2.7 Å) and provides the same N4–C5–H5 correlations as the 3D (H)N(C)CH experiment (Figure S2a). The efficiency of the 1H‐1H RFDR mixing (0.48 ms‐long) was superior to that of the 13C‐13C RFDR mixing, leading to a higher sensitivity of the 3D (H)N(HH)CH experiment compared to the 3D (H)N(C)CH experiment. The 3D (H)N(HH)CH experiment provided N4–C5–H5 correlations of the four cytidines C2, C7, C9, C17 (excluding C26 which is not visible in any ssNMR spectrum, Figure S2c‐f) and allowed unambiguous assignment of the amino resonances of nucleotides C2, C9 and C17 (Figure S2d‐f).
The 3D (H)N(C)CH experiment did not show any N2–C8–H8 cross‐peaks for G10, G14, G16 and G24. To assign the guanosine amino groups, we exploited the intra‐residue proximity of the H1 and H2 hydrogen atoms (≈2.1–2.3 Å) in a 3D (H)N(HH)NH experiment (Figure S1b and Figure S2b), which utilizes the same magnetization transfer pathway as the 2D (H)N(HHN)H experiment (Figure 1 i) but features an additional 15N evolution time before the final magnetization transfer to 1H for direct detection, thereby yielding N2–N1–H1 and N1–N2–H2 cross‐peaks. In the 3D (H)N(HH)NH spectrum we observed N2–N1–H1 cross‐peaks for the nucleotides G4, G6, G10, G14, G16 and G21 (Figure S2g‐j). Combining the information on N2 and C2 chemical shifts obtained from the 3D (H)N(HH)NH and amino‐selective (H)CNH experiments, we were able to unambiguously assign guanosine amino resonances for five out of the six guanosines in the structured region of the 26mer box C/D RNA (nucleotides 2–9 and 14–24) and of G10 in the apical loop (Figure S2g‐j). The N‐H resonances of G24 at the open end of the helix were not detectable in any of the spectra.
In total, using the combination of experiments presented in Figures 3–4 and Figure S2 we could assign all imino groups of the 19 structured nucleotides (2–10 and 14–24) with only G24 missing, as well as the amino groups of all cytidines, five out of six guanosines (excluding G24 in the helix end) and all adenosines.
In the final step, we assigned the remaining nitrogen resonances (N1, N3, N7 in purines) using a modified version of the base‐tuned 3D (H)NCH experiment developed previously. [14] We shifted the 15N carrier position from 160 to 190 ppm to allow effective manipulation of the purine N1, N3 and N7 resonances. In addition, after t 2, we applied a selective 13C refocusing pulse (bandwidth of 50 ppm) centered at ω(13C)=160 ppm to select for nucleobase 13C resonances. The experiment yields N7–C8–H8 and N9–C8–H8 correlations in purines, N1–C2–H2 and N3–C2–H2 correlations in adenosines and N1–C6–H6 correlations in pyrimidines (Figure S1g and Figure S3) and provided the assignment of all missing nitrogen atoms in purines in 56 hours of measurement time. Finally, to assign N3 resonances in cytidines involved in base pairs, we recorded a 2D 1H‐15N‐HSQC experiments with long‐range 1H‐15N/15N‐1H CP transfers of 8 ms, which allows magnetization transfer from guanosines H1s to cytidine N3s involved in WC base pairs (Figure S1c and Figure S4). However, in this experiment, C‐N3–G‐H1 correlations could be obtained only for the G6:C17 base pair (Figure S4b).
Assignment of Base Pairs
The two‐dimensional spectra of Figure 1 allow the assignment of base pairs to specific pairs of spin‐systems. The WC A:U base pair (Figure 1 h) can be assigned to the uridine spin‐system with U‐N3=163.3 ppm (U8) and to the adenosine spin‐system with A‐H2=7.4 ppm (A15). Similarly, the three WC G:C base pairs belong to three pairs of spin‐systems with G‐H1=13.0 ppm and C‐N4=98.8 ppm (G6:C17), G‐H1=13.3 ppm and C‐N4=98.2 ppm (G14:C9), G‐H1=13.4 ppm and C‐N4=99.4 ppm (G16:C7). Finally, the two A‐N6–G‐H1′ correlations specific for trans‐Hoogsteen/sugar‐edge G:A base pairs (Figure 1 g) involve one guanosine spin‐system with the unique H1′ chemical shift of 6.1 ppm (G21) and one other guanosine spin‐system with the non‐unique chemical shift of 5.7 ppm (G4 or G10). [14] The identity of the adenosine spin‐system remains ambiguous as the two adenosine spin‐systems involved in the G:A base pairs have almost identical N6 chemical shifts of 81.1–81.2 ppm (A5 and A22).
To be able to assign the base pairs to individual spin‐systems in larger RNAs with more spectral crowding, we tested 3D versions of the experiments of Figure 1. The 3D (H)N(HH)CH experiment, with an additional evolution time for 13C frequencies (as in Figure S2c‐f), was acquired with a 1H‐1H RFDR mixing time of 0.96 ms (compared to that of 0.48 ms used in the 2D spectrum of Figure 1). One U‐N3–A‐C2–H2 cross‐peak confirmed the U8:A15 base pair (Figure 5 a). In addition, the different C1′ chemical shifts of G4 and G10 allowed identification of G4 as the nucleotide involved in the second G:A base pair. Using the 1H‐1H RFDR mixing time of 0.96 ms, we observed additional cross‐peaks between nitrogen atoms corresponding to either G‐N2 or A‐N6 and the C8‐H8 groups of two adenosines at 139.9/7.7 ppm and 140.1/7.9 ppm (A5 and A22) (Figure 5 c). If these cross‐peaks originated from G‐N2, they would be indicative of G:A base pairs; alternatively, they could originate from A‐N6 through intra‐residue magnetization transfer. To verify the nature of these cross‐peaks we evaluated intra‐residue and inter‐strand A‐H6–H8 and G‐H2–A‐H8 distances in the ssNMR‐derived structure of this RNA [15] as well as in a few other known RNA structures that contain trans‐Hoogsteen/sugar‐edge G:A base pairs[ 39 , 40 ] (Table S4). Inter‐strand G‐H2–A‐H8 distances (≈3.1 Å) are significantly shorter than intra‐residue A‐H6–H8 distances (≈5.0 Å); furthermore, we did not observe any intra‐residue A‐H6‐H8 correlations for any other adenosine spin system apart from A5 and A22. Thus, we concluded that the correlations observed in the 3D (H)N(HH)CH spectrum are specific for trans‐Hoogsteen/sugar‐edge G:A base pairs. The longer distance of inter‐strand G‐H2–A‐H8 hydrogens compared to that of inter‐strand G‐H1′–A‐H6 hydrogens (≈2.4 Å) explains why the G‐H2–A‐H8 cross‐peaks were observable only when using the longer 1H‐1H RFDR mixing time of 0.96 ms.
We also implemented two different 3D versions of the (H)N(HHN)H experiment (Figure S1b), where we utilized a 1H‐1H RFDR mixing time of 0.48 ms and measured either 1H, 15N & 1H or 15N, 15N & 1H frequencies (as in Figure S2g‐j). Both experiments showed good sensitivity (Figure 5 d and Figure S5), but the 3D (HN)H(H)NH experiment, with acquisition of 1H, 15N & 1H frequencies, had better resolution of the base pairs. C‐H41/H42–G‐N1–H1 and C‐N4–G‐N1–H1 cross‐peaks were obtained for three out of the four G:C base pairs of the 26mer Box C/D RNA (Figure 5 d and Figure S5). No cross‐peak was observed for the terminal base pair C2:G24 due to either poor signal‐to‐noise or to fraying of the helix. The 3D (HN)H(H)NH experiment delivered also intra‐residue C‐H41–N4–H42 cross‐peaks, which correlated directly the H41 and H42 resonances.
Finally, the 3D (HN)H(H)NH experiment yielded the assignment of the U:U base pair by separating the resonances of the degenerate N3 atoms via the different chemical shifts of the H3 hydrogens: we observed a cross‐peak between two uridine imino protons at 9.6 and 11.4 ppm, which confirmed the presence of the predicted U:U base pair [8] between the two unique uridine spin‐systems containing these imino protons (U3:U23, Figure 5 e).
Conclusion
In summary, we have established a set of experiments that accomplishes the rapid identification of RNA base pair patterns, the complete assignment of all nucleobase spin‐systems (Table S5) and the assignment of the base pairs to individual spin‐systems. The experimental workflow requires a single uniformly‐labelled RNA sample and uses 1H‐detected MAS ssNMR at ultrafast (≥100 kHz) spinning rates. We are currently developing strategies for the determination of structural restraints based solely on 1H‐detected ssNMR applied to a single uniformly‐labelled RNA sample. These developments demonstrate that 1H‐detection has the potential to significantly strengthen the role of ssNMR in RNA structural biology and will pave the way for the determination of RNA structures within large biomolecular complexes.
Conflict of interest
The authors declare no conflict of interest.
Supporting information
Acknowledgements
This work was supported by the Deutsche Forschungsgemeinschaft (DFG grant CA294/21‐1 to TC). Open Access funding enabled and organized by Projekt DEAL.
P. I. Aguion, J. Kirkpatrick, T. Carlomagno, A. Marchanka, Angew. Chem. Int. Ed. 2021, 60, 23903.
Contributor Information
Prof. Dr. Teresa Carlomagno, Email: teresa.carlomagno@oci.uni-hannover.de.
Prof. Dr. Alexander Marchanka, Email: alexander.marchanka@oci.uni-hannover.de.
References
- 1. Carlomagno T., J. Magn. Reson. 2014, 241, 126–136. [DOI] [PubMed] [Google Scholar]
- 2. Fürtig B., Richter C., Wöhnert J., Schwalbe H., ChemBioChem 2003, 4, 936–962. [DOI] [PubMed] [Google Scholar]
- 3. Duss O., Maris C., Von Schroetter C., Allain F. H. T., Nucleic Acids Res. 2010, 38, e188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Tzakos A. G., Easton L. E., Lukavsky P. J., Nat. Protoc. 2007, 2, 2139–2147. [DOI] [PubMed] [Google Scholar]
- 5. Carlomagno T., Amata I., Codutti L., Falb M., Fohrer J., Masiewicz P., Simon B., J. Am. Chem. Soc. 2013, 135, 4403–4411. [DOI] [PubMed] [Google Scholar]
- 6. Brown J. D., Kharytonchyk S., Chaudry I., Iyer A. S., Carter H., Becker G., Desai Y., Glang L., Choi S. H., Singh K., Lopresti M. W., Orellana M., Rodriguez T., Oboh U., Hijji J., Ghinger F. G., Stewart K., Francis D., Edwards B., Chen P., Case D. A., Telesnitsky A., Summers M. F., Science 2020, 368, 413–417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Marchanka A., Simon B., Carlomagno T., Angew. Chem. Int. Ed. 2013, 52, 9996–10001; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2013, 125, 10180–10185. [Google Scholar]
- 8. Marchanka A., Simon B., Althoff-Ospelt G., Carlomagno T., Nat. Commun. 2015, 6, 7024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Schubeis T., Le T., Andreas L. B., Pintacuda G., J. Magn. Reson. 2018, 287, 140–152. [DOI] [PubMed] [Google Scholar]
- 10. Struppe J., Quinn C. M., Lu M., Wang M., Lu X., Kraus J., Andreas L. B., Stanek J., Lalli D., Pintacuda G., Maas W., Gronenborn A. M., Solid State Nucl. Magn. Reson. 2017, 87, 117–125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Su Y., Andreas L., Griffin R. G., Annu. Rev. Biochem. 2015, 84, 465–497. [DOI] [PubMed] [Google Scholar]
- 12. Andreas L. B., Le T., Jaudzems K., Pintacuda G., J. Magn. Reson. 2015, 253, 36–49. [DOI] [PubMed] [Google Scholar]
- 13. Böckmann A., Ernst M., Meier B. H., J. Magn. Reson. 2015, 253, 71–79. [DOI] [PubMed] [Google Scholar]
- 14. Marchanka A., Stanek J., Pintacuda G., Carlomagno T., Chem. Commun. 2018, 54, 8972–8975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Ahmed M., Marchanka A., Carlomagno T., Angew. Chem. Int. Ed. 2020, 59, 6866–6873; [DOI] [PMC free article] [PubMed] [Google Scholar]; Angew. Chem. 2020, 132, 6933–6940. [Google Scholar]
- 16. Jehle S., Falb M., Kirkpatrick J. P., Oschkinat H., Van Rossum B. J., Althoff G., Carlomagno T., J. Am. Chem. Soc. 2010, 132, 3842–3846. [DOI] [PubMed] [Google Scholar]
- 17. Asami S., Rakwalska-Bange M., Carlomagno T., Reif B., Angew. Chem. Int. Ed. 2013, 52, 2345–2349; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2013, 125, 2401–2405. [Google Scholar]
- 18. Sklenář V., Peterson R. D., Rejante M. R., Feigon J., J. Biomol. NMR 1993, 3, 721–727. [DOI] [PubMed] [Google Scholar]
- 19. Farmer B. T., Müller L., Nikonowicz E. P., Pardi A., J. Biomol. NMR 1994, 4, 129–133. [DOI] [PubMed] [Google Scholar]
- 20. Fiala R., Czernek J., Sklenář V., J. Biomol. NMR 2000, 16, 291–302. [DOI] [PubMed] [Google Scholar]
- 21. Leppert J., Urbinati C. R., Häfner S., Ohlenschläger O., Swanson M. S., Görlach M., Ramachandran R., Nucleic Acids Res. 2004, 32, 1177–1183. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Riedel K., Leppert J., Ohlenschläger O., Görlach M., Ramachandran R., J. Biomol. NMR 2005, 31, 331–336. [DOI] [PubMed] [Google Scholar]
- 23. Schult P., Roth H., Adams R. L., Mas C., Imbert L., Orlik C., Ruggieri A., Pyle A. M., Lohmann V., Nat. Commun. 2018, 9, 2613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Lu K., Heng X., Garyu L., Monti S., Garcia E. L., Kharytonchyk S., Dorjsuren B., Kulandaivel G., Jones S., Hiremath A., Divakaruni S. S., LaCotti C., Barton S., Tummillo D., Hosic A., Edme K., Albrecht S., Telesnitsky A., Summers M. F., Science 2011, 334, 242–245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Watts J. M., Dang K. K., Gorelick R. J., Leonard C. W., W. B. Jr, Swanstrom R., Burch C. L., Weeks K. M., Nature 2009, 460, 711–716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Yang Y., Xiang S., Liu X., Pei X., Wu P., Gong Q., Li N., Baldus M., Wang S., Chem. Commun. 2017, 53, 12886–12889. [DOI] [PubMed] [Google Scholar]
- 27. Leontis N. B., Westhof E., RNA 2001, 7, 499–512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Lange A., Luca S., Baldus M., J. Am. Chem. Soc. 2002, 124, 9704–9705. [DOI] [PubMed] [Google Scholar]
- 29. Jeener J., Meier B. H., Bachmann P., Ernst R. R., J. Chem. Phys. 1979, 71, 4546–4553. [Google Scholar]
- 30. Bennett A. E., Ok J. H., Griffin R. G., Vega S., J. Chem. Phys. 1992, 96, 8624–8627. [Google Scholar]
- 31. Sodickson D. K., Levitt M. H., Vega S., Griffin R. G., J. Chem. Phys. 1993, 98, 6742–6748. [Google Scholar]
- 32. Marchanka A., Carlomagno T., Methods Enzymol. 2019, 615, 333–371. [DOI] [PubMed] [Google Scholar]
- 33. Zhou D. H., Rienstra C. M., J. Magn. Reson. 2008, 192, 167–172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Zhou D. H., Shah G., Cormos M., Mullen C., Sandoz D., Rienstra C. M., J. Am. Chem. Soc. 2007, 129, 11791–11801. [DOI] [PubMed] [Google Scholar]
- 35. Zhou D. H., Shea J. J., Nieuwkoop A. J., Franks W. T., Wylie B. J., Mullen C., Sandoz D., Rienstra C. M., Angew. Chem. Int. Ed. 2007, 46, 8380–8383; [DOI] [PMC free article] [PubMed] [Google Scholar]; Angew. Chem. 2007, 119, 8532–8535. [Google Scholar]
- 36. Li Y., Wylie B. J., Rienstra C. M., J. Magn. Reson. 2006, 179, 206–216. [DOI] [PubMed] [Google Scholar]
- 37. Szeverenyi N. M., Sullivan M. J., Maciel G. E., J. Magn. Reson. 1982, 47, 462–475. [Google Scholar]
- 38. Takegoshi K., Nakamura S., Terao T., Chem. Phys. Lett. 2001, 344, 631–637. [Google Scholar]
- 39. Correll C. C., Freeborn B., Moore P. B., Steitz T. A., Cell 1997, 91, 705–712. [DOI] [PubMed] [Google Scholar]
- 40. Cate J. H., Doherty E. A., Doudna J. A., Protein Eng. 1997, 10, 41. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.