Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Apr 21.
Published in final edited form as: J Am Chem Soc. 2015 Feb 2;137(5):2107–2115. doi: 10.1021/ja513080v

Structure and Thermodynamics of N6-Methyladenosine in RNA: A Spring-Loaded Base Modification

Caroline Roost , Stephen R Lynch , Pedro J Batista , Kun Qu , Howard Y Chang ‡,*, Eric T Kool †,*
PMCID: PMC4405242  NIHMSID: NIHMS677932  PMID: 25611135

Abstract

graphic file with name nihms677932f6.jpg

N6-Methyladenosine (m6A) modification is hypothesized to control processes such as RNA degradation, localization, and splicing. However, the molecular mechanisms by which this occurs are unclear. Here, we measured structures of an RNA duplex containing m6A in the GGACU consensus, along with an unmodified RNA control, by 2D NMR. The data show that m6A–U pairing in the double-stranded context is accompanied by the methylamino group rotating from its energetically preferred syn geometry on the Watson–Crick face to the higher-energy anti conformation, positioning the methyl group in the major groove. Thermodynamic measurements of m6A in duplexes reveal that it is destabilizing by 0.5–1.7 kcal/mol. In contrast, we show that m6A in unpaired positions base stacks considerably more strongly than the unmodified base, adding substantial stabilization in single-stranded locations. Transcriptome-wide nuclease mapping of methylated RNA secondary structure from human cells reveals a structural transition at methylated adenosines, with a tendency to single-stranded structure adjacent to the modified base.

INTRODUCTION

Post-transcriptionally modified bases in RNA are numerous and important to cellular function. The most common internal RNA modification in eukaryotes is adenine N6-methylation (Figure 1),1 which occurs, on average, at three sites per mRNA and is found on long noncoding RNAs as well.2,3 Numbers of substitutions per RNA vary from one to as many as 11 or more. Although the effect of methylation in codons on translation has yet to be determined, methylation loci occur most commonly in 3′ UTRs and near splice sites, suggesting a more common role in signaling and control rather than directly in protein coding. In this light, methylation has been shown to shorten the average lifetime of RNAs and to influence their subcellular localization.4 Significantly, the effect of this substitution on RNA structure and folding is not known for any of the thousands of RNAs that contain the modification.

Figure 1.

Figure 1

Structures and conformations of m6A in RNA. (A) Syn methyl orientation is favored over anti when the base is unpaired16 as a result of a steric clash between the methyl group and N7. (B) Space-filling models of m6A in syn and anti conformations (N9 substituent is methyl). (C) Multiple pairing configurations are possible for m6A paired opposite U; these configurations vary on whether the adenine base and the methylamino group are syn or anti. The current work confirms the third (anti/anti) structure in an RNA duplex; the methyl group is spring-loaded into the high-energy anti conformation, trapped there by pairing with U and surrounding duplex structure.

Although this RNA modification has been studied for decades, the biology and biochemistry of m6A methylation and demethylation is emerging rapidly in recent years. A methylation complex including enzyme METTL3 has been identified and shown to perform adenine methylation in eukaryotic cells,5,6 and FTO and AlkBH5 are two oxidative proteins that have been demonstrated to accomplish demethylation in vitro.7,8 The functions of these latter enzymes are associated with important physiological pathways including metabolism and fertility.810 Moreover, polymorphic variations in the FTO gene have been linked to breast and endometrial cancer risk.1113

Because both methylating and demethylating factors have been identified, m6A modification is hypothesized to act as a biological switch of RNA activity.14 However, the molecular mechanism by which methylation status affects biological pathways is as yet unclear. Current hypotheses have focused mainly on the possibility that proteins specifically recognize and bind m6A,15 marking the modification for further downstream events.

Here, we consider another possible mechanism by which N6 methylation might affect biological activities of RNAs: by altering their stability and conformation. The 6-methyl group on an adenine base alone in solution is known to exist in two conformations, with syn being energetically favored by ca. 1.5 kcal/mol over anti (Figure 1).16 Consistent with this, in the solid state it resides in syn orientation.17 The structure of the modified base in paired RNA is unknown; in single strands, it likely adopts the favored syn conformation,18 but in pairing positions, this is not clear. Indeed, simple inspection of base pairing models (Figure 1C) suggests at least three possible structures for m6A paired within duplexes. The question of which of these is formed could well-affect pairing geometry and stability of folded RNAs and ultimately the biology of this modification.

To study this question, here we have carried out biophysical and structural studies of discrete m6A residues in short RNAs. We report that single m6A modifications are destabilizing to RNA duplexes that contain them, but, in contrast, they are strongly stabilizing in unpaired positions adjacent to duplexes. We further show that the N6 methylamino group must distort to a high-energy conformation, rotating the methyl group into the major groove, in order to be accommodated into a locally paired helix. This suggests that enzymatic methylation in paired regions of RNA and, conversely, demethylation in unpaired regions could destabilize existing structure, possibly triggering larger conformation changes in the RNA and altering its susceptibility to degradation. Preliminary data mapping the structure of methylated sites in cellular RNAs reveals the presence of a structural transition near the methylated adenosine, consistent with the notion that m6A is preferentially situated at the transition between unpaired and duplex structure.

EXPERIMENTAL SECTION

RNA Synthesis

RNA oligonucleotides were synthesized using standard β-cyanoethyl phosphoramidite chemistry and 2′-O-TBDMS-protected ribonucleosides. The m6A phosphoramidite derivative was synthesized according to literature procedures.19 RNA oligonucleotides were purified by denaturing polyacrylamide gel electrophoresis and characterized by MALDI-MS. See Supporting Information for details and data.

Thermal Denaturation Experiments

Solutions for the thermal melting experiments contained a 1:1 ratio of two complementary strands in 1–5 µM total RNA concentration (Ct). Self-complementary cases were measured in 2–14 µM concentrations. The oligoribonucleotides were measured in 10 mM sodium phosphate buffer, pH 7.0, with 1 M NaCl. Prior to the measurements, solutions were heated at 90 °C and then slowly cooled to 10 °C to ensure a complete annealing of the strands. Absorbance at 260 nm was monitored while temperature was raised at a rate of 1 °C/min from 10 to 90 °C. Thermodynamics were derived via curve-fitting with the standard two-state model and by van’t Hoff fitting of data from varied concentrations. See Supporting Information for details and data.

NMR Methods

NMR experiments were acquired on a Varian Inova 600 MHz spectrometer with a z-gradient triple resonance HCN probe or a Varian Inova 500 MHz spectrometer with a z-gradient H{X} probe. 1H and 13C chemical shifts were referenced indirectly to tetramethylsilane (TMS); 31P shifts were referenced indirectly referenced to phosphoric acid. The exchangeable 1H resonances of the duplexes were assigned using through-space homonuclear SS-NOESY experiments at 15 °C with different mixing times (100 or 250 ms) on a sample in 90% H20/10% D2O. The nonexchangeable 1H resonances of the RNA were assigned with a combination of homonuclear NOESY experiments at varied mixing times (75, 150, and 300 ms), DQF-COSY, homonuclear TOCSY with 80 ms mixing time, 1H/13C HSQC, and 1H/31P heteronuclear COSY acquired at 25 °C on a sample dissolved in 99.996% D2O. See Supporting Information for details.

Structure Calculations

Structures were calculated using restrained molecular dynamics followed by energy minimization using the program XPLOR-NIH, version 2.35, with a force field consisting of bond lengths, bond angles, improper angles, repulsive van der Waals potentials, and experimental distance and dihedral constraints in the absence of electrostatics. Random starting structures were created by randomizing torsion angles for the calculation. See Supporting Information for details. Structures of the averaged duplexes have been deposited with the PDB (2MVY and 2MVS).

RNA Isolation and m6A Enrichment

GM12878 cells were cultured as described.20 Cells were collected at a confluence of 6 × 105 cells per milliliter, and RNA was extracted with TRIzol according to the manufacturer’s protocol. m6A immunoprecipitation and library construction were done as described by Batista et al.21 PolyA RNA selection was performed using MicroPoly(A) Purist (Life Technologies) according to the manufacturer’s protocol. The PolyA+ RNA was fragmented to ~100 nucleotide fragments by incubation in a zinc buffer (10 mM ZnCl2, 10 mM Tris-HCl, pH 7.0) for 2 min followed by ethanol precipitation. Methylated RNA was selected as described22 using anti-m6A polyclonal antibody (Synaptic Systems). The immunoprecipitated RNA and an equivalent amount of input RNA were used for library generation with the dUTP protocol, as described,23 except that libraries were size selected by gel purification after ligation and after PCR amplification. Libraries were sequenced using an Illumina HiSeq at the Stanford Center for Genomics and Personalized Medicine. For all libraries, single-end RNA-Seq reads were mapped to the human genome (hg19 assembly) using TopHat (version 1.1.3).24 Only uniquely mapped reads were subjected to downstream analyses. A nonredundant hg19 transcriptome was assembled from UCSC RefSeq genes and UCSC genes. The search for enriched peaks was performed by scanning each gene using 100 nucleotide sliding windows and calculating an enrichment score for each sliding window.2 To determine high-confidence sites, we intersected the peaks with m6A sites from ref 16. See Supporting Information for details.

PARS Mapping of RNAs

Nuclease V1 and S1 were used to map secondary structure of RNAs as described.20 PARS scores for the neighborhood of RRACH motifs were obtained from ref 20.

RESULTS

Adenine N6 Methylation Is Destabilizing in Base-Paired Regions of RNA

To evaluate the effect of m6A substitution on the stability of paired regions of RNA, we synthesized 8-mer RNA strands containing single m6A substitutions in two different sequence contexts as well as complementary RNAs for each. We also prepared a 10-mer self-complementary RNAs containing two sites of substitution, and these were used for thermodynamic and structural studies (see below). Sequences are shown in Table 1; one single-substitution sequence and the double-substitution sequence correspond to common methylation consensus sequences (GGAGU and CGACU),2,3,5 and one single-substitution sequence does not. We measured thermodynamics of duplex formation by UV-monitored thermal denaturation and obtained free energies from curve fitting and from van’t Hoff measurements. Primary data are given in the Supporting Information (Figures S1 and S2). All duplexes exhibited two-state behavior, as judged by good agreement of the two-state fitting model with the van’t Hoff data.

Table 1.

Thermal Denaturation Data for Base Pairing of m6A Compared with A

Duplex Tm (°C)* ΔG°37fit
(kcal/mol)**
ΔG°37vH
(kcal/mol)
ΔG°37avg
(kcal/mol)
1 graphic file with name nihms677932t1.jpg 47.4 ± 0.1 −10.7 ± 0.1 −10.9 ± 0.2 −10.8 ± 0.2
2 graphic file with name nihms677932t2.jpg 23.1 ± 0.4 −6.1 ± 0.1 −6.0 ± 0.2 −6.1 ± 0.2
3 graphic file with name nihms677932t3.jpg 29.9 ± 0.1 −7.3 ± 0.1 −6.9 ± 0.2 −7.1 ± 0.2
4 graphic file with name nihms677932t4.jpg 33.2 ± 0.3 −8.0 ± 0.1 −8.5 ± 0.1 −8.3 ± 0.1
5 graphic file with name nihms677932t5.jpg 43.7 ± 0.2 −10.2 ± 0.1 −10.5 ± 0.1 −10.4±0.1
6 graphic file with name nihms677932t6.jpg 24.3 ± 0.1 −6.3 ± 0.1 −5.3 ± 0.5 −5.8 ± 0.5
7 graphic file with name nihms677932t7.jpg 27.3 ± 0.1 −6.8 ± 0.1 −7.0 ± 0.2 −6.9 ± 0.2
8 graphic file with name nihms677932t8.jpg 28.3 ± 0.2 −7.0 ± 0.1 −7.5 ± 0.1 −7.3 ± 0.1
9 graphic file with name nihms677932t9.jpg 49.9 ± 0.2 −11.5 ± 0.1 −11.3 ± 0.2 −11.4 ± 0.2
10 graphic file with name nihms677932t10.jpg 44.2 ± 0.1 −10.2 ± 0.1 −9.2 ± 0.1 −9.7 ± 0.1
11 graphic file with name nihms677932t11.jpg 70.6 ± 0.1 −16.4 ± 0.1 −14.3 ± 0.2 −15.4 ± 0.2
12 graphic file with name nihms677932t12.jpg 64.4 ± 0.4 −15.0 ± 0.2 −13.5 ± 0.2 −14.2 ± 0.2
*

Conditions: 100 mM NaCl, 10 mM MgCl2, 10 mM Na-PIPES (pH 7.0), [5 µM] duplex.

**

Averaged free energies derived from two-state curve fits.

Free energies derived from van’t Hoff linear plots.

Average of free energy from curve fits and van’t Hoff plots.

The experiments show that N6 methylation is consistently destabilizing to RNA duplexes when substituted opposite a complementary U within the helix. In one single-substitution context (Table 1, entries 1 and 5), the amount of destabilization is relatively small (0.5 kcal/mol (37 °C)), whereas in the second context (entries 9 and 10), the cost is considerably larger (1.7 kcal/mol). For the double-substitution context, the destabilization amounts to 0.6 kcal/mol per methyl. These values are consistent with an early study of polymer RNAs, which suggested destabilization of 0.5–1.0 kcal/mol,18 and with a report of Kierzek,25 which identified a destabilization of ~1.1 kcal per methyl in doubly substituted RNAs. The current experiments, the first to test the effects of single methyl groups and multiple sequence contexts, establish that there is a strong dependence of stability on context.

We are aware of no prior reports measuring pairing selectivity of N6-methylA in RNA; therefore, we carried out experiments to evaluate this parameter. Despite the destabilization caused by m6A, our studies pairing it opposite varied bases show that m6A retains selectivity for its Watson–Crick complement U (Table 1, entries 6–8 compared with entries 2–4 in the unmethylated case). The average pairing selectivity of m6A is similar to that of A; the chief differences are the greater selectivity of m6A against pairing with C and the slightly lower selectivity against pairing with A or G. Taken together, the pairing data suggest that the pairing destabilization, more than altered pairing selectivity, is likely to have the largest influence on structure.

Adenine N6 Methylation Stabilizes Base Stacking at Helix Ends

Since stability of folded nucleic acids depends heavily on base stacking,26 we measured the effect of N6 methyation on the stacking of adenine in unpaired (dangling-end) contexts at the 5′ and 3′ ends of a hexamer core duplex. The data and sequences are given in Table 2. As is well-established,26 adenine stacks relatively strongly on adjacent duplexes, especially at their 3′ end; here, we find a stabilization of 1.5 kcal/mol for addition of two 3′ dangling (unmodified) adenosine residues (0.75 kcal/mol per adenosine) and a smaller stacking stabilization of 0.26 kcal per adenosine at the 5′ end in the same context (compare entries 2 and 4 to entry 1), consistent with prior studies.26 Significantly, we find that methylation affects stacking of adenine strongly: m6A stacks 0.42 kcal/mol per methyl more strongly than A at the 3′ end and 0.58 kcal more strongly at the 5′ end (entries 3 and 5). Adenine is the strongest stacking of the four RNA bases,26 and we find here that addition of a methyl group enhances stacking by as much, on average, as the entire nucleobase adenine does alone. This favorable stacking likely arises from hydrophobic shielding of the methyl group against neighboring bases.27 Thus, it is clear that in these contexts m6A modification of an unpaired adenine substantially increases the stability of nearby helices. Importantly, this is the opposite effect that the N6 modification has within duplexes, and it is of similar magnitude.

Table 2.

Thermal Denaturation Data for m6A End Stacking in Self-Complementary Duplexes

Duplex Tm (°C)* ΔG°37fit
(kcal/mol)**
ΔG°37vH
(kcal/mol)
ΔG°37avg
(kcal/mol)
1 graphic file with name nihms677932t13.jpg 53.3 ± 0.2 −10.2 ± 0.1 −11.6 ± 0.1 −10.9 ± 0.1
2 graphic file with name nihms677932t14.jpg 61.4 ± 0.1 −12.1 ± 0.1 −12.7 ± 0.1 −12.4 ± 0.1
3 graphic file with name nihms677932t15.jpg 62.7 ± 0.4 −12.4 ± 0.1 −14.1 ± 0.2 −13.2 ± 0.2
4 graphic file with name nihms677932t16.jpg 56.3 ± 0.2 −10.8 ± 0.2 −12.0 ± 0.2 −11.4 ± 0.2
5 graphic file with name nihms677932t17.jpg 59.7 ± 0.3 −11.6 ± 0.1 −13.5 ± 0.2 −12.6 ± 0.2
*

Conditions: 100 mM NaCl, 10 mM MgCl2, 10 mM Na-PIPES (pH 7.0), [5 µM] duplex.

**

Averaged free energies derived from two-state curve fits.

Free energies derived from van’t Hoff linear plots.

Average of free energy from curve fits and van’t Hoff plots.

Structures of Methylated and Unmethylated RNAs Reveal a Conformational Switch in m6A upon Pairing

To study the structural origins of destabilization by m6A when paired, next we studied the structures of a duplex RNA containing m6A residues (denoted MA) and of a control RNA lacking the modification (AD) in solution. The self-complementary sequences are shown in Table 1, entries 9 and 10, and contain the common GGACU methylation consensus. The downfield portion of the 1D 1H NMR in H2O spectra is shown in Figure 2. The five resonances observed in both spectra are the imino resonances of uridine and guanosine, confirming that both duplexes are fully base-paired (note that each duplex has five symmetrically unique pairs). The imino resonances are shifted downfield as a result of hydrogen bonding and are sharp, indicating a slow exchange rate with water. The only broadened resonance corresponds to G1 in the end base pair, which is more exposed to solvent. The chemical shift for the NH of U8, opposite m6A3/A3, is nearly identical for both sequences. Thus, the data suggest a normal Watson–Crick base pair involving this adenine in both duplexes, as depicted in Figure 1C (right).

Figure 2.

Figure 2

Proton NMR spectra of RNA duplexes containing GGACU methylation motif, confirming fully paired structure. (A) 1D proton spectrum of methylated duplex; (B) spectrum of unmethylated duplex for comparison. Downfield regions of the 1D 1H NMR spectra are shown; they were acquired in 90% H2O/10% D2O at 15 °C with 11 solvent suppression and show evidence of 10 H-bonded base pairs (5 symmetrically unique) for methylated duplex MA (A) and unmodified AD (B).

NMR data were then acquired in D2O to analyze the structures of both oligonucleotides by 2D methods. A comparison of the two duplexes via 150 ms mixing time NOESY experiments is shown in Figure S3; data for the methylated duplex alone are given in Figure 3. The data show that normal base stacking is present for both RNA duplexes, as confirmed by an intact tracing of the intra- and internucleotide NOEs throughout the sequence. The similar NOE intensities for these data as well as other regions of the NOESY spectra (not shown) suggest that the structures of the two duplexes are similar, with 10 base pairs in each and sequential base stacking throughout. Slight chemical shift changes are observed for some resonances, chiefly on the base of N-methyladenosine, as well as the uridine that it is paired with, and adjacent nucleotides.

Figure 3.

Figure 3

H1′/aromatic portion of a 150 ms mixing time NOESY in D2O is shown for the methylated MA duplex, highlighting the sequential assignments of all bases. Sequence is 5′-GGACUAGUCC, where the bold base is methylated.

The N-methyl location and orientation were confidently assigned based upon NOESY data, which show clearly that the methylamino group is in the anti conformation. The NOEs observed in a 100 ms mixing time H2O SS-NOESY experiment for the N-methyl resonance are highlighted in Figure S4. The observed NOEs fit with a group of protons on the Watson–Crick face of the base, as NOEs are observed to multiple NH and NH2 resonances on the N-methyladenine base, the uridine that it is paired with, and the neighboring base pairs. Additionally, a weak NOE is observed to H8 of N-methyladenosine, confirming that the methyl group is rotated toward the H8 proton, and the NH of m6A is rotated to hydrogen-bond with the U8-carbonyl.

The structural arrangement of the modified nucleobase and its methyl group are no doubt essential to its biophysical and biochemical properties. To further confirm the above structural assignment, we considered three possible base-pairing arrangements between the m6A and uridine (Figure 1C). One of these (syn/syn) allows both two hydrogen-bonded base pairing and the favorable syn methyl orientation. However, the data clearly rule this out. Normal base stacking NOEs were observed from the sugar resonances of G2 and the sugar resonances of m6A to the N-methyladenosine H8. If the base were in the syn conformation, then the H1′/H8 NOE would be very intense, but such a NOE is not observed. In addition, moderate intensity NOEs were observed to the H1′ of U4 and C9 from the N-methyladenosine H2, which are normal for an A-form helix. Furthermore, weak NOEs were observed at long mixing time from the methyl to the N-methyladenosine H8, as well as the G2 H8 and the C4 H5, which would not be observed if the methyl group were in the syn conformation. Further confirmation that the assignments of H2 and the H8 of N-methyladenosine are not reversed is obtained from HSQC data, which confirms the H8 assignment at 8.1 ppm with a C8 shift of 137 ppm, in contrast to the H2 at 7.71 correlating to the C2 at 152 ppm.

In H2O, intense NOEs were observed from the U8 imino to the N-methyladenosine H2 and NH but not the methyl group, also consistent with the methyl group rotated away from the U8 imino. Furthermore, the H2 rather than the H8 is close to the U8 imino proton, consistent with a Watson–Crick rather than a Hoogsteen base-pair. In addition, the NH of N-methyladenosine is sharp and shifted downfield to 8.34 ppm, suggesting two hydrogen bonds of a Watson–Crick A–U pair and ruling out the singly H-bonded (base anti/methyl syn) arrangement depicted in Figure 1C.

The amino portion of the 1D 1H NMR in H2O is shown in Figure S5. The NH of the N6-methyladenosine is observed to be significantly sharper than the hydrogen-bonded amino resonance of the A3 in the unmodified duplex. This data suggests that rotation about the NH bond is slower for the N-methyl adenosine than for adenosine, leading to a sharper resonance. This is further highlighted in Figure S5B, showing the NOEs in the imino to amino/aromatic region in an SS-NOESY experiment. The NOE observed between the U8 imino and m6A in the sequence is significantly more intense than that to the NH2 of adenosine in the unmodified duplex. This data suggests that while the NH2 of adenosine and the NH of N6-methyladenosine are both hydrogen-bonded to U the slower rotation of the methylamino group results in a more intense NOE. Slower rotation about the C6–N6 bond in m6A is no doubt the result of steric hindrance in the locally stacked and H-bonded base pairs, which would be temporarily disrupted by rotating the bulky methylamino group. Thus, local pairing traps the methylamino group in a high-energy conformation.

Solution structures were calculated for both oligonucleotide duplexes using 354 NOE derived distance constraints for the sequence with N-methyladenosine and 340 NOE derived distance constraints for the sequence with unmodified adenosine. Starting with 30 random structures, 25 converged to low energy for both molecules. A superposition of these 25 structures for each is shown in Figure S6. The overall heavy atom rmsd to the average structure was 0.87 Å for methylated RNA and 0.96 Å for the unmodified duplex.

A comparison of the average structure for each oligonucleotide is shown in Figure 4A. The rmsd between all heavy atoms of the two structures is 1.07 Å. The two structures are quite similar overall, with A-form helices containing 10 Watson–Crick base pairs. The only minor difference is a slight change in overall bending and winding of the helix, which is difficult to directly measure with only short-range NMR constraints. Superimposing only the six nucleotides of the second, third, and fourth base pairs, including the N-methyladenosine–uridine base pair and the two nucleotides above and below in the sequence, results in an rmsd of 0.22 Å. Thus, the local differences between the two structures are small. The m6A methyl group is located in the major groove (Figures 4B,C and S7), with closest approach to N7 of the modified adenine and to major groove substituents on adjacent G and C bases in the GGACU consensus.

Figure 4.

Figure 4

Structures of methylated and unmodified RNA duplexes, establishing that m6A is oriented anti in a paired duplex, with the methyl group anti as well. (A) The average structure of the entire 10 bp duplexes with the MA in blue and unmodified DA RNA in red. (B) Superimposed structures of the second, third, and fourth base pairs alone; the m6A3/U8*;A3/U8* and the neighboring base pair (C4/G7*) are shown for clarity. The second strands in the symmetric dimer are indicated with an asterisk (*). (C) Surface representation of the structure of the methylated RNA duplex, highlighting the adenine N6 methyl groups in the major groove (light blue).

Nuclease Mapping of the Structures of Methylated Adenine Sites in Mammalian RNA

To determine the structural context and possible impact of m6A in cellular RNAs, we compared the RNA secondary structure around methylated and nonmethylated adenosines in the RRACH motif. We recently determined the RNA secondary structure of the transcriptome of the human lymphoblastoid cell line GM12878 by parallel analysis of RNA structure (PARS).20 To identify m6A modification sites, we performed m6A-IP-sequencing in GM12878 and also intersected our m6A-enriched peaks with a high-resolution catalog of m6A sites in diverse human cells.28 For these high-confidence sites, we calculated the average PARS score for each nucleotide around m6A-modified and unmodified RRACH motifs (Figure 5). The PARS score reflects the per base ratio of the number of reads obtained at cleavage sites by double- vs single-stranded nucleases and indicates whether a base is likely to be paired (positive score) or unpaired (negative score). We observed that methylated sites are clearly distinguished from unmodified sites through their structural profiles. In modified RNAs, there is a clear transition in average structure, as indicated by nuclease reactivity. In particular, the purine nucleotide immediately upstream of the modified A has a high probability of being single-stranded, with a strong negative PARS score. The methylated base itself has intermediate probability of being unpaired. In contrast, the two residues immediately to the 3′ side of m6A show a significant tendency toward paired structure. Because the PARS data were generated from purified RNA refolded in vitro, the observed structural profile reflects information encoded in the RNA molecule itself, in the absence of influence from proteins such as m6A readers.15 These results with cellular mRNAs suggest that methylation occurs to situate the methyl group in between unpaired RNA and adjacent to helices, which our thermodynamic data suggest is the most energetically favorable outcome.

Figure 5.

Figure 5

Nuclease structural mapping reveals a structural transition from unpaired to paired structure surrounding m6A. Plot shows average PARS score for each nucleotide in the vicinity of methylated vs unmethylated adenines in the RRACH methylation consensus, as mapped for polyadenylated RNAs from GM12878 cells. Non-methylated sites were chosen as the 5000 sites with the lowest enrichment score after m6A IP. y-axis represents PARS score. Positive values indicate high probability of double-stranded conformation, as evidenced by RNase V1 cleavage, whereas negative values indicate high probability of single-stranded conformation, as evidenced by RNase S1 cleavage. x-axis represents nucleotide position.

DISCUSSION

Taken together, our data show that the m6A–U pair in a stable RNA duplex is paired in canonical configuration, with two Watson–Crick hydrogen bonds. To accommodate this, the N6-methyl group in the modified duplex is rotated into the high-energy anti conformation, with the methyl group aimed directly into the major groove of the RNA, where the amino proton would normally be. This is further highlighted in the surface representation in Figure 4b, showing the methyl projecting into the groove. Closest contacts with the methyl group include the major groove edges of the bases above and below (G2 and C4, Figure S7).

Previous NMR studies of the m6A nucleobase in free solution have shown that the methylamino group prefers the syn conformation, with an energetic preference of ca. 1.5 kcal/mol.16 This preference is likely due to steric clashing of the methyl group with N7 in the anti arrangement. Thus, our experiments confirm that formation of an m6A–U base pair requires flipping and trapping of the methylamino group into an energetically unfavorable spring-loaded orientation. This is partially compensated for by hydrogen bonding and Watson–Crick geometry matching with U (Figure 1C); the end result is the net destabilization of the duplex by 0.5–1.7 kcal/mol for a single methyl substitution. Studies of 6-methyldeoxyadenosine in duplex DNA suggest that the methyl group slows hybridization,29 which is consistent with a requirement for rotation of the methylamino group into a high-energy conformation before base pairing can occur. Thus, we conclude that, when paired in RNA with stable base pairing surrounding it, m6A acts as a compressed spring that is locked into place by its paired context.

Our finding of this Watson–Crick paired geometry helps to explain the enzymatic properties of m6A. Most reverse transcriptase enzymes are able to read through this modification with little difficulty19 and to pair it with T, as expected for the canonical paired geometry. This would not be expected if non-Watson–Crick orientations (Figure 1C) were found. Such reverse transcriptases may have sufficient steric room in the major groove to accommodate the anti methyl group with little penalty. However, at least one polymerase (Tth, a DNA polymerase that also exhibits reverse transcriptase activity) does show ca. 10-fold selectivity against the methyl modification,19 which may reflect more sterically restrictive interactions with the major groove.

Our data help to clarify conflicting early studies that measured pairing effects of m6A in the duplex context. While experiments by Kierzek with various adenine N-substitutions showed thermodynamic destabilization by methyl substitution,25 a study by Micura of m6A in a longer duplex context showed no destabilization by this modification.30 Our data, measured with short duplexes, are consistent with Kiezek’s measurements, and we hypothesize that a long duplex context masks the effects of single m6A substitutions.

In contrast to this duplex destabilization, methylation of adenosine adjacent to helices is shown here to stabilize the overall structure substantially by enhancing base stacking, contributing 0.4–0.6 kcal of extra stabilization. m6A in unpaired environments is known to adopt the relaxed (syn) conformation.16,17 This would have the effect of placing the methyl group in contact with the adjacent base pair, which likely adds favorable hydrophobic stabilization.27

As a result of these opposing effects, we hypothesize that enzymatic methylation and demethylation could affect RNA conformations more globally. For example, methylation of adenine within a duplex region would destabilize it. If the local duplex surrounding the methylation site is not highly stable, then this could drive unwinding, leading to a less folded structure on average. Such a conformational effect was previously hypothesized by Micura,30 although destabilization by m6A was not observed in that study. Such unwinding could well have biological consequences: experiments have shown that elimination of duplex structures in RNAs can lead to more rapid degradation.31,32 Similarly, genome-wide studies in yeast have shown a correlation of low melting temperatures with degradation by the exosome.33 Thus, methylation in duplex regions could affect cellular lifetime of the RNA by making it more accessible to nucleases. In this light, it has been shown that methylated RNAs do, in fact, have a shorter cellular lifetime than unmethylated RNAs.4 The preferential location of m6A in 3′-UTRs is also consistent with this idea, given that duplex structure in 3′-UTRs is associated with increased lifetimes.31

We also speculate that such methylation of A in a duplex context could also lead, in principle, to a bimodal conformation switch to a second folded structure, which could be further stabilized by stacking if the m6A residue is unpaired but near a helical region in the second structure. The newly emplaced m6A residue in this scenario would act as a spring-loaded switch, driving local helix unpairing and resulting in the methylamino group flipping from anti to the more stable syn orientation. Once unpaired, the modified base could fold into a second conformation in which m6A remains favorably unpaired and gains stacking interactions with neighboring helices. The overall driving force for a single methylation event of this type would be the combined energetic effects of syn preference and stacking, totaling from 0.9 to as much as 2.3 kcal/mol from the current data. Conversely, enzymatic demethylation of unpaired m6A would have the opposite effect, decreasing beneficial stacking and favoring, relatively speaking, an RNA conformation in which the base is paired.

If such a switching mechanism were active, then we would expect it to be found in RNA contexts where two RNA folds were balanced reasonably closely in energy, although changes in methylation state at more than one site could magnify the energetic influence on conformations. If such switching occurs, then one would expect methylated adenine residues in such GGACH motifs to be preferentially located between unpaired bases and helices, having switched to that favorable conformation. Significantly, our PARS mapping data for methylated RNAs isolated from a human cell line do show striking evidence that m6A is preferentially situated in such a structural context. Notably, we find a preference for 5′ location of m6A next to helical structure, which is precisely the most stabilizing context for methylation in our thermodynamics studies (Table 2). Our thermodynamics data suggest that a dangling 3′ location of m6A would also be favorable, although less so than 5′ stacked locations. Where the m6A-stacked situation occurs in cellular RNAs, it appears to be dominated by the 5′ location in frequency, as judged by the PARS data. Whether this nuclease-mapped context is static or changes dynamically with methylation/demethylation remains to be seen; this hypothesis could be tested further by in vivo and in vitro structure probing34 of methylated versus unmethylated RNAs.

Our data show that m6A modification and context have substantial influences on RNA stability and structure. However, such effects may not be limited to RNA alone; for example, m6A-binding proteins4,35,36 may influence, and be influenced by, the dynamic conformation changes associated with the modification. A recent structural study of a YTH domain bound to an m6A-containing RNA reveals m6A in its relaxed syn conformation and present in a deep pocket.36 This suggests that methylation of A in a double-stranded context would assist protein binding either by destabilizing the helix around it, thus allowing protein access, or by causing a conformation change to place m6A in a single-stranded context. Thus, although our data show that methylation can affect RNA structure and stability without the aid of proteins, it is also quite possible that both may act together to affect RNA structure as well.

Also relevant to m6A context are the structural preferences of methylating and demethylating enzymes. The structural preference of the METTL3 methylation complex for adenine methylation in the RRACH consensus is as yet unclear;37 we would predict a preference for duplex contexts (prior to methylation) if methyl-driven unwinding or conformational switching is a widely active mechanism. Conversely, demethylating enzymes should exhibit a preference for unpaired m6A in this scenario. Interestingly, the enzyme FTO, known to have m6A demethylase activity, does appear in preliminary studies to prefer unpaired RNA as a context for demethylation over duplex structure,7 which is consistent with our hypothesis. Future work will address these issues in detail.

Supplementary Material

2

ACKNOWLEDGMENTS

We thank the U.S. National Institutes of Health (GM068122 and GM067201 to E.T.K.; R01-HG004361 and P50-HG007735 to H.Y.C.) for support. C.R. acknowledges the Swiss National Science Foundation for a postdoctoral fellowship.

Footnotes

ASSOCIATED CONTENT

Supporting Information

Additional experimental methods. Table S1: Mass spectrometry data for oligonucleotides. Figure S1: Representative thermal denaturation curves. Figure S2: Representative van’t Hoff plots. Figure S3: The H1′/aromatic portion of NOESY data showing sequence connectivity. Figure S4: NOEs from the methyl group of N-methyl adenosine. Figure S5: Amino/imino NMR resonances and NOEs in water. Figure S6: Comparison of the superposition of 25 structures of each duplex. Figure S7: View of m6A in the major groove of the methylated MA duplex. This material is available free of charge via the Internet at http://pubs.acs.org.

The authors declare no competing financial interest.

REFERENCES

  • 1.Schibler U, Kelley DE, Perry RP. J. Mol. Biol. 1977;115:695–714. doi: 10.1016/0022-2836(77)90110-3. [DOI] [PubMed] [Google Scholar]
  • 2.Dominissini D, Moshitch-Moshkovitz S, Schwartz S, Salmon-Divon M, Ungar L, Osenberg S, Cesarkas K, Jacob-Hirsch J, Amariglio N, Kupiec M, Sorek R, Rechavi G. Nature. 2012;485:201–206. doi: 10.1038/nature11112. [DOI] [PubMed] [Google Scholar]
  • 3.Meyer KD, Saletore Y, Zumbo P, Elemento O, Mason CE, Jaffrey SR. Cell. 2012;150:1–12. doi: 10.1016/j.cell.2012.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Wang X, Lu Z, Gomez A, Hon GC, Yue Y, Han D, Fu Y, Parisien M, Dai Q, Jia G, Ren B, Pan T, He C. Nature. 2014;505:117–120. doi: 10.1038/nature12730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Csepany T, Lin A, Baldick CJ, Jr, Beemon K. J. Biol. Chem. 1990;265:20117–20122. [PubMed] [Google Scholar]
  • 6.Bokar JA, Shambaugh ME, Polayes D, Matera AG, Rottman FM. RNA. 1997;3:1233–1247. [PMC free article] [PubMed] [Google Scholar]
  • 7.Jia G, Fu Y, Zhao X, Dai Q, Zheng G, Yang Y, Yi C, Lindahl T, Pan T, Yang YG, He C. Nat. Chem. Biol. 2011;7:885–887. doi: 10.1038/nchembio.687. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zheng G, Dahl JA, Niu Y, Fedorcsak P, Huang CM, Li CJ, Vågbø CB, Shi Y, Wang WL, Song SH, Lu Z, Bosmans RP, Dai Q, Hao YJ, Yang X, Zhao WM, Tong WM, Wang XJ, Bogdan F, Furu K, Fu Y, Jia G, Zhao X, Liu J, Krokan HE, Klungland A, Yang YG, He C. Mol. Cell. 2013;49:18–29. doi: 10.1016/j.molcel.2012.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Gerken T, Girard CA, Tung YC, Webby CJ, Saudek V, Hewitson KS, Yeo GS, McDonough MA, Cunliffe S, McNeill LA, Galvanovskis J, Rorsman P, Robins P, Prieur X, Coll AP, Ma M, Jovanovic Z, Farooqi IS, Sedgwick B, Barroso I, Lindahl T, Ponting CP, Ashcroft FM, O’Rahilly S, Schofield CJ. Science. 2007;318:1469–1472. doi: 10.1126/science.1151710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Zheng G, Dahl JA, Niu Y, Fu Y, Klungland A, Yang YG, He C. RNA Biol. 2013;10:915–918. doi: 10.4161/rna.24711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kaklamani V, Yi N, Sadim M, Siziopikou K, Zhang K, Xu Y, Tofilon S, Agarwal S, Pasche B, Mantzoros C. BMC Med. Genet. 2011;12:52–61. doi: 10.1186/1471-2350-12-52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Garcia-Closas M, Couch FJ, Lindstrom S, Michailidou K, Schmidt MK, Brook MN, Orr N, Rhie SK, Riboli E, Feigelson HS, Le Marchand L, Buring JE, Eccles D, Miron P, Fasching PA, Brauch H, Chang-Claude J, Carpenter J, Godwin AK, Nevanlinna H, Giles GG, Cox A, Hopper JL, Bolla MK, Wang Q, Dennis J, Dicks E, Howat WJ, Schoof N, Bojesen SE, Lambrechts D, Broeks A, Andrulis IL, Guénel P, Burwinkel B, Sawyer EJ, Hollestelle A, Fletcher O, Winqvist R, Brenner H, Mannermaa A, Hamann U, Meindl A, Lindblom A, Zheng W, Devillee P, Goldberg MS, Lubinski J, Kristensen V, Swerdlow A, Anton-Culver H, Dörk T, Muir K, Matsuo K, Wu AH, Radice P, Teo SH, Shu XO, Blot W, Kang D, Hartman M, Sangrajrang S, Shen CY, Southey MC, Park DJ, Hammet F, Stone J, Veer LJ, Rutgers EJ, Lophatananon A, Stewart-Brown S, Siriwanarangsan P, Peto J, Schrauder MG, Ekici AB, Beckmann MW, Dos Santos Silva I, Johnson N, Warren H, Tomlinson I, Kerin MJ, Miller N, Marme F, Schneeweiss A, Sohn C, Truong T, Laurent-Puig P, Kerbrat P, Nordestgaard BG, Nielsen SF, Flyger H, Milne RL, Perez JI, Menéndez P, Müller H, Arndt V, Stegmaier C, Lichtner P, Lochmann M, Justenhoven C, Ko YD, Gene ENvironmental Interaction and breast CAncer (GENICA) Network. Muranen TA, Aittomäki K, Blomqvist C, Greco D, Heikkinen T, Ito H, Iwata H, Yatabe Y, Antonenkova NN, Margolin S, Kataja V, Kosma VM, Hartikainen JM, Balleine R, kConFab Investigators. Tseng CC, Berg DV, Stram DO, Neven P, Dieudonné AS, Leunen K, Rudolph A, Nickels S, Flesch-Janys D, Peterlongo P, Peissel B, Bernard L, Olson JE, Wang X, Stevens K, Severi G, Baglietto L, McLean C, Coetzee GA, Feng Y, Henderson BE, Schumacher F, Bogdanova NV, Labrèche F, Dumont M, Yip CH, Taib NA, Cheng CY, Shrubsole M, Long J, Pylkäs K, Jukkola-Vuorinen A, Kauppila S, Knight JA, Glendon G, Mulligan AM, Tollenaar RA, Seynaeve CM, Kriege M, Hooning MJ, van den Ouweland AM, van Deurzen CH, Lu W, Gao YT, Cai H, Balasubramanian SP, Cross SS, Reed MW, Signorello L, Cai Q, Shah M, Miao H, Chan CW, Chia KS, Jakubowska A, Jaworska K, Durda K, Hsiung CN, Wu PE, Yu JC, Ashworth A, Jones M, Tessier DC, González-Neira A, Pita G, Alonso MR, Vincent D, Bacot F, Ambrosone CB, Bandera EV, John EM, Chen GK, Hu JJ, Rodriguez-Gil JL, Bernstein L, Press MF, Ziegler RG, Millikan RM, Deming-Halverson SL, Nyante S, Ingles SA, Waisfisz Q, Tsimiklis H, Makalic E, Schmidt D, Bui M, Gibson L, Müller-Myhsok B, Schmutzler RK, Hein R, Dahmen N, Beckmann L, Aaltonen K, Czene K, Irwanto A, Liu J, Turnbull C, Familial Breast Cancer Study (FBCS) Rahman N, Meijers-Heijboer H, Uitterlinden AG, Rivadeneira F, Australian Breast Cancer Tissue Bank (ABCTB) Investigators. Olswold C, Slager S, Pilarski R, Ademuyiwa F, Konstantopoulou I, Martin NG, Montgomery GW, Slamon DJ, Rauh C, Lux MP, Jud SM, Bruning T, Weaver J, Sharma P, Pathak H, Tapper W, Gerty S, Durcan L, Trichopoulos D, Tumino R, Peeters PH, Kaaks R, Campa D, Canzian F, Weiderpass E, Johansson M, Khaw KT, Travis R, Clavel-Chapelon F, Kolonel LN, Chen C, Beck A, Hankinson SE, Berg CD, Hoover RN, Lissowska J, Figueroa JD, Chasman DI, Gaudet MM, Diver WR, Willett WC, Hunter DJ, Simard J, Benitez J, Dunning AM, Sherman ME, Chenevix-Trench G, Chanock SJ, Hall P, Pharoah PD, Vachon C, Easton DF, Haiman CA, Kraft P. Nat. Genet. 2013;45:392–398. [Google Scholar]
  • 13.Delahanty RJ, Beeghly-Fadiel A, Xiang YB, Long J, Cai Q, Wen W, Xu WH, Cai H, He J, Gao YT, Zheng W, Shu XO. Am. J. Epidemiol. 2011;174:1115–1126. doi: 10.1093/aje/kwr233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Jia G, Fu Y, He C. Trends Genet. 2013;29:108–115. doi: 10.1016/j.tig.2012.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Wang X, He C. RNA Biol. 2014;11:669–672. doi: 10.4161/rna.28829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Engel JD, von Hippel PH. Biochemistry. 1974;13:4143–4158. doi: 10.1021/bi00717a013. [DOI] [PubMed] [Google Scholar]
  • 17.Sternglanz H, Bugg CE. Science. 1973;182:833–834. doi: 10.1126/science.182.4114.833. [DOI] [PubMed] [Google Scholar]
  • 18.Engel JD, von Hippel PH. J. Biol. Chem. 1978;253:927–934. [PubMed] [Google Scholar]
  • 19.Harcourt EM, Ehrenschwender T, Batista PJ, Chang HY, Kool ET. J. Am. Chem. Soc. 2013;135:19079–19082. doi: 10.1021/ja4105792. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wan Y, Qu K, Zhang QC, Flynn RA, Manor O, Ouyang Z, Zhang J, Spitale RC, Snyder MP, Segal E, Chang HY. Nature. 2014;505:706–709. doi: 10.1038/nature12946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Batista PJ, Molinie B, Wang J, Qu K, Zhang J, Li L, Bouley DM, Lujan E, Haddad B, Daneshvar K, Carter AC, Flynn RA, Zhou C, Lim K, Dedon P, Wernig M, Mullen AC, Xing Y, Giallourakis CC, Chang HY. Cell Stem Cell. 2014;15:707–719. doi: 10.1016/j.stem.2014.09.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Xiao R, Moore DD. Current Protocols in Molecular Biology. Unit 21.21 New York: John Wiley & Sons; 2011. [Google Scholar]
  • 23.Levin JZ, Yassour M, Adiconis X, Nusbaum C, Thompson DA, Friedman N, Gnirke A, Regev A. Nat. Methods. 2010;7:709–715. doi: 10.1038/nmeth.1491. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Trapnell C, Pachter L, Salzberg SL. Bioinformatics. 2009;25:1105–1111. doi: 10.1093/bioinformatics/btp120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Kierzek E, Kierzek R. Nucleic Acids Res. 2003;31:4472–4480. doi: 10.1093/nar/gkg633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Petersheim M, Turner DH. Biochemistry. 1983;22:256–263. doi: 10.1021/bi00271a004. [DOI] [PubMed] [Google Scholar]
  • 27.Guckian KM, Schweitzer BA, Ren RX-F, Sheils CJ, Tahmassebi DT, Kool ET. J. Am. Chem. Soc. 2000;122:2213–2222. doi: 10.1021/ja9934854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Schwartz S, Mumbach MR, Jovanovic M, Wang T, Maciag K, Bushkin GG, Mertins P, Ter-Ovanesyan D, Habib N, Cacchiarelli D, Sanjana NE, Freinkman E, Pacold ME, Satija R, Mikkelsen TS, Hacohen N, Zhang F, Carr SA, Lander ES, Regev A. Cell Rep. 2014;8:284–296. doi: 10.1016/j.celrep.2014.05.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Fazakerley GV, Guy A, Téoule R, Quignard E, Guschlbauer W. Biochimie. 1985;67:819–822. doi: 10.1016/s0300-9084(85)80173-5. [DOI] [PubMed] [Google Scholar]
  • 30.Micura R, Pils W, Höbartner C, Grubmayr K, Ebert MO, Jaun B. Nucleic Acids Res. 2001;29:3997–4005. doi: 10.1093/nar/29.19.3997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Blum E, Carpousis AJ, Higgins CF. J. Biol. Chem. 1999;274:4009–4016. doi: 10.1074/jbc.274.7.4009. [DOI] [PubMed] [Google Scholar]
  • 32.Koval AP, Gogolevskaya IK, Tatosyan KA, Kramerov DA. PLoS One. 2012;7:e44157. doi: 10.1371/journal.pone.0044157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Wan Y, Qu K, Ouyang Z, Kertesz M, Li J, Tibshirani R, Makino DL, Nutter RC, Segal E, Chang HY. Mol. Cell. 2012;48:169–181. doi: 10.1016/j.molcel.2012.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Spitale RC, Crisalli P, Flynn RF, Torre EA, Kool ET, Chang HY. Nat. Chem. Biol. 2012;9:18–20. doi: 10.1038/nchembio.1131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Meyer KD, Jaffrey SR. Nat. Rev. Mol. Cell Biol. 2014;15:313–326. doi: 10.1038/nrm3785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Xu C, Wang X, Liu K, Roundtree IA, Tempel W, Li Y, Lu Z, He C, Min J. Nat. Chem. Biol. 2014;10:927–929. doi: 10.1038/nchembio.1654. [DOI] [PubMed] [Google Scholar]
  • 37.Liu J, Yue Y, Han D, Wang X, Fu Y, Zhang L, Jia G, Yu M, Lu Z, Deng X, Dai Q, Chen W, He C. Nat. Chem. Biol. 2014;10:93–95. doi: 10.1038/nchembio.1432. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

2

RESOURCES