Abstract
The family of radical SAM RNA-methylating enzymes comprises a large group of proteins that contains only a few functionally characterized members. Several enzymes in this family have been implicated in the regulation of translation and antibiotic susceptibility, emphasizing their significance in bacterial physiology and their relevance to human health. While few characterized enzymes have been shown to modify diverse RNA substrates, highlighting potentially broad substrate scope within the family, many enzymes in this class have no known substrates. The precise knowledge of RNA substrates and modification sites for uncharacterized family members is important for unraveling their biological function. Here, we describe a strategy for substrate identification that takes advantage of mechanism-based cross-linking between the enzyme and its RNA substrates, which we named individual-nucleotide-resolution cross-linking and immunoprecipitation combined with mutational profiling with sequencing (miCLIP-MaPseq). Identification of the position of the modification site is achieved using thermostable group II intron reverse transcriptase (TGIRT), which introduces a mismatch at the site of the cross-link.
Keywords: RNA methylation, Radical SAM, Substrate identification, Methyl adenosine, RlmN, Cfr, TGIRT
1. Introduction
There are more than 100 chemically distinct RNA modifications, out of which methylation is the most common. RNA methylation is ubiquitous across all domains of life; however, the exact location, biological function, and corresponding RNA-methylating enzymes are poorly understood. Recent strategies that combine immunoprecipitation of modified RNA or chemical treatment of RNA with next-generation sequencing have allowed mapping of the location and abundance of a subset of RNA modifications, such as N6-methyladenosine (m6A), 5-methylcytosine (m5C), and pseudouridine (Ψ) [1–8]. These approaches take advantage of either the unique chemical reactivity of the methyl group (e.g., detection of m5C via RNA bisulfite sequencing) [3] or the availability of modification-specific antibodies (e.g., transcriptome-wide identification of m6A and Ψ) [1, 4, 9]. Additionally, strategies based on UV cross-linking and immunoprecipitation, known as CLIP-based methods, have been used to identify enzyme-substrate pairs for RNA-modifying enzymes [10–14]. These methods employ UV irradiation to generate covalent protein-RNA adducts that can be subsequently isolated and enriched to identify RNA-interacting partners. Despite many advances that resulted from CLIP-based methods, the main disadvantage of UV cross-linking is its low cross-linking efficiency. An alternative approach developed for substrate identification for some RNA methyltransferases exploits the formation of a covalent catalytic intermediate between the enzyme and its RNA substrate [3, 15]. While inherently limited to enzyme families that form a covalent intermediate in their catalytic mechanisms, this approach allows for highly efficient cross-linking of enzyme-substrate pairs. This strategy has earlier been applied to NSun RNA methyltransferase family members, which methylate cytosines in RNA to yield m5C. The covalent enzyme-substrate intermediate is trapped either by using a 5-azacitidine (Aza) analog, as in Aza-IP [3], or by mutation of the key cysteine residue in these enzymes that is necessary for the resolution of the covalent intermediate, as in methylation-iCLIP (miCLIP) approach [15].
Radical SAM RNA-methylating enzymes employ a radical-based mechanism to generate 2-methyladenosine (m2A, RlmN enzymes) and 8-methyladenosine (m8A, Cfr enzymes). Mechanistic studies by our group [16] and others [17–20] have revealed that substrate methylation by radical SAM RNA-methylating enzymes proceeds through an enzyme-substrate covalent intermediate distinct from those formed by RNA m5C methyltransferases [21]. The hallmark of the methylation is formation of a methylene-bridged covalent intermediate between a Cys residue in the enzyme (C355 in E. coli RlmN) and amidine carbon of the adenosine substrate (Fig. 1) [16–20, 22, 23]. Subsequently, the enzyme-RNAcovalent adduct is resolved by a second conserved cysteine (C118 in E. coli RlmN) [16]. Mutation of C118 (C118A) stabilizes the protein-RNA intermediate, enabling isolation of the enzyme-RNA covalent pairs by immunoprecipitation (Fig. 1) [16]. By combining this key mechanistic feature with next-generation sequencing, we have developed a novel strategy where individual-nucleotide-resolution cross-linking and immunoprecipitation are combined with mutational profiling with sequencing (miCLIP-MaPseq) [24]. This method can allow for the identification of substrates and modification sites for any member of the radical SAM RNA-methylating enzyme family. The method was developed and validated using the most well-characterized member of the family, RlmN from E. coli that is known to modify 23S rRNA, as well as a subset of tRNA substrates [25, 26].
Fig. 1.

Mechanistic scheme for RlmN-mediated methylation of RNA showing key steps. The stable covalent intermediate trapped by C118A mutation is shown
miCLIP-MaPseq relies on immunoprecipitation of a stable covalent complex between the mutant enzyme and RNA, followed by high-throughput RNA sequencing (Fig. 2). Following isolation, the protein-RNA species are digested using Proteinase K, which leaves a peptide scar on the RNA at the site of the protein-RNA cross-link formation. RNA is then size-selected on a denaturing TBE-urea gel (Fig. 3). RNA species larger than 300 nucleotides are fragmented prior to dephosphorylation [27], while smaller RNA fragments are dephosphorylated without prior fragmentation. After size selection, RNAs are converted to cDNA using the TGIRT reverse transcriptase, and the resulting library is subjected to PCR amplification and high-throughput sequencing. One main advantage of miCLIP-MaPseq is that it uses TGIRT to generate cDNA. This reverse transcriptase is highly processive and introduces a mismatch when it encounters the protein scar on RNA and thus allows identification of methylation sites using mutational profiling. Here, we provide a detailed miCLIP-MaPseq protocol that can be easily modified and implemented to identify substrates of any member of the radical SAM RNA-methylating family.
Fig. 2.

Schematic representation of library preparation strategy for identification of substrates and methylation sites of RlmN. Red bars represent the fraction of mismatches at a specific nucleotide on substrate RNA
Fig. 3.

Gel analysis of isolated RNA after immunoprecipitation and Proteinase K treatment of FLAG-tagged C118A RlmN. RNA was size selected into four regions (A-D) as indicated on the gel, and each region was individually sequenced. Lanes 1–3: Isolated RNA after immunoprecipitation and Proteinase K treatment of FLAG-tagged C118A RlmN, where the amount of sample loaded in lane 1 is half of the amount loaded in each of the lanes 2 and 3; lane 4: low-range single-stranded RNA markers
2. Materials
2.1. Cell Lysis and Target Protein Immunoprecipitation
Lysis buffer: 50 mM Tris–HCl pH 7.5, 150 mM NaCl, 1 mM EDTA, 1% (v/v) Triton X-100.
100 mM PMSF.
TBS buffer: 50 mM Tris–HCl pH 7.5, 150 mM NaCl.
Glycine buffer: 100 mM glycine–HCl pH 3.5.
Stringent-TBS wash: 50 mM Tris–HCl pH 7.5, 500 mM NaCl.
Resin recycle solution: 50% Glycerol, 50% TBS, 0.02% sodium azide.
10 mM Tris pH 7.5.
RQ1 RNase-free DNase.
Anti-FLAG M2 resin.
IP dilution solution: 12 μL of 5 mg/mL 1× FLAG peptide and 388 μL of 10 mM Tris pH 7.5.
LB medium.
1.5 mL Eppendorf tubes.
E. coli strain encoding a FLAG-tagged wild-type RNA-modifying enzyme of interest and the corresponding FLAG-tagged mutant RNA-modifying enzyme. For the example described herein, use E. coli BW25113 strain encoding FLAG-tagged wild-type RlmN and E. coli BW25113 FLAG-tagged C118A RlmN.
2.2. RNA Isolation
Proteinase K.
GlycoBlue.
3 M Sodium acetate pH 5.5.
Isopropanol.
80% Ethanol.
Novex 10% TBE-urea precast gel.
1× TBE running buffer: Dilute 10× TBE running buffer to 1× using DEPC-treated water.
2× RNA loading dye.
Low-range ssRNA ladder.
SYBR Gold Nucleic Acid Gel Stain.
18G × 1 ½ syringe needle.
Costar SpinX column.
Nuclease-free water.
RNase-free non-stick 0.5 mL tubes and RNase-free 1.5 mL tubes.
100% Ethanol.
2.3. RNA Fragmentation
10× Fragmentation Reagent mix.
Nuclease-free water.
RNase-free PCR tubes.
2.4. Library Preparation
T4 polynucleotide kinase.
SUPERase-In.
TGIRT-III template-switching kit.
10× PNK buffer: 70 mM Tris–HCl pH 7.5, 10 mM MgCl2, 5 mM DTT.
5× TGIRT reaction buffer: 100 mM Tris–HCl pH 7.5, 2.25 M NaCl, 25 mM MgCl2.
Novex 8% TBE precast gel.
5× GelPilot DNA Loading Dye.
5′ DNA Adenylation Kit.
Zymo Oligo Clean & Concentrator kit.
Thermostable 5′ AppDNA/RNA Ligase.
10× NEBuffer 1.
50 mM MnCl2.
MiniElute PCR Purification Kit.
Phusion High-Fidelity DNA Polymerase.
Deoxynucleotide (dNTP) Solution Mix.
10 mM Tris, pH 8.
- Oligos used for library preparation:
- R2 RNA (provided in a kit by InGex; 3SpC3 is a C3 Spacer phosphoramidite): 5′-rArGrA rUrCrG rGrArA rGrArG rCrArC rArCrG rUrCrU rGrArArCrUrCrCrArG rUrCrA rC/3SpC3/-3′.
- R2R DNA (provided in a kit by InGex): 5′-GTG ACT GGA GTT CAG ACG TGT GCT CTT CCG ATC TN (N = equimolar A, T, G, C).
- R1R DNA (IDT; 3SpC3 is a C3 Spacer phosphoramidite): 5′-/5Phos/GAT CGT CGG ACT GTA GAA CTC TGA ACG TGT AG/3SpC3/-3′.
- Illumina multiplex primer (IDT): 5′-AAT GAT ACG GCG ACC ACC GAG ATC TAC ACG TTC AGA GTT CTA CAG TCC GAC GAT C-3′.
- Illumina barcode primer (IDT): 5′-CAA GCA GAA GAC GGC ATA CGA GAT [barcode] GTG ACT GGA GTT CAG ACG TGT GCT CTT CCG ATC T-3′.
2.5. Library Quantification
KAPA library quantification kit.
96-Well PCR plate.
BioRad CFX Connect real-time PCR
Agilent 2100 Bioanalyzer.
2.6. Sequencing
Illumina HiSeq4000.
3. Methods
3.1. Expression of the FLAG-Tagged Enzyme
A C-terminal DYKDDDDK octapeptide (FLAG) sequence is fused to the genomic version of rlmN or rlmN containing C118A mutation as described previously [16].
Inoculate 1 L of LB medium with 1:100 from an overnight culture of E. coli BW25113 encoding either the FLAG-tagged WT RlmN or the FLAG-tagged C118A RlmN.
Grow the cells at 37 °C, 200 rpm, for 90 min for WT RlmN, or 150 min for C118A RlmN (see Note 1).
Harvest the cells by centrifugation at 1610 × g (rotor F9S 4 × 1000y) for 15 min at 4 °C. Flash freeze the pellets in liquid nitrogen, and either store them at −80 °C or immediately proceed with the next step.
3.2. Lysis and DNase Treatment
Thaw ~1.5 g of cells and resuspend them in 4 mL of cold lysis buffer. Add 40 μL of 100 mM PMSF.
While keeping the cells on ice, sonicate the cells using the microtip on a power setting 3 and duty cycle 50%, for three 40-s pulses with 1-min breaks between pulses. The probe should be positioned approximately 0.5 cm from the bottom of the tube and should not be touching the tube sides in order to avoid foaming.
Divide 4 mL sample among five 1.5 mL Eppendorf tubes. Add 18 μL RQ1 RNase-free DNase to each tube and incubate at 37 °C for 15 min. Spin down at 19,722 × g (Sorvall Legend Micro 21R) for 10 min at 4 °C. Without disturbing the pellet remove the supernatant to a new 1.5 mL tube. If not immediately proceeding with the next step, store the samples at −20 °C.
3.3. Immunoprecipitation
3.3.1. Resin Preparation
Thoroughly suspend the anti-FLAG M2 affinity resin and immediately transfer 375 μL to a new Eppendorf tube. Centrifuge the resin at 8000 × g, for 1 min at 4 °C. Let the resin settle for 1 min. Remove the supernatant making sure not to transfer any resin.
Wash the resin twice with 1 mL of TBS buffer. For each wash, add 1 mL of TBS buffer, resuspend the resin by gently pipetting, centrifuge the resin at 8000 × g for 1 min at 4 °C, and then let it settle for 1 min.
Wash the resin once with 1 mL glycine buffer (see Note 2).
Wash the resin three times with TBS buffer. For each wash, the process is the same as in Step 2.
3.3.2. Binding of FLAG-Tagged Protein
Add approximately 75 μL of resuspended resin into each tube containing ~800 μL sample. Let the samples incubate with the resin on a rotator for at least 2 h at 4 °C.
3.3.3. Elution of FLAG-Tagged Protein
Centrifuge the samples at 8000 × g, for 1 min at 4 °C. Let the resin settle for 1 min. Remove the supernatant.
Wash the resin three times with 500 μL of stringent TBS wash buffer.
Add 75 μL of IP dilution solution into each sample. Gently resuspend and rotate samples for at least 1 h at 4 °C (see Note 3).
Centrifuge the samples at 8000 × g, for 1 min at 4 °C. Let the resin settle for 2 min.
Carefully transfer the supernatant to a new Eppendorf tube. Put both resin and supernatant on ice. Store resin without glycerol at 4 °C. Store supernatant at −20 °C.
3.3.4. Recycling of the Resin
Resuspend used resin in 500 μL of TBS buffer. Centrifuge the resin at 8000 × g, for 1 min at 4 °C. Remove the supernatant.
Wash the resin three times with 1 mL of glycine buffer. Centrifuge the resin at 8000 × g, for 1 min at 4 °C. Let the resin settle for 1 min.
Wash the resin five times with 1 mL of resin recycle solution. Centrifuge the resin at 8000 × g, for 1 min at 4 °C. Store the resin at −20 °C.
3.4. Proteinase K Treatment
Thaw the sample obtained after immunoprecipitation on ice. Remove 10 μL for subsequent Western blot analysis. Add 9.6 U of Proteinase K into the rest of the sample and incubate the reaction for 2 h at 37 °C.
Divide each sample into two Eppendorf tubes (~230–250 μL). Precipitate RNA by adding 1000 μL of isopropanol, 25 μL of sodium acetate pH 5.5, and 2 μL of GlycoBlue co-precipitant into each tube. Leave the tubes overnight at −20 °C.
The next day precipitate RNA by centrifugation at 20,000 × g for 30–40 min at 4 °C.
Carefully remove the supernatant and wash the pellet with 750 μL of 80% cold ethanol. Precipitate RNA by centrifugation at 20,000 × g for 40 min at 4 °C. Carefully remove the supernatant and air-dry the pellets (see Note 4).
Resuspend pellets with 40 μL nuclease-free water (see Note 5).
3.5. Gel Purification and RNA Extraction
Set up the gel apparatus and pre-run the 10% TBE-urea gel at 180 V for at least 20 min in 1× TBE running buffer.
Add 25 μL of 2× RNA loading dye to 25 μL of sample. Prepare molecular weight marker solution by mixing 1 μL of ssRNA low-range ladder to 9 μL of nuclease-free water, followed by 10 μL of 2× RNA loading dye. Heat the samples at 92 °C for 4 min. In two wells load 20 μL of sample and in one well 10 μL. Load the marker in the last lane (see Note 6).
Run the gel at 180 V until the lower (dark blue) dye is close to the bottom, approximately 70 min. Incubate the gel with 50 mL of TBE buffer containing 5 μL SYBR-Gold dye for 5 min. Wash the gel twice with nuclease-free water.
Visualize and record the gel under UV light.
Prepare 0.5 mL RNase-free non-stick tubes by piercing a hole in the bottom using an 18G syringe needle. Cut the gel on transilluminator as indicated in Fig. 3. Put gel pieces into 0.5 mL tubes and then place the tubes into a 1.5 mL collection tube. Centrifuge at 20,000 × g, for 3 min at 4 °C. Remove 0.5 mL tube and add 300 μL nuclease-free water.
Shake samples on the thermomixer at 157 × g for 10 min at 68 °C, and then freeze them on dry ice for 10 min. Thaw samples at room temperature for 10 min and then incubate on thermomixer at 157 × g for 10 min at 68 °C.
Cut the tips of the P1000 barrier tips, transfer the sample (with gel pieces) onto a Costar SpinX column, and spin at 20,000 × g for 3 min at 4 °C.
Add to each tube 2 μL of GlycoBlue, 33 μL of 3 M sodium acetate pH 5.5, and 900 μL of 100% ethanol. Vortex to mix. Put at −20 °C overnight. Next morning precipitate RNA as described in Subheading 3.4.
3.6. RNA Fragmentation
Fragment RNAs longer than 300 nt to 50–200 nt long fragments using the following protocol:
Resuspend RNA obtained in Subheading 3.5 in 11 μL of nuclease-free water. Transfer to PCR tubes and place in the thermocycler.
Heat the samples for 2 min at 95 °C to denature RNA.
Add 1 μL of 10× Fragmentation Reagent mix. If you are dealing with larger number of tubes, keep the tubes on ice. This will ensure that most of the RNA stays denatured. Keeping samples at the room temperature will allow for the slow refolding of RNA.
Return the samples to thermomixer and incubate them for 2 min at 95 °C. Add 1 μL of stop solution and place the samples on ice.
Purify the RNA on 10% TBE-urea gel. Extract 50–200 nt long fragments and precipitate RNA as described in Subheading 3.5 (see Note 7).
3.7. Library Preparation
3.7.1. RNA 3′ End Dephosphorylation
Determine RNA concentration by NanoDrop. Low RNA concentration should be measured by Qubit or Bioanalyzer.
Prepare 11 μL reaction mixture by mixing 7 μL of RNA sample with 1.1 μL of 10× PNK buffer, 1 μL of Superase-In, and 2 μL of T4 polynucleotide kinase. Incubate reaction mixture at 37 °C for 1 h, followed by 3 min at 90 °C to inactivate the enzyme.
3.7.2. Template-Switching Reaction Using TGIRT-III
In a PCR tube prepare 17.5 μL reaction mixture by mixing nuclease-free water with 4 μL of 5× TGIRT reaction buffer, 2 μL of 50 mM DTT, ~50 ng of dephosphorylated RNA sample, and 2 μL 10× TGIRT-III RT/template-primer substrate mix (see Note 8).
Preincubate reaction mixture at room temperature for 30 min, and then add 2.5 μL of 10 mM dNTPs.
Incubate the reaction at 60 °C for 60 min (see Note 9).
Add 1 μL of 5 M NaOH and incubate the sample at 65 °C for 15 min.
Cool to room temperature and neutralize sample with 1 μL of 5 M HCl.
To each sample add 100 μL of 10 mM Tris pH 8.0, 13 μL of 3 M sodium acetate pH 5.5, 3 μL of GlycoBlue, and 600 μL of 100% ethanol. Incubate overnight at −20 °C. Next morning precipitate cDNA as described in Subheading 3.4.
Pre-run 8% TBE gel at 155 V for at least 20 min.
Resuspend precipitated cDNA in 5 μL of 10 mM Tris pH 8.0 and add 1.25 μL of 5× DNA loading dye. Run gel at 155 V for 40–45 min.
Size-select cDNA using 10 bp ladder as a guide. Add 300 μL of nuclease-free water to cut gel pieces, and extract cDNA from gel pieces by following the general protocol presented in Subheading 3.5. Precipitate cDNA overnight, at −20 °C, by adding 3 μL of GlycoBlue, 33 μL of sodium acetate pH 5.5, and 900 μL of 100% ethanol. Next morning precipitate cDNA as described in Subheading 3.4 (see Note 10).
3.7.3. Oligo Adenylation of Illumina Read 1 Sequencing Primer (R1R DNA)
In a PCR tube prepare 20 μL reaction mixture by combining 2 μL of 10× 5′ DNA Adenylation Reaction Buffer, 2 μL of 1 mM ATP, 1 μL 100 μM R1R DNA, and 2 μL of Mth RNA Ligase. We usually set up 4–8 parallel reactions.
Incubate reactions in a thermocycler at 65 °C for 1 h.
Incubate samples at 85 °C for 5 min to inactivate the enzyme.
Clean up the adenylated R1R DNA with an Oligo Clean & Concentrator kit and elute in 10 μL of nuclease-free water to obtain 10 μM adenylated R1R DNA. If doing several adenylation reactions in separate PCR tubes, combine them for a cleanup since higher elution volume helps with consistent and efficient recovery of adenylated oligos.
Check the extent of adenylation by running the sample on a 20% TBE-7 M-urea gel.
3.7.4. Thermostable Ligation
In a PCR tube prepare 20 μL reaction mixture by combining 2 μL of 10× NEBuffer 1, 2 μL of 50 mM MnCl2, 4 μL of 10 μM adenylated R1R DNA, 10 μL of cDNA from template-switching, and 2 μL of Thermostable 5′ AppDNA/RNA Ligase.
Incubate reactions in thermocycler at 65 °C for 2 h.
Incubate samples at 90 °C for 3 min to inactivate the enzyme.
Clean up the ligated cDNA with a MiniElute PCR Purification Kit and elute in 23 μL of nuclease-free water.
3.7.5. PCR Amplification
In an Eppendorf tube prepare a 53 μL reaction mixture by combining 29.5 μL of nuclease-free water, 10 μL of 5× Phusion HF buffer, 1 μL of 10 μM Illumina multiplex primer, 1 μL of 10 μM Illumina barcode primer, 10 μL of cDNA, 1 μL of 10 mM dNTPs, and 0.5 μL of Phusion High-Fidelity DNA Polymerase.
Divide reaction mixture among three PCR tubes. Heat cDNA at 98 °C for 5 s, then amplify it for 15, 18, or 21 cycles of 98 °C for 5 s, 60 °C for 10 s, and 72 °C for 12 s.
Mix 17 μL of PCR product with 4.25 μL of 5× DNA loading dye, load on an 8% TBE gel, and run the gel at 155 V for 45 min. Stain the gel with SYBR gold.
Size-select amplified DNA using 10 bp ladder as a guide. Add 300 μL of nuclease-free water to cut gel pieces, and extract DNA from gel pieces by following the general protocol presented in Subheading 3.5. Precipitate DNA overnight, at −20 °C, by adding 3 μL of GlycoBlue, 33 μL of sodium acetate pH 5.5, and 900 μL of 100% ethanol. Next morning precipitate as described in Subheading 3.4. Resuspend each library in 10 μL of 10 mM Tris pH 8.
3.7.6. qPCR Quantification
For the quantification of the libraries we use KAPA library quantification kit. If the kit is being used for the first time, add 1 mL of 10× Primer Premix to the bottle of 2× KAPA SYBRFAST qPCR Master Mix (5 mL) and mix by vortexing. Aliquot this solution and store at −20 °C.
Determine the total number of reactions that will be performed. Usually we run six DNA standards in triplicate and each library dilution in duplicate. Using a NanoDrop, estimate the concentration of each library and determine which dilutions to prepare to stay within the dynamic range of the assay. A 1:5000 and 1:10,000 dilution usually fall around the midpoint of the assay standards.
Prepare 1:5, 1:50, 1:5000, and 1:10,000 library dilutions in 10 mM Tris pH 8 buffer.
For each reaction, prepare the following in a 96-well PCR plate: 6 μL of Master Mix containing primers, and 4 μL of either DNA standard or specific library dilution. Seal the plate with optical adhesive film.
Run the plate with the following program in the qPCR machine: 95 °C for 5 min, and 35 cycles of 95 °C for 30 s, and 60 °C for 45 s.
Use the KAPA analysis template to calculate slope and intercept of the standard curve, to convert the average Cq score for each library dilution to pM, to calculate the average size-adjusted concentration (in pM) for each dilution, and to calculate the size-adjusted concentration for the original undiluted library.
Prepare 15 μL of 10 nM library solution, containing up to 20 libraries. Store individual libraries in RNase-free low-retention Eppendorf tubes at −20 °C (see Note 11).
Check the quality of the library on Bioanalyzer prior to submitting sample for sequencing on an Illumina HiSeq4000 or similar (see Note 12).
3.8. Sequencing Read Mapping and Analysis
3.8.1. Sequence Processing and Alignment
Prior to bioinformatic analysis it is important to de-multiplex sequences if multiple samples were run within one sequencing lane. De-multiplexing for our samples was performed by the Center for Advanced Technology at UCSF.
Upload sequencing data to the Galaxy web platform and use the public server at usegalaxy.org to analyze the data [28 ] (see Note 13).
Process reads with FASTQ Groomer [29] and then remove adapters using Clip tool also available through Galaxy web platform.
Go to Ensembl Bacteria Genome Database (EMBL-EBI), https://bacteria.ensembl.org/index.html, and download E. coli BW25113 FASTA file and gtf file.
Align sequences greater than 15 bp to the genome using Bowtie 2 with default options [30, 31]. Default settings for “sensitive-local” are the default option in “local-mode” (details are -D 15 -R 2 -N 0 -L 20 -i S,1,0.75) (see Note 14).
3.8.2. Enrichment Analysis of Reads
Determine the raw counts per gene by using HTSeq-count script, which is available through Galaxy web platform [32]. Select intersection-nonempty mode to handle reads overlapping more than one feature. Summarize counts from regions A-D per replicate. Use this file to perform the enrichment analysis.
For enrichment analysis of reads mapped to any set of genes use DESeq2 module. In DESeq2 specify the factor levels that will be analyzed (e.g., sample vs. control), and select all replicates belonging to a specific factor level. As an input data use summarized HTSeq-count data as described above. Use parametric fitting and leave on the following options: outliers replacement, outliers filtering, and independent filtering. For the control samples, we generated a library from the rRNA-depleted total RNA isolated from E. coli BW25113 strain (see Note 15).
3.8.3. Analysis of Stop Sites and Mismatches
Download Integrated Genomic Viewer (IGV) [33]. Open E. coli BW25113 genome file. Open all BAM files and their corresponding BAM_index files.
Select the gene of interest and determine the percent of mismatches for a specific nucleotide by cumulative analysis of all biological replicates (Fig. 4).
To determine the 5′ end of the reads (stop sites) use script “make_wiggle” to convert sorted and indexed BAM files to wiggle files. This script was developed by the Weissman lab at UCSF and is readily available through Plastid [36]. The results can be readily visualized with IGV.
Fig. 4.

Examples of read profiles for specific RNAs displayed in Integrative Genomic Viewer. (a) Read profiles for tRNAGlnUUG displayed in Integrative Genomic Viewer (IGV) [33–35]. (b) Read profiles for tRNAHisGUG displayed in IGV. tRNAs were isolated after immunoprecipitation of FLAG-tagged C118A RlmN. The depth of the reads (counts) displayed at a specific locus is represented as a gray bar chart (top panel). Alignment of individual reads is represented in the bottom panel. Known modifications are represented using abbreviations and were taken from the MODOMICS database [35]. Abbreviations: 4-thiouridine (s4U), dihydrouridine (D), queuosine (Q), 7-methylguanosine (m7G), 2′-O-methylguanosine (Gm), 2′-O-methyluridine (Um), 5-carboxymethylamino-methyl-2-selenouridine (cmnm5se2U), 5-methyluridine (m5U), 2-methyladenosine (m2A), and pseudouridine (Ψ)
Acknowledgments
This work was supported by UCSF Program for Breakthrough Biomedical Research (PBBR) Postdoctoral Grant (to V.S.), NIAID R01AI137270 (to D.G.F.), UCSF Program for Breakthrough Biomedical Research funded in part by the Sandler Foundation (to D.E.W.), and NIH Director’s Early Independence Award DP5OD017895 (to D.E.W.).
4 Notes
Optimal expression time for an enzyme should be determined empirically prior to proceeding to the next step.
Resin cannot stay in glycine buffer for longer than 20 min.
When dealing with a very small sample volume, combine all the samples from a single experiment into a single tube prior to leaving the sample on a rotator.
After centrifugation remove the ethanol using P1000 pipette. Recap the tube, and pulse centrifuge to bring down the remaining ethanol. Remove the remaining liquid using P200 pipette. Leave the tube open at room temperature before proceeding to another sample. By the time all the samples are finished, the pellets should be sufficiently dry. Do not overdry the pellet since it can be hard to re-solubilize RNA.
Perform Western blot analysis to ensure that the enzyme was successfully digested. We use monoclonal anti-FLAG M2-peroxidase (HRP) antibody.
Prior to loading the samples flush the remaining urea out of the wells using P1000 pipette. This will decrease smearing and abnormal band shapes. Additionally, it is advisable to load a smaller amount of sample in one of the lanes to better see discrete bands, since loading a large amount of sample can lead to increased smearing in the gel.
Under these conditions, the extent of fragmentation will depend on the initial amount of RNA. If substantial amount of RNA is not successfully fragmented, extract the RNA longer than 300 nt, and repeat the fragmentation step. Make sure to decrease the time for the re-fragmentation step (e.g., from 2 min to 1 min, or less).
Add RNA sample and enzyme/template-primer mix last.
For long or heavily modified RNAs, such as tRNAs, it is necessary to run the reaction for 60 min. For short RNAs 5–15 min is usually sufficient, but the exact time should be determined empirically.
In case no pellet is observed after the first centrifugation step, add 1 μL of GlycoBlue, place sample on dry ice for at least 30 min, thaw sample at room temperature, and then repeat centrifugation step. Only then remove the supernatant and perform the wash step with cold 80% ethanol.
When submitting multiple samples within one sequencing lane, approximately equal amounts of each library should be added.
For our application and for cost-effectiveness, 50-nucleotide single-end runs are sufficient.
When uploading large sequencing files, FileZilla, an open-source software, can be used.
If applying this method to a different system, we suggest aligning sequences to the genome of interest using both Bowtie 1 and Bowtie 2 under various settings and then comparing results.
DESeq2 considers the variability between the replicates and normalizes read counts to account for differences in sequencing depth between samples, reporting fold change values between the sample and the control. In our analysis, we use a fourfold increase in abundance and adjusted P value of <0.01 as our threshold for identifying substrates in samples where TGIRT was used as reverse transcriptase. DESeq2-adjusted P-values are adjusted for multiple-comparison testing and are used to lower the false-positive detection.
References
- 1.Dominissini D, Moshitch-Moshkovitz S, Schwartz S, Salmon-Divon M, Ungar L, Osenberg S, Cesarkas K, Jacob-Hirsch J, Amariglio N, Kupiec M, Sorek R, Rechavi G (2012) Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq. Nature 485(7397):201–206. 10.1038/nature11112 [DOI] [PubMed] [Google Scholar]
- 2.Meyer KD, Saletore Y, Zumbo P, Elemento O, Mason CE, Jaffrey SR (2012) Comprehensive analysis of mRNA methylation reveals enrichment in 3’ UTRs and near stop codons. Cell 149(7):1635–1646. 10.1016/j.cell.2012.05.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Khoddami V, Cairns BR (2013) Identification of direct targets and modified bases of RNA cytosine methyltransferases. Nat Biotechnol 31(5):458–464. 10.1038/nbt.2566 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Carlile TM, Rojas-Duran MF, Zinshteyn B, Shin H, Bartoli KM, Gilbert WV (2014) Pseudouridine profiling reveals regulated mRNA pseudouridylation in yeast and human cells. Nature 515(7525):143–146. 10.1038/nature13802 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Schwartz S, Agarwala SD, Mumbach MR, Jovanovic M, Mertins P, Shishkin A, Tabach Y, Mikkelsen TS, Satija R, Ruvkun G, Carr SA, Lander ES, Fink GR, Regev A (2013) High-resolution mapping reveals a conserved, widespread, dynamic mRNA methylation program in yeast meiosis. Cell 155(6):1409–1421. 10.1016/j.cell.2013.10.047 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Delatte B, Wang F, Ngoc LV, Collignon E, Bonvin E, Deplus R, Calonne E, Hassabi B, Putmans P, Awe S, Wetzel C, Kreher J, Soin R, Creppe C, Limbach PA, Gueydan C, Kruys V, Brehm A, Minakhina S, Defrance M, Steward R, Fuks F (2016) RNA biochemistry. Transcriptome-wide distribution and function of RNA hydroxymethylcytosine. Science 351 (6270):282–285. 10.1126/science.aac5253 [DOI] [PubMed] [Google Scholar]
- 7.Li X, Zhu P, Ma S, Song J, Bai J, Sun F, Yi C (2015) Chemical pulldown reveals dynamic pseudouridylation of the mammalian transcriptome. Nat Chem Biol 11(8):592–597. 10.1038/nchembio.1836 [DOI] [PubMed] [Google Scholar]
- 8.Lovejoy AF, Riordan DP, Brown PO (2014) Transcriptome-wide mapping of pseudouridines: pseudouridine synthases modify specific mRNAs in S. cerevisiae. PLoS One 9(10):e110799. 10.1371/journal.pone.0110799 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Linder B, Grozhik AV, Olarerin-George AO, Meydan C, Mason CE, Jaffrey SR (2015) Single-nucleotide-resolution mapping of m6A and m6Am throughout the transcriptome. Nat Methods 12(8):767–772. 10.1038/nmeth.3453 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ule J, Jensen KB, Ruggiu M, Mele A, Ule A, Darnell RB (2003) CLIP identifies Nova-regulated RNA networks in the brain. Science 302(5648):1212–1215. 10.1126/science.1090095 [DOI] [PubMed] [Google Scholar]
- 11.Hafner M, Lianoglou S, Tuschl T, Betel D (2012) Genome-wide identification of miRNA targets by PAR-CLIP. Methods 58(2):94–105. 10.1016/j.ymeth.2012.08.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Konig J, Zarnack K, Rot G, Curk T, Kayikci M, Zupan B, Turner DJ, Luscombe NM, Ule J (2010) iCLIP reveals the function of hnRNP particles in splicing at individual nucleotide resolution. Nat Struct Mol Biol 17(7):909–915. 10.1038/nsmb.1838 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Haag S, Kretschmer J, Sloan KE, Bohnsack MT (2017) Crosslinking methods to identify RNA methyltransferase targets in vivo. Methods Mol Biol 1562:269–281. 10.1007/978-1-4939-6807-7_18 [DOI] [PubMed] [Google Scholar]
- 14.Zhang CL, Darnell RB (2011) Mapping in vivo protein-RNA interactions at single-nucleotide resolution from HITS-CLIP data. Nat Biotechnol 29(7):607–U686. 10.1038/nbt.1873 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hussain S, Sajini AA, Blanco S, Dietmann S, Lombard P, Sugimoto Y, Paramor M, Gleeson JG, Odom DT, Ule J, Frye M (2013) NSun2-mediated cytosine-5 methylation of vault noncoding RNA determines its processing into regulatory small RNAs. Cell Rep 4(2):255–261. 10.1016/j.celrep.2013.06.029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.McCusker KP, Medzihradszky KF, Shiver AL, Nichols RJ, Yan F, Maltby DA, Gross CA, Fujimori DG (2012) Covalent intermediate in the catalytic mechanism of the radical S-adenosyl-L-methionine methyl synthase RlmN trapped by mutagenesis. J Am Chem Soc 134(43):18074–18081. 10.1021/ja307855d [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Grove TL, Benner JS, Radle MI, Ahlum JH, Landgraf BJ, Krebs C, Booker SJ (2011) A radically different mechanism for S-adenosylmethionine-dependent methyltransferases. Science 332(6029):604–607. 10.1126/science.1200877 [DOI] [PubMed] [Google Scholar]
- 18.Grove TL, Livada J, Schwalm EL, Green MT, Booker SJ, Silakov A (2013) A substrate radical intermediate in catalysis by the antibiotic resistance protein Cfr. Nat Chem Biol 9(7):422–427. 10.1038/nchembio.1251 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Silakov A, Grove TL, Radle MI, Bauerle MR, Green MT, Rosenzweig AC, Boal AK, Booker SJ (2014) Characterization of a cross-linked protein-nucleic acid substrate radical in the reaction catalyzed by RlmN. J Am Chem Soc 136(23):8221–8228. 10.1021/ja410560p [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Boal AK, Grove TL, McLaughlin MI, Yennawar NH, Booker SJ, Rosenzweig AC (2011) Structural basis for methyl transfer by a radical SAM enzyme. Science 332(6033):1089–1092. 10.1126/science.1205358 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.King MY, Redman KL (2002) RNA methyltransferases utilize two cysteine residues in the formation of 5-methylcytosine. Biochemistry 41(37):11218–11225 [DOI] [PubMed] [Google Scholar]
- 22.Yan F, LaMarre JM, Rohrich R, Wiesner J, Jomaa H, Mankin AS, Fujimori DG (2010) RlmN and Cfr are radical SAM enzymes involved in methylation of ribosomal RNA. J Am Chem Soc 132(11):3953–3964. 10.1021/ja910850y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Yan F, Fujimori DG (2011) RNA methylation by radical SAM enzymes RlmN and Cfr proceeds via methylene transfer and hydride shift. Proc Natl Acad Sci U S A 108(10):3930–3934. 10.1073/pnas.1017781108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Stojkovic V, Chu T, Therizols G, Weinberg DE, Fujimori DG (2018) miCLIP-MaPseq, a substrate identification approach for radical SAM RNA methylating enzymes. J Am Chem Soc 140(23):7135–7143. 10.1021/jacs.8b02618 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Benitez-Paez A, Villarroya M, Armengod ME (2012) The Escherichia coli RlmN methyltransferase is a dual-specificity enzyme that modifies both rRNA and tRNA and controls translational accuracy. RNA 18(10):1783–1795. 10.1261/rna.033266.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Fitzsimmons CM, Fujimori DG (2016) Determinants of tRNA recognition by the radical SAM enzyme RlmN. PLoS One 11(11):e0167298. 10.1371/journal.pone.0167298 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Dominissini D, Moshitch-Moshkovitz S, Salmon-Divon M, Amariglio N, Rechavi G (2013) Transcriptome-wide mapping of N(6)-methyladenosine by m(6)A-seq based on immunocapturing and massively parallel sequencing. Nat Protoc 8(1):176–189. 10.1038/nprot.2012.148 [DOI] [PubMed] [Google Scholar]
- 28.Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, Cech M, Chilton J, Clements D, Coraor N, Eberhard C, Gruning B, Guerler A, Hillman-Jackson J, Von Kuster G, Rasche E, Soranzo N, Turaga N, Taylor J, Nekrutenko A, Goecks J (2016) The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Res 44(W1):W3–W10. 10.1093/nar/gkw343 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Blankenberg D, Gordon A, Von Kuster G, Coraor N, Taylor J, Nekrutenko A, Galaxy T (2010) Manipulation of FASTQ data with Galaxy. Bioinformatics 26(14):1783–1785. 10.1093/bioinformatics/btq281 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Langmead B, Salzberg SL (2012) Fast gapped-read alignment with bowtie 2. Nat Methods 9(4):357–359. 10.1038/nmeth.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10(3):R25. 10.1186/gb-2009-10-3-r25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Anders S, Pyl PT, Huber W (2014) HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics 31(2):166–169 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Thorvaldsdottir H, Robinson JT, Mesirov JP (2013) Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform 14(2):178–192. 10.1093/bib/bbs017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP (2011) Integrative genomic viewer. Nat Biotechnol 29:24–26 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Boccaletto P, Machnicka MA, Purta E, Piątkowski P, Bagiński B, Wirecki TK, de Crécy-Lagard V, Ross R, Limbach PA, Kotter A, Helm M (2017) MODOMICS: a database of RNA modification pathways. 2017 update. Nucleic Acids Res 46(D1):D303–D307 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Dunn JG, Weissman JS (2016) Plastid: nucleotide-resolution analysis of next-generation sequencing and genomics data. BMC Genomics 17(1):958. 10.1186/s12864-016-3278-x [DOI] [PMC free article] [PubMed] [Google Scholar]
