Conspectus
Among the many analytical methods applied to RNA modifications, a particularly pronounced surge has occurred in the past decade in the field of modification mapping. The occurrence of modifications such as m6A in mRNA, albeit known since the 1980s, became amenable to transcriptome-wide analyses through the advent of next-generation sequencing techniques in a rather sudden manner. The term “mapping” here refers to detection of RNA modifications in a sequence context, which has a dramatic impact on the interpretation of biological functions. As a consequence, an impressive number of mapping techniques were published, most in the perspective of what now has become known as “epitranscriptomics”. While more and more different modifications were reported to occur in mRNA, conflicting reports and controversial results pointed to a number of technical and theoretical problems rooted in analytics, statistics, and reagents. Rather than finding the proverbial needle in a haystack, the tasks were to determine how many needles of what color in what size of a haystack one was looking at.
As the authors of this Account, we think it important to outline the limitations of different mapping methods since many life scientists freshly entering the field confuse the accuracy and precision of modification mapping with that of normal sequencing, which already features numerous caveats by itself. Indeed, we propose here to qualify a specific mapping method by the size of the transcriptome that can be meaningfully analyzed with it.
We here focus on high throughput sequencing by Illumina technology, referred to as RNA-Seq. We noted with interest the development of methods for modification detection by other high throughput sequencing platforms that act directly on RNA, e.g., PacBio SMRT and nanopore sequencing, but those are not considered here.
In contrast to approaches relying on direct RNA sequencing, current Illumina RNA-Seq protocols require prior conversion of RNA into DNA. This conversion relies on reverse transcription (RT) to create cDNA; thereafter, the cDNA undergoes a sequencing-by-synthesis type of analysis. Thus, a particular behavior of RNA modified nucleotides during the RT-step is a prerequisite for their detection (and quantification) by deep sequencing, and RT properties have great influence on the detection efficiency and reliability. Moreover, the RT-step requires annealing of a synthetic primer, a prerequisite with a crucial impact on library preparation. Thus, all RNA-Seq protocols must feature steps for the introduction of primers, primer landing sites, or adapters on both the RNA 3′- and 5′-ends.
Key References
Hauenschild R.; Tserovski L.; Schmid K.; Thüring K.; Winz M.-L.; Sharma S.; Entian K.-D.; Wacheul L.; Lafontaine D. L. J.; Anderson J.; Alfonzo J.; Hildebrandt A.; Jäschke A.; Motorin Y.; Helm M.. The Reverse Transcription Signature of N-1-Methyladenosine in RNA-Seq Is Sequence Dependent. Nucleic Acids Res. 2015, 43, 9950–9964 .1 Describes deep sequencing analysis of m1A RT signatures in rRNA and tRNAs with Random Forest algorithm for classification of hits.
Werner S.; Schmidt L.; Marchand V.; Kemmer T.; Falschlunger C.; Sednev M. V.; Bec G.; Ennifar E.; Höbartner C.; Micura R.; Motorin Y.; Hildebrandt A.; Helm M.. Machine Learning of Reverse Transcription Signatures of Variegated Polymerases Allows Mapping and Discrimination of Methylated Purines in Limited Transcriptomes. Nucleic Acids Res. 2020, 48, 3734–3746 .2 Extended study of RT signatures for m1A, m66A, m1G, and m22G in RNA using a set of RT enzymes and Random Forest model.
Marchand V.; Ayadi L.; Ernst F. G. M.; Hertler J.; Bourguignon-Igel V.; Galvanin A.; Kotter A.; Helm M.; Lafontaine D. L. J.; Motorin Y.. AlkAniline-Seq: Profiling of m7G and m3C RNA Modifications at Single Nucleotide Resolution. Angew. Chem., Int. Ed. 201857, 16785–16790.3 Description of highly sensitive protocol for detection of RNA modifications via formation of abasic site and subsequent cleavage.
Marchand V.; Pichot F.; Neybecker P.; Ayadi L.; Bourguignon-Igel V.; Wacheul L.; Lafontaine D. L. J.; Pinzano A.; Helm M.; Motorin Y.. HydraPsiSeq: A Method for Systematic and Quantitative Mapping of Pseudouridines in RNA. Nucleic Acids Res. 2020, 48, e110 .4 Description of the protocol for mapping and quantification of pseudouridines and 5-substituted U residues in RNA.
Introduction
RNA modifications, now known as “epitranscriptomics”,5−8 differ from their parental unmodified nucleotides in their chemical structure and, in consequence, their physicochemical properties and chemical reactivity, including in particular base pairing properties (Figure 1A,B).
Figure 1.
RNA nucleobases and modified nucleotides in RNA. (A) Four canonical RNA purine and pyrimidine nucleobases (A, G, C, U), with atom numbering. Watson–Crick (WC) and Hoogsteen edges as well as donors (red)/acceptors (blue) for H-bond formation are indicated. (B) Some prominent examples of modified RNA nucleobases. Deviations from parental nucleosides are shown in green. Chemical structures are shown for neutral pH (pH 7.0) and protonation status is indicated considering the pKa values.9,10 (C) 2′-O-Methylation and (D) m1A as an example of a modification that affects Watson–Crick pairing.
These features sometimes allow mapping of RNA modifications by RNA-Seq. Altered base pairing, typically linked to the presence of additional chemical groups at the Watson–Crick edge of the base (Figure 1B,D), may be detected by a number of classical methods, by RNA-Seq adaptations, as well as by most other next-generation sequencing (NGS) technologies.11−15 For those methods relying on reverse transcription (RT) prior to sequencing (Figure 2A), modified sites may appear either as strong arrest of primer extension (RT-stop) or, in the majority of cases, as the presence of misincorporations in cDNA (“mutations”) and as base deletions (jumps). These three features in the sequenced cDNA strand together constitute a so-called RT-signature (Figure 2B), which stems from the inability of the enzyme to bind base-pair incoming dNTPs with an RNA modification in the template.
Figure 2.
Synthesis of cDNA by reverse transcriptase (RT) from RNA template. The enzyme requires a hybridized primer and a free 3′-OH to be extended (A). Modified nucleotide in the template (shown as X) can alter the RT-signature, through misincorporations, deletions, or abortive cDNA synthesis (B).
Reverse transcriptases, when moving along an RNA template, employ four different substrates, namely, the four dNTPs, for cDNA synthesis, presumably each with a different affinity and turnover, approximated by different Km and kcat values. Based on this alone, one must assume differential catalytic efficiency for each different templated RNA nucleoside. Every single parameter of the in vitro conditions is, in principle, liable to differentially affect incorporation of each different incorporated dNTP. These parameters include, in particular, the individual concentrations of each of the four dNTPs, pH, divalent cations, ionic strength, temperature, and, of course, the type of enzyme and specific activity of each individual enzyme preparation. Biases in efficiency of cDNA synthesis will be further compounded by sequence context, i.e., RNA primary structure, and by stable RNA secondary and tertiary structures that need to be resolved before the RT can process the template. Consequently, RT-stops will occur in an uneven and currently unpredictable distribution, forming a background of events that constitute substantial noise against which modification-dependent RT-stops must be contrasted with statistical significance. Moreover, RNA modifications are installed site-specifically by dedicated modification enzymes, albeit a given site may not be fully modified. For such substoichiometric modification the RT-stop signal is necessarily lower, decreasing the signal-to-noise ratio and thus detection efficiency and confidence.
Before the advent of NGS, the one modification that was easily detectable by normal RT-based sequencing was inosine, a deamination product of adenosine that very faithfully base-pairs with cytidine. This corresponded to a quantitative misincorporation and allowed large scale mapping approaches even with basic sequencing techniques.16−18 Beyond this, RT-stops caused by different modifications had already been observed and used in an analytical perspective to a limited degree.19,20 The influence of a selected few in vitro parameters on RT-stops had also already been explored and exploited, for example, for the detection of Nm modifications (Figure 1C) by primer extension at low dNTP concentrations.21,22 The influence of such parameters on misincorporation in the sequence of the cDNA was, however, only known from DNA polymerases used in PCR.23 With the onset of the NGS era, altered base pairing properties of RNA modified nucleotides and ensuing misincorporation became more apparent24 and an important source of information to identify positions of RNA in generic RNA-Seq data sets.25−27
In more sophisticated approaches, the above-mentioned modulation of RT conditions, e.g., via concentration of metal ions28 and dNTP substrates, were now adapted to NGS, e.g., for the detection of Nm residues.29 This is one example of many where the Watson–Crick edge is unaffected by the modification, causing a weak or invisible RT-signature.27,30 In the case at hand, it becomes enhanced through modulation of the RT-conditions, namely lowered dNTP conditions which diminish the reverse transcriptase activity to the point where it stalls and produces abortive cDNA. Further sophistication included the combination of stalling and misincorporation as important contributors to the RT-signature,28 which was then shown to vary, e.g., among different enzymes such that the combined RT-signature of two RT-enzymes was more informative than one alone.2 This inspired the directed evolution of enzymes whose RT-signature was specifically sensitive to modifications such as m6A, Nm, Q and Ψ, which were otherwise considered to be RT-silent.31−33 Another sophistication included jumps in the RT-signature, a feature that seems to be fraught with less background.34−38 However, it became clear that the number of RNA modifications whose altered base pairing properties allowed comprehensive detection was limited.39
To address this gap, a growing number of methods were and still are being developed that use chemical or enzymatic treatments, which, by selectively acting on modified or unmodified nucleotides enhance the information content of sequencing data. This amounts to a conversion of RT-silent modifications into chemical structures that can now be detected by RT-signature. A pioneering method that was already developed in the 1970s40,41 uses a carbodiimide, specifically 1-cyclohexyl-(2-morpholinoethyl)carbodiimide metho-p-toluene sulfonate (CMCT), which forms alkaline resistant covalent adducts with pseudouridine. A pseudouridine thus becomes detectable as a CMCT-dependent RT-stop.42−44 Of note, this method has meanwhile been adapted to NGS-based mapping of pseudouridine by a number of groups.45−47 Among the earliest RNA modifications to become accessible to transcriptome-wide mapping was m5C. Bisulfite induced chemical deamination of all unmodified cytosines results in uridines, while methylated m5C is resistant to deamination.48,49
Beyond these two classical examples a large variety of methods using enzymes50−52 and chemicals were developed, some of which derive from very early nucleic acid sequencing.53−55 While we have reviewed and discussed these methods elsewhere,7,8,56 we will here focus on methods jointly developed in our laboratories and the conclusions drawn from these developments.
Exploiting the Reverse Transcription Signature of Modifications on the Watson–Crick Edge
At the onset of our work in this field, it was clear that some modifications, in particular those with an altered Watson–Crick edge,20 would block reverse transcription. With the efficiency of that process unknown, we set out to characterize in depth the properties of m1A in RT reactions. m1A is a noncanonical nucleoside generated both, by dedicated enzymes57 and in alkylation reactions used, e.g., in structural probing.20 Various m1A-containing RNAs, including tRNAs, rRNAs, and some synthetic RNAs, were subjected to RNA-Seq according to a protocol previously developed with an eye to unbiased amplification, thereby specifically including cDNAs resulting from abortive RT events.1,58
Importantly m1A-induced RT-stops turned out to not be quantitative. Rather, a significant fraction of read-through events was consistently observed to contain variegated levels of misincorporation of the three dNTPs noncognate to the unmodified adenosine residue, as well as of dTTP. The latter caused a signal identical to that arising from unmodified RNA, thereby effectively erasing information on the presence of m1A. Moreover, when compiled from 37 tRNA sequences from yeast, an averaged profile only showed about 50% arrest, suggesting a substantial amount of information to be hidden in what we then termed the RT-signature of m1A. Data mining of the combination of RT-arrests and misincorporation suggested itself for the detection of thus far unknown or unconfirmed m1A sites. A follow-up with synthetic m1A-containing oligos showed the RT-signature to be strongly influenced by the identity of the upstream dNTP. Thus, the residue incorporated just prior to the enzyme happening upon the noncanonical nucleoside in the RNA template influences the efficacy of RT-arrest, misincorporation, or read-through, which is plausible, given its spatial proximity to the catalytic center. Further analysis showed a mild contribution even of the penultimately incorporated dNTP. Hence it was clear that the relative contribution to the RT-signature of m1A by misincorporation of different dNTPs, by RT-arrest, or by jumps depends on the sequence context. In subsequent work, we characterized the influence of the concentrations of Mg2+ and Mn2+28 and demonstrated that Mn2+ could be employed to enhance the RT-signature of m1A and therefore facilitate its detection. We furthermore found highly variegated propensities for RT-arrest, mismatch incorporation, and jumps within a panel of 13 reverse transcription enzymes.2 By conjecture, we expect that other reaction conditions such as temperature, pH, and salt are very likely to also affect RT-signatures, possibly along with the specific activity of a given enzyme preparation.
In conclusion, RT-signatures are highly variable and correspondingly difficult to employ in the detection and quantification of modifications. However, while the heterogeneous contributions of the above features to the overall RT-signature make it difficult to define rigid thresholds for detection, machine learning turned out to be a viable option for exploiting the information content of RT-signatures for mapping purposes. In the case at hand, we annotated known sites of m1A in RNA-Seq data as positive learning instances, as well as an equal number of bona fide nonmodified adenosines as negative instances. A fraction of about 80% of the RNA-Seq data was used for training a random forest model, which was then used for predicting occurrence of m1A in the remaining 20% of the data set. After repetition of the procedure with permutations within the same data set, the obtained performance parameters were averaged to provide a combined assessment.1,58 Foremost among the various statistical parameters that can be used to characterize the quality of a prediction method, machine learning or other, are false positive and false negative rates, elsewhere discussed in terms of sensitivity and specificity. We will discuss some of these numbers further below.
The principal insights obtained using m1A as a model modification could be recapitulated for other modifications affecting the Watson–Crick edge of purines. Of interest, two consecutive nucleotides of m66A modifications produce a RT-pattern similar to the signature of m1A,58 but those have not been reported outside rRNA. We characterized RT-signatures of m1G and m22G2 and showed that these could be discriminated based on their signature and that the corresponding signatures could be used for meaningful predictions in a transcriptome of limited size, i.e., restricted to tRNAs and rRNA, or ∼104 nt.
Modulating the RT-Signature by Chemical Treatment
In general, strong signatures are known from modifications that are bulky or have impacts on the Watson–Crick edge of the nucleobase. This can be rationalized with an eye on the catalytic center of a RT enzyme. As part of its catalytic cycle, the enzyme tries to fit incoming dNTPs onto the RNA template, thereby observing Watson–Crick base pairing. Bulky modifications are likely to cause steric clashes in the active site, thereby slowing or entirely preventing catalysis. Similarly, the impossibility of establishing a perfect Watson–Crick pair on a modified nucleobase will lead to rejection rather than the incorporation of a dNTP with higher probability, thereby causing longer pauses and potential abortion at modified sites. Indeed, it seems that under suboptimal reaction conditions, stalling might be more likely, at least for certain modifications. Also, as we have outlined above, even the rather blatant block of the Watson–Crick edge in m1A causes some incorporation of dTTP into the cDNA, effectively erasing modification information. To circumvent such features of an “incomplete” signature, numerous chemical treatments have been developed to enhance detection. Some date back to the time when so-called “direct” RNA sequencing was developed.53,55
NO-Seq
Among the earliest was deamination by bisulfite, a reagent that strongly differentiates between canonical cytidine and m5C and converts the former to a uridine, which can be read-out in sequencing without additional effort, save some adjustment of the mapping algorithm.48 We have developed a similar approach for the detection of m6A (Figure 3A,B). The chemical reasoning involves deamination of canonical adenosines to inosines, which behave like a guanosine in reverse transcription. In contrast, m6A residues are not converted and instead continue to be read out as adenosines. However, the deamination reaction by nitrous acid (Figure 3B) involved conditions that lead to the concomitant deamination of cytosines to uridines and also of some of the guanosines to xanthosine (Figure 3C). The deaminated RNA template leads to cDNA synthesis and subsequently obtained reads which consist of mostly dA and dC signals, which originate from uridines, inosines, and unreacted guanosines, respectively (Figure 3D). The effect of riboxanthosine on reverse transcription remains ill understood. Our own studies showed that xanthosines in deaminated RNA templates were associated with RT-arrest,59 while synthetic xanthosines in oligonucleotides obtained by phosphoramidite chemistry did not.60 Comprehensive deamination has several effects on the information contained in the resulting cDNA sequencing. Given that all information on cytidines has been effectively erased at this point and that dT indicates m6A sites, the remaining sequence elements are composed of only two nucleotides, namely, dA, incorporated at former C and U sites, and dC, incorporated at inosine, i.e., at former A sites.
Figure 3.
Main features of the NO-Seq protocol for detection of m6A. Modified nucleotides (A), reagents used (B), and (C) chemical rationale. m6A reacts with the reagent but still produces a T signal after RT. Other deamination products are formed by the reagent. (D) Scheme depicting changes in base composition and sequence upon deamination, RT, and PCR steps. (E) Substitution matrix used by custom alignment. For example, A or G in the sequencing read may correspond to A in the initial RNA. (F) Adenosine deamination is incomplete to warrant RT and PCR amplification. (G) Library preparation protocol consisting of targeted amplification of the region of interest. The signal in NO-Seq is visible as an A residue resistant to deamination.
Among the consequences is a decrease in uniquely mapped reads as the information content is effectively reduced to one-fourth of the original RNA sequence. Mapping algorithms needed to be adapted to account for the lower number of nucleotides that carry information and ensuing ambiguities (Figure 3E). At the same time, complete deamination with nitrous acid reduced the yield of reverse transcription and ultimately of reads in Illumina sequencing to a point where it became impractical for analysis. We therefore resorted to partial deamination (Figure 3F), which significantly improved read numbers and mapping ambiguity, albeit at the expense of sensitivity. Testing the method with a range of synthetic oligonucleotides containing m6A in varying stoichiometry at a defined site, it was possible to quantify sites containing down to 30% m6A. Overall, the method gives many false positive hits to be used for transcriptome-wide applications and therefore is restricted to analysis of short amplicons (Figure 3G) in the size of 102 nucleotides.59 Meanwhile, the introduction of glyoxal as a protection reagent has opened up new perspectives. Glyoxal reversibly masks the exocyclic amine functions in cytidines and guanosines and thus protects them from deamination by nitrous acid. Mild acid hydrolysis after deamination restores the original nucleobases and their information content and thereby has allowed transcriptome-wide mapping applications.61
The Detection Principle of AlkAniline-Seq Is Independent of RT-Signature
In fundamental contrast to the methods described above, reverse transcription signatures occur only as a byproduct in the Alkaline-Aniline-Sequencing method (AlkAniline-Seq). The central feature of this method is the generation of 5′-phosphates downstream of certain modifications. Because these serve as exclusive entry sites to library preparation, AlkAniline-Seq exhibits significantly lower experimental noise and bias.
The original development of the AlkAniline-Seq protocol was intended for mapping of 7-methylguanosine (m7G) residues in RNA (Figure 4A) and was based on a concept that goes back to the characterization of the particular chemical properties of m7G several decades ago. Reduction by NaBH4 under alkaline conditions was demonstrated to promote cleavage of the N-glycosidic bond, resulting in an RNA abasic site.62,63 Of note, a naked RNA abasic site lacking a nucleobase also shows a distinct RT signature.64 The abasic site also features an accessible aldehyde group as part of an equilibrium and can thus undergo condensation with a reactive amine (Figure 4B), typically aniline, to yield a Schiff base. Under alkaline conditions, this in turn leads to cleavage of the RNA chain via β-elimination.53,55,65 Of note, the RNA fragment 3′-downstream of the elimination site now contains a 5′-phosphate (Figure 4C). Analysis of the cleavage sites by gel electrophoresis or sequencing was used in several perspectives. For example, this reaction was used in early direct (chemical) RNA sequencing protocols, for specific cleavage of RNA at all G residues, after modification of G to m7G by chemical methylation with dimethylsulfate (DMS),53 or for DMS-mediated probing of 3D RNA structure.55,66 Barring the DMS methylation step, the same reaction can also be used for mapping of naturally modified m7G residues by RT primer extension.62,63 Detection of m7G residues in RNA can be achieved in approaches using various combinations of the above elements:
-
(1)
Generation of m7G-derived RNA abasic site by NaBH4, followed by aniline cleavage of the RNA abasic and subsequent detection of RT arrests at the position of cleavage. This approach was used in classical protocols for m7G analysis by primer extension.62,63
-
(2)
Similar to the previous approach, m7G-derived RNA abasic site was generated by treatment with NaBH4, but the aniline cleavage step was omitted. Instead, RT signatures generated at these positions were analyzed in an RNA-Seq context.64,67
-
(3)
Covalent labeling of m7G-derived RNA abasic sites with an Aldehyde Reactive Probe (ARP)-like reagent was performed, followed by detection of RT-stops at the bulky ARP structure. In a variation, the ARP contained a biotinylated moiety, enabling prior enrichment of RNA abasic sites. This approach was implemented in high-throughput versions.68,69
-
(4)
As mentioned, the 5′-phosphate released specifically upon cleavage of the m7G-derived RNA abasic site can be targeted for specific ligation of RNA-Seq adapters. This results in a specific enrichment of modification-derived fragments and is implemented in the AlkAniline-Seq protocol3 as well in TRAC-Seq.70
During the development of what eventually became the AlkAniline-Seq protocol (Figure 4D), we compared approaches 1 and 4 described above in an RNA-Seq coupled setting. Indeed, a specific signal corresponding to m7G1575 in Saccharomyces cerevisiae 18S rRNA was detected based on RT arrest at the cleavage site according to the “classical” approach 1. However, the high background signal rendered detection of m7G residues elsewhere in the transcriptome rather questionable. In contrast, ligation of 5′-adapters to the specifically released 5′-phosphate depressed background and correspondingly increased the signal-to-noise ratio. Omission of NaBH4, in what were meant to be control reactions, still showed chain scission upon subsequent alkaline RNA fragmentation for library preparation at pH 9.3 and 95 °C.71 Omission of NaBH4 avoided side reactions including the reduction of s4U,72 and further depressed noise. Interestingly the replacement of alkaline hydrolysis by Mg2+-driven RNA cleavage at neutral pH produced only very low signals at the m7G sites, identifying alkaline conditions as essential.
Figure 4.
Main features of AlkAniline-Seq protocol for detection of m7G, m3C, D, and ho5C. Modified nucleotides detected (A), reagents used (B), and chemical rationale of the method (C). Decomposition of the RNA abasic site generated 5′-phosphate at the N + 1 nucleotide in the sequence. (D) library preparation protocol. Phosphatase removes all 3′- and 5′-phosphates. (E) Nature of the signal and (F) scoring system for AlkAniline-Seq signals.
Aniline-induced chain scission proceeds via β-elimination, directly forming a 5′-phosphate on the downstream fragment, and indirectly leading to a 3′-phosphate on the upstream fragment. The latter is of little relevance, but the former constitutes a specific molecular entry into library preparation (Figure 4E,F). Therefore, treatment with phosphatase again improved the S/N ratio and in particular the specificity by removing all pre-existing 5′-phosphates resulting from biological RNA processing or degradation events in vivo or in vitro.
Application of this AlkAniline-Seq protocol to bacterial and eukaryotic rRNAs faithfully detected known and anticipated m7G sites3,73,74 with exquisite S/N values. Moreover, inspection of AlkAniline-Seq profiles for E. coli rRNA revealed two additional signals corresponding to modified dihydrouridine (D2449) and 5-hydroxycytosine (ho5C2501)75 with somewhat lower amplitude. Further signals for D were found in bacterial and eukaryotic tRNAs. A plausible molecular basis for these results is a ring opening of the dihydrouracil base under alkaline conditions to give β-ureidopropionic acid.76 The corresponding ring-opened nucleotide derivative is compatible with a β-elimination upon aniline treatment.
Another unanticipated finding consisted of strong signals at all known 3-methylcytosine (m3C) residues in eukaryotic cytoplasmic and mitochondrial tRNAs. While we found m3C to deaminate to m3U under alkaline conditions; m3U sites in rRNA did not show signals. Thus, m3C in RNA must induce chain scission before deamination, apparently activating its downstream phosphate for scission only in the context of the RNA chain.
In summary, AlkAniline-Seq detects at least m7G, m3C, D, and ho5C (Figure 4A), but our unpublished data indicate further signals presumably arising from noncanonical nucleobases of nonenzymatic origin or from RNA abasic sites. The latter might, in turn, arise from spontaneous loss of RNA bases, including damaged ones. In vivo, RNA damage is thought to mostly result from reactive oxygen species (ROS), either directly leading to oxidized bases or through ROS-mediated lipid oxidation, the latter leading to unspecific alkylation77 plausibly including m7G and m3C of nonenzymatic origin. Indeed, most of the noise in AlkAniline-Seq originates from guanosines, making 8-oxo-guanosine and its downstream ring-opened oxidation products78 prime suspects. The critical dependence of AlkAniline-Seq on unique 5′-phosphates substantially complicates precise and accurate quantification, since calibration curves between AlkAniline-Seq signals and modification content are not linear for all used scoring systems.73,74,79 Its very low background and exceptional S/N ratio qualify AlkAniline-Seq analyses for full mRNA transcriptomes with a complexity of 106–107 nt.
In the backdrop of large numbers of mapping techniques appearing in the literature, AlkAniline-Seq protocol was successfully validated with known sites in rRNA and tRNA from different species.73,80,81 However, our analysis did not detect any plausible modification signal for m7G, m3C, D, or ho5C in any of the studied eukaryotic mRNAs (S. cerevisiae, Arabidopsis thaliana, Mus musculus, Homo sapiens). These observations corroborate the results reported by others on m7G in E. coli and S. cerevisiae mRNAs.64
HydraPsiSeq
The interest in techniques for mapping and quantification of pseudouridine (ψ) as one of the most widespread and abundant RNA modifications is high. Pseudouridine (Figure 5A) and its derivatives are the only known natural nucleotides with a C–C glycosidic bond. In addition, the isomer of uridine features an additional N–H group, and these features provide particular physicochemical properties and reactivity. The most prominent of several reactions that distinguish between U and ψ is stable adduct formation with CMCT. Resulting methods (cited above) are relatively sensitive but feature a high RT-stop background, which significantly depresses the S/N signal, requiring appropriate spike-in calibration for adequate quantification.82 As another differential reactivity, ψ residues are insensitive to hydrazine (Figure 5B,D) even at high concentrations, while unmodified U undergoes a Michael addition at C6, with subsequent destruction of the aromatic pyrimidine base. The resulting damaged site, upon treatment with aniline, leads to chain cleavage via β-elimination (Figure 5B). Consequently, detection of ψ and other hydrazine-insensitive residues can be achieved by measuring noncleavage at presumed uridine sites which are protected by their modification chemistry (Figure 5C). Quantification of such negative signals, if RT-based, is subject to the various sources of noise mentioned above. Despite this, hydrazine hydrolysis was occasionally employed as an auxiliary method for detection of ψ (and also m5U) in rRNAs, brome mosaic virus (BMV) viral RNA, and snRNAs.41,83,84
Figure 5.
Detection and quantification of pseudouridines and m5U by HydraPsiSeq. Modified nucleotides detected (A), reagents used (B), and chemical rationale (C). Some modified U nucleotides are resistant to cleavage, while some derivatives of G and C are extensively cleaved (panel A). (D) Decomposition of the RNA abasic site after U cleavage by hydrazine generates 5′-phosphate at the N + 1 nucleotide in the sequence. (E) Library preparation protocol. Removal of only 3′-P residues in RNA is achieved by T4 PNK phosphatase activity. (F) Nature of the signal and (G) scoring system for the HydraPsiSeq signals.
For an adaptation to NGS, we identified two options for the exploitation of random hydrazine cleavage. Option 1, as in the preceding section, leverages RT-stops after primer extension of aniline-cleaved RNA. Option 2 is more specific and conceptually related to AlkAniline-Seq. A graphical overview of HydraPsiSeq4,85 in Figure 5E shows hydrazine-induced cleavage being followed by extensive T4 polynucleotide kinase (T4 PNK) treatment in order to selectively remove the resulting 3′-phosphates without affecting 5′-phosphate in the process. Consequently, the resulting RNA fragments, which are randomly cleaved at all U residues, exhibit 3′-OH extremities and ligation-competent 5′-phosphate used for adapter ligation and conversion into a sequencing library with simultaneous barcoding. Coverage in the order of 1000 reads per position is required, making the method practical almost exclusively for highly abundant RNAs.
The data analysis pipeline includes adapter trimming, alignment to RNA reference, and tallying the 5′-extremities of the reads for each position in the reference (Figure 5F). In a scoring system named NormUcleavage (Figure 5G), raw data are normalized to the local background of A/C/G cleavage in a rolling 11 nt window. Tallied values are used for constructing uridine profiles and subsequent calculation of a protection score at all U-sites. Because of the latter, data analysis is conceptually similar to the popular RiboMethSeq protocol, which identifies cleavage-protected Nm residues,86,87 and consequently the same scores are used.
Apart from ψ, 5-substituted uridines, such as m5U and some hypermodified uridines, typically found at the wobble position of tRNA, were also found to be resistant to cleavage (Figure 5A). Weak signals at D sites suggest at least partial protection, which might also be related to substoichiometrically modified D sites. Unmodified A, C, and G nucleotides are inert; however, m7G and the bacterial wobble modification lysidine (k2C) (Figure 5A) give strong signals.85 Interestingly no signals were apparent for m3C, even though this nucleotide was reported to be sensitive to hydrazine,88 an early finding that was exploited in so-called “direct” RNA sequencing and structural probing.53,55 Cleavage conditions substantially differ in hydrazine and salt concentrations between m3C cleavage (10% hydrazine, 3 M NaCl) and HydraPsiSeq (50% hydrazine).
Apart from ψ quantification in noncoding RNA such as rRNA, HydraPsiSeq can be potentially applied for confirmation of candidate ψ residues in coding or other low abundant RNAs;89 however, substantial sequencing efforts are required to achieve sufficient coverage. In practice, HydraPsiSeq turned out to be rather sensitive to the quality of chemical reagents, in particular, of hydrazine, requiring some testing to account for batch-to-batch variation. Since HydraPsiSeq is based on detection of “negative” signals, a substantial number of false positive hits may be expected, related to irregularities in RNA cleavage by hydrazine and to ligation biases during the library preparation step. The S/N ratio for HydraPsiSeq is compatible with rRNA/tRNA pool transcriptome size, thus on the order of 104–105 nt.
Error Range and Statistics Should Determine the Application Range
After a given method has run its course in the wet lab (Table 1), data treatment takes over. There is vanishingly little common ground among the methods used for identifying modification sites from Illumina data, to the point that comparative analysis suggests that the influence of data analysis pipelines on the results may dominate over that of the underlying biology.90 The developments in RNA modification mapping of the past decade have clearly shown a tendency to overestimate the number of modifications detected,91 and hence a need to depress the number of false positives, i.e., increase specificity, if necessary at the cost of increasing false negative signals, equivalent to accepting a loss in sensitivity. As the field has seen several open controversies over mapping issues, we suggest that upon publication of a new mapping method, one should state the size of the epitranscriptome (“limited epitranscriptome”) within which the method produces reasonable and verifiable predictions. Thus, rather than stating the number of newly identified modification sites in a full cellular transcriptome, one might state, if the method is rather suited to investigating the ensemble of tRNAs and rRNAs, thus a “limited transcriptome”2 of about 104–105 nt in size. From our decade of joint experience, this applies to the majority of methods, whereas only a few are suitable to tackle mRNAs, which invoke an epitranscriptome on the order of 107 nt. Still from our own experience, AlkAniline-Seq is one of very few methods that are up to this task, since this method has a very high S/N ratio and is highly sensitive even for low substoichiometric modification rate. Its particularly high S/N ratio and corresponding sensitivity result, as explained, from its peculiar chemistry that provides privileged entry into sequencing libraries for fragments carrying pertinent information.
Table 1. Comparative Characteristics of the Protocols Discussed in the Review.
Parameters | Protocol | |||
---|---|---|---|---|
RT-Seq | NO-Seq | AlkAniline-Seq | HydraPsiSeq | |
Detected modification | m1A, m1G, m22G, m3U, m3C, m66A, other residues modified at the WC face | m6A | m7G, m3C, D, ho5C/ho5U, RNA abasic sites | Ψ, m5U, 5-modified U*34, k2C (positive cleavage signal), m7G (positive cleavage signal) |
Nature of the signal | RT stop + misincorporation/jump signature | Resistance of m6A to deamination compared to unmodified A | Cleavage at the abasic RNA site resulting from decomposition of the modified residue | Resistance of Ψ, m5U, 5-modified U*34 residues to hydrazine cleavage, over cleaved U residues |
Detection type | positive | negative | positive | negative |
Input RNA required | >100 ng | ∼ 500 ng | >50 ng | >150 ng |
Molar sensitivity | average | low | very high | average |
Quantification | Possible, but complicated | Precise | Possible, but calibration is not linear | Precise |
Reagents used | None (different RT enzymes recommended) | NaNO2/H+ | Alkaline hydrolysis/Aniline | Hydrazine/Aniline |
Library preparation step | Custom protocol | Custom protocol | NEB Small RNA kit | NEB Small RNA kit |
Required sequencing depths | Coverage >100 reads/position, better >500 | Coverage >50 reads/position | >50 reads/RNA position | >1000 reads/RNA position |
Recommended size of ≪ limited transcriptome ≫ | 10 000 bases rRNA/tRNA size | 100 bases Custom amplicons | 10 000 000 bases Transcriptome-wide | 10 000 bases rRNA/tRNA size |
Bioinformatics analysis | straightforward | Complicated (custom alignment algorithm) | straightforward | Intermediate (normalization and score calculations) |
Acknowledgments
Research in the Helm lab was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 439669440 – “RMaP” TRR 319 (C01)“ and by Project HE 3397/21-1. Figures created with BioRender.com.
Biographies
Yuri Motorin studied chemistry and chemical enzymology at Lomonosov Moscow University, USSR, and obtained his Ph.D. in Biochemistry in 1989, at the A.N. Bakh Institute of Biochemistry in Moscow. After a postdoctoral period in Gif-sur-Yvette, France, he established his group at the University of Lorraine in Nancy. His research aims to develop analytical methodology for RNA modification analysis by combination of specific chemical treatment and deep sequencing.
Mark Helm studied chemistry in Würzburg, Germany, and obtained his Ph.D. in Molecular Biology in Strasbourg, France, in 1999. After postdoctoral fellowships at Caltech, USA, and the Free University of Berlin, Germany, he started his independent research group in 2002 in Heidelberg Germany. Since 2009, he is Professor of Pharmaceutical Medicinal Chemistry in Mainz, Germany, where he is currently Chair of the Research Consortium “RMaP: RNA Modification and Processing”.
Supporting Information Available
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.accounts.3c00529.
Comparative characteristics of the protocols discussed (PDF)
Author Contributions
CRediT: Yuri Motorin conceptualization, writing-original draft; Mark Helm conceptualization, writing-original draft.
The authors declare no competing financial interest.
Special Issue
Published as part of the Accounts of Chemical Research special issue “RNA Modifications”.
Supplementary Material
References
- Hauenschild R.; Tserovski L.; Schmid K.; Thüring K.; Winz M.-L.; Sharma S.; Entian K.-D.; Wacheul L.; Lafontaine D. L. J.; Anderson J.; Alfonzo J.; Hildebrandt A.; Jäschke A.; Motorin Y.; Helm M. The Reverse Transcription Signature of N-1-Methyladenosine in RNA-Seq Is Sequence Dependent. Nucleic Acids Res. 2015, 43, 9950–9964. 10.1093/nar/gkv895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Werner S.; Schmidt L.; Marchand V.; Kemmer T.; Falschlunger C.; Sednev M. V.; Bec G.; Ennifar E.; Höbartner C.; Micura R.; Motorin Y.; Hildebrandt A.; Helm M. Machine Learning of Reverse Transcription Signatures of Variegated Polymerases Allows Mapping and Discrimination of Methylated Purines in Limited Transcriptomes. Nucleic Acids Res. 2020, 48, 3734–3746. 10.1093/nar/gkaa113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marchand V.; Ayadi L.; Ernst F. G. M.; Hertler J.; Bourguignon-Igel V.; Galvanin A.; Kotter A.; Helm M.; Lafontaine D. L. J.; Motorin Y. AlkAniline-Seq: Profiling of m7G and m3C RNA Modifications at Single Nucleotide Resolution. Angew. Chem., Int. Ed. Engl. 2018, 57, 16785–16790. 10.1002/anie.201810946. [DOI] [PubMed] [Google Scholar]
- Marchand V.; Pichot F.; Neybecker P.; Ayadi L.; Bourguignon-Igel V.; Wacheul L.; Lafontaine D. L. J.; Pinzano A.; Helm M.; Motorin Y. HydraPsiSeq: A Method for Systematic and Quantitative Mapping of Pseudouridines in RNA. Nucleic Acids Res. 2020, 48, e110 10.1093/nar/gkaa769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saletore Y.; Meyer K.; Korlach J.; Vilfan I. D.; Jaffrey S.; Mason C. E. The Birth of the Epitranscriptome: Deciphering the Function of RNA Modifications. Genome Biol. 2012, 13, 175. 10.1186/gb-2012-13-10-175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saletore Y.; Chen-Kiang S.; Mason C. E. Novel RNA Regulatory Mechanisms Revealed in the Epitranscriptome. RNA Biol. 2013, 10, 342–346. 10.4161/rna.23812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Helm M.; Motorin Y. Detecting RNA Modifications in the Epitranscriptome: Predict and Validate. Nat. Rev. Genet. 2017, 18, 275–291. 10.1038/nrg.2016.169. [DOI] [PubMed] [Google Scholar]
- Motorin Y.; Helm M. Methods for RNA Modification Mapping Using Deep Sequencing: Established and New Emerging Technologies. Genes (Basel) 2019, 10, 35. 10.3390/genes10010035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- La Francois C. J.; Jang Y. H.; Cagin T.; Goddard W. A.; Sowers L. C. Conformation and Proton Configuration of Pyrimidine Deoxynucleoside Oxidation Damage Products in Water. Chem. Res. Toxicol. 2000, 13, 462–470. 10.1021/tx990209u. [DOI] [PubMed] [Google Scholar]
- Jones E. L.; Mlotkowski A. J.; Hebert S. P.; Schlegel H. B.; Chow C. S. Calculations of pKa Values for a Series of Naturally Occurring Modified Nucleobases. J. Phys. Chem. A 2022, 126, 1518–1529. 10.1021/acs.jpca.1c10905. [DOI] [PubMed] [Google Scholar]
- Begik O.; Lucas M. C.; Pryszcz L. P.; Ramirez J. M.; Medina R.; Milenkovic I.; Cruciani S.; Liu H.; Vieira H. G. S.; Sas-Chen A.; Mattick J. S.; Schwartz S.; Novoa E. M. Quantitative Profiling of Pseudouridylation Dynamics in Native RNAs with Nanopore Sequencing. Nat. Biotechnol. 2021, 39, 1278–1291. 10.1038/s41587-021-00915-6. [DOI] [PubMed] [Google Scholar]
- Furlan M.; Delgado-Tejedor A.; Mulroney L.; Pelizzola M.; Novoa E. M.; Leonardi T. Computational Methods for RNA Modification Detection from Nanopore Direct RNA Sequencing Data. RNA Biol. 2021, 18, 31–40. 10.1080/15476286.2021.1978215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lucas M. C.; Novoa E. M. Long-Read Sequencing in the Era of Epigenomics and Epitranscriptomics. Nat. Methods 2023, 20, 25–29. 10.1038/s41592-022-01724-8. [DOI] [PubMed] [Google Scholar]
- Lucas M. C.; Pryszcz L. P.; Medina R.; Milenkovic I.; Camacho N.; Marchand V.; Motorin Y.; Ribas de Pouplana L.; Novoa E. M. Quantitative Analysis of tRNA Abundance and Modifications by Nanopore RNA Sequencing. Nat. Biotechnol. 2023, 10.1038/s41587-023-01743-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vilfan I. D.; Tsai Y.-C.; Clark T. A.; Wegener J.; Dai Q.; Yi C.; Pan T.; Turner S. W.; Korlach J. Analysis of RNA Base Modification and Structural Rearrangement by Single-Molecule Real-Time Detection of Reverse Transcription. J. Nanobiotechnology 2013, 11, 8. 10.1186/1477-3155-11-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Levanon E. Y.; Eisenberg E.; Yelin R.; Nemzer S.; Hallegger M.; Shemesh R.; Fligelman Z. Y.; Shoshan A.; Pollock S. R.; Sztybel D.; Olshansky M.; Rechavi G.; Jantsch M. F. Systematic Identification of Abundant A-to-I Editing Sites in the Human Transcriptome. Nat. Biotechnol. 2004, 22, 1001–1005. 10.1038/nbt996. [DOI] [PubMed] [Google Scholar]
- Carmi S.; Borukhov I.; Levanon E. Y. Identification of Widespread Ultra-Edited Human RNAs. PLoS Genet 2011, 7, e1002317 10.1371/journal.pgen.1002317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schrider D. R.; Gout J.-F.; Hahn M. W. Very Few RNA and DNA Sequence Differences in the Human Transcriptome. PLoS One 2011, 6, e25842 10.1371/journal.pone.0025842. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gustafsson C.; Persson B. C. Identification of the rrmA Gene Encoding the 23S rRNA m1G745 Methyltransferase in Escherichia coli and Characterization of an m1G745-Deficient Mutant. J. Bacteriol. 1998, 180, 359–365. 10.1128/JB.180.2.359-365.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lempereur L.; Nicoloso M.; Riehl N.; Ehresmann C.; Ehresmann B.; Bachellerie J. P. Conformation of Yeast 18S rRNA. Direct Chemical Probing of the 5′ Domain in Ribosomal Subunits and in Deproteinized RNA by Reverse Transcriptase Mapping of Dimethyl Sulfate-Accessible Sites. Nucleic Acids Res. 1985, 13, 8339–8357. 10.1093/nar/13.23.8339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maden B. E.; Corbett M. E.; Heeney P. A.; Pugh K.; Ajuh P. M. Classical and Novel Approaches to the Detection and Localization of the Numerous Modified Nucleotides in Eukaryotic Ribosomal RNA. Biochimie 1995, 77, 22–29. 10.1016/0300-9084(96)88100-4. [DOI] [PubMed] [Google Scholar]
- Dong Z.-W.; Shao P.; Diao L.-T.; Zhou H.; Yu C.-H.; Qu L.-H. RTL-P: A Sensitive Approach for Detecting Sites of 2’-O-Methylation in RNA Molecules. Nucleic Acids Res. 2012, 40, e157 10.1093/nar/gks698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McCullum E. O.; Williams B. A. R.; Zhang J.; Chaput J. C. Random Mutagenesis by Error-Prone PCR. Methods Mol. Biol. 2010, 634, 103–109. 10.1007/978-1-60761-652-8_7. [DOI] [PubMed] [Google Scholar]
- Findeiss S.; Langenberger D.; Stadler P. F.; Hoffmann S. Traces of Post-Transcriptional RNA Modifications in Deep Sequencing Data. Biol. Chem. 2011, 392, 305–313. 10.1515/bc.2011.043. [DOI] [PubMed] [Google Scholar]
- Ryvkin P.; Leung Y. Y.; Silverman I. M.; Childress M.; Valladares O.; Dragomir I.; Gregory B. D.; Wang L.-S. HAMR: High-Throughput Annotation of Modified Ribonucleotides. RNA 2013, 19, 1684–1692. 10.1261/rna.036806.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuksa P. P.; Leung Y. Y.; Vandivier L. E.; Anderson Z.; Gregory B. D.; Wang L.-S. In Silico Identification of RNA Modifications from High-Throughput Sequencing Data Using HAMR. Methods Mol. Biol. 2017, 1562, 211–229. 10.1007/978-1-4939-6807-7_14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vandivier L. E.; Anderson Z. D.; Gregory B. D. HAMR: High-Throughput Annotation of Modified Ribonucleotides. Methods Mol. Biol. 2019, 1870, 51–67. 10.1007/978-1-4939-8808-2_4. [DOI] [PubMed] [Google Scholar]
- Kristen M.; Plehn J.; Marchand V.; Friedland K.; Motorin Y.; Helm M.; Werner S. Manganese Ions Individually Alter the Reverse Transcription Signature of Modified Ribonucleosides. Genes 2020, 11, 950. 10.3390/genes11080950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Incarnato D.; Anselmi F.; Morandi E.; Neri F.; Maldotti M.; Rapelli S.; Parlato C.; Basile G.; Oliviero S. High-Throughput Single-Base Resolution Mapping of RNA 2′-O-Methylated Residues. Nucleic Acids Res. 2017, 45, 1433–1441. 10.1093/nar/gkw810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou K. I.; Clark W. C.; Pan D. W.; Eckwahl M. J.; Dai Q.; Pan T. Pseudouridines Have Context-Dependent Mutation and Stop Rates in High-Throughput Sequencing. RNA biology 2018, 15, 892–900. 10.1080/15476286.2018.1462654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aschenbrenner J.; Marx A. Direct and Site-Specific Quantification of RNA 2’-O-Methylation by PCR with an Engineered DNA Polymerase. Nucleic Acids Res. 2016, 44, 3495–3502. 10.1093/nar/gkw200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aschenbrenner J.; Werner S.; Marchand V.; Adam M.; Motorin Y.; Helm M.; Marx A. Engineering of a DNA Polymerase for Direct m6A Sequencing. Angew. Chem., Int. Ed. Engl. 2018, 57, 417–421. 10.1002/anie.201710209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huber L. B.; Kaur N.; Henkel M.; Marchand V.; Motorin Y.; Ehrenhofer-Murray A. E.; Marx A. A Dual-Purpose Polymerase Engineered for Direct Sequencing of Pseudouridine and Queuosine. Nucleic Acids Res. 2023, 51, 3971–3987. 10.1093/nar/gkad177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khoddami V.; Yerra A.; Mosbruger T. L.; Fleming A. M.; Burrows C. J.; Cairns B. R. Transcriptome-Wide Profiling of Multiple RNA Modifications Simultaneously at Single-Base Resolution. Proc. Natl. Acad. Sci. U.S.A. 2019, 116, 6784–6789. 10.1073/pnas.1817334116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dai Q.; Zhang L.-S.; Sun H.-L.; Pajdzik K.; Yang L.; Ye C.; Ju C.-W.; Liu S.; Wang Y.; Zheng Z.; Zhang L.; Harada B. T.; Dou X.; Irkliyenko I.; Feng X.; Zhang W.; Pan T.; He C. Quantitative Sequencing Using BID-Seq Uncovers Abundant Pseudouridines in Mammalian mRNA at Base Resolution. Nat. Biotechnol. 2023, 41, 344–354. 10.1038/s41587-022-01505-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang L.-S.; Dai Q.; He C. BID-Seq: The Quantitative and Base-Resolution Sequencing Method for RNA Pseudouridine. ACS Chem. Biol. 2023, 18, 4–6. 10.1021/acschembio.2c00881. [DOI] [PubMed] [Google Scholar]
- Zhang M.; Jiang Z.; Ma Y.; Liu W.; Zhuang Y.; Lu B.; Li K.; Peng J.; Yi C. Quantitative Profiling of Pseudouridylation Landscape in the Human Transcriptome. Nat. Chem. Biol. 2023, 19, 1185. 10.1038/s41589-023-01304-7. [DOI] [PubMed] [Google Scholar]
- Katanski C. D.; Watkins C. P.; Zhang W.; Reyer M.; Miller S.; Pan T. Analysis of Queuosine and 2-Thio tRNA Modifications by High Throughput Sequencing. Nucleic Acids Res. 2022, 50, e99 10.1093/nar/gkac517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boccaletto P.; Stefaniak F.; Ray A.; Cappannini A.; Mukherjee S.; Purta E.; Kurkowska M.; Shirvanizadeh N.; Destefanis E.; Groza P.; Avşar G.; Romitelli A.; Pir P.; Dassi E.; Conticello S. G.; Aguilo F.; Bujnicki J. M. MODOMICS: A Database of RNA Modification Pathways. 2021 Update. Nucleic Acids Res. 2022, 50, D231–D235. 10.1093/nar/gkab1083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chang S. E.; Ish-Horowicz D. Selective Modification of Cytidine, Uridine, Guanosine and Pseudouridine Residues in Escherichia coli Leucine Transfer Ribonucleic Acid. J. Mol. Biol. 1974, 84, 375–388. 10.1016/0022-2836(74)90446-X. [DOI] [PubMed] [Google Scholar]
- Bakin A.; Ofengand J. Four Newly Located Pseudouridylate Residues in Escherichia coli 23S Ribosomal RNA Are All at the Peptidyltransferase Center: Analysis by the Application of a New Sequencing Technique. Biochemistry 1993, 32, 9754–9762. 10.1021/bi00088a030. [DOI] [PubMed] [Google Scholar]
- Bakin A.; Kowalak J. A.; McCloskey J. A.; Ofengand J. The Single Pseudouridine Residue in Escherichia coli 16S RNA Is Located at Position 516. Nucleic Acids Res. 1994, 22, 3681–3684. 10.1093/nar/22.18.3681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bakin A.; Lane B. G.; Ofengand J. Clustering of Pseudouridine Residues around the Peptidyltransferase Center of Yeast Cytoplasmic and Mitochondrial Ribosomes. Biochemistry 1994, 33, 13475–13483. 10.1021/bi00249a036. [DOI] [PubMed] [Google Scholar]
- Bakin A.; Ofengand J. Four Newly Located Pseudouridylate Residues in Escherichia coli 23S Ribosomal RNA Are All at the Peptidyltransferase Center: Analysis by the Application of a New Sequencing Technique. Biochemistry 1993, 32, 9754–9762. 10.1021/bi00088a030. [DOI] [PubMed] [Google Scholar]
- Carlile T. M.; Rojas-Duran M. F.; Zinshteyn B.; Shin H.; Bartoli K. M.; Gilbert W. V. Pseudouridine Profiling Reveals Regulated mRNA Pseudouridylation in Yeast and Human Cells. Nature 2014, 515, 143–146. 10.1038/nature13802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwartz S.; Bernstein D. A.; Mumbach M. R.; Jovanovic M.; Herbst R. H.; León-Ricardo B. X.; Engreitz J. M.; Guttman M.; Satija R.; Lander E. S.; Fink G.; Regev A. Transcriptome-Wide Mapping Reveals Widespread Dynamic-Regulated Pseudouridylation of ncRNA and mRNA. Cell 2014, 159, 148–162. 10.1016/j.cell.2014.08.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lovejoy A. F.; Riordan D. P.; Brown P. O. Transcriptome-Wide Mapping of Pseudouridines: Pseudouridine Synthases Modify Specific mRNAs in S. cerevisiae. PLoS One 2014, 9, e110799 10.1371/journal.pone.0110799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schaefer M.; Pollex T.; Hanna K.; Lyko F. RNA Cytosine Methylation Analysis by Bisulfite Sequencing. Nucleic Acids Res. 2008, 37, e12 10.1093/nar/gkn954. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schaefer M. RNA 5-Methylcytosine Analysis by Bisulfite Sequencing. Methods Enzymol. 2015, 560, 297–329. 10.1016/bs.mie.2015.03.007. [DOI] [PubMed] [Google Scholar]
- Clark W. C.; Evans M. E.; Dominissini D.; Zheng G.; Pan T. tRNA Base Methylation Identification and Quantification via High-Throughput Sequencing. RNA 2016, 22, 1771–1784. 10.1261/rna.056531.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dai Q.; Zheng G.; Schwartz M. H.; Clark W. C.; Pan T. Selective Enzymatic Demethylation of N2,N2 -Dimethylguanosine in RNA and Its Application in High-Throughput tRNA Sequencing. Angew. Chem., Int. Ed. Engl. 2017, 56, 5017–5020. 10.1002/anie.201700537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Y.; Katanski C. D.; Watkins C.; Pan J. N.; Dai Q.; Jiang Z.; Pan T. A High-Throughput Screening Method for Evolving a Demethylase Enzyme with Improved and New Functionalities. Nucleic Acids Res. 2021, 49, e30 10.1093/nar/gkaa1213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peattie D. A. Direct Chemical Method for Sequencing RNA. Proc. Natl. Acad. Sci. U.S.A. 1979, 76, 1760–1764. 10.1073/pnas.76.4.1760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mazo A. M.; Mashkova T. D.; Avdonina T. A.; Ambartsumyan N. S.; Kisselev L. L. An Improved Rapid Enzymatic Method of RNA Sequencing Using Chemical Modification. Nucleic Acids Res. 1979, 7, 2469–2482. 10.1093/nar/7.8.2469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peattie D. A.; Gilbert W. Chemical Probes for Higher-Order Structure in RNA. Proc. Natl. Acad. Sci. U. S. A. 1980, 77, 4679–4682. 10.1073/pnas.77.8.4679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Helm M.; Schmidt-Dengler M. C.; Weber M.; Motorin Y. General Principles for the Detection of Modified Nucleotides in RNA by Specific Reagents. Adv. Biol. (Weinh) 2021, 5, e2100866 10.1002/adbi.202100866. [DOI] [PubMed] [Google Scholar]
- Motorin Y.; Helm M. RNA Nucleotide Methylation: 2021 Update. Wiley Interdiscip Rev. RNA 2022, 13, e1691 10.1002/wrna.1691. [DOI] [PubMed] [Google Scholar]
- Tserovski L.; Marchand V.; Hauenschild R.; Blanloeil-Oillo F.; Helm M.; Motorin Y. High-Throughput Sequencing for 1-Methyladenosine (m1A) Mapping in RNA. Methods 2016, 107, 110–121. 10.1016/j.ymeth.2016.02.012. [DOI] [PubMed] [Google Scholar]
- Werner S.; Galliot A.; Pichot F.; Kemmer T.; Marchand V.; Sednev M. V.; Lence T.; Roignant J.-Y.; König J.; Höbartner C.; Motorin Y.; Hildebrandt A.; Helm M. NOseq: Amplicon Sequencing Evaluation Method for RNA m6A Sites after Chemical Deamination. Nucleic Acids Res. 2021, 49, e23 10.1093/nar/gkaa1173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mair S.; Erharter K.; Renard E.; Brillet K.; Brunner M.; Lusser A.; Kreutz C.; Ennifar E.; Micura R. Towards a Comprehensive Understanding of RNA Deamination: Synthesis and Properties of Xanthosine-Modified RNA. Nucleic Acids Res. 2022, 50, 6038–6051. 10.1093/nar/gkac477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu C.; Sun H.; Yi Y.; Shen W.; Li K.; Xiao Y.; Li F.; Li Y.; Hou Y.; Lu B.; Liu W.; Meng H.; Peng J.; Yi C.; Wang J. Absolute Quantification of Single-Base m6A Methylation in the Mammalian Transcriptome Using GLORI. Nat. Biotechnol. 2023, 41, 355–366. 10.1038/s41587-022-01487-9. [DOI] [PubMed] [Google Scholar]
- Wintermeyer W.; Zachau H. G. Tertiary Structure Interactions of 7-Methylguanosine in Yeast tRNAPhe as Studied by Borohydride Reduction. FEBS Lett. 1975, 58, 306–309. 10.1016/0014-5793(75)80285-7. [DOI] [PubMed] [Google Scholar]
- Zueva V. S.; Mankin A. S.; Bogdanov A. A.; Baratova L. A. Specific Fragmentation of tRNA and rRNA at a 7-Methylguanine Residue in the Presence of Methylated Carrier RNA. Eur. J. Biochem. 1985, 146, 679–687. 10.1111/j.1432-1033.1985.tb08704.x. [DOI] [PubMed] [Google Scholar]
- Enroth C.; Poulsen L. D.; Iversen S.; Kirpekar F.; Albrechtsen A.; Vinther J. Detection of Internal N7-Methylguanosine (m7G) RNA Modifications by Mutational Profiling Sequencing. Nucleic Acids Res. 2019, 47, e126 10.1093/nar/gkz736. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Küpfer P. A.; Leumann C. J. The Chemical Stability of Abasic RNA Compared to Abasic DNA. Nucleic Acids Res. 2007, 35, 58–68. 10.1093/nar/gkl948. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wells S. E.; Hughes J. M.; Igel A. H.; Ares M. Use of Dimethyl Sulfate to Probe RNA Structure in Vivo. Methods Enzymol 2000, 318, 479–493. 10.1016/S0076-6879(00)18071-1. [DOI] [PubMed] [Google Scholar]
- Zhang L.-S.; Ju C.-W.; Liu C.; Wei J.; Dai Q.; Chen L.; Ye C.; He C. m7G-Quant-Seq: Quantitative Detection of RNA Internal N7-Methylguanosine. ACS Chem. Biol. 2022, 17, 3306–3312. 10.1021/acschembio.2c00792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang L.-S.; Liu C.; Ma H.; Dai Q.; Sun H.-L.; Luo G.; Zhang Z.; Zhang L.; Hu L.; Dong X.; He C. Transcriptome-Wide Mapping of Internal N7-Methylguanosine Methylome in Mammalian mRNA. Mol. Cell 2019, 74, 1304–1316. 10.1016/j.molcel.2019.03.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pandolfini L.; Barbieri I.; Bannister A. J.; Hendrick A.; Andrews B.; Webster N.; Murat P.; Mach P.; Brandi R.; Robson S. C.; Migliori V.; Alendar A.; d’Onofrio M.; Balasubramanian S.; Kouzarides T. METTL1 Promotes Let-7 MicroRNA Processing via m7G Methylation. Mol. Cell 2019, 74, 1278–1290. 10.1016/j.molcel.2019.03.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin S.; Liu Q.; Jiang Y.-Z.; Gregory R. I. Nucleotide Resolution Profiling of m7G tRNA Modification by TRAC-Seq. Nat. Protoc. 2019, 14, 3220–3242. 10.1038/s41596-019-0226-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rainen L.; Stollar B. D. Antibodies Distinguishing between Intact and Alkali-Hydrolyzed 7-Methylguanosine. Nucleic Acids Res. 1978, 5, 4877–4889. 10.1093/nar/5.12.4877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cerutti P.; Holt J. W.; Miller N. Detection and Determination of 5,6-Dihydrouridine and 4-Thiouridine in Transfer Ribonucleic Acid from Different Sources. J. Mol. Biol. 1968, 34, 505–518. 10.1016/0022-2836(68)90176-9. [DOI] [PubMed] [Google Scholar]
- Marchand V.; Ayadi L.; Bourguignon-Igel V.; Helm M.; Motorin Y. AlkAniline-Seq: A Highly Sensitive and Specific Method for Simultaneous Mapping of 7-Methyl-Guanosine (m7G) and 3-Methyl-Cytosine (m3C) in RNAs by High-Throughput Sequencing. Methods Mol. Biol. 2021, 2298, 77–95. 10.1007/978-1-0716-1374-0_5. [DOI] [PubMed] [Google Scholar]
- Marchand V.; Bourguignon-Igel V.; Helm M.; Motorin Y. Mapping of 7-Methylguanosine (m7G), 3-Methylcytidine (m3C), Dihydrouridine (D) and 5-Hydroxycytidine (ho5C) RNA Modifications by AlkAniline-Seq. Methods Enzymol 2021, 658, 25–47. 10.1016/bs.mie.2021.06.001. [DOI] [PubMed] [Google Scholar]
- Andersen T. E.; Porse B. T.; Kirpekar F. A Novel Partial Modification at C2501 in Escherichia coli 23S Ribosomal RNA. RNA 2004, 10, 907–913. 10.1261/rna.5259404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xing F.; Hiley S. L.; Hughes T. R.; Phizicky E. M. The Specificities of Four Yeast Dihydrouridine Synthases for Cytoplasmic tRNAs. J. Biol. Chem. 2004, 279, 17850–17860. 10.1074/jbc.M401221200. [DOI] [PubMed] [Google Scholar]
- Yan L. L.; Zaher H. S. How Do Cells Cope with RNA Damage and Its Consequences?. J. Biol. Chem. 2019, 294, 15158–15171. 10.1074/jbc.REV119.006513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henderson P. T.; Delaney J. C.; Muller J. G.; Neeley W. L.; Tannenbaum S. R.; Burrows C. J.; Essigmann J. M. The Hydantoin Lesions Formed from Oxidation of 7,8-Dihydro-8-Oxoguanine Are Potent Sources of Replication Errors in Vivo. Biochemistry 2003, 42, 9257–9262. 10.1021/bi0347252. [DOI] [PubMed] [Google Scholar]
- Pichot F.; Hogg M. C.; Marchand V.; Bourguignon V.; Jirström E.; Farrell C.; Gibriel H. A.; Prehn J. H. M.; Motorin Y.; Helm M. Quantification of Substoichiometric Modification Reveals Global tsRNA Hypomodification, Preferences for Angiogenin-Mediated tRNA Cleavage, and Idiosyncratic Epitranscriptomes of Human Neuronal Cell-Lines. Comput. Struct Biotechnol J. 2023, 21, 401–417. 10.1016/j.csbj.2022.12.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schöller E.; Marks J.; Marchand V.; Bruckmann A.; Powell C. A.; Reichold M.; Mutti C. D.; Dettmer K.; Feederle R.; Hüttelmaier S.; Helm M.; Oefner P.; Minczuk M.; Motorin Y.; Hafner M.; Meister G. Balancing of Mitochondrial Translation through METTL8-Mediated m3C Modification of Mitochondrial tRNAs. Mol. Cell 2021, 81, 4810–4825. 10.1016/j.molcel.2021.10.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- García-Vílchez R.; Añazco-Guenkova A. M.; Dietmann S.; López J.; Morón-Calvente V.; D’Ambrosi S.; Nombela P.; Zamacola K.; Mendizabal I.; García-Longarte S.; Zabala-Letona A.; Astobiza I.; Fernández S.; Paniagua A.; Miguel-López B.; Marchand V.; Alonso-López D.; Merkel A.; García-Tuñón I.; Ugalde-Olano A.; Loizaga-Iriarte A.; Lacasa-Viscasillas I.; Unda M.; Azkargorta M.; Elortza F.; Bárcena L.; Gonzalez-Lopez M.; Aransay A. M.; Di Domenico T.; Sánchez-Martín M. A.; De Las Rivas J.; Guil S.; Motorin Y.; Helm M.; Pandolfi P. P.; Carracedo A.; Blanco S. METTL1 Promotes Tumorigenesis through tRNA-Derived Fragment Biogenesis in Prostate Cancer. Mol. Cancer 2023, 22, 119. 10.1186/s12943-023-01809-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lei Z.; Yi C. A Radiolabeling-Free, qPCR-Based Method for Locus-Specific Pseudouridine Detection. Angew. Chem., Int. Ed. Engl. 2017, 56, 14878–14882. 10.1002/anie.201708276. [DOI] [PubMed] [Google Scholar]
- Massenet S.; Motorin Y.; Lafontaine D. L.; Hurt E. C.; Grosjean H.; Branlant C. Pseudouridine Mapping in the Saccharomyces cerevisiae Spliceosomal U Small Nuclear RNAs (snRNAs) Reveals That Pseudouridine Synthase Pus1p Exhibits a Dual Substrate Specificity for U2 snRNA and tRNA. Mol. Cell. Biol. 1999, 19, 2142–2154. 10.1128/MCB.19.3.2142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baumstark T.; Ahlquist P. The Brome Mosaic Virus RNA3 Intergenic Replication Enhancer Folds to Mimic a tRNA TpsiC-Stem Loop and Is Modified in Vivo. RNA 2001, 7, 1652–1670. [PMC free article] [PubMed] [Google Scholar]
- Marchand V.; Bourguignon-Igel V.; Helm M.; Motorin Y. Analysis of Pseudouridines and Other RNA Modifications Using HydraPsiSeq Protocol. Methods 2021, 383. 10.1016/j.ymeth.2021.08.008. [DOI] [PubMed] [Google Scholar]
- Marchand V.; Blanloeil-Oillo F.; Helm M.; Motorin Y. Illumina-Based RiboMethSeq Approach for Mapping of 2’-O-Me Residues in RNA. Nucleic Acids Res. 2016, 44, e135 10.1093/nar/gkw547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pichot F.; Marchand V.; Ayadi L.; Bourguignon-Igel V.; Helm M.; Motorin Y. Holistic Optimization of Bioinformatic Analysis Pipeline for Detection and Quantification of 2’-O-Methylations in RNA by RiboMethSeq. Front Genet 2020, 11, 38. 10.3389/fgene.2020.00038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cui J.; Liu Q.; Sendinc E.; Shi Y.; Gregory R. I. Nucleotide Resolution Profiling of m3C RNA Modification by HAC-Seq. Nucleic Acids Res. 2021, 49, e27 10.1093/nar/gkaa1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henry B. A.; Marchand V.; Schlegel B. T.; Helm M.; Motorin Y.; Lee N. Pseudouridylation of Epstein-Barr Virus Noncoding RNA EBER2 Facilitates Lytic Replication. RNA 2022, 28, 1542–1552. 10.1261/rna.079219.122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McIntyre A. B. R.; Gokhale N. S.; Cerchietti L.; Jaffrey S. R.; Horner S. M.; Mason C. E. Limits in the Detection of m6A Changes Using MeRIP/m6A-Seq. Sci. Rep. 2020, 10, 6590. 10.1038/s41598-020-63355-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiener D.; Schwartz S. The Epitranscriptome beyond m6A. Nat. Rev. Genet 2021, 22, 119. 10.1038/s41576-020-00295-8. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.