Skip to main content
Frontiers in Genetics logoLink to Frontiers in Genetics
. 2020 Apr 28;11:394. doi: 10.3389/fgene.2020.00394

Direct RNA Sequencing for the Study of Synthesis, Processing, and Degradation of Modified Transcripts

Mattia Furlan 1,2, Iris Tanaka 1, Tommaso Leonardi 1, Stefano de Pretis 1, Mattia Pelizzola 1,*
PMCID: PMC7212349  PMID: 32425981

Abstract

It has been known for a few decades that transcripts can be marked by dozens of different modifications. Yet, we are just at the beginning of charting these marks and understanding their functional impact. High-quality methods were developed for the profiling of some of these marks, and approaches to finely study their impact on specific phases of the RNA life-cycle are available, including RNA metabolic labeling. Thanks to these improvements, the most abundant marks, including N6-methyladenosine, are emerging as important determinants of the fate of marked RNAs. However, we still lack approaches to directly study how the set of marks for a given RNA molecule shape its fate. In this perspective, we first review current leading approaches in the field. Then, we propose an experimental and computational setup, based on direct RNA sequencing and mathematical modeling, to decipher the functional consequences of RNA modifications on the fate of individual RNA molecules and isoforms.

Keywords: RNA modification, m6A, direct RNA sequencing, metabolic labeling, nascent RNA, RNA metabolism, long reads sequencing, nanopore

Introduction

More than a 100 RNA modifications have been identified since the 1950s (Boccaletto et al., 2018). They were first observed in abundant populations of non-coding transcripts (e.g., tRNAs) and in a second moment, due to the improvement of profiling techniques, their pervasive presence was confirmed in coding transcripts (Roundtree et al., 2017). Different modifications were found to co-occur on the same RNA molecule (Jackman and Alfonzo, 2013). In some cases, rather than a mere stochastic effect due to the modification frequency, their co-occurrence suggested reciprocal regulation mechanisms (Xiang et al., 2018).

The N6-methyladenosine (m6A) emerged as one of the most abundant modifications of coding transcripts (Roundtree et al., 2017), and it was shown to be involved in the regulation of various biological processes, including cellular differentiation (Lin and Gregory, 2014; Wang Y. et al., 2014; Chen et al., 2015; Geula et al., 2015; Zhang et al., 2017a), meiosis (Bushkin et al., 2019), heat stress response (Zhou et al., 2015), gametogenesis (Wojtas et al., 2017), and neurons activity (Engel et al., 2018). Furthermore, aberrant m6A patterning was shown to be associated with diseases insurgence and progression (Tong et al., 2018; Ianniello et al., 2019; Yang et al., 2019). A number of effectors were identified that are responsible for m6A deposition (e.g., METTL3 and METTL14) (Liu et al., 2014; Ping et al., 2014; Schwartz et al., 2014), recognition (e.g., members of the YTH domain family) (Luo and Tong, 2014; Xu et al., 2014; Zhu et al., 2014; Xiao et al., 2016), and removal (FTO and ALKBH5) (Jia et al., 2011; Zheng et al., 2013), suggesting that this mark could be dynamically regulated. Genome-wide m6A profiling, through immunoprecipitation with m6A-specific antibodies followed by short-reads RNA sequencing (srRNA-seq), revealed the preferential, while not exclusive, association of the mark with the central adenosine in the RRACH sequence context around the stop codon of messenger RNAs (R = G or A and H = A, C, or U) (Dominissini et al., 2012; Meyer et al., 2012). Notably, m6A marks have been linked to different biological processes depending on their relative position within a transcript, suggesting a context-specific role for this mark (Shi et al., 2019). However, we have only started revealing the rules that determine the preference of the mark for specific bases, and their impact on specific downstream biological processes (Yue et al., 2018). Altogether, m6A was identified as a key determinant of RNA decay (Wang X. et al., 2014) and translation (Wang et al., 2015), while discordant reports were published about its involvement in splicing regulation (Haussmann et al., 2016; Xiao et al., 2016; Bartosovic et al., 2017; Ke et al., 2017; Darnell et al., 2018; Kasowitz et al., 2018; Louloupi et al., 2018).

RNA metabolic labeling (Dolken et al., 2008) emerged as a powerful approach that not only allows to characterize the association of m6A, or other RNA modifications, with nascent transcripts, but also allows to quantify the impact of these marks on the dynamics of all key steps of the RNA life cycle, and specifically on the kinetic rates of RNA synthesis, processing, and degradation. The application of this technique confirmed the role of m6A on the regulation of RNA stability, and suggested its influence on the dynamics of RNA synthesis and processing (Furlan et al., 2019b).

The application of the current leading approaches for profiling RNA modifications, such as m6A, generated important findings about the functional role of these marks (Roundtree et al., 2017). However, these approaches are heavily based on srRNA-seq, and are afflicted by a number of downsides: different methods were developed for various modifications, they only allow to indirectly map the targeted mark, they are poorly suitable for analyses at the level of single molecules and isoforms, they cannot be readily used to profile co-occurring modifications, and they are difficult to be paired with RNA metabolic labeling. In this perspective, we discuss how direct RNA sequencing (such as nanopore-based sequencing of native RNAs) is rapidly emerging as a powerful alternative approach, which has the potential to overcome these issues, bursting the field of epitranscriptomics.

Experimental and Computational Approaches for the Quantification of RNA Kinetic Rates

The state of the art approach to infer the kinetic rates governing the RNA life cycle – synthesis of premature RNA, its processing into mature RNA, and the degradation of the latter – is based on the joint quantitative analysis of total and nascent RNA (Figure 1). While the former is simply obtained through RNA-seq, the latter can be profiled through RNA metabolic labeling. In this technique, a nucleotide carrying an exogenous modification (e.g., 4-thiouridine, 4sU) is provided in the cells’ medium, and is incorporated into nascent transcripts during the labeling time. Thus, the presence of the exogenous modification can be used for the physical (Dolken et al., 2008) or in silico (Baptista and Dölken, 2018) separation of newly synthetized transcripts from pre-existing ones.

FIGURE 1.

FIGURE 1

Quantification of the RNA kinetic rates through RNA metabolic labeling coupled with srRNA-seq. (A) The key steps of the RNA life cycle, and the corresponding RNA kinetic rates: synthesis (k1) of premature RNA, processing (k2) of premature into mature RNA, and degradation (k3) of mature transcripts. (B) Incorporation of the uridine analog 4sU into newly synthetized transcripts. (C) Pre-existing and nascent RNA purification and sequencing through srRNA-seq. (D) Quantification of premature (P), mature (M), and nascent (N) RNA from srRNA-seq reads. (E) RNA life cycle mathematical modeling and quantification of the RNA kinetic rates in the steady-state limit.

Mathematical modeling is then used for the gene-level quantification of RNA kinetic rates, for example as implemented and documented in the INSPEcT R/Bioconductor library (de Pretis et al., 2015; Furlan et al., 2019a). Briefly, when short labeling times are adopted (<1 h), the quantification of nascent RNA for each gene provides a proxy for the rate of synthesis of premature RNA. Then, total RNA-seq reads are used to measure the abundance of premature and mature transcripts: reads that entirely map to one or more exons are used to quantify mature RNA species, and the remaining mapped reads (entirely, or partially, covering introns) are used for the quantification of premature species. Finally, the combination of synthesis rate and premature RNA abundance is used to quantify the rate of processing, while the combination of synthesis rate and mature RNA abundance allows the quantification of degradation rates (Furlan et al., 2019a).

The joint analysis of the information gained from RNA metabolic labeling experiments, together with the profiling of specific RNA modifications, would be extremely powerful for the study of the functional consequences of these marks on specific RNA life cycle steps. However, while the application of metabolic labeling for the profiling of nascent RNA (Dolken et al., 2008) and for the quantification of the RNA kinetic rates (Dolken et al., 2008; Miller et al., 2011; Rabani et al., 2011, 2014; de Pretis et al., 2015; Furlan et al., 2019a) is an established approach, its combination with the profiling of RNA modifications is more problematic. In fact, the joint profiling of nascent and modified RNA requires the identification of at least two RNA modifications: the endogenous mark (e.g., m6A), and the exogenous modification used for the labeling (e.g., 4sU). As we discuss in the following sections, this is a complex task that can be only indirectly implemented through current approaches.

Detection of Rna Modifications Through Short-Reads RNA Sequencing

Numerous protocols based on srRNA-seq were developed for the identification of either endogenous (e.g., m6A) or exogenous (e.g., 4sU) RNA modifications. A first class of methods is based on the enrichment of modified RNAs before the sequencing. This relies either on the use of specific antibodies [e.g., MeRIP-seq for m6A detection (Dominissini et al., 2012; Meyer et al., 2012)], or the use of enzymes involved in the metabolism of the modification [e.g., tRNA methyltransferase DnmA (Muller et al., 2013)], or on the availability of tags such as biotin on the modified residues [e.g., 4sU-based RNA metabolic labeling (Dolken et al., 2008)]. These techniques do not provide neither the exact modification site (they are limited to 100–200 bp resolution), nor a precise quantification of the proportion of modified transcripts (Molinie et al., 2016), despite the development of ad hoc experimental (Sun et al., 2012) and computational (de Pretis et al., 2015) normalization techniques. Indeed, an alternative approach, m6A-LAIC-seq (Molinie et al., 2016) has been developed that relies on spike-ins to provide a precise quantification of the m6A abundance, at the cost of skipping the RNA fragmentation step and losing positional information on the mark. A second class of methodologies is based on the identification of RNA modifications signatures in the retro-transcribed cDNA. One approach belonging to this class exploits the early interruption of retrotranscription at the modification site to produce specific truncation signatures [e.g., ICE-seq for inosine detection (Sakurai et al., 2010)]. Alternative approaches were developed to retro-transcribe the modified bases and their native counterparts to different nucleotides, thus inferring the site of the modification based on specific mismatches in the reads alignment (Baptista and Dölken, 2018). For example, SLAM-seq allows the in silico identification of reads derived from nascent RNAs by inducing the pairing of alkylated 4sU to guanines (Herzog et al., 2017). These methods markedly increase the resolution, but are typically semi-quantitative, suffering from low sensitivity (Neumann et al., 2019). Hybrid techniques were also developed. For example, methylation induced cross-linking and immunoprecipitation (miCLIP) combines m6A-immunoprecipitation with the antibody cross-linking, leading to conversion and truncation events. Their identification in the sequencing results allows the mapping of m6A at base-resolution (Linder et al., 2015). However, this method is affected by low crosslink efficiency, reducing the sensitivity. Recently, two novel approaches were developed that do not rely on immunoprecipitation. MAZTER-seq (Garcia-Campos et al., 2019) allows the quantitative and base-resolution identification of m6A marks, relying on the use of a restriction enzyme that cuts only when the target site is not methylated. As a downside, the mapping is limited to the identification of m6A marks in specific context sites (16% of all expected m6A sites in mammals). DART-seq (Meyer, 2019) recruits APOBEC1 proteins at m6A sites through readers of the YTH family, allowing the identification of the marks by the detection of adjacent C to U mutations. It was used in combination with srRNA-seq, with as little as 10 ng of total RNA, and with long-reads RNA sequencing (lrRNA-seq), leading to single transcript m6A detection. The key downside of this method is the required cells transfection with APOBEC1-YTH fusion protein. Finally, the ability to quantify the abundance of m6A marks remains to be established.

A number of computational tools were developed that are useful for calling RNA modifications on srRNA-seq data, especially tailored toward the analysis of m6A marks in MeRIP-seq datasets. exomePeak, while not originally developed for this task, is one of the most frequently adopted tools for the identification of m6A peaks (Meng et al., 2013). Indeed, a detailed protocol was described for its application on MeRIP-seq datasets (Meng et al., 2014). This tool adopts a sliding window approach with a conditional test relying on Poisson distributions. HEPeak is an HMM-based tool dedicated to the identification of m6A marks, claiming improved sensitivity and specificity compared to exomePeak (Cui et al., 2015). From the same authors, MeTPeak was later proposed that is able to take advantage of the variance across replicates, and models the reads dependency across a region (Cui et al., 2016). A number of tools were developed that are dedicated to differential RNA methylation analysis, including MeTDiff (Cui et al., 2018), FunDMDeep (Zhang S. Y. et al., 2019), and RADAR (Zhang Z. et al., 2019). Finally, m6A viewer is a Java stand alone application that supports detection, analysis, and visualization of m6A marks, the former relying on the previously described tools (Antanaviciute et al., 2017).

Besides the specific limitations of each technique, all available protocols for the profiling of RNA modifications through srRNA-seq share some key limitations. First, they require specific reagents for each modification of interest, which currently limits the profiling to a handful of modifications (Helm and Motorin, 2017). Second, the library preparations, and the sequencing procedure, remove the RNA marks. As a consequence, most available approaches for the modifications profiling are indirect, reducing specificity and sensitivity (Helm and Motorin, 2017). Third, the reduced length of srRNA-seq reads (50–300 bp) is a major obstacle for the analysis of individual RNA molecules, despite the development of methods to infer isoforms expression from these data (Zhang et al., 2017b). As a consequence, the assignment of individual or co-occurring modifications to a given RNA molecule, or even to a given isoform, is not feasible. Fourth, srRNA-seq protocols are not readily applicable to detect two (or more) RNA modifications simultaneously.

Although recent interesting technical advances are starting to appear [e.g., simultaneous detection of N1-methyladenosine, 5-methylcytosine, and pseudouridine (Khoddami et al., 2019)], these methods highly depend on the specific combination of marks. The reasons for this limitation are manifolds. Likely, the methods for the profiling of different modifications should be consecutively applied, and the output of one method could be poorly suitable for the subsequent. For the same reason, a high amount of starting material is likely to be necessary, to avoid capturing only highly expressed transcripts. Alternatively, numerous rounds of PCR would be necessary, introducing amplification biases (Aird et al., 2011; Kebschull and Zador, 2015). The limitations in specificity and sensitivity of each method would combine. Moreover, it would be crucial and cumbersome to develop normalization procedures for the comparison of the results from each approach, possibly based on spike-ins. Finally, it would be hard to keep track of the positional information of each modification.

Things would get even more complicated when, in addition to the mark of interest, the dynamics of RNA metabolism are also of interest, which require the identification of an exogenous modification as second mark. In this case, to quantify the RNA kinetic rates of modified and unmodified RNAs, it would be necessary to quantify all four possible combinations: nascent/modified, nascent/unmodified, pre-existing/modified, and pre-existing/unmodified transcripts (Figure 2). Currently, the best approach to jointly identify 4sU and m6A would be to start by separating nascent and pre-existing RNA using 4sU metabolic labeling and purification (Dolken et al., 2008). Then, for each of these, the m6A-LAIC-seq protocol could be applied to separate m6A methylated RNAs from unmethylated transcripts. At the end, four samples per condition should be prepared and sequenced. This approach is evidently very complex and onerous, it would require a lot of starting material and complicated downstream analyses, including spike-ins based normalization of the datasets. For all these reasons, the most common compromise is to profile m6A, and to perform metabolic labeling through independent experiments (Li et al., 2017; Furlan et al., 2019b). However, this type of approach completely compromises the possibility of a direct quantification of the dynamics of modified and unmodified transcripts, since it only allows to quantify the dynamics of the pool of transcripts for each gene, and then combine this information with the expected degree of modification for that population. Altogether, approaches based on srRNA-seq are increasingly inadequate and could hamper the progress in the field of epitranscriptomics.

FIGURE 2.

FIGURE 2

srRNA-seq based approach to quantify transcripts’ expression levels in all the four possible combinations given by the presence or absence of 4sU and m6A RNA modifications. (A) RNA metabolic labeling, based on the incorporation of 4sU, is applied to separate the nascent portion of the transcriptome from the pre-existing counterpart. (B) m6A-LAIC-seq is applied for both nascent and pre-existing RNAs to separate methylated from unmethylated transcripts. (C) cDNA library preparation and sequencing for: pre-existing unmethylated RNAs, pre-existing methylated RNAs, nascent unmethylated RNAs, and nascent methylated RNAs. (D) In silico reads alignment, counts quantification, and normalization to estimate transcripts’ expression levels across all the four conditions.

Long-Reads Direct Rna Sequencing for the Identification of Modifications in Native RNAs

In the last few years remarkable efforts were dedicated to overcoming the limitations of srRNA-seq based approaches (Stark et al., 2019) for the identification of RNA modifications within individual RNA molecules and isoforms. As a result, few novel sequencing approaches that emerged recently allow rRNA-seq. One platform, PacBio (developed by Pacific Biosciences), exploits a sequencing by synthesis approach mediated by an immobilized polymerase (Eid et al., 2009). Another one, which will be the main focus in the next sections of this perspective, was developed by Oxford Nanopore Technologies (ONT), and consists of an array of thousands of nanopores which allow a flow of ions across a dielectric membrane, thus generating a measurable current. The active translocation of a molecule of nucleic acids (either DNA, cDNA, or RNA) through each pore, mediated by an engineered motor protein, results in a sequence-specific perturbation of the measured current. In turn, this signal can be exploited to infer the corresponding sequence of nucleotides (Kasianowicz et al., 1996; Smith et al., 2015). lrRNA-seq approaches were successfully used to study transcriptional and post-transcriptional regulation in various physiological and disease conditions (De Roeck et al., 2017; Aneichyk et al., 2018; Anvar et al., 2018; Nattestad et al., 2018), including single-cells (Byrne et al., 2017). Focusing on RNAs, these techniques can produce single reads of up to 104 bases, with an average length of almost 1 Kb for ONT (Workman et al., 2018). Hence, in a number of cases, this allows the profiling of full-length RNA molecules, and the fine characterization of their alternative isoforms. This is especially true for mature transcripts, whose median length for human and mouse mRNAs is around 2 Kb [based on the hg19 and mm10 UCSC genome releases (Haeussler et al., 2019)]. Instead, the likelihood of sequencing full-length premature transcripts is lower. Indeed, their median open reading frame length is in the 13–18 Kb range, although co-transcriptional splicing could significantly reduce this figure (it is likely that some intron was already excised before the completion of RNA synthesis).

The direct RNA sequencing approach developed by ONT does not go through the conversion of RNA into cDNA, and does not rely on amplification steps. For these reasons, the RNA modifications are preserved and can induce specific alterations in the current registered by the sequencer (Garalde et al., 2018). Altogether, this approach represents a potential solution to most of the limitations of srRNA-seq discussed above, due to its ability to directly identify any, and possibly multiple, RNA modification in single, full-length molecules. dRNA-seq was recently applied to study the transcriptome of viruses (Moldován et al., 2018; Tombácz et al., 2018; Boldogkõi et al., 2019; Depledge et al., 2019), yeast (Garalde et al., 2018), animals (Jiang et al., 2019; Roach et al., 2019; Smith et al., 2019), and plants (Zhao et al., 2019).

However, a number of limitations characterize the young field of dRNA-seq. First, current dRNA-seq protocols are available only for the sequencing of targeted, non-polyadenylated RNAs (Keller et al., 2018; Smith et al., 2019) or polyadenylated RNAs. This is due to the library preparation protocolos, which typically targets polyA tails or specific 3′ sequences for ligating sequencing adapters anchoring the motor protein. This limitation could be addressed using adapters with random 3′ sequences, with the risk of introducing a bias for recurrent RNA motifs, or through in vitro polyadenylation of transcripts devoid of a polyA-tail (Wongsurawat et al., 2018). Second, while the throughput of dRNA-seq is rapidly growing, it currently compares to the low- or mid-end coverage of srRNA-seq experiments. This could limit the number of detectable transcripts, although, importantly, the abundance of those that can be detected is well correlated with high-coverage srRNA-seq data (Garalde et al., 2018). This issue could be solved in the future by improving the speed of translocation of RNAs across the nanopore, and/or extending the sequencing time by prolonging the pores’ lifetime. Noteworthy, given the same throughput in terms of sequenced bases, lrRNA-seq vs srRNA-seq data have a substantial difference: while the former allows detecting entire transcripts, the latter offers a more unbiased sampling of any RNA fragment, thus also covering a larger portion of the transcriptome (Soneson et al., 2019). This could in part be obviated by a coarse RNA fragmentation before the library preparation, and would also reduce the 3′ coverage bias of dRNA-seq data, whose reads start from a transcript’s 3′ end. A drawback of this approach is that it would compromise the one-to-one correspondence between reads and RNA molecules. Third, the accuracy of base calling on dRNA-seq data is currently significantly lower than srRNA-seq. When base calling errors occur at sites of RNA modification, they are likely due to the inability of the base caller’s to deal with changes in the signal originated by those marks. However, these errors represent a small fraction of incorrect base calls, due to the low number of marks per transcripts (e.g., 2–3 m6A marks per RNA). Hence, reduced base calling accuracy is not considered a major issue in the field of RNA modifications but, on the contrary, represents an opportunity for aiding the identification of modified bases (Liu et al., 2019). Fourth, there could be limitations on the detectability of specific RNA modifications. For example, in the context of RNA metabolic labeling, the ability of dRNA-seq to identify various (exogenous) modified nucleotides was tested (Maier et al., 2019). This revealed that 4sU modified nucleotides, commonly used in metabolic labeling through srRNA-seq, were not compatible with the nanopores, leading to blockages during the sequencing, although this issue was not confirmed in a more recent report (Drexler et al., 2019). Instead, other marks, such as 5-ethynyluridine (5eU), were found to be suitable for these experiments.

In conclusion, this is a young and rapidly evolving research field, based on a highly collaborative research community. Hence, numerous labs are actively involved to find solutions or improvements to all these limitations, which are likely to be fully or partially overcome in the next few years (Rang et al., 2018).

Computational Tools for the Detection of Modifications in Long-Reads Direct RNA Sequencing

Recent and growing literature is available about the footprints left by RNA modifications on dRNA-seq data, and how to exploit them to detect RNA marks (Xu and Seki, 2019). Differences in current levels between native bases and their modified counterparts were reported for m6A, m5C, m7G, and pseudouridine (Garalde et al., 2018; Workman et al., 2018; Smith et al., 2019). Moreover, the increase of base miscalls frequency in concomitance to modified sites were observed next to “A-to-I,” 7-methylguanosine and pseudouridine sites (Workman et al., 2018; Smith et al., 2019). These observations led to the development of specific computational tools for the detection of RNA modifications.

Tombo, an official tool provided by ONT, requires a model of the signal generated by the modification in all possible sequence contexts, to be used as a baseline for the identification of the same mark at single molecule resolution within a new dRNA-seq dataset (Stoiber et al., 2016). Notably, baseline data for 5-methylcytosine marks are included in the tool (Viehweger et al., 2019). Alternatively, data for a condition devoid of modifications can be provided. With a similar approach, Tombo was recently used to identify m6A in yeast with an accuracy of 69% and a recovery of 59%, compared with m6A peaks identified with MeRIP-seq (Liu et al., 2019). Obviating for the need of these positive or negative baseline data, Tombo can be used to compare the signal observed for each k-mer with that of any possible unmodified k-mer, although this approach is affected by high false positive rates.

EpiNano relies on a support vector machine, and exploits the increased frequency of alignment errors and the low base quality caused by the presence of the modification of interest (Liu et al., 2019). The tool is first trained and tested on two sets of in vitro transcribed synthetic RNAs that contain either m6A only or unmodified adenosine only. Its classification performance in the context of the expected m6A RRACH motif was excellent (area under the curve up to 0.944). Rather, the performance decreased when the tool was applied on in vivo yeast data and benchmarked with MeRIP-seq m6A calls for the same conditions (accuracy: 87% and recovery: 32%). In terms of downsides, EpiNano requires prior knowledge on the sequence motif for the mark of interest, and it cannot achieve single molecule resolution, since it aggregates the information derived from multiple reads alignments.

ELIGOS aims at the unbiased identification of any RNA modification that would impact bases errors frequencies. It relies on the comparison between dRNA-seq of native and cDNA-converted transcripts, the latter used as a reference that is devoid of any mark due to the retro-transcription to cDNA (Wongsurawat et al., 2018). ELIGOS was tested on in vitro fully modified transcripts, rRNAs from various species, and a human lymphoblastoid cell line. Like Tombo, the main downside of ELIGOS is in terms of false positive rates.

A further method for m6A identification that was recently released is called MINES (Lorenz et al., 2019). This software implements a random forest classifier trained on a set of high confidence, experimentally defined, m6A sites within canonical DRACH motifs. This method showed high accuracy and precision, and also has single-isoform, single-base resolution. However, MINES can only predict m6A sites within DRACH motifs, which only comprise a portion of all m6A sites. A further potential limitation is due to the fact that the classifier was trained on m6A sites defined with CLIP and – as such – might suffer of biases similar to those caused by antibody-based methods.

Nano-ID was recently developed for detecting the incorporation of the exogenous mark 5eU into nascent RNA (Maier et al., 2019), implementing the analysis of RNA metabolic labeling on the ONT platform. This tool relies on a neural network trained to distinguish dRNA-seq signal of fully unlabeled from fully labeled RNAs (24 h 5eU labeling time), to classify reads from nascent transcripts, while no positional information on 5eU marks is returned. The results achieved by nano-ID on this test set were very encouraging (area under the curve 0.95), and the tool was applied to infer the isoform-level rates of synthesis and degradation in K562 cells, and how they were affected by heat shock.

Nanocompore is a novel tool recently released, which is based on the comparison of a condition of interest with a condition where the writer for a specific mark was depleted or removed (Leger et al., 2019). The idea is that the removal of the mark leads to a change in the ONT signal, which could be identified through statistical tests by comparing the two conditions. As a result, Nanocompore can provide near base-resolution and single molecule calls for the mark of interest. Alternatively, analogously to ELIGOS, if the baseline condition is depleted of multiple or possibly all marks (e.g., via in vitro transcription), the tool returns the corresponding changes in the signal to identify all marks occurrence, while mark-specific calls are not possible. Advantages and disadvantages of the tools discussed above are reported in Table 1.

TABLE 1.

Comparing strengths and pitfalls of four software packages for m6A detection from Nanopore dRNA-seq data.

EpiNano ELIGOS MINES Nanocompore
Requires training dataset Yes No Yes No
Requires comparison condition No Yes (cDNA) No Yes
Limited to RACH motifs Yes No Yes No
Single nucleotide resolution Yes Yes Yes No
Isoform resolution Yes Yes Yes Yes
Single molecule resolution No No No Yes
Able to distinguish different modifications Yes No Yes Yes

Applying Direct Rna Sequencing to Quantify the Dynamics of Modified RNAs

The recent surge in the number of tools for the identification of specific modifications indicates that the field is quickly progressing. However, a number of improvements are required for the joint analysis of the patterning of an endogenous modification, such as m6A, with the quantification of the corresponding RNA dynamics, via metabolic labeling and profiling of exogenous modifications such as 4sU or 5eU (Figure 3).

FIGURE 3.

FIGURE 3

dRNA-seq based approach to quantify transcripts’ expression levels in all the four possible combinations given by the presence or absence of 5eU and m6A RNA modifications. (A) RNA metabolic labeling, based on the incorporation of 5eU, is applied to mark nascent transcripts, before direct RNA sequencing. (B) Base calling and identification of the two RNA modifications. (C) Reads alignment and in silico separation, according to the presence or absence of each RNA modification, to estimate transcripts’ expression levels across all the four conditions.

First, the modifications have to be profiled at single molecule resolution, a prerequisite for the direct matching of the RNA dynamics with the modification status. This would allow understanding how the RNA kinetic rates are impacted by the presence of a modification, and, potentially, by its patterning (numerosity and position). Notably, the frequency and the specific position of occurrence of the marks is increasingly recognized as an important factor. For example, the fate of RNAs carrying multiple m6A marks was shown to be influenced by a liquid–liquid phase separation processes driven by the binding of readers of the YTH family. Eventually, those transcripts were shown to be targeted to specific cellular compartments, including stress-granules and P-bodies, with important consequences for their translation and stability (Ries et al., 2019).

Second, tools based on supervised machine learning could be preferable in the field, compared to methods for the unsupervised identification of the marks. In fact, various confounding factors could potentially affect direct RNA sequencing data, which could be easier to address in a supervised framework. However, supervised methods require training on sets of modified transcripts, which should be built so that they closely reflect the characteristics of in vivo datasets. For example, for endogenous modifications, rather than producing in vitro fully modified transcripts, the level of modification could be tuned by mixing unmodified and modified nucleotides to match the expected frequency of the mark. For exogenous marks, the approach described in Maier et al. (2019) could be followed, where physiological high-level of incorporation of a modified nucleotides are obtained by its prolonged availability in the cells medium.

Third, the current ONT signal (amplitude and dwell time) is the most direct data type for the identification of the marks, compared to more indirect measurements, such as the error rate. While tools, such as EpiNano, showed a good performance by only using the latter, we would recommend trying to incorporate information from the former. Indeed, indirect measurements could be completely or partially originated by unexpected causes, which could lead to high false positive rates with in vivo datasets.

Fourth, the quantification of RNA dynamics should include the step of premature RNA processing. This is often neglected, by assuming the corresponding rate being constant. However, RNA synthesis and processing are tightly coupled, then when the former is modulated, which often occurs, the latter is also expected to be altered (Neugebauer, 2019). Moreover, recent reports start unveiling the frequency and importance of changes in splicing dynamics (Rabani et al., 2014; de Pretis et al., 2015, 2017; Louloupi et al., 2018; Furlan et al., 2019a; Wachutka et al., 2019). The cost of considering the processing step is two fold: it markedly increases the complexity of the underlying mathematical models, and implies the quantification of the abundance of premature RNA species. The latter is specifically problematic for the ONT platform. Indeed, the library preparation procedure expects transcripts with the polyA tail, which are lacking in premature RNAs. In vitro polyadenylation with m6A could be used for adding m6A-tails to premature transcripts. This would allow the sequencing of premature RNAs, and would preserve the sequencing information about the endogenous tails of mature transcripts, for studies on their functional impact on RNA dynamics.

Fifth, reads from premature RNAs would have to be distinguished from those from mature species. The presence of an endogenous polyA tail would provide a way to computationally identifying reads from mature species. However, this approach would fail for those mRNAs that are not polyadenylated in their endogenous mature form. An alternative criterion is to consider the reads containing introns as premature RNA. This could be problematic in case of intron retention, which in many organisms, including humans, is not infrequent (Chaudhary et al., 2019; Monteuuis et al., 2019). The request of more than one intron in order to classify a read as premature RNA would probably eliminate this issue. Of course, such a strict condition would cause the exclusion of those genes that have less than two introns, which often occurs in some organisms (e.g., yeast or plants). The best criterion could eventually be a mix of the proposed approaches, selected according to the biological system under analysis and the transcripts of interest. For instance, to study mRNA kinetics in mammalian cells, mature RNA could be estimated considering fully spliced, polyadenylated transcripts, while premature RNA could be quantified from the remaining reads, possibly requiring the presence of one or more introns.

Once proficient algorithms for the detection of the endogenous (e.g., m6A) and exogenous (e.g., 5eU) marks at single molecule resolution are in place, they could be used, in series, for the identification of the four possible classes defined by the presence or absence of each modification. The performance of such an approach should be tested on a dataset generated ad hoc. The genesis of reads with both the RNA modifications, or missing only the exogenous mark, is feasible by using or avoiding long-time metabolic labeling, respectively. Instead, reads devoid of both the base analogs can be produced sequencing the corresponding cDNA. It is more difficult to generate transcripts that lack only the endogenous modification, which could be obtained by knocking-out the corresponding writer (for those marks for which this is known). However, genetic compensation (El-Brolosy and Stainier, 2017) or writer’s redundancy could lead to the incomplete depletion of the RNA modification.

Additional Remarks

The study of the impact of RNA modifications on the RNA life cycle dynamics would largely benefit from the development of a unified computational framework. This, starting from long reads dRNA-seq data, should manage the RNA kinetic rates inference, according to their modification status, at the level of individual transcriptional units or specific isoforms.

A convenient starting point could be INSPEcT (de Pretis et al., 2015), a tool developed in our lab for the inference of all RNA kinetic rates (synthesis, processing, and degradation) from srRNA-seq data. The user should only pay attention to quantify premature and mature RNA in both nascent and pre-existing fractions according to the guidelines presented above. Additionally, if the quantification of dynamics at the level of specific isoforms is desired, the analysis should be conducted considering the reads associated with each isoform, rather than those associated with the whole transcriptional unit. Finally, if this analysis is applied independently on the set of modified and unmodified reads, it would allow comparing the kinetic rates among them, as illustrated in Figure 3B.

INSPEcT has been recently extended by implementing a novel approach that allows the inference of synthesis, processing and degradation kinetic rates without nascent RNA profiling (Furlan et al., 2019a). This approach could be an interesting alternative to study the relation between RNA modifications and RNA life cycle dynamics without requiring metabolic labeling and the consequent identification of the exogenous modification. This would also allow studying the impact on RNA dynamics of those modifications that mark the same base targeted by metabolic labeling, such as pseudouridine and 5eU.

In conclusion, a number of recent and on-going technology advancements are significantly facilitating the study of the functional consequences of RNA modifications on the fate of marked transcripts. In particular, the combined application of RNA metabolic labeling, for the profiling of nascent transcripts and the quantification of the kinetic rates governing the RNA life cycle dynamics, and of long-reads direct RNA sequencing, is particularly promising. Indeed, they promise to deliver data of unprecedented quality and resolution, and should allow studying the impact of RNA modifications at the level of individual molecules and isoforms.

Author Contributions

MF and MP conceived the study. MF led the writing and produced the figures. MP supervised the study and the writing of the manuscript. All authors contributed discussing and writing the manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnotes

Funding. This article is based upon work from COST Action EPITRAN (CA16120), supported by COST (European Cooperation in Science and Technology).

References

  1. Aird D., Ross M. G., Chen W.-S., Danielsson M., Fennell T., Russ C., et al. (2011). Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries. Genome Biol. 12:R18. 10.1186/gb-2011-12-2-r18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Aneichyk T., Hendriks W. T., Yadav R., Shin D., Gao D., Vaine C. A., et al. (2018). Dissecting the causal mechanism of X-linked dystonia-parkinsonism by integrating genome and transcriptome assembly. Cell 172 897–909.e21. 10.1016/j.cell.2018.02.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Antanaviciute A., Baquero-Perez B., Watson C. M., Harrison S. M., Lascelles C., Crinnion L., et al. (2017). m6aViewer: software for the detection, analysis, and visualization of N6 -methyladenosine peaks from m6A-seq/ME-RIP sequencing data. RNA 23 1493–1501. 10.1261/rna.058206.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Anvar S. Y., Allard G., Tseng E., Sheynkman G. M., de Klerk E., Vermaat M., et al. (2018). Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing. Genome Biol. 19:46. 10.1186/s13059-018-1418-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Baptista M. A. P., Dölken L. (2018). RNA dynamics revealed by metabolic RNA labeling and biochemical nucleoside conversions. Nat. Methods 15 171–172. 10.1038/nmeth.4608 [DOI] [PubMed] [Google Scholar]
  6. Bartosovic M., Molares H. C., Gregorova P., Hrossova D., Kudla G., Vanacova S. (2017). N6-methyladenosine demethylase FTO targets pre-mRNAs and regulates alternative splicing and 3′-end processing. Nucleic Acids Res. 45 11356–11370. 10.1093/nar/gkx778 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Boccaletto P., Machnicka M. A., Purta E., Pia̧tkowski P., Bagiński B., Wirecki T. K., et al. (2018). MODOMICS: a database of RNA modification pathways. 2017 update. Nucleic Acids Res. 46 D303–D307. 10.1093/nar/gkx1030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Boldogkõi Z., Moldován N., Balázs Z., Snyder M., Tombácz D. (2019). Long-read sequencing – a powerful tool in viral transcriptome research. Trends Microbiol. 27 578–592. 10.1016/j.tim.2019.01.010 [DOI] [PubMed] [Google Scholar]
  9. Bushkin G. G., Pincus D., Morgan J. T., Richardson K., Lewis C., Chan S. H., et al. (2019). m6A modification of a 3′ UTR site reduces RME1 mRNA levels to promote meiosis. Nat. Commun. 10:3414. 10.1038/s41467-019-11232-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Byrne A., Beaudin A. E., Olsen H. E., Jain M., Cole C., Palmer T., et al. (2017). Nanopore long-read RNAseq reveals widespread transcriptional variation among the surface receptors of individual B cells. Nat. Commun. 8:16027. 10.1038/ncomms16027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Chaudhary S., Khokhar W., Jabre I., Reddy A. S. N., Byrne L. J., Wilson C. M., et al. (2019). Alternative splicing and protein diversity: plants versus animals. Front. Plant Sci. 10:708. 10.3389/fpls.2019.00708 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chen T., Hao Y.-J., Zhang Y., Li M.-M., Wang M., Han W., et al. (2015). m6A RNA methylation is regulated by microRNAs and promotes reprogramming to pluripotency. Cell Stem Cell 16 289–301. 10.1016/j.stem.2015.01.016 [DOI] [PubMed] [Google Scholar]
  13. Cui X., Meng J., Rao M. K., Chen Y., Huang Y. (2015). HEPeak: an HMM-based exome peak-finding package for RNA epigenome sequencing data. BMC Genomics 16:S2. 10.1186/1471-2164-16-S4-S2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Cui X., Meng J., Zhang S., Chen Y., Huang Y. (2016). A novel algorithm for calling mRNA m 6 A peaks by modeling biological variances in MeRIP-seq data. Bioinformatics 32 i378–i385. 10.1093/bioinformatics/btw281 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Cui X., Zhang L., Meng J., Rao M. K., Chen Y., Huang Y. (2018). MeTDiff: a novel differential RNA methylation analysis for MeRIP-Seq data. IEEE/ACM Trans. Comput. Biol. and Bioinform. 15 526–534. 10.1109/TCBB.2015.2403355 [DOI] [PubMed] [Google Scholar]
  16. Darnell R. B., Ke S., Darnell J. E. (2018). Pre-mRNA processing includes N 6 methylation of adenosine residues that are retained in mRNA exons and the fallacy of “RNA epigenetics.”. RNA 24 262–267. 10.1261/rna.065219.117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. de Pretis S., Kress T., Morelli M. J., Melloni G. E. M., Riva L., Amati B., et al. (2015). INSPEcT: a computational tool to infer mRNA synthesis, processing and degradation dynamics from RNA- and 4sU-seq time course experiments. Bioinformatics 31 2829–2835. 10.1093/bioinformatics/btv288 [DOI] [PubMed] [Google Scholar]
  18. de Pretis S., Kress T. R., Morelli M. J., Sabò A., Locarno C., Verrecchia A., et al. (2017). Integrative analysis of RNA polymerase II and transcriptional dynamics upon MYC activation. Genome Res. 27 1658–1664. 10.1101/gr.226035.117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. De Roeck A., Van den Bossche T., van der Zee J., Verheijen J., De Coster W., Van Dongen J., et al. (2017). Deleterious ABCA7 mutations and transcript rescue mechanisms in early onset Alzheimer’s disease. Acta Neuropathol. 134 475–487. 10.1007/s00401-017-1714-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Depledge D. P., Srinivas K. P., Sadaoka T., Bready D., Mori Y., Placantonakis D. G., et al. (2019). Direct RNA sequencing on nanopore arrays redefines the transcriptional complexity of a viral pathogen. Nat. Commun. 10:754. 10.1038/s41467-019-08734-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Dolken L., Ruzsics Z., Radle B., Friedel C. C., Zimmer R., Mages J., et al. (2008). High-resolution gene expression profiling for simultaneous kinetic parameter analysis of RNA synthesis and decay. RNA 14 1959–1972. 10.1261/rna.1136108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Dominissini D., Moshitch-Moshkovitz S., Schwartz S., Salmon-Divon M., Ungar L., Osenberg S., et al. (2012). Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq. Nature 485 201–206. 10.1038/nature11112 [DOI] [PubMed] [Google Scholar]
  23. Drexler H. L., Choquet K., Churchman L. S. (2019). Splicing kinetics and coordination revealed by direct nascent RNA sequencing through Nanopores. Mol. Cell 77 985–998.e8. 10.1016/j.molcel.2019.11.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Eid J., Fehr A., Gray J., Luong K., Lyle J., Otto G., et al. (2009). Real-time DNA sequencing from single polymerase molecules. Science 323 133–138. 10.1126/science.1162986 [DOI] [PubMed] [Google Scholar]
  25. El-Brolosy M. A., Stainier D. Y. R. (2017). Genetic compensation: a phenomenon in search of mechanisms. PLoS Genet. 13:e1006780. 10.1371/journal.pgen.1006780 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Engel M., Eggert C., Kaplick P. M., Röh S., Tietze L., Namendorf C., et al. (2018). The role of m6A/m-RNA methylation in stress response regulation. Neuron 99 389–403.e9. 10.1016/j.neuron.2018.07.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Furlan M., Galeota E., Del Gaudio N., Dassi E., Caselle M., de Pretis S., et al. (2019a). Genome-wide dynamics of RNA synthesis, processing and degradation without RNA metabolic labeling. bioRxiv [Preprint]. 10.1101/520155 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Furlan M., Galeota E., De Pretis S., Caselle M., Pelizzola M. (2019b). m6A-dependent RNA dynamics in T cell differentiation. Genes 10:28. 10.3390/genes10010028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Garalde D. R., Snell E. A., Jachimowicz D., Sipos B., Lloyd J. H., Bruce M., et al. (2018). Highly parallel direct RNA sequencing on an array of nanopores. Nat. Methods 15 201–206. 10.1038/nmeth.4577 [DOI] [PubMed] [Google Scholar]
  30. Garcia-Campos M. A., Edelheit S., Toth U., Safra M., Shachar R., Viukov S., et al. (2019). Deciphering the “m6A Code” via antibody-independent quantitative profiling. Cell 178 731–747.e16. 10.1016/j.cell.2019.06.013 [DOI] [PubMed] [Google Scholar]
  31. Geula S., Moshitch-Moshkovitz S., Dominissini D., Mansour A. A., Kol N., Salmon-Divon M., et al. (2015). m 6 A mRNA methylation facilitates resolution of naïve pluripotency toward differentiation. Science 347 1002–1006. 10.1126/science.1261417 [DOI] [PubMed] [Google Scholar]
  32. Haeussler M., Zweig A. S., Tyner C., Speir M. L., Rosenbloom K. R., Raney B. J., et al. (2019). The UCSC genome browser database: 2019 update. Nucleic Acids Res. 47 D853–D858. 10.1093/nar/gky1095 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Haussmann I. U., Bodi Z., Sanchez-Moran E., Mongan N. P., Archer N., Fray R. G., et al. (2016). m6A potentiates Sxl alternative pre-mRNA splicing for robust Drosophila sex determination. Nature 540 301–304. 10.1038/nature20577 [DOI] [PubMed] [Google Scholar]
  34. Helm M., Motorin Y. (2017). Detecting RNA modifications in the epitranscriptome: predict and validate. Nat. Rev. Genet. 18 275–291. 10.1038/nrg.2016.169 [DOI] [PubMed] [Google Scholar]
  35. Herzog V. A., Reichholf B., Neumann T., Rescheneder P., Bhat P., Burkard T. R., et al. (2017). Thiol-linked alkylation of RNA to assess expression dynamics. Nat. Methods 14 1198–1204. 10.1038/nmeth.4435 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Ianniello Z., Paiardini A., Fatica A. (2019). N6-methyladenosine (m6A): a promising new molecular target in acute myeloid leukemia. Front. Oncol. 9:251. 10.3389/fonc.2019.00251 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Jackman J. E., Alfonzo J. D. (2013). Transfer RNA modifications: nature’s combinatorial chemistry playground: transfer RNA modifications. Wiley Interdiscip. Rev. RNA 4 35–48. 10.1002/wrna.1144 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Jia G., Fu Y., Zhao X., Dai Q., Zheng G., Yang Y., et al. (2011). N6-Methyladenosine in nuclear RNA is a major substrate of the obesity-associated FTO. Nat. Chem. Biol. 7 885–887. 10.1038/nchembio.687 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Jiang F., Zhang J., Liu Q., Liu X., Wang H., He J., et al. (2019). Long-read direct RNA sequencing by 5′-Cap capturing reveals the impact of Piwi on the widespread exonization of transposable elements in locusts. RNA Biol. 16 950–959. 10.1080/15476286.2019.1602437 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kasianowicz J. J., Brandin E., Branton D., Deamer D. W. (1996). Characterization of individual polynucleotide molecules using a membrane channel. Proc. Natl. Acad. Sci. U.S.A. 93 13770–13773. 10.1073/pnas.93.24.13770 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Kasowitz S. D., Ma J., Anderson S. J., Leu N. A., Xu Y., Gregory B. D., et al. (2018). Nuclear m6A reader YTHDC1 regulates alternative polyadenylation and splicing during mouse oocyte development. PLoS Genet. 14:e1007412. 10.1371/journal.pgen.1007412 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Ke S., Pandya-Jones A., Saito Y., Fak J. J., Vågbø C. B., Geula S., et al. (2017). m 6 A mRNA modifications are deposited in nascent pre-mRNA and are not required for splicing but do specify cytoplasmic turnover. Genes Dev. 31 990–1006. 10.1101/gad.301036.117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Kebschull J. M., Zador A. M. (2015). Sources of PCR-induced distortions in high-throughput sequencing data sets. Nucleic Acids Res. 43:e143. 10.1093/nar/gkv717 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Keller M. W., Rambo-Martin B. L., Wilson M. M., Ridenour C. A., Shepard S. S., Stark T. J., et al. (2018). Direct RNA sequencing of the coding complete influenza A virus genome. Sci. Rep. 8:14408. 10.1038/s41598-018-32615-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Khoddami V., Yerra A., Mosbruger T. L., Fleming A. M., Burrows C. J., Cairns B. R. (2019). Transcriptome-wide profiling of multiple RNA modifications simultaneously at single-base resolution. Proc. Natl. Acad. Sci. U.S.A. 116 6784–6789. 10.1073/pnas.1817334116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Leger A., Amaral P. P., Pandolfini L., Capitanchik C., Capraro F., Barbieri I., et al. (2019). RNA modifications detection by comparative Nanopore direct RNA sequencing. bioRxiv [Preprint]. 10.1101/843136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Li H.-B., Tong J., Zhu S., Batista P. J., Duffy E. E., Zhao J., et al. (2017). m6A mRNA methylation controls T cell homeostasis by targeting the IL-7/STAT5/SOCS pathways. Nature 548 338–342. 10.1038/nature23450 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Lin S., Gregory R. I. (2014). Methyltransferases modulate RNA stability in embryonic stem cells. Nat. Cell Biol. 16 129–131. 10.1038/ncb2914 [DOI] [PubMed] [Google Scholar]
  49. Linder B., Grozhik A. V., Olarerin-George A. O., Meydan C., Mason C. E., Jaffrey S. R. (2015). Single-nucleotide-resolution mapping of m6A and m6Am throughout the transcriptome. Nat. Methods 12 767–772. 10.1038/nmeth.3453 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Liu H., Begik O., Lucas M. C., Ramirez J. M., Mason C. E., Wiener D., et al. (2019). Accurate detection of m6A RNA modifications in native RNA sequences. Nat. Commun. 10:4079. 10.1038/s41467-019-11713-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Liu J., Yue Y., Han D., Wang X., Fu Y., Zhang L., et al. (2014). A METTL3–METTL14 complex mediates mammalian nuclear RNA N6-adenosine methylation. Nat. Chem. Biol. 10 93–95. 10.1038/nchembio.1432 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Lorenz D. A., Sathe S., Einstein J. M., Yeo G. W. (2019). Direct RNA sequencing enables m6A detection in endogenous transcript isoforms at base specific resolution. RNA 26 19–28. 10.1261/rna.072785.119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Louloupi A., Ntini E., Conrad T., Ørom U. A. V. (2018). Transient N-6-methyladenosine transcriptome sequencing reveals a regulatory role of m6A in splicing efficiency. Cell Rep. 23 3429–3437. 10.1016/j.celrep.2018.05.077 [DOI] [PubMed] [Google Scholar]
  54. Luo S., Tong L. (2014). Molecular basis for the recognition of methylated adenines in RNA by the eukaryotic YTH domain. Proc. Natl. Acad. Sci. U.S.A. 111 13834–13839. 10.1073/pnas.1412742111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Maier K. C., Gressel S., Cramer P., Schwalb B. (2019). Native molecule sequencing by nano-ID reveals synthesis and stability of RNA isoforms. bioRxiv [Preprint]. 10.1101/601856 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Meng J., Cui X., Rao M. K., Chen Y., Huang Y. (2013). Exome-based analysis for RNA epigenome sequencing data. Bioinformatics 29 1565–1567. 10.1093/bioinformatics/btt171 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Meng J., Lu Z., Liu H., Zhang L., Zhang S., Chen Y., et al. (2014). A protocol for RNA methylation differential analysis with MeRIP-Seq data and exomePeak R/Bioconductor package. Methods 69 274–281. 10.1016/j.ymeth.2014.06.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Meyer K. D. (2019). DART-seq: an antibody-free method for global m6A detection. Nat. Methods. 16 1275–1280. 10.1038/s41592-019-0570-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Meyer K. D., Saletore Y., Zumbo P., Elemento O., Mason C. E., Jaffrey S. R. (2012). Comprehensive analysis of mRNA methylation reveals enrichment in 3′ UTRs and near Stop Codons. Cell 149 1635–1646. 10.1016/j.cell.2012.05.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Miller C., Schwalb B., Maier K., Schulz D., Dümcke S., Zacher B., et al. (2011). Dynamic transcriptome analysis measures rates of mRNA synthesis and decay in yeast. Mol. Syst. Biol. 7:458. 10.1038/msb.2010.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Moldován N., Tombácz D., Szûcs A., Csabai Z., Snyder M., Boldogkõi Z. (2018). Multi-platform sequencing approach reveals a novel transcriptome profile in pseudorabies virus. Front. Microbiol. 8:2708. 10.3389/fmicb.2017.02708 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Molinie B., Wang J., Lim K. S., Hillebrand R., Lu Z., Van Wittenberghe N., et al. (2016). m6A-LAIC-seq reveals the census and complexity of the m6A epitranscriptome. Nat. Methods 13 692–698. 10.1038/nmeth.3898 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Monteuuis G., Wong J. J. L., Bailey C. G., Schmitz U., Rasko J. E. J. (2019). The changing paradigm of intron retention: regulation, ramifications and recipes. Nucleic Acids Res. 47 11497–11513. 10.1093/nar/gkz1068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Muller S., Windhof I. M., Maximov V., Jurkowski T., Jeltsch A., Forstner K. U., et al. (2013). Target recognition, RNA methylation activity and transcriptional regulation of the Dictyostelium discoideum Dnmt2-homologue (DnmA). Nucleic Acids Res. 41 8615–8627. 10.1093/nar/gkt634 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Nattestad M., Goodwin S., Ng K., Baslan T., Sedlazeck F. J., Rescheneder P., et al. (2018). Complex rearrangements and oncogene amplifications revealed by long-read DNA and RNA sequencing of a breast cancer cell line. Genome Res. 28 1126–1135. 10.1101/gr.231100.117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Neugebauer K. M. (2019). Nascent RNA and the coordination of splicing with transcription. Cold Spring Harb. Perspect. Biol. 11 a032227. 10.1101/cshperspect.a032227 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Neumann T., Herzog V. A., Muhar M., von Haeseler A., Zuber J., Ameres S. L., et al. (2019). Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets. BMC Bioinformatics 20:258. 10.1186/s12859-019-2849-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Ping X.-L., Sun B.-F., Wang L., Xiao W., Yang X., Wang W.-J., et al. (2014). Mammalian WTAP is a regulatory subunit of the RNA N6-methyladenosine methyltransferase. Cell Res. 24 177–189. 10.1038/cr.2014.3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Rabani M., Levin J. Z., Fan L., Adiconis X., Raychowdhury R., Garber M., et al. (2011). Metabolic labeling of RNA uncovers principles of RNA production and degradation dynamics in mammalian cells. Nat. Biotechnol. 29 436–442. 10.1038/nbt.1861 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Rabani M., Raychowdhury R., Jovanovic M., Rooney M., Stumpo D. J., Pauli A., et al. (2014). High-resolution sequencing and modeling identifies distinct dynamic RNA regulatory strategies. Cell 159 1698–1710. 10.1016/j.cell.2014.11.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Rang F. J., Kloosterman W. P., de Ridder J. (2018). From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy. Genome Biol. 19:90. 10.1186/s13059-018-1462-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Ries R. J., Zaccara S., Klein P., Olarerin-George A., Namkoong S., Pickering B. F., et al. (2019). m6A enhances the phase separation potential of mRNA. Nature 571 424–428. 10.1038/s41586-019-1374-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Roach N. P., Sadowski N., Alessi A. F., Timp W., Taylor J., Kim J. K. (2019). The full-length transcriptome of C. elegans using direct RNA sequencing. bioRxiv [Preprint]. 10.1101/598763 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Roundtree I. A., Evans M. E., Pan T., He C. (2017). Dynamic RNA modifications in gene expression regulation. Cell 169 1187–1200. 10.1016/j.cell.2017.05.045 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Sakurai M., Yano T., Kawabata H., Ueda H., Suzuki T. (2010). Inosine cyanoethylation identifies A-to-I RNA editing sites in the human transcriptome. Nat. Chem. Biol. 6 733–740. 10.1038/nchembio.434 [DOI] [PubMed] [Google Scholar]
  76. Schwartz S., Mumbach M. R., Jovanovic M., Wang T., Maciag K., Bushkin G. G., et al. (2014). Perturbation of m6A writers reveals two distinct classes of mRNA methylation at internal and 5′ Sites. Cell Rep. 8 284–296. 10.1016/j.celrep.2014.05.048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Shi H., Wei J., He C. (2019). Where, when, and how: context-dependent functions of RNA methylation writers, readers, and erasers. Mol. Cell 74 640–650. 10.1016/j.molcel.2019.04.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Smith A. M., Abu-Shumays R., Akeson M., Bernick D. L. (2015). Capture, unfolding, and detection of individual tRNA molecules using a nanopore device. Front. Bioeng. Biotechnol. 3:91. 10.3389/fbioe.2015.00091 [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Smith A. M., Jain M., Mulroney L., Garalde D. R., Akeson M. (2019). Reading canonical and modified nucleobases in 16S ribosomal RNA using nanopore native RNA sequencing. PLoS One 14:e0216709. 10.1371/journal.pone.0216709 [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Soneson C., Yao Y., Bratus-Neuenschwander A., Patrignani A., Robinson M. D., Hussain S. (2019). A comprehensive examination of Nanopore native RNA sequencing for characterization of complex transcriptomes. Nat. Commun. 10:3359. 10.1038/s41467-019-11272-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Stark R., Grzelak M., Hadfield J. (2019). RNA sequencing: the teenage years. Nat. Rev. Genet. 20 631–656. 10.1038/s41576-019-0150-2 [DOI] [PubMed] [Google Scholar]
  82. Stoiber M., Quick J., Egan R., Eun Lee J., Celniker S., Neely R. K., et al. (2016). De novo Identification of DNA modifications enabled by genome-guided nanopore signal processing. bioRxiv [Preprint]. 10.1101/094672 [DOI] [Google Scholar]
  83. Sun M., Schwalb B., Schulz D., Pirkl N., Etzold S., Lariviere L., et al. (2012). Comparative dynamic transcriptome analysis (cDTA) reveals mutual feedback between mRNA synthesis and degradation. Genome Res. 22 1350–1359. 10.1101/gr.130161.111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Tombácz D., Prazsák I., Szûcs A., Dénes B., Snyder M., Boldogkõi Z. (2018). Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques. Gigascience 7:giy139. 10.1093/gigascience/giy139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Tong J., Flavell R. A., Li H.-B. (2018). RNA m6A modification and its function in diseases. Front. Med. 12 481–489. 10.1007/s11684-018-0654-8 [DOI] [PubMed] [Google Scholar]
  86. Viehweger A., Krautwurst S., Lamkiewicz K., Madhugiri R., Ziebuhr J., Hölzer M., et al. (2019). Direct RNA nanopore sequencing of full-length coronavirus genomes provides novel insights into structural variants and enables modification analysis. Genome Res. 29 1545–1554. 10.1101/gr.247064.118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Wachutka L., Caizzi L., Gagneur J., Cramer P. (2019). Global donor and acceptor splicing site kinetics in human cells. eLife 8:e45056. 10.7554/eLife.45056 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Wang X., Lu Z., Gomez A., Hon G. C., Yue Y., Han D., et al. (2014). N6-methyladenosine-dependent regulation of messenger RNA stability. Nature 505 117–120. 10.1038/nature12730 [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Wang X., Zhao B. S., Roundtree I. A., Lu Z., Han D., Ma H., et al. (2015). N6-methyladenosine modulates messenger RNA translation efficiency. Cell 161 1388–1399. 10.1016/j.cell.2015.05.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Wang Y., Li Y., Toth J. I., Petroski M. D., Zhang Z., Zhao J. C. (2014). N6-methyladenosine modification destabilizes developmental regulators in embryonic stem cells. Nat. Cell Biol. 16 191–198. 10.1038/ncb2902 [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Wojtas M. N., Pandey R. R., Mendel M., Homolka D., Sachidanandam R., Pillai R. S. (2017). Regulation of m6A Transcripts by the 3′→5′ RNA Helicase YTHDC2 Is essential for a successful meiotic program in the mammalian germline. Mol. Cell 68 374–387.e12. 10.1016/j.molcel.2017.09.021 [DOI] [PubMed] [Google Scholar]
  92. Wongsurawat T., Jenjaroenpun P., Wassenaar T. M., Wadley T. D., Wanchai V., Akel N. S., et al. (2018). Decoding the epitranscriptional landscape from native RNA sequences. bioRxiv [Preprint]. 10.1101/487819 [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Workman R. E., Tang A. D., Tang P. S., Jain M., Tyson J. R., Zuzarte P. C., et al. (2018). Nanopore native RNA sequencing of a human poly(A) transcriptome. bioRxiv [Preprint]. 10.1101/459529 [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Xiang J.-F., Yang Q., Liu C.-X., Wu M., Chen L.-L., Yang L. (2018). N6-methyladenosines modulate A-to-I RNA editing. Mol. Cell 69 126–135.e6. 10.1016/j.molcel.2017.12.006 [DOI] [PubMed] [Google Scholar]
  95. Xiao W., Adhikari S., Dahal U., Chen Y.-S., Hao Y.-J., Sun B.-F., et al. (2016). Nuclear m6A reader YTHDC1 regulates mRNA splicing. Mol. Cell 61 507–519. 10.1016/j.molcel.2016.01.012 [DOI] [PubMed] [Google Scholar]
  96. Xu C., Wang X., Liu K., Roundtree I. A., Tempel W., Li Y., et al. (2014). Structural basis for selective binding of m6A RNA by the YTHDC1 YTH domain. Nat. Chem. Biol. 10 927–929. 10.1038/nchembio.1654 [DOI] [PubMed] [Google Scholar]
  97. Xu L., Seki M. (2019). Recent advances in the detection of base modifications using the Nanopore sequencer. J. Hum. Genet 65 25–33. 10.1038/s10038-019-0679-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Yang S., Wei J., Cui Y.-H., Park G., Shah P., Deng Y., et al. (2019). m6A mRNA demethylase FTO regulates melanoma tumorigenicity and response to anti-PD-1 blockade. Nat. Commun. 10:2782. 10.1038/s41467-019-10669-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Yue Y., Liu J., Cui X., Cao J., Luo G., Zhang Z., et al. (2018). VIRMA mediates preferential m6A mRNA methylation in 3′UTR and near stop codon and associates with alternative polyadenylation. Cell Discov. 4:10. 10.1038/s41421-018-0019-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Zhang C., Chen Y., Sun B., Wang L., Yang Y., Ma D., et al. (2017a). m6A modulates haematopoietic stem and progenitor cell specification. Nature 549 273–276. 10.1038/nature23883 [DOI] [PubMed] [Google Scholar]
  101. Zhang C., Zhang B., Lin L.-L., Zhao S. (2017b). Evaluation and comparison of computational tools for RNA-seq isoform quantification. BMC Genomics 18:583. 10.1186/s12864-017-4002-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Zhang S.-Y., Zhang S.-W., Fan X.-N., Zhang T., Meng J., Huang Y. (2019). FunDMDeep-m6A: identification and prioritization of functional differential m6A methylation genes. Bioinformatics 35 i90–i98. 10.1093/bioinformatics/btz316 [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Zhang Z., Zhan Q., Eckert M., Zhu A., Chryplewicz A., De Jesus D. F., et al. (2019). RADAR: differential analysis of MeRIP-seq data with a random effect model. Genome Biol. 20:294. 10.1186/s13059-019-1915-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Zhao L., Zhang H., Kohnen M. V., Prasad K. V. S. K., Gu L., Reddy A. S. N. (2019). Analysis of transcriptome and epitranscriptome in plants using PacBio Iso-Seq and nanopore-based direct RNA sequencing. Front. Genet. 10:253. 10.3389/fgene.2019.00253 [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Zheng G., Dahl J. A., Niu Y., Fedorcsak P., Huang C.-M., Li C. J., et al. (2013). ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility. Mol. Cell 49 18–29. 10.1016/j.molcel.2012.10.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Zhou J., Wan J., Gao X., Zhang X., Jaffrey S. R., Qian S.-B. (2015). Dynamic m6A mRNA methylation directs translational control of heat shock response. Nature 526 591–594. 10.1038/nature15377 [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Zhu T., Roundtree I. A., Wang P., Wang X., Wang L., Sun C., et al. (2014). Crystal structure of the YTH domain of YTHDF2 reveals mechanism for recognition of N6-methyladenosine. Cell Res. 24 1493–1496. 10.1038/cr.2014.152 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Frontiers in Genetics are provided here courtesy of Frontiers Media SA

RESOURCES