Abstract
Adenovirus is a nuclear replicating DNA virus reliant on host RNA processing machinery. Processing and metabolism of cellular RNAs can be regulated by METTL3, which catalyzes the addition of N6-methyladenosine (m6A) to mRNAs. While m6A-modified adenoviral RNAs have been previously detected, the location and function of this mark within the infectious cycle is unknown. Since the complex adenovirus transcriptome includes overlapping spliced units that would impede accurate m6A mapping using short-read sequencing, here we profile m6A within the adenovirus transcriptome using a combination of meRIP-seq and direct RNA long-read sequencing to yield both nucleotide and transcript-resolved m6A detection. Although both early and late viral transcripts contain m6A, depletion of m6A writer METTL3 specifically impacts viral late transcripts by reducing their splicing efficiency. These data showcase a new technique for m6A discovery within individual transcripts at nucleotide resolution, and highlight the role of m6A in regulating splicing of a viral pathogen.
Subject terms: Biological techniques, Computational biology and bioinformatics, Microbiology, Molecular biology
Adenovirus transcripts contain N6-methyladenosine (m6A) modification. Here the authors profile m6A modification sites on adenovirus mRNAs using Illumina meRIP-Seq and nanopore direct RNA sequencing, and showcase a role for m6A in splicing of viral late mRNAs.
Introduction
Adenovirus is a nuclear-replicating DNA virus with a linear double-stranded genome that is dependent on the host cell machinery for productive infection1. To maximize coding capacity of the 36 kilobase genome, adenovirus employs a tightly regulated gene transcription process. Early genes and subsequent late genes are produced from both strands of DNA using cellular RNA polymerase II and the spliceosomal machinery to generate capped, spliced, and polyadenylated messenger RNAs (mRNA). Besides the four canonical ribose nucleosides, adenoviral RNA is also known to contain RNA modifications2,3. RNA modifications comprise a family of over one hundred chemical modifications of nucleic acid that can play important roles in both RNA biogenesis and function4,5. In eukaryotic messenger RNA, N6-methyladenosine (m6A) is the most prevalent RNA modification besides the 7-methylguanosine cap6. This mark has been implicated in regulating multiple processes of RNA maturation, including splicing, polyadenylation, export, translation, and ultimately decay7–12. Current understanding suggests that m6A is added to messenger RNAs co-transcriptionally in the nucleus by recruitment of a writer complex composed of METTL3, METTL14, WTAP, and other accessory proteins to RNA polymerase II13–15. These modified mRNAs are bound by reader proteins such as the YTH family (YTHDC1-2, YTHDF1-3)16,17, various hnRNPs18,19, and the IGF2B20 family of RNA binding proteins, which affect various downstream fates of the modified mRNAs. This mark is reversible through the action of erasers, and FTO and ALKBH5 have been proposed to demethylate m6A8,21,22.
Shortly after the discovery of m6A in cellular RNAs, RNAs from several diverse viruses such as adenovirus, Rous sarcoma virus, simian virus 40, herpes simplex virus, and influenza A virus were also shown to contain m6A in internal regions2,3,23–26. In particular, adenovirus serotype 2 was shown to contain m6A at sites away from the cap2. These marks were added to nuclear pre-mRNA and retained in the fully processed cytoplasmic RNA3. These two studies also predicted adenovirus RNAs to contain on average four m6A modifications per transcript, along with low levels of methyl-5-cytosine, for a total of 1/400 (0.25%) of modified viral nucleotides2,3. However, no subsequent studies have elucidated the functions of m6A modification in adenovirus. With the advent of high-throughput m6A sequencing methods, interest in viral RNA modifications has been rekindled. RNA viruses such as HIV, influenza A, picornavirus, and various Flaviviridae including Zika, dengue, and hepatitis C virus are influenced both positively and negatively by m6A added via METTL3, and many of these viral RNAs are bound by cytoplasmic YTHDF proteins27–34. In hepatitis B virus, m6A at the same site can both stimulate reverse transcription, as well as reduce mRNA stability35. For DNA viruses such as SV40 and KSHV, deposition of m6A on viral RNA transcripts can enhance viral replication36–39. Interestingly, multiple labs have published conflicting functions for m6A within the same viral transcript of KSHV, which suggests cell type specific roles39. Of note, recent work using human cytomegalovirus also implicates m6A in controlling aspects of the interferon response, thereby indirectly regulating viral infection40,41. Since adenovirus is reliant on cellular polymerases and mechanisms to generate and process its viral RNAs, adenovirus infection provides an excellent opportunity to study the consequences of co-transcriptional m6A addition.
Until recently, sequencing methods to map m6A have relied on antibody-based immunoprecipitations to enrich for methylated RNA within a relatively large nucleotide window (methylated RNA immunoprecipitation sequencing, meRIP-seq or m6A-seq)42,43. These techniques are indirect, because antibody-precipitated RNA has to be converted to cDNA before sequencing. Although other RNA modifications can be located due to mutations or truncations resulting from reverse transcription44–46, these events are not generated in the case of m6A due to efficient base pairing with thymine and uracil. Several techniques have circumvented some of these limitations, such as photo-crosslinking assisted m6A sequencing (PA-m6A-Seq)47, m6A individual nucleotide resolution crosslinking and immunoprecipitation (miCLIP)48,49, and RNA digestion via m6A sensitive RNase (MAZTER-seq)50. In general, these methods are labor intensive, and require either specialized chemical addition to cell culture, large amounts of input material, or higher unique read counts than meRIP-seq. Furthermore, the antibodies used to precipitate m6A may themselves have sequence or structure biases, and cannot distinguish between m6A and the similar modifications m6Am22,51. To this end, the ability to sequence native RNA molecules directly using nanopore arrays provides a new approach to locate RNA modifications. While detecting modified DNA nucleotides is possible using both PacBio and Oxford Nanopore Technologies platforms52,53, detection of RNA modifications has proven much more challenging. Recently, two groups have shown detection of m6A using nanopores in yeast total RNA and in human cell lines54,55. In addition to detecting RNA modifications directly, production of long reads by these platforms provides distinct advantages in the study of gene-dense viral genomes, which encode complex and often overlapping sets of transcripts56. To date, the ability to use direct RNA sequencing to map full-length transcripts and their RNA modifications unambiguously has not been realized.
In this study, we found that adenovirus infection does not alter expression of m6A-interacting enzymes but instead concentrates these host proteins at sites of nascent viral RNA synthesis. While meRIP-Seq was able to identify numerous methylated regions on both early and late kinetic classes of viral mRNA, the complex splicing structure and overlapping nature of the adenovirus transcriptome precluded unambiguous transcript assignments and m6A localization by this method alone. To overcome this limitation, we developed a method to predict sites of m6A modification at single-base resolution within full-length RNA by direct RNA sequencing and used this technique to predict m6A specific to transcript isoforms. While we found that both viral early and late genes are marked by m6A, expression of viral late RNAs in particular decreased dramatically with loss of the cellular m6A writer METTL3. This late gene-biased effect was primarily mediated by decreased RNA splicing efficiency in the absence of METTL3, and could be extended to all of the multiply spliced adenovirus late RNAs. Overall, these results highlight a new technological advancement in long-read RNA sequencing, and reveal that m6A influences the splicing and expression from a viral pathogen.
Results
Nuclear m6A-interacting factors concentrate at viral RNA during adenovirus infection
While it is known that adenovirus RNA transcripts contain m6A, the impact of infection on cellular components involved in writing, reading, or erasing m6A is unknown. The majority of m6A on messenger RNA is installed co-transcriptionally in the nucleus by a writer complex composed minimally of METTL3, METTL14, and WTAP that associates directly with RNA Pol II13–15. To examine whether m6A-interacting enzymes are altered during adenovirus infection, we performed immunoblot analysis over a time-course of infection with adenovirus serotype 5 (Ad5) in A549 lung adenocarcinoma cells (Fig. 1a). Over the course of infection, levels of the assayed writers (METTL3, METTL14, and WTAP) and readers (YTHDC1, YTHDF1, and YTHDF2) remain unchanged. There was a modest increase in levels of both purported erasers, FTO and ALKBH5, including the appearance of a faster migrating band detected with the ALKBH5 antibody.
Adenovirus is known to recruit specific cellular factors to viral replication centers or mislocalize antiviral cellular factors57,58. To determine if localization of m6A-interacting factors were similarly altered, we performed indirect immunofluorescence microscopy to localize cellular proteins, as well including an antibody against the viral DNA binding protein (DBP) to demarcate viral replication centers (Fig. 1b). When comparing mock-infected A549 cells to cells infected with Ad5 for 18 h, we observed that METTL3, METTL14, WTAP, and YTHDC1 relocalized from their diffuse nuclear pattern into ring-like structures surrounding the sites of viral DNA replication marked by DBP. These structures have been previously characterized as sites of viral RNA transcription59, and were consistent with staining for phospho-serine 2 on the RNA Pol II C-terminal domain, a marker of actively transcribing polymerase (Fig. 1c). The localization of cytoplasmic readers (YTHDF1, YTHDF2) and demethylases (ALKBH5 and FTO) was mostly unchanged. These data highlight that while adenovirus does not significantly alter expression levels of known m6A writing enzymes, these nuclear proteins are concentrated at sites of viral RNA synthesis, and not actively excluded or mislocalized, during infection.
Adenovirus transcripts contain METTL3-dependent m6A modifications
While it is known that adenovirus mRNAs contain m6A2,3, it is not known exactly where these marks are located or whether adenovirus infection affects m6A localization within host transcripts. To address these questions, we performed meRIP-seq on poly(A)-selected RNA from A549 cells that were mock-infected or infected with Ad5 for 24 h (Supplementary Data 1). Strand-specific sequencing was performed on fragmented RNA immunoprecipitated (IP) with an anti-m6A antibody, as well as on total input RNA. We used the MACS2 algorithm to call peaks in IP over input reads for both viral and cellular transcriptomes across three biological replicates (Fig. 2a and Supplementary Data 2). HOMER motif analysis revealed the characteristic DRACH signature (Where D = A/G/U, R = A/G, and H = A/U/C) as the highest ranked motif in cellular m6A peaks in both mock and infected conditions, indicating that the immunoprecipitation was successful and that use of the canonical m6A motif was unperturbed by infection (Fig. 2b). Furthermore, the general location of m6A peaks in cellular transcripts was unchanged during infection and showed the characteristic stop codon/3’UTR bias in mRNA metagene plots (Fig. 2c). We next focused on m6A addition to viral transcripts and identified 19 peaks in the forward transcripts and 6 in the reverse transcripts (Fig. 2a). While these peaks covered every viral kinetic class and transcriptional unit produced by the virus, the MACS2-generated peak areas were very broad, and it was impossible to identify which of many overlapping viral transcripts were m6A methylated. Multiple peak callers were used, but ultimately none were successful in deconvoluting the large peaks due to the nature of overlapping short reads from multiple distinct viral transcripts. MACS2 peaks were retained for downstream comparison due to the robust peak calling ability of this software as tested using other meRIP-seq datasets60.
We next asked whether viral RNAs were m6A-methylated by cellular enzyme METTL3. To achieve this, METTL3 was knocked down by siRNA prior to Ad5 infection of A549 cells and meRIP followed by qRT-PCR of total RNA was performed at 24 hs post-infection (Fig. 2d, e). These results were normalized for the amount of input RNA, and demonstrate that the amount of m6A-marked RNA available for immunoprecipitation were all reduced in a METTL3-dependent manner for the early viral transcript DBP and late viral RNAs generated from the Major Late Promoter (MLP), as well as the positive control cellular transcript MALAT1. These data indicate that viral early and late transcriptional units contain m6A, and that this modification is added by the cellular writer complex that includes METTL3. However, the exact location of the m6A mark could not be assessed by this approach.
A statistical framework predicts sites of m6A methylation using direct RNA sequencing
To address shortcomings of short read-based m6A-sequencing platforms, we sought a technique that would provide both single nucleotide resolution, as well as long read length to allow for unambiguous assignment of m6A sites to specific adenovirus mature transcripts. Nanopore sequencing has been used to call DNA modifications directly using differences between measured and expected current values as nucleotides travel through the pore53. It is well established that signal deviations during direct RNA sequencing can result from the presence of one or more base modifications, and that this leads to an increase in base-call error rate around the modified base54,61. The likelihood of a given nucleotide being modified (i.e., carrying an m6A mark) can be assessed by a 2 × 5 contingency table to examine the distribution of base-calls between two datasets (e.g., m6A positive and m6A negative) at a given genome position, as was recently demonstrated62. Here, a G-test on the distribution of A, C, G, U, and indels provides a score and p-value that requires subsequent (Bonferroni) correction for multiple testing (Fig. S1a). This approach is codified in the software package DRUMMER (https://github.com/DepledgeLab/DRUMMER).
To produce m6A positive and negative datasets, we generated METTL3 knockout A549 cells using CRISPR-Cas9 with a strategy that included regulated expression of a nuclease-insensitive transgene (Fig. 2f). We performed direct RNA sequencing using two biological replicates each of RNA collected from parental wild-type A549 cells (WT) and METTL3 knockout A549 cells (M3KO), each infected with Ad5 for 24 h (Fig. 3a and Supplementary Data 3). Since the datasets are unlinked, each METTL3 KO dataset was compared to each WT parental dataset, yielding four distinct comparisons (Fig. S1b). Each comparison yielded between 335 and 452 candidate sites with significant G-test statistics (Supplementary Data 4). To account for only the positions at which the base error rate was greater in the WT (m6A positive) dataset, we filtered for a one-fold or greater increase in the ratio of mismatch:match base-calls compared to the M3KO (m6A negative) dataset. This reduced the number of putative sites to 191–217 (Fig. S1b). Reasoning that multiple sites proximal to a single m6A modification could show significant differences (adjusted p < 0.01 in error rate), we next calculated the distance between each candidate site and its nearest neighbor candidate in a strand-specific manner (Fig. S1c). We determined that the majority of candidate sites had at least one neighboring m6A candidate site within five nucleotides, so we implemented an additional filtering strategy to collapse all clustered candidates to one candidate site (Fig. 3b), retaining only the candidate site with the highest G-test statistic. We subsequently plotted the distance (number of nucleotides) from the identified base to the nearest upstream or downstream AC motif (the minimal possible m6A motif). Most significant candidate sites filtered for mismatch:match rates (94.2–99%) were located within five nucleotides of an AC motif (Fig. 3c), the maximum distance at which a modified base is thought to affect basecall error61. Masking reduced this fraction slightly (92.7–98.9%, Fig. 3c) and yielded 89–111 distinct predicted m6A sites, of which 53 were conserved across all four dataset comparisons (Fig. 3d, highlighted by dark blue line in Fig. 2a). The majority (83.1%) of these 53 sites mapped directly to adenines in AC motifs, with the remainder mapping within four nucleotides (Fig. 3c). When random non-candidate nucleotides were selected from our dataset we found that the distance to the nearest AC motif was much greater than for m6A-candidate sites, indicating that this was not due to random chance (Fig. S1d). We subsequently extracted the seven base sequence centered on a collapsed candidate m6A site and generated a sequence logo (Fig. S1e) that closely matched the m6A DRACH logo that we confirmed for m6A modifications in the human transcriptome. Since all m6A candidate sites mapped to within five nucleotides of nearby AC, subsequent shifting of the motif center to the closest AC revealed that only 13% of sites did not perfectly recapitulate the DRACH motif (Fig. 3e). Of these non-canonical motifs, three were DRACG and the remaining four were GUACU. The overlap between antibody-based meRIP predicted regions and our dRNA-based approach was significantly higher than expected by chance (Fig. S1f-g). Concordance between the predicted location of these m6A sites and the previously established motif supports the validity of this unbiased mapping approach.
Exome versus isoform-level m6A analysis
After identifying putative m6A-modified bases within the viral RNA exome using direct RNA-seq, we extended our approach to transcript isoform-level analysis. Here, we aligned our nanopore sequencing reads against 75 distinct transcripts derived from the recently re-annotated adenovirus Ad5 genome63 and observed that the aligner (MiniMap264) produced secondary alignments for many reads and supplementary alignments for a smaller subset of reads (Fig. S2a). Secondary alignments indicate that a region of a given sequence read aligns with high confidence against two or more distinct transcripts while supplementary alignments indicate potentially chimeric reads where two segments of the same read align to separate overlapping transcripts (Fig. 4a). Given the possibility that overlapping transcript isoforms may share the same DRACN motifs but still undergo differential methylation, we reasoned that reads with multiple alignments could reduce sensitivity of detection. We thus retained only reads that produced unique alignments with mapping qualities >0 and had no insertions greater than 20 nucleotides, with the latter requirement intended to exclude reads from incompletely spliced RNAs. While this approach discarded 63–69% of our sequence reads (Fig. S2a), the specificity achieved by unambiguous isoform-level assignment of sequence reads allowed us to identify 747 putative m6A-modified bases across 47 transcripts. This translated to 352 exome-level sites, of which 204 were conserved across all pairwise comparisons (Fig. 4b and Supplementary Data 5, highlighted by magenta lines in Fig. 2a). Isoform-level analysis recapitulated the majority of exome-level sites identified above (47/53), while identifying an additional 50 m6A sites (Fig. 4c). This represents a four-fold increase when compared to the equivalent exome-level analysis, demonstrating the greater sensitivity of the isoform-level approach. Furthermore, this strategy allowed us to detect transcripts that had unique m6A sites, even compared to overlapping transcripts that shared the same potential DRACH motifs (Fig. 4d). To validate our data further, we also identified putative m6A sites using NanoCompore which performs comparative interrogation of signal level direct RNA-Seq data (dwell time and current intensity) to predict modified nucleotides65. Using the same datasets as inputs, we identified 204 putative m6A sites at the isoform level: 93 of these were also reported by DRUMMER in all four comparisons, while a further 69 were also reported by DRUMMER in 1-3 comparisons. This left just 41 sites predicted by NanoCompore that were not identified by DRUMMER compared to 42 sites predicted by DRUMMER but not NanoCompore (Fig. S2b, c). Overall, these data highlight a novel technique that reveals m6A marks at both single nucleotide and isoform-specific levels, greatly improving our ability to map m6A modifications across complex viral transcriptomes.
Loss of METTL3 and m6A methylation differentially impacts late viral gene expression
Since both viral early and late genes are marked by METTL3-dependent m6A, we asked what role this mark might play in the viral infectious cycle. METTL3 or METTL14 were knocked down by siRNA 48 h prior to infection, a time point that led to stable loss of both RNA and protein expression throughout the subsequent infection (Fig. S3a, b). As reported by others, siRNA-mediated knockdown of METTL3 results in a concomitant loss of METTL14 protein without affecting mRNA, and vice versa14 (Fig. 5a, Fig. S3b). Upon knockdown of either METTL3 or METTL14, we observed reductions of viral late proteins Hexon, Penton, and Fiber. In contrast, the viral early protein DBP was largely unaffected (Fig. 5a), even though the DBP transcript is also marked by m6A (Fig. 2a). When we assessed viral genome amplification by qPCR, we detected only a minimal change after knockdown of METTL3 or METTL14 (Fig. 5b), indicating m6A is not required for early stage adenovirus infection. This contrasts with robust decreases in the number of infectious particles in lysates derived from infected cells knocked down for either METTL3 or METTL14, as measured by plaque assay (Fig. 5c). Consistent with these findings, when we used reverse transcription coupled with quantitative PCR (qRT-PCR) to measure total viral mRNA levels over a time course of infection (Fig. 5d), we observed that early viral genes were only modestly (less than two-fold or not significantly) decreased by METTL3 knockdown, whereas viral late genes were significantly reduced (sometimes greater than 10-fold). To corroborate these findings from siRNA-mediated knockdown, we also assayed specific stages of the viral infectious cycle in METTL3 KO A549 cells. Two independently generated CRISPR-mediated knockout cells were infected with Ad5 for 24 h and assayed for viral early and late genes by immunoblotting and qRT-PCR. Consistent with siRNA findings, accumulation of DBP transcript and protein was not affected, but viral late gene products were significantly reduced in METTL3 KO cells (Fig. S3c, d). Finally, we blocked methylation by treating cells with the small molecule inhibitor 3-Deazadenosine (DAA) at the time of infection. DAA is an S-Adenosylmethionine synthesis inhibitor that has preferential effects on m6A deposition when used at low concentrations66. Although treatment with DAA reduced early gene protein expression and DNA replication more than METTL3 knockdown or knockout, the most robust effects observed were reductions in the amount of viral late gene transcripts, protein production, and plaque forming units (Fig. S3e-h). Overall, these data highlight a preferential effect of m6A addition at late stages of the viral infectious cycle, with viral early gene transcription and genome replication largely unaffected while late RNAs, late proteins, and infectious progeny production are greatly reduced in the absence of m6A.
Two groups have recently highlighted that in primary cells m6A destabilizes the IFNB1 transcript, such that loss of METTL3 results in increased production of IFNβ40,41. When these cells were infected with viruses, including by adenovirus serotype 4, they restricted viral infectivity through indirect priming of the interferon pathway. To determine if loss of METTL3 similarly increased IFNB1 and innate immune-related transcripts in our experiments, we used siRNA to knock down METTL3 for 48 h prior to infection with Ad5 either in the presence or absence of ruxolitinub, a JAK/STAT inhibitor that blocks signaling downstream of type-I interferons such as IFNβ (Fig. S4a). As a positive control for interferon activation and the efficacy of ruxolitinub, uninfected cells were transfected with poly(I:C) as a surrogate viral RNA agonist. Our data demonstrate that in A549 cells, neither type-I interferon (IFNB1) or type-III interferon (IFNL1) transcripts are induced by Ad5 infection or METTL3 depletion, whereas poly(I:C) transfection induces these transcripts several hundred-fold. Furthermore, expression of interferon-stimulated genes such as MX1 and OAS1 was blocked downstream of poly(I:C) transfection by JAK inhibition but not induced by viral infection or METTL3 knockdown. In addition, blocking the interferon pathway with JAK inhibition did not alter the specific decrease in viral late gene we report here (Fig. S4b). Overall these data suggest a specific role for m6A in altering accumulation of late adenoviral RNA transcripts at late stages of the adenoviral infectious cycle, independent of innate immune activation, in the cell types we used.
Cytoplasmic readers and m6A erasers do not affect the adenoviral infectious cycle
Since we found that loss of the m6A writer METTL3 resulted in decreases in viral late gene transcripts and proteins, we next examined whether loss of m6A erasers led to increases in viral protein production. We used siRNA to knockdown either ALKBH5 or FTO in A549 cells and performed a time-course infection with Ad5. There was no defect in either viral early proteins or late proteins with either ALKBH5 or FTO knockdown (Fig. S5a). Furthermore, when assaying viral DNA replication by qPCR or infectious particle production by plaque forming units after knockdown of the two erasers, we saw less than 3-fold or non-significant changes (Fig. S5b, c). Using siRNA in A549 cells, we were able to knock down the cytoplasmic m6A reader proteins YTHDF1-3 efficiently (Fig. S5d) but did not observe changes in viral protein production (Fig. S5e). These results highlight that whatever amount of m6A deposition is necessary for adenoviral function, removal of ALKBH5 and FTO does not increase m6A beyond necessary levels.
Splicing efficiency of late viral RNA is mediated by m6A
We investigated various mechanisms to explain the differential impact of m6A on early and late viral RNAs. To rule out the role of transcription in masking effects of this post-transcriptional modification, we examined early and late promoter activity by profiling nascent RNA transcription with a 4-thiouridine (4sU) labeling approach67. After 24 h infection of A549 cells with Ad5 following knockdown of METTL3, nascent RNA was labeled with 4sU for the last 10 min of infection. Total RNA was harvested, nascent RNA conjugated with biotin, and precipitated for analysis by qRT-PCR with primers to viral transcripts. We examined unspliced regions of viral RNA near the early E1A and E4 regions, as well as within the tripartite leader of the Major Late Promoter (MLP) that precedes nearly all adenovirus late genes. While relative transcription of the E1A promoter was increased two-fold after METTL3 knockdown, a separate early promoter (E4), and the viral late promoter itself showed no significant change (Fig. 6a). RNA half-life was assessed by performing transcriptional shut-off assays with the RNA Pol II elongation inhibitor 5,6-Dichloro-1-β-d-ribofuranosylbenzimidazole (DRB) to assess the stability of viral RNAs. We quantified RNA abundance over an 8 h timecourse in the presence or absence of METTL3, and observed no significant change in RNA decay (Fig. 6b). These data imply that the same amount of late stage viral mRNA are transcribed in adenoviral infections of cells lacking METTL3, and that despite similar spliced RNA stability fewer late viral RNAs still accumulate.
Besides transcription from the MLP, which is dependent on viral DNA replication, the other major difference between early and late viral transcripts is the sheer number of alternative splicing and polyadenylation events that must take place to generate mature mRNAs. While most early transcripts contain only one splice junction and no more than two potential poly(A) sites, the major late transcriptional unit has over 25 splice acceptors and 5 major sites of polyadenylation, leading to over 20 functional open reading frames63. Mounting evidence suggests that m6A and the nuclear reader YTHDC1 can regulate mRNA splicing7,37. We used qRT-PCR to measure spliced and unspliced viral RNAs and determined the splicing efficiency of a particular transcript as a ratio of these two products (Fig. 6c, Fig. S6a-c). We examined the m6A-marked transcripts E1A (early gene) and Fiber (late gene) as representative of their kinetic classes. Of note, Fiber is the only late gene that can be incorporated into this type of qPCR assay, since no other intron amplifying primer set within the late gene transcriptional unit is specific to a single gene. Upon knockdown of METTL3 or WTAP, the splicing efficiency of the E1A gene did not change, whereas the splicing efficiency of Fiber significantly decreased (Fig. 6d). METTL3-dependent loss of late gene splicing efficiency was independent of time post infection (Fig. S6e, f). The effect on late gene splicing, but not early gene splicing, was phenocopied by METTL3 knockout (Fig. 6e, Fig. S6d). Fiber RNA splicing efficiency decrease was also seen by knockdown of the nuclear m6A reader protein YTHDC1 (Fig. 6f). This decreased splicing efficiency appears to explain the total RNA decrease in viral late genes, as well as the decrease in protein expression, since both METTL3 and WTAP knockdown lead to significant losses in viral late gene proteins (Fig. 6g). A similar result was observed with knockdown of YTHDC1, although to a much lesser degree (Fig. S6g, h).
To assay the role of m6A in cis on viral transcript splicing we turned to a transgene splicing assay in which we can ablate m6A sites (Fig. 6h). In this system, we generated a construct with an exogenous plasmid-based promoter to drive expression of an RNA containing the splice donor exon of the viral late tripartite leader, as much intervening Fiber intron as possible without including additional viral genes or splice sites, and the 5’ region of Fiber that encompasses the meRIP-seq peak. This viral cassette was fused to a Renilla luciferase gene in which all m6A DRACH motifs were silently mutated34. These constructs allow transgene expression with wild-type viral context (WT Fiber), or with all 15 DRACH sites present in both exonic and intronic regions silently mutated to ablate deposition of METTL3-dependent m6A (m6A Mut Fiber, mutations shown below in Fig. 6h). Potential intronic sites were also ablated, since both meRIP-seq and direct RNA sequencing were performed on polyadenylated mRNA and would not have detected potential m6A sites within introns. HeLa cells were depleted of METTL3 with siRNA for 48 h before the respective transgene plasmids were transfected into uninfected cells and incubated for a further 24 h. Using this system, splicing efficiency is read out with qRT-PCR using the primers for endogenous Fiber splicing. Upon knockdown of METTL3, the WT Fiber construct showed a decrease in splicing efficiency (Fig. 6i), similar to that observed during viral infection (Fig. 6d). Importantly, m6A Mut Fiber showed a similar decrease in splicing efficiency when compared to the WT Fiber construct during control siRNA knockdown, and had no further decrease upon METTL3 knockdown (Fig. 6i). The same transgene splicing assay was performed following YTHDC1 knockdown (Fig. S6i). A similar decrease in splicing efficiency of the wildtype reporter was observed, with no change in the m6A mutated reporter. There was no change in the accumulation of an unspliced Firefly luciferase RNA expressed from the same plasmid upon METTL3 knockdown (Fig. S6j). Overall, these data suggest m6A positively regulates the splicing reaction of viral late transcripts.
METTL3 knockdown globally dysregulates adenoviral late RNA processing
To determine if m6A positively regulates the splicing of all late transcripts, we returned to two orthologous sequencing technologies, short-read sequencing and long-read direct RNA sequencing. Due to the limitations of short-read mapping to viral genomes with overlapping transcripts, standard expression algorithms such as mapped fragments per kilobase transcript per million base pairs (FPKM) cannot be used to quantify these viral transcripts accurately. Instead, we focused exclusively on the short reads that contained splice junctions, allowing unambiguous assignment to one viral transcript. Since our RNA-seq library preparation was strand-specific, this allowed us to map reads accurately to their strand of origin. Furthermore, we were able to address intron retention of the entire late transcriptional unit by counting only unspliced reads overlapping the major splice donor exiting the conserved third exon of the tripartite leader (intron retention, IR). This strategy is schematized for top-strand viral genes (Fig. 7a). When we applied this technique to adenovirus-infected A549 cells after METTL3 knockdown, we observed that the prevalence of early gene splice junctions was largely unchanged (Fig. 7b). Of these, E1B-19K appears to be an outlier, since the prevalence of this particular splice junction increases as the virus moves into the late stage, and a potential ORF of this transcript encodes for protein IX, an alternate promoter-driven adenovirus late gene. When we performed this analysis on late gene splice sites, we observed that every late gene originating from the major late promoter was significantly decreased, and that intron retention in this transcriptional unit was increased, consistent with our previous results. The only exception we detected was the first possible alternative splice junction encoding L1-52K, which was unchanged. While the increase in intron retention was not as dramatic as the decrease in spliced late transcripts, this can be explained by the relative instability of unspliced RNA transcripts within the nucleus68, and the bias of the poly(A) selection employed for sequencing in capturing mature transcripts.
While short reads targeting splice junctions provided high depth quantitative results, we turned to long-read sequencing with unambiguous transcript mapping as an orthogonal technique to validate these results. We counted full-length RNA transcripts after infection of METTL3 knockdown A549 cells normalized to the read depth of the experiment (Fig. 7c). Confirming the short-read sequencing data, we saw that early transcripts were essentially unaffected by loss of METTL3, while the overall abundance of every viral late transcript, except L1-52K, was decreased. While both short-read and long-read sequencing were performed on poly(A) selected RNA that is biased for detecting mature fully processed transcripts, low levels of incompletely processed RNA can be detected. Since nanopore sequencing starts at the 3’ cleavage and polyadenylation site, we were able to uniquely bin all reads that were cleaved at any of the L1-L5 late gene polyadenylation sites and assay for the presence of intron retention upstream of the first splice acceptor (Fig. S7a). Comparing the METTL3 KO to Wildtype cells or siMETTL3 to siCTRL data revealed evidence of increased intron retention/poor RNA processing in every late transcriptional unit (Fig. S7b). Overall, these data highlight that METTL3 promotes the expression and splicing efficiency of the alternatively spliced viral late transcriptional unit.
Utilizing the power of our long-read dRNA sequencing approach to map m6A sites to unique RNA isoforms, we asked if any specific features of these RNAs correlated with the loss of expression we observed after METTL3 knockdown. The total number of potential m6A sites per transcript was not correlated with change in expression (Spearman ρ = −0.38, Fig. 7d), however the total number of splice sites per transcript and RNA expression were negatively correlated (Spearman ρ = −0.75, Fig. 7e). The strongest negative correlation with METTL3-dependent changes in expression was found to be the length of the longest intron within a specific transcript (Spearman ρ = −0.84, Fig. 7f). Using high-depth short-read sequencing and existing gene annotations to ask similar questions of host cell transcripts upon METTL3 depletion, we found 958 genes increased and 1298 decreased greater than two-fold with an FDR of 0.05% upon siMETTL3 treatment. Focusing on these significantly changed genes, we saw no correlation with the number of m6A regions (defined by meRIP-seq), total number of exons, or the longest intron within a single transcript (data not shown). While these correlations might hold true for specific cellular genes that have features similar to the viral transcriptome (e.g., high expression levels and many alternative splice and polyadenylation sites), global correlations appear confounded by the many diverse roles m6A modification can have on any single transcript.
Discussion
In this work we have identified sites of m6A methylation in both cellular and viral transcripts during infection with adenovirus. Using a conventional antibody-based approach to detect m6A, we discovered that both viral early and late kinetic classes of mRNA were modified. To identify m6A within overlapping viral transcripts, we devised a technique to predict METTL3-dependent methylation sites in full-length RNAs using direct RNA sequencing. This technique identified specific m6A sites at nucleotide resolution within m6A-enriched regions, and revealed adenovirus RNA isoform-dependent sites of methylation. Ultimately, we found that m6A addition in the alternatively spliced viral late transcripts was important for mediating the efficiency of splicing and accumulation of these messages leading to productive infection. The benefit of m6A-mediated splicing efficiency was directly correlated to the intron length of the transcript, implicating a role for m6A in processing of long transcripts with many alternative splicing choices. Taken together, this work demonstrates m6A-mediated control over the splicing of viral transcripts.
We also defined a new method for isoform-specific m6A detection within complex transcriptomes. Compared to conventional antibody-based approaches, direct detection of m6A modification in RNA provides many advantages. Focusing on adenovirus RNAs, we show that our technique is highly reproducible between biological replicates with a very low false-positive rate (here defined as candidates that did not map within five nucleotides of an AC dinucleotide). In fact, after candidate site masking, greater than 80% of all sites mapped directly to the adenine within an AC dinucleotide, with the rest often mapping within four nucleotides. This is consistent with the fact that nanopores read RNA in a five nucleotide window61. While we demonstrate this method by assaying for METTL3-dependent m6A modifications, theoretically this technique could work for other RNA modifications where the writing enzyme is known, such as methyl-5-cytosine and pseudouridine44,45. Compared to other single-nucleotide resolution approaches to m6A detection48–50, direct RNA sequencing avoids the known biases associated with antibody targeting51 and cDNA synthesis42,49 and, unlike endoribonuclease cutting50, is not specific to a small subset of methylated motifs. Currently, our technique works best with samples of high read depth, and is thus ideally suited to the study of viruses that dominate the RNA processing of their host cell. However, with improvements in nanopore sequencing technologies such as the PromethION which can yield much higher read depth, this technique should be scalable to the study of cellular transcripts as well.
The lack of long-read assays to map m6A methylation has hampered the identification of transcript isoform-specific modifications. While our method still does not enable single molecule resolution, the ability to aggregate similar isoforms in bulk using the long reads produced through direct RNA sequencing allows for isoform-level prediction of m6A sites. Indeed, we demonstrate that isoform-level identification was more sensitive for the detection of potential m6A sites in adenovirus mRNA than the aggregate exome-level data by as much as four-fold. While isoform-level alignment and filtering rely on quality annotations, this technique may need additional customization based on transcript structures in the target of choice. Furthermore, we have only shown the efficacy of this design on polyadenylated messenger RNA. Further studies focusing on non-adenylated or nascent RNAs should prove highly informative for defining the role of m6A-mediated regulation within introns and structural RNAs.
While the influence of m6A on splicing has been widely reported in both viral and cellular contexts7,8,37,69, this finding has not been without controversy70. One study in particular showed that while m6A can be detected in nascent pre-mRNA, these locations do not change upon RNA maturation and nuclear export15. Furthermore, knockout of METTL3 in mouse embryonic stem cells led to no detectable change in cassette exon splicing15. In general, our findings agree with this as we do not see the global loss of any particular viral splicing event in the absence of METTL3, only a decrease in total amount of spliced transcripts. However, we did find that m6A modifications present in adenoviral late transcripts increase the splicing efficiency of these RNAs. This result is consistent with the results of others that have found that the presence of m6A near splice sites correlates with the rate and efficiency of splicing when studied in a time-resolved fashion in nascent RNA71. Furthermore, it is known that unspliced cellular RNAs resident in the nucleus can be targeted for decay by the nuclear exosome68,72. Therefore, our hypothesis is that efficient splicing requires m6A and leads to the accumulation of mature mRNAs that would otherwise be destroyed in the nucleus before accumulating in the cytoplasm.
The m6A modification has also been implicated in the regulation of alternative polyadenylation47,62,69. While the majority of the adenovirus early transcripts that were insensitive to METTL3 knockdown contain only one cleavage and polyadenylation site, the adenovirus late transcriptional unit contains five poly(A) sites, and usage must be regulated to be able to splice the downstream isoforms. It is intriguing to note that transcripts generated from the L1 region upstream of the first polyadenylation site were the viral late transcripts least affected by METTL3 knockdown. While our quantitative PCR reaction to test the splicing efficiency of the L5 region cannot fully rule out the possibility of fewer transcripts reaching this region due to altered poly(A) cleavage, our splicing reporter transgene assay that does not contain upstream poly(A) sites argues against this hypothesis. Splicing and polyadenylation are thought to happen co-transcriptionally73, and any effect that m6A might have on coordinating these processes remains an interesting topic for future study.
In summary, we discovered that while both early and late adenoviral genes are marked by m6A, we only detected a loss of expression within late genes after depletion of METTL3. This was because m6A increased the efficiency of splicing within adenovirus late transcriptional units, where a plethora of potential splice sites leads to an abundance of alternative splice isoforms. Our data show that even when rates of transcription and RNA decay remain constant, increased splicing efficiency can lead to increased transcript abundance, presumably via protection against nuclear decay mechanisms that target unspliced transcripts. This work highlights how RNA modifications can regulate distinct stages of viral gene expression.
Methods
Cell culture
All cell lines were obtained from American Type Culture Collection (ATCC) and cultured at 37 °C and 5% CO2. All cell lines tested negative for mycoplasma infection and were routinely tested afterwards using the LookOut Mycoplasma PCR Detection Kit (Sigma-Aldrich). A549 cells (ATCC CCL-185) were maintained in Ham’s F-12K medium (Gibco, 21127-022) supplemented with 10% v/v FBS (VWR, 89510-186) and 1% v/v Pen/Strep (100 U/ml of penicillin, 100 μg/ml of streptomycin, Gibco, 15140-122). HeLa cells (ATCC CCL-2) and HEK293 cells (ATCC CRL-1573) were grown in DMEM (Corning, 10-013-CV) were grown in DMEM supplemented with 10% v/v FBS and 1% v/v Pen/Strep.
Viral infection
Adenovirus serotype 5 (Ad5) was originally purchased from ATCC. All viruses were expanded on HEK293 cells, purified using two sequential rounds of ultracentrifugation in CsCl gradients, and stored in 40% v/v glycerol at −20 °C (short term) or −80 °C (long term). Viral stock titer was determined on HEK293 cells by plaque assay, and all subsequent infections were performed at a multiplicity of infection (MOI) of 20 PFU/cell. Cells were infected at 80-90% confluent monolayers by incubation with diluted virus in a minimal volume of low serum (2%) F-12K for 2 h. After infection viral inoculum was removed by vacuum and full serum growth media was replaced for the duration of the experiment.
Plasmids, siRNA, and transfections
Fiber-Transgene constructs were created from the previously generated m6A-null psiCheck2 reporter plasmid34, where both Firefly and Renilla luciferase genes have all m6A-DRACH motifs ablated by silent mutations. The original 3’UTR reporter was removed from the end of Renilla luciferase using XhoI and NotI restriction enzymes and this site was healed by the insertion of a recombinant multiple cloning site containing XhoI, AgeI, and NotI using two annealed DNA oligos with complementary sticky ends (Obtained from IDT). From this reporter construct lacking significant 3’UTR, the chimeric intron downstream of the SV40 promoter and upstream of Renilla luciferase was excised using StuI and NheI. Using compatible cut sites, a DNA fragment was cloned into this site encoding the third exon of the Ad5 late gene unit (nucleotides 9,644 to 9,733), all intervening intron that did not contain other Ad5 genes or splice sites (nucleotides 9733 to 10,631 and 30,867 to 31,042), and the first 482 nucleotides of the Fiber exon and ORF (31,042 to 31,524) such that the Fiber ORF would continue in frame to the Renilla luciferase ORF (WT Fiber). Alternatively, the same fragment but with all 15 DRACH motifs silently mutated was used to generate m6A Mut Fiber. Both fragments were excised from plasmids created as a PriorityGENE service from GeneWiz from provided sequences. All DNA oligos can be found in Supplementary Table 1. DNA transfections were performed using the standard protocol for Lipofectamine2000 (Invitrogen).
The following siRNA pools were obtained from Dharmacon: non-targeting control (D-001206-13-05), METTL3 (M-005170-01-0005), METTL14 (M-014169-00-0005), WTAP (M-017323-02-0005), YTHDC1 (M-015332-01-0005), FTO (M-004159-01-0005), and ALKBH5 (M-004281-01-0005). The following siRNAs were obtained from Qiagen: YTHDF1 (SI00764715), YTHDF2 (SI04174534), YTHDF3 (SI04205761), FTO (SI04177530), and ALKBH5 (SI04138869). All siRNA transfections were performed using the standard protocol for Lipofectamine RNAiMAX (Invitrogen). Poly(I:C) was provided pre-complexed with transfection reagent (Invivogen tlrl-piclv), and was reconstituted fresh with molecular grade water before added to cells at a concentration of 500 ng/ml.
METTL3 knockout
Since METTL3 has previously been reported to be an essential gene, a rescue cell line was pre-constructed that contained a CRISPR-insensitive METTL3 transgene under a tetracycline-inducible promoter. In brief, the 2X-Flag Tagged METTL3 transgene32 was cloned from pEFTak into the BB72674 entry vector and the NGG Cas9 PAM site at nucleotide position 117 was silently mutated (G to C) via the Stratagene QuickChange Site-Directed Mutagenesis protocol (primer sequence in Supplementary Table 1). This plasmid was then co-transfected into A549 HiLo cells with a transiently expressed Cre Recombinase before being selected by puromycin74. This allows integration of METTL3 into a conserved tetracycline-regulated genomic locus, and expression was tested both by immunoblot and immunofluorescence for selection efficiency.
GFP-Cas9 expressing plasmid pX33075 was constructed to contain the single guide RNA (Supplementary Table 1) from the GECKO CRISPR library. The A549-METTL3-HiLo cell line was then induced to express Cas9-insensitive METTL3 for one day before transfected with the single pX330 plasmid containing both GFP-Cas9 and METTL3 sgRNA. Twenty-four hours post transfection, GFP expressing cells were sorted as single cells by fluorescence-activated cell sorting (FACS) into 96-well plates for clonal expansion. Tetracycline was refreshed biweekly to maintain transgene expression for the entire outgrowth until resulting cell lines reached large enough numbers to be viably frozen. Afterwards, tetracycline was withdrawn and cells were cultured for a further two weeks before the presence of endogenous METTL3 was assayed for by both immunoblot and immunofluorescence. Two clones were picked that displayed undetectable levels of endogenous METTL3 at the single cell level, and Cas9-mediated lesions were detected by Sanger sequencing of a PCR amplicon derived from genomic DNA (PureLink Genomic DNA kit, Invitrogen).
meRIP-Seq and meRIP-qPCR
A549 cells in 10 cm plates were lysed directly in TRIzol (Thermo Fisher) and RNA was extracted and DNaseI treated (Qiagen). Poly(A) RNA was selected with the Poly(A)Purist MAG Kit before being fractionated (Thermo Fisher) with RNA Fragmentation Reagents (Thermo Fisher) and ethanol precipitated. For meRIP-qPCR the following protocol was identical but RNA was not poly(A) purified or fragmented. MeRIP was performed using the EpiMark N6-methyladenosine Enrichment Kit (NEB) following manufacturer’s instructions. The following modifications were made following the previously published protocol32. Immunoprecipitations were washed with low and high salt wash buffers and RNA was eluted from the beads with 5 mM m6A salt (Santa Cruz Biotechnology).
For meRIP-Seq RNA-seq libraries were prepared from both eluate and 10% input mRNA using the TruSeq mRNA library prep kit (Illumina), subjected to quality control (MultiQC), and sequenced on the HiSeq4000 instrument. Both the IP and input samples were mapped to the GRCh37/hg19 genome assembly and the Ad5 genome using the RNA-seq aligner GSNAP76 (version 2019-09-12) or STAR (v2.5.4b)77. We called peaks using MACS2 (v2.1.2)78 with the following flags: “-q 0.05 -B –call-summits –keep-dup auto –nomodel –extsize 150”. HOMER (v4.11)79 was then used to identify motifs enriched in the identified m6A-peaks. Metagene analysis and visualization was done using deepTools2 (v3.3.1)80.
Antibodies, immunoblotting, and immunofluorescence
The following primary antibodies were used for cellular proteins: METTL3 (Novus Biologicals H00056339, WB: 1:400, IF: 1:50), METTL14 (Sigma-Aldrich HPA038002, WB: 1:5000, IF: 1:100), WTAP (Proteintech 60188, WB: 1:400, IF: 1:100), YTHDC1 (Abcam ab122340, WB: 1:2000, IF: 1:100), YTHDF1 (Abcam ab99080, WB: 1:500, IF: 1:100), YTHDF2 (Proteintech 24744-1-AP, WB: 1;1000, IF: 1:100), YTHDF3 (Sigma-Aldrich SAB2102735, WB: 1:500), FTO (Abcam ab92821, WB: 1:300, IF: 1:100), ALKBH5 (Sigma-Aldrich SAB1407587, WB: 1:250, IF: 1:100), β-Actin (Sigma-Aldrich A5441-100UL, WB 1:5000), GAPDH (GeneTex 41577, WB:1:20,000), and RNA Pol II p-Ser2 (Abcam ab5095, IF: 1:400). Primary anti-Flag tag antibody was obtained from Sigma-Aldrich (F7425-.2MG, WB 1:2000). Primary antibodies against viral proteins were obtained from: rabbit polyclonal against adenovirus Hexon, Penton, and Fiber (Gift from J. Wilson, WB 1:10,000), mouse anti-DBP (Gift from A. Levine, Clone: B6-8, WB 1:1000, IF 1:400), polyclonal rabbit anti-DBP (Gift from A. Levine, IF: 1:40,000), mouse anti-E1A (BD 554155, WB: 1:500), and mouse anti-E1B55K (Gift from A. Levine, Clone: 58K2A6, WB 1:500).
For western immunoblotting protein samples were prepared by directly lysing cells in lithium dodecyl sulfate (LDS) loading buffer (NuPage) supplemented with 1% beta-mercaptoethanol (BME) and boiled at 95 °C for 10 min. Equal volumes of protein lysate were separated SDS-PAGE in MOPS buffer (Invitrogen) before being transferred onto a nitrocellulose membrane (Millipore) at 35 V for 90 min in 20% methanol solution. Membranes were stained with Ponceau to confirm equal loading and blocked in 5% w/v non-fat milk in TBST supplemented with 0.05% w/v sodium azide. Membranes were incubated with primary antibodies in milk overnight, washed for three times in TBST, incubated with HRP-conjugated secondary (Jackson Laboratories) for 1 h and washed an additional three times in TBST. Proteins were visualized with Pierce ECL or Femto Western Blotting Substrate (Thermo Scientific) and detected using a Syngene G-Box. Images were processed and assembled in Adobe Photoshop and Illustrator CS6.
For immunofluorescence A549 cells were grown on glass coverslips in 24-well plates, mock-infected or infected with Ad5 for 18 h, washed twice with PBS and then fixed in 4% w/v Paraformaldehyde for 15 min. Cells were permeabilized with 0.5% v/v Triton-X in PBS for 10 min, and blocked in 3% w/v BSA in PBS (+0.05% w/v sodium azide) for 1 h. Primary antibody dilutions were added to coverslips in 3% w/v BSA in PBS (+0.05% w/v sodium azide) for 1 h, washed with 3% BSA in PBS three times, followed by secondary antibodies and 4,6-diamidino-2-phenylindole (DAPI) for 1 h. Secondary antibodies were used at 1:500 dilution and conjugated to Alexa-Fluor 488 (Invitrogen A-11001 or A-11008), 555 (Invitrogen A-21422 or A-21428), or 568 (Invitrogen A-11004 or A-11011) in anti-mouse or anti-rabbit. Coverslips were mounted onto glass slides using ProLong Gold Antifade Reagent (Cell Signaling Technologies). Immunofluorescence was visualized using a Zeiss LSM 710 Confocal microscope (Cell and Developmental Microscopy Core at UPenn) and ZEN 2011 software. Images were processed in FIJI (v1.52p) and assembled in Adobe Photoshop and Illustrator 2020.
Inhibitors and small molecules
3-Deazadenosine was purchased from Sigma-Aldrich (D8296) and resuspended in Dimethyl Sulfoxide (DMSO) before being added to cells at the time of infection to a final concentration of 25 µM. Ruxolitinub was purchased from Sigma-Aldrich (G-6185) and resuspended in DMSO before being added to cells immediately prior to infection with Ad5 or transfection with poly(I:C) at a final concentration of 4 µM. 5,6-dichloro-1-β-D-ribofuranosyl-1H-benzimidazole (DRB) was purchased from Cayman Chemical (Item No 10010302) and resuspended in DMSO before being added to cells at 24 h post infection to a final concentration of 60 µM.
RNA isolation and qPCR
Total RNA was isolated from cells by either TRIzol extraction (Thermo Fisher) or RNeasy Micro kit (Qiagen), following manufacturer protocols. RNA was treated with RNase-free DNase I (Qiagen), either on-column or after ethanol precipitation. RNA was converted to complementary DNA (cDNA) using 1 μg of input RNA in the High Capcity RNA-to-cDNA kit (Thermo Fisher). Quantitative PCR was performed using the standard protocol for SYBR Green reagents (Thermo Fisher) in a QuantStudio 7 Flex Real-Time PCR System (Applied Biosystems). All primers were used at 10 μM and sequences can be found in Supplementary Table 1. All values were normalized by the ΔΔCt method by normalizing first to internal controls such as HPRT1 and GAPDH.
Viral DNA replication qPCR
Infected cells were harvested at the indicated time points post infection by trypsin and total DNA was harvested using the PureLink Genomic DNA kit (Invitrogen). DNA quantity was assessed by qPCR and SYBR green reagents using primers for genomic regions of Ad5 and normalized to cellular tubulin (See Supplementary Table 1 for primer sequence). Entry of the viral genome was assessed at the 4 h infection time point, and all subsequent values were normalized to this by the ΔΔCt method.
Plaque assays
Infected cells seeded in 12-well plates were harvested by scraping at the indicated time points and lysed by three cycles of freeze-thawing in liquid nitrogen. Cell debris was removed from lysates by centrifugation at max speed (21,130 × g), 4 °C, 5 min. Lysates were serially diluted in DMEM supplemented with 2% v/v FBS and 1% v/v Pen/Strep to infect a confluent monolayer of HEK293 cells seeded in 12-well plates. After incubation for 2 h at 37 °C, the infection media was removed, and cells were overlaid with DMEM containing 0.45% w/v SeaPlaque agarose (Lonza) in addition to 2% v/v FBS and 1% v/v Pen/Strep. Plaques were stained using 1% w/v crystal violet in 50% v/v ethanol between 6 to 7 days post-infection.
Metabolic labeling of RNA for transcription rate determination
To assess relative RNA transcription rate, cells were treated with 1 mM 4-thiouridine (4sU; Sigma T4509) for exactly 10 min. Infection was stopped and RNA harvested using 1 ml TRIzol (Thermo Fisher), following manufacturer’s instructions. A fraction of the total RNA was reserved as input, and the remaining 4sU-labeled nascent RNA was biotinylated using MTSEA-Biotin-XX (Biotium, 90066) as previously described67,81. Nascent RNA was separated from unlabeled RNA using MyOne C1 Streptavidin Dynabeads (Thermo Fisher Scientific, 65-001), biotin was removed from nascent RNA using 100 mM dithiothreitol (DTT), and RNA was isopropanol precipitated. One µg of total RNA (T) and an equivalent volume of nascent RNA (N) were converted to cDNA and qPCR was performed as described above. Relative transcription rates were determined by the ΔΔCt method to compare nascent transcript levels between control and siRNA treated cells normalized to nascent GAPDH RNA.
RNA-sequencing
Total RNA from three biological replicates of Control knockdown or three biological replicates of METTL3-knockdown A549 cells infected with Ad5 for 24 h were sent to Genewiz for preparation into strand-specific RNA-Seq libraries. Libraries were then run spread over three lanes of an Illumina HiSeq 2500 using a 150 bp paired-end protocol. Raw reads were mapped to the GRCh37/hg19 genome assembly and the Ad5 genome using the RNA-seq aligner GSNAP76 (version 2019-09-12). The algorithm was given known human gene models provided by GENCODE (release_27_hg19) to achieve higher mapping accuracy. We used R package ggplot2 (v3.3.0) for visualization. Downstream analysis and visualization was done using deepTools2 (v3.3.1)80. Splice junctions were extracted using RegTools (v0.2.0)82 and visualized in Integrative Genomics Viewer.
Nanopore direct RNA sequencing
Direct RNA sequencing libraries were generated from 680 to 1800 ng of poly(A) RNA, isolated using the Dynabeads™ mRNA Purification Kit (Invitrogen, 61006). Isolated poly(A) RNA was subsequently spiked with 0.3 µl of a synthetic Enolase 2 (ENO2) calibration RNA (Oxford Nanopore Technologies Ltd.) and prepared for sequencing as described previously56,61 Sequencing was performed on a MinION MkIb using R9.4.1 (rev D) flow cells (Oxford Nanopore Technologies Ltd.) for 18–23 h (one library per flowcell) and yielded between 1,000,000 – 2,000,000 reads per dataset. Raw fast5 datasets were then basecalled using Guppy v3.2.2 (-f FLO-MIN106 -k SQK-RNA002) with only reads in the pass folder used for subsequent analyses. Sequence reads were aligned against the adenovirus Ad5 genome, using MiniMap2 (v2.15 -ax splice -k14 -uf –secondary = no), a splice aware aligner64, with subsequent parsing through SAMtools (v1.9)83 and BEDtools (v2.27.1)84. Here sequence reads were filtered to retain only primary alignments (SAM FLAG 0 (top strand) or 16 (bottom strand)). Coverage plots (Figs. 2 and 4) were generated using the R packages Gviz (v3.10)85 and GenomicRanges86. Isoform-level analysis was performed by aligning sequence reads with MiniMap2 to an Ad5 transcriptome that we recently re-annotated63. Subsequent parsing with SAMtools was used to filter out sequence reads that could not be unambiguously assigned to a single transcript. Here, only primary alignments (SAM flag 0) with mapping qualities (MapQ) >0 were retained. Unambiguous transcripts were defined as those that began at a designated 3’ poly(A) tail and extended to within 50 nucleotides of an annotated TSS, or within the third exon of the late transcripts tripartite leader.
Nucleotide resolution analysis of m6A sites using direct RNA sequencing
The identification of putative m6A sites on adenoviral RNAs was performed using a new tool, DRUMMER (https://github.com/DepledgeLab/DRUMMER/), which predicts RNA modifications based on comparative profiling of basecall error rates at both exome and isoform level. Briefly, DRUMMER takes sorted.bam files from two datasets as input and parses these individually in a stranded manner using BAM Readcount (https://github.com/genome/bam-readcount) to generate base-call distributions (i.e., the number of A, C, G, U, and indels) at each position in the alignment. For each comparison (e.g., M3KO versus WT), a G-test was performed on the base-call distributions at each position in each dataset with subsequent Bonferroni correction for multiple testing. Positions with a read coverage of <100× in one or both datasets being compared were excluded from this analysis. To further filter the dataset following the G-test, we calculated the ratio of match:mismatch fractions at each position. Candidate sites were those that gave a statistically significant G-test result (bonferroni adjusted p < 0.01) and had a one-fold or higher reduction in the match:mismatch fraction in the WT dataset compared to the METTL3 knockout dataset. Subsequent masking of clustered candidates was performed by collapsing all sites located within five nucleotides of the same AC motif into a single representative, chosen by picking the site with highest G-test score. Where sites were located within ten nucleotides of each other, but further than five nucleotides from an AC motif, collapsing was performed by selecting the site with the highest G-test score.
To further validate our data, we also performed m6A site prediction using NanoCompore, which predicts RNA modifications based on signal level analyses of current intensity and dwell time for nucleotides translocating through nanopores65. Here, read indexing and resquiggling were performed using Nanopolish and NanopolishComp prior to running NanoCompore SampComp (–logit –sequence_context 2 –sequence_context_weights harmonic) to compare both biological replicates of the WT and METTL3-knockout datasets in a 2 × 2 analysis. Note that NanoCompore processing times were extremely slow for transcripts with >1000 dRNA-Seq reads assigned, so a random subsampling approach (1000 reads) was used in these situations. Putative m6A modified sites were identified on the basis of a GMM logit p-value (context 2) <0.01 and an Logit LOR score >0.5 or <−0.5. Where multiple sites were present with five nucleotides, only the site with the lowest p-value was retained. To calculate the degree of methylation in WT and METTL3-knockout datasets, we parsed the cluster counts column to determine the fraction of reads in datasets with an m6A at a defined sequence position with the m6A depletion estimate representing the difference between these values.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We extend special thanks to Ian Mohr (New York University School of Medicine) for his many insightful comments and for support of DPD in part through National Institutes of Health (NIH) grants R01-AI073898 and R01-GM056927. We also thank members of the Weitzman and Mohr/Wilson Labs for insightful discussions and input. We are grateful to A. Berk, P. Branton, G. Ketner, A. Levine, and D. Ornelles for generous gifts of reagents. We thank our colleagues Kristen Lynch, Fange Liu, and Peter Choi for helpful suggestions and careful reading of the manuscript. We also thank Tommaso Leonardi (Istituto Italiano di Tecnologia) for insightful discussions regarding NanoCompore. We thank the UPenn Cell and Developmental Biology Microscopy Core for imaging assistance. This work was supported through NIH grants R21-AI130618 and R21-AI147163 (ACW), R01-AI125416 and R21-AI129851 (S.M.H. and C.E.M.), and R01-AI145266, R01-AI121321, and R01-CA097093 (MDW). S.M.H. was supported by the Burroughs Wellcome Fund (17PRE336700170). N.S.G. was supported by an American Heart Association Predoctoral Fellowship. ABRM was supported by the Natural Sciences and Engineering Research Council of Canada. J.S.A. was supported by the Sackler Institute of Graduate Biomedical Sciences. Additional support came from the NCI T32 Training Grant in Tumor Virology T32-CA115299 (A.M.P.) and Individual National Research Service Award F32-AI138432 (AMP).
Source data
Author contributions
A.M.P., D.P.D., and M.D.W. conceived of the project and designed the experiments; C.E.M., A.C.W., and S.M.H. provided additional input into study design; A.M.P., N.S.G., A.N.D.F., and D.P.D. performed the experiments; D.P.D. designed the nanopore m6A detection approach; J.S.A. codified the DRUMMER pipeline; K.E.H., A.B.R.M., and D.P.D. performed computational analyses; A.M.P. and D.P.D analyzed all additional data; A.M.P., D.P.D., and M.D.W. wrote the manuscript; All authors read, edited and approved the final paper.
Data availability
Fast5 (Nanopore) and fastq (Illumina) datasets generated as part of this study can be downloaded from the European Nucleotide Archive (ENA) under the following study accession: PRJEB35652. The authors declare that all other data supporting the findings of this study are available within the article and its Supplementary Information files, or are available from the authors upon request. Source data are provided with this paper.
Code availability
All code pertaining to detection of m6A sites via direct RNA Sequencing (DRUMMER) is available at https://github.com/DepledgeLab/DRUMMER/.
Competing interests
C.E.M. is a cofounder and board member for Biotia and Onegevity Health, as well as an advisor or compensated speaker for Abbvie, Acuamark Diagnostics, ArcBio, BioRad, DNA Genotek, Genialis, Genpro, Karius, Illumina, New England Biolabs, QIAGEN, Whole Biome, and Zymo Research. The remaining authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors jointly supervised this work: Daniel P. Depledge, Matthew D. Weitzman.
Contributor Information
Daniel P. Depledge, Email: daniel.depledge@nyulangone.org
Matthew D. Weitzman, weitzmanm@email.chop.edu
Supplementary information
Supplementary information is available for this paper at 10.1038/s41467-020-19787-6.
References
- 1.Arnold J. Berk Adenoviridae. In Fields Virology (eds. David M. Knipe & Peter M. Howley) Vol. 2, 1704–1731 (Wolters Kluwer Health/Lippincott Williams & Wilkins, 2013).
- 2.Sommer S, et al. The methylation of adenovirus-specific nuclear and cytoplasmic RNA. Nucleic Acids Res. 1976;3:749–765. doi: 10.1093/nar/3.3.749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Chen-Kiang S, Nevins JR, Darnell JE. N-6-methyl-adenosine in adenovirus type 2 nuclear RNA is conserved in the formation of messenger RNA. J. Mol. Biol. 1979;135:733–752. doi: 10.1016/0022-2836(79)90174-8. [DOI] [PubMed] [Google Scholar]
- 4.Zhao BS, Roundtree IA, He C. Post-transcriptional gene regulation by mRNA modifications. Nat. Rev. Mol. Cell Biol. 2017;18:31–42. doi: 10.1038/nrm.2016.132. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Williams GD, Gokhale NS, Horner SM. Regulation of viral infection by the RNA modification N6-methyladenosine. Annu. Rev. Virol. 2019;6:235–253. doi: 10.1146/annurev-virology-092818-015559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Yue Y, Liu J, He C. RNA N6-methyladenosine methylation in post-transcriptional gene expression regulation. Genes Dev. 2015;29:1343–1355. doi: 10.1101/gad.262766.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Xiao W, et al. Nuclear m(6)A reader YTHDC1 regulates mRNA splicing. Mol. Cell. 2016;61:507–519. doi: 10.1016/j.molcel.2016.01.012. [DOI] [PubMed] [Google Scholar]
- 8.Bartosovic M, et al. N6-methyladenosine demethylase FTO targets pre-mRNAs and regulates alternative splicing and 3’-end processing. Nucleic Acids Res. 2017;45:11356–11370. doi: 10.1093/nar/gkx778. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Roundtree, I. A. et al. YTHDC1 mediates nuclear export of N(6)-methyladenosine methylated mRNAs. Elife6, e31311 (2017). [DOI] [PMC free article] [PubMed]
- 10.Wang X, et al. N6-methyladenosine-dependent regulation of messenger RNA stability. Nature. 2014;505:117–120. doi: 10.1038/nature12730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lin S, Choe J, Du P, Triboulet R, Gregory RI. The m(6)A methyltransferase METTL3 promotes translation in human cancer cells. Mol. Cell. 2016;62:335–345. doi: 10.1016/j.molcel.2016.03.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wang X, et al. N(6)-methyladenosine modulates messenger RNA translation efficiency. Cell. 2015;161:1388–1399. doi: 10.1016/j.cell.2015.05.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Liu J, et al. A METTL3-METTL14 complex mediates mammalian nuclear RNA N6-adenosine methylation. Nat. Chem. Biol. 2014;10:93–95. doi: 10.1038/nchembio.1432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Schwartz S, et al. Perturbation of m6A writers reveals two distinct classes of mRNA methylation at internal and 5’ sites. Cell Rep. 2014;8:284–296. doi: 10.1016/j.celrep.2014.05.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ke S, et al. m(6)A mRNA modifications are deposited in nascent pre-mRNA and are not required for splicing but do specify cytoplasmic turnover. Genes Dev. 2017;31:990–1006. doi: 10.1101/gad.301036.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Meyer KD, Jaffrey SR. Rethinking m(6)A readers, writers, and erasers. Annu. Rev. Cell Dev. Biol. 2017;33:319–342. doi: 10.1146/annurev-cellbio-100616-060758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Hazra D, Chapat C, Graille M. m6A mRNA destiny: chained to the rhYTHm by the YTH-containing proteins. Genes. 2019;10:49. doi: 10.3390/genes10010049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Liu N, et al. N(6)-methyladenosine-dependent RNA structural switches regulate RNA-protein interactions. Nature. 2015;518:560–564. doi: 10.1038/nature14234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zhou KI, et al. Regulation of co-transcriptional pre-mRNA splicing by m6A through the low-complexity protein hnRNPG. Mol. Cell. 2019;76:70–81. doi: 10.1016/j.molcel.2019.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Huang H, et al. Recognition of RNA N(6)-methyladenosine by IGF2BP proteins enhances mRNA stability and translation. Nat. Cell Biol. 2018;20:285–295. doi: 10.1038/s41556-018-0045-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Zheng G, et al. ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility. Mol. Cell. 2013;49:18–29. doi: 10.1016/j.molcel.2012.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Mauer J, et al. Reversible methylation of m 6 A m in the 5′ cap controls mRNA stability. Nature. 2017;541:371–375. doi: 10.1038/nature21022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kane SE, Beemon K. Precise localization of m6A in Rous sarcoma virus RNA reveals clustering of methylation sites: implications for RNA processing. Mol. Cell. Biol. 1985;5:2298–2306. doi: 10.1128/MCB.5.9.2298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lavi S, Shatkin AJ. Methylated simian virus 40-specific RNA from nuclei and cytoplasm of infected BSC-1 cells. Proc. Natl Acad. Sci. USA. 1975;72:2012–2016. doi: 10.1073/pnas.72.6.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Moss B, Gershowitz A, Stringer JR, Holland LE, Wagner EK. 5’-Terminal and internal methylated nucleosides in herpes simplex virus type 1 mRNA. J. Virol. 1977;23:234–239. doi: 10.1128/JVI.23.2.234-239.1977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Krug RM, Morgan MA, Shatkin AJ. Influenza viral mRNA contains internal N6-methyladenosine and 5’-terminal 7-methylguanosine in cap structures. J. Virol. 1976;20:45–53. doi: 10.1128/JVI.20.1.45-53.1976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Kennedy EM, et al. Posttranscriptional m(6)A editing of HIV-1 mRNAs enhances viral gene expression. Cell Host Microbe. 2016;19:675–685. doi: 10.1016/j.chom.2016.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lichinchi G, et al. Dynamics of the human and viral m(6)A RNA methylomes during HIV-1 infection of T cells. Nat. Microbiol. 2016;1:16011. doi: 10.1038/nmicrobiol.2016.11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Tirumuru, N. et al. N(6)-methyladenosine of HIV-1 RNA regulates viral infection and HIV-1 Gag protein expression. Elife5, e15528 (2016). [DOI] [PMC free article] [PubMed]
- 30.Courtney DG, et al. Epitranscriptomic enhancement of Influenza A virus gene expression and replication. Cell Host Microbe. 2017;22:377–386. doi: 10.1016/j.chom.2017.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hao H, et al. N6-methyladenosine modification and METTL3 modulate enterovirus 71 replication. Nucleic Acids Res. 2019;47:362–374. doi: 10.1093/nar/gky1007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Gokhale NS, et al. N6-methyladenosine in flaviviridae viral RNA genomes regulates infection. Cell Host Microbe. 2016;20:654–665. doi: 10.1016/j.chom.2016.09.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lichinchi G, et al. Dynamics of human and viral RNA methylation during Zika virus infection. Cell Host Microbe. 2016;20:666–673. doi: 10.1016/j.chom.2016.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Gokhale, N. S. et al. Altered m6A modification of specific cellular transcripts affects flaviviridae infection. Mol. Cell10.1016/j.molcel.2019.11.007. (2019). [DOI] [PMC free article] [PubMed]
- 35.Imam H, et al. N6-methyladenosine modification of hepatitis B virus RNA differentially regulates the viral life cycle. Proc. Natl Acad. Sci. USA. 2018;115:8829–8834. doi: 10.1073/pnas.1808319115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Tsai K, Courtney DG, Cullen BR. Addition of m6A to SV40 late mRNAs enhances viral structural gene expression and replication. PLoS Pathog. 2018;14:e1006919. doi: 10.1371/journal.ppat.1006919. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ye, F., Chen, E. R. & Nilsen, T. W. Kaposi’s sarcoma-associated herpesvirus utilizes and manipulates RNA N6-adenosine methylation to promote lytic replication. J. Virol.10.1128/JVI.00466-17. (2017). [DOI] [PMC free article] [PubMed]
- 38.Tan B, et al. Viral and cellular N 6 -methyladenosine and N 6, 2′- O -dimethyladenosine epitranscriptomes in the KSHV life cycle. Nat. Microbiol. 2018;3:108–120. doi: 10.1038/s41564-017-0056-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hesser CR, Karijolich J, Dominissini D, He C, Glaunsinger BA. N6-methyladenosine modification and the YTHDF2 reader protein play cell type specific roles in lytic viral gene expression during Kaposi’s sarcoma-associated herpesvirus infection. PLoS Pathog. 2018;14:e1006995. doi: 10.1371/journal.ppat.1006995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Rubio RM, Depledge DP, Bianco C, Thompson L, Mohr I. RNA m6 A modification enzymes shape innate responses to DNA by regulating interferon β. Genes Dev. 2018;32:1472–1484. doi: 10.1101/gad.319475.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Winkler R, et al. m 6 A modification controls the innate immune response to infection by targeting type I interferons. Nat. Immunol. 2019;20:173–182. doi: 10.1038/s41590-018-0275-z. [DOI] [PubMed] [Google Scholar]
- 42.Meyer KD, et al. Comprehensive analysis of mRNA methylation reveals enrichment in 3’ UTRs and near stop codons. Cell. 2012;149:1635–1646. doi: 10.1016/j.cell.2012.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Li X, Xiong X, Yi C. Epitranscriptome sequencing technologies: decoding RNA modifications. Nat. Methods. 2017;14:23–31. doi: 10.1038/nmeth.4110. [DOI] [PubMed] [Google Scholar]
- 44.Carlile TM, et al. Pseudouridine profiling reveals regulated mRNA pseudouridylation in yeast and human cells. Nature. 2014;515:143–146. doi: 10.1038/nature13802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Squires JE, et al. Widespread occurrence of 5-methylcytosine in human coding and non-coding RNA. Nucleic Acids Res. 2012;40:5023–5033. doi: 10.1093/nar/gks144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Li X, et al. Base-resolution mapping reveals distinct m1A methylome in nuclear- and mitochondrial-encoded transcripts. Mol. Cell. 2017;68:993–1005. doi: 10.1016/j.molcel.2017.10.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Chen K, et al. High-resolution N6-methyladenosine (m6A) map using photo-crosslinking-assisted m6A sequencing. Angew. Chem. Int. Ed. 2015;54:1587–1590. doi: 10.1002/anie.201410647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Ke S, et al. A majority of m6A residues are in the last exons, allowing the potential for 3′ UTR regulation. Genes Dev. 2015;29:2037–2053. doi: 10.1101/gad.269415.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Linder B, et al. Single-nucleotide-resolution mapping of m6A and m6Am throughout the transcriptome. Nat. Methods. 2015;12:767–772. doi: 10.1038/nmeth.3453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Garcia-Campos MA, et al. Deciphering the “m6A Code” via antibody-independent quantitative profiling. Cell. 2019;178:731–747. doi: 10.1016/j.cell.2019.06.013. [DOI] [PubMed] [Google Scholar]
- 51.Zeng Y, et al. Refined RIP-seq protocol for epitranscriptome analysis with low input materials. PLoS Biol. 2018;16:e2006092. doi: 10.1371/journal.pbio.2006092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Flusberg BA, et al. Direct detection of DNA methylation during single-molecule, real-time sequencing. Nat. Methods. 2010;7:461–465. doi: 10.1038/nmeth.1459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.McIntyre ABR, et al. Single-molecule sequencing detection of N 6-methyladenine in microbial reference materials. Nat. Commun. 2019;10:1–11. doi: 10.1038/s41467-019-08289-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Liu H, et al. Accurate detection of m 6 A RNA modifications in native RNA sequences. Nat. Commun. 2019;10:1–9. doi: 10.1038/s41467-018-07882-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Lorenz, D. A., Sathe, S., Einstein, J. M. & Yeo, G. W. Direct RNA sequencing enables m6A detection in endogenous transcript isoforms at base specific resolution. RNA10.1261/rna.072785.119. (2019) [DOI] [PMC free article] [PubMed]
- 56.Depledge DP, et al. Direct RNA sequencing on nanopore arrays redefines the transcriptional complexity of a viral pathogen. Nat. Commun. 2019;10:1–13. doi: 10.1038/s41467-019-08734-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Reyes ED, et al. Identifying host factors associated with DNA replicated during virus infection. Mol. Cell Proteom. 2017;16:2079–2097. doi: 10.1074/mcp.M117.067116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Charman, M., Herrmann, C. & Weitzman, M. D. Viral and cellular interactions during adenovirus DNA replication. FEBS Lett. 10.1002/1873-3468.13695. (2019). [DOI] [PMC free article] [PubMed]
- 59.Pombo A, Ferreira J, Bridge E, Carmo-Fonseca M. Adenovirus replication and transcription sites are spatially separated in the nucleus of infected cells. EMBO J. 1994;13:5075–5085. doi: 10.1002/j.1460-2075.1994.tb06837.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.McIntyre ABR, et al. Limits in the detection of m 6 A changes using MeRIP/m 6 A-seq. Sci. Rep. 2020;10:6590. doi: 10.1038/s41598-020-63355-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Garalde DR, et al. Highly parallel direct RNA sequencing on an array of nanopores. Nat. Methods. 2018;15:201–206. doi: 10.1038/nmeth.4577. [DOI] [PubMed] [Google Scholar]
- 62.Parker MT, et al. Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and m6A modification. ELife. 2020;9:e49658. doi: 10.7554/eLife.49658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Price, A. M., Hayer, K. E., Depledge, D. P., Wilson, A. C. & Weitzman, M. D. Novel splicing and open reading frames revealed by long-read direct RNA sequencing of adenovirus transcripts. bioRxiv10.1101/2019.12.13.876037. (2019). [DOI] [PMC free article] [PubMed]
- 64.Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Leger, A. et al. RNA modifications detection by comparative Nanopore direct RNA sequencing. bioRxiv 843136 10.1101/843136. (2019). [DOI] [PMC free article] [PubMed]
- 66.Fustin J-M, et al. RNA-methylation-dependent RNA processing controls the speed of the circadian clock. Cell. 2013;155:793–806. doi: 10.1016/j.cell.2013.10.026. [DOI] [PubMed] [Google Scholar]
- 67.Russo J, Heck AM, Wilusz J, Wilusz CJ. Metabolic labeling and recovery of nascent RNA to accurately quantify mRNA stability. Methods San. Diego Calif. 2017;120:39–48. doi: 10.1016/j.ymeth.2017.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Doma MK, Parker R. RNA quality control in eukaryotes. Cell. 2007;131:660–668. doi: 10.1016/j.cell.2007.10.041. [DOI] [PubMed] [Google Scholar]
- 69.Kasowitz SD, et al. Nuclear m6A reader YTHDC1 regulates alternative polyadenylation and splicing during mouse oocyte development. PLoS Genet. 2018;14:e1007412. doi: 10.1371/journal.pgen.1007412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Darnell RB, Ke S, Darnell JE. Pre-mRNA processing includes N6 methylation of adenosine residues that are retained in mRNA exons and the fallacy of ‘RNA epigenetics’. RNA N. Y. N. 2018;24:262–267. doi: 10.1261/rna.065219.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Louloupi A, Ntini E, Conrad T, Ørom UAV. Transient N-6-methyladenosine transcriptome sequencing reveals a regulatory role of m6A in splicing efficiency. Cell Rep. 2018;23:3429–3437. doi: 10.1016/j.celrep.2018.05.077. [DOI] [PubMed] [Google Scholar]
- 72.Bresson SM, Hunter OV, Hunter AC, Conrad NK. Canonical Poly(A) polymerase activity promotes the decay of a wide variety of mammalian nuclear RNAs. PLoS Genet. 2015;11:e1005610. doi: 10.1371/journal.pgen.1005610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Misra A, Green MR. From polyadenylation to splicing: dual role for mRNA 3’ end formation factors. RNA Biol. 2015;13:259–264. doi: 10.1080/15476286.2015.1112490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Khandelia P, Yap K, Makeyev EV. Streamlined platform for short hairpin RNA interference and transgenesis in cultured mammalian cells. Proc. Natl Acad. Sci. USA. 2011;108:12799–12804. doi: 10.1073/pnas.1103532108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Cong L, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–823. doi: 10.1126/science.1231143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Wu TD, Reeder J, Lawrence M, Becker G, Brauer MJ. GMAP and GSNAP for genomic sequence alignment: enhancements to speed. Accuracy Functionality. Methods Mol. Biol. Clifton NJ. 2016;1418:283–334. doi: 10.1007/978-1-4939-3578-9_15. [DOI] [PubMed] [Google Scholar]
- 77.Dobin A, et al. STAR: ultrafast universal RNA-seq aligner. Bioinform. Oxf. Engl. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Heinz S, et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Ramírez F, et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Poling BC, Price AM, Luftig MA, Cullen BR. The Epstein-Barr virus miR-BHRF1 microRNAs regulate viral gene expression in cis. Virology. 2017;512:113–123. doi: 10.1016/j.virol.2017.09.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Feng, Y.-Y. et al. RegTools: integrated analysis of genomic and transcriptomic data for discovery of splicing variants in cancer. bioRxiv10.1101/436634. (2018).
- 83.Li H, et al. The Sequence Alignment/Map format and SAMtools. Bioinform. Oxf. Engl. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–842. doi: 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Hahne, F. & Ivanek, R. Visualizing Genomic Data Using Gviz and Bioconductor. In Statistical Genomics: Methods and Protocols (eds. Mathé, E. & Davis, S.) 335–351 (Springer, 2016). [DOI] [PubMed]
- 86.Lawrence M, et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 2013;9:e1003118. doi: 10.1371/journal.pcbi.1003118. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Fast5 (Nanopore) and fastq (Illumina) datasets generated as part of this study can be downloaded from the European Nucleotide Archive (ENA) under the following study accession: PRJEB35652. The authors declare that all other data supporting the findings of this study are available within the article and its Supplementary Information files, or are available from the authors upon request. Source data are provided with this paper.
All code pertaining to detection of m6A sites via direct RNA Sequencing (DRUMMER) is available at https://github.com/DepledgeLab/DRUMMER/.