Extensive translation of circular RNAs driven by N6-methyladenosine

Yun Yang; Xiaojuan Fan; Miaowei Mao; Xiaowei Song; Ping Wu; Yang Zhang; Yongfeng Jin; Yi Yang; Ling-Ling Chen; Yang Wang; Catherine CL Wong; Xinshu Xiao; Zefeng Wang

doi:10.1038/cr.2017.31

. 2017 Mar 10;27(5):626–641. doi: 10.1038/cr.2017.31

Extensive translation of circular RNAs driven by N⁶-methyladenosine

Yun Yang ^1,^2,^3,^4,^✝, Xiaojuan Fan ^2,^✝, Miaowei Mao ^4,^5,^✝, Xiaowei Song ^2,⁴, Ping Wu ^6,⁷, Yang Zhang ⁸, Yongfeng Jin ¹, Yi Yang ⁵, Ling-Ling Chen ⁸, Yang Wang ⁹, Catherine CL Wong ^6,⁷, Xinshu Xiao ³, Zefeng Wang ^2,^4,^*

PMCID: PMC5520850 PMID: 28281539

Abstract

Extensive pre-mRNA back-splicing generates numerous circular RNAs (circRNAs) in human transcriptome. However, the biological functions of these circRNAs remain largely unclear. Here we report that N⁶-methyladenosine (m⁶A), the most abundant base modification of RNA, promotes efficient initiation of protein translation from circRNAs in human cells. We discover that consensus m⁶A motifs are enriched in circRNAs and a single m⁶A site is sufficient to drive translation initiation. This m⁶A-driven translation requires initiation factor eIF4G2 and m⁶A reader YTHDF3, and is enhanced by methyltransferase METTL3/14, inhibited by demethylase FTO, and upregulated upon heat shock. Further analyses through polysome profiling, computational prediction and mass spectrometry reveal that m⁶A-driven translation of circRNAs is widespread, with hundreds of endogenous circRNAs having translation potential. Our study expands the coding landscape of human transcriptome, and suggests a role of circRNA-derived proteins in cellular responses to environmental stress.

Keywords: N⁶-methyladenosine, circular RNA, cap-independent translation, eIF4G2

Introduction

Although circular RNAs (circRNAs) in higher eukaryotes were first discovered more than 20 years ago^1,2, they attracted little attention until recently when a large number of circRNAs were identified by parallel sequencing^3,4,5,6,7. The majority of circRNAs are generated through back-splicing of internal exons, a non-canonical splicing process promoted by dsRNA structures across circularizing exons^3,8,9,10,11. Although there are reports that some circRNAs can function as “decoys” to neutralize miRNA (i.e., as miRNA sponges)^4,7 or to bind and sequester other RNA binding proteins¹², the biological function of most circRNAs is still undetermined. An intriguing possibility is that circRNAs could be translated to produce proteins, because most of circRNAs originate from exons and are localized in the cytoplasm. Indeed, artificial circRNAs with an internal ribosomal entry site (IRES) can be translated in vitro¹³ or in vivo¹¹. However the coding potential of circRNAs remains an open question because of early reports that most circRNAs are not associated with polysomes^3,14,15.

N⁶-methyladenosine (m⁶A) is the most abundant internal modification of RNAs in eukaryotes^16,17. The modification preferably occurs in the consensus motif “RRm⁶ACH” (R = G or A; H = A, C or U)^18,19, and is found in over 7 000 mRNAs and 300 non-coding RNAs in human and mouse using m⁶A-specific immunoprecipitation (MeRIP-Seq)^20,21. The methylation of adenosine is catalyzed by a methyltransferase complex consisting of methyltransferase-like 3 (METTL3), METTL14 and Wilms' tumor 1-associating protein^22,23,24,25, and the m⁶A is demethylated by fat mass and obesity-associated protein (FTO) and alkylated DNA repair protein alkB homolog 5 (AKLBH5)^26,27, which serve as the writers and erasers of m⁶A. Inside cells, m⁶A is specially recognized by the YTH domain family protein YTHDF1, 2 and 3 that bind m⁶A and function as m⁶A readers. The m⁶A modification can affect multiple stages of RNA metabolism, including mRNA localization, splicing, translation and degradation, which in turn regulates important biological processes such as stem cell differentiation^28,29. In particular, m⁶A is reported to have multifaceted affects on translation: m⁶A in 3′ UTRs was found to increase translation efficiency through binding of YTHDF1³⁰, whereas m⁶A in 5′ UTRs was reported to promote cap-independent translation upon heat shock stress through a YTHDF2-protection mechanism^31,32. In addition, a recent study has reported that m⁶A reader YTHDF3 promotes protein synthesis in synergy with YTHDF1³³, which is further supported by the finding that cytoplasmic YTHDF3 interacts with ribosomal proteins to promote mRNA translation³⁴. However the general effect of m⁶A on protein translation is still an incomplete story with the detailed mechanisms being elusive.

Here we report that circRNAs in human cells can be efficiently translated using short sequences containing m⁶A site as IRESs. Consistently, the translation from circRNA is reduced by m⁶A demethylase FTO, promoted by adenosine methyltransferase METTL3/14, and requires eukaryotic translation initiation factor eIF4G2 and m⁶A reader YTHDF3. We found that a large number of circRNAs are methylated, suggesting that such translation could be common to many circRNAs. By sequencing RNase R-resistant RNAs associated with polysomes, we identified hundreds of endogenous translatable circRNAs, many of which contain m⁶A sites. Our finding suggests a general function of circRNA and has important implications in the translation landscape of human genome.

Results

circRNAs containing m⁶A motifs are translated inside cells

Previously we developed a minigene reporter containing split GFP to demonstrate that a circRNA can be translated using a viral IRES¹¹ (Figure 1A). The translation from this circRNA system was rigorously validated by multiple controls, including introducing mutations that disrupt intron pairing, treating the samples with RNase R and cleaving the expression plasmids into linear DNA with restriction enzymes to eliminate potential artifact of exon concatemers¹¹. To further explore the molecular mechanisms governing this phenomenon, we analyzed whether the circRNAs containing several other endogenous IRES or control sequences in human genome can also be translated (Supplementary information, Table S1). Surprisingly, all the inserted sequences, including the three “negative controls” ranging from 38 to 253 nt, efficiently initiated GFP translation as judged by western blots (Figure 1B) and fluorescence microscopy (Supplementary information, Figure S1A). Translation from the circRNA was eliminated only when the sequence of restriction sites was left between the stop and start codons of GFP in the circRNA reporter (Supplementary information, Figure S1B). This unexpected result suggested that IRESs serving to facilitate translation initiation in circRNAs are more prevalent than previously expected.

N⁶-methyladenosine promotes circRNA translation. **(A)** Schematic diagram of a circRNA translation reporter consisting of a single exon and two introns with complement sequences (marked by heart and crown). The exon can be back-spliced to generate circRNAs that drive GFP translation from the IRES. Green arrows indicate PCR primers used in detecting circRNA. **(B)** Translation of circRNA can be driven by different endogenous human IRESs (from IGF2, Hsp70 and XIAP) or three control sequences (short fragments of intron, coding region and 5′ UTR from beta-Actin gene, see Supplementary information, Table S1). Each reporter was transfected into 293 cells, and protein production was detected by western blot 48 h after transfection. CircRNA was detected by semi-quantitative PCR using circRNA specific primers. **(C)** Consensus m⁶A motifs are enriched in the “negative control” sequences. Top, RRACH motif near the start codon, with the putative m⁶A-modified adenosine highlighted in red and the start codon labeled in green. Bottom, enriched motifs discovered by k-mer sampling/clustering. **(D)** Accumulative distribution of m⁶A motif in circRNA and mRNA. **(E)** Density of m⁶A peaks (from MeRIP-seq) that are mapped to all mRNAs and known circRNAs regions. P-value = 3.2e⁻⁷ by student's t-test. **(F)** m⁶A motifs directly promote circRNA translation. 0-2 copies of m⁶A motifs (GGACU) and an adenosine-free control sequence (CGGTGCCGGTGC) were inserted into the upstream of the start codon in circRNA reporters, and circRNA and GFP translation were detected similarly as in panel B. **(G)** Known m⁶A sites (RSV and RSVns) and their mutations were tested for the activity of driving translation. Experimental procedures are the same as in panel B. In last panel, the total RNA was also treated with RNase R before RT-PCR.

To understand how the “negative control” sequences induce circRNA translation, we examined their sequences near the translation start site. Interestingly, all “negative control” sequences contain a RRACH fragment (R = G or A; H = A, C or U) near the start codon (Figure 1C, top), resembling the consensus motif of m⁶A modification (i.e., RRACH motif), the most abundant internal modification of RNAs^16,17. Moreover, by clustering the randomly sampled 10-nt sequence windows in these three sequences, we also found a motif that resembles the consensus site for m⁶A modification (Figure 1C, bottom, see Materials and Methods section). Compared to all coding mRNAs, the putative m⁶A motifs are significantly enriched in known circRNAs (Figure 1D). This observed enrichment is reliable, because in a control analysis, m⁶A motifs are more enriched in snoRNAs yet depleted in snRNAs³⁵, and are more enriched in exons compared to introns as expected (Supplementary information, Figure S1C). Furthermore, compared to randomly selected 1 000 sets of control hexamers, the consensus m⁶A hexamers (i.e., HRRACH) are significantly enriched in known circRNAs (Supplementary information, Figure S1D). Consistently, we found that previously identified m⁶A peaks from transcriptome-wide mapping of m⁶A sites using MeRIP-Seq^20,21 have a higher average density in circRNA regions compared to all mRNAs regardless of the relative positions (Figure 1E).

Because m⁶A was recently found to increase mRNA translation efficiency^30,32, our observations strongly suggested that the m⁶A containing RRACH sequences may be involved in the translation initiation of circRNAs. To test this hypothesis, we inserted a short fragment (19 nt) containing different copies of consensus m⁶A motifs before the start codon of circRNA reporter and measured GFP protein production in transfected 293 cells (Figure 1F). As expected, circRNAs containing one or two m⁶A motifs were efficiently translated into GFP protein, whereas the mutation of both motifs greatly reduced (but did not completely eliminate) the GFP level (Figure 1F). In addition, the circRNA with single m⁶A site has similar translation efficiency compared to circRNA with two m⁶A sites, indicating that a single m⁶A site is sufficient to initiate translation (i.e., excessive m⁶A modification may not increase circRNA translation efficiency). The translation of GFP was eliminated when we inserted a sequence without any adenosine residue (Figure 1F). In addition, we tested two other sequences (RSV and RSVns) that were reported to undergo m⁶A modification²³, and found that both sequences strongly induced protein translation. Importantly, mutants that decrease m⁶A methylation in these sequences²³ also reduced the translation efficiency (Figure 1G), further supporting the notion that m⁶A drives protein translation from circRNAs. The production of circRNAs from all reporters was further validated with northern blot using a probe that specifically recognizes full-length GFP (Supplementary information, Figure S2A).

As previously reported, the circRNA and its translation products were detected even after the linearization of reporter plasmids with MluI digestion (Supplementary information, Figure S2B), strongly supporting translation from circRNA because the pre-mRNA with multiple copies of concatenated GFP fragments cannot be produced in this scenario. The same set of sequences was also able to drive protein translation in HeLa cells (Supplementary information, Figure S3A and S3B), indicating that m⁶A-initiated translation is cell type independent. However in HeLa cells, there is still some expression of GFP following mutation of both m⁶A motifs (Supplementary information, Figure S3A). This might be because either m⁶A can occur at a non-canonical site or some sequences can initiate translation in m⁶A-independent fashion in HeLa cells.

Modulation of m⁶A level in circRNA affects translation efficiency

To further evaluate the importance of m⁶A in translation of circRNAs, we examined whether circRNAs with m⁶A motifs are indeed methylated using RNA-IP. We found that antibody against m⁶A specifically pulled down circRNA containing the RSV m⁶A site and a known m⁶A-containing mRNA (SON mRNA)²⁰, but not a control mRNA without m⁶A (GAPDH) (Figure 2A). CircRNA containing mutated m⁶A site (RSV-mut) was also pulled down by m⁶A antibody, but with dramatically reduced efficiency (Figure 2A). This observation suggests that the mutation can reduce but not completely eliminate the m⁶A modification at RSV site, which is consistent with the small amount of GFP production in RSV-mut reporter (Figure 1G). Alternatively, there may be other minor sites for m⁶A modification in the circRNA. In addition, co-expression of m⁶A demethylase FTO²⁶ significantly reduced the abundance of immunoprecipitated SON mRNA or RSV-containing circRNA (Figure 2A) and decreased the translation of GFP from the circRNA (Figure 2B), further confirming that the circRNA/mRNA with m⁶A sites are methylated and circRNA translation is indeed driven by m⁶A. In agreement with these observations, co-expression of m⁶A methyltransferase METTL3/14 significantly increased the RNA-IP signal from the circRNA or mRNA containing m⁶A but not from the control RNA (Figure 2C), and greatly increased protein translation from circRNA (Figure 2D). Interestingly, expression of METTL14 is unstable by itself but the co-expression of METTL3 greatly stabilized METLL14 and synergistically induced the translation of the GFP protein, supporting a previous report that METTL3 and METLL14 may form a stable complex²³ (Figure 2D). We also noticed that expression of FTO or METTL3/14 did not change the level of circRNA, suggesting that m⁶A modification has little effect on the stability of circRNA. This observation differs from the report that m⁶A modification promotes degradation of linear mRNA³⁶, and probably reflects a greater stability of circRNAs so that the small change of stability resulting from m⁶A modification is not as obvious, or because degradation of circRNA is regulated by a different mechanism compared to linear mRNA.

Methylation of circRNA affects translation efficiency. **(A)** m⁶A in circRNA is reduced by FTO. FTO expression vector was co-transfected with circRNA containing RSV or RSV-mut m⁶A site into 293 cells, and the RNAs from transfected cells were pulled down by m⁶A-specific antibody and analyzed by RT-qPCR. The SON mRNA known to contain multiple m⁶A sites and GAPDH mRNA containing no m⁶A modification were used as controls. Control antibody is anti-GAPDH antibody. The IP experiments were repeated three times, with mean and SD plotted. **(B)** FTO reduces circRNA translation. RNA and protein were analyzed by semi-quantitative RT-PCR and western blots using 293 cells transfected with circRNA reporter containing RSV and FTO (or mock control). **(C)** METTL3 and METTL14 can methylate circRNA. circRNA with RSV or RSV-mut, METTL3 and METTL14 overexpression plasmids were co-transfected into 293 cells as in A (n = 3; mean ± SD). **(D)** circRNA translation is increased by METTL3/14. Experimental procedures are the same as in B. **(E)** 293 cells transiently expressing circRNA with RSV were subjected to heat shock stress. Cells were collected at 0, 1, 2, 4 h after heat shock (1 h at 42 °C) to analyze RNA and protein expression using semi-quantitative RT-PCR and western blots. N, no heat shock. **(F)** Quantification of circRNA RNA and GFP protein levels in heat-shocked cells. GAPDH levels were used for normalization (n = 3, mean ± SD).

N⁶-methylation of adenosine has been shown to affect mRNA translation under heat shock stress^31,32. In line with this idea, translation of GFP protein from the m⁶A-containing circRNA increased in a time-dependent manner during the 37 °C recovery phase following 1-h treatment at 42 °C (Figure 2E and 2F), while the levels of the GFP circRNA remaining unchanged (Figure 2E and 2F). This finding suggests that m⁶A-mediated protein translation, particularly from circRNAs, may be an important element in cellular stress responses. One possible mechanism by which heat shock stress could enhance circRNA translation is by translocation of YTHDF2 from cytosol into nucleus upon heat shock to block the m⁶A “eraser” FTO³¹, thus increasing the level of m⁶A modification in circRNAs. Alternatively, heat shock stress can reduce cap-dependent translation globally and cause cells to shift toward cap-independent translation through IRESs (reviewed by Spriggs et al.³⁷); thus cap-independent translation from circRNA would be increased accordingly.

Protein factors required for m⁶A-initiated circRNA translation

CircRNA translation needs to be initiated through a mechanism fundamentally different from linear mRNA that is initiated by ribosomal scanning. Eukaryotic translation is initiated by eIF4 complex³⁸, of which eIF4E binds to the mRNA cap and eIF4G serves as a protein-binding scaffold to assemble the initiation complex (Figure 3A). Activated 40S ribosomal subunit is subsequently recruited to mRNA through binding of eIF3 to eIF4G³⁸. In the cap-independent translation, a non-canonical eIF4G protein (eIF4G2) directly recognizes an IRES to initiate eIF4 complex assembly in the absence of eIF4E (Figure 3A), leading to translation initiation³⁹. Therefore, to further understand m⁶A-driven translation of circRNAs, we investigated the possible involvement of eIF4G2 in the translation initiation of circRNAs. Using stable cell lines expressing two shRNAs against eIF4G2 (Figure 3B), we examined the expression of GFP encoded by either circRNA or linear mRNA. As expected, eIF4G2 depletion significantly reduced protein translation from circRNA but had no effect on translation from linear mRNA (Figure 3C). Similarly, depletion of eIF3A, an eIF3 subunit bound to viral IRES⁴⁰, modestly reduced protein translation from circRNA but did not affect linear mRNA translation (Figure 3D and 3E). This result is consistent with previous finding that eIF3A is involved in the m⁶A-promoted translation³². As expected, depletion of eIF4G2 by RNAi had little effect on global protein translation rate, whereas eIF3A depletion significantly reduced global protein synthesis (Supplementary information, Figure S4A-S4B). This result is consistent with that eIF4G2 knockdown has more obvious effect in reducing protein translation from circRNA. We further confirmed that the overexpression of eIF4G2 indeed increased GFP translation from circRNA but not from linear mRNA by co-expressing eIF4G2 with the circRNA or linear mRNA encoding for GFP (Figure 3F). Collectively, these results suggest that the translation of circRNA may be initiated by an eIF4G2-dependent mechanism similar to other IRESs.

Initiation factors eIF3A and eIF4G2 affect circRNA translation. **(A)** Schematic diagram of cap-dependent and cap-independent translation initiation in eukaryotic cells. In cap-dependent translation, eIF4 complex recognizes m⁷G and recruits 43S complex to mRNA to initiate translation. In cap-independent translation, eIF4G2 directly binds to the mRNA and recruitments 43S complex to mRNA to initiate translation. **(B)** eIF4G2 knockdown by two different shRNAs stably expressed in 293 cells. **(C)** RNAi of eIF4G2 decreases circRNA translation. 293 cells stably expressing shRNAs were transfected with circRNA reporters containing RSV sequence or linear GFP reporters (pEGFP-C1). RNA and protein expression levels were analyzed by semi-quantitative RT-PCR and western blots (left). Quantification of GFP protein levels was normalized to GAPDH (right; n = 3, mean ± SD). **(D)** eIF3A knockdown by two different shRNAs stably expressed in 293 cells. **(E)** RNAi of eIF3A decreases circRNA translation. Experimental procedures are same as in C (n = 3, mean ± SD). **(F)** eIF4G2 overexpression increases the circRNA translation. circRNA with RSV and eIF4G2 overexpression plasmids were co-transfected into 293 cells, and the levels of proteins and circRNAs were detected with western blots and RT-PCR. **(G)** Expression vectors of eIF4G2 and YTDHF3 with different epitope tags were co-expressed in 293 cells, and the anti-Flag or anti-HA antibodies were used for precipitation. **(H)** Relative fraction of eIF4G2, eIF3A and eIF4G1-binding sites in mRNAs, circRNAs and circRNAs with m⁶A site and translation initiation site. **(I)** eIF4G2 binding site and m⁶A peak in circular ARIH2 RNA.

We next examined whether the m⁶A reader proteins are required in the translation of m⁶A-containing circRNAs. We found that depletion of YTHDF1 by RNAi did not affect translation from circRNA (Supplementary information, Figure S5A-S5B), and YTHDF2 depletion slightly inhibited GFP translation from both circRNA and linear RNA (Supplementary information, Figure S5C and S5D). However, the depletion of YTHDF3 significantly inhibited GFP production from circRNA but not from linear mRNA (Supplementary information, Figure S5E and S5F), suggesting that YTHDF3 is essential for circRNA translation driven by m⁶A. Consistently, YTHDF3 can directly interact with eIF4G2 as judged by the reciprocal co-immunoprecipitation assays (Figure 3G), suggesting a possible role of YTHDF3 in recruiting eIF4G2 to the m⁶A containing RNA. Interestingly, compared to canonical translation from linear mRNAs, m⁶A-driven circRNA translation was more sensitive to treatment of hygromycin B (Supplementary information, Figure S6), a well-studied antibiotic that inhibits ribosome translocation during translation elongation⁴¹, raising the possibility that different modes of translation initiation may affect elongation.

In addition, we examined the possible translation of endogenous circRNA through genomic analyses of the in vivo binding sites for various initiation factors (from CLIP-seq data set of ENCODE project, https://www.encodeproject.org), the transcriptome-wide m⁶A profiles^20,21, and the mapping of translation initiation sites (TIS)⁴². Because CLIP reads across the circular-specific splice junction are too low for a meaningful comparison, we used the total reads that may be contributed by both linear and circRNAs. We computed the frequencies of total mRNAs that are bound by eIF4G2, eIF3A or eIF4G1, and compared them to those of the previously reported circRNAs region³. We found that, although circRNAs generally have reduced binding of translation initiation factors compared to mRNAs (Figure 3H, white vs light blue bars), circRNAs containing pre-mapped m⁶A and TIS are about twice as often bound by eIF4G2 and eIF3A compared to mRNAs (Figure 3H, white vs dark blue bars). The observation that eIF4G2 and eIF3A prefer to bind circRNAs with m⁶A and TIS sites is consistent with our findings that these factors promoted circRNA translation (Figure 3B-3F). As a control, we found both types of circRNAs have reduced binding to the initiation factor eIF4G1 that is required for cap-dependent translation, again suggesting that circRNA translation is initiated in a cap-independent fashion. As expected, the binding sites of eIF4G2 often overlap with m⁶A modification near the predicted TIS, as exemplified by the circRNA of E3 ubiquitin-protein ligase ARIH2 (Figure 3I).

Identification of endogenous circRNAs that contain m⁶A modification

To assess the importance of m⁶A-mediated translation of circRNAs in cells, we conducted parallel sequencing to identify the m⁶A-containing endogenous circRNAs (circRNA-m⁶A-seq) using m⁶A immunoprecipitation of the RNA samples treated with exoribonuclease RNase R (Figure 4A). We mapped the RNA reads from both input and m⁶A-IP samples, and defined circRNAs according to the reads that span a back-splice junction. We identified 85 circRNAs (supported by 2 450 back-splicing junction reads) with m⁶A as judged by m⁶A-IP (Supplementary information, Table S2). We further tested eight circRNAs by RT-PCR with primers across back-spliced junction, and confirmed that all circRNAs tested are enriched by m⁶A-antibody precipitation compared to treatment with control antibody (Figure 4B). By comparing to the circRNA levels in the input sample, we found that circRNA-m⁶A-seq is very sensitive and can detect circRNAs with low m⁶A modification rate (0.6% m⁶A modification rate in cRBM5, Figure 4B). In addition, compared to the read density obtained using control antibodies, both m⁶A sites and eIF4G2-binding sites were relatively enriched in circRNAs that contain a putative start codon AUG (Figure 4C) (see Materials and Methods section). Intriguingly, the m⁶A displayed a broad peak located upstream of eIF4G2-binding sites, supporting a potential functional cooperation between these two elements in driving circRNA translation.

Transcriptome-wide sequencing of m⁶A-modified circRNAs and predictive identification of endogenous circular mRNAs. **(A)** Schematic diagram of circRNA-m⁶A-seq protocol. **(B)** Validation of m⁶A-modified circRNAs in immunoprecipitated samples using m⁶A antibody or control antibody. Arrows indicate predicted circRNA size in the lanes with multiple bands, input (10%) indicates 10% of total input RNAs were used for RT-PCR, % m⁶A indicates percentage of m⁶A modification in target circRNAs (m⁶A antibody/(10× input (10%)). **(C)** Positional distribution of relative density for m⁶A and eIF4G2 binding site in circRNAs as compared to the respective control samples. The putative start codon was used as arbitrary marker to align the plot. When multiple AUG sites are presented in the circRNAs, the AUG that generates the longest ORF is use. **(D)** Number of circRNA reads (i.e., back-splice junction reads) per million of total reads in circRNA-m⁶A-seq samples treated with or without RNase R. **(E)** Schematic diagram of translatable circRNA prediction pipeline. Left, computational filters sequentially applied to identify circRNAs that contain m⁶A site, translation initiation site (TIS) and an ORF with sufficient length. The circRNA, m⁶A and TIS-sequencing data are from published results (see Materials and Methods section). Right, the numbers of circRNAs passing each filters. **(F)** Detection of predicted circRNAs from various host genes in polysome fractions with RT-PCRs. In the lanes with multiple bands, the circRNAs with expected size are indicated with arrows. IP, immunoprecipitation.

On the basis of the number of circRNA reads recovered from m⁶A IP vs the total input circRNA reads sequenced, we estimate that ∼13% of total circRNAs had the m⁶A modification (Figure 4D, 2.6/20=13%). This is probably a conservative estimate due to our stringent experimental design and data analysis, in which only a fraction of m⁶A-containing RNAs were precipitated and only the fragments containing m⁶A sites adjacent to the back-splice junction were recovered as positive circRNA reads (Figure 4A). Nevertheless, these data suggest that circRNAs are extensively modified by m⁶A.

circRNAs with coding potential are common in human transcriptome

On the basis of the above observations, we developed a computational pipeline to predict endogenous translatable circRNAs using a series of filters (Figure 4E). Starting from a trustable set of 7 771 circRNAs previously discovered via sequencing of RNAs resistant to RNase R from Hs68 cells³, we first identified 623 circRNAs containing m⁶A peaks (as judged by m⁶A-seq, from HEK293 cells)^20,21, and further reduced these candidates to 124 circRNAs using the pre-mapped TIS (from HEK293 cells) as a filter⁴². Finally we applied an arbitrary filter of ORF length and selected only those 25 circRNAs with a sufficiently long ORF (≥ 150 nt or encoding a protein longer than 50 aa; Figure 4E). Further validation with polysome profiling confirmed that 10 out of 12 tested circRNAs from selected 25 circRNAs were indeed associated with polysomes, with the circRNA from KLHL24 being the only clear negative (Figure 4F). The other circRNA, cFAM115A, was not detected in total RNAs, presumably because it is not expressed in the cell line we tested (Figure 4F). We also used cMART3 and cARL67P1 as examples to examine the circRNA distribution in the entire polysome gradient. We found that cMART3 was present in all fractions including monosome- and polysome-bound fractions (Supplementary information, Figure S7). As a negative control, cARL67P1, a circRNA that did not pass our computational filters, was not associated with polysome (fractions 8-14) (Supplementary information, Figure S7). As we started with a small trusted data set of 7 700 circRNAs from human fibroblasts, this prediction pipeline is expected to have low sensitivity but high specificity. In addition, because of the incomplete coverage of m⁶A-seq and the limited number of pre-mapped TIS, together with the different cell lines used in existing data, we expect that the 25 circRNAs obtained through these stringent filters only represent a very small fraction of all circRNAs with coding potential in cultured cells. Future studies in a more consistent cellular context will likely increase the sensitivity of detection of translatable circRNAs.

The above findings inspired us to experimentally identify the circRNAs undergoing active translation in human transcriptome. We first used sucrose gradient centrifugation to purify polysome-associated RNAs, and subsequently treated the purified samples with RNase R and subjected them to high-throughput sequencing (Figure 5A). The resulting reads were mapped to human genome using CIRCexplorer to identify back-splicing junctions⁸. We identified 250 circRNAs that are associated with polysomes (Figure 5B; Supplementary information, Table S3), with ∼0.6 circRNA per million total reads in RNase R-treated samples (Figure 5C). It is worth noting that this result was obtained again using stringent criteria because only the polysome-associated fragments containing back-splice junctions were considered as circRNA reads (see Materials and Methods section). As a result, only 1 out of 25 translatable circRNA predicted from above pipeline was recovered in the unbiased polysome profiling/circRNA-seq, suggesting that our methods are far from saturation. When comparing with all circRNAs, we found that polysome-associated circRNAs tend to have fewer exons and are generally shorter (Figure 5D). On the other hand, polysome-associated circRNAs have longer putative ORFs as compared to all circRNAs in general (Figure 5E), consistent with their expected coding potential. This result also suggests that a larger fraction of polysome-associated circRNAs accounts for putative ORFs.

Systematic discovery of circular mRNA in human cells. **(A)** Schematic diagram of polysome-bound circRNA-seq protocol. **(B)** Polysome fractionation of HeLa cell lysate. All Fractions were collected. Fraction 8 was marked as R1, fraction 11 was marked as R2 and fraction 13-20 were combined together and marked as R3. Total RNA from R1, R2 and R3 were isolated separately. **(C)** Numbers and frequencies of circRNA junction reads detected by polysome-bound circRNA seq in samples with or without RNase R treatment. **(D)** Comparison of the number of exons and the length between polysome-associated circRNAs and the total circRNAs. All P-values were calculated with KS-test. **(E)** Accumulative distribution of the length for putative ORFs in polysome-associated circRNAs and the total circRNAs. **(F)** Relative ratios of monosome- and polysome-bound RNAs vs unbound (free) RNAs for several circRNAs. The linear mRNA of GAPDH was used as control. **(G)** HeLa cells were treated with 200 μM puromycin or cycloheximide, lysed and separated by sucrose gradient centrifugation. The RNAs from light fractions (< 60S) and heavy fraction (> 2 ribosomes) were purified and used as template for real-time RT-PCR reactions. The relative levels of RNAs associated with the heavy fraction vs the light fraction were plotted for each circRNA or GAPDH mRNA.

We further examined the monosome- and polysome-associated circRNAs using quantitative RT-PCR (Figure 5F), and confirmed that all seven tested circRNAs were indeed associated with ribosomes, with five out of seven having more RNAs associated with ribosomes than with the unbound fraction. Moreover, the distribution of circRNAs showed greater bias towards monosomes than polysomes compared to the highly expressed GAPDH mRNA that was mainly bound to polysomes (Figure 5F), suggesting a possibly slower initiation process. Nevertheless, for some circRNAs, a large fraction was associated with ribosomes (e.g., > 20-fold more circRNAs are bound by ribosomes in cFKBP8 and cZCCHC7), suggesting that they are under active translation. In addition, the polysome-associated circRNAs (i.e., heavy fractions) were significantly reduced upon treatment of cells with the puromycin that specifically disrupts active translating ribosomes (Figure 5G, compare the treatment with cycloheximide vs puromycin), suggesting that association of circRNAs with polysomes is likely due to active translation. Taken together, our observations demonstrate that m⁶A-containing circRNAs with coding potential are widespread in human transcriptome.

Endogenous peptide translated from circular mRNA junction

To directly identify endogenous proteins encoded by circRNAs, we generated a customized database containing peptides encoded by RNA sequences spanning back-splice junctions of all known circRNAs⁴³, and combined this peptide database with all human proteins from UniProt to search tandem mass spectrometry (MS/MS) data from total lysates of 293 cells. Our search was performed against a database that includes reversed entries, which minimizes the false discovery rate from the random noise of MS/MS data.

We identified 33 peptides (19 unique peptide in total, some being identified multiple times in replicated samples) encoded by the back-splice junctions of circRNAs that do not match any known proteins from UniProt (Supplementary information, Table S4, sheet 1). The collision-induced dissociation MS/MS spectrum from two representative examples was shown (Figure 6A). To further validate candidate circRNA-encoded peptides, we chemically synthesized these two circRNA-encoded peptides and used them to re-run the MS/MS analysis. The collision-induced dissociation MS/MS spectrum from both synthesized peptides closely matched those of the original peptide identified from cell lysate (comparing Figure 6B with 6A), suggesting that the peptides identified by proteomic analyses are likely produced from circRNA translation. These peptides merely represent a small fraction of the circRNA-encoded proteome, because only the peptide sequence spanning the back-splice junction can be unambiguously identified as circRNA-encoded products. However, we did not find any functional enrichment of the host genes of these circRNAs despite circRNA translation being elevated by cellular stress, probably because of the limited coverage of this method.

Identification of circRNA junction-coded peptides. **(A)** The collision-induced dissociation (CID) MS/MS spectrum of the [M+2H]³⁺ ion at *m/z* 465.29 of the human cDGKB peptide ISLSILQR and of human cMYO15B peptide LLGAIAAR ([M+2H]²⁺ ion at *m/z* 392.75) shown as an example. Annotated b- and y-ions are listed above and below the peptide sequence marked in red and green color, respectively. **(B)** The CID spectrum of MS/MS for corresponding synthetic peptides match to human cDGKB peptide ISLSILQR and cMYO15B peptide LLGAIAAR were shown as confirmation for the product of circRNA translation. Annotated b- and y-ions are listed above and below the peptide sequence marked in red and green color, respectively. **(C)** A schematic diagram of circRNA translation driven by m⁶A.

Discussion

In this report, we serendipitously found that a variety of sequences can function as IRESs to drive circRNA translation, and also observed that m⁶A is responsible for the promiscuous circRNA translation (Figure 1). We further demonstrated that circRNAs contain extensive m⁶A modifications, which are sufficient to drive protein translation in a cap-independent fashion involving the m⁶A reader YTHDF3 and the translation initiation factors eIF4G2 and eIF3A (Figure 6C). Consistently, many circRNAs were found to be associated with polysomes, suggesting that a sizable fraction of endogenous circRNAs (but not all circRNAs) is indeed translated. Searches for polysome-associated circRNAs were previously attempted with conventional approaches, but no significant hits were identified^3,14,15. Our approach has increased sensitivity by starting with a large amount of raw material, using RNase R treatment to enrich circRNAs, and sequencing more than 700 million reads. Although we only identified a small number of circRNA-encoded peptides due to the stringent filters used in MS/MS analyses (unique mapped peptides across back-splicing site), m⁶A modifications are very common in circRNAs as judged by circRNA-m⁶A-seq, suggesting that translatable circRNAs may be common in human transcriptome. These results challenge the stereotypic view of circRNAs as non-coding RNAs, and open new paradigms for potential function of circRNAs.

This finding leads to many intriguing questions. For example, what are the possible functions of circRNA-encoded proteins? We found that circRNA translation is increased under heat shock condition, raising the possibility that circRNA-encoded proteins may play roles in stress response. It has been proposed that cap-independent translation through IRESs is increased in cancers to promote translation of genes that play important roles in stress responses, development, apoptosis and cell cycle regulation⁴⁴. Because circRNA can only be translated through cap-independent translation, we speculate that the translation from circRNA may be more prevalent in cancer cells, a question to be addressed in future. In addition, many circRNAs code for N-terminal protein fragments, potentially generating protein isoforms that have overlap sequences with conventionally tranlated protein. As a result, it is possible that the circRNA-coded isoforms can interfere with the function of the respective canonical protein. Interestingly, we also found that several short sequences without consensus m⁶A motifs (RRACH) can also drive translation in our circRNA reporter (data not shown), implying either that circRNA translation could also be driven through some m⁶A-independent mechanisms, or alternatively the methylation of adenosine may occur in non-canonical sites.

Our findings have some fundamental implications: While most circRNAs may be classified as non-coding RNAs, some of them likely function as mRNAs because they contain m⁶A and TIS, and are associated with polysomes, blurring the definition of coding and non-coding RNAs. Previously some non-coding RNAs are reported to have coding potential from 5′ ORFs^45,46, however the extensive cap-independent translation driven by m⁶A in vivo suggests an even more pervasive translation of non-coding RNAs from internal ORFs. Therefore, the translational landscape of a cell may be much more complicated than currently appreciated, i.e., the alternative ORFs commonly found in mRNAs may indeed be translated by internal m⁶A sites. It was generally known that many mass spectrum peaks cannot be reliably assigned to known proteins in proteomic studies and this was mainly attributed to existence of unknown modifications. However, alternative reading frames of mRNA may contribute to some of the unassigned peaks. Just as one gene can produce multiple mRNA isoforms through alternative splicing, we speculate that extensive cap-independent translation may enable one mRNA to be translated into multiple proteins.

Materials and Methods

Plasmid construction, cell culture and transfection

The circRNA reporters containing split GFP¹¹ were inserted with different human endogenous IRES, control sequences and putative m⁶A motifs using EcoRI and EcoRV cloning sites in the reporter (see Supplementary information, Table S1 for inserted sequences). The expression vector for FTO was constructed by cloning HA-tagged FTO cDNA into pcDNA5/FRT/TO using NheI and KpnI sites. The pcDNA3-Flag-METTL3 and pcDNA3-Flag-METTL14 plasmids were obtained from Addgene, and the pcDNA3-Flag-eIF4G2 expression vector is the generous gift from Dr Nahum Sonenberg.

293 and HeLa cells were cultured with DMEM medium containing 10% of FBS. To transiently express circRNA reporter, 293 cells were plated into 24-well plates 1 day before transfection. Of note, 1 μg of the plasmids was transfected using lipofectamine 2000 according to the manufacturer's manual. Transfected cells were collected 48 h after transfection for further RNA and protein analysis. For co-transfection, the circRNA reporter was transfected with protein overexpression plasmids in ratio 1:3.

Semi-quantitative RT-PCR and real-time PCR

Total RNAs were isolated using TRIZOL reagent and treated with DNase I (37 °C, 1 h, followed by heat inactivation). For semi-quantitative PCR, 2 μg total RNA was reverse-transcribed with SuperScript III (Invitrogen), and one-tenth of the RT product was used for PCR (22 cycles, supplemented with trace amount of Cy5-dCTP). The products were separated on 10% PAGE gels, scanned with a Typhoon 9400 scanner, and quantified with ImageQuant 5.2 or stained by SYBR Green I (Thermo Scientific). The real-time PCR was performed using the Maxima SYBR Green qPCR Master Mix (Thermo Scientific) and a 7500 real-time PCR system (Life Technologies) according to the manufacturer's instructions.

Western blot

Cells were lysed in buffer containing 50 mM HEPES, 150 mM NaCl, 1 mM EDTA, 1% (w/v) CHAPS and Sigma protease inhibitor cocktail, and the total cell lysates were resolved with SDS-PAGE gels. The following antibodies were used: GFP antibody (632381) from Clontech; GAPDH antibody (sc-32233) from Santa Cruz; Flag antibody (F1804) from Sigma; HA antibody (SC-805) from Santa Cruz. HRP-linked secondary antibodies were used and blots visualized with the ECL kit (Bio-Rad).

Gene knockdown with lentiviral shRNA

shRNA plasmids were purchased from the TRC library through GE Dharmacon. shRNA plasmids were transfected into 293 cells with psPAX2 and pMD2.G in ratio 4:3:1. Virus was collected at 48 h after transfection. 293 cells were infected by the lentivirus for 48 h followed by 2 μg/ml puromycin selection.

m⁶A immunoprecipitation and quantification

Total RNAs were isolated from cells and treated with DNase I (37 °C, 1 h, followed by heat inactivation). 20 μg total RNA was incubated with 2 μg anti-m⁶A antibody (Synaptic Systems 202003) or GAPDH antibody in 200 μl IP buffer (10 mM Tris-HCl, 150 mM NaCl, 0.1% (vol/vol) Igepal CA-630, 2 mM ribonucleoside vanadyl complexes (Sigma-Aldrich) and 0.5 U/μl RNasin (Promega)) for 2 h at 4 °C. During the incubation, the protein A/G PLUS-agarose beads (Santa Cruz) were blocked by IP buffer supplemented with BSA (0.5 mg/ml) for 2 h at 4 °C, washed three times in 500 μl IP buffer, and then mixed with the total RNAs/anti-m⁶A antibodies in IP buffer (2 h at 4 °C). After the incubation, beads were washed three times with 500 μl IP buffer, and bound RNAs isolated with TRIZOL reagents. Recovered RNA was then analyzed by real-time RT-PCR.

Polysome fractionation and sequencing

HeLa cells were pre-treated with 200 μM cycloheximide for 5 min at 37 °C and washed with ice-cold PBS containing 200 μM cycloheximide. Cells were then lysed with polysome lysis buffer (400 mM KOAc (pH 7.5), 25 mM K-HEPES, 15 mM Mg(OAc)₂, 1 mM DTT, 200 μM cycloheximide, 1% NP-40, 0.5% deoxycholate, 1 mM PMSF and 50 U/ml RNasin) for 10 min on ice. Cell debris was removed by centrifugation at 14 000 rpm for 10 min at 4 °C, and the supernatant was loaded onto 10-ml continuous 15-50% sucrose gradients containing 400 mM KOAc (pH 7.5), 25 mM K-HEPES, 15 mM Mg(OAc)₂, 200 μM cycloheximide and 50 U/ml RNasin. The samples were centrifuged at 4 °C for 3 h at 100 000× g in an SW41 rotor (Beckman), and the fractions were collected using a Brandel Fractionation System and an Isco UA-6 ultraviolet detector used to produce polysome profiles for gradients. Total RNA was extracted from each fraction by TRIZOL.

Ribosomal RNA was depleted from these fractionated RNAs by RiboMinus Human/Mouse Transcriptome Isolation Kit. Half of the recovered RNA was treated with RNase R at 37 °C for 1 h followed by ethanol precipitation. The purified RNA was used for library preparation with KAPA-stranded RNA-seq kit.

Analysis of m⁶A motifs in circRNAs

The circRNA data set was derived from a previous study³ and introns removed based on annotation from circBase (http://www.circbase.org/). For comparison, we also analyzed total mRNAs, which were separated into coding sequences, exons, introns, transcription start sites, transcription termination sites, start codons and stop codons based on Refseq gene annotation. We determined the frequency of m⁶A motif (HRRACH, H=A/C/T, R=A/G) based on counts of these motifs normalized by the length of certain region. As control, we calculated the average frequency of 1 000 random 6-mers in the circRNA data set.

Percent of binding site of eIF4G2, eIF3A and eIF4G1

We defined circRNAs including m⁶A peaks²⁰ and ribosome binding sites⁴⁷ as potential coding circRNAs. Using CLIP-seq data sets of eIF4G2 and eIF4G1 from the ENCODE project (https://www.encodeproject.org), and eIF3A from a published data set³², we computed the percent of these factors' binding sites contained in mRNAs, circRNAs and potential coding circRNAs.

Analysis of the density of m⁶A-seq peaks and CLIP-seq data, and circRNA

The pre-mapped m⁶A-seq reads were downloaded from previous datasets^20,21, and the reads number calculated using sliding 20 nt windows along the full-length circRNA. The CLIP-seq data were downloaded from ENCODE. We calculated the mean coverage of a specific region for each window of the immunoprecipitated samples and controls. To calculate the enrichment of signals in each window, the coverage of the IP samples (m⁶A or eIF4G2) was normalized by the mean coverage of entire gene, then divided by normalized coverage of corresponding window in control samples.

Polysome- and m⁶A-associated circRNA detection

We detected circRNA using CIRCexplorer pipeline. First reads were aligned to GRCh37 human genome with Tophat, and then unmapped reads were realigned with Tophat-Fusion. Finally, back-spliced junction reads were annotated with Refseq gene annotation.

Mass spectrometry detection of circRNA-coded proteins

Proteins were precipitated with trichloroacetic acid. The protein pellet was dried either by air or by using a Speedvac for 1-2 min. The pellet was subsequently dissolved in 8 M urea, 100 mM Tris-HCl, pH 8.5. TCEP (final concentration is 5 mM) (Thermo Scientific) and iodoacetamide (final concentration is 10 mM) (Sigma) for reduction and alkylation were added to the solution and incubated at room temperature for 20 and 15 min, respectively. The protein mixture was diluted four times and digested with Trypsin at 1:50 (w/w) (Promega, http://www.promega.com/).

For multidimensional protein identification technology (MudPIT), total peptide mixtures were pressure-loaded onto a biphasic-fused silica capillary column. The entire column setting (biphasic column-union-analytical column) was placed in line with an Agilent 1200 quaternary HPLC pump (Palo Alto, CA, USA) for MS analysis. The digested proteins were analyzed using an eight-step MudPIT separation method as described previously⁴⁸.

A back-splice junction database was constructed based on circBase⁴³, from which the circRNA sequences were extracted using BEDTools⁴⁹ using hg19 annotation of human genome. The peptides spanning the back-splice junctions were translated in all reading frames from 5′ to 3′. We combined all human protein sequences from UniProt and back-splice junction databases as a customized database to search the spectra. Peptides obtained from MS/MS across back-splice site were also used to search non-redundant human protein database with BALSTP to ensure that these peptides are not from any known human protein. The acquired MS/MS data were analyzed against the customized protein database using Protein Discoverer 2.0 (Thermo Scientific). Mass tolerances for precursor ions were set at 20 ppm and for MS/MS were set at 0.8 Da. Trypsin was defined as cleavage enzyme with three most miss cleavage, the mass of the amino acid cysteine was statically modified by + 57.02146 Da, the FDR was set at 0.01 for protein identification by searching against a database that includes reversed entries.

See Supplementary information, Data S1 for detailed methods.

Author Contributions

YY and ZW designed the study and prepared the manuscript. YY, MM and XS conducted experiments, and XF conducted the computational analyses. PW and CW conducted the mass spectrometry analyses. YZ and LC helped in detecting circRNA with northern blot. YY, JY, YW and XX helped to analyze the data and provided comments in paper revision.

Competing Financial Interests

The authors declare no competing financial interests.

Acknowledgments

We want to thank Dr Sonenberg for generous gift of eIF4G2 expression vectors, ENCODE Consortium and Dr Gene Yeo's lab for generating CLIP-seq data sets of eIF4G1, eIF4G2 and eIF3A, and Dr Bill Marzluff for help in polysome fractionation. We thank Shuaixin Gao from the Mass Spectrometry Facility of National Center for Protein Science Shanghai for assistance with MS data collection, database search and quantitation analysis. We thank Dr Xiaoling Li for critical reading of the manuscript. The work is supported by National Natural Science Foundation of China (31570823 to ZW, 31300655 to YY, and 91540110 to YW). The work is also supported by International Postdoctoral Exchange Fellowship to YY, and NIH grant R01HG006264 to XX. MM is supported by a Chinese Scholarship Council Scholarship (201406740040).

Footnotes

(Supplementary information is linked to the online version of the paper on the Cell Research website.)

Supplementary Material

Supplementary information, Table S1

Click here for additional data file.^{(14.9KB, xlsx)}

Supplementary information, Table S2

Click here for additional data file.^{(82.3KB, xlsx)}

Supplementary information, Table S3

Click here for additional data file.^{(53.5KB, xlsx)}

Supplementary information, Table S4

Click here for additional data file.^{(13.7KB, xlsx)}

Supplementary information, Figure S1

circRNA is translated by m⁶A motif

Click here for additional data file.^{(9.4MB, pdf)}

Supplementary information, Figure S2

circRNA validation by northern blot and RNase R treatment.

Click here for additional data file.^{(559.7KB, pdf)}

Supplementary information, Figure S3

Translation of circRNA in different cell types.

Click here for additional data file.^{(214.9KB, pdf)}

Supplementary information, Figure S4

eIF4G2 and eIF3A knockdown affects global protein synthesis rate

Click here for additional data file.^{(1MB, pdf)}

Supplementary information, Figure S5

m⁶A reader protein affect translation of circRNAs

Click here for additional data file.^{(457.7KB, pdf)}

Supplementary information, Figure S6

m⁶A driven circRNA translation is more sensitive to treatment of hygromycin B.

Click here for additional data file.^{(141.9KB, pdf)}

Supplementary information, Figure S7

Entire expression profile across the polysome gradient.

Click here for additional data file.^{(1.7MB, pdf)}

Supplementary information, Data S1

Supplementary experimental procedures

Click here for additional data file.^{(140.9KB, pdf)}

References

Cocquerelle C, Daubersies P, Majerus MA, Kerckaert JP, Bailleul B. Splicing with inverted order of exons occurs proximal to large introns. EMBO J 1992; 11:1095–1098. [DOI] [PMC free article] [PubMed] [Google Scholar]
Nigro JM, Cho KR, Fearon ER, et al. Scrambled exons. Cell 1991; 64:607–613. [DOI] [PubMed] [Google Scholar]
Jeck WR, Sorrentino JA, Wang K, et al. Circular RNAs are abundant, conserved, and associated with ALU repeats. RNA 2013; 19:141–157. [DOI] [PMC free article] [PubMed] [Google Scholar]
Memczak S, Jens M, Elefsinioti A, et al. Circular RNAs are a large class of animal RNAs with regulatory potency. Nature 2013; 495:333–338. [DOI] [PubMed] [Google Scholar]
Salzman J, Gawad C, Wang PL, Lacayo N, Brown PO. Circular RNAs are the predominant transcript isoform from hundreds of human genes in diverse cell types. PLoS One 2012; 7:e30733. [DOI] [PMC free article] [PubMed] [Google Scholar]
Salzman J, Chen RE, Olsen MN, Wang PL, Brown PO. Cell-type specific features of circular RNA expression. PLoS Genet 2013; 9:e1003777. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hansen TB, Jensen TI, Clausen BH, et al. Natural RNA circles function as efficient microRNA sponges. Nature 2013; 495:384–388. [DOI] [PubMed] [Google Scholar]
Zhang XO, Wang HB, Zhang Y, Lu X, Chen LL, Yang L. Complementary sequence-mediated exon circularization. Cell 2014; 159:134–147. [DOI] [PubMed] [Google Scholar]
Liang D, Wilusz JE. Short intronic repeat sequences facilitate circular RNA production. Genes Dev 2014; 28:2233–2247. [DOI] [PMC free article] [PubMed] [Google Scholar]
Starke S, Jost I, Rossbach O, et al. Exon circularization requires canonical splice signals. Cell Rep 2015; 10:103–111. [DOI] [PubMed] [Google Scholar]
Wang Y, Wang Z. Efficient backsplicing produces translatable circular mRNAs. RNA 2015; 21:172–179. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ashwal-Fluss R, Meyer M, Pamudurti NR, et al. circRNA biogenesis competes with pre-mRNA splicing. Mol Cell 2014; 56:55–66. [DOI] [PubMed] [Google Scholar]
Chen CY, Sarnow P. Initiation of protein synthesis by the eukaryotic translational apparatus on circular RNAs. Science 1995; 268:415–417. [DOI] [PubMed] [Google Scholar]
Guo JU, Agarwal V, Guo H, Bartel DP. Expanded identification and characterization of mammalian circular RNAs. Genome Biol 2014; 15:409. [DOI] [PMC free article] [PubMed] [Google Scholar]
You X, Vlatkovic I, Babic A, et al. Neural circular RNAs are derived from synaptic genes and regulated by development and plasticity. Nat Neurosci 2015; 18:603–610. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li S, Mason CE. The pivotal regulatory landscape of RNA modifications. Annu Rev Genomics Hum Genet 2014; 15:127–150. [DOI] [PubMed] [Google Scholar]
Wei CM, Gershowitz A, Moss B. Methylated nucleotides block 5′ terminus of HeLa cell messenger RNA. Cell 1975; 4:379–386. [DOI] [PubMed] [Google Scholar]
Csepany T, Lin A, Baldick CJ Jr, Beemon K. Sequence specificity of mRNA N⁶-adenosine methyltransferase. J Biol Chem 1990; 265:20117–20122. [PubMed] [Google Scholar]
Harper JE, Miceli SM, Roberts RJ, Manley JL. Sequence specificity of the human mRNA N⁶-adenosine methylase in vitro. Nucleic Acids Res 1990; 18:5735–5741. [DOI] [PMC free article] [PubMed] [Google Scholar]
Meyer KD, Saletore Y, Zumbo P, Elemento O, Mason CE, Jaffrey SR. Comprehensive analysis of mRNA methylation reveals enrichment in 3′ UTRs and near stop codons. Cell 2012; 149:1635–1646. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dominissini D, Moshitch-Moshkovitz S, Schwartz S, et al. Topology of the human and mouse m⁶A RNA methylomes revealed by m⁶A-seq. Nature 2012; 485:201–206. [DOI] [PubMed] [Google Scholar]
Bokar JA, Shambaugh ME, Polayes D, Matera AG, Rottman FM. Purification and cDNA cloning of the AdoMet-binding subunit of the human mRNA (N⁶-adenosine)-methyltransferase. RNA 1997; 3:1233–1247. [PMC free article] [PubMed] [Google Scholar]
Liu J, Yue Y, Han D, et al. A METTL3-METTL14 complex mediates mammalian nuclear RNA N⁶-adenosine methylation. Nat Chem Biol 2014; 10:93–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang Y, Li Y, Toth JI, Petroski MD, Zhang Z, Zhao JC. N⁶-methyladenosine modification destabilizes developmental regulators in embryonic stem cells. Nat Cell Biol 2014; 16:191–198. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ping XL, Sun BF, Wang L, et al. Mammalian WTAP is a regulatory subunit of the RNA N⁶-methyladenosine methyltransferase. Cell Res 2014; 24:177–189. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jia G, Fu Y, Zhao X, et al. N⁶-methyladenosine in nuclear RNA is a major substrate of the obesity-associated FTO. Nat Chem Biol 2011; 7:885–887. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zheng G, Dahl JA, Niu Y, et al. ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility. Mol Cell 2013; 49:18–29. [DOI] [PMC free article] [PubMed] [Google Scholar]
Yue Y, Liu J, He C. RNA N⁶-methyladenosine methylation in post-transcriptional gene expression regulation. Genes Dev 2015; 29:1343–1355. [DOI] [PMC free article] [PubMed] [Google Scholar]
Meyer KD, Jaffrey SR. The dynamic epitranscriptome: N⁶-methyladenosine and gene expression control. Nat Rev Mol Cell Biol 2014; 15:313–326. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang X, Zhao BS, Roundtree IA, et al. N(6)-methyladenosine modulates messenger RNA translation efficiency. Cell 2015; 161:1388–1399. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhou J, Wan J, Gao X, Zhang X, Jaffrey SR, Qian SB. Dynamic m(6)A mRNA methylation directs translational control of heat shock response. Nature 2015; 526:591–594. [DOI] [PMC free article] [PubMed] [Google Scholar]
Meyer KD, Patil DP, Zhou J, et al. 5′ UTR m(6)A promotes cap-independent translation. Cell 2015; 163:999–1010. [DOI] [PMC free article] [PubMed] [Google Scholar]
Shi H, Wang X, Lu Z, et al. YTHDF3 facilitates translation and decay of N⁶-methyladenosine-modified RNA. Cell Res 2017; 27:315–328. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li A, Chen YS, Ping XL, et al. Cytoplasmic m⁶A reader YTHDF3 promotes mRNA translation. Cell Res 2017; 27:444–447. [DOI] [PMC free article] [PubMed] [Google Scholar]
Linder B, Grozhik AV, Olarerin-George AO, Meydan C, Mason CE, Jaffrey SR. Single-nucleotide-resolution mapping of m⁶A and m⁶Am throughout the transcriptome. Nat Methods 2015; 12:767–772. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang X, Lu Z, Gomez A, et al. N⁶-methyladenosine-dependent regulation of messenger RNA stability. Nature 2014; 505:117–120. [DOI] [PMC free article] [PubMed] [Google Scholar]
Spriggs KA, Stoneley M, Bushell M, Willis AE. Re-programming of translation following cell stress allows IRES-mediated translation to predominate. Biol Cell 2008; 100:27–38. [DOI] [PubMed] [Google Scholar]
Sonenberg N, Hinnebusch AG. Regulation of translation initiation in eukaryotes: mechanisms and biological targets. Cell 2009; 136:731–745. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liberman N, Gandin V, Svitkin YV, et al. DAP5 associates with eIF2beta and eIF4AI to promote internal ribosome entry site driven translation. Nucleic Acids Res 2015; 43:3764–3775. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sun C, Querol-Audi J, Mortimer SA, et al. Two RNA-binding motifs in eIF3 direct HCV IRES-dependent translation. Nucleic Acids Res 2013; 41:7512–7521. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gonzalez A, Jimenez A, Vazquez D, Davies JE, Schindler D. Studies on the mode of action of hygromycin B, an inhibitor of translocation in eukaryotes. Biochim Biophys Acta 1978; 521:459–469. [DOI] [PubMed] [Google Scholar]
Lee S, Liu B, Huang SX, Shen B, Qian SB. Global mapping of translation initiation sites in mammalian cells at single-nucleotide resolution. Proc Natl Acad Sci USA 2012; 109:E2424–2432. [DOI] [PMC free article] [PubMed] [Google Scholar]
Glazar P, Papavasileiou P, Rajewsky N. circBase: a database for circular RNAs. RNA 2014; 20:1666–1670. [DOI] [PMC free article] [PubMed] [Google Scholar]
Leprivier G, Rotblat B, Khan D, Jan E, Sorensen PH. Stress-mediated translational control in cancer cells. Biochim Biophys Acta 2015; 1849:845–860. [DOI] [PubMed] [Google Scholar]
Ruiz-Orera J, Messeguer X, Subirana JA, Alba MM. Long non-coding RNAs as a source of new peptides. Elife 2014; 3:e03523. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ingolia NT, Brar GA, Stern-Ginossar N, et al. Ribosome profiling reveals pervasive translation outside of annotated protein-coding genes. Cell Rep 2014; 8:1365–1379. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wan J, Qian SB. TISdb: a database for alternative translation initiation in mammalian cells. Nucleic Acids Res 2014; 42:D845–D850. [DOI] [PMC free article] [PubMed] [Google Scholar]
Washburn MP, Wolters D, Yates JR, III. Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat Biotechnol 2001; 19:242–247. [DOI] [PubMed] [Google Scholar]
Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010; 26:841–842. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary information, Table S1

Click here for additional data file.^{(14.9KB, xlsx)}

Supplementary information, Table S2

Click here for additional data file.^{(82.3KB, xlsx)}

Supplementary information, Table S3

Click here for additional data file.^{(53.5KB, xlsx)}

Supplementary information, Table S4

Click here for additional data file.^{(13.7KB, xlsx)}

Supplementary information, Figure S1

circRNA is translated by m⁶A motif

Click here for additional data file.^{(9.4MB, pdf)}

Supplementary information, Figure S2

circRNA validation by northern blot and RNase R treatment.

Click here for additional data file.^{(559.7KB, pdf)}

Supplementary information, Figure S3

Translation of circRNA in different cell types.

Click here for additional data file.^{(214.9KB, pdf)}

Supplementary information, Figure S4

eIF4G2 and eIF3A knockdown affects global protein synthesis rate

Click here for additional data file.^{(1MB, pdf)}

Supplementary information, Figure S5

m⁶A reader protein affect translation of circRNAs

Click here for additional data file.^{(457.7KB, pdf)}

Supplementary information, Figure S6

m⁶A driven circRNA translation is more sensitive to treatment of hygromycin B.

Click here for additional data file.^{(141.9KB, pdf)}

Supplementary information, Figure S7

Entire expression profile across the polysome gradient.

Click here for additional data file.^{(1.7MB, pdf)}

Supplementary information, Data S1

Supplementary experimental procedures

Click here for additional data file.^{(140.9KB, pdf)}

[bib1] Cocquerelle C, Daubersies P, Majerus MA, Kerckaert JP, Bailleul B. Splicing with inverted order of exons occurs proximal to large introns. EMBO J 1992; 11:1095–1098. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib2] Nigro JM, Cho KR, Fearon ER, et al. Scrambled exons. Cell 1991; 64:607–613. [DOI] [PubMed] [Google Scholar]

[bib3] Jeck WR, Sorrentino JA, Wang K, et al. Circular RNAs are abundant, conserved, and associated with ALU repeats. RNA 2013; 19:141–157. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] Memczak S, Jens M, Elefsinioti A, et al. Circular RNAs are a large class of animal RNAs with regulatory potency. Nature 2013; 495:333–338. [DOI] [PubMed] [Google Scholar]

[bib5] Salzman J, Gawad C, Wang PL, Lacayo N, Brown PO. Circular RNAs are the predominant transcript isoform from hundreds of human genes in diverse cell types. PLoS One 2012; 7:e30733. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] Salzman J, Chen RE, Olsen MN, Wang PL, Brown PO. Cell-type specific features of circular RNA expression. PLoS Genet 2013; 9:e1003777. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] Hansen TB, Jensen TI, Clausen BH, et al. Natural RNA circles function as efficient microRNA sponges. Nature 2013; 495:384–388. [DOI] [PubMed] [Google Scholar]

[bib8] Zhang XO, Wang HB, Zhang Y, Lu X, Chen LL, Yang L. Complementary sequence-mediated exon circularization. Cell 2014; 159:134–147. [DOI] [PubMed] [Google Scholar]

[bib9] Liang D, Wilusz JE. Short intronic repeat sequences facilitate circular RNA production. Genes Dev 2014; 28:2233–2247. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] Starke S, Jost I, Rossbach O, et al. Exon circularization requires canonical splice signals. Cell Rep 2015; 10:103–111. [DOI] [PubMed] [Google Scholar]

[bib11] Wang Y, Wang Z. Efficient backsplicing produces translatable circular mRNAs. RNA 2015; 21:172–179. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] Ashwal-Fluss R, Meyer M, Pamudurti NR, et al. circRNA biogenesis competes with pre-mRNA splicing. Mol Cell 2014; 56:55–66. [DOI] [PubMed] [Google Scholar]

[bib13] Chen CY, Sarnow P. Initiation of protein synthesis by the eukaryotic translational apparatus on circular RNAs. Science 1995; 268:415–417. [DOI] [PubMed] [Google Scholar]

[bib14] Guo JU, Agarwal V, Guo H, Bartel DP. Expanded identification and characterization of mammalian circular RNAs. Genome Biol 2014; 15:409. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] You X, Vlatkovic I, Babic A, et al. Neural circular RNAs are derived from synaptic genes and regulated by development and plasticity. Nat Neurosci 2015; 18:603–610. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] Li S, Mason CE. The pivotal regulatory landscape of RNA modifications. Annu Rev Genomics Hum Genet 2014; 15:127–150. [DOI] [PubMed] [Google Scholar]

[bib17] Wei CM, Gershowitz A, Moss B. Methylated nucleotides block 5′ terminus of HeLa cell messenger RNA. Cell 1975; 4:379–386. [DOI] [PubMed] [Google Scholar]

[bib18] Csepany T, Lin A, Baldick CJ Jr, Beemon K. Sequence specificity of mRNA N⁶-adenosine methyltransferase. J Biol Chem 1990; 265:20117–20122. [PubMed] [Google Scholar]

[bib19] Harper JE, Miceli SM, Roberts RJ, Manley JL. Sequence specificity of the human mRNA N⁶-adenosine methylase in vitro. Nucleic Acids Res 1990; 18:5735–5741. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] Meyer KD, Saletore Y, Zumbo P, Elemento O, Mason CE, Jaffrey SR. Comprehensive analysis of mRNA methylation reveals enrichment in 3′ UTRs and near stop codons. Cell 2012; 149:1635–1646. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] Dominissini D, Moshitch-Moshkovitz S, Schwartz S, et al. Topology of the human and mouse m⁶A RNA methylomes revealed by m⁶A-seq. Nature 2012; 485:201–206. [DOI] [PubMed] [Google Scholar]

[bib22] Bokar JA, Shambaugh ME, Polayes D, Matera AG, Rottman FM. Purification and cDNA cloning of the AdoMet-binding subunit of the human mRNA (N⁶-adenosine)-methyltransferase. RNA 1997; 3:1233–1247. [PMC free article] [PubMed] [Google Scholar]

[bib23] Liu J, Yue Y, Han D, et al. A METTL3-METTL14 complex mediates mammalian nuclear RNA N⁶-adenosine methylation. Nat Chem Biol 2014; 10:93–95. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] Wang Y, Li Y, Toth JI, Petroski MD, Zhang Z, Zhao JC. N⁶-methyladenosine modification destabilizes developmental regulators in embryonic stem cells. Nat Cell Biol 2014; 16:191–198. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] Ping XL, Sun BF, Wang L, et al. Mammalian WTAP is a regulatory subunit of the RNA N⁶-methyladenosine methyltransferase. Cell Res 2014; 24:177–189. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] Jia G, Fu Y, Zhao X, et al. N⁶-methyladenosine in nuclear RNA is a major substrate of the obesity-associated FTO. Nat Chem Biol 2011; 7:885–887. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib27] Zheng G, Dahl JA, Niu Y, et al. ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility. Mol Cell 2013; 49:18–29. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] Yue Y, Liu J, He C. RNA N⁶-methyladenosine methylation in post-transcriptional gene expression regulation. Genes Dev 2015; 29:1343–1355. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] Meyer KD, Jaffrey SR. The dynamic epitranscriptome: N⁶-methyladenosine and gene expression control. Nat Rev Mol Cell Biol 2014; 15:313–326. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] Wang X, Zhao BS, Roundtree IA, et al. N(6)-methyladenosine modulates messenger RNA translation efficiency. Cell 2015; 161:1388–1399. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib31] Zhou J, Wan J, Gao X, Zhang X, Jaffrey SR, Qian SB. Dynamic m(6)A mRNA methylation directs translational control of heat shock response. Nature 2015; 526:591–594. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] Meyer KD, Patil DP, Zhou J, et al. 5′ UTR m(6)A promotes cap-independent translation. Cell 2015; 163:999–1010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] Shi H, Wang X, Lu Z, et al. YTHDF3 facilitates translation and decay of N⁶-methyladenosine-modified RNA. Cell Res 2017; 27:315–328. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] Li A, Chen YS, Ping XL, et al. Cytoplasmic m⁶A reader YTHDF3 promotes mRNA translation. Cell Res 2017; 27:444–447. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib35] Linder B, Grozhik AV, Olarerin-George AO, Meydan C, Mason CE, Jaffrey SR. Single-nucleotide-resolution mapping of m⁶A and m⁶Am throughout the transcriptome. Nat Methods 2015; 12:767–772. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] Wang X, Lu Z, Gomez A, et al. N⁶-methyladenosine-dependent regulation of messenger RNA stability. Nature 2014; 505:117–120. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] Spriggs KA, Stoneley M, Bushell M, Willis AE. Re-programming of translation following cell stress allows IRES-mediated translation to predominate. Biol Cell 2008; 100:27–38. [DOI] [PubMed] [Google Scholar]

[bib38] Sonenberg N, Hinnebusch AG. Regulation of translation initiation in eukaryotes: mechanisms and biological targets. Cell 2009; 136:731–745. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] Liberman N, Gandin V, Svitkin YV, et al. DAP5 associates with eIF2beta and eIF4AI to promote internal ribosome entry site driven translation. Nucleic Acids Res 2015; 43:3764–3775. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] Sun C, Querol-Audi J, Mortimer SA, et al. Two RNA-binding motifs in eIF3 direct HCV IRES-dependent translation. Nucleic Acids Res 2013; 41:7512–7521. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib41] Gonzalez A, Jimenez A, Vazquez D, Davies JE, Schindler D. Studies on the mode of action of hygromycin B, an inhibitor of translocation in eukaryotes. Biochim Biophys Acta 1978; 521:459–469. [DOI] [PubMed] [Google Scholar]

[bib42] Lee S, Liu B, Huang SX, Shen B, Qian SB. Global mapping of translation initiation sites in mammalian cells at single-nucleotide resolution. Proc Natl Acad Sci USA 2012; 109:E2424–2432. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib43] Glazar P, Papavasileiou P, Rajewsky N. circBase: a database for circular RNAs. RNA 2014; 20:1666–1670. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] Leprivier G, Rotblat B, Khan D, Jan E, Sorensen PH. Stress-mediated translational control in cancer cells. Biochim Biophys Acta 2015; 1849:845–860. [DOI] [PubMed] [Google Scholar]

[bib45] Ruiz-Orera J, Messeguer X, Subirana JA, Alba MM. Long non-coding RNAs as a source of new peptides. Elife 2014; 3:e03523. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib46] Ingolia NT, Brar GA, Stern-Ginossar N, et al. Ribosome profiling reveals pervasive translation outside of annotated protein-coding genes. Cell Rep 2014; 8:1365–1379. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib47] Wan J, Qian SB. TISdb: a database for alternative translation initiation in mammalian cells. Nucleic Acids Res 2014; 42:D845–D850. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib48] Washburn MP, Wolters D, Yates JR, III. Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat Biotechnol 2001; 19:242–247. [DOI] [PubMed] [Google Scholar]

[bib49] Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010; 26:841–842. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Extensive translation of circular RNAs driven by N6-methyladenosine

Yun Yang

Xiaojuan Fan

Miaowei Mao

Xiaowei Song

Ping Wu

Yang Zhang

Yongfeng Jin

Yi Yang

Ling-Ling Chen

Yang Wang

Catherine CL Wong

Xinshu Xiao

Zefeng Wang

Abstract

Introduction

Results

circRNAs containing m6A motifs are translated inside cells

Figure 1.

Modulation of m6A level in circRNA affects translation efficiency

Figure 2.

Protein factors required for m6A-initiated circRNA translation

Figure 3.

Identification of endogenous circRNAs that contain m6A modification

Figure 4.