Abstract
The recent discovery of circular RNAs (circRNAs) and characterization of their functional roles have opened a new avenue for understanding the biology of genomes. circRNAs have been implicated to play important roles in a variety of biological processes, but their precise functions remain largely elusive. Currently, a few approaches are available for novel circRNA prediction, but almost all these methods are intended for animal genomes. Considering that the major differences between the organization of plant and mammal genomes cannot be neglected, a plant-specific method is needed to enhance the validity of plant circRNA identification. In this study, we present CircPlant, an integrated tool for the exploration of plant circRNAs, potentially acting as competing endogenous RNAs (ceRNAs), and their potential functions. With the incorporation of several unique plant-specific criteria, CircPlant can accurately detect plant circRNAs from high-throughput RNA-seq data. Based on comparison tests on simulated and real RNA-seq datasets from Arabidopsis thaliana and Oryza sativa, we show that CircPlant outperforms all evaluated competing tools in both accuracy and efficiency. CircPlant is freely available at http://bis.zju.edu.cn/circplant.
Keywords: Circular RNA, Plant, MicroRNA, RNA-seq, Identification
Introduction
Circular RNAs (circRNAs) constitute a large class of non-coding RNAs that were previously dismissed as molecular flukes or by-products of canonical RNA splicing. High-throughput sequencing technologies and bioinformatics approaches have revealed that circRNAs are endogenous, abundant, conserved across species, and widespread in eukaryotic transcriptomes [1], [2]. Emerging evidence suggests that circRNAs play important roles in diverse biological processes, including acting as competing endogenous RNAs (ceRNAs) or microRNA (miRNA) sponges, regulating alternative RNA splicing or transcription, and potentially enhancing host gene transcription [2]. Furthermore, circRNAs are involved in aging [3] and neural development [4], and their aberrant expression is associated with several diseases, including atherosclerotic vascular disease [5], colorectal and ovarian cancers [6], and esophageal squamous cell carcinoma [7]. As such, circRNAs could become promising biomarkers for disease diagnosis and prognosis. Although research of circRNAs has mainly focused on humans and animals, our knowledge and understanding on plant circRNAs has improved substantially as well.
Similar to animal circRNAs, plant circRNAs are conserved across different species, which have long flanking introns, are expressed at low levels with development/tissue-specific expression profiles, and may act as miRNA sponges or regulate the transcription of their parental genes [8], [9], [10], [11], [12], [13]. However, plant circRNAs have their own unique features. Studies of rice, soybeans, and cotton (Gossypium) have shown that plant exonic circRNAs are less likely to be generated from exons flanked by introns containing repetitive or reverse complementary sequences [11], [12], [13], [14]. Meanwhile, circRNAs have been observed to be paralog- and species-specific in soybean [11] and cotton [14]. For expression levels of exonic circRNAs and their parental genes, few have shown significant positive correlation, and no negatively-correlated pair has been found in rice [13]. A recent study in Arabidopsis has revealed that circRNA could bind to the DNA of its host gene through an R-loop to control linear alternative splicing, consequently regulating the transcription of its host gene [15]. As for biotic or abiotic stress conditions in plants, circRNAs are differentially expressed at specific time points/stages, which may act as important functional regulators involved in stress-specific biological processes in plants [16], [17], [18].
Given the implications of plant circRNAs in many biological processes, it is important to better understand their molecular mechanisms, characteristics, and functions. It is important to make improvements to our identification abilities. A key step is to improve our capabilities in identifying plant circRNAs. Many circRNA prediction tools have been developed (Table 1), which have been used to detect tens of thousands of circRNAs in human, mouse, fly, and other model organisms. Almost all the pipelines or programs are designed for mammals. However, the accuracy of predicting plant circRNAs is relatively low with available methods, potentially attributing to a high false positive rate or overlapping between identified plant circRNAs [9], [31], [32]. For example, in Ye et al.’s study [11], 10 of the 18 predicted exonic circRNAs were confirmed by sequencing PCR products amplified by divergent primers, but none of the 30 predicted non-exonic circRNAs was confirmed. Besides common canonical splicing signals in animals and humans, the majority of circRNAs (92.7%) in rice were flanked by non-GT/AG canonical splicing signals [33]. Considering the differences between plant and mammal genomes, a computational tool is urgently needed for improved identification of plant circRNAs.
Table 1.
Tool | De novo | Mapper | Dependency | Check splice site | Language | Single-end | Latest release date | Refs. |
---|---|---|---|---|---|---|---|---|
circRNA_finder | Yes | STAR | Awk, Samtools | GU-AG GC-AG AU-AC | Perl | No | 2015 | [3] |
find_circ | Yes | Bowtie2 | Pysam, Samtools | GU-AG | Python | No | 2016 | [19] |
CIRCexplorer | Yes | Bowtie, TopHat | Cufflinks, Bedtools, pysam, Pandas, docopt | GU-AG GC-AG AU-AC | Python | Yes | 2018 | [20], [21] |
Segemehl | Yes | Segemehl | Samtools | no | C | Yes | 2015 | [22] |
CIRI | Yes | BWA-MEM | / | GU-AG | Perl | Yes | 2017 | [23], [24] |
KNIFE | No | Bowtie, Bowtie2 | Samtools, numpy, scipy, data.Table | no | Perl, Python, R | Yes | 2017 | [25] |
MapSplice | No | Bowtie | Samtools | no | Python, C++ | Yes | 2016 | [26] |
UROBORUS | No | Bowtie, Bowtie2 | TopHat, Samtools | no | Perl | Yes | 2017 | [27] |
ACFS | Yes | BWA-MEM | / | full | Perl | Yes | 2016 | [28] |
DCC/CircTest | No | STAR | Pysam, pandas, numpy, HTSeq | GU-AG | Python | Yes | 2018 | [29] |
PcircRNA_finder | Yes | Bowtie, Bowtie2, STAR, Segemehl, TopHat | Pysam, docopt, Samtools, Bedtools | GU-AG GC-AG AU-AC | Perl, Python | No | 2016 | [30] |
CircPlant | Yes | BWA-MEM | FASTA36, TAPIR, Bioperl | GU-AG GC-AG AU-AC | Perl | Yes | 2018 | – |
Note: FASTA36 and TAPIR are required for the interaction prediction and network construction modules, while Bioperl is needed for the GO annotation module. The circRNA detection module in CircPlant has no dependencies.
According to previous results, circRNAs interact with miRNAs to regulate transcription and cellular pathways, act as miRNA sponges to naturally sequester and competitively inhibit the activity of miRNAs, or compete with miRNA-binding mRNAs [2]. circRNAs and those mRNAs with common miRNA target sites could form a complex interactive regulatory network, known as a ceRNA network [34]. Here, we present CircPlant, an integrated tool for plant circRNA detection and functional prediction, with circRNA-miRNA interactions, ceRNA pairs (or circRNA-miRNA-mRNA networks), and annotation of Gene Ontology (GO). Based on comprehensive comparisons applied to both simulated and real datasets, we have shown that CircPlant has the highest accuracy and precision among competing tools. More importantly, CircPlant enables huge reductions in runtime for detecting circRNAs from high-throughput sequencing data.
Methods
As illustrated in Figure 1, the CircPlant tool consists of four modules (see details in File S1).
Module 1: circRNA detection
Firstly, CircPlant uses a BWA-MEM algorithm [35] to map sequencing reads, and a plant-specifically modified CIRI2 [23], [24] algorithm is integrated to collect back-splice sites from rRNA-depleted total RNA or poly(A)-depleted RNA sequencing data. CIRI2 is a widely used computational detection method for circRNA identification, using an adapted maximum likelihood estimation based on multiple seed matching with remarkably balanced sensitivity, reliability, and duration relative to other methods [24], [31]. Several plant-specific factors are integrated into the modified CIRI2 (File S1), which account for the relatively small lengths of plant circRNAs, the high copy number of plant genes, and the non-canonical types of splicing signals [25], [33], [36], [37]. If gene annotation is provided, CIRI2 will combine the annotation file and plant-specific factors to screen candidate circRNA sites. Subsequently, these candidate sites are further screened through the second alignment. A pseudo reference genome (pseudoRef) is created with chiastic back-splice site flanking sequences. Raw reads are then mapped to it to check the back-splice sites [30]. Finally, full-length sequences of circRNAs are extracted, including isoforms possibly originated from the same back-splice sites [38]. For exonic circRNAs, which are generated from exons of a single protein-coding gene, CircPlant only extracts exon regions based on genome annotation. However, a few circRNAs were proven to contain sequences that cannot be found in mRNAs [10], [23], which may arise from the alternative splicing events within circRNAs [39]. These alternative splicing events in plants are not considered in the current algorithms, which should be addressed in future investigations.
Module 2: circRNA–miRNA interaction prediction
To predict the interaction between circRNAs and miRNAs, all circRNA sequences are regarded as candidate miRNA targets. The target sequences that could be perfectly bound by mature miRNAs are identified by TargetFinder [40] with a score parameter less than three, and the potential target mimics are predicted by Tapir [41] with three filtering rules [42], [43] (File S1).
Module 3: circRNA–miRNA–mRNA network construction
After obtaining the interactions between known miRNAs and mRNAs by TargetFinder, a hypergeometric test is applied to assess the reliability of the ceRNA pair prediction (File S1). The circRNA–miRNA–mRNA networks are then generated from ceRNA pairs, which can be exported as a file for visualization and modification in Cytoscape [44].
Module 4: circRNA annotation
CircPlant applies the GO enrichment method to predict the potential biological functions of circRNAs based on their correlated ceRNA pairs. Given that members in a ceRNA pair are expected to have similar functions, the potential functions of a circRNA are assigned as enriched GO terms of all associating mRNAs in ceRNA pairs.
Results
The characteristics and methodological aspects of CircPlant
CircPlant has several key differences in comparison to other circRNA detection tools. Firstly, CircPlant is a lightweight tool with fewer dependencies (Table 1). Except for the read mapping tool BWA-MEM, no other software or package is required to use CircPlant, which allows for a more straightforward installation. Secondly, CircPlant is memory- and time-efficient. We tested five prediction tools on a real paired-end RNA-seq dataset (ERR748773; paired-end, 100 bp, 123,120,011 × 2 reads), and CircPlant had a higher read mapping rate (Table S1) with relatively short computation time (Figure 2 and Table S2). This high efficiency will provide a considerable advantage for working with large-scale datasets. Thirdly, CircPlant applies several procedures to increase the sensitivity and precision of circRNA identification. Raw reads are mapped to a pseudoRef that is created with chiastic back-splice site flanking sequences. For paired-end datasets, candidate circRNAs should have back-splice reads in inferred regions of both paired reads, and that paired-end information would be consistent with corresponding templates of putative circRNAs. Maximum likelihood estimation based on multiple seed matching was adapted in CIRI2, resulting in the paired-end mapping information being consistent with stricter criteria in length, alignment score, and mismatch numbers to filter false positives derived from repetitive sequences and mapping errors [45]. It is recommended to use genome annotation and splicing signal at the same time, as they could be complementary to each other. Although exon–intron boundary usually has a canonical splicing signal, it is better to harness splicing signals in complete genome annotations for non-model species, usually with GT-AG [46], [47].
To better detect plant circRNAs, CircPlant has considered the differences between plant and mammal genomes, and applies several plant-specific criteria for circRNA identification. (1) According to the length distribution of circRNAs belonging to 12 plant species in PlantCircBase [48], the default length threshold of circRNA is set to 20 kb, and the maximum threshold is set to 100 kb (Figure S1). (2) In the read mapping process, CircPlant applies stricter criteria, including alignment score and mismatch numbers, to exclude some false predictions due to high copy number of genes, many rounds of segment duplications, and whole-genome duplications in plant genomes. Genome annotations are extremely recommended to use for plant circRNA identification. (3) 5-bp flanking regions of the two canonical back-splice sites (acceptor and donor) are allowed for candidate back-splice sites in CircPlant [30]. circRNAs with minor changes in alternative donor/acceptor sites are another form of alternative circulation [25], while much of alternative splicing of circRNAs occurs nearby canonical splicing sites. (4) Besides the splicing by U2 based spliceosome (usually with consensus sequence GT-AG and GC-AG), CircPlant also considers a U12 based minor spliceosome (usually with consensus sequence AT-AC) for each candidate circRNA’s splicing signal. These two types account for over 99.5% of splicing sites [47]. These plant-specific features are integrated into CIRI2 for back-splice site collection.
Benchmark based on simulated and real datasets
In order to fully gauge the performance of CircPlant, we compared it to four popular circRNA detection tools (MapSplice, ACFS2, PcircRNA_finder, and CIRI2) using both simulated and real datasets [30]. MapSplice is a highly accurate algorithm for detecting novel canonical and non-canonical splices, and is not dependent on splice site features or intron length [26]. ACFS allows de novo, accurate, and fast identification and abundance quantification of circRNAs from RNA-seq data, which is used more in human circRNA identification [28]. ACFS version 2 (ACFS2) was used in this study. PcircRNA_finder is the first precise prediction method for plant exonic circRNAs [30], and CIRI2 is a widely used tool for circRNA identification based on multiple seed matching [24].
Two simulated RNA-seq datasets (paired-end reads, 100 bp, and 6000 back-splice reads for each sample; Figure S2) were generated by randomly choosing 200 chiastic transcripts based upon the Arabidopsis thaliana (TAIR10) and Oryza sativa (IRGSP-1.0.38) genome annotations, respectively. Sensitivity, precision, and F1 score (the harmonic mean of precision and sensitivity) were used to evaluate the performance of these methods [49]. The results indicated that CircPlant had the highest overall sensitivity among all five tools, and its precision reached 0.99 for both test genomes (Table 2). Considering that the ACFS2 algorithm contains over 95% of all canonical splicing sites in human transcriptome, it is not surprising that ACFS2 performed poorly with simulated datasets. Finally, CircPlant also obtained the highest F1 score in both test plant species compared to MapSplice, ACFS2, PcircRNA_finder, and CIRI2 (Table 2).
Table 2.
Species | Software | Sensitivity | Precision | F1 score |
---|---|---|---|---|
A. thaliana | MapSplice | 0.30 | 0.91 | 0.45 |
ACFS2 | 0.01 | 0.33 | 0.01 | |
PcircRNA_finder | 0.85 | 0.90 | 0.87 | |
CircPlant | 0.92 | 0.99 | 0.95 | |
CIRI2 | 0.88 | 0.95 | 0.91 | |
O. sativa | MapSplice | 0.26 | 0.76 | 0.39 |
ACFS2 | 0.00 | / | 0.00 | |
PcircRNA_finder | 0.69 | 0.92 | 0.79 | |
CircPlant | 0.96 | 0.99 | 0.98 | |
CIRI2 | 0.96 | 0.99 | 0.97 |
Note: Sensitivity = TP/(TP + FN); Precision = TP/(TP + FP); F1 score = 2 × TP/(2 × TP + FP + FN), in which TP (true positive), FP (false positive), and FN (false negative) represent the number of correctly identified circRNAs, the number of incorrectly identified circRNAs, and the number of missing circRNAs, respectively. Sensitivity, precision, and F1 score all range from 0 to 1; higher value means higher accuracy and balance.
To test the circRNA detection capability of current tools on real datasets, we have obtained poly(A)-selected (ERR748773; paired-end, 100 bp, 123,120,011 × 2 reads) and poly(A)-depleted (ERR748783; paired-end, 100 bp, 207,396,249 × 2 reads) samples from mature leaves of O. sativa. These data were used to predict circRNAs by all five circRNA detection methods. In the poly(A)-selected sample, 287 circRNAs were predicted by CircPlant, while 1700 circRNAs were predicted by MapSplice, 2804 by ACFS2, 297 by PcircRNA_finder, and 331 by CIRI2 (Figure S3A). In the poly(A)-depleted sample, 173 circRNAs were predicted by CircPlant, while 79 circRNAs were predicted by MapSplice, 1103 by ACFS2, 190 by PcircRNA_finder, and 185 by CIRI2 (Figure S3B). Considering the secondary structure of circRNAs, RNA-seq of poly(A)-selected samples preferentially detects linear but not circular RNAs [50]. CircPlant predicted fewer circRNAs in the poly(A)-selected sample than the other four tools. However, in the poly(A)-depleted sample, CircPlant predicted more circRNAs than MapSplice and slightly less circRNAs than CIRI2 and PcircRNA_finder. Meanwhile, the circRNAs predicted by CircPlant exhibited relatively high levels of overlap with the circRNAs predicted by other tools (32.4% for poly(A)-selected sample and 58.4% for poly(A)-depleted sample). It suggested that CircPlant performs well in identifying the circRNAs.
Although PcircRNA_finder and CircPlant are both specifically developed for plant circRNAs, the overlapping circRNAs of these two tools in the poly(A)-selected and poly(A)-depleted samples are few, while the majority of circRNAs detected by CircPlant are overlapped with other tools (Figure S3). The limitations of PcircRNA_finder could be an important reason, as it is only able to detect exonic circRNAs. Interestingly, PcircRNA_finder uses the integration of five software results, including Bowtie used by MapSplice, but the overlap of these two tools is relatively low.
Conclusion
Detection of circRNAs is the basis for in-depth study of circRNA’s biogenesis, functions, and corresponding molecular mechanisms. Considering the differences between plant and mammal genomes, most prevailing tools are unsuitable for plant circRNA detection. Computational requirements are also a concern when the identification process is scaled up to a large set of samples. CircPlant is a reliable, time-saving, and easy-to-use tool for plant circRNA detection and functional prediction. The CircPlant package is developed in Perl language, which has been tested on Ubuntu Linux 14.04. The software package and a user manual can be obtained from http://bis.zju.edu.cn/circplant.
Code availability
CircPlant is freely available at http://bis.zju.edu.cn/circplant.
CRediT author statement
Peijing Zhang: Conceptualization, Software, Formal analysis, Writing - original draft, Writing - review & editing. Yongjing Liu: Writing - original draft, Writing - review & editing. Hongjun Chen: Software. Xianwen Meng: Formal analysis. Jitong Xue: Software. Kunsong Chen: Writing - review & editing. Ming Chen: Conceptualization, Writing - original draft, Writing - review & editing, Supervision. All authors read and approved the final manuscript.
Competing interests
The authors have declared no competing interests.
Acknowledgments
We would like to thank Michael Wade (Vanderbilt University, USA) and Youhuang Bai (Fujian Agriculture and Forestry University, China) for their assistance with linguistic issues. This work has been supported by the National Key Research and Development Program of China (Grant Nos. 2016YFA0501704 and 2018YFC0310602), the National Natural Science Foundation of China (Grant Nos. 31771477 and 31571366), the Fundamental Research Funds for the Central Universities, Jiangsu Collaborative Innovation Center for Modern Crop Production, and the 2018 Zhejiang University Academic Award for Outstanding Doctoral Candidates.
Handled by Fangqing Zhao
Footnotes
Peer review under responsibility of Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China.
Supplementary data to this article can be found online at https://doi.org/10.1016/j.gpb.2020.10.001.
Supplementary material
The following are the Supplementary data to this article:
References
- 1.Li X., Yang L., Chen L.L. The bogenesis, functions, and challenges of circular RNAs. Mol Cell. 2018;71:428–442. doi: 10.1016/j.molcel.2018.06.034. [DOI] [PubMed] [Google Scholar]
- 2.Kristensen L.S., Andersen M.S., Stagsted L.V.W., Ebbesen K.K., Hansen T.B., Kjems J. The biogenesis, biology and characterization of circular RNAs. Nat Rev Genet. 2019;20:675–691. doi: 10.1038/s41576-019-0158-7. [DOI] [PubMed] [Google Scholar]
- 3.Westholm J.O., Miura P., Olson S., Shenker S., Joseph B., Sanfilippo P. Genome-wide analysis of Drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. Cell Rep. 2014;9:1966–1980. doi: 10.1016/j.celrep.2014.10.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.You X.T., Vlatkovic I., Babic A., Will T., Epstein I., Tushev G. Neural circular RNAs are derived from synaptic genes and regulated by development and plasticity. Nat Neurosci. 2015;18:603–610. doi: 10.1038/nn.3975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Burd C.E., Jeck W.R., Liu Y., Sanoff H.K., Wang Z., Sharpless N.E. Expression of linear and novel circular forms of an INK4/ARF-associated non-coding RNA correlates with atherosclerosis risk. PLoS Genet. 2010;6:e1001233. doi: 10.1371/journal.pgen.1001233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Bachmayr-Heyda A., Reiner A.T., Auer K., Sukhbaatar N., Aust S., Bachleitner-Hofmann T. Correlation of circular RNA abundance with proliferation - exemplified with colorectal and ovarian cancer, idiopathic lung fibrosis, and normal human tissues. Sci Rep. 2015;5:8057. doi: 10.1038/srep08057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Chen H., Zheng B., Huang L., Zheng W., Chen C. Microarray analysis reveals altered expression of multiple circular RNAs in the pathogenesis of esophageal squamous cell carcinoma. Transl Cancer Res. 2017;6:1248–1257. [Google Scholar]
- 8.Ashwal-Fluss R., Meyer M., Pamudurti N.R., Ivanov A., Bartok O., Hanan M. circRNA biogenesis competes with pre-mRNA splicing. Mol Cell. 2014;56:55–66. doi: 10.1016/j.molcel.2014.08.019. [DOI] [PubMed] [Google Scholar]
- 9.Chen L., Zhang P., Fan Y., Lu Q., Li Q., Yan J. Circular RNAs mediated by transposons are associated with transcriptomic and phenotypic variation in maize. New Phytol. 2018;217:1292–1306. doi: 10.1111/nph.14901. [DOI] [PubMed] [Google Scholar]
- 10.Li Z., Huang C., Bao C., Chen L., Lin M., Wang X. Exon-intron circular RNAs regulate transcription in the nucleus. Nat Struct Mol Biol. 2015;22:256–264. doi: 10.1038/nsmb.2959. [DOI] [PubMed] [Google Scholar]
- 11.Zhao W., Cheng Y., Zhang C., You Q., Shen X., Guo W. Genome-wide identification and characterization of circular RNAs by high throughput sequencing in soybean. Sci Rep. 2017;7:5636. doi: 10.1038/s41598-017-05922-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lu T., Cui L., Zhou Y., Zhu C., Fan D., Gong H. Transcriptome-wide investigation of circular RNAs in rice. RNA. 2015;21:2076–2087. doi: 10.1261/rna.052282.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ye C.Y., Chen L., Liu C., Zhu Q.H., Fan L.J. Widespread noncoding circular RNAs in plants. New Phytol. 2015;208:88–95. doi: 10.1111/nph.13585. [DOI] [PubMed] [Google Scholar]
- 14.Zhao T., Wang L., Li S., Xu M., Guan X., Zhou B. Characterization of conserved circular RNA in polyploid Gossypium species and their ancestors. FEBS Lett. 2017;591:3660–3669. doi: 10.1002/1873-3468.12868. [DOI] [PubMed] [Google Scholar]
- 15.Conn V.M., Hugouvieux V., Nayak A., Conos S.A., Capovilla G., Cildir G. A circRNA from SEPALLATA3 regulates splicing of its cognate mRNA through R-loop formation. Nat Plants. 2017;3:17053. doi: 10.1038/nplants.2017.53. [DOI] [PubMed] [Google Scholar]
- 16.Pan T., Sun X., Liu Y., Li H., Deng G., Lin H. Heat stress alters genome-wide profiles of circular RNAs in Arabidopsis. Plant Mol Biol. 2018;96:217–229. doi: 10.1007/s11103-017-0684-7. [DOI] [PubMed] [Google Scholar]
- 17.Wang Y., Yang M., Wei S., Qin F., Zhao H., Suo B. Identification of circular RNAs and their targets in leaves of Triticum aestivum L. under dehydration stress. Front Plant Sci. 2016;7:2024. doi: 10.3389/fpls.2016.02024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Zhou R., Zhu Y., Zhao J., Fang Z., Wang S., Yin J. Transcriptome-wide identification and characterization of potato circular RNAs in response to Pectobacterium carotovorum subspecies brasiliense infection. Int J Mol Sci. 2017;19:71. doi: 10.3390/ijms19010071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Memczak S., Jens M., Elefsinioti A., Torti F., Krueger J., Rybak A. Circular RNAs are a large class of animal RNAs with regulatory potency. Nature. 2013;495:333–338. doi: 10.1038/nature11928. [DOI] [PubMed] [Google Scholar]
- 20.Zhang X.O., Wang H.B., Zhang Y., Lu X., Chen L.L., Yang L. Complementary sequence-mediated exon circularization. Cell. 2014;159:134–147. doi: 10.1016/j.cell.2014.09.001. [DOI] [PubMed] [Google Scholar]
- 21.Zhang X.O., Dong R., Zhang Y., Zhang J.L., Luo Z., Zhang J. Diverse alternative back-splicing and alternative splicing landscape of circular RNAs. Genome Res. 2016;26:1277–1287. doi: 10.1101/gr.202895.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hoffmann S., Otto C., Doose G., Tanzer A., Langenberger D., Christ S. A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection. Genome Biol. 2014;15:R34. doi: 10.1186/gb-2014-15-2-r34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Gao Y., Wang J., Zhao F. CIRI: an efficient and unbiased algorithm for de novo circular RNA identification. Genome Biol. 2015;16:4. doi: 10.1186/s13059-014-0571-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Gao Y., Zhang J., Zhao F. Circular RNA identification based on multiple seed matching. Brief Bioinform. 2018;19:803–810. doi: 10.1093/bib/bbx014. [DOI] [PubMed] [Google Scholar]
- 25.Szabo L., Morey R., Palpant N.J., Wang P.L., Afari N., Jiang C. Statistically based splicing detection reveals neural enrichment and tissue-specific induction of circular RNA during human fetal development. Genome Biol. 2015;16:126. doi: 10.1186/s13059-015-0690-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wang K., Singh D., Zeng Z., Coleman S.J., Huang Y., Savich G.L. MapSplice: accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Res. 2010;38:e178. doi: 10.1093/nar/gkq622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Song X., Zhang N., Han P., Moon B.S., Lai R.K., Wang K. Circular RNA profile in gliomas revealed by identification tool UROBORUS. Nucleic Acids Res. 2016;44:e87. doi: 10.1093/nar/gkw075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.You X., Conrad T.O. Acfs: accurate circRNA identification and quantification from RNA-Seq data. Sci Rep. 2016;6:38820. doi: 10.1038/srep38820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Cheng J., Metge F., Dieterich C. Specific identification and quantification of circular RNAs from sequencing data. Bioinformatics. 2016;32:1094–1096. doi: 10.1093/bioinformatics/btv656. [DOI] [PubMed] [Google Scholar]
- 30.Chen L., Yu Y.Y., Zhang X.C., Liu C., Ye C.Y., Fan L.J. PcircRNA_finder: a software for circRNA prediction in plants. Bioinformatics. 2016;32:3528–3529. doi: 10.1093/bioinformatics/btw496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Zeng X., Lin W., Guo M., Zou Q. A comprehensive overview and evaluation of circular RNA detection tools. PLoS Comput Biol. 2017;13:e1005420. doi: 10.1371/journal.pcbi.1005420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Tan J., Zhou Z., Niu Y., Sun X., Deng Z. Identification and functional characterization of tomato circRNAs derived from genes involved in fruit pigment accumulation. Sci Rep. 2017;7:8594. doi: 10.1038/s41598-017-08806-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Ye C.Y., Zhang X.C., Chu Q.J., Liu C., Yu Y.Y., Jiang W.Q. Full-length sequence assembly reveals circular RNAs with diverse non-GT/AG splicing signals in rice. RNA Biol. 2017;14:1055–1063. doi: 10.1080/15476286.2016.1245268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Meng X., Zhang P., Chen Q., Wang J., Chen M. Identification and characterization of ncRNA-associated ceRNA networks in Arabidopsis leaf development. BMC Genomics. 2018;19:607. doi: 10.1186/s12864-018-4993-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv 2013;1303.3997.
- 36.Staiger D., Brown J.W.S. Alternative splicing at the intersection of biological timing, development, and stress responses. Plant Cell. 2013;25:3640–3656. doi: 10.1105/tpc.113.113803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Starke S., Jost I., Rossbach O., Schneider T., Schreiner S., Hung L.H. Exon circularization requires canonical splice signals. Cell Rep. 2015;10:103–111. doi: 10.1016/j.celrep.2014.12.002. [DOI] [PubMed] [Google Scholar]
- 38.Salzman J., Chen R.E., Olsen M.N., Wang P.L., Brown P.O. Cell-type specific features of circular RNA expression. PLoS Genet. 2013;9:e1003777. doi: 10.1371/journal.pgen.1003777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Gao Y., Wang J., Zheng Y., Zhang J., Chen S., Zhao F. Comprehensive identification of internal structure and alternative splicing events in circular RNAs. Nat Commun. 2016;7:12060. doi: 10.1038/ncomms12060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Fahlgren N., Howell M.D., Kasschau K.D., Chapman E.J., Sullivan C.M., Cumbie J.S. High-throughput sequencing of Arabidopsis microRNAs: evidence for frequent birth and death of MIRNA genes. PLoS One. 2007;2:e219. doi: 10.1371/journal.pone.0000219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Bonnet E., He Y., Billiau K., Van de Peer Y. TAPIR, a web server for the prediction of plant microRNA targets, including target mimics. Bioinformatics. 2010;26:1566–1568. doi: 10.1093/bioinformatics/btq233. [DOI] [PubMed] [Google Scholar]
- 42.Wu H.J., Wang Z.M., Wang M., Wang X.J. Widespread long noncoding RNAs as endogenous target mimics for microRNAs in plants. Plant Physiol. 2013;161:1875–1884. doi: 10.1104/pp.113.215962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Chen D., Yuan C., Zhang J., Zhang Z., Bai L., Meng Y. PlantNATsDB: a comprehensive database of plant natural antisense transcripts. Nucleic Acids Res. 2012;40:D1187–D1193. doi: 10.1093/nar/gkr823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Shannon P., Markiel A., Ozier O., Baliga N.S., Wang J.T., Ramage D. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Gao Y., Zhao F. Computational strategies for exploring circular RNAs. Trends Genet. 2018;34:389–400. doi: 10.1016/j.tig.2017.12.016. [DOI] [PubMed] [Google Scholar]
- 46.Reddy A.S.N., Marquez Y., Kalyna M., Barta A. Complexity of the alternative splicing landscape in plants. Plant Cell. 2013;25:3657–3683. doi: 10.1105/tpc.113.117523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Marquez Y., Brown J.W.S., Simpson C., Barta A., Kalyna M. Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis. Genome Res. 2012;22:1184–1195. doi: 10.1101/gr.134106.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Chu Q., Zhang X., Zhu X., Liu C., Mao L., Ye C. PlantcircBase: a database for plant circular RNAs. Mol Plant. 2017;10:1126–1128. doi: 10.1016/j.molp.2017.03.003. [DOI] [PubMed] [Google Scholar]
- 49.Chuang T.J., Wu C.S., Chen C.Y., Hung L.Y., Chiang T.W., Yang M.Y. NCLscan: accurate identification of non-co-linear transcripts (fusion, trans-splicing and circular RNA) with a good balance between sensitivity and precision. Nucleic Acids Res. 2016;44:e29. doi: 10.1093/nar/gkv1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Lasda E., Parker R. Circular RNAs: diversity of form and function. RNA. 2014;20:1829–1842. doi: 10.1261/rna.047126.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.