Abstract
Non-coding ribonucleic acid (RNA) has without a doubt captured the interest of biomedical researchers. The ability to screen the entire human genome with high-throughput sequencing technology has greatly enhanced the identification, annotation and prediction of the functionality of non-coding RNAs. In this review, we discuss the current landscape of non-coding RNA research and quantitative analysis. Non-coding RNA will be categorized into two major groups by size: long non-coding RNAs and small RNAs. In long non-coding RNA, we discuss regular long non-coding RNA, pseudogenes and circular RNA. In small RNA, we discuss miRNA, transfer RNA, piwi-interacting RNA, small nucleolar RNA, small nuclear RNA, Y RNA, single recognition particle RNA, and 7SK RNA. We elaborate on the origin, detection method, and potential association with disease, putative functional mechanisms, and public resources for these non-coding RNAs. We aim to provide readers with a complete overview of non-coding RNAs and incite additional interest in non-coding RNA research.
Keywords: long non-coding RNA, small RNA, detection, functional mechanism, disease, resource
1. Introduction
It is commonly believed that protein-coding messenger ribonucleic acids (mRNAs) are the primary controller of cells, carrying out the necessary functions for life. Our understanding of how a gene functions has been significantly challenged due to the advances in high-throughput sequencing (HTS) technology, which has provided researchers with an expanded view of the complexity of the human genome, allowing for the identification of diverse types of RNAs. Mounting evidence from various studies has shown that the non-coding portion of the genome plays a more significant role in human biology than previously thought. Depending on the version of the annotation, the protein-coding genome regions represent around 1–3% of the human genome. However, the rest of the genome plays an important role in controlling the expression of the coding deoxyribonucleic acid (DNA), and in the spatial organization of the genome. One of the most critical corroborations of this is the distribution of results of genome-wide association studies (GWAS). Based on the latest GWAS Catalog (June 2017), only 3.56% of disease-associated single-nucleotide polymorphisms (SNPs) reside in the protein-coding region and 96.44% lie in non-coding regions equally proportioned between the intergenic regions and intron regions [1,2].
The Encyclopedia of DNA Elements (ENCODE) [3] is a large consortium project aimed at mapping all functional elements in the human genome. The initial results of the ENCODE project suggest that as much as 80% of the genome is biologically active and functional. However, this conclusion has drawn many sharp criticisms. The arguments against ENCODE’s conclusion include the C-value paradox [4], which argues that transcription activity does not necessarily indicate biological functionality [5], and the current estimated fraction of the genome that is conserved through purifying selection is less than 10% [6]. A later study was able to narrow down the fraction of the conserved region to 8.2% [7]. The argument between ENCODE and the other researchers who reject ENCODE’s conclusion primarily lies in the definition of functionality. For example, the encode researchers consider all DNAs that are transcribed to RNA including non-coding RNA as functional. However, some researchers only consider that evolutionarily conserved regions are functional.
Non-coding RNA (ncRNA) is an RNA molecule that is transcribed from DNA but not translated to a protein. The last decade has witnessed a sharp rise of interest in non-coding RNA research (Figure 1). There are multiple classes and sub-classes of ncRNA. RNA size is a commonly used division of ncRNA: with small ncRNA (sRNA, less than 200 nucleotides), and long ncRNA (lncRNA, more than 200 nucleotides). Our review reflects on how the recent widespread interest in non-coding RNA is a result of the combination of the maturity of HTS technology and the bioinformatics development supporting that technology.
2. lncRNA
LncRNAs are a class of long transcribed but not translated RNAs that are longer than 200 nucleotides [8,9]. lncRNAs are often transcribed by RNA polymerase II, and can be classified as antisense, intronic, intergenic, divergent and enhancer lncRNAs according to their relative genome position (Figure 1) [10]. They have many characteristics similar to mRNAs such as 5’ capping, exon–intron splicing, and poly-adenylation, and the primary distinction from mRNAs is the lack of open reading frames (ORFs) [11]. LncRNA was discovered through the bioinformatics analysis of transcriptome data. Unlike protein coding mRNAs, lncRNA was traditionally believed to be non-functional. However, many recent studies have shown evidence for the functionality of lncRNA [9,12], such as roles in high-order chromosomal dynamics [13], embryonic stem cell differentiation [14], telomere biology [15] and subcellular structural organization [16]. The best characterized lncRNAs come from cancer studies. For example, MALAT-1 was associated with poor prognosis in non-small cell lung cancer [17,18], and oral squamous cell carcinoma [19]. Similarly, the lncRNA HOTAIR has been shown to promote metastasis in multiple cancers including breast, gastric, colorectal, cervical and liver cancer [20,21,22,23,24]. LncRNA SNHG1 has been found to regulate NOB1 expression by sponging miR-326 and promotes tumorigenesis in osteosarcoma [25]. Gioia R et al. observed that silencing lncRNA RP11-624C23.1 or RP11-203E8 could provide a selective advantage to leukemic cells by increasing resistance to genotoxic stress, possibly by modulating the DNA damage response (DDR) pathway [26]. Moreover, lncRNA was recently hailed as a possible biomarker for cancer [27,28,29]. For instance, lncRNA ZEB1-AS1 predicts unfavorable prognosis in gastric cancer [30] and lncRNA-ATB has potential as a biomarker for the prognosis of hepatocellular carcinoma and as a targeted therapy for afflicted patients [31].
The interest in lncRNA has grown considerably as evidence of lncRNA’s role in various biological contexts has accumulated in recent years (Figure 2). lncRNAdb is a database that provides comprehensive annotations of eukaryotic lncRNAs manually curated from referenced literature [32]. The current version of lncRNAdb is released in Jan 2015 providing information of sequence, genomic context, expression profile, structure, subcellular localization, conservation, and function for 287 eukaryotic lncRNAs. LncRNADisease, another database that documents previously identified and predicts novel lncRNA-disease associations for human and mouse was published in 2013 [33]. The LncRNADisease curated the experimentally supported lncRNA-disease association data and integrated a tool for predicting potential associated diseases for a novel lncRNA based on its genomic context. In addition, LncRNADisease also collected lncRNA interactions in various levels, including protein, RNA, miRNA, and DNA. Current version (July 2017) of LncRNADisease documents ~3000 published lncRNA-disease associations including 914 lncRNAs and 329 diseases. Another notable lncRNA resource is LNCipedia (v4.1), a database which contains 146,742 annotated human lncRNAs [34]. Other published lncRNA database that are worth noting are lncRNome, a comprehensive database of lncRNA in humans [35]; MONOCLdb, which provides the annotations and expression profiles of mouse lncRNAs involved in influenza and SARS-CoV infections [36]; and NRED, a database of long non-coding RNA expression [37]. A list of currently available databases for lncRNA is shown in Table 1.
Table 1.
Name | Link | Dataset | Organism | PMID |
---|---|---|---|---|
ChIPBase v2.0 | http://rna.sysu.edu.cn/chipbase/ | transcriptional regulatory networks of ncRNAs and protein-coding genes | 10 species | 27924033 |
DIANA-LncBase v2 | http://www.microrna.gr/LncBase/ | miRNA-lncRNA interactions | human and mouse | 26612864 |
LNCipedia 4.1 | http://www.lncipedia.org/ | annotated lncRNA transcripts | human | 25378313 |
lncRNAdb v2.0 | http://www.lncrnadb.org/ | functional lncRNAs | 68 species | 25332394 |
LncRNADisease | http://www.cuilab.cn/lncrnadisease/ | lncRNA-disease associations | human | 23175614 |
LncRNAWiki | http://lncrna.big.ac.cn/ | lncRNA knowledgebase | human | 25399417 |
lncRNome | http://genome.igib.res.in/lncRNome/ | lncRNA knowledgebase | human | 23846593 |
MONOCLdb | http://www.monocldb.org/ | lncRNAs expressed in collaborative cross founder mice in response to respiratory virus infection | mouse | 24922324 |
NONCODE (v5.0) | http://www.noncode.org/ | lncRNA knowledgebase | 17 species | 26586799 |
NRED | http://nred.matticklab.com/ | lncRNA expression | human and mouse | 18829717 |
The identification of lncRNA requires the detection of transcription from unannotated genomic regions. This can be done by a number of techniques, including tiling array, serial analysis of gene expression (SAGE) [38], cap analysis gene expression (CAGE) [39], and the most powerful technique to date: RNA-seq, which prompts the development of multiple RNA-seq based pipelines for identifying lncRNAs [40,41]. Furthermore, chromatin immunoprecipitation (ChIP) technology, either ChIP-chip or Chip-seq can also identify novel lncRNAs indirectly by studying genomic regions with protein or histone modifications [42,43].
Traditional RNAseq libraries were built based on the presence of a poly(A) tail, which is present in all mRNA except for histone encoded mRNAs [44]. It is estimated that 60% of lncRNAs also have poly(A) tails [45]. Thus poly(A) tail-based RNA capture is not able to describe the entire range of lncRNAs. The preferred RNA library preparation method for this is total RNA library construction, which depletes ribosomal RNA (rRNA) and washes out small RNAs by size selection [46]. The current common total RNA library kits are Ribo-Zero and RNase H, each of which has its own strengths and weaknesses [47]. The total library usually requires more sequencing reads due to the multiple RNA species that are present in the library, and the rRNA reduction is not 100% efficient, usually leaving a portion of the rRNA in the library. In the last few years, lncRNA-focused microarray products have become available for purchase, such as Array star INC’s Human LncRNA Expression Microarray V4.0. However, these products are limited to known lncRNAs. In our opinion, RNAseq remains the ideal technology for detecting lncRNAs.
The number of definable lncRNAs varies by study. In 2005, a study claimed to have identified over 35,000 lncRNAs [11]. Another study in 2007 estimated that there are four times more lncRNA compared to protein-coding RNA [48]. In the ENCODE lncRNA release (Version 26), 15,787 lncRNAs were identified, further categorizing them into four sub-classes: antisense, large intergenic non-coding RNAs (lincRNA), sense intronic, and processed transcripts. In the latest study in 2017 by Hon et al., 27,919 human lncRNAs with high confidence 5’ end were described [49].
According to the latest gene annotation (GRCh38) gene transfer format file (GTF) (Homo_sapiens.GRCh38.89.gtf) from Ensemble, the total length of coding RNA is 97,662,789 nucleotides compared to 9,699,539 nucleotides for lincRNA. How lncRNA exerts its influence over disease is not well-known. The lncRNA-disease association may be activated through the regulation of protein-coding gene expression by a trans-expression quantitative trait locus (eQTL) and these trans-eQTL SNPs could be a surrogate SNP for SNPs residing in protein-coding RNA within the same high-linkage disequilibrium haplotype block. Cabili et al. found that many tissue-specific cis-eQTL are SNPs with known diseases or trait associations [50]. Hon et al. found that eQTL-linked lncRNA-mRNA pairs were more co-expressed than random lncRNA-mRNA pairs [49].
Even though the majority of lncRNAs are believed to be non-coding, some of them may potentially harbor novel unannotated proteins. In recent years, there have been increasing efforts to predict the functional potential of lncRNA. The first tool created for such a purpose was the Coding Potential Calculator (CPC) [51]. CPC assesses coding-potential by incorporating features such as the length of the open reading frame, coverage and integrity of the predicted open reading frame into a support vector machine (SVM). Getorf is a tool implemented by the European Molecular biology Open Software Suite [52], which identifies ORFs by identifying start and stop codons. The coding-potential assessment tool (CPAT) was designed to access coding potential through a logistic regression model accessing similar features used in CPC, such as length of ORF and ORF coverage. CPAT utilized two additional features, the Fickett score and the hexamer score, where the Fickett score describes the nucleotide composition of the RNA, and the hexamer score indicates the relative degree of hexamer usage bias in a particular sequence. The latest entry to this tool category is slncky [41], which is designed to discover lncRNAs from an RNA-seq dataset and assess their functional importance. Slncky assesses the coding potential by identifying conserved ORFs in syntenic regions across multiple species, with the syntenic regions identified by liftOver [53].
3. Circular RNA (circRNA)
CircRNA is a type of evolutionarily conserved RNA that forms a covalently closed continuous RNA loop. There are two scenarios of the formation of circRNAs: direct ligation of 5’ and 3’ ends of linear RNA, and backsplicing, wherein a downstream 3’ splice site joins to an upstream 5’ splice site (Figure 3). It has been suggested that the formation mechanism is associated with the RNA editing enzyme adenosine deaminase acting on RNA [54], and that the RNA-binding protein quaking regulates the formation of circRNAs [55]. It has been suggested that circRNA can be formed from mRNA and other non-coding RNA in the intergenic regions [56]. CircRNAs do not possess poly(A) tails [56,57], thus total RNA library preparation is more suitable for circRNA detection. There have been multiple reports that circRNA can be detected in non-tissue samples, such as saliva [58], blood plasma [59] and seminal plasma [60]. Currently, a major recognized function of circRNA is their action as a micro RNA (miRNA) sponge. It was found that circRNAs are much easier to bind with miRNA than with mRNA [61], allowing circRNA to repress miRNA regulation on mRNA.
CircRNA was originally identified through the analysis of scrambled exons over two decades ago using older, non-high throughput technology [62]. CircRNA was considered a rare event and exclusive to only a few viruses [63], and has evaded detection in mammalian genomes for a long time due to the lack of poly(A) tails. Advances in HTS technology have allowed researchers to scrutinize the mammalian genome in unprecedented detail, allowing for the identification of thousands of additional circRNAs. A large portion of the identified circRNAs are derived from protein-coding genes. However, after formation, these circRNAs are not able to undergo translation to proteins, and thus they have been categorized as non-coding RNA. Interestingly, recently several groups have discovered a protein-encoding function for some circRNAs, revealing an unexplored model of gene expression [64,65,66]. There have been increasingly more human disease studies devoted to circRNAs (Figure 2). For example, circRNA has been found to be a biomarker for cancer [67,68] and is associated with neurological disease [69]. The molecular roles and function, and how circRNA dysregulation affects disease, were reviewed in [70].
Novel bioinformatic tools have contributed greatly to the identification of the new circRNAs from RNA-seq data. Notable circRNA identification tools are find_circ [71], MapSplice2 [72], Segemehl [73], circExplorer [74], circRNA_finder [75], CIRI [76], ACFS [69], KNIFE [77], NCLscan [78], DCC [79] and UROBORUS [80]. Information regarding these tools is listed in Table 2. In addition, CircView, a platform visualization tool, is specifically developed to visualize circRNAs detected from these tools [81]. Furthermore, unlike other types of ncRNAs, circRNAs are not well-annotated. The traditional RNA annotation database RefSeq and Ensembl do not contain circRNA information currently. Instead, seven independent circRNA databases are available (Table 3) [82,83,84,85,86,87,88].
Table 2.
Tools | Language | Prerequisites | Input | PMID |
---|---|---|---|---|
find_circ | Python | Python (2.7) (numpy, pysam) Bowtie2 SAMtools |
reads in fastq | 23446348 |
MapSplice2 | Python | g++ (≥4.3.3) Python (≥2.4.3) SAMtools Bowtie |
reads in fasta/fastq | 20802226 |
Segemehl | C | NA | reads in fasta | 1975021224512684 |
circExplorer2 | Python | Python (≥2.7) (pysam ≥ 0.8.4, pybedtools, pandas, docopt, and scipy) TopHat (≥2.0.9) Cufflinks (≥2.1.1) BEDTools genePredToGtf gtfToGenePred bedGraphToBigWig (optional) bedToBigBed (optional) STAR (≥2.4.0j) MapSplice (≥2.1.9) BWA (≥0.6.2-r126) segemehl (≥0.2.0) |
reads in fastq | 27365365 |
circRNA_Finder | Perl | Perl awk STAR (versions 2.4.1c) samtools |
reads in fasta | 25544350 |
CIRI | Perl | Perl | alignment in SAM | 2558336528334140 |
ACFS | Perl | BWA-0.7.3a Perl BLAT (optional) |
reads in fasta | 27929140 |
KNIFE | Perl, Python & R | Bowtie2 (≥2.2.1) Bowtie (≥0.12.7) Perl Python 2.7.5 (numpy, scipy) R 3.0.2 (data.table) SAMtools |
reads in fastq | 26076956 |
NCLscan | Python & C++ | Python 2.7 BEDTools SAMtools BLAT BWA |
reads in fasta | 26442529 |
DCC | Python | Python (pysam, pandas, numpy, and HTSeq) | alignment in BAM | 26556385 |
UROBORUS | Perl | TopHat SAMtools Bowtie1/Bowtie2 |
alignment in SAM | 26873924 |
Table 3.
Name | Link | Dataset | Organism | PMID |
---|---|---|---|---|
circBase | http://www.circbase.org/ | merged and unified data sets of circRNAs | Human, Mouse, Worm, fly, and coelacanth | 25234927 |
circRNADb | http://202.195.183.4:8000/circrnadb/circRNADb.php | exonic circRNAs | Human | 27725737 |
Circ2Traits | http://gyanxet-beta.com/circdb/ | disease-circRNA association | Human | 24339831 |
CircNet | http://circnet.mbc.nctu.edu.tw/ | tissue-specific circRNA expression profiles and circRNA–miRNA-gene regulatory networks | Human | 26450965 |
circRNABase (in starBase v2.0) | http://202.116.90.187/mirCircRNA.php | miRNA-circRNA interactions | Human, Mouse, and worm | 24297251 |
PlantcircBase | http://ibi.zju.edu.cn/plantcircbase/ | plant circular RNA Database | Rice, Mouse-ear cress, corn, Tomato, and Barley | 28315753 |
4. Pseudogenes
Pseudogenes are a category of ncRNA that resembles mRNA, but they are not transcribed into proteins. Currently there are two major mechanisms for the formation of pseudogenes. The first one suggests that pseudogenes are the products of the process of genomic duplication [89]. Genes in the duplicated regions often retain the original functions of their parent protein-coding genes. It was shown that high mutation rates were observed near or on duplicated regions [90]. When mutations such as stop-gain and frameshifts disrupt the original function of the duplicated genes, pseudogenes are formed. The second mechanism describes how pseudogenes can be formed through retrotransposition, where the process of reverse transcription of mRNA re-integrates cDNA sequences into the genome, forming new pseudogenes [91]. The formation of pseudogenes provides vital clues on how genomic DNA has adapted to evolutionary pressure to ensure survival. Pseudogenes are often regarded as dead or disabled with respect to protein synthesis. However, evolutionary studies have found evidence that a small number of pseudogenes in the human lineage have regained their protein-coding function [92].
The detection of pseudogenes usually relies on the careful analysis of sequence alignment. A common approach is to determine the homologous sequences to protein coding genes using tools such as FASTA or BLAST. Several more complicated computational approaches have been developed over the years to detect pseudogenes [93,94,95]. However, their utilization has been low, probably partially due to the lack of interest in pseudogenes. Despite this lack of enthusiasm, several interesting studies have shown how pseudogenes can potentially affect human health. Most of the proposed pseudogene functions are facilitated through the homology of their sequences. An example is PTENP1 which is the pseudogene of PTEN, a well-characterized tumor suppressor [96]. PTENP1 regulates cellular levels of PTEN by both sense and antisense RNAs which act as decoys for PTEN targeting microRNAs and also exert tumor-suppressive activities [97]. It has been shown that a missense mutation in PTENP1 can eliminate a codon associated with methionine initiation, thus inhibiting the translation of regular PTEN protein [97]. Another example is a pseudogene acting as a miRNA sponge. Studies have shown that BRAFP1 [98] and PTENP1 [97] compete for miRNA binding with their mRNA counterparts. The majority of the pseudogenes have been characterized in RefSeq and Ensembl. They are usually part of the regular RNA-seq analysis. Additional pseudogene resources for non-human species can be found at pseudogene.org [99]. Pseudofam provides family based pseudogene resources [100].
5. Small RNA
sRNAs are non-coding RNAs with a length of less than 200 nucleotides. The discovery of sRNAs has substantially enriched our understanding of the diverse world of RNAs. There are many species of sRNA. Here we will discuss eight species that have been proven to be detectable from HTS data; miRNA, transfer RNA (tRNA), piwi-interacting RNA (piRNA), small nucleolar RNA (snoRNA), small nuclear RNA (snRNA), Y RNA (yRNA), single-recognition particle RNA (7SL RNA), and 7SK RNA.
The most studied smallest RNAs are miRNAs with more than 10,000 published manuscripts on PubMed (Figure 2). Originally discovered in 1993 [101], miRNAs are single-stranded ncRNAs of 19–25 nucleotides, which modulate translation by binding mRNAs through the seed sequence (up to seven nucleotides). Prior to the introduction of HTS technology, high-throughput miRNAs studies were conducted using hybridization-based technology, which limited the detection of miRNA to known and annotated miRNAs. The advancement of HTS has substantially increased the detection throughput of miRNA. More importantly, HTS enables the examination of miRNA at a single nucleotide resolution in addition to the quantification of abundance. By scrutinizing the precise nucleotide sequences of miRNA, researchers have discovered the phenomenon of miRNA isoforms (isomiR). The isomiRs are miRNAs with clipped seed sequences, compared to reference miRNA sequences [102] (Figure 4). The seed sequence of isomiRs and their parent miRNAs can differ by up to two nucleotides, causing a substantial difference in the repertoire of predicted mRNAs targets. Because miRNAs are well studied, their annotations are readily available in most annotation databases, such as Ensemble and Refseq. In addition, more than 50 miRNA and/or miRNA target resources are available now (Table 4 and Table 5). The most commonly used independent miRNA database is miRBase [103], and the most used miRNA target prediction webserver is TargetScan [104]. Another feature of miRNA that can be characterized by HTS technology is the non-templated nucleotide additions at the 3’ end of miRNAs [105,106,107,108]. The miRNA generally function as the transcriptional control of the regulatory elements of other protein-coding genes [109]. Thus, miRNAs play important roles in most biological processes, including development, proliferation, differentiation, immune reaction, apoptosis, tumorgenesis, adaptation to stress, and etc [101,109,110,111,112]. The miRNAs have exhibited potential as biomarkers or therapeutic targets for human diseases including cancer. For example, overexpression of miR-185 was shown to inhibit autophagy and apoptosis of dopaminergic cells in Parkinson’s disease, potentially via regulation of the AMPK/mTOR signaling pathway [113]. Several miRNAs have been repeatedly reported to be significantly dys-expressed in prostate cancer, including the down-regulated miR-143/145 and up-regulated let-7a, miR-130b, miR-141, and miR-17-5p [46,114,115]. In lung cancer, the 5q33 region containing miR-143 and miR-145 is often deleted which implies decreased expression of both miRNAs [116].
Table 4.
Name | Link | Organism | PMID |
---|---|---|---|
ComiR | http://www.benoslab.pitt.edu/comir/ | human, mouse, worm and fly | 23703208 |
MBSTAR | http://www.isical.ac.in/~bioinfo_miu/MBStar30.htm | human | 25614300 |
miRDB | http://www.mirdb.org/ | human, mouse, rat, dog and chicken | 25378301 |
miRmap | http://mirmap.ezlab.org/ | human, mouse, cow, opossum, chicken, chimpanzee, zebrafish and other | 23716633 |
mirPath v.3 | http://www.microrna.gr/miRPathv3 | human, mouse, rat, worm, fly, chicken and zebrafish | 25977294 |
miRNA_Targets | http://mamsap.it.deakin.edu.au/mirna_targets/ | human, mouse, chicken, cow, worm, fly and zebrafish | 22940442 |
miRSystem | http://mirsystem.cgm.ntu.edu.tw/ | human and mouse | 22870325 |
miRTar2GO | http://www.mirtar2go.org/ | human (cell type specific) | 27903911 |
miRwalk2.0 | http://zmf.umm.uni-heidelberg.de/mirwalk2 | 15 species | 26226356 |
MR-microT | http://diana.imis.athena-innovation.gr/DianaTools/index.php?r=mrmicrot/index | human, mouse and fly | 22285563 |
PITA | https://genie.weizmann.ac.il/pubs/mir07/mir07_prediction.html | human, mouse, fly and worm | 17893677 |
psRNATarget | http://plantgrn.noble.org/psRNATarget/ | plant | 21622958 |
STarMir | http://sfold.wadsworth.org/starmir.html | mammalian | 24803672 |
TAPIR | http://bioinformatics.psb.ugent.be/webtools/tapir | 10 plant species | 20430753 |
TargetScan v7.1 | http://www.targetscan.org/vert_71/ | human, mouse, worm, fly and fish | 26267216 |
Tools4miRs | https://tools4mirs.org/ | any | 27153626 |
Table 5.
Name | Link | Description | Organism | PMID |
---|---|---|---|---|
ChIPBase v2.0 | http://rna.sysu.edu.cn/chipbase/ | regulatory networks of ncRNAs and protein-coding genes | human | 27924033 |
deepBase v2.0 | http://biocenter.sysu.edu.cn/deepBase/ | small RNAs, LncRNAs and circular RNAs | 19 species | 26590255 |
DIANA-TarBase v7.0 | http://www.microrna.gr/tarbase | experimentally validated miRNA:gene interactions | 24 species | 25416803 |
doRiNA v2.0 | http://dorina.mdc-berlin.de/ | RNA interactions in post-transcriptional regulation | human, mouse, fly and worm | 25416797 |
dPORE | http://cbrc.kaust.edu.sa/dpore | SNPs on the regulation of miRNAs and predicted TFBSs | human | 21326606 |
Firefly Discovery Engine | https://www.fireflybio.com/portal/search | published miRNA papers | 6 species | NA |
HOCTAR2 | http://hoctar.tigem.it/ | miRNA target and expression | human | 21435384 |
mESAdb | http://konulab.fen.bilkent.edu.tr/mirna/ | sequences and expression of miRNA | human, mouse and zebrafish | 21177657 |
microRNA.org | http://www.microrna.org | targets and expression | human, mouse, rat, fly and worm | 18158296 |
miRBase | http://www.mirbase.org/ | miRNA sequences and annotation | 206 species | 24275495 |
miRCancer | http://mircancer.ecu.edu/ | miRNA Cancer Association | human | 23325619 |
miRDB | http://www.mirdb.org/ | targets and functional annotations | human, mouse, rat, dog and chicken | 25378301 |
miRecords | http://miRecords.umn.edu/miRecords | animal miRNA-target interactions | 9 species | 18996891 |
miRGate | http://mirgate.bioinfo.cnio.es | miRNA-mRNA targets | human, mouse and rat | 25858286 |
miRGator v3.0 | http://mirgator.kobic.re.kr/ | expression profiles and target relationships | human | 23193297 |
miRGen v2.0 | http://www.microrna.gr/mirgen | cell-line-specific miRNA TSSs and TF binding sites | human and mouse | 26586797 |
miRmine | http://guanlab.ccmb.med.umich.edu/mirmine | miRNA expression profiles | human | 28108447 |
miRNAMap v2.0 | http://miRNAMap.mbc.nctu.edu.tw/ | experimentally verified microRNAs and miRNA target genes | human, mouse, rat and other metazoan genomes | 18029362 |
miRNEST v2.0 | http://mirnest.amu.edu.pl | collection of animal, plant and virus microRNA data | 544 species | 24243848 |
miROrtho | http://cegg.unige.ch/mirortho | miRNA gene candidates | 45 species | 18927110 |
miRSel | http://services.bio.ifi.lmu.de/mirsel | literature derived miRNA-gene association | human, mouse and rat | 20233441 |
miRTarBase v7.0 | http://miRTarBase.mbc.nctu.edu.tw/ | experimentally validated miRNA-target interactions | 23 species | 29126174 |
miRWalk v2.0 | http://zmf.umm.uni-heidelberg.de/mirwalk2 | predicted and experimentally verified miRNA-target interactions | 15 species | 26226356 |
PMRD | http://bioinformatics.cau.edu.cn/PMRD | plant microRNA | 21 plant species | 19808935 |
PolymiRTS Database | http://compbio.uthsc.edu/miRSNP/ | DNA variations in miRNA seed regions and miRNA target sites | human and mouse | 24163105 |
Psmir | http://www.bio-bigdata.com/Psmir/ | potential associations between small molecules and miRNAs | human | 26759061 |
RepTar | http://reptar.ekmd.huji.ac.il | predicted miRNA target | human and mouse | 21149264 |
SonamiR DB v2.0 | http://compbio.uthsc.edu/SomamiR/ | cancer somatic mutations in miRNAs and their target sites | human | 26578591 |
StarBase v2.0 | http://starbase.sysu.edu.cn/ | RNA-RNA and protein-RNA interaction networks from CLIP-Seq data | human, mouse and worm | 24297251 |
STarMirDB | http://sfold.wadsworth.org/starmirDB.php | miRNA binding sites | human, mouse and worm | 27144897 |
TargetScan v7.1 | http://www.targetscan.org/vert_71/ | miRNA targets | human, mouse, worm, fly and fish | 26267216 |
TMREC | http://bioinfo.hrbmu.edu.cn/TMREC/ | transcription factor and miRNA regulatory cascades in human diseases | human | 25932650 |
TransmiR | http://cmbi.bjmu.edu.cn/transmir | transcription factor–microRNA regulation | 14 species | 19786497 |
VIRmiRNA | http://crdd.osdd.net/servers/virmirna | experimental viral miRNA and their targets | 68 viruses | 25380780 |
HTS technology has also facilitated the discovery and identification of a wide range of sRNA species. At the infancy of HTS technology, small RNA-sequencing was often referred to as miRNA-sequencing as the goal was mostly to study miRNA. Through meticulous examination of HTS data, researchers became aware that miRNAs are only a fraction of the sRNA-sequencing data. The sequencing libraries for sRNA are constructed with size-selected gel electrophoresis, which is agnostic to sRNA categories. All RNA of less than 50 nucleotides in size are selected into the library. The most, or second most, abundant sRNA species is often tRNA (Figure 5A). tRNAs have a length of between 76 to 90 nucleotides, and serve as the physical link between mRNA and protein. The tRNAs detected with sRNA-sequencing are usually tRNA-derived sRNAs, a fragment of the parent tRNA, usually the 33 nucleotide sequences before the anticodon or the 33 nucleotide sequence after the anticodon (Figure 5B,C). The fragmentation of tRNA is caused by the cleavage by RNAse III enzyme, producing tRNA-derived halves [117,118]. Moreover, tRNAs can also be cleaved in a Dicer-dependent manner or as an in-vitro phenomenon by incubation with MgCl2 or nuclease S1 [119]. The 5’ tRNA fragments has been found to inhibit the translation initiation by interfering with the cap binding complex elF4F [120,121]. The production of the tRNA fragments has been shown to be associated with stress [118,122]. The detection of tRNA nucleotide variants and their association with diseases has also been increasingly reported [123,124,125,126]. For example, in animals, the tRNA fragment abundance has been found to be correlated with the severity of tissue damage in kidneys [127]. These 5’ tRNA fragments may be captured by high-throughput sequencing. In 2015, a novel technique was developed to sequence the entire tRNA by removing the bases with potential modification [128].
Other noticeable species of sRNA that can be detected through HTS are piRNA, yRNA, snRNA, snoRNA, 7SL, and 7SK. piRNA is a small RNA with a length of 24 to 32 nucleotides, and is considered by many to be the most abundant species of sRNA [129]. PiRNAs form RNA-protein complexes with the piwi proteins. Although the piRNA pathway has been commonly perceived as germline-specific, recent studies have demonstrated that the piRNA pathway has somatic functions [130], and potential associations with cancer [131]. Several studies have suggested that piRNA can be derived from pseudogenes [132,133]. yRNAs are components of the Ro60 ribonucleoprotein particle. Their primary function is DNA replication through interaction with chromatin and initiation proteins [134]. The latest studies have suggested that yRNAs regulate cell death and inflammation in monocytes [135]. snRNA is a species of small RNA confined to the splicing speckles and Cajal bodies of the nucleus in eukaryotic cells. The average length of snRNA is 150 nucleotides. The snRNAs bind with proteins to form small nuclear ribonucleoprotein particles (snRNPs). Some snRNAs are engaged in the formation and function of spliceosomes where pre-mRNA splicing occurs [136]. The transcription of snRNA is carried out by RNA polymerase II or III [137]. snoRNA is a species of sRNA with a length of 60 to 300 nucleotides. Its primary function is to guide chemical modifications of ribosomal RNA and tRNA. The two main classes of snoRNA are C/D box snoRNA and H/ACA box snoRNA. The primary function of C/D box snoRNA is to regulate methylation and pre-mRNA splicing [138]. The H/ACA box snoRNAs have been associated with pseudouridine, the most abundant modified nucleoside in RNA. Discovered in 1970 [139], 7SL is a species of sRNA, and a component of the signal-recognition particle ribonucleoprotein complex. The primary functions of 7SL RNA are to regulate protein translation [140], and post-translational transport [141]. 7SK is a species of sRNA found in metazoans. Its primary role is regulating transcription through the regulation of the positive transcription elongation factor P-TEFb [142].
Many miRNA processing pipelines have been established for utilizing HTS data. The major pipelines include: Oasis [143], Chimira [144], miRge [145], and TIGER [146], etc. Several tools have also been developed to detect isomiRs such as SeqBuster [147], isomiRID [148] and DeAnnIso [149]. One tRNA detection tool, tDRMapper [150], is currently available. For other species of sRNAs, the majority of the alignment-based pipelines would be sufficient for detection given that the annotation is available.
6. Conclusions
The biology of the human body is a vast and complex system and we are just beginning to understand the role of non-coding RNA in regulating that system. The advancement of biotechnology contributes to every breakthrough in our understanding of human biology. ncRNAs, the supposedly insignificant portion of the RNA universe, have exploded with an array of studies centered around the potential functions and disease associations related to ncRNAs. The sudden interest in ncRNA is due to the maturity of HTS technology and the development of bioinformatics allowing the interpretation of HTS data.
The interest in ncRNA is reflected by the increasing number of manuscripts published pertaining to ncRNA and the initiation of large consortium projects focused on ncRNA, such as ENCODE. The controversy surrounding whether ncRNAs are functional really comes down to the definition of “functional.” In the 2012 ENCODE publication [4], all transcribed RNAs were considered to be functional, but some researchers require a stricter definition. Nevertheless, the majority of the human genome should have a purpose, whether it is to synthesize proteins or to serve as a sponge for miRNA, or with lost or undiscovered mechanisms.
The common consensus of the functions of ncRNA is that they regulate gene expression at both transcriptional and epigenetic levels. The exact mechanism of this regulation varies by ncRNA categories; and some may be yet to be discovered. As HTS technology and sequencing library construction methods advance, we come closer to elucidating the entire human RNA spectrum and uncovering the secrets of ncRNAs.
Acknowledgments
This study was supported by grant 5P30CA118100 from National Cancer Institute.
Author Contributions
Y.G., Y.X. and J.W. wrote the manuscript. D.C.S. provided conceptual and editorial assistant. Y.-Y.Z. and S.Z. designed figures and tables.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Freedman M.L., Monteiro A.N., Gayther S.A., Coetzee G.A., Risch A., Plass C., Casey G., De Biasi M., Carlson C., Duggan D., et al. Principles for the post-GWAS functional characterization of cancer risk loci. Nat. Genet. 2011;43:513–518. doi: 10.1038/ng.840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Blattler A., Yao L., Witt H., Guo Y., Nicolet C.M., Berman B.P., Farnham P.J. Global loss of DNA methylation uncovers intronic enhancers in genes showing expression changes. Genome Biol. 2014;15 doi: 10.1186/s13059-014-0469-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Consortium E.P., Bernstein B.E., Birney E., Dunham I., Green E.D., Gunter C., Snyder M. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Eddy S.R. The C-value paradox, junk DNA and encode. Curr. Biol. 2012;22 doi: 10.1016/j.cub.2012.10.002. [DOI] [PubMed] [Google Scholar]
- 5.Niu D.K., Jiang L. Can ENCODE tell us how much junk DNA we carry in our genome? Biochem. Biophys. Res. Commun. 2013;430:1340–1343. doi: 10.1016/j.bbrc.2012.12.074. [DOI] [PubMed] [Google Scholar]
- 6.Graur D., Zheng Y., Price N., Azevedo R.B., Zufall R.A., Elhaik E. On the immortality of television sets: “Function” in the human genome according to the evolution-free gospel of ENCODE. Genome Biol. Evol. 2013;5:578–590. doi: 10.1093/gbe/evt028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Rands C.M., Meader S., Ponting C.P., Lunter G. 8.2% of the human genome is constrained: variation in rates of turnover across functional element classes in the human lineage. PLoS Genet. 2014;10 doi: 10.1371/journal.pgen.1004525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Perkel J.M. Visiting “noncodarnia”. Biotechniques. 2013;54:301, 303–304. doi: 10.2144/000114037. [DOI] [PubMed] [Google Scholar]
- 9.Mercer T.R., Dinger M.E., Mattick J.S. Long non-coding RNAs: insights into functions. Nat. Rev. Genet. 2009;10:155–159. doi: 10.1038/nrg2521. [DOI] [PubMed] [Google Scholar]
- 10.Knoll M., Lodish H.F., Sun L. Long non-coding RNAs as regulators of the endocrine system. Nat. Rev. Endocrinol. 2015;11:151–160. doi: 10.1038/nrendo.2014.229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N., Oyama R., Ravasi T., Lenhard B., Wells C., et al. The transcriptional landscape of the mammalian genome. Science. 2005;309:1559–1563. doi: 10.1126/science.1112014. [DOI] [PubMed] [Google Scholar]
- 12.Dinger M.E., Amaral P.P., Mercer T.R., Mattick J.S. Pervasive transcription of the eukaryotic genome: Functional indices and conceptual implications. Brief. Funct. Genom. Proteom. 2009;8:407–423. doi: 10.1093/bfgp/elp038. [DOI] [PubMed] [Google Scholar]
- 13.Amaral P.P., Mattick J.S. Noncoding RNA in development. Mamm. Genome. 2008;19:454–492. doi: 10.1007/s00335-008-9136-7. [DOI] [PubMed] [Google Scholar]
- 14.Dinger M.E., Amaral P.P., Mercer T.R., Pang K.C., Bruce S.J., Gardiner B.B., Askarian-Amiri M.E., Ru K., Solda G., Simons C., et al. Long noncoding RNAs in mouse embryonic stem cell pluripotency and differentiation. Genome Res. 2008;18:1433–1445. doi: 10.1101/gr.078378.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Schoeftner S., Blasco M.A. Developmentally regulated transcription of mammalian telomeres by DNA-dependent RNA polymerase II. Nat. Cell Biol. 2008;10:228–236. doi: 10.1038/ncb1685. [DOI] [PubMed] [Google Scholar]
- 16.Mercer T.R., Dinger M.E., Sunkin S.M., Mehler M.F., Mattick J.S. Specific expression of long noncoding RNAs in the mouse brain. Proc. Natl. Acad. Sci. USA. 2008;105:716–721. doi: 10.1073/pnas.0706729105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Schmidt L.H., Spieker T., Koschmieder S., Schaffers S., Humberg J., Jungen D., Bulk E., Hascher A., Wittmer D., Marra A., et al. The long noncoding MALAT-1 RNA indicates a poor prognosis in non-small cell lung cancer and induces migration and tumor growth. J. Thorac. Oncol. 2011;6:1984–1992. doi: 10.1097/JTO.0b013e3182307eac. [DOI] [PubMed] [Google Scholar]
- 18.Gutschner T., Hammerle M., Eissmann M., Hsu J., Kim Y., Hung G., Revenko A., Arun G., Stentrup M., Gross M., et al. The noncoding RNA MALAT1 is a critical regulator of the metastasis phenotype of lung cancer cells. Cancer Res. 2013;73:1180–1189. doi: 10.1158/0008-5472.CAN-12-2850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zhou X., Liu S., Cai G., Kong L., Zhang T., Ren Y., Wu Y., Mei M., Zhang L., Wang X. Long non coding RNA MALAT1 promotes tumor growth and metastasis by inducing epithelial-mesenchymal transition in oral squamous cell carcinoma. Sci. Rep. 2015;5 doi: 10.1038/srep15972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Gupta R.A., Shah N., Wang K.C., Kim J., Horlings H.M., Wong D.J., Tsai M.C., Hung T., Argani P., Rinn J.L., et al. Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature. 2010;464:1071–1076. doi: 10.1038/nature08975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bhan A., Hussain I., Ansari K.I., Bobzean S.A., Perrotti L.I., Mandal S.S. Bisphenol-A and diethylstilbestrol exposure induces the expression of breast cancer associated long noncoding RNA HOTAIR in vitro and in vivo. J. Steroid Biochem. Mol. Biol. 2014;141:160–170. doi: 10.1016/j.jsbmb.2014.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bhan A., Hussain I., Ansari K.I., Kasiri S., Bashyal A., Mandal S.S. Antisense transcript long noncoding RNA (lncRNA) HOTAIR is transcriptionally induced by estradiol. J. Mol. Biol. 2013;425:3707–3722. doi: 10.1016/j.jmb.2013.01.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Hajjari M., Salavaty A. Hotair: An oncogenic long non-coding RNA in different cancers. Cancer Biol. Med. 2015;12:1–9. doi: 10.7497/j.issn.2095-3941.2015.0006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Zhong D.N., Luo Y.H., Mo W.J., Zhang X., Tan Z., Zhao N., Pang S.M., Chen G., Rong M.H., Tang W. High expression of long noncoding HOTAIR correlated with hepatocarcinogenesis and metastasis. Mol. Med. Rep. 2017 doi: 10.3892/mmr.2017.7999. [DOI] [PubMed] [Google Scholar]
- 25.Wang J., Cao L., Wu J., Wang Q. Long non-coding RNA SNHG1 regulates NOB1 expression by sponging miR-326 and promotes tumorigenesis in osteosarcoma. Int. J. Oncol. 2017 doi: 10.3892/ijo.2017.4187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Gioia R., Drouin S., Ouimet M., Caron M., St-Onge P., Richer C., Sinnett D. LncRNAs downregulated in childhood acute lymphoblastic leukemia modulate apoptosis, cell migration, and DNA damage response. Oncotarget. 2017;8:80645–80650. doi: 10.18632/oncotarget.20817. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Shi T., Gao G., Cao Y. Long noncoding RNAs as novel biomarkers have a promising future in cancer diagnostics. Dis. Markers. 2016 doi: 10.1155/2016/9085195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Yarmishyn A.A., Kurochkin I.V. Long noncoding RNAs: A potential novel class of cancer biomarkers. Front. Genet. 2015;6:145. doi: 10.3389/fgene.2015.00145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Tian S., Rulli S., Lader E. Serum lncRNA detection as potential biomarker of lung cancer. FASEB J. 2015;29 [Google Scholar]
- 30.Li Y., Wen X., Wang L., Sun X., Ma H., Fu Z., Li L. LncRNA ZEB1-AS1 predicts unfavorable prognosis in gastric cancer. Surg. Oncol. 2017;26:527–534. doi: 10.1016/j.suronc.2017.09.008. [DOI] [PubMed] [Google Scholar]
- 31.Jang S.Y., Kim G., Park S.Y., Lee Y.R., Kwon S.H., Kim H.S., Yoon J.S., Lee J.S., Kweon Y.-O., Ha H.T., et al. Clinical significance of lncRNA-ATB expression in human hepatocellular carcinoma. Oncotarget. 2017;8:78588–78597. doi: 10.18632/oncotarget.21094. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Quek X.C., Thomson D.W., Maag J.L., Bartonicek N., Signal B., Clark M.B., Gloss B.S., Dinger M.E. LncRNAdb v2.0: Expanding the reference database for functional long noncoding RNAs. Nucleic Acids Res. 2015;43 doi: 10.1093/nar/gku988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Chen G., Wang Z., Wang D., Qiu C., Liu M., Chen X., Zhang Q., Yan G., Cui Q. LncRNADisease: A database for long-non-coding RNA-associated diseases. Nucleic Acids Res. 2013;41 doi: 10.1093/nar/gks1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Volders P.J., Verheggen K., Menschaert G., Vandepoele K., Martens L., Vandesompele J., Mestdagh P. An update on LNCipedia: A database for annotated human lncRNA sequences. Nucleic Acids Res. 2015;43:4363–4364. doi: 10.1093/nar/gkv295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bhartiya D., Pal K., Ghosh S., Kapoor S., Jalali S., Panwar B., Jain S., Sati S., Sengupta S., Sachidanandan C., et al. lncRNome: a comprehensive knowledgebase of human long noncoding RNAs. Database (Oxford) 2013;2013 doi: 10.1093/database/bat034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Josset L., Tchitchek N., Gralinski L.E., Ferris M.T., Eisfeld A.J., Green R.R., Thomas M.J., Tisoncik-Go J., Schroth G.P., Kawaoka Y., et al. Annotation of long non-coding RNAs expressed in collaborative cross founder mice in response to respiratory virus infection reveals a new class of interferon-stimulated transcripts. RNA Biol. 2014;11:875–890. doi: 10.4161/rna.29442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Dinger M.E., Pang K.C., Mercer T.R., Crowe M.L., Grimmond S.M., Mattick J.S. NRED: A database of long noncoding RNA expression. Nucleic Acids Res. 2009;37 doi: 10.1093/nar/gkn617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Velculescu V.E., Zhang L., Vogelstein B., Kinzler K.W. Serial analysis of gene expression. Science. 1995;270:484–487. doi: 10.1126/science.270.5235.484. [DOI] [PubMed] [Google Scholar]
- 39.Shiraki T., Kondo S., Katayama S., Waki K., Kasukawa T., Kawaji H., Kodzius R., Watahiki A., Nakamura M., Arakawa T., et al. Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. Proc. Natl. Acad. Sci. USA. 2003;100:15776–15781. doi: 10.1073/pnas.2136655100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Sun L., Zhang Z., Bailey T.L., Perkins A.C., Tallack M.R., Xu Z., Liu H. Prediction of novel long non-coding RNAs based on RNA-Seq data of mouse Klf1 knockout study. BMC Bioinform. 2012;13:331. doi: 10.1186/1471-2105-13-331. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Chen J., Shishkin A.A., Zhu X.P., Kadri S., Maza I., Guttman M., Hanna J.H., Regev A., Garber M. Evolutionary analysis across mammals reveals distinct classes of long non-coding RNAs. Genome Biol. 2016;17 doi: 10.1186/s13059-016-0880-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Guttman M., Amit I., Garber M., French C., Lin M.F., Feldser D., Huarte M., Zuk O., Carey B.W., Cassady J.P., et al. Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature. 2009;458:223–227. doi: 10.1038/nature07672. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Cottrell K.A., Djuranovic S. Urb-RIP - an adaptable and efficient approach for immunoprecipitation of RNAs and associated RNAs/proteins. PLoS ONE. 2016;11 doi: 10.1371/journal.pone.0167877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Guo Y., Zhao S., Sheng Q., Guo M., Lehmann B., Pietenpol J., Samuels D.C., Shyr Y. RNAseq by total RNA library identifies additional RNAs compared to poly(a) RNA library. BioMed Res. Int. 2015;2015 doi: 10.1155/2015/862130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Cheng J., Kapranov P., Drenkow J., Dike S., Brubaker S., Patel S., Long J., Stern D., Tammana H., Helt G., et al. Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution. Science. 2005;308:1149–1154. doi: 10.1126/science.1108625. [DOI] [PubMed] [Google Scholar]
- 46.Song C., Chen H., Wang T., Zhang W., Ru G., Lang J. Expression profile analysis of microRNAs in prostate cancer by next-generation sequencing. Prostate. 2015;75:500–516. doi: 10.1002/pros.22936. [DOI] [PubMed] [Google Scholar]
- 47.Guo Y., Wu J., Zhao S., Ye F., Su Y., Clark T., Sheng Q., Lehmann B., Shu X.O., Cai Q. RNA sequencing of formalin-fixed, paraffin-embedded specimens for gene expression quantification and data mining. Int. J. Genom. 2016;2016:9837310. doi: 10.1155/2016/9837310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Kapranov P., Cheng J., Dike S., Nix D.A., Duttagupta R., Willingham A.T., Stadler P.F., Hertel J., Hackermuller J., Hofacker I.L., et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science. 2007;316:1484–1488. doi: 10.1126/science.1138341. [DOI] [PubMed] [Google Scholar]
- 49.Hon C.C., Ramilowski J.A., Harshbarger J., Bertin N., Rackham O.J., Gough J., Denisenko E., Schmeier S., Poulsen T.M., Severin J., et al. An atlas of human long non-coding RNAs with accurate 5’ ends. Nature. 2017;543:199–204. doi: 10.1038/nature21374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Cabili M.N., Trapnell C., Goff L., Koziol M., Tazon-Vega B., Regev A., Rinn J.L. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 2011;25:1915–1927. doi: 10.1101/gad.17446611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Kong L., Zhang Y., Ye Z.Q., Liu X.Q., Zhao S.Q., Wei L., Gao G. CPC: Assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucleic Acids Res. 2007;35:345–349. doi: 10.1093/nar/gkm391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Rice P., Longden I., Bleasby A. EMBOSS: The European Molecular Biology Open Software Suite. Trends Genet. 2000;16:276–277. doi: 10.1016/S0168-9525(00)02024-2. [DOI] [PubMed] [Google Scholar]
- 53.Hinrichs A.S., Karolchik D., Baertsch R., Barber G.P., Bejerano G., Clawson H., Diekhans M., Furey T.S., Harte R.A., Hsu F. The UCSC genome browser database: update 2006. Nucleic Acids Res. 2006;34:590–598. doi: 10.1093/nar/gkj144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Ivanov A., Memczak S., Wyler E., Torti F., Porath H.T., Orejuela M.R., Piechotta M., Levanon E.Y., Landthaler M., Dieterich C., et al. Analysis of intron sequences reveals hallmarks of circular RNA biogenesis in animals. Cell Rep. 2015;10:170–177. doi: 10.1016/j.celrep.2014.12.019. [DOI] [PubMed] [Google Scholar]
- 55.Conn S.J., Pillman K.A., Toubia J., Conn V.M., Salmanidis M., Phillips C.A., Roslan S., Schreiber A.W., Gregory P.A., Goodall G.J. The RNA binding protein quaking regulates formation of circRNAs. Cell. 2015;160:1125–1134. doi: 10.1016/j.cell.2015.02.014. [DOI] [PubMed] [Google Scholar]
- 56.Chen I., Chen C.Y., Chuang T.J. Biogenesis, identification, and function of exonic circular RNAs. Wiley Interdiscip. Rev. RNA. 2015;6:563–579. doi: 10.1002/wrna.1294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Jeck W.R., Sorrentino J.A., Wang K., Slevin M.K., Burd C.E., Liu J.Z., Marzluff W.F., Sharpless N.E. Circular RNAs are abundant, conserved, and associated with ALU repeats RNA 2013, 19, 426. RNA. 2013;19:426. doi: 10.1261/rna.035667.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Bahn J.H., Zhang Q., Li F., Chan T.M., Lin X., Kim Y., Wong D.T.W., Xiao X. The landscape of microRNA, Piwi-interacting RNA, and circular RNA in human saliva. Clin. Chem. 2015;61:221–230. doi: 10.1373/clinchem.2014.230433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Koh W., Pan W., Gawad C., Fan H.C., Kerchner G.A., Wyss-Coray T., Blumenfeld Y.J., El-Sayed Y.Y., Quake S.R. Noninvasive in vivo monitoring of tissue-specific global gene expression in humans. Proc. Natl. Acad. Sci. USA. 2014;111 doi: 10.1073/pnas.1405528111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Dong W.W., Li H.M., Qing X.R., Huang D.H., Li H.G. Identification and characterization of human testis derived circular RNAs and their existence in seminal plasma. Sci. Rep. 2016;6 doi: 10.1038/srep39080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Hansen T.B., Jensen T.I., Clausen B.H., Bramsen J.B., Finsen B., Damgaard C.K., Kjems J. Natural RNA circles function as efficient microRNA sponges. Nature. 2013;495:384–388. doi: 10.1038/nature11993. [DOI] [PubMed] [Google Scholar]
- 62.Nigro J.M., Cho K.R., Fearon E.R., Kern S.E., Ruppert J.M., Oliner J.D., Kinzler K.W., Vogelstein B. Scrambled exons. Cell. 1991;64:607–613. doi: 10.1016/0092-8674(91)90244-S. [DOI] [PubMed] [Google Scholar]
- 63.Jeck W.R., Sharpless N.E. Detecting and characterizing circular RNAs. Nat. Biotechnol. 2014;32:453–461. doi: 10.1038/nbt.2890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Yang Y., Fan X., Mao M., Song X., Wu P., Zhang Y., Jin Y., Yang Y., Chen L.L., Wang Y., et al. Extensive translation of circular RNAs driven by N6-methyladenosine. Cell Res. 2017;27:626–641. doi: 10.1038/cr.2017.31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Pamudurti N.R., Bartok O., Jens M., Ashwal-Fluss R., Stottmeister C., Ruhe L., Hanan M., Wyler E., Perez-Hernandez D., Ramberger E., et al. Translation of circRNAs. Mol. Cell. 2017;66 doi: 10.1016/j.molcel.2017.02.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Legnini I., Di Timoteo G., Rossi F., Morlando M., Briganti F., Sthandier O., Fatica A., Santini T., Andronache A., Wade M., et al. Circ-ZNF609 is a circular RNA that can be translated and functions in myogenesis. Mol. Cell. 2017;66:22–37. doi: 10.1016/j.molcel.2017.02.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Qin M.L., Liu G., Huo X.S., Tao X.M., Sun X.M., Ge Z.H., Yang J., Fan J., Liu L., Qin W.X. Hsa_circ_0001649: A circular RNA and potential novel biomarker for hepatocellular carcinoma. Cancer Biomark. 2016;16:161–169. doi: 10.3233/CBM-150552. [DOI] [PubMed] [Google Scholar]
- 68.Zhang Y., Liang W., Zhang P., Chen J., Qian H., Zhang X., Xu W. Circular RNAs: Emerging cancer biomarkers and targets. J. Exp. Clin. Cancer Res. 2017;36:152. doi: 10.1186/s13046-017-0624-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.You X.T., Vlatkovic I., Babic A., Will T., Epstein I., Tushev G., Akbalik G., Wang M.T., Glock C., Quedenau C., et al. Neural circular RNAs are derived from synaptic genes and regulated by development and plasticity. Nat. Neurosci. 2015;18:603–610. doi: 10.1038/nn.3975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Holdt L.M., Kohlmaier A., Teupser D. Molecular roles and function of circular RNAs in eukaryotic cells. Cell. Mol. Life Sci. 2017;7 doi: 10.1007/s00018-017-2688-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Memczak S., Jens M., Elefsinioti A., Torti F., Krueger J., Rybak A., Maier L., Mackowiak S.D., Gregersen L.H., Munschauer M., et al. Circular RNAs are a large class of animal RNAs with regulatory potency. Nature. 2013;495:333–338. doi: 10.1038/nature11928. [DOI] [PubMed] [Google Scholar]
- 72.Wang K., Singh D., Zeng Z., Coleman S.J., Huang Y., Savich G.L., He X., Mieczkowski P., Grimm S.A., Perou C.M., et al. MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Res. 2010;38:178. doi: 10.1093/nar/gkq622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Hoffmann S., Otto C., Doose G., Tanzer A., Langenberger D., Christ S., Kunz M., Holdt L.M., Teupser D., Hackermuller J., et al. A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection. Genome Biol. 2014;15 doi: 10.1186/gb-2014-15-2-r34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Zhang X.O., Wang H.B., Zhang Y., Lu X., Chen L.L., Yang L. Complementary sequence-mediated exon circularization. Cell. 2014;159:134–147. doi: 10.1016/j.cell.2014.09.001. [DOI] [PubMed] [Google Scholar]
- 75.Westholm J.O., Miura P., Olson S., Shenker S., Joseph B., Sanfilippo P., Celniker S.E., Graveley B.R., Lai E.C. Genome-wide analysis of drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. Cell Rep. 2014;9:1966–1980. doi: 10.1016/j.celrep.2014.10.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Gao Y., Wang J., Zhao F. CIRI: An efficient and unbiased algorithm for de novo circular RNA identification. Genome Biol. 2015;16:4. doi: 10.1186/s13059-014-0571-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Szabo L., Morey R., Palpant N.J., Wang P.L., Afari N., Jiang C., Parast M.M., Murry C.E., Laurent L.C., Salzman J. Statistically based splicing detection reveals neural enrichment and tissue-specific induction of circular RNA during human fetal development. Genome Biol. 2015;16 doi: 10.1186/s13059-015-0690-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Chuang T.J., Wu C.S., Chen C.Y., Hung L.Y., Chiang T.W., Yang M.Y. NCLscan: Accurate identification of non-co-linear transcripts (fusion, trans-splicing and circular RNA) with a good balance between sensitivity and precision. Nucleic Acids Res. 2016;44 doi: 10.1093/nar/gkv1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Cheng J., Metge F., Dieterich C. Specific identification and quantification of circular RNAs from sequencing data. Bioinformatics. 2016;32:1094–1096. doi: 10.1093/bioinformatics/btv656. [DOI] [PubMed] [Google Scholar]
- 80.Song X., Zhang N., Han P., Moon B.S., Lai R.K., Wang K., Lu W. Circular RNA profile in gliomas revealed by identification tool UROBORUS. Nucleic Acids Res. 2016;44 doi: 10.1093/nar/gkw075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Feng J., Xiang Y., Xia S., Liu H., Wang J., Ozguc F.M., Lei L., Kong R., Diao L., He C., et al. CircView: A visualization and exploration tool for circular RNAs. Brief. Bioinform. 2017 doi: 10.1093/bib/bbx070. [DOI] [PubMed] [Google Scholar]
- 82.Glazar P., Papavasileiou P., Rajewsky N. circBase: A database for circular RNAs. RNA. 2014;20:1666–1670. doi: 10.1261/rna.043687.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Chen X., Han P., Zhou T., Guo X., Song X., Li Y. CircRNADb: A comprehensive database for human circular RNAs with protein-coding annotations. Sci. Rep. 2016;6:34985. doi: 10.1038/srep34985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Ghosal S., Das S., Sen R., Basak P., Chakrabarti J. Circ2Traits: A comprehensive database for circular RNA potentially associated with disease and traits. Front. Genet. 2013;4:283. doi: 10.3389/fgene.2013.00283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Liu Y.C., Li J.R., Sun C.H., Andrews E., Chao R.F., Lin F.M., Weng S.L., Hsu S.D., Huang C.C., Cheng C., et al. CircNet: A database of circular RNAs derived from transcriptome sequencing data. Nucleic Acids Res. 2016;44 doi: 10.1093/nar/gkv940. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Li J.H., Liu S., Zhou H., Qu L.H., Yang J.H. starBase v2.0: Decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res. 2014;42 doi: 10.1093/nar/gkt1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Chu Q., Zhang X., Zhu X., Liu C., Mao L., Ye C., Zhu Q.H., Fan L. PlantcircBase: A database for plant circular RNAs. Mol. Plant. 2017;10:1126–1128. doi: 10.1016/j.molp.2017.03.003. [DOI] [PubMed] [Google Scholar]
- 88.Xia S., Feng J., Lei L., Hu J., Xia L., Wang J., Xiang Y., Liu L., Zhong S., Han L., et al. Comprehensive characterization of tissue-specific circular RNAs in the human and mouse genomes. Brief. Bioinform. 2016 doi: 10.1093/bib/bbw081. [DOI] [PubMed] [Google Scholar]
- 89.Bailey J.A., Gu Z.P., Clark R.A., Reinert K., Samonte R.V., Schwartz S., Adams M.D., Myers E.W., Li P.W., Eichler E.E. Recent segmental duplications in the human genome. Science. 2002;297:1003–1007. doi: 10.1126/science.1072047. [DOI] [PubMed] [Google Scholar]
- 90.Bailey J.A., Eichler E.E. Primate segmental duplications: crucibles of evolution, diversity and disease. Nat. Rev. Genet. 2006;7:552–564. doi: 10.1038/nrg1895. [DOI] [PubMed] [Google Scholar]
- 91.Harrison P.M., Gerstein M. Studying genomes through the aeons: Protein families, pseudogenes and proteome evolution. J. Mol. Biol. 2002;318:1155–1174. doi: 10.1016/S0022-2836(02)00109-2. [DOI] [PubMed] [Google Scholar]
- 92.Zhang Z.D., Frankish A., Hunt T., Harrow J., Gerstein M. Identification and analysis of unitary pseudogenes: Historic and contemporary gene losses in humans and other primates. Genome Biol. 2010;11 doi: 10.1186/gb-2010-11-3-r26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Zheng D., Gerstein M.B. A computational approach for identifying pseudogenes in the ENCODE regions. Genome Biol. 2006;7(Suppl 1) doi: 10.1186/gb-2006-7-s1-s13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Valdes C., Capobianco E. Methods to detect transcribed pseudogenes: RNA-Seq discovery allows learning through features. Methods Mol. Biol. 2014;1167:157–183. doi: 10.1007/978-1-4939-0835-6_11. [DOI] [PubMed] [Google Scholar]
- 95.Khurana E., Lam H.Y.K., Cheng C., Carriero N., Cayting P., Gerstein M.B. Segmental duplications in the human genome reveal details of pseudogene formation. Nucleic Acids Res. 2010;38:6997–7007. doi: 10.1093/nar/gkq587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Dahia P.L.M., FitzGerald M.G., Zhang X., Marsh D.J., Zheng Z.M., Pietsch T., von Deimling A., Haluska F.G., Haber D.A., Eng C. A highly conserved processed PTEN pseudogene is located on chromosome band 9p21. Oncogene. 1998;16:2403–2406. doi: 10.1038/sj.onc.1201762. [DOI] [PubMed] [Google Scholar]
- 97.Poliseno L., Salmena L., Zhang J., Carver B., Haveman W.J., Pandolfi P.P. A coding-independent function of gene and pseudogene mRNAs regulates tumour biology. Nature. 2010;465:1033–1038. doi: 10.1038/nature09144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Karreth F.A., Reschke M., Ruocco A., Ng C., Chapuy B., Leopold V., Sjoberg M., Keane T.M., Verma A., Ala U., et al. The BRAF pseudogene functions as a competitive endogenous RNA and induces lymphoma in vivo. Cell. 2015;161:319–332. doi: 10.1016/j.cell.2015.02.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Karro J.E., Yan Y., Zheng D., Zhang Z., Carriero N., Cayting P., Harrrison P., Gerstein M. Pseudogene.org: A comprehensive database and comparison platform for pseudogene annotation. Nucleic Acids Res. 2007;35 doi: 10.1093/nar/gkl851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Lam H.Y.K., Khurana E., Fang G., Cayting P., Carriero N., Cheung K.-H., Gerstein M.B. Pseudofam: The pseudogene families database. Nucleic Acids Res. 2009;37 doi: 10.1093/nar/gkn758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Lee R.C., Feinbaum R.L., Ambros V. The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell. 1993;75:843–854. doi: 10.1016/0092-8674(93)90529-Y. [DOI] [PubMed] [Google Scholar]
- 102.Morin R.D., O’Connor M.D., Griffith M., Kuchenbauer F., Delaney A., Prabhu A.L., Zhao Y.J., McDonald H., Zeng T., Hirst M., et al. Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Res. 2009;19:610–621. doi: 10.1101/gr.7179508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Kozomara A., Griffiths-Jones S. mirBase: Annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res. 2014;42 doi: 10.1093/nar/gkt1181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Agarwal V., Bell G.W., Nam J.W., Bartel D.P. Predicting effective microRNA target sites in mammalian mRNAs. eLife. 2015;4 doi: 10.7554/eLife.05005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Berezikov E., Robine N., Samsonova A., Westholm J.O., Naqvi A., Hung J.H., Okamura K., Dai Q., Bortolamiol-Becet D., Martin R., et al. Deep annotation of Drosophila melanogaster microRNAs yields insights into their processing, modification, and emergence. Genome Res. 2011;21:203–215. doi: 10.1101/gr.116657.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Rajagopalan R., Vaucheret H., Trejo J., Bartel D.P. A diverse and evolutionarily fluid set of micrornas in Arabidopsis thaliana. Genes Dev. 2006;20:3407–3425. doi: 10.1101/gad.1476406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Westholm J.O., Ladewig E., Okamura K., Robine N., Lai E.C. Common and distinct patterns of terminal modifications to mirtrons and canonical microRNAs. RNA. 2012;18:177–192. doi: 10.1261/rna.030627.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Larter C.Z., Yeh M.M. Animal models of NASH: Getting both pathology and metabolic context right. J. Gastroenterol. Hepatol. 2008;23 doi: 10.1111/j.1440-1746.2008.05543.x. [DOI] [PubMed] [Google Scholar]
- 109.Bartel D.P. MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell. 2004;116:281–297. doi: 10.1016/S0092-8674(04)00045-5. [DOI] [PubMed] [Google Scholar]
- 110.van Rooij E., Sutherland L.B., Qi X., Richardson J.A., Hill J., Olson E.N. Control of stress-dependent cardiac growth and gene expression by a microRNA. Science. 2007;316:575–579. doi: 10.1126/science.1139089. [DOI] [PubMed] [Google Scholar]
- 111.Xiao C., Calado D.P., Galler G., Thai T.H., Patterson H.C., Wang J., Rajewsky N., Bender T.P., Rajewsky K. MiR-150 controls B cell differentiation by targeting the transcription factor c-Myb. Cell. 2007;131:146–159. doi: 10.1016/j.cell.2007.07.021. [DOI] [PubMed] [Google Scholar]
- 112.Guarnieri D.J., DiLeone R.J. MicroRNAs: A new class of gene regulators. Ann. Med. 2008;40:197–208. doi: 10.1080/07853890701771823. [DOI] [PubMed] [Google Scholar]
- 113.Wen Z., Zhang J., Tang P., Tu N., Wang K., Wu G. Overexpression of mir185 inhibits autophagy and apoptosis of dopaminergic neurons by regulating the ampk/mtor signaling pathway in parkinson’s disease. Mol. Med. Rep. 2017 doi: 10.3892/mmr.2017.7897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Casanova-Salas I., Rubio-Briones J., Calatrava A., Mancarella C., Masia E., Casanova J., Fernandez-Serra A., Rubio L., Ramirez-Backhaus M., Arminan A., et al. Identification of miR-187 and miR-182 as biomarkers of early diagnosis and prognosis in patients with prostate cancer treated with radical prostatectomy. J. Urol. 2014;192:252–259. doi: 10.1016/j.juro.2014.01.107. [DOI] [PubMed] [Google Scholar]
- 115.Kumar B., Lupold S.E. MicroRNA expression and function in prostate cancer: A review of current knowledge and opportunities for discovery. Asian J. Androl. 2016;18:559–567. doi: 10.4103/1008-682X.177839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Calin G.A., Croce C.M. MicroRNAs and chromosomal abnormalities in cancer cells. Oncogene. 2006;25:6202–6210. doi: 10.1038/sj.onc.1209910. [DOI] [PubMed] [Google Scholar]
- 117.Garcia-Silva M.R., Cabrera-Cabrera F., Guida M.C., Cayota A. Hints of tRNA-derived small RNAs role in RNA silencing mechanisms. Genes. 2012;3:603–614. doi: 10.3390/genes3040603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Fu H., Feng J., Liu Q., Sun F., Tie Y., Zhu J., Xing R., Sun Z., Zheng X. Stress induces tRNA cleavage by angiogenin in mammalian cells. FEBS Lett. 2009;583:437–442. doi: 10.1016/j.febslet.2008.12.043. [DOI] [PubMed] [Google Scholar]
- 119.Harada F., Dahlberg J.E. Specific cleavage of tRNA by nuclease S1. Nucleic Acids Res. 1975;2:865–871. doi: 10.1093/nar/2.6.865. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Nekrasov M.P., Ivshina M.P., Chernov K.G., Kovrigina E.A., Evdokimova V.M., Thomas A.A.M., Hershey J.W.B., Ovchinnikov L.P. The mRNA-binding protein YB-1 (p50) prevents association of the eukaryotic initiation factor eIF4G with mRNA and inhibits protein synthesis at the initiation stage. J. Biol. Chem. 2003;278:13936–13943. doi: 10.1074/jbc.M209145200. [DOI] [PubMed] [Google Scholar]
- 121.Bakowska-Zywicka K., Kasprzyk M., Twardowski T. tRNA-derived short RNAs bind to Saccharomyces cerevisiae ribosomes in a stress-dependent manner and inhibit protein synthesis in vitro. FEMS Yeast Res. 2016;16 doi: 10.1093/femsyr/fow077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Yamasaki S., Ivanov P., Hu G.F., Anderson P. Angiogenin cleaves tRNA and promotes stress-induced translational repression. J. Cell Biol. 2009;185:35–42. doi: 10.1083/jcb.200811106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Guo Y., Bosompem A., Mohan S., Erdogan B., Ye F., Vickers K.C., Sheng Q., Zhao S., Li C.I., Su P.F., et al. Transfer RNA detection by small RNA deep sequencing and disease association with myelodysplastic syndromes. BMC Genom. 2015;16:727. doi: 10.1186/s12864-015-1929-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Abbott J.A., Francklyn C.S., Robey-Bond S.M. Transfer RNA and human disease. Front. Genet. 2014;5:158. doi: 10.3389/fgene.2014.00158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Anderson P., Ivanov P. tRNA fragments in human health and disease. FEBS Lett. 2014;588:4297–4304. doi: 10.1016/j.febslet.2014.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Fu Y., Lee I., Lee Y.S., Bao X. Small non-coding transfer RNA-derived RNA fragments (tRFs): Their biogenesis, function and implication in human diseases. Genom.Inform. 2015;13:94–101. doi: 10.5808/GI.2015.13.4.94. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Mishima E., Inoue C., Saigusa D., Inoue R., Ito K., Suzuki Y., Jinno D., Tsukui Y., Akamatsu Y., Araki M., et al. Conformational change in transfer RNA is an early indicator of acute cellular damage. J. Am. Soc. Nephrol. 2014;25:2316–2326. doi: 10.1681/ASN.2013091001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Zheng G., Qin Y., Clark W.C., Dai Q., Yi C., He C., Lambowitz A.M., Pan T. Efficient and quantitative high-throughput tRNA sequencing. Nat. Methods. 2015;12:835–837. doi: 10.1038/nmeth.3478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Seto A.G., Kingston R.E., Lau N.C. The coming of age for Piwi proteins. Mol. Cell. 2007;26:603–609. doi: 10.1016/j.molcel.2007.05.021. [DOI] [PubMed] [Google Scholar]
- 130.Ross R.J., Weiner M.M., Lin H. PIWI proteins and PIWI-interacting RNAs in the soma. Nature. 2014;505:353–359. doi: 10.1038/nature12987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Ng K.W., Anderson C., Marshall E.A., Minatel B.C., Enfield K.S., Saprunoff H.L., Lam W.L., Martinez V.D. Piwi-interacting RNAs in cancer: emerging functions and clinical utility. Mol. Cancer. 2016;15:5. doi: 10.1186/s12943-016-0491-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Guo X., Zhang Z., Gerstein M.B., Zheng D. Small RNAs originated from pseudogenes: cis- or trans-acting? PLoS Comput. Biol. 2009;5 doi: 10.1371/journal.pcbi.1000449. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Parrish N.F., Fujino K., Shiromoto Y., Iwasaki Y.W., Ha H., Xing J., Makino A., Kuramochi-Miyagawa S., Nakano T., Siomi H., et al. piRNAs derived from ancient viral processed pseudogenes as transgenerational sequence-specific immune memory in mammals. RNA. 2015;21:1691–1703. doi: 10.1261/rna.052092.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Zhang A.T., Langley A.R., Christov C.P., Kheir E., Shafee T., Gardiner T.J., Krude T. Dynamic interaction of Y RNAs with chromatin and initiation proteins during human DNA replication. J. Cell Sci. 2011;124:2058–2069. doi: 10.1242/jcs.086561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Hizir Z., Bottini S., Grandjean V., Trabucchi M., Repetto E. RNY (YRNA)-derived small RNAs regulate cell death and inflammation in monocytes/macrophages. Cell Death Dis. 2017;8 doi: 10.1038/cddis.2016.429. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Valadkhan S., Gunawardane L.S. Role of small nuclear RNAs in eukaryotic gene expression. Essays Biochem. 2013;54:79–90. doi: 10.1042/bse0540079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Henry R.W., Mittal V., Ma B., Kobayashi R., Hernandez N. SNAP19 mediates the assembly of a functional core promoter complex (SNAPc) shared by RNA polymerases II and III. Genes Dev. 1998;12:2664–2672. doi: 10.1101/gad.12.17.2664. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.Falaleeva M., Pages A., Matuszek Z., Hidmi S., Agranat-Tamir L., Korotkov K., Nevo Y., Eyras E., Sperling R., Stamm S. Dual function of C/D box small nucleolar RNAs in rRNA modification and alternative pre-mRNA splicing. Proc. Natl. Acad. Sci. USA. 2016;113 doi: 10.1073/pnas.1519292113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Bishop J.M., Levinson W.E., Sullivan D., Fanshier L., Quintrell N., Jackson J. The low molecular weight RNAs of Rous sarcoma virus. II. The 7 S RNA. Virology. 1970;42:927–937. doi: 10.1016/0042-6822(70)90341-7. [DOI] [PubMed] [Google Scholar]
- 140.Shan S.O., Walter P. Co-translational protein targeting by the signal recognition particle. FEBS Lett. 2005;579:921–926. doi: 10.1016/j.febslet.2004.11.049. [DOI] [PubMed] [Google Scholar]
- 141.Abell B.M., Pool M.R., Schlenker O., Sinning I., High S. Signal recognition particle mediates post-translational targeting in eukaryotes. EMBO J. 2004;23:2755–2764. doi: 10.1038/sj.emboj.7600281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 142.Peterlin B.M., Brogie J.E., Price D.H. 7SK snRNA: A noncoding RNA that plays a major role in regulating eukaryotic transcription. Wiley Interdiscip. Rev. RNA. 2012;3:92–103. doi: 10.1002/wrna.106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Capece V., Garcia Vizcaino J.C., Vidal R., Rahman R.U., Pena Centeno T., Shomroni O., Suberviola I., Fischer A., Bonn S. Oasis: Online analysis of small RNA deep sequencing data. Bioinformatics. 2015;31:2205–2207. doi: 10.1093/bioinformatics/btv113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 144.Vitsios D.M., Enright A.J. Chimira: Analysis of small RNA sequencing data and microRNA modifications. Bioinformatics. 2015;31:3365–3367. doi: 10.1093/bioinformatics/btv380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Baras A.S., Mitchell C.J., Myers J.R., Gupta S., Weng L.C., Ashton J.M., Cornish T.C., Pandey A., Halushka M.K. miRge - a multiplexed method of processing small RNA-seq data to determine microRNA entropy. PLoS ONE. 2015;10 doi: 10.1371/journal.pone.0143066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 146.Guo Y., Strickland S.A., Mohan S., Li S., Bosompem A., Vickers K.C., Zhao S., Sheng Q., Kim A.S. MicroRNAs and tRNA-derived fragments predict the transformation of myelodysplastic syndromes to acute myeloid leukemia. Leuk. Lymphoma. 2017;58:1–15. doi: 10.1080/10428194.2016.1272680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 147.Pantano L., Estivill X., Marti E. SeqBuster, a bioinformatic tool for the processing and analysis of small RNAs datasets, reveals ubiquitous miRNA modifications in human embryonic cells. Nucleic Acids Res. 2010;38 doi: 10.1093/nar/gkp1127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 148.de Oliveira L.F., Christoff A.P., Margis R. isomiRID: A framework to identify microRNA isoforms. Bioinformatics. 2013;29:2521–2523. doi: 10.1093/bioinformatics/btt424. [DOI] [PubMed] [Google Scholar]
- 149.Zhang Y., Zang Q., Zhang H., Ban R., Yang Y., Iqbal F., Li A., Shi Q. DeAnniso: A tool for online detection and annotation of isomirs from small RNA sequencing data. Nucleic Acids Res. 2016;44 doi: 10.1093/nar/gkw427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 150.Selitsky S.R., Sethupathy P. tDRmapper: Challenges and solutions to mapping, naming, and quantifying tRNA-derived RNAs from human small RNA-sequencing data. BMC Bioinform. 2015;16 doi: 10.1186/s12859-015-0800-0. [DOI] [PMC free article] [PubMed] [Google Scholar]