Abstract
MicroRNA (miRNA) isoforms (“isomiRs”) and tRNA-derived fragments (“tRFs”) are powerful regulatory non-coding RNAs (ncRNAs). In human tissues, both types of molecules are abundant, with expression patterns that depend on a person’s race, sex and population origin. Here, we present our analyses of the Prostate Cancer (PRAD) datasets of The Cancer Genome Atlas (TCGA) from the standpoint of isomiRs and tRFs. This study represents the first simultaneous examination of isomiRs and tRFs in a large cohort of PRAD patients. We find that isomiRs and tRFs have extensive correlations with messenger RNAs (mRNAs). These correlations are disrupted in PRAD, which suggests disruptions of the regulatory network in the disease state. Notably, we find that the profiles of isomiRs and tRFs differ in patients belonging to different races. We hope that the presented findings can lay the groundwork for future research efforts aimed at elucidating the functional roles of the numerous and distinct members of these two categories of ncRNAs that are present in PRAD.
Introduction
The rapidly decreasing cost of next generation sequencing has enabled the unbiased profiling of short and long non-coding RNAs (ncRNAs) in many settings. This has facilitated the quantification of select RNAs under a variety of conditions, the discovery of new categories of transcripts, and the identification of new members of previously established categories1,2. In this report, we focus on two specific categories of ncRNAs: the isoforms of microRNAs (miRNAs), or isomiRs, and fragments that are derived from transfer RNAs (tRNAs), or tRFs. We examine the profiles of isomiRs and tRFs in the PRAD samples that are available in the TCGA repository.
MiRNAs are ~22 nucleotide (nt) long non-coding RNAs3 that derive from hairpin-shaped precursor transcripts. They promote the degradation of sequence-matched mRNA targets or inhibit their translation4–8. The precursors of miRNAs were initially thought to produce at most a single mature product from each hairpin arm9,10. It is now known that miRNA precursor arms give rise to many distinct isoforms, termed “isomiRs”11,12. Any two isomiRs from the same miRNA arm typically differ in either their 5′ termini, 3′ termini, or both. In previous work, we showed that the most abundant isoform is frequently different from the one annotated as the “canonical” or “archetypal” sequence13 that is found in public databases like miRBase9,14.
In previous work, we analyzed hundreds of transcriptomic datasets from LCL samples and showed that isomiRs are produced constitutively and are differentially abundant among people in a manner that depends on the individuals’ sex, population origin, and race13. We also showed that similar isomiR dependencies exist in the context of breast cancer and demonstrated experimentally that isomiRs from the same hairpin arm can target largely distinct sets of mRNAs15. We also examined whether isomiR profiles differ across cancer types. We analyzed 10,271 cancer samples from TCGA that represent 32 cancer types and found that by simply examining whether an isomiR is present or absent in a sample, we could build highly sensitive and specific biomarkers that distinguish among the 32 cancer types16. In the case of PRAD, we showed that isomiRs can correctly classify TCGA datasets (obtained by the Illumina platform) as well as datasets obtained from two other platforms (Affymetrix, and ABI SOLiD)16.
tRFs represent a new class of short regulatory non-coding RNAs that were recently discovered through analysis of short RNA sequencing data17–19. tRFs are relatively short RNA oligonucleotides, with lengths ranging from 18 to ~34 nt. In this presentation, we study only tRFs that overlap with mature tRNAs. These can be grouped into five classes17,18,20–22: 5′-tRFs, i-tRFs, 3′-tRFs, 5′-tRNA halves (5′-tRHs), and 3′-tRNA halves (3′-tRHs).
Several proteins have been implicated in the production of tRFs. Chronologically first, Angiogenin was shown to be a producer of tRNA halves under stress conditions17,21–23. RNAse Z produces tRFs derived from premature tRNA transcripts24. Finally, the miRNA-producing enzyme Dicer has also been implicated in the production of tRFs in several species24–27, but many tRFs are also produced independently of Dicer27. In complete analogy to isomiRs, we showed that tRFs are produced constitutively and are differentially abundant among people in a manner that depends on the individuals’ sex, population origin, and race as well as on tissue, and disease subtype20. Functionally, tRFs have also been reported to increase in abundance after hepatitis B and C infection28, to act as miRNAs26, to act as molecular decoys of an RNA binding protein29, to mediate neuroprotective responses30, to inhibit angiogenesis after ischemic injury31, and to be involved in sex-hormone pro-proliferative signaling23. Collectively, these results suggest important functional roles for these molecules.
Recent work by the TCGA PRAD Consortium has improved our understanding of the biochemistry of prostate cancer subtypes32, but many questions regarding the biochemical progression of the disease remain unanswered33–37. Several of the currently open questions include the identification of new and potentially actionable biochemical targets for treatment, the relative merit of treatment vs. active surveillance38, the relevance of race in establishing risk, and the impact of race in disease prognosis33,39,40. To date, and to the best of our knowledge, the significance of isomiRs and tRFs has not been studied systematically in PRAD.
In this study, we characterize the expression of isomiRs and tRFs in the context of PRAD. Specifically, we investigate the possibility of associations between clinical categories of PRAD and the expression of isomiRs and tRFs, and report on whether these ncRNAs are associated with disease progression.
Results
Patients, samples, and sequencing limitations
526 samples were obtained from 472 patient donors. The patients span a range of ages and several races (Table 1). These samples form evenly sized subgroups from the standpoint of Gleason score at initial diagnosis (Table 1). It is important to note here that short RNA-seq for the TCGA project was capped at 30 cycles of sequencing. This choice does not pose a problem for miRNAs and their isoforms, because their typical length is ~22 nt. However, in the case of tRFs, the 30-cycle limit prevents us from identifying and quantitating longer molecules, such as the 5′-tRHs and 3′-tRHs41.
Table 1.
Sample Characteristics (n = 526) | ||||
---|---|---|---|---|
Race | White | 431 | Normal Tissue < 6 | 50 |
Black_or_African_American | 63 | Tumor Tissue | 476 | |
Asian | 12 | Gleason Score | ||
American_Indian_or_Alaska_Native | 1 | 3 + 3, 2 + 4 | 48, 1 | |
[Not_Available] | 19 | 3 + 4 | 143 | |
Age at diagnosis | 40–49 | 27 | 4 + 3 | 93 |
50–59 | 166 | 4 + 4, 5 + 3, 3 + 5 | 54, 6, 8 | |
60–69 | 230 | 4 + 5, 5 + 4 | 67, 45 | |
70+ | 49 | 5+5 | 11 |
Prostate isomiRs - summary statistics
For each sample in turn, we used Threshold-seq42 (see Methods) to automatically determine a dataset-dependent, adaptive threshold. Using these thresholds across the 526 datasets, we identified 3,178 isomiRs that were used in the subsequent analyses. Each of these 3,178 isomiRs exceeded Threshold-seq’s threshold in at least one of the analyzed datasets. 3,104 of these isomiRs are unambiguously annotated in our analyses, i.e. they map to a single mature miRNA or miRNA precursor, whereas the remaining 74 map to a total of 28 miRNA loci with known paralogues in the human genome. The 3,104 isomiRs arise from 600 distinct miRNA precursor arms (Fig. 1A) that correspond to 391 unique miRNA loci.
The isomiRs in TCGA PRAD have lengths that are typical of mature miRNAs. The most frequently encountered isomiR length is 22 nt, followed closely by 21 nt (Fig. 1B). Moreover, the endpoints of isomiRs (Fig. 1C) follow essentially the same distributions that we reported in our prior work15. We note that nearly one half (43.0%) of all identified isomiRs have 5′ termini that differ from the 5′ termini of the miRNA sequences found in miRBase, in agreement with what we found previously in a different tissue13. IsomiRs with different 5′ termini have different seed sequences compared to the archetype in miRBase, and, thus, different mRNA targets than the archetype15,43–45. Consequently, the identification of so many isomiRs with 5′ termini is of high potential importance, as it suggests the existence of a large repertoire of genes targeted by miRNAs that have not been explored to date. Moreover, 73.3% of the isomiRs have 3′ termini that differ from the 3′ termini of the corresponding archetypes found in miRBase.
Across all miRNA precursors, a few arms appear to be the greatest producers of distinct isomiRs in the PRAD context: 42 isomiRs are produced from miR-143-3p, 41 from miR-182-5p, 40 from miR-21-5p, 36 from miR-183–5p, 36 from miR-10a-5p, and 33 from let-7a-5p. (Fig. 1D). Supp. Table 1 lists all of the mature miRNAs that produce more than one isomiR above threshold in at least one of the analyzed datasets.
Prostate tRFs – summary statistics
We mined the sequenced reads using our MINTmap tRF mining method41, then used the Threshold-seq method42 to determine which tRFs to retain for each dataset. Across the 526 TCGA PRAD datasets, we identified 6,296 distinct tRFs that pass threshold in at least one dataset. Of these, 3,903 (62.0%) arise from nuclearly-encoded tRNAs, whereas the remaining 2,393 (38.0%) arise from mitochondrially-encoded tRNAs. In other words, the 22 mitochondrial (MT) tRNAs contribute almost half as many unique tRFs as do the 610 nuclear tRNAs, a rather unexpected finding. 140 tRFs show non-zero expression across all 526 (100%) datasets, which suggests possible relevance of tRFs for PRAD biology. Moreover, 83 tRFs show a normalized mean abundance at least as high as 10 reads per million (RPM), in either normal or prostate cancer samples.
Prostate tRFs – lengths, structural categories, source isoacceptors
The majority (66%) of the 6,296 tRFs are i-tRFs20, i.e. they overlap the interior of the mature tRNA sequence. Interestingly, nearly one half (42%) of all i-tRFs are produced by MT tRNAs, a disproportionately high contribution considering that there are only 22 MT tRNA sequences. The remainder of the tRFs are almost evenly split among nuclear 5′-tRFs, nuclear 3′-tRFs, and mitochondrial 3′-tRFs (Fig. 2A). Interestingly, five isoacceptors contribute 44% all of the tRFs we identified in PRAD. They are: Leucine (Leu), Glycine (Gly), Histidine (His), Glutamine (Gln), and Glutamic acid (Glu) (Fig. 2B). The lengths of nuclearly-encoded tRFs differ characteristically from those of MT-encoded tRFs (Fig. 2C). This is similar to what we observed previously in other human tissues20. Nuclearly-encoded tRFs appear to favor shorter lengths (16–20 nt), whereas mitochondrially-encoded tRFs tend to favor specific and generally-longer lengths (17, 21, 26, and 29 nt) (Fig. 2C).
Prostate isomiRs, prostate tRFs, and comparisons to clinical characteristics
Tissue state
When we compared tumor and normal samples, we observed marked differences in the lengths of tRFs from the structural categories described above. We computed density functions for the probability of encountering a tRF of length X in each structural category, and included genome of origin (nuclear or mitochondrial) as an additional variable. For several lengths, we found statistically-significant (Mann-Whitney U-test) differences between normal and tumor. Figure 3 shows these findings. In it, a few representative cases are highlighted in the plots with asterisks. Individual differences between structural categories appear to be specific to the tissue and tissue-state combination. For example, in TCGA’s breast cancer (BRCA) samples, we found short (18–20 nt) 5′-tRFs to be more prevalent in the tumor samples20. Unlike BRCA, in TCGA PRAD, 5′-tRFs with these lengths are more prevalent in the normal samples. Additionally, 3′-tRFs also tend to be short (18–20 nt) in normal prostate samples, which is again unlike what we found in normal breast. Finally, we note that 3′-tRFs from mitochondrially-derived tRNAs differ between normal prostate and tumor by length: short (19 nt) tRFs are more frequent in normal samples, whereas longer (23 nt) tRFs are more frequent in prostate tumor samples.
Tissue state and patient race
We next used Significance Analysis of Microarrays (SAM)46 to compare the abundance of isomiRs and tRFs, respectively, across tissue states (both races) and across tissue states within each race group in turn. We retained only isomiRs or tRFs that were differentially abundant and had an associated False Discovery Rate (FDR) of 5% (Supp. Table 2). Of 3,178 isomiRs, 524 are differentially abundant between PRAD and normal prostate in both Black or African American patients (B/Aa) and White (Wh) patients (Fig. 4A). A larger number of isomiRs (1,519) are differentially abundant in PRAD vs. normal comparisons, exclusively in Wh patients. We also analyzed the 6,296 tRFs and found 35 tRFs to be differentially abundant between PRAD and normal in both Wh and B/Aa patients (Fig. 4B). A much larger portion comprising 3,307 tRFs is differentially abundant exclusively in Wh patients. Interestingly, 119 tRFs are exclusively differentially abundant between PRAD Wh and PRAD B/Aa.
Gleason score and prognostic group
In order to determine whether the profiles of tRFs or isomiRs are associated with progression of disease, we performed multi-class SAM analysis with the Gleason score and the newly-proposed five-grade prognostic system (Supp. Table 2)47. We used only samples for which Gleason score agreed in both clinical and pathologic grading (n = 518 samples), and also kept normal samples (with Gleason score lower than 6) in the comparisons. In these analyses, we maintained the FDR threshold at 5%. We found that n = 2,441 isomiRs and n = 4,331 tRFs are differentially abundant between Gleason-score subgroups. We find that n = 2,493 and 4,652 isomiRs and tRFs are differentially abundant when we instead reclassify tumors using the alternative five-grade system. Importantly, the identified groups of isomiRs and tRFs are highly congruent: the same n = 2,401 isomiRs and n = 4,187 tRFs arise from both of these two independent analyses. Though prognostic groups are based on Gleason score, the subgroups do differ in that the samples of Gleason score 7 are split into two categories, each of which is characterized by a different prognosis. The respective isomiR and tRF profiles do differ between these two groups. All of these isomiRs and tRFs are shown in Supp. Table 2.
Using the abundance of all isomiRs from a miRNA locus as a proxy for isomiR profiles
Quantitative RT-PCR (qRT-PCR) methods such as Taqman miRNA qPCR and Exiqon LNA miRCuRY are designed with an eye towards distinguishing among miRNAs that have base differences that are internal to the target sequence. The ability of these methods to distinguish isoforms that differ at their endpoints has been a topic of scientific debate. In recent work, we examined both Taqman miRNA qPCR and Exiqon LNA miRCuRY and found that these methods cannot distinguish end-point isoforms, in either synthetic or true cellular contexts48. While we experimented with isomiRs only, we expect that these findings extend to tRFs as well. Our findings suggest that SYBR Green-based methods are also affected. This includes variants of the SYBR Green method such as the one used in a previous B-cell lymphoma study26, which is similar in spirit to the LNA miRCuRY method, but without the latter’s additional constraints. Importantly, neither the SYBR Green variant nor the LNA miRCuRY can guarantee the specific 5′ and 3′ endpoints of the target.
In light of this limitation, we carried out a second analysis of the deep sequencing data where we aggregated the abundance of all isoforms from a given miRNA arm into a single measure. Starting with the pool of isomiRs, and for each sample in turn, we summed the RPM values of all isomiRs from each annotated miRNA arm that was transcribed. This resulted in 628 observations per sample. Figure 5A–D show boxplots for several miRNA loci that showed significant differences between clinical groups (Welch’s independent t-test, p-value ≤ 0.05). In each plot, the statistical significance of the shown miRNA arm decreases from left to right. Not more than the top 20 most significant comparisons are shown in each case.
Using the abundance of all tRFs from a tRNA isoacceptor as a proxy for tRF profiles
As we mentioned in the previous section, the current commercially available quantitative methods that target specific molecules cannot accurately determine the abundance of isomiRs or tRFs. Thus, we repeated the above re-analysis, this time for tRFs, wherein we summed up the RPM values of tRFs from the same isoacceptor. This resulted in 47 observations per sample. Figure 5E–H show the resulting boxplots (Welch’s independent t-test, p-value ≤ 0.05). Again, the statistical significance of the shown isoacceptors decreases from left to right. Not more than the top 20 most significant comparisons are shown in each case.
Correlation and anti-correlation of the abundance of prostate isomiRs
We also determined how well the levels of individual miRNAs correlate with each other in the context of PRAD tumors. First, we examined correlations at the level of the mature miRNA locus. In this case, the abundances of isomiRs from a miRNA locus were summed into a single number (see above). We computed the Spearman correlation for all 628 miRNA arms that produced one or more isomiRs across the 526 PRAD datasets. We only retained correlations if their absolute value was ≥0.75 and FDR ≤ 5% (Supp. Table 3). Figure 6A shows the correlations/anti-correlations at the level of the miRNA locus, which survive this filtering. Each node in the network represents an individual miRNA arm that produces at least one isomiR. Green edges indicate positive correlations, whereas red edges indicate negative correlations. Supp. Table 3 also lists all isomiR-isomiR correlations whose absolute value was ≥0.75 and FDR ≤ 5%.
Correlation and anti-correlation of the abundance of prostate tRFs
We repeated the correlation analysis using tRFs this time. As with the miRNAs/isomiRs, we carried out the computations in two ways. First, we computed correlations using the summed abundances for tRFs from the various tRNA isoacceptors that are active in PRAD (see above). Second, we computed correlations using the individual tRFs. As before, we retained only Spearman correlations with absolute value ≥ 0.75 and FDR ≤ 5%. Figure 6B shows the correlations/anti-correlations at the level of the tRNA isoacceptor, which survive this filtering (see Supp. Table 3 for the complete list). Each node represents an isoacceptor that produces at least one tRF. Green edges indicate positive correlations. Supp. Table 3 also lists all tRF-tRF correlations that survive the correlation and FDR filtering.
Correlation between isomiRs and mRNAs
To investigate the possibility of regulatory networks involving isomiRs and mRNAs, we selected mRNAs with a mean value of reads per thousand mapped reads (RPKM) value that was at most 10 log2 units from the most abundant observed value for ACTB across the 526 samples. By design, these mRNAs have average expression that is ≥1/1,024-th of ACTB (210 = 1,024). We retained 9,871 transcribed mRNAs. Importantly, we used the mRNA RPKM values that are reported in the RSEM normalized output that is part of the UNC mRNA pipeline of the TCGA data. Then, we examined the correlation between the abundance of these mRNAs and the abundance of isomiRs across all samples. To elucidate potentially meaningful relationships, we limited our focus on the 932 isomiRs that are differentially abundant (according to SAM) between normal and tumor samples at FDR = 0% and have median abundance ≥1 RPM. A first set of 210,682 isomiR-mRNA correlations emerged (Supp. Table 4).
IsomiRs (i.e. miRNAs) interact directly with mRNA transcripts degrading them and leading to a concomitant decrease of protein abundance. Such direct interactions are expected to be captured by negatively-correlated isomiR-mRNA pairs. On the other hand, positive correlations between miRNAs and mRNAs may reflect indirect events downstream of direct targets. In previous work, we showed experimentally that two isomiRs x and y from the same miRNA arm can differentially impact the abundance of an mRNA m: transfection of x could lead to a decrease in the abundance of m, whereas transfection of y could lead to an increase in the abundance of m, and vice versa49. Thus, we computed both positive and negative correlations and list them in Supp. Table 4. There are 19,812 pairs with a correlation with absolute value ≥ 0.33, including 412 isomiRs and 1,683 mRNAs.
From these correlations with absolute value ≥ 0.33, we sub-selected those from 60 isomiRs from the top five isomiR-producing miRNAs shown in Fig. 1D. We combined these with the 12 isomiRs from the 5 miRNA loci that participate in the denser networks of the miRNA-miRNA interactions shown in Fig. 6A. We then visualize only mRNAs that participate in isomiR-mRNA pairs with ≥15 distinct isomiRs originating from at least two different miRNA loci. Also, we used a single node labeled with the name of the isomiRs’ parental miRNA precursor to capture multiple isomiRs. Figure 7A shows the resulting network, which involves 57 isomiRs from 7 miRNA loci and 27 mRNAs.
For functional enrichment analysis, we concentrated only on negatively-correlated isomiR-mRNA pairs. These pairs were formed by 775 isomiRs and 2,912 mRNAs (Supp. Table 4). Using four different tools (TargetScan50, rna2251, miRanda50, RNAhybrid52), we determined which mRNAs were also predicted targets of the isomiRs with which they were anti-correlated. For rna22, miRanda, and RNAhybrid, we generated predictions using the full-length of the isomiRs of interest and the 3′UTRs of the corresponding anti-correlated mRNA. For Targetscan, we used the seed (positions 2–8 of the miRNA) of the corresponding isomiR of interest with the full length of the corresponding anti-correlated mRNA. See Supp. Table 4.
We used the target predictions to sub-select the negatively-correlated isomiR-mRNA pairs. Specifically, we kept only those mRNAs that were also supported by the 4 target-prediction tools. 310 mRNAs survived this filtering (Supp. Table 4). We analyzed them using DAVID and found enrichment for 25 GO terms. The top 5 GO terms include: regulation of nucleobase-containing compound metabolism, DNA binding, regulation of nitrogen compound metabolic process, regulation of macromolecule biosynthetic process, and regulation of cellular macromolecule biosynthetic process.
For comparison, we also investigated how functional enrichments might change if we used the strength of correlation as a filter, instead of target predictions. Doing so retains 754 mRNAs that are negatively correlated with isomiRs at r ≤ −0.33. DAVID analysis revealed 57 GO terms enriched in this group of mRNAs. The corresponding GO terms are listed in Supp. Table 4. The top 5 GO terms include: RNA splicing; mRNA processing; mRNA splicing, via spliceosome; RNA splicing, via transesterification reactions with bulged adenosine as nucleophile; and RNA splicing, via transesterification reactions.
To assess whether race differences could play a part in these dynamics, we focused on the 196 isomiRs that we observed to be differentially abundant between Wh PRAD and B/Aa PRAD (Fig. 4A). We sub-selected among all isomiR-mRNA correlations only those associated with these race-dependent isomiRs at r ≤ −0.33, obtaining 13,230 negatively-correlated isomiR-mRNA pairs. We analyzed the corresponding 93 mRNAs with DAVID and found 42 enriched GO terms (Supp. Table 4). Among these terms, we find enrichment for terms such as: cellular response to hormone stimulus, reproductive process, reproduction, and multicellular organism reproduction.
Correlation between tRFs and mRNAs
Next, we extended our analysis to correlations involving tRFs and mRNAs. We again correlated the expression of the mRNAs identified above with the expression of tRFs across all samples, using Spearman correlation. We limited our focus to the 192 tRFs that are differentially abundant between normal and tumor samples and have median abundance ≥ 1RPM (Supp. Table 3) and filtered them further retaining only those with absolute value ≥ 0.33 and FDR ≤ 5%. We kept both negative and positive correlations, because the mode in which tRFs function has yet to be understood well. As mentioned in the Introduction, these ncRNAs have been demonstrated to function as miRNAs – suggesting negative correlations would represent functional relationships – but have also been shown to decoy RNA binding proteins – suggesting positive correlations would show functional relationships as well. For presentation purposes, we then collapsed the correlation relationships by replacing individual tRF sequences with the name of the tRNA isoacceptor from which they arise. This left us with 423 correlation pairs (23 isoacceptors and 223 mRNAs) that are listed in Supp. Table 4. In Fig. 7B we depict these pairs. For clarity, we limited our graph and show only those mRNAs that participate in ≥4 correlations. DAVID analysis of the 223 mRNAs that survived the filtering showed 10 enriched GO terms: cellular response to endogenous stimulus, response to endogenous stimulus, response to organic substance, cellular response to hormone stimulus, response to hormone, response to oxygen-containing compound, collagen fibril organization, response to organic cyclic compound, response to organonitrogen compound, and response to nitrogen compound. Thirteen mRNAs were negatively correlated with tRFs (Supp. Table 4). Removing them from the list of 223 mRNAs and repeating the DAVID analysis did not change the list of enriched GO terms. Lastly, we examined the mRNAs that were correlated with the 200 tRFs that are differentially abundant between the Wh PRAD samples and the B/Aa PRAD samples. We did not find any enriched GO terms in this group.
TCGA PRAD tRFs vs. tRFs obtained from other prostate cancer samples
In previous work20, we demonstrated consistent differences in the tRF profiles of individuals that differ by gender, population origin or race. This observation thus becomes an important consideration when comparing tRF profiles from samples of different origin. With that in mind, we sought to compare our TCGA PRAD tRF profiles with those from two different collections of prostate samples for which deep sequencing data is available, but for which neither the population origin nor the race of the patient donors is reported. We note here that, by design, the TCGA repository comprises samples from individuals whose race is self-reported.
The first non-TCGA collection comprises the 11 samples described in earlier work41, a study that sought to establish a prostate-cancer-specific tRF signature. An additional characteristic of this collection is that each of its 11 samples was created by pooling together samples from multiple patients. The second non-TCGA collection comprises five normal prostate and five PRAD samples that we generated for a different project16 (available at http://trace.ddbj.nig.ac.jp/DRASearch/study?acc = SRP034550). To ensure consistency in the mining steps, we analyzed the 21 samples in these two collections using MINTmap and Threshold-seq (see Materials and Methods), just as we did for the TCGA PRAD datasets.
Our mining of the 11 datasets of the first collection41 uncovered 3,341 tRFs above Threshold-seq threshold in at least one of the samples. Of the 6,296 tRFs that we find in the TCGA PRAD collection, 1,856 (29.48%) are also present among the tRFs that MINTmap finds in these 11 samples, and we list them in Supp. Table 5 together with their respective universal identifiers, i.e. their license plates, which we described elsewhere53. The Table also indicates which of the tRFs are exclusive to tRNA space or can be found in locations of the genome that do not harbor tRNAs.
Our mining of the 10 datasets from our parallel project16 uncovered 10,505 tRFs above Threshold-seq’s threshold in at least one of the samples. Of the 6,296 tRFs that we find in the TCGA PRAD collection, 2,090 (33.20%) are also present among the tRFs found in this collection and are listed in Supp. Table 5.
Finally, we compared all three of these collections and found 923 tRFs that are present consistently in all of them. Considering the potential diversity of the patients who contributed all these samples, it is encouraging to see such a high number of tRFs being consistently present across collections. This observation coupled to the demonstrated tissue specificity of tRFs suggests that devising tRF-based biomarkers for this cancer or for other types of cancer is a real possibility.
Discussion
In recent work, we identified potential roles for miRNAs/isomiRs and tRFs as tissue- and patient-specific post-transcriptional regulators. Here, we characterized the tRF and isomiR profiles of 526 human prostate samples from TCGA and examined whether these molecules exhibit differential abundance in disease compared to normal and whether there is a race-specific component to the abundance of these molecules. Our analyses also investigated the correlations between isomiRs and mRNAs, and between tRFs and mRNAs, to elucidate putative regulatory connections between these short non-coding RNAs and the post-transcriptional regulation layer that is at work in prostate.
Our analyses showed that the TCGA PRAD datasets are characterized by very rich tRF profiles. Interestingly, these profiles are largely similar to tRF profiles that we generated by analyzing two different cohorts1,41, suggesting a consistency of tRF profiles in prostate tumors across likely diverse populations. Arguably, the 923 tRFs that we identified as being present in all three cohorts represent a good starting point for researchers interested in studying the roles of these molecules in the prostate context.
DAVID analyses of the mRNAs that are correlated with isomiRs and tRFs reveal interesting potential roles for each of these two classes of regulators in disease. When we filtered our correlations by retaining only those with predicted miRNA target interactions from all 4 different algorithms, our mRNA input list led to an enrichment for GO terms relating to metabolic and biosynthetic processes. When we filtered isomiRs based on the strength of anticorrelation with mRNAs (r ≤ −0.33), pathways that are critical for the production of mature mRNA products were enriched instead, including: mRNA processing, RNA splicing, mRNA splicing via spliceosome, RNA metabolic process, and mRNA metabolic process.
We also find specific GO enrichments for isomiR-mRNA correlations with isomiRs that were differentially expressed between PRAD Wh and PRAD B/Aa samples. Specifically, GO enrichments from these 93 mRNAs pertain to reproductive pathways and are characterized by an overabundance of androgen receptor signaling machinery genes. Future work should look to determine whether isomiRs could play a race-independent role in PRAD biology, as well as whether they could influence the variability of androgen receptor activity seen in different race groups.
For tRFs that are differentially abundant between normal prostate and PRAD (both races), we found that correlated mRNAs enriched in GO terms pertaining to cellular responses to stimuli. These terms include: GO:0071495-cellular response to endogenous stimulus, GO:0009719-response to endogenous stimulus, GO:0010033-response to organic substance, GO:0032870-cellular response to hormone stimulus, and GO:0009725-response to hormone. We note that the large majority of tRFs are positively correlated with mRNAs and that negative correlations between tRFs and mRNAs do not significantly contribute to GO enrichments.
Collectively, these findings raise the possibility of a disease biology in which dysregulation of miRNAs/isomiRs might give rise to accumulating oncogenic transcripts, whereas dysregulation of tRFs might render cells insensitive to control signals in the microenvironment. Specifically, isomiRs may be involved in regulating nucleic acid and protein metabolism or may be involved in regulating splicing machinery. Losing these isomiRs in prostate cancer contexts may contribute to the accumulation of alternatively spliced oncogenic transcripts that both contribute to disease progression and account for some differential characteristics of disease54,55. Alternatively, isomiRs may accumulate or be co-regulated with mRNAs involved in upregulation of the proliferative pathways that contribute to tumor biology – i.e., those involved in nuclear division, cell cycle processes, or metabolism (see above). tRFs might be produced in cells actively responding to metabolic changes in the microenvironment or to paracrine/autocrine signaling, whereas isomiRs might become dysregulated in cells experiencing pro-proliferative or cancerous transformations driven by other mechanisms. We note here that tRNA halves have been reported to be produced in response to hormone signaling23. Follow-up studies will need to establish causal relationships from these correlations. The investigation of a possible regulatory relationship between these two groups of ncRNAs and prostate cancer specific gene pathways is warranted.
Our findings suggest the possibility of developing a deep-sequencing-based approach that examines the abundance of a specific subset of short ncRNAs and allows the automated detection of prostate cancer. This would enable diagnosis in a manner that is minimally dependent on human observation. It is important to note here that our earlier findings indicate that it may not be feasible to base such a diagnosis on qRT-PCR techniques aimed at estimating the abundance of specific isomiRs or tRFs. Indeed, as we showed, the simultaneous presence of near-identical variants of the targeted isomiR or tRF introduces cross-talk that skews these measurements48.
In terms of prioritizing future work, we believe that initial efforts should be directed at elucidating the biogenesis and functional roles of these molecules. Doing so will have a direct beneficial impact when designing experiments to study these molecules. In the absence of this knowledge, it is difficult to know the extent to which attempts to over-express or silence a tRF disrupt other cellular processes leading to unintended consequences.
Materials and Methods
Sample Collection and Patient Enrollment
The TCGA initiative enrolled all of the PRAD patients whose samples were analyzed in this presentation. We focused on and analyzed the short RNA profiles from 526 prostate cancer datasets from 472 patients that were part of TCGA PRAD as of October 2015. Only samples that were ‘white-listed’ by the PRAD consortium were analyzed, meaning that these samples did not bear an annotation calling for their exclusion from the TCGA PRAD project. We also included 10 samples from an independent dataset first discussed in1, which includes deep sequencing from 5 prostate cancer tumor tissue samples, as well as 5 normal prostate tissue controls. We further included 11 datasets that represent pooled prostate tissue RNA samples from 11 patient groups, including one FFPE tumor preserved sample set and one normal healthy prostate tissue sample set.
Mapping
For the isomiR analyses, we mapped the reads on the full genome, as we previously described13,15,20. For the tRF analyses, we used the MINTmap tool we reported recently56 to mine for and quantify tRFs. The mRNA data were generated by the TCGA initiative and downloaded from the rsem.genes.normalized_expression files generated as output of the UNC mRNA analysis pipeline. All mRNA values represent RPKM values as computed by the RSEM RNA-seq analysis platform.
Data Thresholds and Normalization
After mapping, and for each of the 526 TCGA samples in turn, we thresholded isomiRs and tRFs using the Threshold-seq method42. Read counts were normalized to Reads Per Million (RPM) by referring to the number of reads that survive quality trimming and adapter removal.
Statistical Analyses
We used the R software package for statistical analyses. We used the prcomp package for PCA analysis57 and the samr package for the Significance Analysis of Microarrays46. Plots were generated using the gplots, matplotlib and igraph packages.
miRNA Target Prediction
We used TargetScan 7.150, miRanda50, RNAhybrid52, and rna2258 (http://cm.jefferson.edu/rna22/) to generate miRNA target predictions. We sought targets in the 3′UTRs of mRNAs in all four tools. We used mRNAs and isomiRs that were negatively correlated in our data for predictions in all tools.
DAVID Functional Analysis
GO term enrichments were computed using DAVID59, with an FDR threshold of less than or equal to 1%59 (Release 6.8 of October 2016). We performed functional analysis first using mRNAs that are predicted as targets of isomiRs with which they are negatively correlated. We considered only targets predicted by all four prediction tools. For comparison purposes, we also performed functional analysis using those mRNAs that were negatively correlated with isomiRs with correlation value less than or equal to −0.33.
Data Availability
The TCGA datasets that are described in this study are available from The Cancer Genome Atlas data portal at: https://portal.gdc.cancer.gov/. The deep sequencing datasets from normal prostate and PRAD originally discussed in our previous work1 are available at http://trace.ddbj.nig.ac.jp/DRASearch/study?acc = SRP034550. The data of the earlier prostate cancer study41 are available through GEO accession GSE80400. All of the data that we generated in our analyses are included in the Supplemental material accompanying this manuscript.
Declarations
Ethics approval and consent to participate
Not applicable for the TCGA datasets. For the other two datasets, see previous publications1,41.
Availability of data and materials
The original datasets that we used and/or analyzed during the current study are available from The Cancer Genome Atlas data portal at: https://portal.gdc.cancer.gov/. The alternative datasets are available through http://trace.ddbj.nig.ac.jp/DRASearch/study?acc = SRP034550 and through GEO accession GSE80400.
Electronic supplementary material
Acknowledgements
This work was supported by NIH/NCI grant R21-CA195204 (IR), a W. M. Keck Foundation grant (IR), and by Institutional funds.
Author Contributions
I.R. designed and supervised the study. R.M., A.G.T. and P.L. downloaded and collected the data. P.L. mapped the data. R.M., A.G.T. and I.R. analyzed the data. R.M., I.R. and E.L. wrote the manuscript. All authors read and approved the final manuscript.
Competing Interests
The authors declare no competing interests.
Footnotes
Electronic supplementary material
Supplementary information accompanies this paper at 10.1038/s41598-018-22488-2.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Londin E, et al. Analysis of 13 Cell Types Reveals Evidence for the Expression of Numerous Novel Primate- and Tissue-Specific microRNAs. Proceedings of the National Academy of the Sciences. 2014;112:E1106–E1115. doi: 10.1073/pnas.1420955112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Friedlander MR, et al. Discovering microRNAs from Deep Sequencing Data Using miRDeep. Nature Biotechnology. 2008;26:407–415. doi: 10.1038/nbt1394. [DOI] [PubMed] [Google Scholar]
- 3.Bartel DP. MicroRNAs: Genomics, Biogenesis, Mechanism, and Function. Cell. 2004;116:281–297. doi: 10.1016/S0092-8674(04)00045-5. [DOI] [PubMed] [Google Scholar]
- 4.Iorio, M. V. & Croce, C. M. MicroRNA dysregulation in cancer: diagnostics, monitoring and therapeutics. A comprehensive review. EMBO Mol Med4, 10.1002/emmm.201100209 (2012). [DOI] [PMC free article] [PubMed]
- 5.Kayani, M., Kayani, M. A., Malik, F. A. & Faryal, R. Role of miRNAs in breast cancer. Asian Pac J Cancer Prev12 (2011). [PubMed]
- 6.Lee, Y. S. & Dutta, A. MicroRNAs in cancer. Annu Rev Pathol4, 10.1146/annurev.pathol.4.110807.092222 (2009). [DOI] [PMC free article] [PubMed]
- 7.Leva, G. & Croce, C. M. miRNA profiling of cancer. Curr Opin Genet Dev23, 10.1016/j.gde.2013.01.004 (2013). [DOI] [PMC free article] [PubMed]
- 8.Lu, J. et al. MicroRNA expression profiles classify human cancers. Nature435, 10.1038/nature03702 (2005). [DOI] [PubMed]
- 9.Griffiths-Jones S. The microRNA Registry. Nucleic Acids Research. 2004;32:D109–D111. doi: 10.1093/nar/gkh023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ambros V, et al. A Uniform System for microRNA Annotation. RNA. 2003;9:277–279. doi: 10.1261/rna.2183803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Morin RD, et al. Application of Massively Parallel Sequencing to microRNA Profiling and Discovery in Human Embryonic Stem Cells. Genome Research. 2008;18:610–621. doi: 10.1101/gr.7179508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Koppers-Lalic D, et al. Non-invasive Prostate Cancer Detection by Measuring miRNA Variants (isomiRs) in Urine Extracellular Vesicles. Oncotarget. 2016;7:22566–22578. doi: 10.18632/oncotarget.8124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Loher P, Londin E, Rigoutsos I. IsomiR Expression Profiles in Human Lymphoblastoid Cell Lines Exhibit Population and Gender Dependencies. Oncotarget. 2014;5:8790–8802. doi: 10.18632/oncotarget.2405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright A. J. miRBase: microRNA Sequences, Targets and Gene Nomenclature. Nucleic Acids Research. 2006;34:D140–D144. doi: 10.1093/nar/gkj112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Telonis AG, Loher P, Jing Y, Londin E, Rigoutsos I. Beyond the One-Locus-One-miRNA Paradigm: microRNA Isoforms Enable Deeper Inisights into Breast Cancer Heterogeneity. Nucleic Acids Research. 2015;43:9158–9175. doi: 10.1093/nar/gkv922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Telonis AG, et al. Knowledge about the presence or absence of miRNA isoforms (isomiRs) can successfully discriminate amongst 32 TCGA cancer types. Nucleic Acids Res. 2017;45:2973–2985. doi: 10.1093/nar/gkx082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Sobala A, Hutvagner G. Transfer RNA-Derived Fragments: Origins, Processing, and Functions. Wiley Interdisciplinary Reviews RNA. 2011;2:853–862. doi: 10.1002/wrna.96. [DOI] [PubMed] [Google Scholar]
- 18.Anderson P, Ivanov P. tRNA Fragments in Human Health and Disease. FEBS Letters. 2014;588:4297–4304. doi: 10.1016/j.febslet.2014.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Shigematsu M, Honda S, Kirino Y. Transfer RNA as a Source of Small Functional RNA. Journal of Molecular Biological Molecular Imaging. 2014;1:8. [PMC free article] [PubMed] [Google Scholar]
- 20.Telonis AG, et al. Dissecting tRNA-Derived Fragment Complexities Using Personalized Transcriptomes Reveals Novel Fragment Classes and Unexpected Dependencies. Oncotarget. 2015;6:24797–24822. doi: 10.18632/oncotarget.4695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fu H, et al. Stress Induces tRNA Cleavage by Angiogenin in Mammalian Cells. FEBS Letters. 2009;583:437–442. doi: 10.1016/j.febslet.2008.12.043. [DOI] [PubMed] [Google Scholar]
- 22.Yamasaki S, Ivanov P, Hu G-F, Anderson P. Angiogenin Cleaves tRNA and Promotes Stress-Induced Translation Repression. Journal of Cell Biology. 2009;185:35–42. doi: 10.1083/jcb.200811106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Honda S, et al. Sex Hormone-Dependent tRNA Halves Enhance Cell Proliferation in Breast and Prostate Cancers. Proceedings of the National Academy of the Sciences. 2015;112:E3816–3825. doi: 10.1073/pnas.1510077112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Li Z, et al. Extensive Terminal and Asymmetric Processing of Small RNAs from rRNAs, snoRNAs, snRNAs, and tRNAs. Nucleic Acids Research. 2012;40:6787–6799. doi: 10.1093/nar/gks307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Cole C, et al. Filtering of Deep Sequencing Data Reveals the Existence of Abundant Dicer-Dependent Small RNAs Derived from tRNAs. RNA. 2009;15:2147–2160. doi: 10.1261/rna.1738409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Maute RL, et al. tRNA-Derived microRNA Modulates Proliferation and the DNA Damage Response and is Down-Regulated in B Cell Lymphoma. Proceedings of the National Academy of the Sciences. 2013;110:1404–1409. doi: 10.1073/pnas.1206761110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Kumar, P., Ananya, J., Mudunuri, S. B. & Dutta, A. Meta-analysis of tRNA Derived RNA Fragments Reveals That They Are Evolutionarily Conserved and Associate with AGO Proteins to Recognize Specific RNA Targets. BMC Biology12, online (2014). [DOI] [PMC free article] [PubMed]
- 28.Selitsky SR, et al. Small tRNA-Derived RNAs Are Increased and More Abundant Than MicroRNAs in Chronic Hepatitis B and C. Scientific Reports. 2015;5:7675. doi: 10.1038/srep07675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Goodarzi H, et al. Endogenous tRNA-Derived Fragments Suppress Breast Cancer Progression Via YBX1 Displacement. Cell. 2015;161:709–802. doi: 10.1016/j.cell.2015.02.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ivanov, P., et al. G-Quadruplex Structures Contribute to the Neuroprotective Effects of Angiogenin-Induced tRNA Fragments. Proceedings of the National Academy of the Sciences111, 18201-18206 (2014). [DOI] [PMC free article] [PubMed]
- 31.Li, Q., et al. tRNA-Derived Small Non-Coding RNAs in Response to Ischemia InhibitAngiogenesis. Scientific Reports6, online (2016). [DOI] [PMC free article] [PubMed]
- 32.Consortium TCGA. The Molecular Taxonomy of Primary Prostate. Cancer. Cell. 2015;163:1011–1025. doi: 10.1016/j.cell.2015.10.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Cooperberg M. Re-Examining Racial Disparities in Prostate Cancer Outcomes. Journal of Clinical Oncology. 2013;31:2979–2980. doi: 10.1200/JCO.2013.50.7723. [DOI] [PubMed] [Google Scholar]
- 34.Eble M. Complications from Treatment of Localised Prostate Cancer. Lancet Oncology. 2014;15:134–135. doi: 10.1016/S1470-2045(14)70015-4. [DOI] [PubMed] [Google Scholar]
- 35.Nam RK, et al. Incidence of Complications Other Than Urinary Incontinence or Erectile Dysfunction After Radical Prostatectomy or Radiotherapy for Prostate Cancer: A Population Based Cohort Study. Lancet Oncology. 2014;15:223–231. doi: 10.1016/S1470-2045(13)70606-5. [DOI] [PubMed] [Google Scholar]
- 36.Bosset JF, et al. Fluorouracil-Based Adjuvant Chemotherapy After Preoperative Chemoradiotherapy in Rectal Cancer: Long-Term Results of the EORTC 22921 Randomised Study. Lancet Oncology. 2014;15:184–190. doi: 10.1016/S1470-2045(13)70599-0. [DOI] [PubMed] [Google Scholar]
- 37.Barrington WE, et al. Difference in Association of Obesity with Prostate Cancer Risk Between US African American and Non-Hispanic White Men in the Selenium and Vitamin E Cancer Prevention Trial (SELECT) Journal of the American Medical Assocation Oncology. 2015;1:342–349. doi: 10.1001/jamaoncol.2015.0513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.D’Amico, A. V. Treatment or Monitoring for Early Prostate Cancer. New England Journal of Medicine0, null, 10.1056/NEJMe1610395 (2016). [DOI] [PubMed]
- 39.Hoffman RM, et al. Racial and Ethnic Differences in Advanced-Stage Prostate Cancer: the Prostate Cancer Outcomes Study. Journal of National Cancer Institute. 2001;93:388–395. doi: 10.1093/jnci/93.5.388. [DOI] [PubMed] [Google Scholar]
- 40.Khierandish P, Chinegwundow F. Ethnic Differences in Prostate Cancer. British Journal of Cancer. 2011;105:481–485. doi: 10.1038/bjc.2011.273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Olvedy M, et al. A Comprehensive Repertoire of tRNA-Derived Fragments in Prostate Cancer. Oncotarget. 2016;7:24766–24777. doi: 10.18632/oncotarget.8293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Magee, R., Loher, P., Londin, E. & Rigoutsos, I. Threshold-seq: a tool for determining the threshold in short RNA-seq datasets. Bioinformatics, 10.1093/bioinformatics/btx073 (2017). [DOI] [PMC free article] [PubMed]
- 43.Tan, G. C. et al. 5′ isomiR variation is of functional and evolutionary importance. Nucleic Acids Research, 10.1093/nar/gku656 (2014). [DOI] [PMC free article] [PubMed]
- 44.Baran-Gale J, Fannin EE, Kurtz CL, Sethupathy P. Beta Cell 5′-Shifted isomiRs Are Candidate Regulatory Hubs in Type 2 Diabetes. PLoS ONE. 2013;8:e73240. doi: 10.1371/journal.pone.0073240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Manzano M, Forte E, Raja AN, Schipma MJ, Gottwein E. Divergent target recognition by coexpressed 5′-isomiRs of miR-142-3p and selective viral mimicry. RNA. 2015;21:1606–1620. doi: 10.1261/rna.048876.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA. 2001;98:5116–5121. doi: 10.1073/pnas.091062498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Epstein JI, et al. A Contemporary Prostate Cancer Grading System: A Validated Alternative to the Gleason Score. Eur Urol. 2016;69:428–435. doi: 10.1016/j.eururo.2015.06.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Magee, R. et al. Assessment of isomiR Discrimination Using Commercial qPCR Methods. Non-Coding RNA3 (2017). [DOI] [PMC free article] [PubMed]
- 49.Pliatsika V, Loher P, Telonis AG, Rigoutsos I. MINTbase: a framework for the interactive exploration of mitochondrial and nuclear tRNA fragments. Bioinformatics. 2016;32:2481–2489. doi: 10.1093/bioinformatics/btw194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Telonis AG, et al. Dissecting tRNA-derived fragment complexities using personalized transcriptomes reveals novel fragment classes and unexpected dependencies. Oncotarget. 2015;6:24797–24822. doi: 10.18632/oncotarget.4695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Lewis BP, Burge CB, Bartel DP. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005;120:15–20. doi: 10.1016/j.cell.2004.12.035. [DOI] [PubMed] [Google Scholar]
- 52.Miranda KC, et al. A pattern-based method for the identification of MicroRNA binding sites and their corresponding heteroduplexes. Cell. 2006;126:1203–1217. doi: 10.1016/j.cell.2006.07.031. [DOI] [PubMed] [Google Scholar]
- 53.Kruger J, Rehmsmeier M. RNAhybrid: microRNA target prediction easy, fast and flexible. Nucleic Acids Res. 2006;34:W451–454. doi: 10.1093/nar/gkl243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Rajan P, Elliott DJ, Robson CN, Leung HY. Alternative splicing and biological heterogeneity in prostate cancer. Nat Rev Urol. 2009;6:454–460. doi: 10.1038/nrurol.2009.125. [DOI] [PubMed] [Google Scholar]
- 55.Wang BD, et al. Alternative splicing promotes tumour aggressiveness and drug resistance in African American prostate cancer. Nat Commun. 2017;8:15921. doi: 10.1038/ncomms15921. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Loher P, Telonis AG, Rigoutsos I. MINTmap: fast and exhaustive profiling of nuclear and mitochondrial tRNA fragments from short RNA-seq data. Sci Rep. 2017;7:41184. doi: 10.1038/srep41184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Sigg, C. D. & Buhmann, J. M. Expectation-Maximization for Sparse and Non-Negative PCA. Proceedings of the 25th International Conference on Machine Learning (2008).
- 58.Loher P, Rigoutsos I. Interactive exploration of RNA22 microRNA target predictions. Bioinformatics. 2012;28:3322–3323. doi: 10.1093/bioinformatics/bts615. [DOI] [PubMed] [Google Scholar]
- 59.Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The TCGA datasets that are described in this study are available from The Cancer Genome Atlas data portal at: https://portal.gdc.cancer.gov/. The deep sequencing datasets from normal prostate and PRAD originally discussed in our previous work1 are available at http://trace.ddbj.nig.ac.jp/DRASearch/study?acc = SRP034550. The data of the earlier prostate cancer study41 are available through GEO accession GSE80400. All of the data that we generated in our analyses are included in the Supplemental material accompanying this manuscript.
The original datasets that we used and/or analyzed during the current study are available from The Cancer Genome Atlas data portal at: https://portal.gdc.cancer.gov/. The alternative datasets are available through http://trace.ddbj.nig.ac.jp/DRASearch/study?acc = SRP034550 and through GEO accession GSE80400.