Abstract
Eukaryotic organisms have developed a variety of mechanisms to regulate translation post-transcriptionally, including but not limited to the use of miRNA silencing in many species. One method of post-transcriptional regulation is through miRNAs that bind to the 3′ UTRs to regulate mRNA abundance and influence protein expression. Therefore, the diversity of mRNA 3′ UTRs mediating miRNA binding sites influence miRNA-mediated regulation. Alternative polyadenylation, by shortening mRNA isoforms, increases the diversity of 3′ UTRs; moreover, short mRNA isoforms elude miRNA-medicated repression. Because no current prediction methods for putative miRNA target sites consider whether or not 1) splicing-informed miRNA binding sites and/or 2) the use of 3′ UTRs provide higher resolution or functionality, we sought to identify not only the genome-wide impact of using exons in mRNA 3′ UTRs but also their functional connection to miRNA regulation and clinical outcomes in cancer. With a genome-wide expression of mRNA and miRNA quantified by 395 bladder cancer cases from The Cancer Genome Atlas (TCGA), we 1) demonstrate the diversity of 3′ UTRs affecting miRNA efficiency and 2) identify a set of genes clinically associated with mRNA expression in bladder cancer. Knowledge of 3′ UTR diversity will not only be a useful addition to current miRNA target prediction algorithms but also enhance the clinical utility of mRNA isoforms in the expression of mRNA in cancer. Thus, variability among cancer patient’s variability in molecular signatures based on these exon usage events in 3′ UTR along with miRNAs in bladder cancer may lead to better prognostic/treatment strategies for improved precision medicine.
Introduction
The 3′ untranslated region (UTR), located at the 3′ end of mRNA, is not translated into protein. However, the 3′ UTR includes important sequences that affect not only the fate of the mRNA, but also regulation by non-coding RNA that influences the total amount of protein [1]. microRNA (miRNA) is a type of non-coding RNA molecule approximately 22 nucleotides in length, which can post transcriptionally regulate gene expression. miRNA primarily binds to its target mRNA at the 3′ UTR of target mRNA of protein-coding genes, which results in repressed translation of mRNA or cleaved mRNA [2–4].
The diversity of mRNA 3′ UTRs in human genes is influenced by alternative polyadenylation (APA) and alternative splicing (AS). In general, mRNA stability is affected by the length of 3′ UTRs because longer mRNA 3′ UTRs provide more regulatory regions, which results in less protein production by degrading its mRNA or attenuating translation [5]. In other words, because of the alternative usage of polyadenylation, shorter mRNA 3′ UTR isoforms lose miRNA binding sites, which initiates not only an evasion of miRNA-mediated inhibition, but also reduces transcriptional capacity [6]. Shorter mRNA isoforms have been identified in the proto-oncogene IGF2BP1/IMP-1 [7]. Moreover, the reduced expression of longer 3′ UTRs is widespread and correlated with proliferation in immune cells [8]. Furthermore, miRNA-mediated repression of mRNA isoforms influences cancer development [9, 10]. For example, the overexpression of the cell cycle gene, Cyclin D1 (CCND1), is very common for many cancers, including bladder cancer, and contributes to cancer development [11–14]. While CCND1 gene generates two mRNA isoforms [15, 16], the shorter mRNA isoform, which excludes miRNA binding site, results in higher expression of CCNDI. Taken together, these results suggest that APA that affects miRNA binding is important for dissecting the etiology of cellular tumorigenesis [17].
AS allows many eukaryotic organisms to generate multiple mRNA isoforms from a single gene through diverse combinations of exons and introns. Exon skipping, intron retention, and alternative 5′ and 3′ splice sites are prevalent events that increase the diversity of mRNA isoforms [18]. Approximately 90% of human genes undergo AS and produce various mRNA isoforms, which results from a different combination of exons [19]. Such diverse exon usage via AS (i.e., exon skipping, intron retention, and alternative 5′ and 3′ splice sites) can occur within 3′ UTR of the mRNA, possibly influencing the ability of miRNA recognition of its target mRNA.
APA can result in shorter mRNA 3′ UTRs, resulting in lower expression of the synthesized protein. Intron retention demonstrates a potential connection between human miRNA targets and mRNA 3′ UTRs, by increasing putative miRNA targets within the 3′ UTRs of human mRNAs [20].
Although APA and intron retention have been known to change putative miRNA target sites, other potential dynamics in 3′ UTRs have not been systematically studied in human mRNA. These potential dynamics cause shorter mRNA 3′ UTR because of 1) splicing events (i.e., exon skipping and alternative 5′ and 3′ splices) and 2) mutually exclusive usage of 3′ UTR region (i.e., no common 3′ UTR region between mRNA isoforms and 3′ UTRs—coding regions in other mRNA isoforms). The objective of this study was to determine the genome-wide impact of different usage of exons in mRNA 3′ UTRs which contain miRNA binding sites, and their clinical relevance. Our hypothesis was that absence of an entire or partial exon in mRNA 3′ UTRs allows mRNA to evade miRNA repression; therefore, miRNA-mediated repression mediates the functional consequence of changed mRNA post-transcription and oncologic outcomes. We tested this hypothesis for muscle invasive bladder cancer using the genome-wide expression of mRNA and miRNA quantified in 395 bladder cancer cases from the Cancer Genome Atlas (TCGA).
Material and methods
Data
miRNA expression and RNA-Seq datasets for all cases of bladder urothelial carcinoma were downloaded from the online Genomic Data Commons (GDC) Data Portal (https://gdc-portal.nci.nih.gov/) and Broad Institute GDAC Firehose (http://gdac.broadinstitute.org/) in September 2016. The GDC Data Portal featured 430 cases for RNA-Seq and 437 cases for miRNA. We downloaded 395 cases that matched in both the RNA-Seq and miRNA data sets and clinical information of Stage and Survival (See Methods below). The RNA-seq data for all cases were obtained by Illumina HiSeq and reads featured strand-specific paired-ends. miRNA data was quantified by using a miRNA-seq maturation process. For each bladder cancer case (n = 395), we mapped the RNA-seq reads to the reference genome (i.e., release 75, GRCh37.75) using TopHat v2.1.1 [21]. Additionally, to quantify all mRNA expression levels (i.e., fragment per kilobase per million—FPKM), the mapped bam files from TopHat were assembled using Cufflinks, a tool for comprehensive expression analysis of RNA-seq data [22], with the GTF guide and normalization of the upper quartile-norm options. The miRNA-seq data was downloaded from the Broad Institute GDAC Firehose (analysis version 2016_01_28). In particular, we used the mature miRNA data that was generated from read per million miRNA mature reads in level 3 data. All the RPM values from the mature strands were summed and log transformed to generate the miRNA data matrix.
Compiling miRNA-binding sites from publically available sources
As most existing miRNA target prediction methods are based on the sequence similarity between mature miRNA and the 3′ UTR region of mRNA. These sequence-based miRNA target predictions frequently produce false-positive binding sites due to the lack of exponential validation. Therefore, to obtain comprehensive relations between miRNA and mRNA, we compiled one experimentally validated set with two existing miRNA target prediction databases: 1) miRTarBase, which is based on experimentally validated miRNA targets by reporter assay, western blots, and etc. [23]; 2) TargetScan (Release 7.0), which is based on conserved complementarity between targets of miRNAs and mRNAs [24]; 3) MicroRNA.org, which is based on the miRanda algorithm, and from which only predictions with high confidence scores (alignment score ≥ 120 and binding energy ≤-7.0) were included in this study [25]. Our compilation with data quality control followed three steps: First, we identified experimentally validated pairs between mRNAs and miRNAs using miRTarBase. Second, for the pairs with experimental validations, we obtained mRNA-target sites in 3′ UTR and their genomic coordinates by matching miRNA IDs (i.e., hsa-miR-199) with TargetScan and MicroRNA.org. Third, we then defined 3′ UTR regions of each mRNA based on Ensembl (Release version 75) reference information (http://ftp.ensembl.org/pub/release-75/gtf/homo_sapiens/, September 2016). This compilation yielded 439,404 pairs of miRNAs and their target sites, which is 2.7% of the total predictions (16,228,619 pairs) reported in the MicroRNA.org database. The resultant dataset consists of 1) the genomic coordinates of all miRNA binding sites in 3′ UTRs, 2) the target gene id (i.e., ENSG id), 3) the target mRNA name (i.e., ENST id), and 4) target exons in 3′ UTRs. In our study, miRNA binding exon (MBE) refers to any exons including at least one miRNA target site in 3′ UTR of mRNA.
3′ UTR miRNA binding site annotations
We mapped miRNA binding sites in 3′ UTR exons. Overall, there are two types of mRNA isoforms that do not include MBEs in 3′ UTRs: 1) AS-derived and 2) non-AS-derived. The AS-derived absence of MBEs includes exon skipping, intron retention, and alternative 5′ and 3′ splice sites. On the other hand, the non-AS-derived absence of MBEs includes mutually exclusive 3′ UTR between mRNA isoforms, alternative last exon usage, alternative polyadenylation, and non-coding mRNA. For the AS-derived case, we defined which MBEs are constitutively or alternative spliced by comparing all mRNAs in a given gene. If an MBE is skipped in a mRNA isoform, then the isoform is defined as MBE-free, in reference to exon skipping or the loss of an entire exon. If part of the MBE is skipped as a result of the alternative 5′ or 3′ splice sites in a mRNA isoform, then the mRNA isoform is defined as MBE-free (see Fig 1, Step 2). Otherwise, a given mRNA isoform is a MBE-retaining isoform, which is regulated by miRNA.
Alternative polyadenylation sites in 3′ UTR
We downloaded the APA sites in normal bladder cells from APASdb [26]. APASdb provides tissue-specific APA sites with the supporting read data. We used APA sites with supporting reads >10.
Define and estimate expression ratio of mRNA isoforms that are repressed by miRNA
We obtained the quantifications of all mRNA expression levels (i.e., FPKM) using Cufflinks. With the expression data, we estimated a relative ratio of MBE-retaining mRNA expression to the summation of the expression of MBE-retaining and MBE-free mRNA isoforms. This relative ratio is presented by percent splice-in (PSI), which is the fraction of affected expression by a single miRNA binding site in a given gene. As such, PSI is a fair estimate of the potential decrease of mRNA expression by miRNA-mediated repression without all possible MBE-free transcript isoforms. For a specific exon t, that is an MBE in a 3′ UTR, the PSI value (denoted as a ratio between two groups of MBE-retaining and MBE-deleting mRNA isoforms t), Ψt, was estimated by using the identification of genetic variants affecting alternative splicing (IVAS), a Bioconductor package, which utilizes the following algorithm [27]:
Here, Xit and Xst are the expression of MBE-retaining and MBE-free mRNA isoforms t, respectively. Therefore, we obtained expressions of the two groups and estimated the expression ratio. Our assumption was that the sum of Xit would be lower than that of Xst by miRNA targets because transcript isoforms in the Xit group are more likely to be repressed by miRNA.
With each pair of miRNA expression and PSI value (i.e., their ratios), we performed a linear regression to determine whether or not mRNA expression is inversely correlated with miRNA expression. The miRNA expression values were the predictor variable and PSI values of the given exon were the outcome variable. Statistically significant associations were determined if Padj (i.e., the Bonferroni correction) was less than 0.05 with anti-correlation (r < -0.3).
Association of mRNA expression with stage and histology groups in bladder cancer cases
We used the clinical status information of 395 bladder cancer cases, which have both the RNA-Seq and miRNA data as described above, obtained from the Broad Institute Firehose (analysis_version_2016_01–28). From this clinical data, we classified the 395 cases into subgroups according to the cancer stage, which were based on 1) the progression—from I to IV—of the disease and 2) the histology status, which comprises non-papillary (i.e., 0) or papillary (i.e., 1). Stage II was grouped together (i.e., 130 cases), and stage III and IV were grouped together (i.e., 265 cases). There were 268 cases that exhibited histology 0 status, and 127 cases exhibited histology 1 status. The 395 cases that had both stage and histology information were carried forward. For mRNA isoforms that contained exons with their PSI value anti-correlated with miRNA expressions, we performed a linear regression to identify which genes show differential expression of MBE-retaining mRNAs of a gene having at least two isoforms (each with and without MBE and potentially repressed by miRNA) between the stages and the histology. Age and smoking status were added to the regression model as covariates. The statistical significance between PSI values and clinical groups was defined as p < 0.05 via Bonferroni correction. To validate the p-value from the regression, we performed a permutation test by generating 1,000 sets of random assignments for the stage and histology of the 395 cases. With 1,000 sets of randomized subgroups, we conducted the same linear regression.
Survival analysis
We divided the 395 cases into two groups according to the PSI values of mRNA (i.e., high and low) for each gene having a statistically significant differential PSI value between the stage groups and the histology groups. We then carried out a Kaplan-Meier survival analysis to assess differences in overall survival between the two groups of cases with high and low PSI values.
Results
Genes escaping miRNA-mediated repression because of the diversity of the mRNA 3′ UTR
An overview of our study is depicted in Fig 1, and our results are summarized in S1 Fig.
A comprehensive dataset of miRNAs and their target sites was compiled by integrating three repositories of miRNA targets: one experimentally validated miRNA target and two predicted miRNA targets. We obtained the experimentally validated unique pairs of miRNA and target which comprised 1,410 miRNAs and 9,793 genes. Among these 9,793 genes, 9,142 genes had at least two mRNA isoforms, and only these genes were included for further analysis. A summary of our results is depicted in S1 Fig. Among the 439,404 pairs of miRNAs and their binding sites, we defined MBEs in 3′ UTRs (see Materials and Methods). For the 9,142 genes that generated more than two mRNA isoforms, we identified one mRNA per gene that had multiple miRNA target sites in the 3′ UTR. In addition, 40.9% and 44.5% of genes had three and four miRNA target sites in the 3’ UTR per a single gene and per single miRNA, respectively (see Fig 2A). In other words, most of each miRNA have three or four binding sites in 3’ UTR per target gene. Interestingly, among 50,245 pairs that had three miRNA binding sites, 86% lost all three miRNA binding sites, which suggests that at least one mRNA (i.e., miRNA-binding exon [MBE]-free—see Materials and Methods for definition) completely eluded miRNA-mediated repression (see Fig 2B). In addition, 86% (i.e., 7,841) of the genes produced at least one such mRNA isoform that is MBE-free (i.e., 313,091 unique pairs of miRNA and their target sites), while a mere 8% (i.e., 699 genes) generated only mRNA isoforms that retain MBE (see Fig 2C). Furthermore, we not only examined the diverse features of MBE-free that occurred in 3′ UTRs of every transcript isoform per miRNA per single gene, but also further classified the feature of losing miRNA binding sites in 3′ UTRs into four categories (see Fig 1, Step 2): 1) alternative 3′ or 5′ splice sites (i.e., miRNA 3 in mRNA1), 2) mutually exclusive 3′ UTR regions (i.e., miRNA case in mRNA3 vs. mRNA5 and mRNA5 vs. mRNA6), 3) exon skipping (i.e., miRNA 2 in mRNA2), and 4) others (i.e., retained introns, non-coding RNA, and alternative polyadenylation). Fig 2D shows the distribution of each feature that results in MBE-free. As the AS-related event, alternative 5′ or 3′ splice sites were the major features that resulted in MBE-free. Moreover, mutually exclusive 3′ UTR regions was a frequent feature that also may contribute to change-final mRNA expression.
Diversity of mRNA 3′ UTRs conferring multiple transcript isoforms may affect miRNA-mediated repression of mRNAs
We obtained 214,293 unique mRNA isoforms using Cufflinks. There were 395 cases of miRNA expression that matched to cases with RNA-seq data. Only miRNAs that had an expression value in more than 50% of cases (>200 cases) were included in our analysis. As a result, we obtained 548 miRNAs, which corresponded to 166,551 pairs (i.e., 548 miRNAs and 7,607 genes) among 313,091 pairs in 7,841 genes, which generate transcript isoforms that are MBE-free 3′ UTRs (see the right panel of S1 Fig, right). Next, using linear regression to identify a statistically significant correlation of miRNA expression with mRNA expression (i.e., PSI) for each of the 166,551 pairs across 548 miRNAs and 7,607 genes in 395 bladder cancer cases, we identified 2,324 statistically significant pairs, which comprised 182 miRNAs and 619 genes (adjusted Padj < 0.05 via Bonferroni correction). Among them, 569 unique pairs (i.e., 78 miRNAs and 155 genes) were inversely correlated (i.e., r < -0.3, adjusted p < 0.05, see S1 Table, the bottom panel of S1 Fig). However, when the miRNAs were tested for correlation with gene level expression, 1,979 pairs (i.e., 299 miRNAs and 1,158 genes) were statistically significant, and 148 pairs (i.e., 47 miRNAs and 103 genes) exhibited anti-correlations. Interestingly, only eight genes were identified in both calculations for mRNA and gene level expression, indicating concordant impacts of miRNA-mediated repression on both levels. In other words, the summation of all mRNAs expressed from a gene is likely to be similar to the given gene’s expression level, which is the level of miRNA-mediated regulation. Such concordance between mRNA and gene-level expression affected by miRNA can be explained by the cases we observed. First, the intrinsic expression level of each transcript isoform differs. For example, when transcript isoforms targeted by a miRNA are predominantly expressed over isoforms not so targeted, the miRNA-targeted mRNAs are major contributors to the total gene expression level. Second, mRNAs have different numbers and locations of miRNA binding sites. For example, when a mRNA has only one miRNA binding site, the mRNA expression level repressed by the given miRNA can be similar to the gene expression level repressed by the miRNA. S2 Fig. shows one example, COLEC12, out of eight genes that have two transcript isoforms. The mRNA expression level of transcript ENST00000400256 was inversely correlated with expression of hsa-miR-148b-3p, indicating that the miRNA might repress the mRNA. Since the other mRNA isoform, ENST00000582147, is a non-coding RNA with an expression level of almost zero, miRNA-mediated repression showed concordant impacts at the both mRNA and gene levels (S2D Fig).
Links between alternative splicing and alternative polyadenylation
As stated in the introduction, APA is one factor that increases 3′ UTR diversity—a factor well-studied in miRNA regulation. APA in transcript isoforms is known to be linked to alternative splicing [28]. We estimated how many cases among 569 unique pairs were coupled with APA using a database of potential APA sites (See Methods). When the APA is located upstream of the alternative splicing events of each pair, these splicing events can be diminished due to the early cleavage site. Among 569 pairs, 27.5% (157 pairs) were predicted to have APA sites upstream of the splicing events, suggesting that splicing events in these 157 pairs are linked to APA. Although our study has limitations such as examining which transcript isoforms differ in 3′UTRs by an APA and whether or not APA is AS-dependent, it benefits from our classification of all possible dynamics in 3′ UTRs; moreover, we tested purely miRNA-binding transcript isoforms for their associations with the given miRNA.
Differential expressions of MBE-retaining transcript isoforms between stage and histology subgroups in bladder cancer
American Joint Committee on Cancer (AJCC) cancer stage is a standardized way to assess the level of 1) tumor invasion, 2) spread to lymph nodes, and 3) metastasis to distant sites. Cancer stage is often determined through a histology report which will then be used to determine a prognosis. Therefore, by developing more precise measurements for defining stage can potentially inform and improve treatment. Since 3’UTR isoforms were shown to regulate gene expression with respect to bladder cancer, we hypothesized that the 3’ UTR isoforms in bladder cancer samples could potentially differ between cancer stage and histology subgroups, thus providing a new means to assess the state of cancer.
As described in Materials and Methods, we grouped 395 bladder cancer cases into two AJCC stage groups and two histology groups. For 155 genes, which exhibit statistically significant reverse-correlation between the expression of miRNA and mRNA expression, we examined whether or not these mRNA isoforms are differentially expressed between each of the two stage and histology groups. From a regression test, a total of 7 and 37 genes, were identified (see S1 Table, bottom and right panel of S1 Fig), which were differentially expressed between stage and histology groups (adjusted p < 0.05 via Bonferroni correction), respectively. Seven genes were differentially expressed in both of stage and histology groups.
Functional enrichment analysis on identified genes
We performed a pathway over-representation analysis using ConsensusPathDB (CPDB) to identify enriched pathways, given identified genes from the analyses [29]. For 7 genes associated with stage, we observed an enriched KEGG pathway (i.e., miRNAs in cancer) (FDR q < 0.05), which suggests that these 7 genes are regulated by previously identified cancer-related miRNAs; moreover, this regulation induces subsequent, malignant effects on cell differentiation, proliferation, and apoptosis that lead to tumor growth and progress. An over representation analysis also demonstrated that 37 genes associated with histology were over-represented in nine pathways (FDR q < 0.05) (S2 Table). Furthermore, 16 genes associated with survival were enriched in eight pathways, which included photodynamic therapy-induced HIF-1 survival signaling and HIF-1-alpha transcription factor network pathways (S3 Table). Hypoxia-inducible factor (HIF)-1 is a transcription factor that regulates genes involved in cancer development and contributes to anaerobic metabolism and angiogenesis [30, 31]. Notably, HIF-1 is a well-known transcriptional factor associated with bladder cancer [32] and other cancers [33], and Nadaoka et al. assert that HIF-1 alpha polymorphism is a prognostic factor in bladder carcinoma [34]. Taken together, these observations suggest that HIF-1 may play an important role in variability, based on the different usage of exons in mRNA 3'UTRs and their functional connection to miRNA regulation efficiency. Therefore, HIFs not only mediate resistance to radio- and chemotherapy, but also are associated with poor prognosis in bladder cancer patients. In addition to the over-representation analysis, we also performed a network analysis of the interaction- and pathway-centric analysis of lists of genes, known as induced network modules analysis [35]. Given 16 genes associated with survival, these seed genes were to interconnect those genes through different types of interactions (e.g., physical, biochemical, and regulatory) (S3 Fig). In particular, VEGFA, which was associated with stage, histology, and survival, was connected to multiple genes associated with HIF-1 as gene regulatory interactions (S3 Fig, see Fig 3and below for detail). A previous study supports the evidence of an activation of vascular endothelial growth factor gene transcription by hypoxia-inducible factor 1[36].
Clinical interpretation
For the 7 and 37 genes that 1) generated mRNA isoforms that lose miRNA target exons and 2) exhibited differential expression in cancer stage and histology, respectively, we examined whether or not these genes inform bladder cancer prognosis for a given patient. Through Kaplan-Meier survival analysis (see Materials and Methods), 5 and 16 genes, respectively, exhibited a statistically significant association with stage and histology (bottom and right panel of S1 Fig); meanwhile, five genes exhibited a statistically significant association with both of stage and histology: CYBRD1, H2AFY, PCSK7, VEGFA, and ZFPM2.
VEGFA: Statistical significance for three clinical features
The hsa-miR-361-5p miRNA potentially binds to the 3′ UTR (i.e., chromosome 6, 43,753,919–43,753,940) of mRNA transcribed from the Vascular Endothelial Growth Factor A (VEGFA) gene. As shown in Fig 3, the PSI value (i.e., the relative expression ratio between mRNA isoforms with and without a miRNA target exon) is likely to be highly expressed in stage group 1 and histology group 0. Moreover, of the two subgroups (i.e., low and high PSI expression), we found that, through K-means clustering, the subgroup with a higher PSI ratio comprised more severe cases. The VEGFA gene is part of the platelet-derived growth factor (PDGF) / vascular endothelial growth factor (VEGF) family, which is associated with angiogenesis in cancer. In fact, in an examination of the association between VEGF-A expression in bladder cancer tissues and clinical outcomes, Fauconnet et al. suggest that transcript levels are higher among bladder cancer stages T2 to T4 [37].
BACE1: Statistical significance in stage and histology only
Hsa-miR-17-5p miRNA potentially targets the chr11:117160243–117160223 chromosome in the 3′ UTR of the Beta-Secretase 1 (BACE1) gene. As shown in Fig 4, the PSI level was high for stage group 1 and histology group 0. However, we observed no statistically significant differential survival outcomes between the two subgroups based on low and high PSI values. BACE1 is a gene that encodes the Beta-secretase 1 (BACE1) enzyme, which has important roles in forming myelin sheaths in peripheral nerves [38]. BACE1 not only holds clinical relevance in the pathogenesis of Alzheimer’s disease, but also has been a molecular target for drug development [39]. Little is known about the association between BACE1 and cancer; however, because bladder cancer exhibits the highest median age (i.e., 73 years) at diagnosis of all cancer types, this gene may have an association with disease development in aging populations.
Discussion
Bladder cancer is the fifth most common malignancy among American men, with 74,000 new cases diagnosed in 2015 [40]. Approximately 25% of bladder cancers are muscle-invasive at diagnosis (i.e., tumor stage T2 or greater), which confers a worse prognosis. The standard treatment for muscle-invasive bladder cancer is platinum-based neoadjuvant chemotherapy followed by radical cystectomy and urinary diversion, which is a major extirpative surgical procedure to remove the bladder and reconstruct the urinary tract [7] with high complication rates [41]. The overall 5-year survival for a muscle-invasive disease is 50%, even with radical treatment [42]. Moreover, few changes in mortality rates or advancements in disease management this disease have been reported over the past 30 years. Clearly, better understanding of the molecular basis of the disease offers the possibility of changing the treatment paradigm to offer personalized, molecular drug target-based approaches based on an individual’s tumor biology, stage, and tumor morphology.
Genes in eukaryote cells undergo post-transcriptional regulation as part of their translation into proteins, and one of the main forms of this regulation is coordinated by miRNA. In the process of this regulation, 3′ UTRs serve as a regulatory sequence onto which miRNA binds and, therefore, adjust the stability of mRNA. Up to 90% of human genes generate multiple mRNA isoforms, and mRNA isoforms can differ in their 3′ UTRs. Our examination of mRNA 3′ UTRs revealed the different features of 3′ UTRs that allow transcript isoforms to elude miRNA-mediated regulation. As a result, our study provides comprehensive knowledge on the diversity of 3′ UTRs, which can function as a useful addition to current miRNA target prediction algorithms [18]. The relative ratio of transcript isoform expression and signature differ between cancer and normal tissues; moreover, the ratio characterizes tumor phenotypes and progression—but not gene expression [43–45]. These results also suggest that information about transcript isoforms is necessary for understanding miRNA mediated gene regulatory mechanisms and advancing precision medicine in cancer.
In our study, we explored two sources of omics data (i.e., RNA-seq and miRNA) for bladder cancer from TCGA. In the coming iteration of our research, we will extend our examination to PanCancer in TCGA to further investigate this comprehensive feature of diversity in 3′ UTRs across human cancers. Furthermore, this work helps elucidate the architecture of cancer specific transcript isoform signatures and their regulation by miRNAs through the diversity of 3′ UTRs. In summary, the cancer patient’s characterizing alternative splicing events along with miRNA binding sites in bladder cancer may lead to better prognostic/treatment strategies.
Supporting information
Acknowledgments
The support and resources from the Center for High Performance Computing and Vice President’s Clinical and Translational Research Scholar Program at the University of Utah are gratefully acknowledged. We gratefully acknowledge the TCGA Consortium, and all its members for the TCGA Project initiative, for providing a sample, tissues, data processing and making data and results available. The results published here are in whole or part based upon data generated by The Cancer Genome Atlas pilot project established by the NCI and NHGRI. Information about TCGA and the investigators and institutions that constitute the TCGA research network can be found at http://cancergenome.nih.gov. We also extend our thanks to Brian Greene, EdD(c), Scientific Editor and Writer, University of Pittsburgh, School of Nursing, for his assistance in preparing this manuscript for submission.
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This project is funded, in part, under a grant with the Pennsylvania Department of Health (#SAP 4100070267). The Department specifically disclaims responsibility for any analyses, interpretations or conclusions. This work was also funded by NIGMS grant P50GM115318. In addition, we gratefully acknowledge the TCGA Consortium and all its members for the TCGA Project initiative, for providing sample, tissues, data processing and making data and results available. The results published here are in whole or part based upon data generated by The Cancer Genome Atlas pilot project established by the NCI and NHGRI. Information about TCGA and the investigators and institutions that constitute the TCGA research network can be found at http://cancergenome.nih.gov.
References
- 1.Moore MJ. From birth to death: the complex lives of eukaryotic mRNAs. Science. 2005;309(5740):1514–8. doi: 10.1126/science.1111443 . [DOI] [PubMed] [Google Scholar]
- 2.Lee RC, Feinbaum RL, Ambros V. The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell. 1993;75(5):843–54. . [DOI] [PubMed] [Google Scholar]
- 3.Pillai RS, Bhattacharyya SN, Artus CG, Zoller T, Cougot N, Basyuk E, et al. Inhibition of translational initiation by Let-7 MicroRNA in human cells. Science. 2005;309(5740):1573–6. doi: 10.1126/science.1115079 . [DOI] [PubMed] [Google Scholar]
- 4.Yekta S, Shih IH, Bartel DP. MicroRNA-directed cleavage of HOXB8 mRNA. Science. 2004;304(5670):594–6. doi: 10.1126/science.1097434 . [DOI] [PubMed] [Google Scholar]
- 5.Barrett LW, Fletcher S, Wilton SD. Regulation of eukaryotic gene expression by the untranslated gene regions and other non-coding elements. Cell Mol Life Sci. 2012;69(21):3613–34. doi: 10.1007/s00018-012-0990-9 ; PubMed Central PMCID: PMCPMC3474909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Legendre M, Ritchie W, Lopez F, Gautheret D. Differential repression of alternative transcripts: a screen for miRNA targets. PLoS Comput Biol. 2006;2(5):e43 doi: 10.1371/journal.pcbi.0020043 ; PubMed Central PMCID: PMCPMC1458965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mayr C, Bartel DP. Widespread shortening of 3'UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells. Cell. 2009;138(4):673–84. doi: 10.1016/j.cell.2009.06.016 ; PubMed Central PMCID: PMCPMC2819821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Sandberg R, Neilson JR, Sarma A, Sharp PA, Burge CB. Proliferating cells express mRNAs with shortened 3' untranslated regions and fewer microRNA target sites. Science. 2008;320(5883):1643–7. doi: 10.1126/science.1155390 ; PubMed Central PMCID: PMCPMC2587246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bandres E, Agirre X, Ramirez N, Zarate R, Garcia-Foncillas J. MicroRNAs as cancer players: potential clinical and biological effects. DNA Cell Biol. 2007;26(5):273–82. doi: 10.1089/dna.2006.0544 . [DOI] [PubMed] [Google Scholar]
- 10.O'Rourke JR, Swanson MS, Harfe BD. MicroRNAs in mammalian development and tumorigenesis. Birth Defects Res C Embryo Today. 2006;78(2):172–9. doi: 10.1002/bdrc.20071 . [DOI] [PubMed] [Google Scholar]
- 11.Elsheikh S, Green AR, Aleskandarany MA, Grainge M, Paish CE, Lambros MB, et al. CCND1 amplification and cyclin D1 expression in breast cancer and their relation with proteomic subgroups and patient outcome. Breast Cancer Res Treat. 2008;109(2):325–35. doi: 10.1007/s10549-007-9659-8 . [DOI] [PubMed] [Google Scholar]
- 12.Nosho K, Kawasaki T, Chan AT, Ohnishi M, Suemoto Y, Kirkner GJ, et al. Cyclin D1 is frequently overexpressed in microsatellite unstable colorectal cancer, independent of CpG island methylator phenotype. Histopathology. 2008;53(5):588–98. doi: 10.1111/j.1365-2559.2008.03161.x ; PubMed Central PMCID: PMCPMC2719983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Radovic S, Babic M, Doric M, Hukic A, Kuskunovic S, Hadzismajlovic A, et al. Non-small cell lung carcinoma: cyclin D1, bcl-2, p53, Ki-67 and HER-2 proteins expression in resected tumors. Bosn J Basic Med Sci. 2007;7(3):205–11. . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Shakir R, Ngo N, Naresh KN. Correlation of cyclin D1 transcript levels, transcript type and protein expression with proliferation and histology among mantle cell lymphoma. J Clin Pathol. 2008;61(8):920–7. doi: 10.1136/jcp.2008.057455 . [DOI] [PubMed] [Google Scholar]
- 15.Wiestner A, Tehrani M, Chiorazzi M, Wright G, Gibellini F, Nakayama K, et al. Point mutations and genomic deletions in CCND1 create stable truncated cyclin D1 mRNAs that are associated with increased proliferation rate and shorter survival. Blood. 2007;109(11):4599–606. doi: 10.1182/blood-2006-08-039859 ; PubMed Central PMCID: PMCPMC1885523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Chen RW, Bemis LT, Amato CM, Myint H, Tran H, Birks DK, et al. Truncation in CCND1 mRNA alters miR-16-1 regulation in mantle cell lymphoma. Blood. 2008;112(3):822–9. doi: 10.1182/blood-2008-03-142182 ; PubMed Central PMCID: PMCPMC2481543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Yu Z, Wang C, Wang M, Li Z, Casimiro MC, Liu M, et al. A cyclin D1/microRNA 17/20 regulatory feedback loop in control of breast cancer cell proliferation. J Cell Biol. 2008;182(3):509–17. doi: 10.1083/jcb.200801079 ; PubMed Central PMCID: PMCPMC2500136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wu CT, Chiou CY, Chiu HC, Yang UC. Fine-tuning of microRNA-mediated repression of mRNA by splicing-regulated and highly repressive microRNA recognition element. BMC Genomics. 2013;14:438 doi: 10.1186/1471-2164-14-438 ; PubMed Central PMCID: PMCPMC3708814. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chen M, Manley JL. Mechanisms of alternative splicing regulation: insights from molecular and genomics approaches. Nat Rev Mol Cell Biol. 2009;10(11):741–54. doi: 10.1038/nrm2777 ; PubMed Central PMCID: PMCPMC2958924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Tan S, Guo J, Huang Q, Chen X, Li-Ling J, Li Q, et al. Retained introns increase putative microRNA targets within 3' UTRs of human mRNA. FEBS Lett. 2007;581(6):1081–6. doi: 10.1016/j.febslet.2007.02.009 . [DOI] [PubMed] [Google Scholar]
- 21.Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14(4):R36 doi: 10.1186/gb-2013-14-4-r36 ; PubMed Central PMCID: PMCPMC4053844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28(5):511–5. doi: 10.1038/nbt.1621 ; PubMed Central PMCID: PMCPMC3146043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Chou CH, Chang NW, Shrestha S, Hsu SD, Lin YL, Lee WH, et al. miRTarBase 2016: updates to the experimentally validated miRNA-target interactions database. Nucleic Acids Res. 2016;44(D1):D239–47. doi: 10.1093/nar/gkv1258 ; PubMed Central PMCID: PMCPMC4702890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lewis BP, Burge CB, Bartel DP. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005;120(1):15–20. doi: 10.1016/j.cell.2004.12.035 . [DOI] [PubMed] [Google Scholar]
- 25.Betel D, Wilson M, Gabow A, Marks DS, Sander C. The microRNA.org resource: targets and expression. Nucleic Acids Res. 2008;36(Database issue):D149–53. doi: 10.1093/nar/gkm995 ; PubMed Central PMCID: PMCPMC2238905. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.You L, Wu J, Feng Y, Fu Y, Guo Y, Long L, et al. APASdb: a database describing alternative poly(A) sites and selection of heterogeneous cleavage sites downstream of poly(A) signals. Nucleic Acids Res. 2015;43(Database issue):D59–67. doi: 10.1093/nar/gku1076 ; PubMed Central PMCID: PMCPMC4383914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Han S, Jung H, Lee K, Kim H, Kim S. Genome wide discovery of genetic variants affecting alternative 3 splicing patterns in human using bioinformatics method. Genes Genom. 2017. doi: 10.1007/s13258-016-0466-7 [Google Scholar]
- 28.Tian B, Manley JL. Alternative cleavage and polyadenylation: the long and short of it. Trends Biochem Sci. 2013;38(6):312–20. doi: 10.1016/j.tibs.2013.03.005 ; PubMed Central PMCID: PMCPMC3800139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kamburov A, Stelzl U, Lehrach H, Herwig R. The ConsensusPathDB interaction database: 2013 update. Nucleic Acids Res. 2013;41(Database issue):D793–800. doi: 10.1093/nar/gks1055 ; PubMed Central PMCID: PMCPMC3531102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Shweiki D, Itin A, Soffer D, Keshet E. Vascular endothelial growth factor induced by hypoxia may mediate hypoxia-initiated angiogenesis. Nature. 1992;359(6398):843–5. doi: 10.1038/359843a0 . [DOI] [PubMed] [Google Scholar]
- 31.Ebert BL, Gleadle JM, O'Rourke JF, Bartlett SM, Poulton J, Ratcliffe PJ. Isoenzyme-specific regulation of genes involved in energy metabolism by hypoxia: similarities with the regulation of erythropoietin. Biochem J. 1996;313 (Pt 3):809–14. ; PubMed Central PMCID: PMCPMC1216982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hunter BA, Eustace A, Irlam JJ, Valentine HR, Denley H, Oguejiofor KK, et al. Expression of hypoxia-inducible factor-1alpha predicts benefit from hypoxia modification in invasive bladder cancer. Br J Cancer. 2014;111(3):437–43. doi: 10.1038/bjc.2014.315 ; PubMed Central PMCID: PMCPMC4119984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wigerup C, Pahlman S, Bexell D. Therapeutic targeting of hypoxia and hypoxia-inducible factors in cancer. Pharmacol Ther. 2016;164:152–69. doi: 10.1016/j.pharmthera.2016.04.009 . [DOI] [PubMed] [Google Scholar]
- 34.Nadaoka J, Horikawa Y, Saito M, Kumazawa T, Inoue T, Narita S, et al. Prognostic significance of HIF-1 alpha polymorphisms in transitional cell carcinoma of the bladder. Int J Cancer. 2008;122(6):1297–302. doi: 10.1002/ijc.23256 . [DOI] [PubMed] [Google Scholar]
- 35.Berger SI, Posner JM, Ma'ayan A. Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases. BMC Bioinformatics. 2007;8:372 doi: 10.1186/1471-2105-8-372 ; PubMed Central PMCID: PMCPMC2082048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Forsythe JA, Jiang BH, Iyer NV, Agani F, Leung SW, Koos RD, et al. Activation of vascular endothelial growth factor gene transcription by hypoxia-inducible factor 1. Mol Cell Biol. 1996;16(9):4604–13. ; PubMed Central PMCID: PMCPMC231459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Fauconnet S, Bernardini S, Lascombe I, Boiteux G, Clairotte A, Monnien F, et al. Expression analysis of VEGF-A and VEGF-B: relationship with clinicopathological parameters in bladder cancer. Oncol Rep. 2009;21(6):1495–504. . [DOI] [PubMed] [Google Scholar]
- 38.Hu X, Hicks CW, He W, Wong P, Macklin WB, Trapp BD, et al. Bace1 modulates myelination in the central and peripheral nervous system. Nat Neurosci. 2006;9(12):1520–5. doi: 10.1038/nn1797 . [DOI] [PubMed] [Google Scholar]
- 39.Vassar R. BACE1 inhibitor drugs in clinical trials for Alzheimer's disease. Alzheimers Res Ther. 2014;6(9):89 doi: 10.1186/s13195-014-0089-7 ; PubMed Central PMCID: PMCPMC4304279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Siegel RL, Miller KD, Jemal A. Cancer statistics, 2015. CA Cancer J Clin. 2015;65(1):5–29. doi: 10.3322/caac.21254 . [DOI] [PubMed] [Google Scholar]
- 41.Shabsigh A, Korets R, Vora KC, Brooks CM, Cronin AM, Savage C, et al. Defining early morbidity of radical cystectomy for patients with bladder cancer using a standardized reporting methodology. Eur Urol. 2009;55(1):164–74. doi: 10.1016/j.eururo.2008.07.031 . [DOI] [PubMed] [Google Scholar]
- 42.Yang L, Parkin DM, Li LD, Chen YD, Bray F. Estimation and projection of the national profile of cancer mortality in China: 1991–2005. Br J Cancer. 2004;90(11):2157–66. doi: 10.1038/sj.bjc.6601813 ; PubMed Central PMCID: PMCPMC2409509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Trincado JL, Sebestyen E, Pages A, Eyras E. The prognostic potential of alternative transcript isoforms across human tumors. Genome Med. 2016;8(1):85 doi: 10.1186/s13073-016-0339-3 ; PubMed Central PMCID: PMCPMC4989457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Gonzalez-Porta M, Frankish A, Rung J, Harrow J, Brazma A. Transcriptome analysis of human tissues and cell lines reveals one dominant transcript per gene. Genome Biol. 2013;14(7):R70 doi: 10.1186/gb-2013-14-7-r70 ; PubMed Central PMCID: PMCPMC4053754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Sebestyen E, Zawisza M, Eyras E. Detection of recurrent alternative splicing switches in tumor samples reveals novel signatures of cancer. Nucleic Acids Res. 2015;43(3):1345–56. doi: 10.1093/nar/gku1392 ; PubMed Central PMCID: PMCPMC4330360. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.