Abstract
BACKGROUND:
Accumulating evidence shows that clinical factors alone are not adequate for predicting the survival of patients with urothelial bladder cancer (UBC), and many genes have been found to be associated with UBC prognosis.
PURPOSE:
The objective of this study is to develop a signature which integrates clinical and molecular information to predict the overall survival of UBC patients more accurate.
MATERIALS AND METHODS:
We integrated messenger RNA (mRNA) and microRNA (miRNA) expression profiles and the corresponding clinical data of 402 UBC patients and 19 normal controls from The Cancer Genome Atlas. Univariate Cox regression followed by a multiple testing correction and an elastic net-regulated Cox regression were adopted to identify a prognostic signature.
RESULTS:
We generated an integrated clinical-RNA signature which consisting of 3 clinical variables, 3 protective mRNAs, 7 risky mRNAs, 2 protective miRNAs and 1 risky miRNA. The area under the receiver operating characteristic curve of the integrated clinical-RNA signature was 0.802, larger than that of the clinical-alone signature (0.709) or the RNA-alone signature (0.726). UBC patients in the high-risk group had a significantly shorter overall survival time compared with patients in the low-risk group (clinical-RNA signature, hazard ratio 2.441).
CONCLUSIONS:
Our conclusions that we have identified an integrated clinical-RNA signature that was superior to the traditional clinical-alone signature for ascertaining the overall survival prognosis of patients with UBC. These findings provide some novel genes for tumor molecular biologist to further study their functions and mechanisms in UBC tumorigenesis and malignance, and may be useful for effective clinical risk management of UBC patients.
Keywords: Urothelial bladder cancer, clinical, RNA, signature, prognosis
1. Introduction
Urothelial bladder cancer (UBC) is most common pathological subtype of bladder cancer, and the clinical prognosis of patients with UBC differs significantly which is hard to predict. If a new criterion had been developed to identify the group of high-risk patients ahead, they could get more suitable treatment including extended curative resection or higher radiation/chemo doses, which may greatly ameliorate the poor survival of UBC patients. So, an accurate prediction model is essential for efficient management of UBC which may bring better clinical outcomes to patients with UBC.
A variety of clinical variables in UBC have been studied with regard to their ability to predict disease recurrence, response to therapy, progression, and survival [1, 2, 3, 4, 5]. In the clinical guideline, clinical stage and tumor grade had been recommended as the most important prognostic indicators to forecast disease recurrence and progression [6]. However, significant variability in patient outcome is observed perhaps due to the underlying molecular heterogeneity within the clinically homogeneous tumor groupings.
In recent years, efforts have been made to better understand UBC through basic researches (molecular and genetic), and many studies have revealed that RNAs could serve as novel biomarkers for UBC prognosis [7, 8, 9, 10, 11, 12, 13, 14]. Kim et al. [9] reported that a four-gene signature could predict prognosis of patients with muscle-invasive bladder cancer. Ratert et al. [11] described that expressions of miR-141 and miR-205 were associated with overall survival in patients with bladder cancer. However, these studies mainly focused on messenger RNA (mRNA) profiles or microRNA (miRNA) profiles of UBC genomes independently, and the prognostic signatures were quite different due to a low sample size or the use of an inappropriate regression method for parameter estimation.
Biologically, identification of prognostic signature by integration of mRNA and miRNA expression profiles incorporating more information and may discover more robust prognostic signature due to potential interactions between miRNAs and mRNAs. Methodologically, in genomic expression analysis, there is a so-called “curse of dimensionality” problem in that the number of genes is much larger than the number of samples [15]. In this setting, ordinary regression is subject to over-fitting and instable coefficients, and stepwise variable selection methods do not scale well [16, 17]. Regression by penalization methods has been successfully adapted to high-dimensional multiple genomic datasets and outperforms univariate and multivariate regression methods [18]. At present, the most commonly used penalization methods are ridge regression, Least Absolute Shrinkage and Selection Operator (LASSO) regression and a hybrid of these (elastic net regression); all three methods are based on penalizing the L1 norm, the L2 norm, and both the L1 norm and L2 norm with tuning parameters. Although the traditional Cox proportional hazards model is widely used to discover cancer prognostic factors, it is not appropriate for the genomic setting due to the high dimensionality and colinearity. Several groups have proposed to combine the Cox regression model with the elastic net dimension reduction method to select survival-correlated genes within a high-dimensional expression dataset and have made available the associated computation procedures [19, 20, 21].
By characterizing genetic alterations, epigenetic alterations, and the expression of cancer genomes, The Cancer Genome Atlas (TCGA) project has provided a comprehensive way to understand UBC [22]. As the number of UBC samples in this project grows, the opportunity to identify prognostic molecular signature for patients with UBC is increasing. In the current study, we subjected the integrated mRNA and miRNA profiles of UBC patients to elastic net-regulated Cox regression analysis and identified an integrated prognostic signature that can predict the overall survival (OS) of UBC patients. These may be helpful for selecting high risk UBC patients for better clinical management.
2. Materials and methods
2.1. Patient characteristics and integrated RNA profiles
Clinical data and RNA expression profiles of UBCs were retrieved from TCGA (https://cancergenome.nih.gov/). The dataset contained 412 UBC patients and 19 patients have matched normal samples. A total of 402 patients were included for further study, and 10 patients were excluded from the study by the following criterions: First, the patients that not having both fully characterized mRNA and miRNA profiles. Second, the patients were alive but the last followup times were unavailable. mRNA expression data were generated by RNA-Seq with HTSeq-Counts workflow type and miRNA expression data were generated by miRNA-Seq with GCGSC miRNA profiling workflow type. The level-3 raw counts of mRNA and miRNA sequencing data were downloaded, combined, full-quantile-normalized, and quantile-filtered.
2.2. Prognostic clinical factors
Survival analysis with log-rank test was preliminarily applied to filter out the demographic and clinical factors that were correlated with the OS of UBC patients from among 9 factors: gender, age at initial diagnosis, clinical stage, tobacco use, anatomic neoplasm subdivision, Karnofsky performance status, human race, lymphovascular invasion, and neoplasm histologic grade. Demographic and clinical factors resulting in a log-rank test P value 0.05 were further selected into a multivariate stepwise Cox regression analysis to evaluate the contribution of the preliminary selected clinical factors as independent clinical prognostic factors.
2.3. Prognostic mRNAs and miRNAs
mRNAs and miRNAs that differentially expressed between UBCs and normal samples were selected using “glmLRT” method implemented in “edger” R package with FDR 0.01 and logFC 1 as parameters. A univariate Cox regression method was used to assess the relationship between OS and expressions of differentially expressed RNAs. To alleviate the effect of “curse of dimensionality” for general multivariate Cox regression analysis, we adopted an elastic net-regulated Cox regression analysis with 200,000 iterations and 10 cross-validations [20, 21] to estimate the regression coefficients more accurate. RNAs with elastic net-regulated Cox regression coefficients 0, which were considered candidate RNAs, were included in the RNA signature. RNAs that had a hazard ratio (HR) for death 1 were considered to be risky RNAs, and those with an HR 1were defined as protective RNAs.
2.4. Prognosis index and integrated clinical-RNA signature
We adopted a prognosis index (PI) which defined as a linear combination of the observed value weighted by the regression coefficients as indicator for UBC patient prognosis. Specifically,
where, PI is the prognostic index of the jth observation, is the regression coefficient of the th variable. is the observed value of the th variable in the th observation. For the RNA signature, is the relative expression value of the th RNA in the th patient, is the elastic net-regulated Cox regression coefficient of the th RNA, and the calculated PI was termed as RNA-PI. For the clinical factors, is the observed value of the th clinical factor in the th patient, is the general multivariate Cox regression coefficient of the th clinical factor, and the calculated PI was termed as clinical-PI.
We also used multivariate stepwise Cox regression analysis to determine the prognostic contributions and independence of RNA-PI and the associated clinical factors [15, 23]. RNA-PI and clinical factors with a multivariate Cox regression P value 0.05 were considered to be independent predictors for UBC patient prognosis. Furthermore, we constructed a new PI termed as clinical-RNA-PI as an integrated indicator of RNA expressions and clinical factors for UBC patient prognosis. The clinical-RNA-PI was also calculated as a linear combination. For both forms of PI, a weighed prognostic index (WPI) was calculated as the standard form of the PI. i.e.,
where, Mean(PI) and sd(PI) are the mean and standard deviation of the PI vector, respectively.
2.5. Model evaluation
UBC patients that have WPI 0 were classified into a high-risk group and the others were classified into a low-risk group. OS between the high-risk group and the low-risk group was compared using Kaplan-Meier method with 2-sided log-rank test. The ability and efficiency of the integrated clinical-RNA signature to predict UBC patient outcome was assessed by calculating the area under the curve (AUC) of the time dependent receiver operating characteristic (ROC). Analysis was conducted by the “survivalROC” package in R software for time dependent ROC analysis at 5 years of OS time [24]. Furthermore, we compared the prognostic ability that indicated by the ROC curves of the integrated clinical-RNA signature with the RNA-alone signature and the clinical-alone signature.
2.6. GO and pathway of mRNA and targets of miRNAs
Validated mRNA and targets of miRNAs were available from miRTarBase [25]. Only miRNA and mRNA target interactions with strong evidences, either by reporter assay or western blot, were extracted. Gene ontology enrichment and pathway enrichment were carried out based on these validated targets using online tool STRING [26, 27, 28]. The interactions among products of these targets were visualized by the same tool.
2.7. Statistical analysis
All statistical analyses were completed by R software [29] (version 3.3.2). Survival analysis with log-rank test was used to compare survival distributions between different groups, and it is implemented by “survfit” function that incorporated in “survival” R package. Univariate Cox regression analysis was used to evaluate the association between RNA expressions across samples with OS of UBC patients. Multivariate Cox regression analysis with step-wise procedure to select independent variables. Both univariate and multivariate Cox regression analyses with Wald test were implemented by “coxph” function that incorporated in “survival” R package. The generalized linear model (GLM) likelihood ratio test is based on the idea of fitting negative binomial GLMs with the Cox-Reid dispersion estimates. The testing can be done by using the “glmLRT” function implemented in “edger” R package [30]. The elastic-net penalized Cox regression was implemented by “glmnet” function and cross validation was implemented by “cv.glmnet” function incorporated in “glmnet” R package. All statistical tests were considered to be significant with 0.05 or FDR 0.05, as appropiate.
3. Results
3.1. Clinical characteristics correlated with UBC patient survival
Of the 402 patients in the TCGA UBC cohort, 176 patients were deceased and 226 were alive at the time of last follow-up. The average age at initial diagnosis of the patients in this cohort was 60.14 years, and the median overall survival time was 813 days (95% confidence interval [CI], 731–895 days). Demographic and clinical data for the TCGA UBC cohort were summarized in Table 1. Gender, age at initial diagnosis, clinical stage, tobacco use, anatomic neoplasm subdivision, Karnofsky performance status, human race, lymphovascular invasion, and neoplasm histologic grade were evaluated by survival analysis. Among these clinical characteristics, only age at diagnosis, lymphovascular invasion, and clinical stage had a P value less than 0.05 by Log-rank test (Table 1). Multivariate Cox regression analysis of these 3 factors revealed that all of them were independent prognostic factors (Table 1). The results of this preliminary assessment demonstrated that the survival data in the TCGA UBC cohort, despite the censored data, were informative and applicable to identifying a prognostic signature.
Table 1.
Univariate survival analysis and multivariate Cox regression of demographic and clinical variables
| Variables | Deaths/Patients (%) | Median | 95% CI | Log-rank | Multivariate Cox | Multivariate |
|---|---|---|---|---|---|---|
| survival | test P | HR (95%) CI | Cox | |||
| Age at diagnosis | ||||||
| y | 45/149 (30.20) | 2828 | 1423–NA | 9.24E05 | 2.26 (1.44–3.56) | 4.34E04 |
| 65 y | 131/252 (51.98) | 819 | 602–1064 | |||
| Clinical stage | ||||||
| Stage I | 0/2 (0) | – | – | 3.71E07 | 1.47 (1.11–1.95) | 7.43E3 |
| Stage II | 35/128 (27.34) | 2828 | 1869–NA | |||
| Stage III | 56/138 (40.58) | 1367 | 795–NA | |||
| Stage IV | 84/132 (63.64) | 593 | 508–739 | |||
| Lymphovascular invasion | ||||||
| No | 39/114 (34.21) | 2828 | 1556-NA | 2.12E05 | 1.83 (1.18–2.86) | 7.34E3 |
| Yes | 69/125 (55.20) | 623 | 547–1348 | |||
| Gender | ||||||
| Female | 51/104 (49.04) | 941 | 623–NA | 0.364 | ||
| Male | 125/298 (41.95) | 1077 | 819–1804 | |||
| Tobacco use | ||||||
| Non-smoker | 5/13 (38.46) | 778 | 254–NA | 0.573 | ||
| Current smoker | 88/220 (40.00) | 1163 | 823–2828 | |||
| Current reformed smoker | 83/169 (49.11) | 974 | 685–1869 | |||
| Anatomic neoplasm subdivision | ||||||
| Bladder-NOS | 89/204 (43.63) | 1367 | 1004–2641 | 0.379 | ||
| Dome | 11/24 (45.83) | 712 | 547–NA | |||
| Neck | 1/7 (14.29) | – | – | |||
| Trigone | 13/31 (41.94) | 795 | 623–NA | |||
| Wall anterior | 8/24 (33.33) | 835 | 617–NA | |||
| Wall lateral | 19/46 (41.30) | 904 | 577–NA | |||
| Wall NOS | 12/22 (54.55) | 510 | 415–NA | |||
| Wall posterior | 19/37 (55.89) | 706 | 579–NA | |||
| Karnofsky performance status | ||||||
| 3/14 (21.43) | 630 | 630–NA | 0.559 | |||
| 15/30 (50.00) | 734 | 612–NA | ||||
| 43/81 (53.01) | 1670 | 823–2828 | ||||
| Neoplasm histologic grade | ||||||
| High grade | 174/378 (46.03) | 1005 | 778–1670 | 0.119 | ||
| Low grade | 2/21 (9.52) | 904 | – | |||
| Human race | ||||||
| Asian | 9/44 (20.45) | 1670 | 674–NA | 0.262 | ||
| Black | 13/23 (56.52) | 696 | 510–NA | |||
| White | 147/318 (46.23) | 1005 | 778–1718 |
Abbreviation: CI, confidence interval. P-values were obtained from log-like test for survival analysis and Wald test for Cox regression, patients were omitted when data is unavailable. For multivariate Cox regression, variable coding is as follows: Age at diagnosis (1 65 years; 2 65 years), Clinical stage (1 leve I; 2 level II; 3 level III; 4 level IV), Lymphovascular invasion (0 No; 1 Yes).
3.2. Prognostic RNA signature for UBC patients
Differential expression analysis revealed that 3043 RNAs were differentially expressed between UBCs and normal samples. Among these RNAs, 211 RNAs (adjusted P value 0.05) were preliminarily identified as OS predictors by univariate Cox regression. Finally, elastic net regulated Cox regression resulted in the selection of 10 mRNAs and 3 miRNA (Table 2) as the prognostic RNA signature with minimal lambda 0.05 (Fig. 1). Multivariate stepwise Cox regression analysis revealed that RNA-PI, age at initial diagnosis, clinical stage, and lymphovascular invasion were independent prognostic predictors for UBC patient survival (Table 3). The HR (HR 2.351) of the RNA signature was greater than that of the demographic and clinical variables (Table 3), and it implied that the RNA signature had superior performance compared with traditional clinical variables. We also calculated PIs for clinical-alone and clinical-RNA signatures. As continuous variables, the RNA-PI and clinical-RNA-PI were both significantly correlated with UBC patient survival by univariate Cox regression analysis (HR 18.12; 95% CI 8.361–39.29; 2.136E-13 for RNA-PI; HR 3.113; 95% CI 2.304–4.206; 1.37E-13 for clinical-RNA-PI).
Table 2.
RNAs correlated with UBC prognosis
| No | miRNA | P Value | FDR | Type |
|---|---|---|---|---|
| 1 | hsa-mir-200c | 1.11E-04 | 2.32E-02 | Protective |
| 2 | hsa-mir-598 | 1.35E-04 | 2.32E-02 | Protective |
| 3 | hsa-mir-143 | 3.24E-04 | 3.71E-02 | Risky |
| 4 | RUNX2 | 7.09E-05 | 8.61E-04 | Risky |
| 5 | LAMA2 | 1.67E-04 | 1.37E-02 | Risky |
| 6 | UBD | 2.20E-04 | 1.52E-02 | Protective |
| 7 | INHBB | 2.59E-04 | 1.65E-02 | Risky |
| 8 | CATSPER2 | 5.42E-04 | 2.18E-02 | Protective |
| 9 | KCNK6 | 5.81E-04 | 2.26E-02 | Risky |
| 10 | PLSCR4 | 6.02E-04 | 2.27E-02 | Risky |
| 11 | ZNF600 | 1.83E-03 | 3.94E-02 | Protective |
| 12 | PAM | 2.08E-03 | 4.05E-02 | Risky |
| 13 | PDGFD | 2.21E-03 | 4.16E-02 | Risky |
P-values were obtained from Wald test in univariate Cox’s model. Types included protective (HR 1) and risky (HR 1).
Figure 1.
Cross-validation error curve. The left vertical line shows where the cross-validation error curve hits its minimum (lambda 0.05). The right vertical line shows the most regularized model with cross-validation error within 1 standard deviation of the minimum. The minimum was achieved by a fairly regularized model ( 13), but the right line indicates that the null model (no coefficients included). The numbers at the top of the figure indicate the number of the nonzero coefficients.
Table 3.
Multivariate Cox regression of demographic and clinical variables
| Variables | HR | 95% CI | Multivariate Cox P |
|---|---|---|---|
| RNA-PI | 2.351 | 1.565–3.533 | 3.87E-05 |
| Age at diagnosis | 2.214 | 1.404–3.491 | 6.29E-04 |
| Clinical stage | 1.382 | 1.053–1.813 | 0.0195 |
| Lymphovascular | 1.690 | 1.102–2.593 | 0.0163 |
| invasion | |||
| 1.907E-12 |
P-values of multivariate Cox regression model were obtained from Wald test.
The WPIs ranged from 2.13 to 1.46 for clinical-signature (Fig. 2A), 5.77 to 3.99 for RNA-signature (Fig. 2B) and 3.14 to 2.09 for clinical-RNA-signature (Fig. 2C). We split the UBC patients into high-risk and low-risk groups by WPIs for patients that have full age at diagnosis, lymphovascular invasion, and clinical stage information if necessary ( 238 for clinical-signature and clinical-RNA signature, 402 for RNA-signature). Specifically, patients with WPIs 0 were classified as high-risk group and patients with WPIs 0 were classified as low-risk group (Fig. 2). Survival analysis showed that the median OS time was significantly shorter in the high-risk group than in the low-risk group for all WPI based stratifications (761.2 versus 1007.2 days, HR 1.794, 95% CI 1.457–2.208, 0.001, for clinical-signature, Fig. 3A; 748.4 versus 905.8 days, HR 1.876, 95% CI 1.586–2.220, 0.01, for RNA-signature, Fig. 3C; 765.7 versus 1039.2 days; HR 2.441; 95% CI 1.927–3.092; 0.001, for clinical-RNA-signature, Fig. 3E).
Figure 2.
WPI distributions and stratifications of UBC patients of clinical signature (A), RNA signature (B) and clinical-RNA signature (C), respectively.
Figure 3.
Kaplan-Meier plots for stratifications based on clinical signature (A), RNA signature (C) and clinical-RNA signature (E), respectively. ROC for WPIs based on clinical signature (B), RNA signature (D) and clinical-RNA signature (F), respectively.
In all of the survival analyses, fewer events occurred after 5 years, we tested the ability of the integrated RNA signature to predict the survival outcome of UBC patients at, and around, this time point by time dependent ROC analysis. The AUCs were 0.709, 0.726 and 0.802 at 5 years of OS for clinical-signature (Fig. 3B), RNA-signature (Fig. 3D) and clinical-RNA signature (Fig. 3F), respectively. Thus, the integrated clinical-RNA signature had superior performance compared with clinical alone and RNA alone signatures whatever in terms to greater HR or greater AUC.
3.3. Gene ontology and pathway enrichment
STRING database (http://string-db.org/) [28] is employed to analyze the pathway enrichment of miRNAs’ targets (Supplementary Table 1) and mRNAs of the RNA signature. The gene ontology included here are just slim gene ontology, which gives an overview without the details of the specific terms [26]. The results show that these genes are mainly enriched in protein sumoylation biological process (Table 4). Others are enriched in transcription biological process. Protein sumoylation is closely associated with interaction and regulation to protein-protein interaction in living cells. And it is also closely associated with tumorigenesis [31].
Table 4.
Slim gene ontology enrichment
| Category | ID | Pathway description | Observed gene count | FDR |
|---|---|---|---|---|
| Biological | GO.0016925 | Protein sumoylation | 13 | 5.9E19 |
| processes | GO.0018205 | Peptidyl-lysine modification | 13 | 6.12E13 |
| GO.0043687 | Post-translational protein modification | 13 | 5.45E12 | |
| GO.0070647 | Protein modification by small protein conjugation or removal | 14 | 6.59E09 | |
| GO.0018193 | Peptidyl-amino acid modification | 14 | 1.91E08 | |
| GO.0045892 | Negative regulation of transcription, DNA-templated | 15 | 3.42E08 | |
| GO.0010629 | Negative regulation of gene expression | 16 | 5.68E08 | |
| GO.0032446 | Protein modification by small protein conjugation | 12 | 1.75E07 | |
| GO.1903507 | Negative regulation of nucleic acid-templated transcription | 14 | 4.01E07 | |
| Cellular | GO.0031519 | PcG protein complex | 7 | 3.58E11 |
| components | GO.0035102 | PRC1 complex | 5 | 2.99E09 |
| GO.0016604 | Nuclear body | 10 | 2.97E08 | |
| GO.0016605 | PML body | 6 | 1.69E06 | |
| GO.0043234 | Protein complex | 21 | 1.69E06 | |
| GO.0044451 | Nucleoplasm part | 10 | 8.03E06 | |
| GO.0000803 | Sex chromosome | 4 | 1.19E05 | |
| GO.0001739 | Sex chromatin | 3 | 1.39E05 | |
| GO.0044427 | Chromosomal part | 9 | 6.79E05 | |
| Molecular | GO.0019789 | SUMO transferase activity | 5 | 8.20E09 |
| function | GO.0005515 | Protein binding | 24 | 4.97E07 |
| GO.0003682 | Chromatin binding | 10 | 2.45E06 | |
| GO.0071535 | RING-like zinc finger domain binding | 3 | 3.93E06 | |
| GO.0008134 | Transcription factor binding | 8 | 1.15E04 | |
| GO.0003713 | Transcription coactivator activity | 7 | 2.05E04 | |
| GO.0044877 | Macromolecular complex binding | 11 | 2.05E04 | |
| GO.1901363 | Heterocyclic compound binding | 22 | 2.84E04 | |
| GO.0097159 | Organic cyclic compound binding | 22 | 3.22E04 |
In cellular component analysis, we find that many gene products locate in chromosome and involve in regulating target genes by chromosome modification (Table 4). These gene products generally constitute some protein complex to bind chromatin.
In molecular function analysis (Table 4), molecular function of SUMO transferase activity is a major function in UBC patients. The enrichment of molecular function supports biological process. Besides, we can find that many genes are involved in protein binding, chromatin binding and domain binding. The main functions of these genes were binding and activity. The results indicated that these genes as direct regulators for regulation gene expression.
In KEGG pathway analysis, miRNA’s target genes and mRNA are mainly enriched in four KEGG pathways (FDR 0.01) (Table 5). The enrichment of KEGG has some different from biological process. They are miRNAs in cancer, RNA transport, neurotrophin signaling pathway, and ubiquitin mediated proteolysis. In these four pathways, RNA transport, ubiquitin mediated proteolysis, and MAPK signaling pathway are related with tumor. Of these KEGG pathways, MAPK play critical roles in tumor formation and metastasis.
Table 5.
KEGG pathway enrichment
| ID | Term | Observed | FDR |
|---|---|---|---|
| gene count | |||
| 5206 | MicroRNAs in cancer | 6 | 2.86E-05 |
| 3013 | RNA transport | 5 | 5.91E-04 |
| 4722 | Neurotrophin signaling pathway | 4 | 3.27E-03 |
| 4120 | Ubiquitin mediated proteolysis | 4 | 4.46E-03 |
| 4010 | MAPK signaling pathway | 4 | 3.93E-02 |
3.4. Network analysis
STRING was used to visualize the protein-protein interaction network among miRNAs’ targets and mRNAs of the RNA signature. Figure 4A showed the network with methods of neighborhood, gene fusion, co-occurrence, co-expression, experiments, databases and text-mining. This network wags enriched in interaction, which assumed that these protein worked altogether and joined in the related pathways. After removing evidences with lower confidence and only using data experiments, a network with fewer genes was built (Fig. 4B). In both networks, TP53 was the center and had high degrees, indicating the important role of it in the prognosis of UBC.
Figure 4.
Network of proteins. A. The network predicted with methods of neighborhood, gene fusion, co-occurrence, co-expression, experiments, databases and text-mining. This network was enriched in interaction with a p-value of 4.22e-15. B. The network predicted only with experiments. Disconnected nodes were hidden. This network was enriched in interaction with a p-value of 0.0324.
4. Discussion
In this study, we proposed a novel integrated mRNA, miRNA and clinical signature that could predict UBC patient survival more accurate than clinical-only and RNAs-only signatures. Pathway analysis revealed that genes in our signature were involved in cell death and survival. All results suggested that the integrated signature was suitable and had superior prognostic value for UBC patient prognosis. Our study provides novel insights into the significance of bringing molecular and clinical information together for UBC prognosis.
Although markers for classifying UBC molecular subtypes have been identified, markers associated mainly with the pathogenesis of UBC may not predict the prognosis [32]. Furthermore, it is impertinent to use the clinical characteristics alone to distinguish the high-risk patients due to the underlying molecular heterogeneity within the clinically homogeneous tumor groupings. Thus we objected to identify common RNAs that consistently drive the outcome of UBC patients irrespective of the clinical or molecular subtypes. For UBC prognostic markers discovery, studies have been focused on either mRNA [9, 33] or miRNA profiles [7, 10, 11, 12, 13, 14] with limited sample size. To our knowledge, there is no study have been proposed a combination signature of mRNA expression, miRNA expression, and clinical characteristics to improve the UBC patient prognosis.
Of the prognostic RNAs, hsa-mir-200c was previously reported to be associated with UBC patient survival [34]. It has been reported that hsa-mir-200c regulates epithelial to mesenchymal transition (EMT) and restores expression of E-cadherin in breast and ovarian cancer cells [35, 36, 37]. Yu et al. reported that high levels of hsa-mir-200c expression inhibit cancer invasion and stimulate cancer cell proliferation, possibly via up-regulation of E-cadherin, and that high levels of hsa-mir-200c expression correlate with better survival of patients with curative resection of pancreatic cancer [38]. Furthermore, to the best of our knowledge, we are the first to report the hsa-mir-598, UBD, CATSPER2 and ZNF600 were also associated with UBC patient survival. Expression of miR-598-3p was down-regulated in the serum of breast cancer patients, using miR-598-3p as a biomarker, the sensitivity and specificity for the detection of breast cancer was 95.0 and 87.5% [39]. High expression of UBD correlates with epirubicin resistance and indicates poor prognosis in triple-negative breast cancer [40]. ZNF600 were down regulated in smokers compared to non-smokers [41], and cigarette smoking was an important risk factor for bladder cancer in both sexes [42]. Although the specific function of ZNF600 is unclear, ZNF600 expression is closely associated with smoking. Consistent with the GO result, UBD is not only an ubiquitin protein associated with ubiquitylation, which plays a vital role in survival [43]. CATSPER2 plays a central role in calcium-dependent physiological responses essential for successful fertilization, such as sperm hyperactivation, acrosome reaction and chemotaxis towards the oocyte. The relationship between prognosis of UBC and CATSPER2 expression has not been reported yet. ZNF600 belongs to the krueppel C2H2-type zinc-finger protein family which involved in transcriptional regulation. Transcriptional regulation factor such as ZNF600 generally affects cancer by promoting cell proliferation.
Among the risky genes, hsa-mir-143, RUNX2,LAMA2, INHBB, KCNK6, PLSCR4, PAM andPDGFD are associated with tumor development. In previous report of study, microRNA-143 as a tumor suppressor for bladder cancer and microRNA-143 inhibits cell migration and invasion by targeting matrix metalloproteinase 13 in prostate cancer. Thus, the function of microRNA-143 is heterogeneous in different study. And this condition needs to be further study. RUNX2 and TP53 might be functionally related and are likely involved in bladder tumor carcinogenesis and aggressiveness, and RUNX2 and p53 independently predict early tumor recurrence in bladder carcinoma patients, with the highest prediction accuracy being achieved on their combined high expression [44]. TP53-induced miR-34a contributes liver regeneration termination via regulation of INHBB and MET. Other risky genes have not been reported in bladder cancer yet. However, LAMA2, INHBB and PDGFD all involved in cell migration and embryonic development [45, 46]. In fact, cell migration is closely associated with cancer. On the other hand, embryonic development and cancer development have some similar biological processing such as rapid cell proliferation and EMT pathway. KCNK6, PLSCR4 and PAM performed their special role in bladder cancer. Although these three genes are high-risk genes in bladder cancer, it is difficult to elaborate their functions. According to main functions of PLSCR4 and PAM, they are ion binding proteins. And KCNK6 is a transport for K. Therefore, we inferred that the three genes might be associated with drug resistance. And we expected that these genes can be validated by further biological experiment.
In conclusion, we have identified a 3-clinical and 13-RNA integrated signature that can predict the survival outcome of UBC patients more accurate than RNA-alone or clinical-alone signatures. Our findings may help researchers understanding of UBC cell death and survival, develop targeted therapy, and identify high-risk UBC patients for better disease management.
Acknowledgments
which made the genomic data of UBC available. We also thank the anonymous reviewers for helpful comments. This work was supported by The Postdoctoral Science Foundation of Central South University.
Supplement material
Supplementary Table 1.
Mature sequences of miRNA signature and experimentally validated mRNA targets
| Mature miRNA | Targets | Mature miRNA | Targets |
|---|---|---|---|
| hsa-mir-143-3p | KRAS | hsa-miR-200c-3p | TUBB3 |
| MYO6 | BMI1 | ||
| DNMT3A | SIP1 | ||
| FNDC3B | BAP1 | ||
| MAPK7 | ZEB2 | ||
| FSCN1 | ZEB1 | ||
| HK2 | Zeb2 | ||
| SERPINE1 | Zeb1 | ||
| MACC1 | FN1 | ||
| PTGS2 | ZFPM2 | ||
| JAG1 | UBE2I | ||
| AKT1 | PTPN13 | ||
| COL1A1 | RNF2 | ||
| HRAS | RCOR3 | ||
| MDM2 | BRD7 | ||
| BCL2 | ACVR2B | ||
| MMP13 | MSN | ||
| SDC1 | NTRK2 | ||
| RREB1 | ERRFI1 | ||
| CD44 | CCNE2 | ||
| KLF5 | XIAP | ||
| BRAF | BCL2 | ||
| TNF | TIMP2 | ||
| NR2C2 | FBLN5 |
| Supplementary Table 1, continued | |||
|---|---|---|---|
| Mature miRNA | Targets | Mature miRNA | Targets |
| IL13RA1 | VEGFA | ||
| DYT10 | NCAM1 | ||
| COL3A1 | Vldlr | ||
| DDX6 | Reln | ||
| LIMK1 | IKBKB | ||
| HNF4A | FLT1 | ||
| KLF9 | |||
| hsa-miR-143-5p | COX2 | hsa-miR-200c-3p | TBK1 |
| IGF1R | PMAIP1 | ||
| NTF3 | |||
| LPAR1 | |||
| EDNRA | |||
| RHOA | |||
| KLHL20 | |||
| PTPRD | |||
| ELMO2 | |||
| ERBB2IP | |||
| WDR37 | |||
| VAC14 | |||
| TCF7L1 | |||
| RASSF2 | |||
| HOXB5 | |||
| RIN2 | |||
| KLF11 | |||
| SEPT7 | |||
| SHC1 | |||
| MYB | |||
| ETS1 | |||
| USP25 | |||
| EFNA1 | |||
| RND3 | |||
| DNMT3A | |||
| DNMT3B | |||
| SP1 | |||
| CFL2 | |||
| CDH11 | |||
| SEC23A | |||
| KDR | |||
| HFE | |||
| DLC1 | |||
| ATRX | |||
| ZNF217 | |||
| BTC | |||
| ZFPM1 | |||
| PIN1 | |||
| KRAS | |||
| NOTCH1 | |||
| GATA4 | |||
| SUZ12 | |||
| ROCK2 | |||
| UBQLN1 | |||
| E2F3 | |||
Conflict of interest
The authors have declared that no competing interests exist.
References
- [1]. Dalesio O., Schulman C.C., Sylvester R. et al., Prognostic factors in superficial bladder A study of the european organization for research on treatment of cancer: Genitourinary tract cancer cooperative group, Journal of Urology 129(4) (1983), 730–733. [DOI] [PubMed] [Google Scholar]
- [2]. Fitzpatrick J.M., West A.B., Butler M.R. et al., Superficial bladder tumors (stage pTa, grades 1 and 2): The importance of recurrence pattern following initial resection, Journal of Urology 135(5) (1986), 920–922. [DOI] [PubMed] [Google Scholar]
- [3]. Parmar M.K.B., Freedman L.S., Hargreave T.B. et al., Prognostic factors for recurrence and followup policies in the treatment of superficial bladder cancer: Report from the british medical research council subgroup on superficial bladder cancer (urological cancer working party), Journal of Urology 142(2) (1989), 284–288. [DOI] [PubMed] [Google Scholar]
- [4]. Soloway M.S., The relationship among multiple recurrences, progression and prognosis of patients with stages TA and T1 transitional cell cancer of the bladder followed for at least 20 years – editorial comment, Journal of Urology 153(6) (1995), 1826–1827. [Google Scholar]
- [5]. Sylvester R.J., van der Meijden A.P.M., Oosterlinck W. et al., Predicting recurrence and progression in individual patients with stage Ta T1 bladder cancer using EORTC risk tables: A combined analysis of 2596 patients from seven EORTC trials, European Urology 49(3) (2006), 466–477. [DOI] [PubMed] [Google Scholar]
- [6]. Hall M.C., Chang S.S., Dalbagni G. et al., Guideline for the management of nonmuscle invasive bladder cancer (stages Ta, T1, and Tis): 2007 update, Journal of Urology 178(6) (2007), 2314–2330. [DOI] [PubMed] [Google Scholar]
- [7]. Rosenberg E., Baniel J., Spector Y. et al., Predicting progression of bladder urothelial carcinoma using microRNA expression, Bju International 112(7) (2013), 1027–1034. [DOI] [PubMed] [Google Scholar]
- [8]. Konety B.R., Molecular markers in bladder cancer: A critical appraisal, Urologic Oncology 24(4) (2006), 326–337. [DOI] [PubMed] [Google Scholar]
- [9]. Kim W.J., Kim S.K., Jeong P. et al., A four-gene signature predicts disease progression in muscle invasive bladder cancer, Molecular Medicine 17(5–6) (2011), 478–485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10]. Xu Z., Yu Y.Q., Ge Y.Z. et al., MicroRNA expression profiles in muscle-invasive bladder cancer: Identification of a four-microRNA signature associated with patient survival, Tumor Biology 36(10) (2015), 8159–8166. [DOI] [PubMed] [Google Scholar]
- [11]. Ratert N., Meyer H.A., Jung M. et al., miRNA profiling identifies candidate miRNAs for bladder cancer diagnosis and clinical outcome, Journal of Molecular Diagnostics Jmd 15(5) (2013), 695–705. [DOI] [PubMed] [Google Scholar]
- [12]. Wang J., Zhang X., Wang L. et al., MicroRNA-214 suppresses oncogenesis and exerts impact on prognosis by targeting PDRG1 in bladder cancer, Plos One 10(2) (2015), e0118086. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- [13]. Fujii T., Tatsumi Y., Fujimoto K. et al., Microrna-145 promotes differentiation in human urothelial carcinoma through down-regulation of syndecan-1, BMC Cancer 15(1) (2015), 818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14]. Zhou H., Tang K., Xiao H. et al., A panel of eight-miRNA signature as a potential biomarker for predicting survival in bladder cancer, Journal of Experimental & Clinical Cancer Research 34(1) (2015), 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15]. Yoshihara K., Tajima A., Yahata T. et al., Gene expression profile for predicting survival in advanced-stage serous ovarian cancer across two independent datasets, Plos One 5(3) (2010), 1191–1204. [Google Scholar]
- [16]. Jr H.F., Lee K.L. and Mark D.B., Multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors, Statistics in Medicine 15(4) (1996), 361–387. [DOI] [PubMed] [Google Scholar]
- [17]. Yuan M. and Lin Y., Model selection and estimation in regression with grouped variables, Journal of the Royal Statistical Society 68(1) (2006), 49–67. [Google Scholar]
- [18]. Bøvelstad H.M., Nygård S., Størvold H.L. et al., Predicting survival from microarray data – a comparative study, Bioinformatics 23(16) (2004), 2080–2087. [Google Scholar]
- [19]. Simon N., Friedman J., Hastie T. et al., Regularization paths for Cox’s proportional hazards model via coordinate descent, Journal of Statistical Software 39(5) (2011), 1–13. [Google Scholar]
- [20]. Tibshirani R., Bien J., Friedman J. et al., Strong rules for discarding predictors in lasso-type problems, Journal of the Royal Statistical Society 74(2) (2010), 245–266. [Google Scholar]
- [21]. Friedman J., Hastie T. and Tibshirani R., Regularization paths for generalized linear models via coordinate descent, Journal of Statistical Software 33(1) (2010), 1. [PMC free article] [PubMed] [Google Scholar]
- [22]. Network TCGA, Comprehensive molecular characterization of urothelial bladder carcinoma, Nature 507(7492) (2014), 315–322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [23]. Xiong J., Bing Z., Su Y. et al., An integrated mRNA and microRNA expression signature for glioblastoma multiforme prognosis, Plos One 9(5) (2014), e98419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24]. Heagerty P.J., Lumley T. and Pepe M.S., Time-dependent ROC curves for censored survival data and a diagnostic marker, Biometrics 56(2) (2000), 337–344. [DOI] [PubMed] [Google Scholar]
- [25]. Hsu S.D., Tseng Y.T., Shrestha S. et al., miRTarBase update 2014: An information resource for experimentally validated miRNA-target interactions, Nucleic Acids Research 42(1) (2014), 78–85. [Google Scholar]
- [26]. Consortium TGO, Gene ontology consortium: Going forward, Nucleic Acids Research 43(1) (2015), 1049–1056. [Google Scholar]
- [27]. Kanehisa M., Goto S., Sato Y. et al., Data, information, knowledge and principle: Back to metabolism in KEGG, Nucleic Acids Research 42(1) (2014), 199–205. [Google Scholar]
- [28]. Szklarczyk D., Franceschini A., Wyder S. et al., STRING v10: Protein-protein interaction networks, integrated over the tree of life, Nucleic Acids Res 43(1) (2015), 447–452. [Google Scholar]
- [29]. Ihaka R. and Gentleman R., R: A language for data analysis and graphics, Journal of Computational & Graphical Statistics 5(3) (1996), 299–314. [Google Scholar]
- [30]. Robinson M.D., McCarthy D.J. and Smyth G.K., edgeR: A bioconductor package for differential expression analysis of digital gene expression data, Bioinformatics 26(1) (2010), 139–140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [31]. Masoumi K.C., Marfany G., Wu Y. et al., Putative role of SUMOylation in controlling the activity of deubiquitinating enzymes in cancer, Future Oncology 12(4) (2016), 565–574. [DOI] [PubMed] [Google Scholar]
- [32]. Hurst C.D., Platt F.M., Taylor C.F. et al., Novel tumor subgroups of urothelial carcinoma of the bladder defined by integrated genomic analysis, Clinical Cancer Research 18(21) (2012), 5865–5877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33]. Riester M., Taylor J.M., Feifer A. et al., Combination of a novel gene expression signature with a clinical nomogram improves the prediction of survival in high-risk bladder cancer, Clinical Cancer Research an Official Journal of the American Association for Cancer Research 18(5) (2012), 1323–1333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [34]. Liu L., Qiu M., Tan G. et al., miR-200c inhibits invasion, migration and proliferation of bladder cancer cells through down-regulation of BMI-1 and E2F3, Journal of Translational Medicine 12(1) (2014), 305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35]. Hurteau G.J., Carlson J.A., Roos E. et al., Stable expression of miR-200c alone is sufficient to regulate TCF8 (ZEB1) and restore E-cadherin expression, Cell Cycle 8(13) (2009), 2064–2069. [DOI] [PubMed] [Google Scholar]
- [36]. Park S.M., Gaur A.B., Lengyel E. et al., The miR-200 family determines the epithelial phenotype of cancer cells by targeting the E-cadherin repressors ZEB1 and ZEB2, Genes & Development 22(7) (2008), 894–907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [37]. Li Y., Vandenboom T.G., Kong D. et al., Up-regulation of miR-200 and let-7 by natural agents leads to the reversal of epithelial-to-mesenchymal transition in gemcitabine-resistant pancreatic cancer cells, Cancer Research 69(16) (2009), 6704–6712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [38]. Yu J., Ohuchida K., Mizumoto K. et al., MicroRNA, hsa-miR-200c, is an independent prognostic factor in pancreatic cancer and its upregulation inhibits pancreatic cancer invasion but increases cell proliferation, Molecular Cancer 9(1) (2010), 169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [39]. Fu L., Li Z., Zhu J. et al., Serum expression levels of microRNA-382-3p, -598-3p, -1246 and -184 in breast cancer patients, Oncology Letters 12(1) (2016), 269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [40]. Han T., Liu Z., Li H. et al., High expression of UBD correlates with epirubicin resistance and indicates poor prognosis in triple-negative breast cancer, Oncotargets & Therapy 8 (2015), 1643–1649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41]. Eckelpassow J.E., Serie D.J., Bot B.M. et al., ANKS1B is a smoking-related molecular alteration in clear cell renal cell carcinoma, BMC Urology 14(1) (2014), 14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42]. Quirk J.T., Li Q., Natarajan N. et al., Cigarette smoking and the risk of bladder cancer in men and women, Tobacco Induced Diseases 2(3) (2004), 141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43]. Schmidtke G., Kalveram B., Weber E. et al., The UBA domains of NUB1L are required for binding but not for accelerated degradation of the ubiquitin-like modifier FAT10, Journal of Biological Chemistry 281(29) (2006), 20045–20054. [DOI] [PubMed] [Google Scholar]
- [44]. Abdelzaher E. and Kotb A.F., High coexpression of runt-related transcription factor 2 (RUNX2) and p53 independently predicts early tumor recurrence in bladder urothelial carcinoma patients, Applied Immunohistochemistry & Molecular Morphology Aimm 24(5) (2015), 345. [Google Scholar]
- [45]. Ehrig K., Leivo I., Argraves W.S. et al., Merosin, a tissue-specific basement membrane protein, is a laminin-like protein, Proceedings of the National Academy of Sciences of the United States of America 87(9) (1990), 3264–3268. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [46]. Ying S.Y., Inhibins and activins: Chemical properties and biological activity, Experimental Biology and Medicine 186(3) (1987), 253–264. [Google Scholar]




