Abstract
Growing evidence showed that alternative splicing (AS) event is significantly related to tumor occurrence and progress. This study was performed to make a systematic analysis of AS events and constructed a robust prediction model of hepatocellular carcinoma (HCC). The clinical information and the genes expression profile data of 335 HCC patients were collected from The Cancer Genome Atlas (TCGA). Information of seven types AS events were collected from the TCGA SpliceSeq database. Overall survival (OS) related AS events and splicing factors (SFs) were identified using univariate Cox regression analysis. The corresponding genes of OS-related AS events were sent for gene network analysis and functional enrichment analysis. Optimal OS-related AS events were selected by LASSO regression to construct prediction model using multivariate Cox regression analysis. Prognostic value of the prediction models were assessed by receiver operating characteristic (ROC) curve and KaplanMeir survival analysis. The relationship between the Percent Spliced In (PSI) value of OS-related AS events and SFs expression were analyzed using Spearman correlation analysis. And the regulation network was generated by Cytoscape. A total of 34,163 AS events were identified, which consist of 3,482 OS-related AS events. UBB, UBE2D3, SF3A1 were the hub genes in the gene network of the top 800 OS-related AS events. The area under the curve (AUC) of the final prediction model based on seven types OS-related AS events was 0.878, 0.843, 0.821 in 1, 3, 5 years, respectively. Upon multivariate analysis, risk score (All) served as the risk factor to independently predict OS for HCC patients. SFs HNRNPH3 and HNRNPL were overexpressed in tumor samples and were signifcantly associated with the OS of HCC patients. The regulation network showed prominent correlation between the expression of SFs and OS-related AS events in HCC patients. The final prediction model performs well in predicting the prognosis of HCC patients. And the findings in this study improve our understanding of the association between AS events and HCC.
Keywords: Alternative splicing, Hepatocellular carcinoma, Prediction model, Splicing factor, Risk score, Prognosis, Overall survival, TCGA, PSI value, LASSO regression
Introduction
As the main component of tissues and cells, proteins play a vital role in many life activities. The diversity of proteins contributes to the functional diversity. Alternative splicing (AS) is a significant and ubiquitous post-transcriptional regulatory mechanism that enables eukaryotic cells to generate vast protein diversity based on a limited number of genes (Baralle & Giudice, 2017). Genome-wide research indicated that up to 95% of human genes experience some level of AS in physiological processes (Pan et al., 2008; Wang et al., 2008). Precursor mRNA can be transformed to mature mRNA and further produce versatile protein by removing introns and selectively including or excluding specific exons in human multi-exon genes (Kelemen et al., 2013). Recently, growing evidence showed that AS is significantly related to tumor occurrence, progress and therapeutic resistance, and AS is involved in the process of invasion and metastasis of cancer cells (Wan et al., 2019; Oltean & Bates, 2014). Splicing factor (SF) is the executor of AS events and aberrant expression of SF was associated with oncogenesis process (Dvinge et al., 2016).
Liver cancer ranks the 6th place in terms of global tumor incidence, and it is the 4th leading cause of cancer-related death (Villanueva, 2019). Hepatocellular carcinoma (HCC), one of the frequently seen primary liver tumors, occupies about 80% of liver cancers. Worldwide, the highest liver cancer morbidity is reported in Asia and Africa. About 75% liver cancers take place in Asian areas, among which China accounts for more than half of the total global cases (McGlynn, Petrick & London, 2015). Although great progresses have been achieved in diagnosing and treating liver tumor over past few decades, prognosis for liver tumor remains very poor. Liver cancer has become the second most fatal tumor after pancreatic cancer with a 5-year survival rate of 18% (Jemal et al., 2017). In the recent years, some studies have reported the crucial significance of AS in HCC occurrence and development (Yuan et al., 2017; Luo et al., 2017). However, systematic analysis of the predictive value of AS events in HCC is scarce.
In this study, we collected RNA-seq data, AS events data and corresponding clinical information of 335 HCC patients from TCGA database. Overall survival (OS) related AS events and splicing factors (SFs) were identified. Additionally, a reliable prediction model on the basis of AS events, correlation network between AS events and SFs were constructed.
Materials and Methods
TCGA-based data collection
The genes expression profile data as well as the clinical information of HCC patients had been collected from TCGA (https://cancergenome.nih.gov/). Information of seven types AS events were collected from the TCGA SpliceSeq database (https://bioinformatics.mdanderson.org/TCGASpliceSeq/), including alternate acceptor site (AA), alternate promoter (AP), alternate donor site (AD), alternate terminator (AT), exon skip (ES), mutually exclusive exons (ME), and retained intron (RI). Due to defining perioperative mortality as death that occurs within 30 days of surgery may underestimate ‘true’ mortality among patients undergoing hepatic resection (Mayo et al., 2011). We only included patients with a survival time more than 90 days, finally, 335 HCC patients were selected in this study. The Percent Spliced In (PSI) value, rating from 0 to 1 which is used to quantify AS events in general. To generate a reliable set of AS events, we applied a series of stringent filters (Percentage of samples with PSI value ≥75%, standard deviation of PSI value >0.1 and average PSI value > 0.05).
Every AS event was assigned a unique identifier by combining the gene symbol, the ID number in the SpliceSeq database and splicing type. For instance, in the identifier term “CSAD-21952-ES”, the gene symbol is CSAD, ID number is 21,952 and splicing type is ES.
Identification of overall survival related AS events, functional enrichment analysis and gene network construction
We identified overall survival (OS) related AS events using univariate Cox regression analysis with a P value <0.05. Upset plot which is similar to a Venn diagram, was introduced to depict the intersections between the seven types of AS events in HCC (Khan & Mathelier, 2017). The corresponding genes of OS-related AS events were sent for functional enrichment analysis. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways with P value <0.05 and false discovery rate (FDR) <0.05 were considered significantly. The top significant pathways in KEGG and GO were showed with bar plots. Gene network analysis was performed by inputting the corresponding genes of 800 most significant OS-related AS events to Cytoscape and Reactome FI (version 3.7.1), further, hub genes were selected at the same time.
Construction of the prediction model based on AS events
Least absolute shrinkage and selection operator (LASSO) regression is suitable for the reduction of high-dimensional data to avoid model overfitting (Sauerbrei, Royston & Binder, 2007). When we performed LASSO regression, some variables were eliminated through penalty rules and potential predictors with non-zero coefficients were finally leaved (Gao, Kwan & Shi, 2010). We determined the penalty parameter lambda by the cross-validation using the glmnet package. The optimal lambda value corresponding to the minimum value of the cross-validation error mean was identified to determine the potential OS-related AS events (Tibshirani, 1997). We selected the optimal OS-related AS events in each type respectively with nonzero coefficients in the LASSO regression, and constructed the prediction models by using multivariate Cox regression analysis. The final prediction model (all types) was constructed by combing seven types of AS events also selected via LASSO regression.
The prediction model was established according to risk score, and the risk score was calculated by the PSI value of each AS events and the corresponding regression coefficient (lnHR) generated from multivariate Cox regression analysis. The formula is as follows: , where β is the regression coefficient.
Prognostic value evaluation of the prediction model
The HCC patients were divided into low and high risk groups according to the median value of the risk score and Kaplan–Meier survival analysis was performed to compare the OS rate between the two groups within five years. The p-values were computed using log-rank test. Furthermore, ROC curves of 1, 3, 5 years were generated to compare the predictive accuracy of each prediction model.
Construction of the regulation network between SFs and AS events
SF data was obtained from the SpliceAid2 database (http://www.introni.it/splicing.html). The association of SF expression with OS was analyzed using univariate Cox regression. The relationship between the PSI value of OS-related AS events and SF expression were analyzed using Spearman correlation analysis. Regulation network was generated by Cytoscape and Reactome FI (version 3.7.1).
Statistical analysis
The R software 3.5.0 was utilized for all statistical analysis. A P value <0.05 was deemed to be of statistical significance.
Results
Overview of AS events in HCC cohort
A flowchart of our study design is showed in Fig. 1. We detected a total of 34,163 AS events in 8985 genes in 335 HCC patients, comprised of 12,327 ES events in 5,343 genes, 8087 AT events in 3,532 genes, 6,352 AP events in 2,566 genes, 2,666 AA events in 1,937 genes, 2,331 AD events in 1,663 genes, 2,263 RI events in 1,561 genes and 137 ME events in 135 genes (Table 1). We can find that one single gene undergoes up to six types of AS events from the UpSet plot (Fig. 2A). In addition, ES was the most common among seven types of AS events.
Table 1. Overview of total AS events and OS-related AS events.
Type | Total AS events | OS-related AS events | ||
---|---|---|---|---|
AS events | Genes | AS events | Genes | |
AA | 2666 | 1937 | 217 | 203 |
AD | 2331 | 1663 | 231 | 205 |
AP | 6352 | 2566 | 643 | 411 |
AT | 8087 | 3532 | 887 | 534 |
ES | 12327 | 5343 | 1264 | 988 |
ME | 137 | 135 | 16 | 16 |
RI | 2263 | 1561 | 224 | 196 |
ALL | 34163 | 8985 | 3482 | 2203 |
The OS-related AS events of HCC
A total of 3,482 OS-related AS events in 2,203 genes were detected using univariate Cox analysis, including 1,264 ES events in 988 genes, 887 AT events in 534 genes, 643 AP events in 411 genes, 217 AA events in 203 genes, 231 AD events in 205 genes, 224 RI events in 196 genes and 16 ME events in 16 genes (Table 1). A volcano plot of these AS events was provided in Fig. 2B. With the display of UpSet plot, one single gene could have up to four types OS-related AS events (Fig. 2C). The top 20 significant OS-related AS events (if available) for each AS type were showed by bubble plots in Figs. 3A–3G. Obviously, most of AS events in ES were favorable prognostic elements. However, most of these AS events in RI, AA and AD were adverse prognostic elements.
We established a gene interaction network by sending the corresponding genes of the top 800 most significant OS-related AS events to Cytoscape. The results indicated that UBB, UBE2D3, SF3A1 were the hub genes of this gene network (Fig. 4A). Furthermore, 2,203 genes from 3,482 OS-related AS events were used for KEGG and GO enrichment analysis to explore the pathways and biological functions of the OS-related AS genes. The top 10 significant terms of GO enrichment analysis were presented in Fig. 4B. Such as spliceosomal complex, adherens junction in cellular component (CC); damaged DNA binding, cell adhesion molecule binding in molecular function (MF); protein targeting, actin cytoskeleton reorganization in biological process (BP). A total of nine KEGG pathways were identified, such as base excision repair, pyruvate metabolism and PPAR signaling pathway (Fig. 4C).
Construction of the prediction model for HCC patients
We used LASSO regression to select the top 10 optimal OS-related AS events (if available) and then construct the prediction model (Fig. 5). Three OS-related AS events were selected for ES; four OS-related AS events for ME; five OS-related AS events for AP and AT; six OS-related AS events for AA, AD and RI; nine OS-related AS events for the final prediction model (Table 2). Risk scores were computed according to the selected AS events, and HCC patients were divided into low and high risk groups on the basis of the median value of risk scores. The distribution of survival status in low and high risk groups, risk score curves and the PSI value heat map of the AS events for eight prediction models were visualized in Fig. 6. The results of Kaplan–Meier survival analysis showed that all of the eight prediction models possess significant ability to predict the prognosis of HCC patients between low and high risk group (Fig. 7). However, according the results of ROC curves, the final prediction model exhibited the most powerful predictive efficiency than other models established by one single AS type with the maximum AUCs of 0.878, 0.843, 0.821 in 1, 3, 5 years ROC curves (Fig. 8).
Table 2. Information of AS events used for construction of prediction model.
Type | ID | Coef | HR | 95% Lower | 95% Up | P value |
---|---|---|---|---|---|---|
AA | SWI5-87732-AA | 3.44 | 31.34 | 4.32 | 227.57 | 0.00066043 |
MFSD10-68617-AA | 5.53 | 251.81 | 4.79 | 13244.14 | 0.006246456 | |
FDPS-8074-AA | 1.72 | 5.58 | 0.96 | 32.52 | 0.055702088 | |
FUOM-13553-AA | 4.50 | 89.61 | 2.18 | 3689.99 | 0.017794651 | |
ZFAND6-32171-AA | −18.97 | 5.79e−9 | 1.84e−14 | 1.83e−3 | 0.003324932 | |
ABCG5-53407-AA | 1.45 | 4.24 | 1.54 | 11.67 | 0.005133725 | |
AD | CCDC90B-18085-AD | 2.73 | 15.27 | 2.46 | 94.69 | 0.003404273 |
FAXDC2-74229-AD | 3.72 | 41.20 | 1.74 | 973.72 | 0.021200397 | |
C1orf159-13-AD | −2.02 | 0.13 | 0.04 | 0.42 | 0.000547079 | |
BSCL2-16408-AD | 6.33 | 560.69 | 9.32 | 33724.29 | 0.002462146 | |
TXNDC17-38768-AD | 12.15 | 1.89e+5 | 8.72 | 4.12e+9 | 0.017081663 | |
RPL13-392312-AD | 3.25 | 25.81 | 3.24 | 205.55 | 0.002134468 | |
AP | KANK1-85710-AP | −1.12 | 0.33 | 0.16 | 0.68 | 0.003039732 |
NUDT6-70521-AP | −1.40 | 0.25 | 0.09 | 0.71 | 0.009102069 | |
S100A13-7731-AP | 1.33 | 3.76 | 0.93 | 15.27 | 0.063560786 | |
MAEA-68467-AP | −11.36 | 1.16e−5 | 1.41e−8 | 0.01 | 0.000914974 | |
MXI1-13079-AP | −1.25 | 0.29 | 0.09 | 0.89 | 0.030826651 | |
AT | JUP-40930-AT | −4.18 | 0.02 | 8.21e−4 | 0.29 | 0.005110540 |
TSPAN31-22724-AT | −5.98 | 2.52e−3 | 9.11e−5 | 0.07 | 0.000412724 | |
ZNF706-84737-AT | 9.10 | 8974.41 | 116.68 | 690281.40 | 3.99e−5 | |
NEIL3-71227-AT | 1.45 | 4.26 | 1.78 | 10.19 | 0.001109076 | |
EPS15L1-48154-AT | 5.79 | 327.41 | 15.76 | 6803.41 | 0.000183173 | |
ES | CRELD1-63291-ES | −13.64 | 1.19e−6 | 5.85e−10 | 2.44e−3 | 0.000452970 |
CSAD-21952-ES | −3.21 | 0.04 | 0.01 | 0.19 | 4.24e−5 | |
IRF3-50995-ES | −2.62 | 0.07 | 0.03 | 0.20 | 6.14e−7 | |
ME | SLC39A14-140283-ME | −1.96 | 0.14 | 0.04 | 0.45 | 0.001037644 |
H2AFY-96931-ME | 3.18 | 24.02 | 2.71 | 211.85 | 0.004205834 | |
CYP4F3-48101-ME | −4.59 | 0.01 | 2.74e−4 | 0.37 | 0.012551164 | |
MTFR1L-1211-ME | −2.30 | 0.10 | 0.03 | 0.38 | 0.000662517 | |
RI | VPS28-85600-RI | 6.06 | 429.08 | 5.62 | 32758.26 | 0.006135308 |
NUDT22-16590-RI | 2.69 | 14.79 | 3.18 | 68.86 | 0.000599344 | |
NAA60-33527-RI | −1.87 | 0.15 | 0.04 | 0.65 | 0.010841633 | |
MOGAT3-81047-RI | 5.24 | 188.66 | 5.22 | 6812.78 | 0.004190482 | |
SNX5-58749-RI | −3.94 | 0.02 | 3.24e−3 | 0.12 | 1.70e−5 | |
UBB-39434-RI | 44.68 | 2.54e+19 | 2.40e+10 | 2.69e+28 | 2.50e−5 | |
ALL | CSAD-21952-ES | −2.15 | 0.12 | 0.02 | 0.58 | 0.008527832 |
NUDT22-16590-RI | 2.26 | 9.58 | 2.24 | 40.90 | 0.00228534 | |
TSPAN31-22724-AT | −6.17 | 2.08e−3 | 4.74e−5 | 0.09 | 0.001376182 | |
CCDC90B-18085-AD | 1.53 | 4.62 | 0.68 | 31.15 | 0.116422196 | |
IRF3-50994-ES | −1.71 | 0.18 | 0.03 | 1.02 | 0.053074975 | |
ZNF706-84737-AT | 8.94 | 7608.05 | 76.67 | 754961.11 | 0.000138991 | |
KANK1-85710-AP | −0.80 | 0.45 | 0.22 | 0.94 | 0.033403 | |
PGAP3-40670-ES | −6.81 | 1.11e−3 | 4.06e−7 | 3.02 | 0.091751791 | |
NAA60-33527-RI | −2.27 | 0.10 | 0.03 | 0.41 | 0.001319454 |
Moreover, we assessed the prognostic value of risk score and other clinicopathological characteristics using univariate and multivariate Cox regression analysis. Univariate analysis suggested that advanced clinical stage (HR = 2.053, 95% CI [1.553–2.714], p < 0.001), high T classification (HR = 1.962, 95% CI [1.515–2.542], p < 0.001) and high risk score (All) (HR = 1.105, 95% CI [1.076–1.135], p < 0.001) were associated with poor prognosis (Fig. 9A). Upon multivariate analysis, risk score (All) (HR = 1.101, 95% CI [1.069–1.135], p < 0.001) served as the risk factor to independently predict OS for HCC patients (Fig. 9B).
The regulation network of SFs and OS-related AS events
SF is the executor of AS events and aberrant expression of SF was related to oncogenesis process. A total of 71 SFs data were collected from the SpliceAid2 database (http://www.introni.it/splicing.html). We identified 21 SFs associated with OS of HCC patients using univariate analysis. Then Spearman correlation analysis was performed to determine the correlation between the PSI value of OS-related AS events and SF expression. The significant correlations (—R—≥0.6, p < 0.001) were selected to construct the regulation network (Fig. 10A). The regulation network consists of 29 OS-related AS events, of which 16 were adverse AS events (red dots) and 13 were favorable AS events (green dots), were significantly correlated with the 13 SFs (blue dots). We can find that all of the SFs were correlated with multiple AS events and played opposite roles in regulating different AS events. Similarly, a part of the AS events could be regulated by different SFs. This phenomenon partly explains that the same transcript can produce multiple different splicing events. In addition, we can detect that the adverse AS events were positively correlated with SF expression (red lines), whereas the favorable AS events were negatively correlated with SF expression (green lines).
The top four most significant correlations between SFs and OS-related AS events (—R— ≥0.7) are shown in Figs. 10D, 10E, 10H and 10I. The top two counterpart SFs were HNRNPL and HNRNPH3, and the expression of HNRNPL and HNRNPH3 in tumor sample were significantly higher than adjacent normal sample (Figs. 10B and 10F). HCC patients were classified into low and high risk groups according to the median value of the two SFs expression. Also, Kaplan–Meier survival analysis showed statistical difference in survival time between the two groups (Figs. 10C and 10G).
Discussion
AS is a significant regulatory process for generating protein isoforms with a variety of functional characteristics. Abnormality of AS events are closely associated with tumor occurrence, development and metastasis (Oltean & Bates, 2014; Liu & Cheng, 2013; Spaethling et al., 2016). In recent decades, with the rapid development of high-throughput sequencing technology, the potential significance of AS events in malignant tumor has achieved great advancement. However, there are few studies focus on the systematic analysis of AS events in HCC patients.
Then we screened OS-related AS events and OS-related SFs in HCC patients via the analysis of TCGA and SpliceAid2 database. A total of 3,482 OS-related AS events in 2,203 genes and 21 OS-related SFs were detected using univariate Cox analysis, which shows that AS events are ubiquitous in HCC patients and limited SFs can regulate massive AS. The gene interaction network was established using Cytoscape based on the top 800 OS-related AS events, and UBB, UBE2D3, SF3A1 were identified as the hub genes of this gene network. Ubiquitin is a small and highly conserved protein expressed in all eukaryotic cells. Over expression of ubiquitin B (UBB) was reported in non-small cell lung cancer and cervical cancer, UBB may serve as a potential therapy and prevention target (Tang et al., 2015; Tian et al., 2013). Nevertheless, the relationship between HCC and UBB is not clear. Ubiquitin-conjugating enzyme E2D3 (UBE2D3) is a member of the E2 family, which is involved in the ubiquitin proteasome pathway to regulate the basic activities of cells, such as DNA damage response, cell cycle control, apoptosis, and tumorgenesis. A previous study demonstrated that UBE2D3 plays a significant role in the development of esophageal cancer (Guan et al., 2015). SF3A1 is a critical spliceosome gene which participated in normal splicing events and spliceosome assembly (Chen et al., 2015). SF3A1 has been reported to be related to susceptibility of breast cancer and lung cancer (Hu et al., 2011; Michailidou et al., 2013). And the mutation of SF3A1 were involved in some cancers, including esophageal adenocarcinoma, osteosarcomas, ovarian carcinoma and gastric cancer (Chen et al., 2015). Additionally, KEGG and GO enrichment analysis were performed and we found that these genes were closely associated with spliceosomal complex, cell adhesion molecule binding, actin cytoskeleton reorganization, base excision repair, etc. From the above mentioned, UBB, UBE2D3 and SF3A1 as the hub genes in this gene interaction network and may be potential targets for the prevention and treatment of HCC in the future.
In the recent years, with the fast development of high-throughput sequencing, massive potential prognosis biomarkers or therapy targets of tumor were emerging, such as mRNA, miRNA, lncRNA, and methylation (Zhao et al., 2018; Ji et al., 2018; Cai et al., 2019; Wu et al., 2017). However, the focus of these studies is limited to the transcriptome-level analysis. The prognostic value of AS events has a great potential for development. Tremblay et al. (2016) firstly investigated differential AS events between HCC sample and normal liver sample based on TCGA database, and they provided an overview of misregulated AS events in different types of HCC (e.g., HBV-related HCC, HCV-related HCC, HBV&HCV-associated HCC and virus-free HCC). But they did not explore the association between the AS events and the prognosis of HCC patients. Subsequently, some researchers identified OS-related AS events to establish a prediction model in HCC patients (Chen et al., 2019; Zhu et al., 2019). However, Chen et al. reported that the AUC of ROC curve for the final prediction model constructed with 10 AS events was only 0.752. Zhu et al. constructed the final prediction model with up to 33 AS events, but the AUC of ROC curve was only 0.806. In this study, the final prediction model exhibited the most powerful predictive efficiency than other models established by one single AS type with the maximum AUCs of 0.878, 0.843, 0.821 in 1, 3, 5 years ROC curves. And our final prediction model was constructed with only nine OS-related AS events.
SF plays a significant role in the regulation of AS events, which affects the selection of exons and splicing sites by identifying and combining to cis-regulatory elements of pre-mRNA. Aberrant expression of SF was related to oncogenesis process. The regulation network was constructed to show the correlation between the PSI value of OS-related AS events and SF expression. The top two SFs were HNRNPL and HNRNPH3 identified according to the correlation coefficient (—R— ≥0.7). Compared to adjacent normal sample, the expression of HNRNPL and HNRNPH3 were higher in tumor sample. And patients with higher expression of HNRNPL and HNRNPH3 had more dismal prognosis. From the regulation network, we found that the adverse AS events were positively correlated with the expression of HNRNPL and HNRNPH3, whereas the favorable AS events were negatively correlated with the expression of HNRNPL and HNRNPH3, which was consistent with the results of survival analysis. HNRNPL and HNRNPH3 are members of HNRNPs family. HNRNPs directly modulate the AS of a group of RNAs and serve as multifunctional RNA-binding proteins for mRNA stabilization, transportation and translation (Fei et al., 2017). Aberrant expression of HNRNPL and its RNA target are closely associated with the proliferation, invasion and metastasis of tumor cells (Kedzierska & Piekielko-Witkowska, 2017; Geuens, Bouhy & Timmerman, 2016). A recent study have demonstrated that HNRNPL was highly expressed in HCC samples and down-regulation of HNRNPL expression can significantly inhibit the proliferation and migration of liver cancer cells (Yau et al., 2013), which was in accordance with our results. However, to the best knowledge of us, there is no research report the actual regulatory mechanism between the two prognostic SFs and OS-related AS events, and further elucidation with in vivo or vitro experiments is urgently needed.
Although the findings in the present study improves our understanding of the association between AS events and HCC, some limitations existed in this study. First, this study was conducted based on the data obtained from one public database with a relatively small sample size. Second, due to the lack of external data, we did not make a cohort verification for the prediction model. Third, we did not perform a functional experiment and could not clearly elucidate the underlying mechanism between AS events and SFs in HCC patients. It is essential to carry out functional experiments and clinical trials with a large sample size of HCC patients to confirm the findings of this study in the future.
Conclusions
In summary, we performed a systematic analysis of AS events in HCC and constructed prediction models based on OS-related AS events with well performance in predicting the prognosis of HCC patients. The AS events used for the construction of the final prediction model may be the most significant AS events in exploring the latent mechanism in initiation and development of HCC, have a great potential for clinical application as therapeutic and preventive targets of HCC patients. Furthermore, we established a regulation network between SFs and OS-related AS events. Although the findings in the present study improve our understanding of the association between AS events and HCC to some extent. In vitro/vivo function experiments are also urgently needed in the future to understand the mechanism between AS events and SFs in the HCC patients.
Supplemental Information
Funding Statement
The current work was supported by the National Natural Science Foundation of China (No. 81770566), the New Medical Technology Foundation of West China Hospital of Sichuan University (No. XJS2016004), and the Science and Technology Program of Sichuan Science and Technology Department (No. 2019YFS0029). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Additional Information and Declarations
Competing Interests
The authors declare there are no competing interests.
Author Contributions
Lingpeng Yang performed the experiments, prepared figures and/or tables, authored or reviewed drafts of the paper, approved the final draft.
Yang He and Zifei Zhang analyzed the data, contributed reagents/materials/analysis tools, prepared figures and/or tables, approved the final draft.
Wentao Wang conceived and designed the experiments, authored or reviewed drafts of the paper, approved the final draft.
Data Availability
The following information was supplied regarding data availability:
The raw data is available in the Supplemental Files.
References
- Baralle & Giudice (2017).Baralle FE, Giudice J. Alternative splicing as a regulator of development and tissue identity. Nature Reviews Molecular Cell Biology. 2017;18:437–451. doi: 10.1038/nrm.2017.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cai et al. (2019).Cai J, Tong Y, Huang L, Xia L, Guo H, Wu H, Kong X, Xia Q. Identification and validation of a potent multi-mRNA signature for the prediction of early relapse in hepatocellular carcinoma. Carcinogenesis. 2019;40:840–852. doi: 10.1093/carcin/bgz018. [DOI] [PubMed] [Google Scholar]
- Chen et al. (2019).Chen QF, Li W, Wu P, Shen L, Huang ZL. Alternative splicing events are prognostic in hepatocellular carcinoma. Aging. 2019;11:4720–4735. doi: 10.18632/aging.102085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen et al. (2015).Chen X, Du H, Liu B, Zou L, Chen W, Yang Y, Zhu Y, Gong Y, Tian J, Li F, Zhong S. The associations between RNA splicing complex gene SF3A1 polymorphisms and colorectal cancer risk in a chinese population. PLOS ONE. 2015;10:e130377. doi: 10.1371/journal.pone.0130377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dvinge et al. (2016).Dvinge H, Kim E, Abdel-Wahab O, Bradley RK. RNA splicing factors as oncoproteins and tumour suppressors. Nature Reviews Cancer. 2016;16:413–430. doi: 10.1038/nrc.2016.51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fei et al. (2017).Fei T, Chen Y, Xiao T, Li W, Cato L, Zhang P, Cotter MB, Bowden M, Lis RT, Zhao SG, Wu Q, Feng FY, Loda M, He HH, Liu XS, Brown M. Genome-wide CRISPR screen identifies HNRNPL as a prostate cancer dependency regulating RNA splicing. Proceedings of the National Academy of Sciences of the United States of America. 2017;114:E5207–E5215. doi: 10.1073/pnas.1617467114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gao, Kwan & Shi (2010).Gao J, Kwan PW, Shi D. Sparse kernel learning with LASSO and Bayesian inference algorithm. Neural Networks. 2010;23:257–264. doi: 10.1016/j.neunet.2009.07.001. [DOI] [PubMed] [Google Scholar]
- Geuens, Bouhy & Timmerman (2016).Geuens T, Bouhy D, Timmerman V. The hnRNP family: insights into their role in health and disease. Human Genetics. 2016;135:851–867. doi: 10.1007/s00439-016-1683-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guan et al. (2015).Guan GG, Wang WB, Lei BX, Wang QL, Wu L, Fu ZM, Zhou FX, Zhou YF. UBE2D3 is a positive prognostic factor and is negatively correlated with hTERT expression in esophageal cancer. Oncology Letters. 2015;9:1567–1574. doi: 10.3892/ol.2015.2926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hu et al. (2011).Hu Z, Wu C, Shi Y, Guo H, Zhao X, Yin Z, Yang L, Dai J, Hu L, Tan W, Li Z, Deng Q, Wang J, Wu W, Jin G, Jiang Y, Yu D, Zhou G, Chen H, Guan P, Chen Y, Shu Y, Xu L, Liu X, Liu L, Xu P, Han B, Bai C, Zhao Y, Zhang H, Yan Y, Ma H, Chen J, Chu M, Lu F, Zhang Z, Chen F, Wang X, Jin L, Lu J, Zhou B, Lu D, Wu T, Lin D, Shen H. A genome-wide association study identifies two new lung cancer susceptibility loci at 13q12.12 and 22q12.2 in Han Chinese. Nature Genetics. 2011;43:792–796. doi: 10.1038/ng.875. [DOI] [PubMed] [Google Scholar]
- Jemal et al. (2017).Jemal A, Ward EM, Johnson CJ, Cronin KA, Ma J, Ryerson B, Mariotto A, Lake AJ, Wilson R, Sherman RL, Anderson RN, Henley SJ, Kohler BA, Penberthy L, Feuer EJ, Weir HK. Annual report to the nation on the status of cancer, 1975-2014, featuring survival. Journal of the National Cancer Institute. 2017;109(9) doi: 10.1093/jnci/djx030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ji et al. (2018).Ji J, Chen H, Liu XP, Wang YH, Luo CL, Zhang WW, Xie W, Wang FB. A miRNA combination as promising biomarker for hepatocellular carcinoma diagnosis: a study based on bioinformatics analysis. Journal of Cancer. 2018;9:3435–3446. doi: 10.7150/jca.26101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kedzierska & Piekielko-Witkowska (2017).Kedzierska H, Piekielko-Witkowska A. Splicing factors of SR and hnRNP families as regulators of apoptosis in cancer. Cancer Letters. 2017;396:53–65. doi: 10.1016/j.canlet.2017.03.013. [DOI] [PubMed] [Google Scholar]
- Kelemen et al. (2013).Kelemen O, Convertini P, Zhang Z, Wen Y, Shen M, Falaleeva M, Stamm S. Function of alternative splicing. Gene. 2013;514:1–30. doi: 10.1016/j.gene.2012.07.083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khan & Mathelier (2017).Khan A, Mathelier A. Intervene: a tool for intersection and visualization of multiple gene or genomic region sets. BMC Bioinformatics. 2017;18:287. doi: 10.1186/s12859-017-1708-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu & Cheng (2013).Liu S, Cheng C. Alternative RNA splicing and cancer. Wiley Interdisciplinary Reviews: RNA. 2013;4:547–566. doi: 10.1002/wrna.1178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo et al. (2017).Luo C, Cheng Y, Liu Y, Chen L, Liu L, Wei N, Xie Z, Wu W, Feng Y. SRSF2 regulates alternative splicing to drive hepatocellular carcinoma development. Cancer Research. 2017;77:1168–1178. doi: 10.1158/0008-5472.CAN-16-1919. [DOI] [PubMed] [Google Scholar]
- Mayo et al. (2011).Mayo SC, Shore AD, Nathan H, Edil BH, Hirose K, Anders RA, Wolfgang CL, Schulick RD, Choti MA, Pawlik TM. Refining the definition of perioperative mortality following hepatectomy using death within 90 days as the standard criterion. HPB. 2011;13:473–482. doi: 10.1111/j.1477-2574.2011.00326.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McGlynn, Petrick & London (2015).McGlynn KA, Petrick JL, London WT. Global epidemiology of hepatocellular carcinoma: an emphasis on demographic and regional variability. Clinics in Liver Disease. 2015;19:223–238. doi: 10.1016/j.cld.2015.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Michailidou et al. (2013).Michailidou K, Hall P, Gonzalez-Neira A, Ghoussaini M, Dennis J, Milne RL, Schmidt MK, Chang-Claude J, Bojesen SE, Bolla MK, Wang Q, Dicks E, Lee A, Turnbull C, Rahman N, Fletcher O, Peto J, Gibson L, Dos SSI, Nevanlinna H, Muranen TA, Aittomaki K, Blomqvist C, Czene K, Irwanto A, Liu J, Waisfisz Q, Meijers-Heijboer H, Adank M, Van der Luijt RB, Hein R, Dahmen N, Beckman L, Meindl A, Schmutzler RK, Muller-Myhsok B, Lichtner P, Hopper JL, Southey MC, Makalic E, Schmidt DF, Uitterlinden AG, Hofman A, Hunter DJ, Chanock SJ, Vincent D, Bacot F, Tessier DC, Canisius S, Wessels LF, Haiman CA, Shah M, Luben R, Brown J, Luccarini C, Schoof N, Humphreys K, Li J, Nordestgaard BG, Nielsen SF, Flyger H, Couch FJ, Wang X, Vachon C, Stevens KN, Lambrechts D, Moisse M, Paridaens R, Christiaens MR, Rudolph A, Nickels S, Flesch-Janys D, Johnson N, Aitken Z, Aaltonen K, Heikkinen T, Broeks A, Veer LJ, Van der Schoot CE, Guenel P, Truong T, Laurent-Puig P, Menegaux F, Marme F, Schneeweiss A, Sohn C, Burwinkel B, Zamora MP, Perez JI, Pita G, Alonso MR, Cox A, Brock IW, Cross SS, Reed MW, Sawyer EJ, Tomlinson I, Kerin MJ, Miller N, Henderson BE, Schumacher F, Le Marchand L, Andrulis IL, Knight JA, Glendon G, Mulligan AM, Lindblom A, Margolin S, Hooning MJ, Hollestelle A, Van den Ouweland AM, Jager A, Bui QM, Stone J, Dite GS, Apicella C, Tsimiklis H, Giles GG, Severi G, Baglietto L, Fasching PA, Haeberle L, Ekici AB, Beckmann MW, Brenner H, Muller H, Arndt V, Stegmaier C, Swerdlow A, Ashworth A, Orr N, Jones M, Figueroa J, Lissowska J, Brinton L, Goldberg MS, Labreche F, Dumont M, Winqvist R, Pylkas K, Jukkola-Vuorinen A, Grip M, Brauch H, Hamann U, Bruning T, Radice P, Peterlongo P, Manoukian S, Bonanni B, Devilee P, Tollenaar RA, Seynaeve C, Van Asperen CJ, Jakubowska A, Lubinski J, Jaworska K, Durda K, Mannermaa A, Kataja V, Kosma VM, Hartikainen JM, Bogdanova NV, Antonenkova NN, Dork T, Kristensen VN, Anton-Culver H, Slager S, Toland AE, Edge S, Fostira F, Kang D, Yoo KY, Noh DY, Matsuo K, Ito H, Iwata H, Sueta A, Wu AH, Tseng CC, Van Den Berg D, Stram DO, Shu XO, Lu W, Gao YT, Cai H, Teo SH, Yip CH, Phuah SY, Cornes BK, Hartman M, Miao H, Lim WY, Sng JH, Muir K, Lophatananon A, Stewart-Brown S, Siriwanarangsan P, Shen CY, Hsiung CN, Wu PE, Ding SL, Sangrajrang S, Gaborieau V, Brennan P, McKay J, Blot WJ, Signorello LB, Cai Q, Zheng W, Deming-Halverson S, Shrubsole M, Long J, Simard J, Garcia-Closas M, Pharoah PD, Chenevix-Trench G, Dunning AM, Benitez J, Easton DF. Large-scale genotyping identifies 41 new loci associated with breast cancer risk. Nature Genetics. 2013;45:353–361. doi: 10.1038/ng.2563. 361e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oltean & Bates (2014).Oltean S, Bates DO. Hallmarks of alternative splicing in cancer. Oncogene. 2014;33:5311–5318. doi: 10.1038/onc.2013.533. [DOI] [PubMed] [Google Scholar]
- Pan et al. (2008).Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nature Genetics. 2008;40:1413–1415. doi: 10.1038/ng.259. [DOI] [PubMed] [Google Scholar]
- Sauerbrei, Royston & Binder (2007).Sauerbrei W, Royston P, Binder H. Selection of important variables and determination of functional form for continuous predictors in multivariable model building. Statistics in Medicine. 2007;26:5512–5528. doi: 10.1002/sim.3148. [DOI] [PubMed] [Google Scholar]
- Spaethling et al. (2016).Spaethling JM, Sanchez-Alavez M, Lee J, Xia FC, Dueck H, Wang W, Fisher SA, Sul JY, Seale P, Kim J, Bartfai T, Eberwine J. Single-cell transcriptomics and functional target validation of brown adipocytes show their complex roles in metabolic homeostasis. Faseb Journal. 2016;30:81–92. doi: 10.1096/fj.15-273797. [DOI] [PubMed] [Google Scholar]
- Tang et al. (2015).Tang Y, Geng Y, Luo J, Shen W, Zhu W, Meng C, Li M, Zhou X, Zhang S, Cao J. Downregulation of ubiquitin inhibits the proliferation and radioresistance of non-small cell lung cancer cells in vitro and in vivo. Scientific Reports. 2015;5:9476. doi: 10.1038/srep09476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tian et al. (2013).Tian Y, Ding W, Wang Y, Ji T, Sun S, Mo Q, Chen P, Fang Y, Liu J, Wang B, Zhou J, Ma D, Wu P. Ubiquitin B in cervical cancer: critical for the maintenance of cancer stem-like cell characters. PLOS ONE. 2013;8:e84457. doi: 10.1371/journal.pone.0084457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tibshirani (1997).Tibshirani R. The lasso method for variable selection in the Cox model. Statistics in Medicine. 1997;16:385–395. doi: 10.1002/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO;2-3. [DOI] [PubMed] [Google Scholar]
- Tremblay et al. (2016).Tremblay MP, Armero VE, Allaire A, Boudreault S, Martenon-Brodeur C, Durand M, Lapointe E, Thibault P, Tremblay-Letourneau M, Perreault JP, Scott MS, Bisaillon M. Global profiling of alternative RNA splicing events provides insights into molecular differences between various types of hepatocellular carcinoma. BMC Genomics. 2016;17:683. doi: 10.1186/s12864-016-3029-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Villanueva (2019).Villanueva A. Hepatocellular Carcinoma. New England Journal of Medicine. 2019;380:1450–1462. doi: 10.1056/NEJMra1713263. [DOI] [PubMed] [Google Scholar]
- Wan et al. (2019).Wan L, Yu W, Shen E, Sun W, Liu Y, Kong J, Wu Y, Han F, Zhang L, Yu T, Zhou Y, Xie S, Xu E, Zhang H, Lai M. SRSF6-regulated alternative splicing that promotes tumour progression offers a therapy target for colorectal cancer. Gut. 2019;68:118–129. doi: 10.1136/gutjnl-2017-314983. [DOI] [PubMed] [Google Scholar]
- Wang et al. (2008).Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C, Kingsmore SF, Schroth GP, Burge CB. Alternative isoform regulation in human tissue transcriptomes. Nature. 2008;456:470–476. doi: 10.1038/nature07509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu et al. (2017).Wu HC, Yang HI, Wang Q, Chen CJ, Santella RM. Plasma DNA methylation marker and hepatocellular carcinoma risk prediction model for the general population. Carcinogenesis. 2017;38:1021–1028. doi: 10.1093/carcin/bgx078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yau et al. (2013).Yau WY, Shih HC, Tsai MH, Sheu JC, Chen CH, Chow LP. Autoantibody recognition of an N-terminal epitope of hnRNP L marks the risk for developing HBV-related hepatocellular carcinoma. Journal of Proteomics. 2013;94:346–358. doi: 10.1016/j.jprot.2013.10.003. [DOI] [PubMed] [Google Scholar]
- Yuan et al. (2017).Yuan JH, Liu XN, Wang TT, Pan W, Tao QF, Zhou WP, Wang F, Sun SH. The MBNL3 splicing factor promotes hepatocellular carcinoma by increasing PXN expression through the alternative splicing of lncRNA-PXN-AS1. Nature Cell Biology. 2017;19:820–832. doi: 10.1038/ncb3538. [DOI] [PubMed] [Google Scholar]
- Zhao et al. (2018).Zhao QJ, Zhang J, Xu L, Liu FF. Identification of a five-long non-coding RNA signature to improve the prognosis prediction for patients with hepatocellular carcinoma. World Journal of Gastroenterology. 2018;24:3426–3439. doi: 10.3748/wjg.v24.i30.3426. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu et al. (2019).Zhu GQ, Zhou YJ, Qiu LX, Wang B, Yang Y, Liao WT, Luo YH, Shi YH, Zhou J, Fan J, Dai Z. Prognostic alternative mRNA splicing signature in hepatocellular carcinoma: a study based on large-scale sequencing data. Carcinogenesis. 2019;40:1077–1085. doi: 10.1093/carcin/bgz073. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The following information was supplied regarding data availability:
The raw data is available in the Supplemental Files.