Abstract
This study was conducted to screen prognosis-associated long-noncoding RNAs (lncRNAs) and a prognosis assessment model in hepatocellular carcinoma (HCC). lncRNA- and mRNA-sequencing data of early-stage HCC samples were downloaded from The Cancer Genome Atlas database. The samples were divided into training set and validation set. Differentially expressed lncRNAs (DELs) between poor prognosis and good prognosis samples were screened with DEseq and edgeR. Cox regression analysis was conducted to identify prognosis-associated lncRNAs in the training set. A prognosis risk assessment model was established to calculate the risk score for each patient in the training set, and the prognosis prediction function was tested and validated in the validation dataset. The connection between the risk assessment model and clinical features was analyzed. A co-expression network between lncRNAs and corresponding genes was constructed, and functional enrichment was performed for these genes. A total of 81 DELs were screened between poor and good prognosis samples in the training set, and 43 prognosis-associated lncRNAs were observed. Of these DELs, five were used to construct the risk assessment model (RP11-325L7.2, DKFZP434L187, RP11-100L22.4, DLX2-AS1 and RP11-104L21.3). Low-risk samples exhibited longer survival time compared with the high-risk samples. The five lncRNAs exhibited significant differences in expression levels between different prognosis groups. Risk score was an independent prognostic factor for HCC. In the entire set, the low-risk group demonstrated significantly better prognosis compared with the high-risk group, even across all age, sex and alcohol consumption subgroups. ‘Nucleoside-triphosphatase regulator activity’, ‘GTPase regulator activity’, ‘enzyme binding’, ‘peroxisome proliferator-activated receptor signaling pathway’ and ‘fatty acid metabolism’ were the most significantly enriched functional terms. The signature lncRNAs screened in this study may have constitute novel strategies and biomarkers that predict the prognosis of HCC, and these may also contribute to a deeper understanding of the mechanisms underlying HCC development.
Keywords: HCC, DELs, risk assessment model, co-expression network, prognosis
Introduction
Hepatocellular carcinoma (HCC), a primary malignancy of the liver, predominantly occurs in patients with chronic liver disease and cirrhosis. HCC is the most common type of primary liver cancer in adults, and the most common cause of mortality in patients with cirrhosis (1). An estimated 662,000 HCC-associated mortalities are reported annually worldwide, and half of these cases have occurred in China (2).
The prognosis of HCC is poor, as only 10–20% carcinomas can be completely removed by surgery. The patients usually succumb to the disease within 3–6 months (3). With the approval of sorafenib for advanced HCC treatment, the prognosis of metastatic or unresectable HCC has improved (4). However, more advanced and efficient therapies, and better prognostic molecular markers are required.
Long-noncoding RNAs (lncRNAs), accounting for 80% of noncoding RNAs, are defined as endogenous cellular RNAs longer than 200 nucleotides that lack an obvious open-reading frame (5). The role of lncRNAs in the carcinogenesis, microvascular invasion and metastasis of HCCs has been reported in a number of studies (6–8). For instance, the overexpression of lncRNA-urothelial cancer associated 1 contributed to HCC progression through the inhibition of miR-216b, and subsequent activation of the fibroblast growth factor receptor 1/extracellular signal-regulated kinase signaling pathway (9). lncRNA HOX transcript antisense RNA (HOTAIR) overexpression was demonstrated to predict tumor recurrence in patients with HCC following liver transplantation (3). Furthermore, upregulation of lncRNA ZEB1-antisense RNA 1 (AS1) and ANRIL expression was reported to predict a poorer prognosis in HCC (10,11). In comparison with HCC patients without HOTAIR expression, those with high HOTAIR expression exhibited significantly poorer prognosis (12). Downregulation of lncRNA growth arrest specific 5 expression was also associated with HCC prognosis (13). Despite these studies, the functions of multiple lncRNAs in HCC remain unclear. Therefore, a comprehensive analysis of HCC-associated lncRNAs is necessary in order to reveal possible biomarkers and/or potential therapeutic targets.
In the present study, RNA-sequencing (RNA-seq) data of HCC samples were studied to select signature lncRNAs for the prognosis of HCC. Differentially expressed lncRNAs (DELs) were identified, and prognosis-associated lncRNAs were screened. Based on a risk score calculated using prognosis-linked lncRNAs, a risk assessment model was established. This model was tested by performing a survival analysis of high-risk and low-risk samples from the validation and entire datasets. Multivariate and univariate analyses were performed to investigate prognosis-associated clinical factors. Moreover, the genes associated with signature lncRNAs were screened for co-expression network and functional enrichment analyses.
Materials and methods
Data acquisition and processing
The RNA-seq data of HCC samples were downloaded from The Cancer Genome Atlas database (https://gdc-portal.nci.nih.gov/; May 8, 2017). A total of 424 samples were analyzed based on Illumina HiSeq 2000 RNA Sequencing platform. Of these, 261 samples from early stage (stage I and II) cancer were used in the present study. Among these samples, 194 samples with available clinical information were retained, and the samples with <6-month recurrence-free survival (RFS) time and the non-recurrent samples were discarded. Hence, 167 samples remained for further analyses, and these were randomly divided into training (83 samples) and validation datasets (84 samples).
Screening of DELs
The 83 training samples were divided into poor prognosis samples (recurrence samples with RFS <24 months) and good prognosis samples (non-recurrent samples with RFS >24 months). DEseq (14) and edgeR (15) packages in R (version 3.1.0; R-project.org/) were used for the screening of the DELs between poor and good prognosis samples. The threshold was set as false discovery rate <0.05 and |log(fold change)|>1.3. Overlapping lncRNAs screened using the two packages were used for further analyses.
Screening of prognosis-associated lncRNAs
Cox regression analysis was conducted for the identification of prognosis-associated lncRNAs in the training set using the survival package in R (16). The significance of the identified lncRNAs was tested with log-rank test, and P<0.05 was considered to indicate a statistically significant difference.
Construction of a risk assessment model
The prognostic risk of each sample in the training set was calculated using the regression coefficients of prognosis-associated lncRNAs via multivariate Cox regression analysis. The following formula was used for calculation, wherein the regression coefficient of each lncRNA was weighted:
where, β lncRNAn indicates the multivariate Cox regression coefficient and exprlncRNAn indicates the expression level of lncRNAn.
Association between risk assessment model and clinical feature
The obtained formula was used to calculate the risk score of patients in the validation set. The median risk score was used to distinguish between high-risk and low-risk samples. The survival information was compared using Kaplan-Meier survival analysis and receiver operating characteristic (ROC) curve (17). The area under the ROC curve (AUC) indicated the accuracy of prognosis. An AUC >0.5 indicated a relatively high accuracy of prognosis, with higher AUC values indicating a more accurate prognosis. The high-risk and low-risk samples were also subjected to Cox regression analysis to screen for prognosis risk-associated clinical features. The independent prognosis factors were identified using both univariate and multivariate Cox regression analyses. Moreover, the association between high or low risk and prognosis under the same clinical status was tested.
Co-expression network and functional enrichment
The co-expression network between lncRNAs and the corresponding mRNAs was constructed using the MEM package (18,19). P<0.05 was used as the cut-off selection criterion. Gene pairs associated with lncRNAs were obtained from the Search Tool for the Retrieval of Interacting Genes/Proteins database (string-db.org/), using a connection score >0.4 as threshold. A co-expression network of these genes was also constructed. Functional enrichment of these lncRNA-associated genes was conducted using the Database for Annotation, Visualization and Integrated Discovery (david.ncifcrf.gov/), and P<0.05 was set as the cut-off value (20).
Results
Identification and validation of five lncRNA prognostic signatures in two datasets
The clinical information of the samples in the training set, validation set and entire set is presented in Table I. A total of 23 poor prognosis samples and 18 good prognosis samples were reported in the training set. Using edgeR and DEseq packages for R, 117 and 225 DELs were screened comparing the poor and good prognosis samples, respectively. The bidirectional hierarchical clustering results of the overlapped 81 lncRNAs are displayed in Fig. 1.
Table I.
Clinical information of samples in training set, validation set and entire set.
| Stage I + II | |||
|---|---|---|---|
| Clinical parameters | Training set (n=83) | Testing set (n=84) | Entire set (n=167) |
| Age (years, mean ± sd) | 58.99±11.82 | 58.42±13.35 | 59.14±12.59 |
| Gender (male/female) | 61/22 | 56/28 | 117/50 |
| Pathologic M (M0/-) | 64/19 | 70/14 | 134/33 |
| Pathologic N (N0/-) | 61/22 | 63/21 | 124/43 |
| Pathologic T (T1/T2) | 55/28 | 58/26 | 113/54 |
| Virus infection (HBV/HCV/mixed/non) | 14/5/14/50 | 10/4/8/62 | 24/9/22/112 |
| Alcohol consumption (yes/no/-) | 29/47/7 | 22/59/3 | 51/106/10 |
| Recurrence (yes/no) | 31/52 | 32/52 | 63/104 |
| Recurrence free survival time (months, mean ± sd) | 24.07±20.45 | 24.56±21.41 | 24.32±20.87 |
sd, standard deviation; -, data unavailable; HBV, hepatitis B virus; HCV, hepatitis C virus.
Figure 1.
DEL analysis. (A) DELs identified using the DESeq package. (B) DELs identified using the edgeR package. (C) Overlapping DELs between the selection methods. (D) Heatmap of the 81 common DELs in the good and poor prognosis groups. Blue represents bad prognosis sample, while yellow indicates good prognosis sample. DEL, differentially expressed long noncoding RNA; FDR, false discovery rate.
A total of 43 prognosis-associated lncRNAs were obtained using univariate Cox regression analysis. Five of these lncRNAs were selected for multivariate Cox regression analysis to construct the risk assessment model. These five signature lncRNAs were RP11-325L7.2, DKFZP434L187, RP11-100L22.4, DLX2-AS1 and RP11-104L21.3 (Table II).
Table II.
lncRNAs in the risk assessment model.
| lncRNA | Coefficient | HR | Lower .95 | Upper .95 | P-value |
|---|---|---|---|---|---|
| RP11-325L7.2 | −0.9283 | 0.3952 | 0.2193 | 0.7124 | 0.002 |
| DKFZP434L187 | −0.2867 | 0.7508 | 0.6238 | 0.9036 | 0.002 |
| RP11-100L22.4 | −0.6785 | 0.5074 | 0.2808 | 0.9167 | 0.025 |
| DLX2-AS1 | 0.3590 | 1.4319 | 1.0420 | 1.9678 | 0.027 |
| RP11-104L21.3 | 0.6786 | 1.9711 | 1.0793 | 3.5996 | 0.027 |
lncRNA, long noncoding RNA; HR, hazard ratio.
These five lncRNAs were used to evaluate the HCC risk in each patient. The risk score distribution, RFS status and expression of five signature lncRNAs in training set, validation set and entire set are depicted in Fig. 2. Their distributions were similar, supporting the robust prediction ability of the five lncRNA-based risk score assessment model.
Figure 2.
Risk score distributions, RFS status, and expressions of five signature lncRNAs in three datasets. Risk score distributions, RFS status, and expressions of lncRNAs in the (A) training set, (B) validation set and (C) entire set. In the risk score distribution graphs, the horizontal axis represents samples categorized between low and high-risk scores, while the vertical axis represents risk scores. In the RFS status graphs, the horizontal axis represents samples with low to high-risk scores, while vertical axis represents RFS time. Recurrence samples and non-recurrence samples are marked in red and black, respectively. RFS, recurrence-free survival; lncRNAs, long noncoding RNAs.
Survival analysis using the five lncRNAs in the training, validation and entire sets
Using the risk assessment model, samples in the training set were divided into high-risk group (42 samples) and low-risk group (41 samples). The Kaplan-Meier survival curves indicated that the samples from the low-risk group had longer survival times compared with those from the high-risk group (average RFS was 27.22±22.23 vs. 21.01±18.28 months) in the training set (Fig. 3A). Similar results were observed for the samples from the validation set. Samples from the low-risk group had longer survival times compared with those from the high-risk group (average RFS was 33.50±24.67 vs. 15.62±12.45 months; Fig. 3B). The samples from the low-risk group similarly exhibited longer survival times compared with those from the high-risk group (average RFS was 30.39±23.57 vs. 18.31±5.78 months; Fig. 3C) in the entire set. The average AUC value of these five lncRNAs in the training set was 0.905 (Fig. 3D), suggestive of the highly accurate prognosis using these five lncRNAs. The average AUC value of these five lncRNAs in the validation set was 0.849 (Fig. 3E). The average AUC value of the lncRNAs for the entire dataset was 0.942 (Fig. 3F).
Figure 3.
Kaplan-Meier curves of recurrence free survival and receiver operating characteristic for the five long noncoding RNAs. Kaplan-Meier curves in training set (A), validation set (B) and entire set (C). Horizontal axis represents the survival period and the vertical axis represents the frequency. Red lines represent high-risk group samples, blue lines represent low-risk group samples. Receiver operating characteristic curves in training set (D), validation set (E) and entire set (F). The abscissa represents sensitivity and the ordinate represents specificity. AUC, area under the curve.
Significant differences were observed in the expression levels of these five lncRNAs between samples from the low- and high-risk groups in training set, validation set and entire set (P<0.05; Fig. 4A-C, respectively). Expression levels of DLX2-AS1 and RP11-104L21.3 were significantly higher in the samples from the high-risk group compared with those from the low-risk group, whereas the expression levels of DKFZP434L187, RP11-100L22.4 and RP11-325L7.2 were significantly lower in the samples from the high-risk group compared with those from the low-risk group (P<0.05).
Figure 4.
Expression level comparisons of the five signature lncRNAs in high- and low-risk samples across the three datasets. Expression levels in (A) training set, (B) validation set and (C) entire set. The expression level in low-risk samples is displayed in green, while the expression level in high-risk samples is displayed in red. *P<0.05, **P<0.01 and ***P<0.005. lncRNA, long noncoding RNA.
Association between survival information and expression pattern was different for the five lncRNAs. In the validation set, high expression samples had a longer survival time compared with the low expression group for RP11-325L7.2, DKFZP434L187, RP11-100L22.4 and RP11-104L21.3 (Fig. 5A-D, respectively), suggesting that the high expression of these lncRNAs may be associated with good prognosis. Low DLX2-AS1 expression was associated with longer survival time compared with those from the high expression group (Fig. 5E), indicating that high expression of DLX2-AS1 may be associated with poor prognosis. The AUC values for prognosis using RP11-325L7.2, DKFZP434L187, RP11-100L22.4 and RP11-104L21.3 (Fig. 5F-I, respectively) were 0.650, 0.666, 0.613, and 0.638, respectively, indicating that these four lncRNAs demonstrated relatively high accuracy for prognosis, and the best performance was produced by DKFZP434L187. The AUC value for DLX2-AS1 prognosis was 0.609 (Fig. 5J), indicative of the relatively and equally high accuracy of prognosis using this lncRNA.
Figure 5.
Predictive effect of each signature lncRNA for high-risk and low-risk samples in the validation set. Kaplan-Meier survival curves for (A) RP11-325L7.2, (B) DKFZP434L187, (C) RP11-100L22.4, (D) RP11-104L21.3 and (E) DLX2-AS1 lncRNAs. High-risk and low-risk samples are marked in red and blue, respectively. Receiver operating characteristic curves for (F) RP11-325L7.2, (G) DKFZP434L187, (H) RP11-100L22.4, (I) DLX2-AS1 and (J) RP11-104L21.3 lncRNAs. lncRNA, long noncoding RNA; AUC, area under the curve.
Prognostic factors for HCC
The results of univariate and multivariate Cox regression analyses are presented in Table III. In the training, validation and entire sets, the risk score was significantly associated with the prognosis of patients and served as an independent prognostic factor (P<0.05). Alcohol consumption was also an independent predictive index in training and entire sets (P<0.05). Virus infection was not a significant factor (P>0.05).
Table III.
Cox regression results between clinical features and prognosis.
| A, Training set (n=83) | ||||||
|---|---|---|---|---|---|---|
| Univariate analysis | Multivariate analysis | |||||
| Variables | HR | 95% CI | P-value | HR | 95% CI | P-value |
| Risk score (high/low) | 2.557 | 1.200–5.447 | 0.009 | 2.452 | 0.544–3.048 | 0.024 |
| Age (≤60/>60) | 0.943 | 0.464–1.914 | 0.869 | 2.172 | 0.526–3.964 | 0.283 |
| Gender (male/female) | 0.938 | 0.429–2.050 | 0.872 | 0.528 | 0.495–1.291 | 0.265 |
| Virus infection (HBV/HCV/mixed) | 1.317 | 0.723–2.398 | 0.369 | 1.283 | 0.653–2.519 | 0.469 |
| Alcohol consumption (yes/no) | 1.402 | 0.752–1.963 | 0.046 | 1.103 | 0.713–1.933 | 0.043 |
| B, Validation set (n=84) | ||||||
| Univariate analysis | Multivariate analysis | |||||
| Variables | HR | 95% CI | P-value | HR | 95% CI | P-value |
| Risk score (high/low) | 2.723 | 1.289–5.751 | 0.012 | 2.992 | 1.196–3.458 | 0.034 |
| Age (≤60/>60) | 1.107 | 0.547–2.241 | 0.777 | 0.265 | 0.040–1.767 | 0.170 |
| Gender (male/female) | 0.645 | 0.319–1.300 | 0.216 | 0.561 | 0.090–3.522 | 0.538 |
| Virus infection (HBV/HCV/mixed) | 1.098 | 0.467–2.059 | 0.958 | 2.243 | 0.678–7.430 | 0.186 |
| Alcohol consumption (yes/no) | 1.132 | 0.298–1.802 | 0.495 | 1.358 | 0.520–2.466 | 0.296 |
| C, Entire set (n=167) | ||||||
| Univariate analysis | Multivariate analysis | |||||
| Variables | HR | 95% CI | P-value | HR | 95% CI | P-value |
| Risk score (high/low) | 2.569 | 1.530–4.315 | 0.001 | 1.906 | 0.697–5.205 | 0.021 |
| Age (≤60/>60) | 1.018 | 0.619–1.672 | 0.944 | 0.215 | 0.437–3.518 | 0.686 |
| Gender (male/female) | 0.773 | 0.462–1.294 | 0.328 | 0.424 | 0.566–4.132 | 0.402 |
| Virus infection (HBV/HCV/mixed) | 1.178 | 0.746–1.860 | 0.481 | 1.180 | 0.715–1.948 | 0.517 |
| Alcohol consumption (yes/no) | 1.542 | 0.803–1.648 | 0.047 | 1.209 | 0.554–1.792 | 0.021 |
HR, hazard ratio; CI, confidence interval; HBV, hepatitis B virus; HCV, hepatitis C virus.
The impact of high and low risk on prognosis within subgroups of the same clinical feature was evaluated. In the entire set, the low-risk group had a significantly better prognosis compared with the high-risk group in all age, sex and alcohol subgroups (P<0.05; Fig. 6A-C, respectively).
Figure 6.
Kaplan-Meier curves of recurrence free survival between high- and low-risk patients across the entire dataset. Survival across the entire dataset was divided and evaluated between (A) age, (B) sex and (C) alcohol assumption subgroups. High-risk and low-risk samples are marked in red and blue, respectively.
Functional enrichment for mRNAs co-expressed with signature lncRNAs
The top 100 mRNAs associated with the five signature lncRNAs were determined for the construction of signature lncRNA-mRNA co-expression networks, and gene-gene co-expression networks of the genes connected with signature lncRNAs (Fig. 7). WD repeat domain 5B (WDR5B) was determined to be a target gene of RP11-100L22.4 and RP11-104L21.3 lncRNAs. Functional enrichment analysis demonstrated that ‘nucleoside-triphosphatase regulator activity’, ‘GTPase regulator activity’, ‘enzyme binding’, ‘oxidation-reduction’ and ‘protein amino acid phosphorylation’ were the most significantly enriched functions, while ‘peroxisome proliferator-activated receptor (PPAR) signaling pathway’, ‘fatty acid metabolism’, ‘Fc-γ-R-mediated phagocytosis’, ‘leukocyte transendothelial migration’ and ‘adipocytokine signaling pathway’ were the most significantly enriched pathways (Table IV).
Figure 7.
Co-expression networks of signature lncRNA-gene and gene-gene interactions. Top mRNAs co-expressed with (A) DKFZP434L187, (B) DLX2-AS1. Top mRNAs co-expressed with (C) RP11-100L22.4 and (D) RP11-104L21.3. Top mRNAs co-expressed with (E) RP11-325L7.2 signature lncRNAs, and gene-gene co-expression networks of associated-mRNAs, are listed in this figure.
Table IV.
Significantly enriched functions and pathways of genes correlated with signature long noncoding RNAs.
| A, Enriched functions (GO terms) | |||
|---|---|---|---|
| Category | Term | Gene count | P-value |
| GOTERM_MF_FAT | GO: 0060589~nucleoside-triphosphatase regulator activity | 24 | 3.52×10-4 |
| GOTERM_MF_FAT | GO: 0030695~GTPase regulator activity | 23 | 6.35×10-4 |
| GOTERM_BP_FAT | GO: 0055114~oxidation reduction | 28 | 2.81×10-3 |
| GOTERM_BP_FAT | GO: 0006468~protein amino acid phosphorylation | 28 | 4.93×10-3 |
| GOTERM_MF_FAT | GO: 0019899~enzyme binding | 26 | 1.79×10-3 |
| B, Enriched pathways (KEGG analysis) | |||
| Category | Term | Gene count | P-value |
| KEGG_PATHWAY | hsa03320: PPAR signaling pathway | 9 | 7.24×10-4 |
| KEGG_PATHWAY | hsa00071: Fatty acid metabolism | 7 | 8.66×10-4 |
| KEGG_PATHWAY | hsa04666: Fc-γ-R-mediated phagocytosis | 9 | 5.62×10-3 |
| KEGG_PATHWAY | hsa04670: Leukocyte transendothelial migration | 10 | 6.48×10-3 |
| KEGG_PATHWAY | hsa04920: Adipocytokine signaling pathway | 7 | 1.20×10-2 |
GO, gene ontology; KEGG, Kyoto encyclopedia of genes and genomes; GTPase, guanosine triphosphatase; PPAR, peroxisome proliferator-activated receptor; MF, molecular function; BP, biological process.
Discussion
In the present study, a series of complex bioinformatics analyses identified five lncRNAs associated with HCC prognosis, namely RP11-325L7.2, DKFZP434L187, RP11-100L22.4, DLX2-AS1 and RP11-104L21.3. A risk-model comprising these five lncRNAs was able to distinguish low-risk and high-risk samples with a relatively high prognosis accuracy. The evaluation of the target genes of these lncRNAs further revealed that ‘nucleoside-triphosphatase regulator activity’, ‘GTPase regulator activity’, ‘enzyme binding’, ‘PPAR signaling pathway’ and ‘fatty acid metabolism’ were potentially the most affected pathways.
To the best of our knowledge, no studies have yet reported a direct correlation between RP11-325L7.2, RP11-100L22.4 or RP11-104L21.3 expression and HCC prognosis. Other RP11-associated lncRNAs have been reported to be implicated in HCC. For instance, high expression levels of RP11-589N15.2, RP11-343N15.5 and RP11-479G22.8 were reported to be associated with the malignant phenotypes of HCC (21). In addition, RP11.404P21.3 and RP11.488L18.10 were identified as two prognostic lncRNA biomarkers for HCC (22). The lncRNA RP11-513I15.6, a member of the three exosomal RNA-based panel, has high sensitivity and specificity in the identification of HCC cases among patients with chronic hepatitis C virus infection and healthy individuals (23). In the present study, the samples with higher expression of the three RP11-associated lncRNAs exhibited longer survival times compared with those which had lower expression, indicating that the high expression of these lncRNAs may be associated with a good HCC prognosis.
DLX2-AS1 is also termed DLX2 divergent transcript. To the best of our knowledge, no studies have confirmed a link between this lncRNA and HCC prognosis. However, the expression of distal-less homeobox 2 (DLX2) protein has been linked with HCC. In a previous study evaluating the response of HCC cells to Actinidia chinensis root (a traditional Chinese medicine) treatment, treatment with acRoots inhibited proliferation, invasion and migration, clonality and epithelial-to-mesenchymal transition, and promoted HCC cells apoptosis by downregulating DLX2 expression (24). Moreover, high expression level of DLX2 was linked to poor prognosis in patients with HCC (24). Another study revealed the association between DLX2 overexpression and poor prognosis in HCC (25). In the present study, samples with high DLX2-AS1 expression had shorter survival times compared with those with low expression. The present results are inconsistent with previous studies, and further experiments are required to understand the discrepancy between the present and previous results.
The function of DKFZP434L187 (also termed LINC02249) is yet unknown. Based on the present results, this lncRNA may serve as a novel prognostic lncRNA biomarker for HCC. However, further studies are required in order to understand the role of this lncRNA in HCC.
It is known that lncRNAs participate in various biological processes such as transcription, translation, cellular differentiation, chromatin modification, regulation of gene expression, cell cycle and nuclear-cytoplasmic trafficking (26–28). lncRNAs guide chromatin-modifying complexes, thus allowing epigenetic modifications in cancer (29). According to the results presented in this study, WDR5B is a target gene of both RP11-100L22.4 and RP11-104L21.3. WDR5 is associated with the lncRNA HOXA distal transcript antisense RNA (HOTTIP), and its expression is associated with disease progression and predictive outcomes in HCC (30). HOTTIP binds directly to WDR5, and targets WDR5/mixed linear leukemia complexes across the HOXA gene cluster, thereby driving histone H3 lysine 4 trimethylation and gene transcription (31). Downregulated expression of HOX is also reported to be involved in the development of HCC (31). Therefore, RP11-100L22.4 and RP11-104L21.3 may participate in the prognosis of HCC through interactions with WDR5.
Lipid metabolism is an essential function of the liver, and any anomalies in this function may cause liver diseases, including fibrosis (32). Defects and/or deregulation of fatty acid metabolism have also been reported to be associated with HCC (33,34). Fatty acid metabolism also serves an important role in determining the function of extra-mitochondrial pathways (35). In the present study, fatty acid metabolism was revealed as a potentially dysfunctional pathway associated with the signature lncRNA differentially expressed in the HCC samples studied. Thus, fatty acid metabolism may be a potential pathway influencing the prognosis of patients with HCC.
Although these predictive results are valuable, the present study has certain limitations. The expression of these lncRNAs should be experimentally validated. In addition, substantial experiments should be performed to confirm the prognostic accuracy of these five lncRNAs in HCC.
In conclusion, the signature lncRNAs screened in this study (RP11-325L7.2, DKFZP434L187, RP11-100L22.4, DLX2-AS1 and RP11-104L21.3) may serve as novel lncRNA biomarkers to predict the prognosis of HCC. Further studies of these lncRNAs and associated genes may contribute to a deeper understanding of the underlying mechanism of HCC development.
Acknowledgements
Not applicable.
Glossary
Abbreviations
- HCC
hepatocellular carcinoma
- lncRNA
long noncoding RNA
- DEL
differentially expressed lncRNA
- PPAR
peroxisome proliferator-activated receptor
- RFS
recurrence-free survival
- ROC
receiver operating characteristic
Funding
The present study was supported by the Presidential Foundation of the 302 Military Hospital, China (grant no. YNKT2014027).
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Authors' contributions
YQS analyzed the data and wrote the manuscript. LW, FFZ and XS collected and analyzed the data. JJ, FZ and SY drafted the manuscript and contributed to the experimental design. HHL conceived, designed the study and revised the manuscript. All authors read and approved the final manuscript.
Ethics approval and consent to participate
Not applicable.
Patient consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
References
- 1.Forner A, Llovet JM, Bruix J. Hepatocellular carcinoma. Lancet. 2012;379:1245–1255. doi: 10.1016/S0140-6736(11)61347-0. [DOI] [PubMed] [Google Scholar]
- 2.McGlynn KA, London WT. The global epidemiology of hepatocellular carcinoma: Present and future. Clin Liver Dis. 2011;15:223–243. doi: 10.1016/j.cld.2011.03.006. vii-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Yang Z, Zhou L, Wu LM, Lai MC, Xie HY, Zhang F, Zheng SS. Overexpression of long non-coding RNA HOTAIR predicts tumor recurrence in hepatocellular carcinoma patients following liver transplantation. Ann Surg Oncol. 2011;18:1243–1250. doi: 10.1245/s10434-011-1581-y. [DOI] [PubMed] [Google Scholar]
- 4.Llovet JM, Ricci S, Mazzaferro V, Hilgard P, Gane E, Blanc JF, de Oliveira AC, Santoro A, Raoul JL, Forner A, et al. Sorafenib in advanced hepatocellular carcinoma. N Engl J Med. 2008;359:378–390. doi: 10.1056/NEJMoa0708857. [DOI] [PubMed] [Google Scholar]
- 5.Zhou J, Li X, Wu M, Lin C, Guo Y, Tian B. Knockdown of long noncoding RNA GHET1 inhibits cell proliferation and invasion of colorectal cancer. Oncol Res. 2016;23:303–309. doi: 10.3727/096504016X14567549091305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Yang F, Huo XS, Yuan SX, Zhang L, Zhou WP, Wang F, Sun SH. Repression of the long noncoding RNA-LET by histone deacetylase 3 contributes to hypoxia-mediated metastasis. Mol Cell. 2013;49:1083–1096. doi: 10.1016/j.molcel.2013.01.010. [DOI] [PubMed] [Google Scholar]
- 7.Zhu Y, Yu M, Li Z, Kong C, Bi J, Li J, Gao Z, Li Z. NcRAN, a newly identified long noncoding RNA, enhances human bladder tumor growth, invasion, and survival. Urology. 2011;77:510.e1–5. doi: 10.1016/j.urology.2010.09.022. [DOI] [PubMed] [Google Scholar]
- 8.Prensner JR, Chinnaiyan AM. The emergence of lncRNAs in cancer biology. Cancer Discov. 2011;1:391–407. doi: 10.1158/2159-8290.CD-11-0209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wang F, Ying HQ, He BS, Pan YQ, Deng QW, Sun HL, Chen J, Liu X, Wang SK. Upregulated lncRNA-UCA1 contributes to progression of hepatocellular carcinoma through inhibition of miR-216b and activation of FGFR1/ERK signaling pathway. Oncotarget. 2015;6:7899–7917. doi: 10.18632/oncotarget.3219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Li T, Xie J, Shen C, Cheng D, Shi Y, Wu Z, Deng X, Chen H, Shen B, Peng C, et al. Upregulation of long noncoding RNA ZEB1-AS1 promotes tumor metastasis and predicts poor prognosis in hepatocellular carcinoma. Oncogene. 2016;35:1575–1584. doi: 10.1038/onc.2015.223. [DOI] [PubMed] [Google Scholar]
- 11.Hua L, Wang CY, Yao KH, Chen JT, Zhang JJ, Ma WL. High expression of long non-coding RNA ANRIL is associated with poor prognosis in hepatocellular carcinoma. Int J Clin Exp Pathol. 2015;8:3076–3082. [PMC free article] [PubMed] [Google Scholar]
- 12.Ishibashi M, Kogo R, Shibata K, Sawada G, Takahashi Y, Kurashige J, Akiyoshi S, Sasaki S, Iwaya T, Sudo T, et al. Clinical significance of the expression of long non-coding RNA HOTAIR in primary hepatocellular carcinoma. Oncol Rep. 2013;29:946–950. doi: 10.3892/or.2012.2219. [DOI] [PubMed] [Google Scholar]
- 13.Tu ZQ, Li RJ, Mei JZ, Li XH. Down-regulation of long non-coding RNA GAS5 is associated with the prognosis of hepatocellular carcinoma. Int J Clin Exp Pathol. 2014;7:4303–4309. [PMC free article] [PubMed] [Google Scholar]
- 14.Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010;11:R106. doi: 10.1186/gb-2010-11-10-r106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Robinson MD, McCarthy DJ, Smyth GK. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Wang P, Wang Y, Hang B, Zou X, Mao JH. A novel gene expression-based prognostic scoring system to predict survival in gastric cancer. Oncotarget. 2016;7:55343–55351. doi: 10.18632/oncotarget.10533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Hanley JA. The robustness of the ‘binormal’ assumptions used in fitting ROC curves. Med Decis Making. 1988;8:197–203. doi: 10.1177/0272989X8800800308. [DOI] [PubMed] [Google Scholar]
- 18.Adler P, Kolde R, Kull M, Tkachenko A, Peterson H, Reimand J, Vilo J. Mining for coexpression across hundreds of datasets using novel rank aggregation and visualization methods. Genome Biol. 2009;10:R139. doi: 10.1186/gb-2009-10-12-r139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, Aken BL, Barrell D, Zadissa A, Searle S, et al. GENCODE: The reference human genome annotation for The ENCODE Project. Genome Res. 2012;22:1760–1774. doi: 10.1101/gr.135350.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
- 21.Wu Y, Wang PS, Wang BG, Xu L, Fang WX, Che XF, Qu XJ, Liu YP, Li Z. Genomewide identification of a novel six-LncRNA signature to improve prognosis prediction in resectable hepatocellular carcinoma. Cancer Med. 2018;7:6219–6233. doi: 10.1002/cam4.1854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Wang J, Pu J, Yao T, Lu X, Deng Y. Four long noncoding RNAs as potential prognostic biomarkers for hepatocellular carcinoma. J Cell Physiol. 2018;234:8709–8716. doi: 10.1002/jcp.27530. [DOI] [PubMed] [Google Scholar]
- 23.Abd El Gwad A, Matboli M, El-Tawdi A, Habib EK, Shehata H, Ibrahim D, Tash F. Role of exosomal competing endogenous RNA in patients with hepatocellular carcinoma. J Cell Biochem. 2018;119:8600–8610. doi: 10.1002/jcb.27109. [DOI] [PubMed] [Google Scholar]
- 24.Fang T, Fang Y, Xu X, He M, Zhao Z, Huang P, Yuan F, Guo M, Yang B, Xia J. Actinidia chinensis Planch root extract attenuates proliferation and metastasis of hepatocellular carcinoma by inhibiting epithelial-mesenchymal transition. J Ethnopharmacol. 2019;231:474–485. doi: 10.1016/j.jep.2018.11.014. [DOI] [PubMed] [Google Scholar]
- 25.Liu J, Cui X, Qu L, Hua L, Wu M, Shen Z, Lu C, Ni R. Overexpression of DLX2 is associated with poor prognosis and sorafenib resistance in hepatocellular carcinoma. Exp Mol Pathol. 2016;101:58–65. doi: 10.1016/j.yexmp.2016.06.003. [DOI] [PubMed] [Google Scholar]
- 26.Kim ED, Sung S. Long noncoding RNA: Unveiling hidden layer of gene regulatory networks. Trends Plant Sci. 2012;17:16–21. doi: 10.1016/j.tplants.2011.10.008. [DOI] [PubMed] [Google Scholar]
- 27.Yoon JH, Abdelmohsen K, Gorospe M. Posttranscriptional gene regulation by long noncoding RNA. J Mol Biol. 2013;425:3723–3730. doi: 10.1016/j.jmb.2012.11.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Wilusz JE, Sunwoo H, Spector DL. Long noncoding RNAs: Functional surprises from the RNA world. Genes Dev. 2009;23:1494–1504. doi: 10.1101/gad.1800909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Harries LW. Long non-coding RNAs and human disease. Biochem Soc Trans. 2012;40:902–906. doi: 10.1042/BST20120020. [DOI] [PubMed] [Google Scholar]
- 30.Quagliata L, Matter MS, Piscuoglio S, Arabi L, Ruiz C, Procino A, Kovac M, Moretti F, Makowska Z, Boldanova T, et al. Long noncoding RNA HOTTIP/HOXA13 expression is associated with disease progression and predicts outcome in hepatocellular carcinoma patients. Hepatology. 2014;59:911–923. doi: 10.1002/hep.26740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wang KC, Yang YW, Liu B, Sanyal A, Corces-Zimmerman R, Chen Y, Lajoie BR, Protacio A, Flynn RA, Gupta RA, et al. A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression. Nature. 2011;472:120–124. doi: 10.1038/nature09819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Moustafa T, Fickert P, Magnes C, Guelly C, Thueringer A, Frank S, Kratky D, Sattler W, Reicher H, Sinner F, et al. Alterations in lipid metabolism mediate inflammation, fibrosis, and proliferation in a mouse model of chronic cholestatic liver injury. Gastroenterology. 2012;142:140–151. doi: 10.1053/j.gastro.2011.09.051. e12. [DOI] [PubMed] [Google Scholar]
- 33.Li Z, Guan M, Lin Y, Cui X, Zhang Y, Zhao Z, Zhu J. Aberrant lipid metabolism in hepatocellular carcinoma revealed by liver lipidomics. Int J Mol Sci. 2017;18:2550. doi: 10.3390/ijms18122550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Zhou L, Wang Q, Yin P, Xing W, Wu Z, Chen S, Lu X, Zhang Y, Lin X, Xu G. Serum metabolomics reveals the deregulation of fatty acids metabolism in hepatocellular carcinoma and chronic liver diseases. Anal Bioanal Chem. 2012;403:203–213. doi: 10.1007/s00216-012-5782-4. [DOI] [PubMed] [Google Scholar]
- 35.Ockner RK, Kaikaus RM, Bass NM. Fatty-acid metabolism and the pathogenesis of hepatocellular carcinoma: Review and hypothesis. Hepatology. 1993;18:669–676. doi: 10.1002/hep.1840180327. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.











