Skip to main content
PeerJ logoLink to PeerJ
. 2017 Jul 18;5:e3575. doi: 10.7717/peerj.3575

Identification of four prognostic LncRNAs for survival prediction of patients with hepatocellular carcinoma

Zhonghao Wang 1, Qian Wu 2, Shu Feng 1, Yanhua Zhao 1, Chuanmin Tao 1,
Editor: Alistair Forrest
PMCID: PMC5518732  PMID: 28729955

Abstract

Background

As the fifth most common cancer worldwide, Hepatocellular carcinoma (HCC) is also the third most common cause of cancer-related death in China. Several lncRNAs have been demonstrated to be associated with occurrence and prognosis of HCC. However, identification of prognostic lncRNA signature for HCC with expression profiling data has not been conducted yet.

Methods

With the reuse of public available TCGA data, expression profiles of lncRNA for 371 patients with HCC were obtained and analyzed to find the independent prognostic lncRNA. Based on the expression of lncRNA, we developed a risk score model, which was evaluated by survival analysis and ROC (receiver operating characteristic) curve. Enrichment analysis was performed to predict the possible role of the identified lncRNA in HCC prognosis.

Results

Four lncRNAs (RP11-322E11.5, RP11-150O12.3, AC093609.1, CTC-297N7.9) were found to be significantly and independently associated with survival of HCC patients. We used these four lncRNAs to construct a risk score model, which exhibited a strong ability to distinguish patients with significantly different prognosis (HR = 2.718, 95% CI [2.103–3.514], p = 2.32e−14). Similar results were observed in the subsequent stratification survival analysis for HBV infection status and pathological stage. The ROC curve also implied our risk score as a good indicator for 5-year survival prediction. Furthermore, enrichment analysis revealed that the four signature lncRNAs may be involved in multiple pathways related to tumorigenesis and prognosis.

Discussion

Our study recognized four lncRNAs to be significantly associated with prognosis of liver cancer, and could provide novel insights into the potential mechanisms of HCC progression. Additionally, CTC-297N7.9 may influence the downstream TMEM220 gene expression through cis-regualtion. Nevertheless, further well-designed experimental studies are needed to validate our findings.

Keywords: Prognostic, LncRNA, Survival, HCC

Introduction

Hepatocellular carcinoma, accounting for 90% of primary liver cancers, is one of the most common malignancies worldwide particularly in East Asia and sub-Saharan Africa. More than 250,000 new cases and approximately 500,000–600,000 deaths occur due to this disease annually (Forner, Llovet & Bruix, 2012). Despite the recent advances in early diagnosis, imaging techniques and therapeutic intervention for HCC, low overall 5-year survival rate of HCC patients remains unsatisfactory (Villanueva, Hernandez-Gea & Llovet, 2013).

LncRNAs, defined as ncRNAs longer than 200 nucleotides, play essential roles in almost every aspect of biological processes. Over the past decades, emerging evidences have associated the misexpression of lncRNAs with tumor initiation, metastasis, and prognosis (Spizzo et al., 2012; Zeng et al., 2017; Zhou et al., 2016). Many studies, up to now, have highlighted the molecular mechanism and biological characters of lncRNA in HCC occurrence and progression (Quagliata et al., 2014; Yang et al., 2011). Moreover, some lncRNAs can also be served as valuable prognostic predictor for HCC patients (Chen et al., 2016; Li et al., 2016; Zhang et al., 2016).

With the development of high-throughput technologies, such as RNA-seq and microarray, tremendous expression datasets of gene transcripts have been created for multiple cancer types. TCGA (The Cancer Genome Atlas), a project comprised of 33 cancer types, aiming to improve the prevention, diagnosis, and treatment of cancer (Cancer Genome Atlas Research, 2011). In our present study, expression data and related clinical information of 371 LIHC (liver hepatocellular carcinoma) patients were obtained from TCGA, then four lncRNAs were recognized as independent prognostic biomarkers in the survival analysis. In addition, we also investigated the possible biological mechanism uncovering the association of these prognostic lncRNAs with cancer occurrence and progression.

Materials and Methods

Collection of TCGA dataset and related clinical information

The RNA-Seq data of patients in TCGA-LIHC project were downloaded from Genomic Data Commons Data Portal, as well as the corresponding clinical information (Cancer Genome Atlas Research, 2011). To obtain lncRNA expression profile, we first got ensembl ID of lncRNAs which had been filtered by The Atlas of Noncoding RNAs in Cancer (TANRIC) (Li et al., 2015), then 12,309 lncRNA ID were used to retrieve the required expression information from above RNA-Seq data. Similarly, 19,817 mRNAs ID were downloaded from GENCODE (release 26) (Harrow et al., 2012), then we got the expression profiles of mRNAs from the RNA-Seq data.

Identification of potential prognostic signatures

After removing lncRNAs whose expression is zero in more than 50% of the samples, a total of 4,683 lncRNAs were utilized to do the differential expression analysis among normal and LIHC. To find the prognostic lncRNAs, these patients were analyzed by univariate and multivariate Cox proportional hazards regression methods. Briefly, univariate survival analysis was performed with expression value of lncRNAs as independent variable, while multivariate analysis was conducted with expression value of lncRNAs and other factors (including clinical features or other lncRNAs) as the independent variables. To construct the predictive risk score model, we only used those lncRNAs which were still independent of other factors after the multivariate Cox regression analysis. The coefficient of each prognostic lncRNA in risk score model was derived from corresponding multivariate analysis of these identified lncRNA markers (Zhou et al., 2015). All patients were divided into two groups according to the risk score. Then, for mRNAs differentially expressed between these two group, prognostic mRNAs were recognized with the same method of lncRNA, as well as the mRNA-based model. The combined transcripts based model was fitted with both above identified lncRNA and mRNA biomarkers.

Statistical analysis

All bioinformatics analysis and statistical tests were performed with R language (version 3.3.2): (i) limma package for differential expression analysis; (ii) survival package for univariate and multivariate Cox proportional hazards regression survival analysis; (iii) pROC package for ROC curve and the area under the ROC curves (AUC) analysis; (iv) Gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), Disease Ontology (DO) enrichment analysis were performed with clusterProfiler package (Yu et al., 2012). The co-expression correlation between CTC-297N7.9 and TMEM220 were evaluated with Pearson correlation coefficient. The expression profiles of TMEM220 in normal tissues and cancer cell lines were retrieved from Genotype-Tissue Expression (GTEx) project and Cancer Cell Line Encyclopedia (CCLE) project, respectively (Barretina et al., 2012; Consortium GT, 2013).

Validation for the expression values of lncRNAs

First, nucleotide sequences of each identified lncRNA were downloaded from GENCODE (release 26) (Harrow et al., 2012). The probe sequences of microarray were retrieved from matching platform annotation files deposited in Gene Expression Omnibus (GEO) database with R package GEOquery. Next, probes were re-mapped to lncRNA sequences using Magic-BLAST tool (version 1.2.0, from NCBI), and probes which uniquely mapped to the sequence with no mismatch were retained to represent the corresponding lncRNA. For each GEO dataset which focused on HCC, we used GEO2R web application supplied by GEO (Barrett et al., 2013) to validate differential lncRNA expression.

Results

Differential expression profile and identification of prognostic lncRNAs

From TCGA, RNA-Seq expression profiles of LIHC were downloaded, which covered 371 primary tumor samples and 50 (a subset of 371 HCC patients) paired adjacent normal liver tissue samples. After the initial analysis of all lncRNAs, a total of 1,074 lncRNAs were considered to be differentially expressed between tumor and normal (cutoff of absolute log2FC > 1 and adjusted p-value <0.01), the expression patterns of which were displayed in Fig. 1A. Then we conducted a univariate Cox proportional hazards regression analysis of the above 1,074 lncRNAs, which revealed a subset of 108 lncRNAs significantly associated with patients’ OS (overall survival, p-value <0.01, Fig. 1B). Given that OS can be affected by many factors, a multivariate Cox proportional hazards regression analysis was performed with lncRNAs and other clinical features (such as age of initial diagnosis, pathologic stage and TP53 mutation status) to further evaluate the independent prognostic value of these lncRNAs. Finally, four of the 108 lncRNAs were demonstrated as independent predictors of overall survival (Fig. 1C). The detailed description of these four identified lncRNAs was shown in Table 1.

Figure 1. Identification of prognostic lncRNA for patients with HCC.

Figure 1

(A) Heatmap of 1,074 differentially expressed lncRNAs between normal and tumor tissues. (B) Z score of 1,074 deregulated lncRNAs from univariate Cox aggression analysis (brown dots indicate lncRNAs with p value <0.01). (C) Z score of four identified prognostic lncRNAs from univariate and multivariate Cox aggression analysis.

Table 1. Information of the four identified prognostic lncRNAs for HCC.

Ensembl ID Gene name Chr. Coordinate Z scorea P valuea
ENSG00000267583 RP11-322E11.5 18 35,443,869–35,467,088 3.386 6.935e−04
ENSG00000254290 RP11-150O12.3 8 37,597,480–37,599,858 3.313 8.284e−04
ENSG00000230587 AC093609.1 2 43,097,746–43,102,691 −3.595 3.143e−04
ENSG00000264016 CTC-297N7.9 17 10,741,267–10,769,016 −5.151 2.376e−07

Notes.

a

Statistics derived from univariate Cox proportional hazards regression analysis.

Risk score model based on expression level of independent prognostic lncRNAs

Four lncRNAs screened from above analysis were used to construct a risk score model, risk score = (0.2567 * expression level of RP11-322E11.5) + (0.1307 * expression level of RP11-150O12.3) + (−0.2320 * expression level of AC093609.1) + (−0.1857 * expression level of CTC-297N7.9), then the risk score of each patient was calculated according to this model. With the median value of risk score as cutoff, patients in TCGA datasets were divided into high risk group (HRG, n = 186) and low risk group (LRG, n = 185). As shown in Fig. 2, the four lncRNAs displayed significantly different expression patterns between normal and tumor, LRG and HRG, which indicated possible protective roles of AC093609.1 and CTC-297N7.9, carcinogenic effects of RP11-322E11.5 and RP11-150O12.3 in HCC tumorigenesis and prognosis.

Figure 2. Expression patterns of four prognostic lncRNAs.

Figure 2

(A) Expression pattern of RP11-322E11.5. (B) Expression pattern of RP11-150O12.3. (C) Expression pattern of AC093609.1. (D) Expression pattern of CTC-297N7.9. Normal indicates paired adjacent tissue from a subset of 371 primary tumors; Low/High risk subgroups were divided by our risk score model; statistical significance between different subgroups were performed with the two-tailed Wilcoxon test.

Performance evaluation of risk score model for survival prediction

The Kaplan–Meier curves of the above four lncRNAs were presented in Fig. S1. The distribution of risk score, following-up time and survival status were shown in Figs. 3A and 3B. In the survival analysis, K–M plots exhibited a significant difference in patients’ OS between LRG and HRG (log rank p < 0.0001, Fig. 3C). Patients in LRG had significantly longer OS (mean 2.64 years) when compared with the HRG (mean 1.76 years). The univariate Cox proportional hazards regression analysis implied the significant association between signature lncRNAs based risk score and patients’ OS (HR = 2.718, 95% CI [2.103–3.514], p = 2.32e−14, Table 2). The comparison between prognostic lncRNAs, mRNAs and combined transcripts based risk score model were shown in Table S1.

Figure 3. Performance evaluation of risk score model for overall survival prediction.

Figure 3

(A) Bar plot of risk score for every patient and ordered by the value of risk score. (B) Distribution of following-up time and risk score for every patient. (C) Kaplan–Meier plots of overall survival for patients with high or low risk score. P value was calculated with two-sided log-rank test. (D) ROC curves of the risk score based on prognostic lncRNAs, mRNAs and combined transcripts for five-year survival prediction.

Table 2. Univariate and multivariate Cox proportional hazards regression analysis.

Univariate analysis Multivariate analysis
Hazard ratio (95% CI) P value Hazard ratio (95% CI) P value
Risk scorea 2.718 (2.103–3.514) 2.32E−14 2.513 (1.886–3.348) 3.12E−10
Age 1.01 (0.997–1.024) 0.139 1.009 (0.993–1.025) 0.2602
Gender (Male/Female) 0.816 (0.573–1.163) 0.26 0.968 (0.646–1.450) 0.8748
TP53 mutation (Y/N) 1.563 (1.078–2.266) 0.0184 1.395 (0.915–2.128) 0.1217
HBV infection (Y/N) 1.697 (1.199–2.402) 0.0029 1.656 (1.113–2.463) 0.0129
Pathologic stage (ii/i) 1.423 (0.872–2.323) 0.1583 0.864 (0.513–1.455) 0.5827
Pathologic stage (iii/i) 2.676 (1.754–4.083) 4.92E−06 1.794 (1.125–2.861) 0.0141
Pathologic stage (iv/i) 5.496 (1.695–17.821) 0.0045 2.836 (0.818–9.833) 0.1003

Notes.

a

Derived from risk score model based on the expression of prognostic lncRNAs.

Furthermore, ROC analysis were also performed to evaluate specificity and sensitivity of the risk score models for five-year survival prediction. The AUC derived from lncRNA risk score model was 0.701, suggesting that our risk score was a good indicator for survival prediction (Fig. 3D). Additionally, risk model developed with the combination of lncRNAs and mRNAs exhibited the best prediction power when compared with only lncRNAs or mRNAs (AUC = 0.73 vs 0.70 or 0.68, respectively). To evaluate the independent predictive performance of our model when considering the traditional clinical prognostic factors, a multivariate Cox regression analysis was conducted with risk score and other clinical features as independent variables in this TCGA dataset. The results revealed a significant correlation with survival time after adjusting for multiple clinical variates (HR = 2.513, 95% CI [1.886–3.348], p = 3.12e−10, Table 2).

Stratified Cox proportional hazards regression analysis

Moreover, the results also indicated HBV infection status and pathological stage iii as independent prognostic factors in the above multivariate analysis (p = 0.0129 and p = 0.0141, respectively), so that we performed stratification analysis for HBV infection status and pathological stage next. All patients were first stratified into group with (n = 140) or without HBV infection (n = 230). For patients with HBV infection, significant longer OS can still be observed in LRG than HRG (Fig. 4A, p < 0.0001, mean 3.04 years vs. 1.31 years), but not for patients without HBV infection (Fig. 4B, p = 0.18, mean 2.45 years vs. 2.11 years). Similarly, when 370 patients stratified according to different pathological stage, the Cox regression analysis also exhibited a strong classification power between LRG and HRG both in stage i–ii (Fig. 4C, n = 256, p = 0.00013, mean 2.75 years vs. 1.88 years) and stage iii–iv subgroup (Fig. 4D, n = 90, p = 0.0079, mean 2.39 years vs. 1.50 years).

Figure 4. Stratification survival analysis for patients with available HBV infection and pathological information.

Figure 4

(A–B) Stratification analysis of patients with or without HBV infection. (C) Stratification analysis of patients with pathological stage i and ii. (D) Stratification analysis of patients with pathological stage iii and iv. P value were all calculated with two-sided log-rank test.

Expression profiling of identified prognostic lncRNAs

After the examinations of HCC associated datasets in GEO, variations of lncRNA expression were summarized in Table 3. For AC093609.1, CTC-297N7.9 and RP11-150O12.3, many other studies manifested the same significant difference as our study in liver tumor tissue (p < 0.05). Despite no evidence of significant difference was seen for RP11-322E11.5 (p > 0.05), the expression was also up-regulated in GSE98269, which was the same as our result (lgFC > 0).

Table 3. Expression profiles of 4 lncRNAs in GEO datasets.

LncRNA Year Accession Normal Tumor lgFC P value
RP11-322E11.5 2017 GSE98269 3 3 0.41 4.50E–01
RP11-150O12.3 2008 GSE6222 2 10 1.41 1.60E–01
2013 GSE50579 10 67 1.93 6.05E–03
2014 GSE55092 91 49 0.28 3.78E–06
2014 GSE58043 7 7 1.29 7.16E–02
2015 GSE67260 5 5 3.17 3.09E–03
2016 GSE75271 5 50 0.47 2.30E–02
AC093609.1 2008 GSE6222 2 10 −1.88 4.15E–02
2011 GSE29721 10 10 −0.49 1.33E–03
2013 GSE17548 20 17 −0.33 4.69E–04
2013 GSE47197 63 61 −0.6 1.22E–12
2013 GSE50579 10 67 −1.17 5.68E–05
2014 GSE62232 10 81 −0.48 2.74E–06
2014 GSE55092 91 49 −0.54 4.79E–20
2014 GSE58043 7 7 −1.48 4.48E–02
2015 GSE67260 5 5 −2.06 6.63E–03
2017 GSE70880 16 16 −1.48 2.71E–06
2017 GSE98269 3 3 −1.72 1.11E–02
CTC-297N7.9 2017 GSE98269 3 3 −1.4 2.48E−02
2015 GSE67260 5 5 −2.24 4.44E–04

Functional implication of prognostic lncRNA signatures

To further explore the possible role of these four lncRNA in HCC tumorigenesis and prognosis, we performed enrichment analysis with GO, KEGG and DO databases. First we carried out the expression analysis of mRNA between LRG and HRG, 2,510 in total were recognized as differentially expressed genes. Then we conducted functional enrichment analysis for up-regulated and down-regulated mRNA respectively. The GO, KEGG and DO enrichment showed that 1,189 up-regulated mRNAs are mainly involved in chromosome function, cell cycle, p53 signal pathway and associated with a variety of cancers (Figs. 5A5B, Fig. S2A). For 1,321 down-regulated mRNAs, it revealed that these mRNAs are participant in many pathways or diseases (Figs. 5C5D, Fig. S2B), such as blood microparticle, biosynthesis of basic nutrients, drug metabolism and multiple metabolic diseases.

Figure 5. KEGG and DO enrichment analysis of mRNAs deregulated in high risk group compared with low risk group.

Figure 5

(A–B) KEGG and DO analysis of up-regulated mRNAs. (C–D) KEGG and DO analysis of down-regulated mRNAs.

Biological significance of the prognostic lncRNA CTC-297N7.9

We further examined whether there were any protein-coding genes which are neighborhoods of these lncRNAs in genomic location coordinates. Among these four prognostic lncRNAs, CTC-297N7.9 locates upstream of TMEM220, which is a transmembrane protein-coding gene (Fig. 6A). Then expression correlation of CTC-297N7.9 with TMEM220 were evaluated by Pearson correlation analysis, which implied a strong positive correlation between this paired lncRNA and mRNA (Fig. 6B, correlation coefficient = 0.795, p < 0.0001). Moreover, to explore the possible role of TMEM220 in HCC, we retrieved expression profile of this gene from several databases. From Fig. 6C, we can see a liver-specific expression of TMEM220 among different types of normal tissue. However, this tissue-specific expression pattern disappeared in liver cancer cell when compared with multiple tissue-derived cancer cell lines.

Figure 6. Biological features of prognostic lncRNA CTC-297N7.9.

Figure 6

(A) Relative genomic location of CTC-297N7.9 and TMEM220. (B) Expression correlation analysis of CTC-297N7.9 and TMEM220 in patients with HCC. (C) Expression patterns of TMEM220 in normal tissues and cancer cell lines. Relative expression was calculated with average expression level of all normal tissues or cancer cell lines as one, respectively.

Discussion

Nowadays, much is known about the cellular changes and etiological agents responsible for HCC, while the underlying molecular pathogenesis of HCC is still unclear. Although much progress has been made, many issues still remain unresolved, such as the prognosis evaluation of HCC cases (Llovet, Burroughs & Bruix, 2003). Traditional prognostic staging models, including BCLC, Okuda and CLIP scoring systems, have been used to stage HCC (Bruix & Sherman, 2011; Grieco et al., 2005). Mounts of novel prognostic biomarkers based on gene expression also have been carried out to predict HCC prognosis (Budhu et al., 2013; Govaere et al., 2014; Hoshida et al., 2013). However, neither of which can distinguish every situation. Up to now, mounts of reports have regarded lncRNAs as oncogenes or tumor suppressors. Previous studies on lncRNAs in HCC mainly focused on well-known cancer related transcripts. For example, overexpression of HOTAIR and MALAT-1 were found to be associated with tumor recurrence of HCC after liver transplantation (Lai et al., 2012; Yang et al., 2011). In addition, suppression of these two lncRNAs in HepG2 (liver cancer cell line) could reduce cell viability, invasiveness, and increase the sensitivity to apoptosis.

In the present study, we found 1,074 dysregulated lncRNAs in LIHC, some of which have been recognized to be associated with cancer. For example, LINC01419 was reported to be significantly overexpressed in HBV-related and HCV-related HCC (Zhang et al., 2015), which was consistent with our result. A single nucleotide polymorphism lies in HAGLR was identified as a risk site for ovarian cancer (Burghaus et al., 2017). Expression of HOTTIP is associated with HCC patients’ clinical progression and survival outcome (Quagliata et al., 2014). LINC01093 and FAM99A, both specific expression in liver tissue, was significantly down-regulated in HCC compared with normal livers, as well as lower in cirrhotic tissues relative to normal tissues (Degli Esposti et al., 2016). LINC00844 expression was also down-regulated in HCC and breast cancer (Fu et al., 2015; Jin et al., 2015).

Through univariate and multivariate survival analysis, four lncRNAs were identified to have the ability to predict patients’ overall survival independently. We then developed a risk score model based on expression of the four prognostic lncRNAs. With this risk model, patients were divided into LRG or HRG, which exhibited strong predictive power for patients’ outcome. Patients in LRG are more likely to survival longer than those in HRG. Moreover, prediction power could be enhanced when fitted lncRNA in the model based on mRNA signatures, which also indicate the significance of lncRNA as a valuable supplement to the regular mRNA markers. In addition, other prognostic factors for HCC, like HBV infection status and pathological stage, also showed significant correlation with survival expectations in our multivariate Cox regression analysis. Therefore, we performed stratification analysis for above clinical factors; the results suggested the ability of our risk score model to distinguish patients with significantly different prognostic outcomes (except for patients without HBV infection subgroup).

To confirm our results, we first explored the expression profiles of above lncRNAs in other datasets, which are similar with our findings. We then search some databases to find appropriate cohort to make a cross-validation of our survival prediction model. Considering the fact that many microarrays are not designed for lncRNA, we didn’t find suitable dataset with both expression profiles of the four lncRNAs and detailed clinical data for survival analysis in GEO database. In Fujimoto’s et al. (2016) study, RNAseq analysis were performed on 254 liver tumors, which may be helpful for our validation. However, the original sequencing data deposited in European Gnome-phenome Archive (EGA) database were not publicly accessible, and the processed expression matrix were also not supplied by the author. As a result, we only make a partial cross-validation of the expression profiles of these four prognostic lncRNAs.

As the risk score model was constructed on the basis of lncRNAs expression value and its powerful discrimination of expected survival, differentially expressed mRNAs between LRG and HRG were considered to share similar expression patterns with these signature lncRNAs. On the other hand, these differentially expressed mRNAs can also reflect the possible cause of the significantly different prognosis between these two groups. Enrichment analysis revealed that up-regulated genes are mainly involved in cell cycle pathway and associated with multiple cancers. Those down-regulated genes, linked with a variety of metabolic or cardiovascular diseases, are mainly enriched in drug metabolism pathway, which are consistent with the poor prognosis in HRG.

Although accumulating studies have demonstrated that lncRNAs involve in diverse cancers, the underlying mechanisms are still not well understood. In Panzitt’s et al. (2007) study, lncRNA HULC, acting as a post-transcriptional regulator of gene expression, was detected to be highly specific up-regulation in HCC. Moreover, subsequent research has shown that HULC can trigger protective autophagy and attenuate the sensitivity of HCC cells to chemotherapeutic agents (Xiong et al., 2017). Among the four prognostic lncRNAs, RP11-150O12.3 was recognized as an independent predictor of gastric cancer (GC) prognosis (Ren et al., 2016), which also related to survival time of patients with colorectal adenocarcinoma (Zeng et al., 2017). CTC-297N7.9, locating upstream of protein coding gene TMEM220, showed its protective role in tumorigenesis and progression. As a transmembrane protein, TMEM220 is specifically highly expressed in liver tissue compared with other normal tissues. Recently, a study find that TMEM220 was down-regulated and highly methylated in GC tissues compared to normal tissues. The demethylation experiment also implied that the methylation of TMEM220 was inversely correlated with its expression (Choi et al., 2017). Therefore, we suggest that CTC-297N7.9 may be able to regulate the methylation of TMEM220, or be involved in cell autophagy through the interaction with functional proteins, and then affect the prognosis of HCC patients.

Conclusions

In summary, we recognized four lncRNAs associated with OS of HCC patients, and constructed a risk score model based on lncRNAs expression profiles to make a survival prediction. The four lncRNAs could independently predict OS both in univariate and multivariate survival analysis. Our risk score model had a strong power to distinguish patients with different survival outcomes. These results shown the possibility of lncRNAs as novel biomarkers and therapy targets for HCC. Moreover, this study also revealed the possible role of lncRNAs in occurrence and progression of liver cancer.

Supplemental Information

Figure S1. Survival analysis of four independent prognostic lncRNA markers.

Kaplan-Meier survival curve for (A) RP11-322E11.5 (B) RP11-150O12.3 (C) AC093609.1 and (D) CTC-297N7.9. Patients were divided into high or low expression group with median expression value as the cutoff.

DOI: 10.7717/peerj.3575/supp-1
Figure S2. GO enrichment analysis of mRNAs deregulated in high risk group compared with low risk group.

(A) GO analysis of upregulated mRNAs. (B) GO analysis of downregulated mRNAs.

DOI: 10.7717/peerj.3575/supp-2
Table S1. Comparison between lncRNAs, mRNAs and combined transcripts based risk score model in the overall and stratification survival analysis.
DOI: 10.7717/peerj.3575/supp-3
Table S2. The corresponding probe name of four lncRNAs among different microarray platforms.
DOI: 10.7717/peerj.3575/supp-4

Abbreviations

HCC

Hepatocellular carcinoma

ROC

Receiver operating characteristic curve

HBV

Hepatitis B virus

HCV

Hepatitis C virus

lncRNA

Long non-coding RNA

TCGA

The Cancer Genome Atlas

LIHC

Liver hepatocellular carcinoma

TANRIC

The Atlas of Noncoding RNAs in Cancer

GO

Gene ontology

KEGG

Kyoto Encyclopedia of Genes and Genomes

DO

Disease ontology

AUC

Area under the ROC curve

GTEx

Genotype-Tissue Expression

CCLE

Cancer Cell Line Encyclopedia

OS

Overall survival

HRG

High risk group

LRG

Low risk group

BCLC

Barcelona Clinic Liver Cancer

CLIP

The Cancer of the Liver Italian Program

GC

Gastric cancer

Funding Statement

The authors received no funding for this work.

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Zhonghao Wang conceived and designed the experiments, performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, prepared figures and/or tables.

Qian Wu performed the experiments, analyzed the data, wrote the paper, reviewed drafts of the paper.

Shu Feng performed the experiments, analyzed the data, wrote the paper.

Yanhua Zhao analyzed the data, contributed reagents/materials/analysis tools.

Chuanmin Tao conceived and designed the experiments, reviewed drafts of the paper.

Data Availability

The following information was supplied regarding data availability:

The origin RNAseq data used in our study were all downloaded from the TCGA data portal: https://portal.gdc.cancer.gov/.

References

  • Barretina et al. (2012).Barretina J, Caponigro G, Stransky N, Venkatesan K, Margolin AA, Kim S, Wilson CJ, Lehar J, Kryukov GV, Sonkin D, Reddy A, Liu M, Murray L, Berger MF, Monahan JE, Morais P, Meltzer J, Korejwa A, Jane-Valbuena J, Mapa FA, Thibault J, Bric-Furlong E, Raman P, Shipway A, Engels IH, Cheng J, Yu GK, Yu J, Aspesi Jr P, De Silva M, Jagtap K, Jones MD, Wang L, Hatton C, Palescandolo E, Gupta S, Mahan S, Sougnez C, Onofrio RC, Liefeld T, MacConaill L, Winckler W, Reich M, Li N, Mesirov JP, Gabriel SB, Getz G, Ardlie K, Chan V, Myer VE, Weber BL, Porter J, Warmuth M, Finan P, Harris JL, Meyerson M, Golub TR, Morrissey MP, Sellers WR, Schlegel R, Garraway LA. The Cancer Cell Line Encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature. 2012;483:603–607. doi: 10.1038/nature11003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Barrett et al. (2013).Barrett T, Wilhite SE, Ledoux P, Evangelista C, Kim IF, Tomashevsky M, Marshall KA, Phillippy KH, Sherman PM, Holko M, Yefanov A, Lee H, Zhang N, Robertson CL, Serova N, Davis S, Soboleva A. NCBI GEO: archive for functional genomics data sets–update. Nucleic Acids Research. 2013;41:D991–D995. doi: 10.1093/nar/gks1193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Bruix & Sherman (2011).Bruix J, Sherman M. Management of hepatocellular carcinoma: an update. Hepatology. 2011;53:1020–1022. doi: 10.1002/hep.24199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Budhu et al. (2013).Budhu A, Roessler S, Zhao X, Yu Z, Forgues M, Ji J, Karoly E, Qin LX, Ye QH, Jia HL, Fan J, Sun HC, Tang ZY, Wang XW. Integrated metabolite and gene expression profiles identify lipid biomarkers associated with progression of hepatocellular carcinoma and patient outcomes. Gastroenterology. 2013;144:1066–1075. doi: 10.1053/j.gastro.2013.01.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Burghaus et al. (2017).Burghaus S, Fasching PA, Häberle L, Rübner M, Büchner K, Blum S, Engel A, Ekici AB, Hartmann A, Hein A, Beckmann MW, Renner SP. Genetic risk factors for ovarian cancer and their role for endometriosis risk. Gynecologic Oncology. 2017;145:142–147. doi: 10.1016/j.ygyno.2017.02.022. [DOI] [PubMed] [Google Scholar]
  • Cancer Genome Atlas Research (2011).Cancer Genome Atlas Research N Integrated genomic analyses of ovarian carcinoma. Nature. 2011;474:609–615. doi: 10.1038/nature10166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Chen et al. (2016).Chen ZZ, Huang L, Wu YH, Zhai WJ, Zhu PP, Gao YF. LncSox4 promotes the self-renewal of liver tumour-initiating cells through Stat3-mediated Sox4 expression. Nature Communications. 2016;7:12598. doi: 10.1038/ncomms12598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Choi et al. (2017).Choi B, Han T-S, Min J, Hur K, Lee S-M, Lee H-J, Kim Y-J, Yang H-K. MAL and TMEM220 are novel DNA methylation markers in human gastric cancer. Biomarkers. 2017;22:35–44. doi: 10.1080/1354750X.2016.1201542. [DOI] [PubMed] [Google Scholar]
  • Consortium GT (2013).Consortium GT The Genotype-Tissue Expression (GTEx) project. Nature Genetics. 2013;45:580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Degli Esposti et al. (2016).Degli Esposti D, Hernandez-Vargas H, Voegele C, Fernandez-Jimenez N, Forey N, Bancel B, Le Calvez-Kelm F, McKay J, Merle P, Herceg Z. Identification of novel long non-coding RNAs deregulated in hepatocellular carcinoma using RNA-sequencing. Oncotarget. 2016;7:31862–31877. doi: 10.18632/oncotarget.7364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Forner, Llovet & Bruix (2012).Forner A, Llovet JM, Bruix J. Hepatocellular carcinoma. The Lancet. 2012;379:1245–1255. doi: 10.1016/S0140-6736(11)61347-0. [DOI] [PubMed] [Google Scholar]
  • Fu et al. (2015).Fu J, Khaybullin R, Zhang Y, Xia A, Qi X. Gene expression profiling leads to discovery of correlation of matrix metalloproteinase 11 and heparanase 2 in breast cancer progression. BMC Cancer. 2015;15:473. doi: 10.1186/s12885-015-1410-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Fujimoto et al. (2016).Fujimoto A, Furuta M, Totoki Y, Tsunoda T, Kato M, Shiraishi Y, Tanaka H, Taniguchi H, Kawakami Y, Ueno M, Gotoh K, Ariizumi S, Wardell CP, Hayami S, Nakamura T, Aikata H, Arihiro K, Boroevich KA, Abe T, Nakano K, Maejima K, Sasaki-Oku A, Ohsawa A, Shibuya T, Nakamura H, Hama N, Hosoda F, Arai Y, Ohashi S, Urushidate T, Nagae G, Yamamoto S, Ueda H, Tatsuno K, Ojima H, Hiraoka N, Okusaka T, Kubo M, Marubashi S, Yamada T, Hirano S, Yamamoto M, Ohdan H, Shimada K, Ishikawa O, Yamaue H, Chayama K, Miyano S, Aburatani H, Shibata T, Nakagawa H. Whole-genome mutational landscape and characterization of noncoding and structural mutations in liver cancer. Nature Genetics. 2016;48:500–509. doi: 10.1038/ng.3547. [DOI] [PubMed] [Google Scholar]
  • Govaere et al. (2014).Govaere O, Komuta M, Berkers J, Spee B, Janssen C, De Luca F, Katoonizadeh A, Wouters J, Van Kempen LC, Durnez A, Verslype C, De Kock J, Rogiers V, Van Grunsven LA, Topal B, Pirenne J, Vankelecom H, Nevens F, Van Den Oord J, Pinzani M, Roskams T. Keratin 19: a key role player in the invasion of human hepatocellular carcinomas. Gut. 2014;63:674–685. doi: 10.1136/gutjnl-2012-304351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Grieco et al. (2005).Grieco A, Pompili M, Caminiti G, Miele L, Covino M, Alfei B, Rapaccini GL, Gasbarrini G. Prognostic factors for survival in patients with early-intermediate hepatocellular carcinoma undergoing non-surgical therapy: comparison of Okuda, CLIP, and BCLC staging systems in a single Italian centre. Gut. 2005;54:411–418. doi: 10.1136/gut.2004.048124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Harrow et al. (2012).Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, Aken BL, Barrell D, Zadissa A, Searle S, Barnes I, Bignell A, Boychenko V, Hunt T, Kay M, Mukherjee G, Rajan J, Despacio-Reyes G, Saunders G, Steward C, Harte R, Lin M, Howald C, Tanzer A, Derrien T, Chrast J, Walters N, Balasubramanian S, Pei B, Tress M, Rodriguez JM, Ezkurdia I, Van Baren J, Brent M, Haussler D, Kellis M, Valencia A, Reymond A, Gerstein M, Guigo R, Hubbard TJ. GENCODE: the reference human genome annotation for the ENCODE project. Genome Research. 2012;22:1760–1774. doi: 10.1101/gr.135350.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Hoshida et al. (2013).Hoshida Y, Villanueva A, Sangiovanni A, Sole M, Hur C, Andersson KL, Chung RT, Gould J, Kojima K, Gupta S, Taylor B, Crenshaw A, Gabriel S, Minguez B, Iavarone M, Friedman SL, Colombo M, Llovet JM, Golub TR. Prognostic gene expression signature for patients with hepatitis C– related early-stage cirrhosis. Gastroenterology. 2013;144:1024–1030. doi: 10.1053/j.gastro.2013.01.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Jin et al. (2015).Jin B, Wang W, Du G, Huang G, Han L, Tang Z, Fan D, Li J, Zhang S. Identifying hub genes and dysregulated pathways in hepatocellular carcinoma. European Review for Medical and Pharmacological. 2015;19:592–601. [PubMed] [Google Scholar]
  • Lai et al. (2012).Lai MC, Yang Z, Zhou L, Zhu QQ, Xie HY, Zhang F, Wu LM, Chen LM, Zheng SS. Long non-coding RNA MALAT-1 overexpression predicts tumor recurrence of hepatocellular carcinoma after liver transplantation. Medical Oncology. 2012;29:1810–1816. doi: 10.1007/s12032-011-0004-z. [DOI] [PubMed] [Google Scholar]
  • Li et al. (2015).Li J, Han L, Roebuck P, Diao L, Liu L, Yuan Y, Weinstein JN, Liang H. TANRIC: an interactive open platform to explore the function of lncRNAs in cancer. Cancer Research. 2015;75:3728–3737. doi: 10.1158/0008-5472.CAN-15-0273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Li et al. (2016).Li T, Xie J, Shen C, Cheng D, Shi Y, Wu Z, Deng X, Chen H, Shen B, Peng C, Li H, Zhan Q, Zhu Z. Upregulation of long noncoding RNA ZEB1-AS1 promotes tumor metastasis and predicts poor prognosis in hepatocellular carcinoma. Oncogene. 2016;35:1575–1584. doi: 10.1038/onc.2015.223. [DOI] [PubMed] [Google Scholar]
  • Llovet, Burroughs & Bruix (2003).Llovet JM, Burroughs A, Bruix J. Hepatocellular carcinoma. The Lancet. 2003;362:1907–1917. doi: 10.1016/S0140-6736(03)14964-1. [DOI] [PubMed] [Google Scholar]
  • Panzitt et al. (2007).Panzitt K, Tschernatsch MMO, Guelly C, Moustafa T, Stradner M, Strohmaier HM, Buck CR, Denk H, Schroeder R, Trauner M, Zatloukal K. Characterization of HULC, a novel gene with striking up-regulation in hepatocellular carcinoma, as noncoding RNA. Gastroenterology. 2007;132:330–342. doi: 10.1053/j.gastro.2006.08.026. [DOI] [PubMed] [Google Scholar]
  • Quagliata et al. (2014).Quagliata L, Matter MS, Piscuoglio S, Arabi L, Ruiz C, Procino A, Kovac M, Moretti F, Makowska Z, Boldanova T, Andersen JB, Hammerle M, Tornillo L, Heim MH, Diederichs S, Cillo C, Terracciano LM. Long noncoding RNA HOTTIP/HOXA13 expression is associated with disease progression and predicts outcome in hepatocellular carcinoma patients. Hepatology. 2014;59:911–923. doi: 10.1002/hep.26740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Ren et al. (2016).Ren W, Zhang J, Li W, Li Z, Hu S, Suo J, Ying X. A Tumor-specific prognostic long non-coding rna signature in gastric cancer. Medical Science Monitor : International Medical Journal of Experimental and Clinical Research. 2016;22:3647–3657. doi: 10.12659/MSM.901190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Spizzo et al. (2012).Spizzo R, Almeida MI, Colombatti A, Calin GA. Long non-coding RNAs and cancer: a new frontier of translational research? Oncogene. 2012;31:4577–4587. doi: 10.1038/onc.2011.621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Villanueva, Hernandez-Gea & Llovet (2013).Villanueva A, Hernandez-Gea V, Llovet JM. Medical therapies for hepatocellular carcinoma: a critical view of the evidence. Nature Reviews. Gastroenterology & Hepatology. 2013;10:34–42. doi: 10.1038/nrgastro.2012.199. [DOI] [PubMed] [Google Scholar]
  • Xiong et al. (2017).Xiong H, Ni Z, He J, Jiang S, Li X, He J, Gong W, Zheng L, Chen S, Li B, Zhang N, Lyu X, Huang G, Chen B, Zhang Y, He F. LncRNA HULC triggers autophagy via stabilizing Sirt1 and attenuates the chemosensitivity of HCC cells. Oncogene. 2017;36:3528–3540. doi: 10.1038/onc.2016.521. [DOI] [PubMed] [Google Scholar]
  • Yang et al. (2011).Yang Z, Zhou L, Wu L-M, Lai M-C, Xie H-Y, Zhang F, Zheng S-S. Overexpression of long non-coding RNA HOTAIR predicts tumor recurrence in hepatocellular carcinoma patients following liver transplantation. Annals of Surgical Oncology. 2011;18:1243–1250. doi: 10.1245/s10434-011-1581-y. [DOI] [PubMed] [Google Scholar]
  • Yu et al. (2012).Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zeng et al. (2017).Zeng JH, Liang L, He RQ, Tang RX, Cai XY, Chen JQ, Luo DZ, Chen G. Comprehensive investigation of a novel differentially expressed lncRNA expression profile signature to assess the survival of patients with colorectal adenocarcinoma. Oncotarget. 2017;8:16811–16828. doi: 10.18632/oncotarget.15161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zhang et al. (2016).Zhang JY, Weng MZ, Song FB, Xu YG, Liu Q, Wu JY, Qin J, Jin T, Xu JM. Long noncoding RNA AFAP1-AS1 indicates a poor prognosis of hepatocellular carcinoma and promotes cell proliferation and invasion via upregulation of the RhoA/Rac2 signaling. International Journal of Oncology. 2016;48:1590–1598. doi: 10.3892/ijo.2016.3385. [DOI] [PubMed] [Google Scholar]
  • Zhang et al. (2015).Zhang H, Zhu C, Zhao Y, Li M, Wu L, Yang X, Wan X, Wang A, Zhang MQ, Sang X, Zhao H. Long non-coding RNA expression profiles of hepatitis C virus-related dysplasia and hepatocellular carcinoma. Oncotarget. 2015;6:43770–43778. doi: 10.18632/oncotarget.6087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zhou et al. (2016).Zhou M, Xu W, Yue X, Zhao H, Wang Z, Shi H, Cheng L, Sun J. Relapse-related long non-coding RNA signature to improve prognosis prediction of lung adenocarcinoma. Oncotarget. 2016;7:29720–29738. doi: 10.18632/oncotarget.8825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • Zhou et al. (2015).Zhou M, Zhao H, Wang Z, Cheng L, Yang L, Shi H, Yang H, Sun J. Identification and validation of potential prognostic lncRNA biomarkers for predicting survival in patients with multiple myeloma. Journal of Experimental & Clinical Cancer Research. 2015;34:102–115. doi: 10.1186/s13046-015-0219-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1. Survival analysis of four independent prognostic lncRNA markers.

Kaplan-Meier survival curve for (A) RP11-322E11.5 (B) RP11-150O12.3 (C) AC093609.1 and (D) CTC-297N7.9. Patients were divided into high or low expression group with median expression value as the cutoff.

DOI: 10.7717/peerj.3575/supp-1
Figure S2. GO enrichment analysis of mRNAs deregulated in high risk group compared with low risk group.

(A) GO analysis of upregulated mRNAs. (B) GO analysis of downregulated mRNAs.

DOI: 10.7717/peerj.3575/supp-2
Table S1. Comparison between lncRNAs, mRNAs and combined transcripts based risk score model in the overall and stratification survival analysis.
DOI: 10.7717/peerj.3575/supp-3
Table S2. The corresponding probe name of four lncRNAs among different microarray platforms.
DOI: 10.7717/peerj.3575/supp-4

Data Availability Statement

The following information was supplied regarding data availability:

The origin RNAseq data used in our study were all downloaded from the TCGA data portal: https://portal.gdc.cancer.gov/.


Articles from PeerJ are provided here courtesy of PeerJ, Inc

RESOURCES