Skip to main content
BioMed Research International logoLink to BioMed Research International
. 2020 Dec 8;2020:4251761. doi: 10.1155/2020/4251761

Identification of Potential Hub Genes Related to Diagnosis and Prognosis of Hepatitis B Virus-Related Hepatocellular Carcinoma via Integrated Bioinformatics Analysis

Yuqin Tang 1,2, Yongqiang Zhang 3,4, Xun Hu 2,
PMCID: PMC7744201  PMID: 33376723

Abstract

Hepatocellular carcinoma (HCC) is a common malignant cancer with poor survival outcomes, and hepatitis B virus (HBV) infection is most likely to contribute to HCC. But the molecular mechanism remains obscure. Our study intended to identify the candidate potential hub genes associated with the carcinogenesis of HBV-related HCC (HBV-HCC), which may be helpful in developing novel tumor biomarkers for potential targeted therapies. Four transcriptome datasets (GSE84402, GSE25097, GSE94660, and GSE121248) were used to screen the 309 overlapping differentially expressed genes (DEGs), including 100 upregulated genes and 209 downregulated genes. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment were used to explore the biological function of DEGs. A PPI network based on the STRING database was constructed and visualized by the Cytoscape software, consisting of 209 nodes and 1676 edges. Then, we recognized 17 hub genes by CytoHubba plugin, which were further validated on additional three datasets (GSE14520, TCGA-LIHC, and ICGC-LIRI-JP). The diagnostic effectiveness of hub genes was assessed with receiver operating characteristic (ROC) analysis, and all hub genes displayed good performance in discriminating TNM stage I patient samples and normal tissue ones. For prognostic analysis, two prognostic key genes (TOP2A and KIF11) out of the 17 hub genes were screened and used to develop a prognostic signature, which showed good potential for overall survival (OS) stratification of HBV-HCC patients. Gene Set Enrichment Analysis (GSEA) was performed in order to better understand the function of this prognostic gene signature. Finally, the miRNA–mRNA regulatory relationships of all hub genes in human liver were predicted using miRNet. In conclusion, the current study gives further insight on the pathogenesis and carcinogenesis of HBV-HCC, and the identified DEGs provide a promising direction for improving the diagnostic, prognostic, and therapeutic outcomes of HBV-HCC.

1. Introduction

Liver cancer, with about 841,000 new cases diagnosed and 782,000 deaths in 2018, still represents a common lethal solid tumor and ranks fourth leading cause of cancer-related deaths worldwide [1]. In China, liver cancer was one of the first five life killers in 2017 [2]. Hepatocellular carcinoma (HCC), comprising 75%-85% of all primary liver cancer cases worldwide [1], is the primary histological subtype. The major causative etiological factors of HCC are considered as infection of endemic hepatitis B virus (HBV) or hepatitis C virus (HCV), followed by exposition to aflatoxin B1, alcohol abuse, and obesity [1, 3]. Particularly, HBV infection is considered the dominant cause of HCC, accounting for more than 80% of all HCC incidences in China and other developing countries [4]. Despite significant advances in early diagnosis, prevention and the standard therapeutic interventions such as surgery, radiation, chemotherapy, or personalized target therapeutic strategies developed during the last decade, and the cumulative 5-year overall survival rate of HCC remains unfavorable, probably due to its invasive behavior, as well as its histopathological and molecular heterogeneity that challenge molecular characterization and targeted therapeutic approaches. Moreover, the majority of patients are diagnosed in a more advanced stage, resulted in a much poorer prognosis. Thus, considerable work is still required to achieve a better understanding of the underlying mechanism at molecular level on the pathogenesis and carcinogenesis of HCC, which may be imperative for the development of robust biomarkers for early diagnosis and drug discovery.

In recent years, the rapid development of bioinformatics and emerging high-throughput techniques, such as microarray and next-generation sequencing (NGS), have enabled us to gain a comprehensive understanding of carcinogenesis and progression of various types of cancer. High-throughput platforms have been widely used in early diagnosis, histological identification, molecular classification, prognosis prediction, and drug resistance analysis of cancer [59]. Differentially expressed genes (DEGs), microRNAs (miRNAs), long noncoding RNAs (lncRNAs), and circular RNAs (circRNAs), as well as differentially methylated CpG sites have the potential to provide valuable clues to locate biomarkers in HCC. Nevertheless, data acquired from multiple studies may lead to false-positive results, considering the sample heterogeneity, different screening methods, different data mining approaches, and coupled effect of limited sample size in a single independent study. Integrated analysis based on collective datasets is a promising strategy to overcome these shortcomings. The microarray and sequencing data that deposited in the public databases such as The Cancer Genome Atlas (TCGA), Gene Expression Omnibus (GEO), and International Cancer Genome Consortium (ICGC) have generated valuable information to identify biomarkers or to explore molecular landscapes of cancers, especially at transcriptome level [1015]. Integrated transcriptome analyses have been employed to disclose the molecular mechanisms of cancers. However, few reports have been submitted to examine dysregulated genes and candidate biomarkers regarding HBV-related HCC (HBV-HCC) with combined datasets [16].

In the current study, seven datasets of gene expression profiles from GEO, TCGA, and ICGC were used to identify the hub genes that may play pivotal roles in the pathogenesis of HBV-HCC, comprising a total of 514 HBV-HCC tumor samples and 452 normal tissues. The common differentially expressed genes between tumors and adjacent normal liver tissues of four datasets (GSE84402 [17], GSE25097 [18], GSE94660 [19], and GSE121248 [20]) were screened by the R language software. The R package clusterProfiler [21] was utilized to conduct Gene ontology (GO) enrichment analysis and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis to reveal the biological functions of DEGs. The protein-protein interaction (PPI) network of all common DEGs was constructed with information from the Search Tool for the Retrieval of Interacting Genes (STRING) database. Based on the entire network, hub gene identification and submodule analysis were performed using the CytoHubba plugin or the Molecular Complex Detection (MCODE) plugin of Cytoscape [2224], respectively. The expression levels and correlation analysis of hub genes were then validated with the aid of complementary three datasets (TCGA, ICGC, and GSE14520). Furthermore, the diagnostic effectiveness of the hub genes was evaluated from TCGA and GSE14520, and the prognostic value was also determined in the GSE14520 dataset using univariate and multivariate analyses. Finally, Gene Set Enrichment Analysis (GSEA) and mRNA–microRNA interactions were operated to further clarify their possible molecular mechanisms. Taken all together, the crucial hub genes and associated key pathways identified from our findings would provide a potential indication for the molecular mechanism of HBV-HCC development and progression, and the particular proposed genes may shed additional insight into the early diagnosis, prognosis prediction, and therapeutic targets of HBV-HCC in the near future.

2. Materials and Methods

2.1. Gene Expression Dataset Acquisition

We chose four datasets of gene expression profiling from the GEO (https://www.ncbi.nlm.nih.gov/geo/) database to fetch the DEGs between HBV-HCC tumor samples and normal liver tissues, with the accession numbers of GSE84402, GSE25097, GSE94660, and GSE121248. GSE84402 and GSE121248 were both based on GPL570 (Affymetrix Human Genome U133 Plus 2.0 Array), while GSE25097 was based on GPL10687 (Merck Human RSTA Affymetrix 1.0 microarray, Custom CDF), and GSE94660 was based on GPL16791 (Illumina Hiseq 2500 (Homo sapiens)). HBV-HCC cases or samples were carefully filtered from all of the above studies. The dataset of GSE84402 contained 13 pairs of HBV-related hepatocellular carcinoma tissues and corresponding noncancerous tissues. The dataset of GSE25097 comprised 73 HBV-HCC tissues and 67 paired adjacent nontumor samples. The dataset of GSE94660 consisted of 21 pairs of tumor and nonneoplastic liver tissues of HBV-HCC patients. The dataset of GSE121248 included 70 tissues from chronic hepatitis B-induced HCC and 37 adjacent normal tissues. In addition, we downloaded another three transcriptome datasets and corresponding clinical information for validation—GSE14520 from the GEO database including 213 HBV-HCC tissues and 220 normal samples, TCGA-LIHC data from The Cancer Genome Atlas (TCGA, http://www.tcga.org/) database including 70 HBV-HCC tissues and 49 normal samples, and ICGC-LIRI-JP data from the International Cancer Genome Consortium (ICGC, https://icgc.org/) including 53 HBV-HCC tissues and 45 paired normal samples. Table 1 summarized the detailed information of the selected datasets in this study.

Table 1.

Detailed information of selected datasets in this study.

Study Platforms Etiology Tumor Normal Total Technology
GSE84402 GPL570 HBV 13 13 26 Microarray
GSE25097 GPL10687 HBV 73 67 140 Microarray
GSE94660 Illumina Hiseq HBV 21 21 42 RNA-seq
GSE121248 GPL570 HBV 70 37 107 Microarray
GSE14520 GPL3921 HBV 213 220 433 Microarray
TCGA Illumina Hiseq HBV 71 49 120 RNA-seq
ICGC Illumina Hiseq HBV 53 45 98 RNA-seq

2.2. Data Preprocessing and DEG Identification

All data were analyzed by the R software (version3.6.0, https://http://www.r-project.org/). For the microarray data, we downloaded the raw data (.CELL files) and used the robust multiarray average algorithm [25] to conduct the background correction and quantile normalization; probes were subsequently matched with the corresponding gene name according to the platform annotation packages. When multiple probes were matched one the same gene, the highest expression value of all probes was selected to represent the final expression level. For RNA-seq data, we collected the normalized gene expression matrixes from the GEO database or HCCD database [26] directly. Meanwhile, the limma package [27] was applied to screen DEGs between HBV-HCC samples and noncancerous samples, with P value <0.05 and ∣log2 FC | ≥1 were set as the statistical cutoff criteria. The overlapping DEGs were identified by Venn plot. Besides, to minimize the chance for making type I error, we also adopted the integrating strategy by removing batch effect via “sva” [28] package for the three microarray datasets.

2.3. Functional Enrichment Analysis

To explore a further understanding of the potential biological function, GO and KEGG pathway enrichment analyses were performed on the overlapping dysregulated genes using the package “clusterProfiler” [21], and P < 0.05 was determined as a cutoff for significance.

2.4. PPI Network Construction

STRING database (version 11.0; http://string-db.org/), a free online tool to evaluate PPI information, was used to assess and integrate the interactive relationships among these DEGs. A combined confidence score of ≥0.7 (high confidence) was set as the threshold value. The Cytoscape software [24] (version 3.2.1; http://www.cytoscape.org) was then used to construct and visualize the PPI network. Besides, we also used a plugin from Cytoscape, the MCODE [23] (MCODE; version 1.5.1), to carry out the submodule analysis. For the identification of hub genes, we used the CytoHubba app [22] in Cytoscape with a combined method. Generally, nodes with top 60 scores of all the 11 algorithms in CytoHubba were picked up; then, the intersection based on more than 8 algorithms was taken as candidate hub genes, and two packages including UpSetR [29] and Venn detail [30] were used for visualization. Finally, the potential function of submodule genes or the hub genes was verified and displayed by the clusterProfiler [21] and the GOplot R packages [31].

2.5. Validation of Hub Gene Expression

As mentioned above, three datasets including GSE14520, TCGA-LIHC, and ICGC-LIRI-JP were used to verify the differential expression level of hub genes. The violin plots and heatmaps were drawn to compare the expression pattern between HBV-HCC tissues and noncancerous tissues, with the Wilcoxon test to measure the statistical significance. The same three datasets were also used to perform the coexpression analysis of the selected hub genes. The correlation of hub genes' expression and the clinical stages was investigated with the GSE14520 dataset and exhibited with boxplots.

2.6. Assessment of the Diagnostic and Prognostic Values

The receiver operating characteristic (ROC) analysis was conducted by means of R package pROC [32] to determine the power of potential hub genes in the diagnose of early phrase and whole phrase of HBV-HCC carcinogenesis, using their expression values from GSE14520 and TCGA-LIHC. The appropriate expression levels of the hub genes were served as cutoff values. For prognostic assessment, because of the sample size (n < 100) limitation of other datasets, only GSE14520 containing 213 HBV-HCC patients with complete OS data and sufficient clinicopathologic information were analyzed. In order to increase the robustness of the selection, we applied the “multi-split” strategy with log-rank test for 100 randomizations (75% portion of all samples were subsampled at each time) to evaluate the correlation between the OS and each hub gene expression level (Supplementary Figure 1). Those genes repeatedly showed significance for more than 75 times were considered as prognostic key genes and were further used to construct a linear combination as the risk signature with the following formula: risk score = sum (coef (k) × expression value of k), where k represents the candidate prognostic key genes. All patients were divided as high-risk or low-risk groups according to the median risk score. Kaplan-Meier method with a log-rank test was used to compare the survival curves by using the survival package [33]. The time-dependent ROC was depicted to estimate the predictive ability of the risk signature for patients' OS survival. The associations of clinicopathologic features and the risk signature were determined by Pearson chi-square test or Fisher's exact test. Univariable and multivariable Cox regression analyses were conducted to identify independent prognostic factors. For all statistical tests, P < 0.05 was set as significant cutoff.

2.7. Gene Set Enrichment Analysis

The genes that differentially expressed and associated with risk stratification of OS survival were analyzed by GSEA, which was employed to examine the statistical significance of a priori defined set of genes between different phenotypes. In the current study, we focused on the KEGG pathways (c2.cp.kegg. v7.1) and molecular function of GO gene sets (c5.mf.v7.1). 1000 times of permutations were performed to obtain the enrichment score (ES) and the normalization enrichment score (NES) for each gene set. Significant gene sets were identified with a nominal P value <0.05, combined with an FDR < 25% and normalized enrichment score (NES) > 1 or <-1.

2.8. miRNA-mRNA Interaction Prediction

We predicted the liver-specific miRNA-mRNA interaction of all hub genes using an online tool—miRNet [34], which integrated several well-annotated databases, including miRTarBase v7.0, TarBase v7.0, and miRecords. miRNA-mRNA correlation network was constructed with the Cytoscape software. In the network, diamond nodes denoted the mRNAs while the rectangle nodes represented the miRNAs.

3. Results

3.1. Identification of DEGs in HBV-HCC

With the filtering criteria mentioned above, DEGs in HBV-HCC carcinogenesis were achieved using a total of 315 clinical samples from four GEO datasets (GSE25097, GSE84402, GSE121248, and GSE94660). In all, there were 623 upregulated and 1144 downregulated genes identified from GSE25097, 319 upregulated and 775 downregulated genes from GSE84402, 319 upregulated and 564 downregulated genes from GSE121248, and 1329 upregulated and 534 downregulated genes from GSE94660. The volcano plots of all DEGs in each of the four datasets were shown in Figures 1(a)1(d). For overlapping analysis of the Venn diagram, 121 upregulated genes or 302 downregulated genes were firstly shared by Affymetrix biosystems (Figure 1(e)); then, 309 common DEGs were obtained by Affymetrix and Illumina platforms, consisting of 100 upregulated genes and 209 downregulated genes (Figures 1(f) and 1(g)). To increase the robustness of these 309 DEGs, we conducted the integrating analysis of the three microarray datasets. After removing the batch effect, all 309 DEGs were still showed significant (Supplementary Figure 2, Supplementary Table 1).

Figure 1.

Figure 1

Identification of overlapping DEGs among the screening datasets of HBV-HCC. (a–d) Volcano plot for GSE84402, GSE25097, GSE121248, and GSE94660. (e) Venn diagram for the overlapping DEGs from Affymetrix biosystems. (f) Common DEGs shared by Affymetrix and Illumina platforms. (g) Heatmap of 309 common DEGs. Blue represents downregulated genes while red represents upregulated genes. Each column represents one dataset, and each row represents one gene. DEGs: differentially expressed genes; HBV-HCC: HBV-related hepatocellular cancer.

3.2. DEG Function Analysis

Three categories of GO comprising of biological process (BP), molecular function (MF), and cellular component (CC), together with KEGG enrichment, were performed for the upregulated and downregulated genes. For GO functions analysis, upregulated genes were involved in multiple GO terms, such as mitotic nuclear division, chromosome segregation, nuclear division, and spindle. On the other hand, significant GO terms that associated with downregulated genes were organic acid catabolic process, carboxylic acid catabolic process, small molecule catabolic process, carboxylic acid biosynthetic process, and so on. For KEGG pathway analysis, the upregulated genes were mostly enriched in the cell cycle, DNA replication, oocyte meiosis, progesterone-mediated oocyte maturation, p53 signaling pathway, and mismatch repair. Meanwhile, the downregulated genes were mainly related to chemical carcinogenesis, retinol metabolism, tryptophan metabolism, bile secretion, and linoleic acid metabolism (Figure 2).

Figure 2.

Figure 2

Function enrichment analysis of the overlapping DEGs. (a) GO analysis for the upregulated DEGs (top 20 GO terms are shown). (b) KEGG pathway enrichment for the upregulated DEGs. (c) GO analysis for the downregulated DEGs (top 20 GO terms are shown). (d) KEGG pathway enrichment for the downregulated DEGs. BP: biological process; CC: cellular component; MF: molecular function.

3.3. PPI Network Construction

For the DEG interaction inspection, we constructed a PPI network of 209 nodes and 1676 edges by the STRING database and Cytoscape software, with a strict criterion (combined interaction score ≥ 0.7). The PPI network contained 84 upregulated DEGs and 125 downregulated DEGs (Supplementary Figure 3). The clustering coefficient was 0.525, and the average degree was 16.038. In addition, the MCODE app of Cytoscape detected the most significant module with a high network score (>40), which consisted of 47 nodes and 1017 edges (Figure 3(a)). Functional analysis revealed that mitotic nuclear division and cell cycle were the most significant GO and KEGG pathway enriched by the module (Figures 3(b) and 3(c)).

Figure 3.

Figure 3

Subnetwork analysis of the DEGs PPI. (a) The most significant module selected by MCODE plugin (MCODE score > 40), comprising 47 nodes. (b) Top five related GO terms enriched by the DEGs in the module. (c) KEGG enrichment analyses for the DEGs in the module.

For the hub gene identification, we used a combined method by CytoHubba. Interestingly, all of the 17 hub genes were upregulated DEGs, and most of them (UBE2C, RRM2, RFC4, TOP2A, MCM2, CDK1, CCNB1, HMMR, CDC20, CCNA2, NEK2, NDC80, DLGAP5, and KIF11) were involved in the significant module, while only 3 hub genes (MCM6, MCM3, and PRIM1) were not included (Figure 4(a)). We also employed the GO and KEGG analyses of all the hub genes for further validation of their biological functions, such as DNA replication and cell cycle regulators (Figures 4(b) and 4(c)).

Figure 4.

Figure 4

Hub gene identification and functional analysis. (a) The combination of Upset plot and Vennpie plot shows the 17 hub genes identified by CytoHubba plugin through DEG PPI network, with an overlapping strategy. (b) Top five related GO terms of the hub genes. (c) Result of KEGG pathway analysis of the hub genes.

3.4. Hub Gene Validation

To validate the differentially expressed levels of the selected hub genes, three other datasets were used in this study. Consistent with the screening cohorts from the GEO database, all of the 17 definitive hub genes showed the significantly higher expression trends between HBV-HCC cancer samples and adjacent liver normal ones (Figures 5(a)5(c)). Moreover, the hierarchical clustering of the hub genes and samples based on the three datasets revealed their good potential in discriminating the tumors from healthy tissues (Figure 6). Heatmaps of Pearson correlation suggested the high correlations of these hub genes in all three datasets, supporting the underlying hypothesis that hub genes may strongly interact with each other and play critical roles in the development of HBV-HCC (Figure 5(d)). To seek for the clinical relevance of these hub genes, we analyzed the transcription expression levels of the hub genes according to patients' TNM stages and their BCLC stages. Consequently, most of the hub genes showed no statistical difference between TNM stage I and other stages, which suggested their predominant roles in the initiation of carcinogenesis, but they may not good indicators for HBV-HCC progression (Figure 7). A similar result was reached based on the BCLC staging system, which implied that only MCM6, RRM2, and CCNB1 might be associated with HBV-HCV progression (Supplementary Figure 4).

Figure 5.

Figure 5

Validation of the aberrant expression levels for the selected hub genes and coexpression analysis. (a–c) Violin plots showing the significantly increased expression values for all of the 17 hub genes based on GSE14520, TCGA-LIHC, and ICGC-LIRI-JP. (d) Pearson correlation analysis among expression levels of the 17 hub genes for GSE14520, TCGA-LIHC, and ICGC-LIRI-JP. The color depth indicates the degree of correlation. The darker the color, the higher the correlation coefficient. ∗∗∗P < 0.001.

Figure 6.

Figure 6

Clustering heatmaps of 17 hub genes based on (a) TCGA-LIHC, (b) ICGC-LIRI-JP, and (c) GSE14520. Red denotes high expression levels while green denotes low expression levels.

Figure 7.

Figure 7

Boxplots showing the relative expression levels of 17 hub genes across normal liver tissues and cancer tissues with different TNM stages for HBV-HCC. P < 0.05, ∗∗P < 0.01, ∗∗∗P < 0.001, and ∗∗∗∗P < 0.0001.

3.5. Diagnostic Value Assessment

The above findings prompted us to speculate that these hub genes may have good diagnostic efficiency for HBV-HCC, which was verified by plotting ROC curves using TCGA-LIHC and GSE14520. As the result, the area under the curves (AUCs) of all hub genes ranged from 0.91 to 0.99 (Supplementary Figure 5), indicating their excellent diagnostic values for distinguishing tumor tissues and adjacent normal ones. Next, we were curious whether these hub genes also play a role in the early detection of HBV-HCC, which was even more crucial for clinical intervention. Thus, we especially focused on early stage (TNM stage I) cases, and ROC analysis of individual hub genes proved their great potential in the early diagnosis of HBV-HCC (Figure 8).

Figure 8.

Figure 8

The ROC curves and AUC (95% CI) for each of the selected hub genes to evaluate their efficiency in the early diagnosis of HBV-HCC based on (a–c) TCGA-LIHC cohort and (d–f) GSE14520 cohort. Colored lines denote sensitive curves for each hub gene, and grey line denotes the identify line. ROC: receiver operating characteristic; AUC: area under the curve.

3.6. Survival Analysis

To elucidate the prognostic values of the hub genes, we performed the OS survival analysis with the cohort from GSE14520, including 213 HBV-HCC patients. The resample-based log-rank test resulted in two robust prognostic hub genes (with high repentance frequency of >0.75 showing significant during resampling): TOP2A and KIF11. Then, a prognostic signature was built with these two hub genes, and all patients were assigned to high- or low-risk group based on the median risk score (0.997189). The result of risk score for each patient was shown in Figure 9(a). Figure 9(b) suggested the significant difference of risk scores between high- or low-risk group, and OS survival rate was significant higher in low-risk group by chi-square analysis (Figure 9(c)). As for prediction accuracy, we plotted the time-dependent roc curves by risk score, and the risk signature showed the AUC values at 1, 3, and 5 years was 0.626, 0.643, and 0.693, respectively (Figure 9(d)). For Kaplan-Meier survival analyses, both the OS survival rate and the recurrence-free survival rate were shown to be significantly higher in low-risk group than high-risk group (Figures 9(e) and 9(f)).

Figure 9.

Figure 9

Prognostic assessments of hub genes by GSE14520 cohort. (a) Risk score, survival outcome, and hub gene expression values for each patient in high- and low-risk groups. (b) Comparison of risk scores between high- and low-risk groups. ∗∗∗∗P < 0.0001. (c) Distribute of different survival status in high- and low-risk score groups. Statistical significance was determined by chi-square test. (d) The time-dependent ROC curves for the two-hub gene-based signature at 1, 3, and 5 years to assess accuracy of prognostic prediction. (e, f) Kaplan-Meier curves of (e) OS and (f) RFS in HBV-HCC patients based on the risk score classification. P was calculated by the log-rank test, and P < 0.05 was considered statistically significant. HBV-HCC: HBV-related HCC; OS: overall survival; RFS: relapse-free survival.

Furthermore, we stratified patients into different risk subgroups by several clinicopathologic parameters (age, gender, ALT, AFP, level, main tumor size, cirrhosis, BCLC stage, TNM stage, and CLIP stage) using the two-hub gene-based signature. Interestingly, results showed that our classifier was still statistically significant in most subgroups (Supplementary Figure 6), suggesting its good potential and possible application to add prognostic value to the existing staging systems.

Moreover, risk levels based on the two-hub gene-based signature also suggested to be significantly associated with other aggressive clinicopathological parameters, such as TNM stage (P = 0.002), BCLC stage (P = 0.013), CLIP stage (P = 0.003), and alpha fetal protein (AFP) level (P = 0.003) by chi-square test (Table 2). Multivariate Cox regression analysis indicated that after adjusting for main tumor size, cirrhosis, TNM stage, BCLC stage, CLIP stage, and AFP level, the risk signature was still significantly correlated with OS survival outcome (HR = 1.807, 95%CI = 1.126–2.899, and P = 0.014), implying that the two-hub gene signature served as an independent prognostic factor for HBV-HCC patients (Table 3).

Table 2.

Clinicopathological features of HBV-HCC patients according to the two-hub gene-based signature in the cohort of GSE14520.

Characteristics Low risk (n = 107) High risk (n = 106) P value
Gender
Female 17 13 0.447
Male 90 93
Age
<50 48 47 0.939
≥50 59 59
ALT level
High (>50 U/L) 41 47 0.518
Low (≤50 U/L) 66 59
Main tumor size
Large (>5 cm) 35 40 0.412
Small (≤5 cm) 72 65
Cirrhosis
Yes 96 100 0.213
No 11 6
TNM stage
I 56 33 0.002
II+III 51 73
BCLC stage
0-A 90 74 0.013
B-C 17 32
CLIP stage
0 58 36 0.003
≥1 49 70
AFP level
High (≥300 ng/ml) 37 58 0.007
Low (<300 ng/ml) 67 49

The information was missing for certain patients. P < 0.05 by χ2 test. HBV-HCC: HBV-related HCC; ALT: alanine aminotransferase; AFP: alpha fetoprotein; TNM: tumor-node-metastasis; BCLC: Barcelona Clinic Liver Cancer; CLIP: Cancer of the Liver Italian Program.

Table 3.

Univariate and multivariate analyses of the cohort GSE14520.

Variables Group Univariate analysis Multivariate analysis
HR 95% CI P value HR 95% CI P value
Risk level High/low 2.293 1.461-3.600 <0.001 1.807 1.126-2.899 0.014
Gender Male/female 0.599 0.289-1.242 0.168 - - -
Age ≥50/<50 1.190 0.771-1.837 0.432 - - -
ALT level (U/L) ≥50/<50 1.107 0.716-1.712 0.648 - - -
Main tumor size (cm) ≥5/<5 2.063 1.330-3.199 0.001 1.269 0.77-2.091 0.349
Cirrhosis Yes/no 0.224 0.055-0.910 0.036 0.349 0.084-1.440 0.145
TNM stage II+III/I 0.348 0.210-0.577 <0.001 0.579 0.326-1.027 0.061
BCLC stage B-C/0-A 0.283 0.18-0.445 <0.001 0.478 0.27-0.848 0.012
CLIP stage ≥1/0 2.133 1.338-3.398 0.001 1.129 0.549-2.322 0.741
AFP level (ng/ml) ≥300/<300 1.572 1.019-2.426 0.041 1.006 0.541-1.869 0.986

Significant in univariate Cox regression and were further enrolled for multivariable Cox regression analysis. P < 0.05 by both univariate and multivariate analyses. AFP: alpha fetoprotein; TNM: tumor-node-metastasis; ALT: alanine aminotransferase; BCLC: Barcelona Clinic Liver Cancer; CLIP: Cancer of the Liver Italian Program.

3.7. Gene Set Enrichment Analysis

GSEA is a powerful statistical approach to identify classes of genes that are significantly associated with different disease phenotypes. Thus, inspired by the results of the prognostic analysis, we operated the GSEA to investigate the molecular mechanisms between high- and low-risk groups divided by the risk signature. GSEA was performed by KEGG at first, and top significant pathways were identified as KEGG_spliceosome (P < 0.001 and NES = 2.149), KEGG_cell_cycle (P < 0.001 and NES = 1.959), and KEGG_oocyte_meiosis (P < 0.001 and NES = 1.934) (Figure 10(a)). For the gene set distribution from molecular function component of the GO database, GO_translational_initiation (P < 0.001 and NES = 2.233), GO_meiotic_cell_cycle (P < 0.001 and NES = 2.191), and GO_RNA_splicing_via_transesterification_reactions (P < 0.001 and NES = 2.149) were ranked as the most significant terms (Figure 10(b), Supplementary Tables 2–3). These results suggested that this risk signature may exert a poorer survival for HBV-HCC patient via known crucial cancer pathways.

Figure 10.

Figure 10

GSEA results of high- or low-risk groups divided by the two-hub gene-based signature in GSE14520 cohort. (a) Results of GO terms enriched in highly risk group vs. low-risk group. (b) Results of KEGG pathways enriched in highly risk group vs. low-risk group. GSEA: Gene Set Enrichment Analyses.

3.8. Prediction of miRNA-mRNA Interaction Network

MiRNAs have been extensively documented to regulate tumorigenesis at transcriptome level or posttranscriptional level in various cancers. Thus, we predicted the candidate miRNAs that may target these hub genes in human liver by using the miRNet online platform. Then, the miRNA-hub gene interaction network was established by Cytoscape (Supplementary Figure 7). There were 55 nodes and 77 edges involved in the network, including 16 hub genes (NDC80 was excluded) and 39 miRNAs. In the network, PRIM2, CCNA2, and RRM2 were recognized as the top 3 hub genes that had most neighbors of miRNAs, while hsa-mir-34a-5p, hsa-mir-192-5p, and hsa-mir-24-3p were the top 3 miRNAs with most targeted hub genes. The miRNA-hub gene network based on their regulatory relationships may provide a forceful basis for the further exploration of the molecular mechanisms of HBV-HCC.

4. Discussion

Despite the great advances in clinical management and remarkable progress in understanding the pathogenesis of HCC, the incidence and mortality rates of this malignant cancer remain unacceptably high. Chronic hepatitis B is the primary etiological factor for HCC in China and other parts of Asia [4]. With the identification of diagnostic and prognostic biomarkers of HBV-HCC, we have attempted to provide valuable insight into the molecular mechanism of HBV-HCC during tumorigenesis and development.

Public databases like GEO, TCGA, and ICGC that deposit massive datasets of high-throughput technologies like microarray and NGS platforms have facilitated the strategies for mining of integrated data, which could overcome the limitations of the small sample size in one individual cohort and heterogeneity among different studies. In the present study, we conducted DEG screening based on the transcription profiling data of GSE84402, GSE25097, GSE94660, and GSE121248 at first, and there were 309 overlapped DEGs were identified between the tumor and normal patients, comprising 100 upregulated genes and 209 downregulated genes. The result of GO analysis revealed that the upregulated DEGs were significantly enriched in the regulation of cell division activities (such as mitotic nuclear division, chromosome segregation, and nuclear division), while the downregulated DEGs were closely related to multiple cellular “catabolic process” and “biosynthetic processes” (such as organic acid catabolic process and carboxylic acid biosynthetic process). KEGG pathway analysis revealed that the upregulated DEGs were mainly involved in the cell cycle, DNA replication, oocyte meiosis, and others. Meanwhile, the downregulated DEGs were relevant to chemical carcinogenesis, retinol metabolism, tryptophan metabolism, bile secretion, and so forth.

After that, we utilized a combined strategy to identify the 17 hub genes (MCM6, MCM3, UBE2C, RRM2, RFC4, TOP2A, MCM2, CDK1, PRIM1, CCNB1, HMMR, CDC20, CCNA2, NEK2, NDC80, DLGAP5, and KIF11) from the PPI network established by the STRING database. Most of these hub genes were previously reported as oncogenes, therapeutic targets, or potential biomarkers in HCC [3551]. We then validate the dysregulated mRNA expression levels of these hub genes using GSE14520, TCGA-LIHC, and ICGC-LIRI-JP datasets. The positive coexpression relationships of the hub genes were proved by Pearson's correlation analysis, implying the highly active interactions during the tumorigenesis. Boxplots were used to demonstrate the relevance between gene expression levels and pathological stages; however, most of the hub genes showed no significant association between early stages and late stages, stimulating us to propose that they may be used in early diagnosis for HBV-HCC. As the consequence, the ROC curves of all the 17 hub genes showed high diagnostic values for TNM stage I patients and adjacent normal tissues, suggesting their good potential in further exploiting early diagnosis, including related miRNAs, circRNAs, and aberrantly methylation markers that based on these hub genes.

For the overall survival analysis, with the aid of cohort from GSE14520, we established a risk signature with two prognostic hub genes: TOP2A and KIF11, which was demonstrated to be an independent prognostic predictor for HBV-HCC patient by univariate and multivariate analyses and was significantly correlated with tumor staging systems and AFP levels. Our GSEA result also revealed key molecular functions and KEGG pathways (especially for gene sets related to cell cycle) that involved in carcinogenesis that may be associated with OS survival stratification by the risk signature for HBV-HCC patients.

TOP2A, encoded by TOP2A gene, is a DNA topoisomerase that participates in many processes during transcription and replication through altering DNA topological structure. Previous studies confirmed that the aberrant TOP2A expression was observed in various cancer subtypes, such as breast [52], colon [53], ovarian [54], gastric [55], prostate cancer [56], and HCC [40]. In the current study, TOP2A showed high expression value in HBV-HCC, which agreed with previous results. Recent studies substantiated its oncogenic role during the tumorigenesis and development of many malignancies [52, 53]. For example, Zhang et al. found that knockdown of TOP2A could induce apoptosis and suppress cell proliferation and invasion via Akt and ERK signaling pathways in colon cancer [53]. These findings strongly imply that TOP2A may be served as an anticancer therapeutic target for clinical treatment. Actually, several TOP2A inhibitors have been approved by the US Food and Drug Administration [57], and other compounds were tested in multiple trials [58, 59]. A study on adrenocortical carcinoma (ACC) manifested that aclarubicin was the best agent of 14 TOP2A inhibitors that can decrease proliferation and tumor spheroid size in locally advanced and metastatic ACC [58]. But candidate TOP2A inhibitors with high efficacy for HCC were still rare. Considerable effort was required to explore effective reagents for HCC. Furthermore, early discoveries revealed that the elevated TOP2A expression implicated the worse overall survival for multiple cancers [52, 54, 56]. In accordance with these findings, our present study demonstrated that upregulation of TOP2A was closely related to the poor outcome for HBV-HCC patients.

KIF11 or as BimC, Eg5, belonging to kinesin superfamily, which function as nanomotors to mediate various kinds of spindle dynamics, is well known to play an essential role during cell mitosis, including chromosome positioning, bipolar spindle formation and maintenance, and antiparallel microtubule sliding, as well as microtubule crosslinking [6062]. It may also increase translational efficiency by mediating the association of ribosomes and microtubules [63]. Recently, KIF11 has been reported to be a novel potential candidate prognostic biomarker or therapeutic target in human cancers including breast cancer [64], ovarian cancer [65], oral cancer [66], peripheral nerve sheath tumors [67, 68], and lung cancer [69]. In line with previous findings, we identified KIF11 as an oncogene during tumorigenesis of HBV-HCC, and its expression level was significantly higher in tumor samples compared with adjacent normal tissues. It was also proved to hold good potential for early detection of HBV-HCC. For prognostic analysis, KIF11 and TOP2A performed jointly well in predicting prognosis by multivariate regression. In fact, a growing body of well-known KIF11 inhibitors such as monastrol [70], S-trityl-L-cysteine (STLC) [71], HR22C16 [72], and CK0106023 [73] have been extensively studied, and small-molecule inhibitors such as Ispinesib (SB-715992) [74], Filanesib (ARRY-520) [75], and litronesib (LY2523355) [76] have entered clinical trials. However, although these inhibitors have demonstrated excellent efficacies in certain human cancer with no neurotoxicity [75, 77], none have been used as a marketed anticancer agent; thus, further investigation is warranted to in the development of KIF11-based anticancer drugs.

There are several limitations to our study. First, a larger cohort is required to further validate these results. Second, because of the cautious approach adopted in the study, we failed to enroll the adequate number of HBV-HCC cases with complete clinical characteristics and sufficient long-term follow-up, which did not allow us to conduct informative analyses of better risk stratification and validation. Third, further in-depth studies are necessary to confirm the oncogenic roles of the selected hub genes via in vitro and in vivo assays.

5. Conclusion

In summary, with the integrated bioinformatics analysis, 309 robust DEGs involved in HBV-HCC were screened, which is helpful for a better understanding of molecular pathogenesis and tumorigenesis of HBV-HCC. Based on a series of comprehensive downstream analysis, 17 potential hub genes were identified that may play critical roles in the development of HBV-HCC. TOP2A and KIF11 can be jointly used to predict overall survival for HBV-HCC, and all the hub genes may hold good potential in exploring early detection biomarkers and therapeutic targets for HBV-HCC.

Acknowledgments

This study was funded by a Research Project of a Pilot Study on Establishing Natural Population Cohorts in Western China (No. 2019YFS0039). We sincerely thank TCGA, ICGC, and GEO databases for their data sharing. In addition, we also acknowledge invaluable help from peer reviewers.

Data Availability

The data used to support our results are available at the GEO (https://www.ncbi.nlm.nih.gov/geo/), TCGA (https://portal.gdc.cancer.gov/), and ICGC (https://icgc.org/).

Ethical Approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Conflicts of Interest

The authors declare that they have no conflict of interest.

Authors' Contributions

Yuqin Tang and Yongqiang Zhang performed the research and wrote the manuscript. Xun Hu conceived, designed, and supervised the research and provided some useful suggestions. All authors read and approved the final version of the manuscript. Yuqin Tang and Yongqiang Zhang contributed equally to this work.

Supplementary Materials

Supplementary Materials

“Supplementary Figure 1: flow chart of the prognostic key gene identification with “multi-split” method. Supplementary Figure 2: validation of 309 overlapping DEGs by integrating analysis with batch removal. Supplementary Figure 3: PPI network of DEGs. Supplementary Figure 4: boxplots showing the relative expression levels of 17 hub genes across normal liver tissues and cancer tissues with different BCLC stages for HBV-HCC. Supplementary Figure 5: the ROC curves of the selected hub genes to evaluate the diagnostic efficiency for HBV-HCC based on TCGA-LIHC cohort (A-C) and GSE14520 (D-F) cohort. Supplementary Figure 6: stratification analysis of the two-hub gene-based classifier. Supplementary Figure 7: miRNA-mRNA interaction network predicted by miRNet.”. Supplementary Table 1: validation of 309 differentially expressed genes (DEGs) by integrating analysis. Supplementary Table 2: Gene Set Enrichment Analysis (GSEA) result of the two-hub gene-based prognostic signature by the KEGG database. Supplementary Table 3: Gene Set Enrichment Analysis (GSEA) result of the two-hub gene-based prognostic signature by molecular function component of the GO database.

References

  • 1.Bray F., Ferlay J., Soerjomataram I., Siegel R. L., Torre L. A., Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: a Cancer Journal for Clinicians. 2018;68(6):394–424. doi: 10.3322/caac.21492. [DOI] [PubMed] [Google Scholar]
  • 2.Zhou M., Wang H., Zeng X., et al. Mortality, morbidity, and risk factors in China and its provinces, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2019;394(10204):1145–1158. doi: 10.1016/S0140-6736(19)30427-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.de Martel C., Maucort-Boulch D., Plummer M., Franceschi S. World-wide relative contribution of hepatitis B and C viruses in hepatocellular carcinoma. Hepatology. 2015;62(4):1190–1200. doi: 10.1002/hep.27969. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Yang P., Li Q. J., Feng Y., et al. TGF-β-miR-34a-CCL22 signaling-induced Treg cell recruitment promotes venous metastases of HBV-positive hepatocellular carcinoma. Cancer Cell. 2012;22(3):291–303. doi: 10.1016/j.ccr.2012.07.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Feng W. W., Wilkins O., Bang S., et al. CD36-mediated metabolic rewiring of breast Cancer cells promotes resistance to HER2-targeted therapies. Cell Reports. 2019;29(11):3405–3420.e5. doi: 10.1016/j.celrep.2019.11.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Diaz-Lagares A., Mendez-Gonzalez J., Hervas D., et al. A novel epigenetic signature for early diagnosis in lung cancer. Clinical Cancer Research. 2016;22(13):3361–3371. doi: 10.1158/1078-0432.CCR-15-2346. [DOI] [PubMed] [Google Scholar]
  • 7.Hao X., Luo H., Krawczyk M., et al. DNA methylation markers for diagnosis and prognosis of common cancers. Proceedings of the National Academy of Sciences of the United States of America. 2017;114(28):7414–7419. doi: 10.1073/pnas.1703577114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zhang Y. Q., Wang W. Y., Xue J. X., et al. MicroRNA expression profile on solid subtype of invasive lung adenocarcinoma reveals a panel of four miRNAs to be associated with poor prognosis in Chinese patients. Journal of Cancer. 2016;7(12):1610–1620. doi: 10.7150/jca.14923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Guinney J., Dienstmann R., Wang X., et al. The consensus molecular subtypes of colorectal cancer. Nature Medicine. 2015;21(11):1350–1356. doi: 10.1038/nm.3967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Liu W. T., Wang Y., Zhang J., et al. A novel strategy of integrated microarray analysis identifies CENPA, CDK1 and CDC20 as a cluster of diagnostic biomarkers in lung adenocarcinoma. Cancer Letters. 2018;425:43–53. doi: 10.1016/j.canlet.2018.03.043. [DOI] [PubMed] [Google Scholar]
  • 11.Chen R., Khatri P., Mazur P. K., et al. A meta-analysis of lung cancer gene expression identifies PTK7 as a survival gene in lung adenocarcinoma. Cancer Research. 2014;74(10):2892–2902. doi: 10.1158/0008-5472.CAN-13-2775. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Zhang Y., He W., Zhang S. Seeking for correlative genes and signaling pathways with bone metastasis from breast cancer by integrated analysis. Frontiers in Oncology. 2019;9 doi: 10.3389/fonc.2019.00138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zhou Y., Layton O., Hong L. Identification of genes and pathways involved in ovarian epithelial cancer by bioinformatics analysis. Journal of Cancer. 2018;9(17):3016–3022. doi: 10.7150/jca.26133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Ma Q., Xu Y., Liao H., et al. Identification and validation of key genes associated with non-small-cell lung cancer. Journal of Cellular Physiology. 2019;234(12):22742–22752. doi: 10.1002/jcp.28839. [DOI] [PubMed] [Google Scholar]
  • 15.Li D., Hao X., Song Y. An integrated analysis of key microRNAs, regulatory pathways and clinical relevance in bladder cancer. Oncotargets and Therapy. 2018;11:3075–3085. doi: 10.2147/OTT.S166506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Xie S., Jiang X., Zhang J., et al. Identification of significant gene and pathways involved in HBV-related hepatocellular carcinoma by bioinformatics analysis. PeerJ. 2019;7, article e7408 doi: 10.7717/peerj.7408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Wang H., Huo X., Yang X. R., et al. STAT3-mediated upregulation of lncRNA HOXD-AS1 as a ceRNA facilitates liver cancer metastasis by regulating SOX4. Molecular Cancer. 2017;16(1) doi: 10.1186/s12943-017-0680-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Ivanovska I., Zhang C., Liu A. M., et al. Gene signatures derived from a c-MET-driven liver cancer mouse model predict survival of patients with hepatocellular carcinoma. PLoS One. 2011;6(9, article e24582) doi: 10.1371/journal.pone.0024582. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Yoo S., Wang W., Wang Q., et al. A pilot systematic genomic comparison of recurrence risks of hepatitis B virus-associated hepatocellular carcinoma with low- and high-degree liver fibrosis. BMC Medicine. 2017;15(1) doi: 10.1186/s12916-017-0973-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wang S. M., Ooi L. L., Hui K. M. Identification and validation of a novel gene signature associated with the recurrence of human hepatocellular carcinoma. Clinical Cancer Research. 2007;13(21):6275–6283. doi: 10.1158/1078-0432.CCR-06-2236. [DOI] [PubMed] [Google Scholar]
  • 21.Yu G., Wang L. G., Han Y., He Q. Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16(5):284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Chin C.-H., Chen S.-H., Wu H.-H., Ho C.-W., Ko M. T., Lin C. Y. cytoHubba: identifying hub objects and sub-networks from complex interactome. BMC Systems Biology. 2014;8(Supplement 4) doi: 10.1186/1752-0509-8-S4-S11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Bader G. D., Hogue C. W. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics. 2003;4(1) doi: 10.1186/1471-2105-4-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Shannon P., Markiel A., Ozier O., et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome research. 2003;13(11):2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Irizarry R. A., Hobbs B., Collin F., et al. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003;4(2):249–264. doi: 10.1093/biostatistics/4.2.249. [DOI] [PubMed] [Google Scholar]
  • 26.Lian Q., Wang S., Zhang G., et al. HCCDB: a database of hepatocellular carcinoma expression atlas. Genomics, Proteomics & Bioinformatics. 2018;16(4):269–275. doi: 10.1016/j.gpb.2018.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ritchie M. E., Phipson B., Wu D., et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Research. 2015;43(7, article e47) doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Leek J. T., Johnson W. E., Parker H. S., Jaffe A. E., Storey J. D. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012;28(6):882–883. doi: 10.1093/bioinformatics/bts034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Lex A., Gehlenborg N., Strobelt H., Vuillemot R., Pfister H. UpSet: visualization of Intersecting Sets. IEEE Transactions on Visualization and Computer Graphics. 2014;20(12):1983–1992. doi: 10.1109/TVCG.2014.2346248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Hinder L. M., Park M., Rumora A. E., et al. Comparative RNA-Seq transcriptome analyses reveal distinct metabolic pathways in diabetic nerve and kidney disease. Journal of Cellular and Molecular Medicine. 2017;21(9):2140–2152. doi: 10.1111/jcmm.13136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Walter W., Sánchez-Cabo F., Ricote M. GOplot: an R package for visually combining expression data with functional analysis. Bioinformatics. 2015;31(17):2912–2914. doi: 10.1093/bioinformatics/btv300. [DOI] [PubMed] [Google Scholar]
  • 32.Robin X., Turck N., Hainard A., et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12(1) doi: 10.1186/1471-2105-12-77. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Therneau T. M., Lumley T. L. Package ‘survival’. R Top Doc. 2015;128 [Google Scholar]
  • 34.Fan Y., Siklenka K., Arora S. K., Ribeiro P., Kimmins S., Xia J. miRNet - dissecting miRNA-target interactions and functional associations through network-based visual analysis. Nucleic Acids Research. 2016;44(W1):W135–W141. doi: 10.1093/nar/gkw288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Liu M., Hu Q., Tu M., et al. MCM6 promotes metastasis of hepatocellular carcinoma via MEK/ERK pathway and serves as a novel serum biomarker for early recurrence. Journal of Experimental & Clinical Cancer Research. 2018;37(1) doi: 10.1186/s13046-017-0669-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Yang Q., Xie B., Tang H., et al. Minichromosome maintenance 3 promotes hepatocellular carcinoma radioresistance by activating the NF-κB pathway. Journal of Experimental & Clinical Cancer Research. 2019;38(1) doi: 10.1186/s13046-019-1241-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Xiong Y., Lu J., Fang Q., et al. UBE2C functions as a potential oncogene by enhancing cell proliferation, migration, invasion, and drug resistance in hepatocellular carcinoma cells. Bioscience Reports. 2019;39(4) doi: 10.1042/BSR20182384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Satow R., Shitashige M., Kanai Y., et al. Combined functional genome survey of therapeutic targets for hepatocellular carcinoma. Clinical Cancer Research. 2010;16(9):2518–2528. doi: 10.1158/1078-0432.CCR-09-2214. [DOI] [PubMed] [Google Scholar]
  • 39.Arai M., Kondoh N., Imazeki N., et al. The knockdown of endogenous replication factor C4 decreases the growth and enhances the chemosensitivity of hepatocellular carcinoma cells. Liver International. 2009;29(1):55–62. doi: 10.1111/j.1478-3231.2008.01792.x. [DOI] [PubMed] [Google Scholar]
  • 40.Wong N., Yeo W., Wong W. L., et al. TOP2A overexpression in hepatocellular carcinoma correlates with early age onset, shorter patients survival and chemoresistance. International Journal of Cancer. 2009;124(3):644–652. doi: 10.1002/ijc.23968. [DOI] [PubMed] [Google Scholar]
  • 41.Sun M., Wu G., Li Y., et al. Expression profile reveals novel prognostic biomarkers in hepatocellular carcinoma. Frontiers in Bioscience (Elite Edition) 2010;2:829–840. doi: 10.2741/e144. [DOI] [PubMed] [Google Scholar]
  • 42.Sun M., Mo W., Fu X., et al. System biology analysis of cell cycle pathway involved in hepatocellular carcinoma. Frontiers in Bioscience (Scholar Edition) 2010;S2(3):1127–1144. doi: 10.2741/s122. [DOI] [PubMed] [Google Scholar]
  • 43.Wu C. X., Wang X. Q., Chok S. H., et al. Blocking CDK1/PDK1/β-catenin signaling by CDK1 inhibitor RO3306 increased the efficacy of sorafenib treatment by targeting cancer stem cells in a preclinical model of hepatocellular carcinoma. Theranostics. 2018;8(14):3737–3750. doi: 10.7150/thno.25487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Gu J., Liu X., Li J., He Y. MicroRNA-144 inhibits cell proliferation, migration and invasion in human hepatocellular carcinoma by targeting CCNB1. Cancer Cell International. 2019;19(1) doi: 10.1186/s12935-019-0729-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Shen S., Kong J., Qiu Y., Yang X., Wang W., Yan L. Identification of core genes and outcomes in hepatocellular carcinoma by bioinformatics analysis. Journal of Cellular Biochemistry. 2018;120(6):10069–10081. doi: 10.1002/jcb.28290. [DOI] [PubMed] [Google Scholar]
  • 46.Li J., Gao J. Z., Du J., Huang Z. X., Wei L. X. Increased CDC20 expression is associated with development and progression of hepatocellular carcinoma. International Journal of Oncology. 2014;45(4):1547–1555. doi: 10.3892/ijo.2014.2559. [DOI] [PubMed] [Google Scholar]
  • 47.Yang F., Hu Y., Liu H. X., Wan Y. J. MiR-22-silenced cyclin A expression in colon and liver cancer cells is regulated by bile acid receptor. The Journal of Biological Chemistry. 2015;290(10):6507–6515. doi: 10.1074/jbc.M114.620369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Fu S.-J., Chen J., Ji F., et al. MiR-486-5p negatively regulates oncogenic NEK2 in hepatocellular carcinoma. Oncotarget. 2017;8(32, article 17635):52948–52959. doi: 10.18632/oncotarget.17635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Huang L. Y., Chang C. C., Lee Y. S., et al. Inhibition of Hec1 as a novel approach for treatment of primary liver cancer. Cancer Chemotherapy and Pharmacology. 2014;74(3, article 2540):511–520. doi: 10.1007/s00280-014-2540-7. [DOI] [PubMed] [Google Scholar]
  • 50.Liao W., Liu W., Yuan Q., et al. Silencing of DLGAP5 by siRNA significantly inhibits the proliferation and invasion of hepatocellular carcinoma cells. PLoS One. 2013;8(12, article e80789) doi: 10.1371/journal.pone.0080789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Chen J., Li S., Zhou S., et al. Kinesin superfamily protein expression and its association with progression and prognosis in hepatocellular carcinoma. Journal of Cancer Research and Therapeutics. 2017;13(4):651–659. doi: 10.4103/jcrt.jcrt_491_17. [DOI] [PubMed] [Google Scholar]
  • 52.Deng J.-L., Xu Y.-H., Wang G. Identification of potential crucial genes and key pathways in breast cancer using bioinformatic analysis. Frontiers in Genetics. 2019;10 doi: 10.3389/fgene.2019.00695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Zhang R., Xu J., Zhao J., Bai J. H. Proliferation and invasion of colon cancer cells are suppressed by knockdown of TOP2A. Journal of Cellular Biochemistry. 2018;119(9):7256–7263. doi: 10.1002/jcb.26916. [DOI] [PubMed] [Google Scholar]
  • 54.Shen J., Yu S., Sun X., Yin M., Fei J., Zhou J. Identification of key biomarkers associated with development and prognosis in patients with ovarian carcinoma: evidence from bioinformatic analysis. Journal of ovarian research. 2019;12(1) doi: 10.1186/s13048-019-0578-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Liu H. Q., Zhang S. L., Song S. HER-2/neu and TOPIIa expression in gastric cancer reflect disease severity. Hepato-Gastroenterology. 2012;59(116):1290–1293. doi: 10.5754/hge11844. [DOI] [PubMed] [Google Scholar]
  • 56.He Z., Tang F., Lu Z., et al. Analysis of differentially expressed genes, clinical value and biological pathways in prostate cancer. American Journal of Translational Research. 2018;10(5):1444–1456. [PMC free article] [PubMed] [Google Scholar]
  • 57.Chen T., Sun Y., Ji P., Kopetz S., Zhang W. Topoisomerase IIα in chromosome instability and personalized cancer therapy. Oncogene. 2015;34(31):4019–4031. doi: 10.1038/onc.2014.332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Jain M., Zhang L., He M., Zhang Y. Q., Shen M., Kebebew E. TOP2A is overexpressed and is a therapeutic target for adrenocortical carcinoma. Endocrine-Related Cancer. 2013;20(3):361–370. doi: 10.1530/ERC-12-0403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Belluti S., Basile V., Benatti P., Ferrari E., Marverti G., Imbriano C. Concurrent inhibition of enzymatic activity and NF-Y-mediated transcription of Topoisomerase-IIα by bis-demethoxycurcumin in cancer cells. Cell Death & Disease. 2013;4(8, article e756) doi: 10.1038/cddis.2013.287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Rapley J., Nicolas M., Groen A., et al. The NIMA-family kinase Nek6 phosphorylates the kinesin Eg5 at a novel site necessary for mitotic spindle formation. Journal of Cell Science. 2008;121(23):3912–3921. doi: 10.1242/jcs.035360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Sharp D. J., Yu K. R., Sisson J. C., Sullivan W., Scholey J. M. Antagonistic microtubule-sliding motors position mitotic centrosomes in Drosophila early embryos. Nature Cell Biology. 1999;1(1):51–54. doi: 10.1038/9025. [DOI] [PubMed] [Google Scholar]
  • 62.Blangy A., Lane H. A., d'Hérin P., Harper M., Kress M., Niggt E. A. Phosphorylation by p34cdc2 regulates spindle association of human Eg5, a kinesin-related motor essential for bipolar spindle formation in vivo. Cell. 1995;83(7):1159–1169. doi: 10.1016/0092-8674(95)90142-6. [DOI] [PubMed] [Google Scholar]
  • 63.Bartoli K. M., Jakovljevic J., Woolford J. L., Jr., Saunders W. S. Kinesin molecular motor Eg5 functions during polypeptide synthesis. Molecular Biology of the Cell. 2011;22(18):3420–3430. doi: 10.1091/mbc.e11-03-0211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Zhou J., Chen W. R., Yang L. C., et al. KIF11 functions as an oncogene and is associated with poor outcomes from breast cancer. Cancer Research and Treatment. 2019;51(3):1207–1221. doi: 10.4143/crt.2018.460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Liu J., Meng H., Li S., et al. Identification of potential biomarkers in association with progression and prognosis in epithelial ovarian cancer by integrated bioinformatics analysis. Frontiers in Genetics. 2019;10 doi: 10.3389/fgene.2019.01031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Daigo K., Takano A., Thang P. M., et al. Characterization of KIF11 as a novel prognostic biomarker and therapeutic target for oral cancer. International Journal of Oncology. 2018;52(1):155–165. doi: 10.3892/ijo.2017.4181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Jungwirth G., Yu T., Moustafa M., et al. Identification of KIF11 as a novel target in meningioma. Cancers. 2019;11(4) doi: 10.3390/cancers11040545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Terribas E., Fernández M., Mazuelas H., et al. KIF11 and KIF15 mitotic kinesins are potential therapeutic vulnerabilities for malignant peripheral nerve sheath tumors. Neuro-Oncology Advances. 2020;2(Supplement 1):i62–i74. doi: 10.1093/noajnl/vdz061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Schneider M. A., Christopoulos P., Muley T., et al. AURKA, DLGAP5, TPX2, KIF11 and CKAP5: five specific mitosis-associated genes correlate with poor prognosis for non-small cell lung cancer patients. International Journal of Oncology. 2017;50(2):365–372. doi: 10.3892/ijo.2017.3834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Mayer T. U., Kapoor T. M., Haggarty S. J., King R. W., Schreiber S. L., Mitchison T. J. Small molecule inhibitor of mitotic spindle bipolarity identified in a phenotype-based screen. Science. 1999;286(5441):971–974. doi: 10.1126/science.286.5441.971. [DOI] [PubMed] [Google Scholar]
  • 71.DeBonis S., Skoufias D. A., Lebeau L., et al. In vitro screening for inhibitors of the human mitotic kinesin Eg5 with antimitotic and antitumor activities. Molecular Cancer Therapeutics. 2004;3(9):1079–1090. [PubMed] [Google Scholar]
  • 72.Hotha S., Yarrow J. C., Yang J. G., et al. HR22C16: a potent small-molecule probe for the dynamics of cell division. Angewandte Chemie (International Ed. in English) 2003;42(21):2379–2382. doi: 10.1002/anie.200351173. [DOI] [PubMed] [Google Scholar]
  • 73.Sakowicz R., Finer J. T., Beraud C., et al. Antitumor activity of a kinesin inhibitor. Cancer Research. 2004;64(9):3276–3280. doi: 10.1158/0008-5472.CAN-03-3839. [DOI] [PubMed] [Google Scholar]
  • 74.Wakui H., Yamamoto N., Kitazono S., et al. A phase 1 and dose-finding study of LY2523355 (litronesib), an Eg5 inhibitor, in Japanese patients with advanced solid tumors. Cancer Chemotherapy and Pharmacology. 2014;74(1, article 2467):15–23. doi: 10.1007/s00280-014-2467-z. [DOI] [PubMed] [Google Scholar]
  • 75.Shah J. J., Zonder J. A., Cohen A. D., et al. ARRY-520 shows durable responses in patients with relapsed/refractory multiple myeloma in a phase 1 dose-escalation study. blood. 2011;118(21):1860–1860. doi: 10.1182/blood.V118.21.1860.1860. [DOI] [Google Scholar]
  • 76.Knight S. D., Parrish C. A. Recent progress in the identification and clinical evaluation of inhibitors of the mitotic kinesin KSP. Current Topics in Medicinal Chemistry. 2008;8(10):888–904. doi: 10.2174/156802608784911626. [DOI] [PubMed] [Google Scholar]
  • 77.Lee C. W., Bélanger K., Rao S. C., et al. A phase II study of ispinesib (SB-715992) in patients with metastatic or recurrent malignant melanoma: a National Cancer Institute of Canada Clinical Trials Group trial. Investigational New Drugs. 2008;26(3, article 9097):249–255. doi: 10.1007/s10637-007-9097-9. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Materials

“Supplementary Figure 1: flow chart of the prognostic key gene identification with “multi-split” method. Supplementary Figure 2: validation of 309 overlapping DEGs by integrating analysis with batch removal. Supplementary Figure 3: PPI network of DEGs. Supplementary Figure 4: boxplots showing the relative expression levels of 17 hub genes across normal liver tissues and cancer tissues with different BCLC stages for HBV-HCC. Supplementary Figure 5: the ROC curves of the selected hub genes to evaluate the diagnostic efficiency for HBV-HCC based on TCGA-LIHC cohort (A-C) and GSE14520 (D-F) cohort. Supplementary Figure 6: stratification analysis of the two-hub gene-based classifier. Supplementary Figure 7: miRNA-mRNA interaction network predicted by miRNet.”. Supplementary Table 1: validation of 309 differentially expressed genes (DEGs) by integrating analysis. Supplementary Table 2: Gene Set Enrichment Analysis (GSEA) result of the two-hub gene-based prognostic signature by the KEGG database. Supplementary Table 3: Gene Set Enrichment Analysis (GSEA) result of the two-hub gene-based prognostic signature by molecular function component of the GO database.

Data Availability Statement

The data used to support our results are available at the GEO (https://www.ncbi.nlm.nih.gov/geo/), TCGA (https://portal.gdc.cancer.gov/), and ICGC (https://icgc.org/).


Articles from BioMed Research International are provided here courtesy of Wiley

RESOURCES