Abstract
The purpose of this study was to identify the potential diagnostic biomarkers in hepatocellular carcinoma (HCC) by machine learning (ML) and to explore the significance of immune cell infiltration in HCC. From GEO datasets, the microarray datasets of HCC patients were obtained and downloaded. Differentially expressed genes (DEGs) were screened from five datasets of GSE57957, GSE84402, GSE112790, GSE113996, and GSE121248, totalling 125 normal liver tissues and 326 HCC tissues. In order to find the diagnostic indicators of HCC, the LASSO regression and the SVM-RFE algorithms were utilized. The prognostic value of VIPR1 was analyzed. Finally, the difference of immune cell infiltration between HCC tissues and normal liver tissues was evaluated by CIBERSORT algorithm. In this study, a total of 232 DEGs were identified in 125 normal liver tissues and 326 HCC tissues. 11 diagnostic markers were identified by LASSO regression and SVM-RFE algorithms. FCN2, ECM1, VIRP1, IGFALS, and ASPG genes with AUC>0.85 were regarded as candidate biomarkers with high diagnostic value, and the above results were verified in GSE36376. Survival analyses showed that VIPR1 and IGFALS were significantly correlated with the OS, while ASPG, ECM1, and FCN2 had no statistical significance with the OS. Multivariate assays indicated that VIPR1 gene could be used as an independent prognostic factor for HCC, while FCN2, ECM1, IGFALS, and ASPG could not be used as independent prognostic factors for HCC. Immune cell infiltration analyses showed that the expression of VIPR1 in HCC was positively correlated with the levels of several immune cells. Overall, VIPR1 gene can be used as a diagnostic feature marker of HCC and may be a potential target for the diagnosis and treatment of liver cancer in the future.
1. Introduction
Liver cancer is the sixth most common cancer in the world and the fourth leading cause of cancer death [1]. The main risk factors for primary liver cancer are viral infection (mainly HBV and HCV), alcoholic and nonalcoholic steatohepatitis, aflatoxin and parasitic infections, etc. Hepatocellular carcinoma (HCC) is the most common primary liver malignancy, accounting for about 75% of all liver cancers [2, 3]. Although some progresses have been made in the diagnosis and treatments of HCC [4], more than 70%-80% of patients are still diagnosed with liver cancer at a later stage, and more than 15% of patients have extrahepatic spread at the time of diagnosis, which leads to a poor outcome [5–7]. Up to now, early surgical resection of liver cancer is still the most important and effective treatment. If the tumor is detected at an early stage and surgically removed, the 5-year survival rate of patients can exceed 70% and the prognosis is better [8]. In recent decades, alpha-fetoprotein (AFP) has been applied for HCC diagnosis [9]. The abnormal level of AFP in plasma is closely related to the malignancy of liver cancer, but due to insufficient sensitivity and specificity, the effect of early diagnosis of liver cancer is still not ideal [8]. In addition, AFP also increases in other benign and malignant diseases, such as other forms of chronic liver disease, other malignant tumors, pregnancy, and so on [7, 9]. So far, specific biomarkers are still needed to improve the accurate detection of early or very early liver cancer [9]. Therefore, it is particularly important for the early diagnosis of HCC and the search for specific diagnostic markers.
In recent years, bioinformatics has developed rapidly in the field of medicine. Comprehensive bioinformatics analysis and microarray technology can be used to identify various disease-related genes and their biological functions, which is helpful to clarify the potential mechanisms of disease occurrence and development [10–13]. However, with the increase of the amount and complexity of cancer omics research data, cutting-edge technologies such as ML algorithms have been developed to deal with the increasingly large and complex cancer and other multiomics data [14, 15]. ML is a rapidly growing core subfield of artificial intelligence (AI), which enables computer technology to learn from data processing and selfimproves to predict results without explicit programming [16, 17]. There are two goals for the use of ML in medical biology: the first is to make accurate predictions in the absence of experimental data and to guide the follow-up research work through these predictions; the second is to use machine learning to deepen our understanding of medical biology [18]. Today, ML is widely recognized as a significant innovation and a pioneering method in the field of cancer multiomics data analysis. This method, which seeks to predict, diagnose, categorize, and identify biomarkers, plays a vital role in cancer research. The combination of ML and traditional bioinformatics is used to classify and identify diagnostic biomarkers of cancer, which can greatly improve the accuracy of identifying biomarkers of cancer and provide new guidance for the early diagnosis and treatment of cancer [14, 15, 19].
We downloaded several HCC microarray datasets from the GEO database for the identification of DEGs between HCC tissues and nontumor tissues and combined with ML algorithms to identify diagnosis biomarkers in DEGs for further research. Our findings suggested VIPR1 as a novel diagnostic and prognostic biomarker for HCC patients.
2. Materials and Methods
2.1. Data Download and Processing
The microarray datasets of HCC samples were downloaded from the GEO database (http://www.ncbi.nlm.nih.gov). After screening, GSE57957, GSE84402, GSE112790, GSE113996, GSE121248, and GSE36376 datasets were included in our research (Table 1). Among them, GSE57957 included 39 normal specimens and 39 liver cancer specimens, and the platform was from GPL10558; GSE84402 included 14 normal specimens and 14 liver cancer tissues, and the platform was from GPL570; GSE112790 included 15 normal specimens and 183 liver cancer tissues, and the platform was from GPL570; GSE113996 included 20 adjacent nontumor tissues and 20 liver cancer tissues, and the platform was from GPL16043; GSE121248 included 37 adjacent tissues and 70 liver cancer tissues, and the platform was from GPL570; and GSE36376 included 193 adjacent nontumor tissues and 240 liver cancer tissues, and the platform was from GPL10558. Following the instructions in the platform file, the probes were renamed to their corresponding gene terms, and the samples were separated into tumor and normal subgroups. Using the limma and SVA packages of R software, the data of GSE57957, GSE84402, GSE112790, GSE113996, and GSE121248 chips were collected into a metadata queue and corrected in batches. The experimental group consisted of 125 cases of nontumor liver tissues and 326 cases of HCC liver tissues. The verification group consisted of the GSE36376 dataset. The clinical data of HCC patients were downloaded from TCGA datasets.
Table 1.
2.2. Screening of Differentially Expressed Genes (DEGs)
Using the limma package of R software, the experimental group microarray datasets (GSE57957, GSE84402, GSE112790, GSE113996, and GSE121248) were filtered with |log FC| ≥ 1.0 and adj. P.val < 0.05 as the thresholds to obtain DEGs. DEGs were visually drawn with heat map and volcano map through the pheatmap and ggplot2 packages of R software.
2.3. Functional Enrichment Analyses of DEGs
Gene Ontology (GO) is the most comprehensive gene function database at present. To explore the biological pathways and functions of related genes, KEGG can be used for biological interpretation of genomic sequences and other high-throughput data [20] and can provide additional information about how genes interact in pathways [21]. Disease Ontology (DO) integrates data about human diseases. It can be used to annotate the human genome and better show the characteristics of current human diseases to see which diseases are enriched for differential genes [22–24]. In this study, we used the clusterProfiler package to carry out GO, KEGG, and DO enrichment analyses of DEGs under the conditions of P-value < 0.05 and q-value < 0.05 to understand the biological functions and involved diseases of DEGs.
2.4. Gene Set Enrichment Analysis (GSEA)
Enrichment analysis using GSEA was performed in order to identify the functional items that differed most significantly between the HCC group and the control group. The GSEA enrichment analysis of the gene set was performed by the use of the clusterProfiler package of the R software. The c2.cp.kegg.v7.4.symbols.gmt gene set was chosen as the enrichment analysis gene set.
2.5. Identification of Diagnostic Biomarkers by LASSO Regression and SVM-RFE Algorithms
LASSO is a regression-based algorithm performed through successive shrinking operations that minimize regression coefficients to reduce the possibility of overfitting [25], thereby reducing redundancy and eliminating irrelevant genes from these analyses [26]. SVM is one of the best choices for feature selection, and it is also the most commonly used classifier for the microarray data [27]. SVM-RFE is a feature selection algorithm based on SVM [28]. In order to define the minimum classification error and avoid overfitting, the SVM-RFE algorithm is used to select the optimal genes [27, 29]. Therefore, two ML algorithms have been widely used to identify biomarkers and predict accurate and interpretable models. In our research, we used glmnet package to run LASSO regression algorithm and e1071 package to build SVM model. The value with the least cross-validation error was found as the feature markers of HCC by two ML algorithms of LASSO regression and SVM-RFE.
2.6. Diagnostic Value of Feature Biomarkers in HCC
The AUC value was analyzed to determine how valuable each feature gene was as a diagnostic tool. The ROC curve of each feature gene was generated from the mRNA expression data of 125 normal liver tissues and 326 HCC tissues in the experimental group by using the pROC package of R software. The accuracy of disease diagnosis was judged by AUC value, and genes with AUC > 0.85 were identified as high diagnostic value genes for further research.
2.7. Validation of the Differential Expression and Diagnostic Value
We used the GSE36376 dataset to verify expressions and diagnostic values of the candidate genes in order to further investigate whether the candidate genes exhibited a diagnostic significance for HCC patients.
2.8. The Correlations of Diagnostic Genes with OS and Clinicopathological Characteristics of Patients
Analyses of the associations of diagnostic genes with overall survival and clinicopathological characteristics of patients were performed using data from the TCGA-LIHC transcriptome as well as clinical information. Kaplan–Meier methods were applied to investigate the connections between diagnostic genes and OS, and the survminer package of the R software was applied to develop the survival curves. Both of these analyses were performed using the survival package. The univariate and multivariate assays were applied to investigate the predictive power of each diagnostic gene in HCC.
2.9. Evaluation of Immune Cell Infiltration
In recent years, we have come to the realization that immune cell infiltration was involved in tumor progression. The percentage of infiltrating immune cells that can be found in malignant tumors has a direct bearing on the growth and spread of tumors, as well as the development of cancer and the patients' overall prognoses [30, 31]. CIBERSORT is a bioinformatics analysis tool that can evaluate the proportions of immune cells [32]. The content of immune cell infiltration can be obtained from each sample, and then the correlations between immune cells can be analyzed. When comparing the amounts of immune cell infiltration seen in liver cancer tissues and normal liver tissues, we employed the CIBERSORT algorithm to do our comparisons. Following the exclusion of the data containing the value 0, the corrplot package of the R software was utilized to generate the pertinent heat map in order to identify correlations between the immune cells contained within the samples.
2.9.1. Correlations between Diagnostic Genes and Immune Cells
Spearman rank correlation analysis was applied for the study of the relationships between the identified diagnostic markers and immune cells.
2.10. Statistical Analysis
All the above analyses were performed using R (4.1.3) and Perl software. Comparisons between two independent groups were analyzed by Student's t-test. The survival curves were calculated by the Kaplan–Meier method and the difference by the log-rank test. Moreover, the prognostic significance of the related genes was valued by Cox regression analysis. A P < 0.05 was considered statistically significant.
3. Results
3.1. Identification of the DEGs in HCC Datasets
In this study, the data of 125 normal liver tissues and 326 HCC tissues in the experimental group (GSE57957, GSE84402, GSE112790, GSE113996, and GSE121248) were analyzed by the use of the limma package. A total of 232 DEGs were screened, of which 58 genes were significantly upregulated and 174 genes were significantly downregulated (Figures 1(a) and 1(b)).
3.2. Functional Enrichment Analyses of DEGs
The biological functions of DEGs were analyzed by GO, KEGG, and DO enrichment analyses. GO enrichment analysis showed that BP of DEGs was enriched in terpenoid metabolic process, olefinic compound metabolic process, amino acid metabolic process, and small molecule catabolic process. The CC is mainly enriched in collagen-containing extracellular matrix; MF was remarkably enriched in oxidoreductase activity (Figure 2(a)). KEGG assays revealed that DEGs were mainly concentrated in retinol metabolism, cytochrome P450, chemical carcinogenesis-DNA adducts, various amino acids, and other biological metabolic activities, etc (Figure 2(b)). DO assays indicated that DEGs were involved in cancer-related diseases such as hepatitis, nonsmall cell lung carcinoma, liver cirrhosis, biliary tract cancer, cholangiocarcinoma, esophageal cancer, and so on (Figure 2(c)).
3.3. GSEA Enrichment Analysis
The GSEA enrichment analysis showed that complement and coagulation cascades, cytochrome P450, glycine serine and threonine metabolism, retinol metabolism, and tryptophan metabolism were highly active in normal liver tissues, while spliceosome, ribosome, proteasome, DNA replication, and cell cycle were highly active in HCC samples (Figures 3(a) and 3(b)).
3.4. Identification of Feature Biomarkers
We used two different ML algorithms of LASSO regression and SVM-RFE to identify potential biomarkers of HCC. 26 genes were obtained as diagnostic markers of HCC by using LASSO regression algorithm to narrow the range of DEGs (Figure 4(a)). 31 feature genes in DEGs were identified by the SVM-RFE algorithm (Figure 4(b)). Then, 11 diagnostic feature genes were obtained by intersecting the two sets of algorithms (Figure 4(c)).
3.5. Diagnostic Value of Feature Biomarkers in HCC
With AUC > 0.85 as the threshold, five diagnostic feature genes of FCN2, ECM1, VIPR1, IGFALS, and ASPG were identified for further research. As shown in the Figures 5(a)–5(e), the AUC values of FCN2, ECM1, VIPR1, IGFALS, and ASPG were 0.877 (95% CI 0.832-0.915), 0.870 (95% CI 0.827-0.908), 0.871 (95% CI 0.827-0.912), 0.856 (95% CI 0.811-0.899), and 0.857 (95% CI 0.813-0.898), which indicated that these five feature genes had a high diagnostic ability.
3.6. Validation of the Differential Expressions and Diagnostic Values
In order to obtain more reliable results, we used the GSE36376 dataset to validate our results. The results showed that the expressions of FCN2, ECM1, VIPR1, IGFALS, and ASPG were significantly downregulated in HCC (P < 0.05, Figures 6(a)–6(e)), and all of them had high diagnostic values (AUC > 0.85) (Figures 6(f)–6(j)).
3.7. The Correlations of Diagnostic Genes with OS and Clinicopathological Characteristics of Patients
First of all, we analyzed the relationships between diagnostic genes and OS of patients. The results showed that the patients with lower expression of VIPR1 and IGFALS predicted shorter OS (P < 0.05), while the expressions of FCN2, ECM1, and ASPG had no statistical significance with OS (Figure 7(a)). To further screen the diagnostic genes with clinical prognostic value, we analyzed the relationships between the diagnostic genes and clinicopathological characteristics of HCC patients and performed Cox regression analyses. The results showed that the lower expression of VIPR1 was associated with the poor differentiation of tumor grade classification and malignant progression of clinical stage (P < 0.05) (Figures 7(b) and 7(c)). Multivariate assays demonstrated that VIPR1 could be used as an independent prognostic factor for HCC (P < 0.05), while other diagnostic genes could not be used as independent prognostic factors for HCC (Figures 8(a)–8(e)). Finally, we identified VIPR1 as a prognostic feature biomarker gene for diagnosing HCC.
3.8. Evaluation of Immune Cell Infiltration
We used the CIBERSORT algorithm to calculate the proportions of immune cells in the data set of normal liver tissues and HCC tissues (Figure 9(a)). Then, the correlations between different immune cells were evaluated. The heat map showed that T cells CD8 was positively correlated with T cells CD4 memory activated (R = 0.35), T cells follicular helper (R = 0.33), and macrophages M1 (R = 0.30); mast cells activated was positively correlated with neutrophils (R = 0.33); and monocytes were positively related to dendritic cells activated and NK cells resting (R = 0.33). The heat map also showed that T cells CD4 memory resting was negatively correlated with T cells CD8 (R = −0.52) and T cells follicular helper(R = −0.41); mast cells activated was negatively related to mast cells resting (R = −0.42); and macrophages M0 was negatively correlated with macrophages M1(R = −0.41) (Figure 9(b)). In addition, the results of the CIBERSORT algorithm showed that the proportions of T cell regulatory(Tregs) and macrophages M0 in HCC tissues were significantly higher than that in normal tissues (P < 0.05), while the proportions of T cell gamma delta and macrophages M1 in HCC tissues were significantly lower than that in normal tissues (P < 0.05) (Figure 9(c)).
3.9. Correlation between VIPR1 and Immune-Infiltrating Cells
As shown, VIPR1 was positively correlated with T cell gamma delta (R = 0.33), T cell CD4 memory resting (R = 0.25), macrophages M2 (R = 0.2), and monocytes (R = 0.2) (P < 0.05). VIPR1 was negatively correlated with macrophages M0 (R = −0.41), NK cells activated (R = −0.29), T cell regulatory (Tregs) (R = −0.25), and T cells follicular helper (R = −0.23) (P < 0.05) (Figures 10 and 11).
4. Discussion
Despite the significant leaps forward that have been achieved in both the diagnosis and treatment of HCC, there are still a significant number of patients who are diagnosed with the disease at a more advanced stage, which results in a low rate of patient survival. AFP is the most commonly used biomarker for monitoring liver cancer, but it is still not ideal for early diagnosis of liver cancer due to its lack of sensitivity and specificity. Therefore, it is of great significance to find and study specific biomarkers for early diagnosis of HCC. In this study, we identified VIPR1 as a diagnostic feature biomarker for HCC based on a combination of the ML algorithms and traditional bioinformatics. In addition, we compared the immune cell infiltration seen in HCC tissues to that shown in normal liver tissues, and we also investigated the relationship between VIPR1 and immune cell infiltration.
We downloaded multiple HCC microarray datasets from the GEO database. 232 DEGs were identified in 125 normal liver tissues and 326 HCC tissues. The DEGs were mainly enriched in small molecule catabolic process, amino acid metabolic process, and other biological processes. Small molecules are natural compounds with relatively small molecular weight, usually referring to biological molecules with relative molecular weight less than 1000 Dalton (especially less than 400 Dalton), which can participate in many biological processes including metabolic reactions. Studying the small molecules in metabolic pathway will help people design drugs for human diseases more effectively [33]. In addition, small molecular metabolites are sensitive to endogenous and exogenous changes in the body and have great potential and value in identifying the state and phenotype of liver cancer cells [34, 35].
In cancer, malignant cells usually exhibit greater proliferative capacity and metabolism than nonmalignant cells. Due to the increased demand for growth and metabolism, an adequate supply of amino acids is necessary for cancer cells to maintain their ability to proliferate. In addition, this rapid growth and metabolism may also exhibit a vulnerability specific to cancer, which is an increase in the demand for amino acids [36, 37]. KEGG assays revealed that DEGs were mainly enriched in retinol metabolism, cytochrome P450, chemical carcinogenesis-DNA adducts, various amino acids, and other biological metabolic activities and other tumor-related pathways. DO assays suggested that DEGs were involved in various cancers and related diseases such as hepatitis, nonsmall cell lung carcinoma, liver cirrhosis, biliary tract cancer, cholangiocarcinoma, and esophageal cancer. These results suggest that DESs are activated in many cancer-related pathways.
The retinoid metabolites are involved in a wide range of biological processes, such as cell differentiation, apoptosis, and inflammatory reaction [38]. In the human body, more than 70% of retinol metabolites are stored in the liver, so changes in their content may be involved in the occurrence and development of liver cancer [39]. Cytochrome P450 (CYP450) refers to an enzyme that is abundant in the smooth endoplasmic reticulum of hepatocytes and small intestinal epithelial cells. It is involved in the synthesis of various hormones and affects hormone-related cancer, which plays an important role in the metabolism of many anticancer drugs [40]. Long-term exposure to chemical carcinogens has been linked in certain studies to the formation of DNA adducts, and experts believe that a higher concentration of these adducts may raise the risk of tumorigenesis [41].
GSEA enrichment results showed that cell cycle, DNA replication, proteasome, ribosome, and spliceosome were highly active in hepatocellular carcinoma, which may be closely related to the occurrence and development of hepatocellular carcinoma. In order to improve the diagnostic and clinical availability of HCC diagnostic markers, we used the LASSO regression algorithm to minimize regression coefficients to reduce overfitting and the SVM-RFE algorithm to generate the minimal classification error, and the two ML algorithms took the intersection to select the optimal feature genes. Then, the feature genes were analyzed for the correlations with survival prognosis and clinicopathological characteristics for further screening. Finally, VIPR1 was identified as a diagnostic feature marker of HCC.
VIPR1 is a G protein-coupled receptor that is primarily found in normal tissues and plays a vital role in a variety of physiological tasks, including the metabolism of glycogen and the regulation of the immune system [42]. According to the findings of earlier research, VIPR1 has been found to have a variety of expressions and functions, depending on the specific type of malignant tumor. For example, VIPR1 was highly expressed in breast, gastric, and colon cancers, while it was significantly low expressed in lung, liver, and other cancers [42–45]. In breast cancer, VIP or VPAC1 receptor antagonists can enhance the killing ability of chemotherapy on breast cancer cells [46]. Functionally, elevated VIPR1 expression in gastric cancer promotes the malignant progression of gastric cancer by increasing the potential of gastric cancer cells to metastasis to distant regions. Ca2+ signaling is required for the carcinogenesis and progression of gastric cancer, and activation of VIPR1 by VIP can stimulate TRPV4-mediated Ca2+ entry [43]. It has been shown that the overexpression of VIPR1 in colon cancer may be related to the activation of EGFR, which can lead to poor differentiation of colon cancer, thereby promoting cancer progression. In addition, the overexpression of VIPR1 in tumor vessels and macrophages may play an important role in cancer invasion [44]. However, the expression of VIPR1 is lower in lung cancer tissues than in adjacent tissues, and overexpression of VIPR1 in lung cancer cells can inhibit cell proliferation, invasion, and migration [45, 47]. In HCC, VIPR1 mRNA expression is negatively correlated with DNA methylation, and the transcriptional silencing of VIPR1 caused by DNA methylation may contribute to the development of HCC [42]. The above studies have shown that VIPR1 plays different roles in different cancers, which is involved in the proliferation, invasion, migration, and differentiation of cancer cells.
In recent years, it has been abundantly obvious that the infiltration of immune cells plays a key role in both the beginning and the progression of malignancies. This information was uncovered as a direct consequence of recent research. Therefore, investigating the components of immune infiltration cells in HCC and locating diagnostic biomarkers for the diagnosis of HCC could potentially impact the clinical results of HCC patients. We used a bioinformatics technique known as CIBERSOTR to determine the relative amounts of immune cell infiltration in normal liver tissues and HCC tissues so that we could further investigate the function that immune cell infiltration plays in the development of HCC. We found that the proportions of T cell regulatory (Tregs) and macrophages M0 in HCC tissues were significantly higher than that in normal tissues.
In addition, the expression of VIPR1 in HCC was positively correlated with the levels of T cell gamma delta (0.33), T cell CD4 memory resting (0.25), macrophages M2 (0.2), and monocytes (0.2), while the expression of VIPR1 was negatively correlated with macrophages M0 (0.41), NK cells activated (0.29), T cell regulatory (Tregs), and T cell follicular helper (0.23). According to the previous studies, macrophages are central players in liver fibrosis and play a bidirectional role in the regulation of matrix deposition and catabolism [48]. Monocytes can influence the tumor microenvironment through mechanisms such as induction of immune tolerance, angiogenesis, and increased tumor cell dissemination [49]. In addition, peritumoral monocytes can induce autophagy of tumor cells and promote the occurrence and development of liver cancer [50]. In hepatocellular carcinoma (HCC), T cell gamma delta shows potent antitumor efficacy and has played an important role in tumor monitoring and antitumor immunity [51]. Tregs have the ability to produce an immunosuppressive tumor environment by releasing a variety of inhibitory cytokines. Additionally, Tregs have the potential to lead to immunological dysfunction in HCC through a number of different mechanisms [52]. The above research evidence and our findings suggested that various types of immune cell infiltrations played an important role in the pathogenesis of HCC.
5. Conclusions
In summary, VIPR1 can be used as a diagnostic feature marker of HCC, which is distinctly related to the occurrences, developments, and immune cell infiltration of HCC. It can also be used as an independent prognostic factor of HCC and may become a potential target for the early diagnosis and treatment of HCC in the future.
Acknowledgments
This study was supported by Natural Science Foundation of Jiangsu Province (No. BK20161176) and Six Talent Peaks Project in Jiangsu Province (No.: WSW-073).
Data Availability
The data sets used in this study can be obtained from TCGA and GEO public databases.
Conflicts of Interest
The authors declare that there are no conflicts of interest regarding the publication of this paper.
Authors' Contributions
Song Ge designed and analyzed the research study and prepared the original draft. Chenrui Xu, Yanming Li, and Yulin Zhang contributed to conception and design. Na Li, Feitong Wang, and Liang Ding were responsible for study supervision. Jian Niu provided financial support and revised the paper.
References
- 1.Bray F., Ferlay J., Soerjomataram I., Siegel R. L., Torre L. A., Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians . 2018;68(6):394–424. doi: 10.3322/caac.21492. [DOI] [PubMed] [Google Scholar]
- 2.Muppala S. Significance of the tumor microenvironment in liver cancer progression. Critical Reviews in Oncogenesis . 2020;25(1):1–9. doi: 10.1615/critrevoncog.2020034987. [DOI] [PubMed] [Google Scholar]
- 3.Kew M. C. Hepatocellular carcinoma: epidemiology and risk factors. Journal of Hepatocellular Carcinoma . 2014;1:115–125. doi: 10.2147/jhc.s44381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Yang J. D., Heimbach J. K. New advances in the diagnosis and management of hepatocellular carcinoma. Bmj . 2020;371:p. m3544. doi: 10.1136/bmj.m3544. [DOI] [PubMed] [Google Scholar]
- 5.Demir T., Lee S. S., Kaseb A. O. Mechanisms and Therapy of Liver Cancer 2021 . United state: Pubmed; 2021. Systemic therapy of liver cancer; pp. 257–294. [DOI] [PubMed] [Google Scholar]
- 6.Anwanwan D., Singh S. K., Singh S., Saikam V., Singh R. Challenges in Liver Cancer and Possible Treatment Approaches. Biochimica Et Biophysica Acta-Reviews on Cancer . 2020;(1):p. 1873. doi: 10.1016/j.bbcan.2019.188314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hartke J., Johnson M., Ghabril M. The diagnosis and treatment of hepatocellular carcinoma. Seminars in Diagnostic Pathology . 2017;34(2):153–159. doi: 10.1053/j.semdp.2016.12.011. [DOI] [PubMed] [Google Scholar]
- 8.Wang W., Wei C. Advances in the early diagnosis of hepatocellular carcinoma. Genes & Diseases . 2020;7(3):308–319. doi: 10.1016/j.gendis.2020.01.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Pinero F., Dirchwolf M., Pessoa M. G. Biomarkers in Hepatocellular Carcinoma: Diagnosis, Prognosis and Treatment Response Assessment. Cells . 2020;9(1):p. 1370. doi: 10.3390/cells9061370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bednár M. DNA microarray technology and application. Medical Science Monitor: International Medical Journal of Experimental and Clinical Research . 2000;6(4):796–800. [PubMed] [Google Scholar]
- 11.Marzancola M. G., Sedighi A., Li P. C. H. DNA microarray-based diagnostics. Methods in Molecular Biology . 2016;1368:161–178. doi: 10.1007/978-1-4939-3136-1_12. [DOI] [PubMed] [Google Scholar]
- 12.Li T., Fan J., Wang B., et al. TIMER: a web server for comprehensive analysis of tumor-infiltrating immune cells. Cancer Research . 2017;77(21):e108–e110. doi: 10.1158/0008-5472.can-17-0307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chen M., Hofestädt R., Taubert J. Integrative bioinformatics: history and future. Journal of Integrative Bioinformatics . 2019;16(3) doi: 10.1515/jib-2019-2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Auslander N., Gussow A. B., Koonin E. V. Incorporating machine learning into established bioinformatics frameworks. International Journal of Molecular Sciences . 2021;22(6):p. 2903. doi: 10.3390/ijms22062903. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Arjmand B., Hamidpour S. K., Tayanloo-Beik A., et al. Machine learning: a new prospect in multi-omics data analysis of cancer. Frontiers in Genetics . 2022;13:p. 824451. doi: 10.3389/fgene.2022.824451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Goodswen S. J., Barratt J. L. N., Kennedy P. J., Kaufer A., Calarco L., Ellis J. T. Machine learning and applications in microbiology. Fems Microbiology Reviews . 2021;45(5):p. fuab015. doi: 10.1093/femsre/fuab015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Bastanlar Y., Ozuysal M. Introduction to machine learning. Methods in Molecular Biology (Clifton, N.J.) . 2014;1107:105–128. doi: 10.1007/978-1-62703-748-8_7. [DOI] [PubMed] [Google Scholar]
- 18.Greener J. G., Kandathil S. M., Moffat L., Jones D. T. A guide to machine learning for biologists. Nature Reviews Molecular Cell Biology . 2022;23(1):40–55. doi: 10.1038/s41580-021-00407-0. [DOI] [PubMed] [Google Scholar]
- 19.Zhao S., Zhang Y., Lu X., et al. CDC20 regulates the cell proliferation and radiosensitivity of P53 mutant HCC cells through the Bcl-2/Bax pathway. International Journal of Biological Sciences . 2021;17(13):3608–3621. doi: 10.7150/ijbs.64003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kanehisa M., Sato Y., Kawashima M., Furumichi M., Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Research . 2016;44(D1):D457–D462. doi: 10.1093/nar/gkv1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kanehisa M., Sato Y. KEGG Mapper for inferring cellular functions from protein sequences. Protein Science . 2020;29(1):28–35. doi: 10.1002/pro.3711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bello S. M., Shimoyama M., Mitraka E., et al. Augmenting the disease ontology improves and unifies disease annotations across species. Disease Models & Mechanisms . 2018;11(3):p. dmm032839. doi: 10.1242/dmm.032839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Schriml L. M., Arze C., Nadendla S., et al. Disease Ontology: a backbone for disease semantic integration. Nucleic Acids Research . 2012;40(D1):D940–D946. doi: 10.1093/nar/gkr972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Schriml L. M., Mitraka E., Munro J., et al. Human Disease Ontology 2018 update: classification, content and workflow expansion. Nucleic Acids Research . 2019;47(D1):D955–D962. doi: 10.1093/nar/gky1032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.McEligot A. J., Poynor V., Sharma R., Panangadan A. Logistic LASSO Regression for Dietary Intakes and Breast Cancer. Nutrients . 2020;12 doi: 10.3390/nu12092652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Friedman J., Hastie T., Tibshirani R. Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software . 2010;33(1):1–22. doi: 10.18637/jss.v033.i01. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Li Z., Xie W., Liu T. Efficient feature selection and classification for microarray data. Plos One . 2018;13(8):p. e0202167. doi: 10.1371/journal.pone.0202167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lin X., Li C., Zhang Y., Su B., Fan M., Wei H. Selecting feature subsets based on SVM-RFE and the overlapping ratio with applications in bioinformatics. Molecules . 2017;23(1):p. 52. doi: 10.3390/molecules23010052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Huang M.-L., Hung Y.-H., Lee W. M., Li R. K., Jiang B.-R. SVM-RFE Based Feature Selection and Taguchi Parameters Optimization for Multiclass SVM Classifier. Scientific World Journal . 2014;2014 doi: 10.1155/2014/795624.795624 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Newman A. M., Liu C. L., Green M. R., et al. Robust enumeration of cell subsets from tissue expression profiles. Nature Methods . 2015;12(5):453–457. doi: 10.1038/nmeth.3337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Chen B., Khodadoust M. S., Liu C. L., Newman A. M., Alizadeh A. A. Profiling tumor infiltrating immune cells with CIBERSORT. Methods in Molecular Biology . 2018;1711:243–259. doi: 10.1007/978-1-4939-7493-1_12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Li X., Huang Y., Ye N., He J. Analysis of immune-related genes in idiopathic pulmonary fibrosis based on bioinformatics and experimental verification. Annals of Palliative Medicine . 2021;10(11):11598–11614. doi: 10.21037/apm-21-2676. [DOI] [PubMed] [Google Scholar]
- 33.Lu J., Niu B., Liu L., Lu W.-C., Cai Y.-D. Prediction of small molecules metabolic pathways based on functional group composition. Protein & Peptide Letters . 2009;16(8):969–976. doi: 10.2174/092986609788923374. [DOI] [PubMed] [Google Scholar]
- 34.Nicholson J. K., Holmes E., Kinross J. M., Darzi A. W., Takats Z., Lindon J. C. Metabolic phenotyping in clinical and surgical environments. Nature . 2012;491(7424):384–392. doi: 10.1038/nature11708. [DOI] [PubMed] [Google Scholar]
- 35.Wang X., Zhang A., Sun H. Power of metabolomics in diagnosis and biomarker discovery of hepatocellular carcinoma. Hepatology . 2013;57(5):2072–2077. doi: 10.1002/hep.26130. [DOI] [PubMed] [Google Scholar]
- 36.Endicott M., Jones M., Hull J. Amino acid metabolism as a therapeutic target in cancer: a review. Amino Acids . 2021;53(8):1169–1179. doi: 10.1007/s00726-021-03052-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Vettore L., Westbrook R. L., Tennant D. A. New aspects of amino acid metabolism in cancer. British Journal of Cancer . 2020;122(2):150–156. doi: 10.1038/s41416-019-0620-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Shirakami Y., Sakai H., Shimizu M. Retinoid roles in blocking hepatocellular carcinoma. Hepatobiliary Surgery and Nutrition . 2015;4:222–228. doi: 10.3978/j.issn.2304-3881.2015.05.01. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.O’Byrne S. M., Blaner W. S. Retinol and retinyl esters: biochemistry and physiology. Journal of Lipid Research . 2013;54(7):1731–1743. doi: 10.1194/jlr.r037648. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Mittal B., Tulsyan S., Kumar S., Mittal R. D., Agarwal G. Cytochrome P450 in cancer susceptibility and treatment. Advances in Clinical Chemistry . 2015;71:77–139. doi: 10.1016/bs.acc.2015.06.003. [DOI] [PubMed] [Google Scholar]
- 41.Poirier M. C. Linking DNA adduct formation and human cancer risk in chemical carcinogenesis. Environmental and Molecular Mutagenesis . 2016;57(7):499–507. doi: 10.1002/em.22030. [DOI] [PubMed] [Google Scholar]
- 42.Lu S., Lu H., Jin R., Mo Z. Promoter methylation and H3K27 deacetylation regulate the transcription of VIPR1 in hepatocellular carcinoma. Biochemical and Biophysical Research Communications . 2019;509(1):301–305. doi: 10.1016/j.bbrc.2018.12.129. [DOI] [PubMed] [Google Scholar]
- 43.Tang B., Wu J., Zhu M. X., et al. VPAC1 couples with TRPV4 channel to promote calcium-dependent gastric cancer progression via a novel autocrine mechanism. Oncogene . 2019;38(20):3946–3961. doi: 10.1038/s41388-019-0709-6. [DOI] [PubMed] [Google Scholar]
- 44.Liu S., Zeng Y., Li Y., Guo W., Liu J., Ouyang N. VPAC1 overexpression is associated with poor differentiation in colon cancer. Tumor Biology . 2014;35(7):6397–6404. doi: 10.1007/s13277-014-1852-x. [DOI] [PubMed] [Google Scholar]
- 45.Zhao L., Yu Z., Zhao B. Mechanism of VIPR1 gene regulating human lung adenocarcinoma H1299 cells. Medical Oncology . 2019;36(11):p. 91. doi: 10.1007/s12032-019-1312-y. [DOI] [PubMed] [Google Scholar]
- 46.Moody T. W., Leyton J., Chan D., et al. VIP receptor antagonists and chemotherapeutic drugs inhibit the growth of breast cancer cells. Breast Cancer Research and Treatment . 2001;68(1):55–64. doi: 10.1023/a:1017994722130. [DOI] [PubMed] [Google Scholar]
- 47.Mlakar V., Stražišar M., Sok M., Glavač D. Oligonucleotide DNA microarray profiling of lung adenocarcinoma revealed significant downregulation and deletions of vasoactive intestinal peptide receptor 1. Cancer Investigation . 2010;28(5):487–494. doi: 10.3109/07357900903476752. [DOI] [PubMed] [Google Scholar]
- 48.Ma P.-F., Gao C.-C., Yi J., et al. Cytotherapy with M1-polarized macrophages ameliorates liver fibrosis by modulating immune microenvironment in mice. Journal of Hepatology . 2017;67(4):770–779. doi: 10.1016/j.jhep.2017.05.022. [DOI] [PubMed] [Google Scholar]
- 49.Ugel S., Cane S., De Sanctis F., Bronte V. Monocytes in the tumor microenvironment. Annual Review of Pathology: Mechanisms of Disease . 2021;16:93–122. doi: 10.1146/annurev-pathmechdis-012418-013058. [DOI] [PubMed] [Google Scholar]
- 50.Chen D.-P., Ning W.-R., Li X.-F., et al. Peritumoral monocytes induce cancer cell autophagy to facilitate the progression of human hepatocellular carcinoma. Autophagy . 2018;14(8):1335–1346. doi: 10.1080/15548627.2018.1474994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Makkouk A., Yang X. C., Barca T., et al. Off-the-shelf Vδ1 gamma delta T cells engineered with glypican-3 (GPC-3)-specific chimeric antigen receptor (CAR) and soluble IL-15 display robust antitumor efficacy against hepatocellular carcinoma. Journal for Immunotherapy of Cancer . 2021;9(12) doi: 10.1136/jitc-2021-003441.e003441 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Langhans B., Nischalke H. D., Kraemer B., et al. THU-492-Role of regulatory T cells and checkpoint inhibition in hepatocellular carcinoma. Journal of Hepatology . 2019;70(1):p. E377. doi: 10.1016/s0618-8278(19)30739-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data sets used in this study can be obtained from TCGA and GEO public databases.