Skip to main content
Heliyon logoLink to Heliyon
. 2023 Nov 19;9(11):e22458. doi: 10.1016/j.heliyon.2023.e22458

Prediction model for hepatocellular carcinoma recurrence after hepatectomy: Machine learning-based development and interpretation study

Rongqiang Liu a,b,1, Shinan Wu a,c,1, Hao yuan Yu a,1, Kaining Zeng a,1, Zhixing Liang a, Siqi Li a, Yongwei Hu a, Yang Yang a,∗∗, Linsen Ye a,
PMCID: PMC10687050  PMID: 38034691

Abstract

Background

Identifying patients with hepatocellular carcinoma (HCC) at high risk of recurrence after hepatectomy can help to implement timely interventional treatment. This study aimed to develop a machine learning (ML) model to predict the recurrence risk of HCC patients after hepatectomy.

Methods

We retrospectively collected 315 HCC patients who underwent radical hepatectomy at the Third Affiliated Hospital of Sun Yat-sen University from April 2013 to October 2017, and randomly divided them into the training and validation sets at a ratio of 7:3. According to the postoperative recurrence of HCC patients, the patients were divided into recurrence group and non-recurrence group, and univariate and multivariate logistic regression were performed for the two groups. We applied six machine learning algorithms to construct the prediction models and performed internal validation by 10-fold cross-validation. Shapley additive explanations (SHAP) method was applied to interpret the machine learning model. We also built a web calculator based on the best machine learning model to personalize the assessment of the recurrence risk of HCC patients after hepatectomy.

Results

A total of 13 variables were included in the machine learning models. The multilayer perceptron (MLP) machine learning model was proved to achieve optimal predictive value in test set (AUC = 0.680). The SHAP method displayed that γ-glutamyl transpeptidase (GGT), fibrinogen, neutrophil, aspartate aminotransferase (AST) and total bilirubin (TB) were the top 5 important factors for recurrence risk of HCC patients after hepatectomy. In addition, we further demonstrated the reliability of the model by analyzing two patients. Finally, we successfully constructed an online web prediction calculator based on the MLP machine learning model.

Conclusion

MLP was an optimal machine learning model for predicting the recurrence risk of HCC patients after hepatectomy. This predictive model can help identify HCC patients at high recurrence risk after hepatectomy to provide early and personalized treatment.

Keywords: Hepatocellular carcinoma, Machine learning, Multilayer perceptron, Shapley additive explanations, Risk factor

1. Introduction

Hepatocellular carcinoma is the most common tumor and the second most common cause of cancer-related death worldwide [1]. The major risk factors for liver cancer include hepatitis B virus, hepatitis C virus, and alcoholic and nonalcoholic liver disease [2]. The annual death rate of liver cancer in China accounts for more than 50 % of the global death rate of liver cancer each year [3]. At present, surgical resection is still the main method for liver cancer patients in clinical practice. Due to the high malignancy and aggressive nature of HCC, patients still have a high recurrence rate even after radical hepatectomy [4]. Research has demonstrated that the majority of patients with hepatocellular carcinoma (HCC) who undergo hepatectomy can experience recurrence within one year, with a median survival time from the point of recurrence to death being 21 months [5]. Effective evaluation of postoperative recurrence of HCC patients can timely help doctors to implement individualized treatment for patients and improve postoperative survival time.

Machine learning (ML) is a mathematical method that generalizes and analyzes data to achieve artificial intelligence. It has been widely used in many different sciences. Research has shown that ML is more accurate than traditional statistical methods, and it can detect interactions among variables [6]. In the medical field, ML is widely used to build models to increase the understanding of the diagnosis or prognosis of diseases [7,8]. Currently, ML has been widely used in a variety of tumors due to its good predictive performance. For example, Liu et al. used ML to successfully develop a network predictor to predict the risk of bone metastases in patients with prostate cancer [9]. Li et al. successfully constructed a model to assess the risk of lung metastases in patients with Ewing's sarcoma [10]. In a novel prognostic prediction model constructed by Hasnain et al. they found that the sensitivity and specificity of predicting tumor recurrence after bladder cancer resection exceeded 70 % [11]. Liang et al. constructed the support vector machine model to predict patient prognosis by analyzing 83 patients with liver cancer after radiofrequency ablation [12]. We recently developed a risk stratification model for the prognosis of acute ischemic stroke using machine learning methods and achieved excellent discrimination, diagnosis, and prediction performance [13]. However, studies about the application of ML to predict the recurrence risk of HCC patients after hepatectomy were still limited [14,15]. In the past, there have been prediction models for post-hepatectomy recurrence in liver cancer using CT-based radiomics deep learning techniques [16]. However, research on predicting its recurrence using various serum markers and tumor-related indicators and providing visual explanations for black-box models like machine learning is still lacking [17]. ML has not been fully explored in predicting the recurrence risk of HCC patients after hepatectomy.

In the study, we identified potential risk factors for the recurrence risk of liver cancer patients after hepatic resection by using preoperative indicators, including general demographic data, serological indicators, and tumor pathological characteristics, and constructed prediction models based on a machine learning algorithm. By comparing different models, we selected the optimal ML model. Finally, a web calculator was developed to achieve convenient and personalized prediction for liver cancer patients.

2. Methods

2.1. Subjects

We recruited a total of 315 patients undergoing radical hepatectomy at the Third Affiliated Hospital of Sun Yat-sen University from April 2013 to October 2017. All patients were randomly divided into a training set and a validation set at a ratio of 7:3. To mitigate the potential impact of imbalanced data on model training, we first applied the Synthetic Minority Over-sampling Technique (SMOTE) method to oversample the training set after splitting [18]. Additionally, we removed the samples that contained missing value.; As a result, the training set included 212 cases, and the testing set included 59 cases. The inclusion criteria were: (1) histopathologically confirmed HCC after hepatectomy, (2) no other tumors or extrahepatic metastases, (3) no preoperative anticancer treatment. (4) no other underlying diseases and chronic diseases. All clinicopathological data were retrieved electronically from the hospital or handwritten medical records, including patient factors, laboratory parameters, and histological features of the tumor. During the follow-up period, the patients were divided into recurrence and non-recurrence groups. All patients signed the informed consent form. The Ethics Committee of the Third Affiliated Hospital of Sun Yat-sen University approved the study.

2.2. Selection of machine learning feature variable

This study collected clinical variables at the baseline data collection for liver cancer hepatectomy patients. These included gender, tumor size, number of tumors, presence of vascular invasion by the tumor, BCLC staging (Barcelona Clinic Liver Cancer staging classification), AFP (alpha-fetoprotein), HBV DNA, tumor differentiation degree, neutrophils (N), macrophages (M), fibrinogen, age, platelets (Plt), lymphocytes (L), albumin, the ratio of neutrophils to lymphocytes (NLR), aspartate aminotransferase (AST), total bilirubin (TB), γ-glutamyl transpeptidase (GGT), alanine aminotransferase (ALT), and Total cholesterol. The criteria for selecting feature variables for subsequent machine learning algorithms were: feature importance ranking based on the random forest machine learning algorithm and Sequential Forward Selection [19,20]. On this basis, all the aforementioned feature variables were included for feature importance ranking and screening. Subsequently, these variables were inputted into a hierarchical clustering algorithm to remove feature variables with multicollinearity. Then, we re-ordered the pre-selected variables. Finally, the optimal machine learning feature variables were determined based on when the area under the curve (AUC) of the receiver operating characteristic reached a stable value. We chose the top 13 feature variables for subsequent machine learning algorithm development, as there was no observed significant improvement in the AUC after the 14th iteration (Fig. 1A). The feature variables included in the subsequent machine learning model construction were GGT, N, Fibrinogen, Albumin, TB, M, AST, Plt, ALT, Total Cholesterol, L, NLR, and Age.

Fig. 1.

Fig. 1

Machine learning feature variable extraction and internal ten-fold cross-validation were conducted, resulting in ROC curves for both the training and testing sets.

Notes:Fig. 1A displays feature importance ranking and forward sequential feature selection based on random forests, with features selected highlighted in red font. Fig. 1B shows the results of internal ten-fold cross-validation on the training set, with the best machine learning model being MLP, achieving an AUC of 0.723 ± 0.068. Fig. 1C and D respectively depict the ROC curve results for the training and testing sets under different machine learning algorithms. Notably, the MLP machine learning model achieves an AUC value of 1 in the training set and 0.680 in the testing set.

Abbreviation: ML, machine learning; AUC, the area under the curve; AB, adaptive boosting; MLP, multilayer perceptron; BAG, bootstrapped aggregating; LR, logistic regression; GBM, gradient boosting machine; XGB, extreme gradient boost.

2.3. Model building

Univariate analysis was performed to screen recurrence risk factors in HCC patients after hepatectomy. In this study, we used six different machine learning algorithms including multilayer perceptron (MLP) [21], adaptive boosting (AB), bootstrapped aggregating (BAG) [22], logistic regression (LR) [23], gradient boosting machine (GBM) [22], and extreme gradient boost (XGB) [24]. A 10-fold cross-validation was performed to analyze the prediction performance of different ML models in the training set. The receiver operating characteristic curve (ROC) of the six ML models was plotted using the validation cohort. The model with the best prediction performance was selected as the final model. The prediction performance of the final model was evaluated using the confusion matrix. Finally, an online risk calculator was constructed based on the final model.

2.4. Model interpretation

Shapley additive explanations (SHAP) is a method for interpreting the results of predictive models based on cooperative game theory [25]. This method can quantify the SHAP value of each characteristic variable, representing the contribution of different factors to postoperative recurrence in HCC patients. The SHAP method demonstrates each factor's positive or negative effect on the predicted outcome. When the SHAP value is positive, it indicates that the corresponding feature leads to a higher probability of recurrence risk. When the SHAP value is negative, it indicates that the corresponding feature leads to a lower recurrence risk. We used the SHAP method to establish the importance ranking of recurrence risk factors in HCC patients undergoing hepatectomy and made individual interpretations of the model.

2.5. Statistical analysis

The data were analyzed by Python (version 3.8, Python Software Foundation) and R software (version 4.0.2). Models were constructed using the training set, and internal test sets were used for model validation and evaluation. The ratio of the training set to the internal test set is 7:3. Continuous numerical variables were assessed for normality using the Kolmogorov-Smirnov test [26]. Data that follows a normal distribution is represented as mean ± standard deviation (mean ± SD), while non-normally distributed data is represented using the median (Q1, Q3) quartiles. Categorical variables are represented using frequency distribution. Two independent samples t-tests were used for continuous variables. For categorical variables, the Chi-square test was used. Logistic regression was used to analyze the recurrence risk factors in patients with liver cancer after hepatectomy. Python SHAP package was used to perform the SHAP. Python programming language (version 3.8) was applied to build and evaluate ML models and design the online risk calculator. P-Values<0.05 were considered statistically significant with 95 % confidence intervals (CIs) applied for all logistic regression analyses.

3. Result

3.1. Clinical characteristics

315 patients with HCC after hepatectomy were enrolled. 178 patients experienced recurrence and 137 patients did not experience recurrence during the follow-up time. There were significant differences in tumor size, tumor number, BCLC stage, HBV DNA, macrophages, alanine aminotransferase (ALT), aspartate aminotransferase (AST), γ-glutamyl transpeptidase (GGT) and fibrinogen between the recurrence and non-recurrence groups (P < 0.05). The demographic and clinicopathological characteristics of the patients were detailed in Table 1.

Table 1.

Demographic and clinical characteristics.

Variables Total (n = 315) No recurrence (n = 137) Recurrence (n = 178) P value
Gender, n (%) 0.127
male 288 (91) 121 (88) 167 (94)
female 27 (9) 16 (12) 11 (6)
Tumor size n (%) <0.001
≤3 cm 85 (27) 52 (38) 33 (19)
>3 cm 230 (73) 85 (62) 145 (81)
Tumor number, n (%) <0.001
1 229 (73) 115 (84) 114 (64)
>1 86 (27) 22 (16) 64 (36)
Vascular invasion, n (%) 0.133
No 198 (63) 93 (68) 105 (59)
Yes 117 (37) 44 (32) 73 (41)
BCLC, n (%) <0.001
0 55 (17) 38 (28) 17 (10)
1 115 (37) 48 (35) 67 (38)
2 34 (11) 10 (7) 24 (13)
3 111 (35) 41 (30) 70 (39)
AFP, n (%) 0.195
≤400 219 (70) 101 (74) 118 (66)
>400 96 (30) 36 (26) 60 (34)
HBV DNA, n (%) 0.005
≤1000 97 (31) 54 (39) 43 (24)
>1000 218 (69) 83 (61) 135 (76)
Differentiation, n (%) 0.907
poor 39 (12) 18 (13) 21 (12)
moderate 236 (75) 101 (74) 135 (76)
well 40 (13) 18 (13) 22 (12)
Age, Mean ± SD 50.09 ± 11.45 50.35 ± 11.72 49.88 ± 11.27 0.721
Plt, Median (Q1,Q3) 171 (128, 216.5) 172 (123, 209) 169.5 (130, 222.5) 0.711
N, Median (Q1,Q3) 3.25 (2.46, 4.28) 3.12 (2.41, 3.98) 3.37 (2.49, 4.69) 0.087
L, Median (Q1,Q3) 1.62 (1.25, 2.08) 1.64 (1.3, 2.03) 1.61 (1.22, 2.1) 0.662
M, Median (Q1,Q3) 0.42 (0.32, 0.56) 0.38 (0.3, 0.52) 0.44 (0.33, 0.6) 0.011
NLR, Median (Q1,Q3) 1.94 (1.54, 2.69) 1.91 (1.52, 2.55) 1.98 (1.58, 3.01) 0.205
TB, Median (Q1,Q3) 14.1 (10.45, 18.1) 14.2 (10.6, 18.7) 14.05 (10.4, 17.95) 0.766
Albumin, Mean ± SD 40.09 ± 4.24 40.47 ± 4.48 39.79 ± 4.04 0.16
ALT, Median (Q1,Q3) 38 (28, 52) 36 (24, 48) 40 (30, 56.75) 0.042
AST, Median (Q1,Q3) 38 (29, 50) 34 (27.5, 47.5) 40 (30, 50.75) 0.041
Total cholesterol, Median (Q1,Q3) 4.45 (3.76, 5.26) 4.44 (3.72, 5.18) 4.49 (3.8, 5.32) 0.379
GGT, Median (Q1,Q3) 61 (38, 107) 57 (32, 85.5) 65 (42.25, 118.5) 0.004
Fibrinogen, Median (Q1,Q3) 2.98 (2.47, 3.65) 2.88 (2.39, 3.37) 3.12 (2.54, 3.89) 0.031

Abbreviation: Plt: platelet; N: neutrophils; L: lymphocyte; M: macrophages; NLR: neutrophils to lymphocytes ratio; TB: total bilirubin; ALT: alanine aminotransferase; AST: aspartate aminotransferase; GGT: γ-glutamyl transpeptidase; BCLC: Barcelona Clinic Liver Cancer staging classification.

3.2. Univariate and multivariate logistic regression

Variables with P value < 0.1 in univariate analysis were screened for multivariate logistic regression analysis to determine the risk factors for postoperative recurrence in HCC patients after hepatectomy (Table 2) [27,28]. In univariate analysis, gender, tumor size, tumor number, BCLC, HBV-DNA, neutrophil, macrophages, fibrinogen, and neutrophils to lymphocytes ratio (NLR) were risk factors for postoperative recurrence. Multivariate logistic regression showed that tumor number was an independent risk factor for HCC recurrence (P value < 0.05).

Table 2.

Univariate and multivariate logistic regression.

Characteristics Category Univariate analysis
Multivarite analysis
OR (95 % CI) P value OR (95 % CI) P value
Gender male Ref Ref Ref Ref
female 0.498 (0.223–1.111) 0.089 0.894 (0.3–2.66) 0.84
Tumor size ≤3 cm Ref Ref Ref Ref
>3 cm 2.688 (1.611–4.485) <0.001 1.417 (0.488–4.111) 0.521
Tumor number 1 Ref Ref Ref Ref
>1 2.935 (1.694–5.083) <0.001 4.723 (1.924–11.598) 0.001
Vascular invasion No Ref Ref Ref Ref
Yes 1.469 (0.921–2.343) 0.106 \ \
BCLC 0 Ref Ref Ref Ref
1 3.12 (1.578–6.168) 0.001 1.841 (0.497–6.822) 0.361
2 5.365 (2.109–13.644) <0.001 0.721 (0.124–4.18) 0.715
3 3.816 (1.915–7.606) <0.001 1.34 (0.335–5.367) 0.679
AFP ≤400 Ref Ref Ref Ref
>400 1.427 (0.873–2.332) 0.156 \ \
HBV DNA ≤1000 Ref Ref Ref Ref
>1000 2.043 (1.258–3.318) 0.004 1.747 (0.861–3.547) 0.123
Differentiation poor Ref Ref Ref Ref
moderate 1.146 (0.58–2.262) 0.695 \ \
well 1.048 (0.432–2.54) 0.918 \ \
N \ 1.178 (1.012–1.37) 0.034 0.931 (0.684–1.266) 0.647
M \ 4.84 (1.472–15.909) 0.009 2.367 (0.461–12.151) 0.302
Fibrinogen \ 1.34 (1.027–1.749) 0.031 1.178 (0.855–1.623) 0.317
Age \ 0.996 (0.977–1.016) 0.719 \ \
Plt \ 1.001 (0.998–1.004) 0.663 \ \
L \ 0.946 (0.674–1.327) 0.746 \ \
Albumin \ 0.962 (0.912–1.015) 0.155 \ \
NLR \ 1.185 (1–1.405) 0.05 1.219 (0.909–1.633) 0.185
AST \ 1.01 (0.998–1.023) 0.1 \ \
TB \ 0.994 (0.979–1.009) 0.403 \ \
GGT \ 1.003 (0.999–1.006) 0.118 \ \
ALT \ 1.004 (0.995–1.012) 0.375 \ \
Total cholesterol \ 1.098 (0.876–1.375) 0.417 \ \

Abbreviation: Plt: platelet; N: neutral granulocyte; L: lymphocyte; M: macrophages; NLR: neutrophils to lymphocyte ratio; TB: total bilirubin; ALT: alanine aminotransferase; AST: aspartate aminotransferase; GGT: γ-glutamyl transpeptidase; BCLC: Barcelona Clinic Liver Cancer staging classification.

3.3. Model performance

We evaluated the recurrence risk of HCC patients after hepatectomy by six different machine learning models. A 10-fold cross-validation was performed to evaluate the predictive performance of all models in both the training and internal validation sets. Fig. 1B showed the average AUC of the six models, with the MLP model achieving the best performance (AUC = 0.723 ± 0.068) (Fig. 1B). The AUC value of the MLP machine learning algorithm, which predicts the recurrence after hepatectomy in liver cancer patients in the training set, is 1, and its performance in the internal test set reached 0.680 (Fig. 1C). According to the Precision-Recall Curve results for the training and testing sets, the area under the curve (AUC) for the MLP machine learning model is 1 in the training set (Fig. 2A) and 0.762 in the testing set (Fig. 2B). The calibration curve results demonstrate that the predictive performance of the MLP machine learning model is closer to the actual real-world results compared to other machine learning models (Fig. 2C and D). The detailed results of the validation set are presented in Table 3. Furthermore, we plotted the confusion matrix of the model, In the training set after SMOTE processing, all predictions were correct (Fig. 3A). In the original distribution of the testing set, there were 37 correct predictions and 22 incorrect predictions, resulting in an accuracy of 0.627 (Fig. 3B). Using the best machine learning model, MLP, and performing internal five-fold cross-validation, the average AUC value is 0.66 with a standard deviation of 0.08 (Fig. 3C). For the above six ML models, we also plotted radar plots to evaluate the performance of different models. Compared with other ML models, MLP has the best value in the evaluation of various indicators including sensitivity, F1 score, AUC and accuracy (Fig. 3D).

Fig. 2.

Fig. 2

Precision-Recall Curves and Calibration Curves for six different machine learning algorithms on both the training and testing sets.

Notes:Fig. 2A displays the Precision-Recall (PR) curve for the training set, while Fig. 2B shows the PR curve for the testing set. MLP performs the best in both the training and testing sets, with an Area under the Curve (AUC) of 1 in the training set and 0.762 in the testing set. Fig. 2C and D represent the calibration curves for the training and testing sets, respectively. In these curves, the predictions from the MLP machine learning algorithm are closest to the actual values, indicating superior calibration performance.

Abbreviation: AB, adaptive boosting; MLP, multilayer perceptron; BAG, bootstrapped aggregating; LR, logistic regression; GBM, gradient boosting machine; XGB, extreme gradient boost.

Table 3.

Performance comparison of six ML models.

Model F1 AUC Accuracy Sensitivity Specificity
AB 0.519 0.483 0.559 0.625 0.421
LR 0.568 0.617 0.576 0.525 0.684
BAG 0.579 0.624 0.610 0.650 0.526
MLP 0.601 0.680 0.627 0.650 0.579
GBM 0.482 0.497 0.525 0.600 0.368
XGB 0.515 0.511 0.542 0.575 0.474

Abbreviation: ML, machine learning; AUC, area under the curve; AB, adaptive boosting; MLP, multilayer perceptron; BAG, bootstrapped aggregating; LR, logistic regression; GBM, gradient boosting machine; XGB, extreme gradient boost.

Fig. 3.

Fig. 3

The confusion matrices for the training and testing sets, the five-fold cross-validated ROC curve for the best machine learning model MLP, and a comparative radar chart of different machine learning methods under various evaluation metrics.

Notes:Fig. 3A and B displays the confusion matrices for the training and testing sets. In the testing set's confusion matrix, the accuracy is observed to be 0.63. Fig. 3C shows the five-fold cross-validated ROC curve results for the best machine learning model, MLP, with an AUC of 0.66 ± 0.08. Fig. 3D presents a radar chart visualization of different machine learning algorithms under various evaluation metrics. In this chart, the MLP machine learning algorithm exhibits the best performance in terms of F1-score, sensitivity, accuracy, and AUC value.

Abbreviation: MLP, multilayer perceptron; ROC, receiver operating characteristic; AUC, the area under the ROC curve.

3.4. Model interpretability

We used SHAP to establish the importance ranking of recurrence risk factors based on MLP (Fig. 4.). The variable importance ranking results show that GGT, fibrinogen, neutrophils, AST, TB, and lymphocytes are important factors influencing postoperative recurrence in liver cancer patients. In Fig. 4A, the factors highlighted in red are significant risk factors for recurrence, while those in blue are protective factors against recurrence. Corresponding to this, in the samples used in this study, there are violin plots depicting the feature importance rankings for each sample in Fig. 4B. In addition, according to SHAP values, we selected two subjects respectively, including members of the recurrence group and non-recurrence group. In the recurrent samples, N = 4.11, AST = 46.0, Albumin = 43.1, TB = 15, Total cholesterol = 5.24, and L = 0.79 are important risk factors for recurrence (Fig. 4C). Conversely, factors like Fibrinogen = 2.21 and NLR = 5.2 are important protective factors against recurrence. In the non-recurrent samples, Albumin = 34.7, Fibrinogen = 3.42, GGT = 202.0, and NLR = 0.86 are important risk factors for recurrence, while factors like TB = 13.1, AST = 71.0, N = 2.82, L = 3.28, and Age = 67 are important protective factors against recurrence (Fig. 4D).

Fig. 4.

Fig. 4

SHAP summary plot and SHAP explanation of two patients.

Notes:Fig. 4A and B displays the importance ranking of feature variables based on the MLP machine learning model. Red represents variables that are risk factors for postoperative recurrence in hepatocellular carcinoma, while blue represents variables that act as protective factors. Fig. 4C presents a high-risk SHAP interpretation model for postoperative recurrence in patients with hepatocellular carcinoma after hepatectomy, and Fig. 4D shows a low-risk SHAP interpretation model for postoperative recurrence in the same patient group.

Abbreviation: SHAP, Shapley additive explanations; MLP, multilayer perceptron; N: neutrophil; M: macrophages; NLR: neutrophils to lymphocytes ratio; GGT: γ-glutamyl transpeptidase; Plt: platelets, L: lymphocytes, AST: aspartate aminotransferase, TB: total bilirubin, ALT: alanine aminotransferase.

3.5. Web calculator

Based on the MLP model, we developed a web predictor to predict the risk of postoperative recurrence in HCC patients after hepatectomy. We can predict the postoperative recurrence risk in HCC patients after hepatectomy by setting variables in the sidebar of the website, and provide the importance contributions of each indicator for the occurrence of postoperative recurrence in the respective participants (https://livercancerrecurrence-jpbrmbtbsfn4jojos4anta.streamlit.app/) (Fig. 5). In addition, readers can also access the code used in our research, as well as replicate the figures in the article, by visiting (https://github.com/Wu-Shi-Nan/Liver_cancer/blob/main/code.py). The parameters for each machine learning model and the versions of the libraries used are also available in the GitHub repository.

Fig. 5.

Fig. 5

Web calculator for predicting the recurrence of patients with hepatocellular carcinoma after hepatectomy.

4. Discussion

Studies revealed that the recurrence risk within 5 years remained as high as 60 % even in patients with early-stage hepatocellular carcinoma after hepatectomy [29]. The high recurrence risk of HCC greatly reduced the survival rate of patients. Multiple preoperative indicators influenced cancer patient recurrence after hepatectomy. Zhang et al. found that GGT can serve as an independent factor for assessing postoperative survival prognosis in liver cancer patients, and it is closely associated with postoperative overall survival (OS) and disease-free survival (DFS) [17]. In our current study, the results also indicate that GGT is a positive factor influencing the risk of postoperative recurrence in liver cancer patients. Dai et al. demonstrated that preoperative high fibrinogen was associated with rapid recurrence of HCC [30]. Similarly, based on our feature variable importance ranking using MLP machine learning, it is evident that fibrinogen significantly contributes as a positive factor influencing its recurrence. Recent studies have shown that the immune microenvironment plays a crucial role in the invasive and malignant tendencies of tumors. Neutrophils, as important stress cells in cellular inflammation, have been previously shown to be an important indicator for assessing tumor staging and invasive characteristics in liver cancer when considering the neutrophil-to-lymphocyte ratio [31]. In our current study, the abnormally elevated levels of neutrophils were also significantly correlated with postoperative recurrence in liver cancer patients. We employed the MLP machine learning algorithm to predict postoperative recurrence in liver cancer patients and analyzed the feature variables that significantly impact recurrence. Some indicators emerged as significant risk factors, such as GGT, fibrinogen, neutrophils, and TB, while others served as significant protective factors, including AST and lymphocytes. Through visual explanations of the black-box machine learning model, we were able to provide insights into the effects of abnormal serum value elevations or reductions on predicting postoperative recurrence in liver cancer patients.

Researchers have proposed several methods to assess the prognosis of patients with liver cancer, such as tumor-node-metastasis, cancer of the liver Italian program (CLIP) score, the Singapore liver cancer recurrence (SLICER) score, and china liver cancer (CNLC) staging [[32], [33], [34]]. However, due to inherent defects, the prognostic value of these methods was still not widely recognized. In addition, many prognostic models based on cox regression have also been developed, such as ERASL and ALBI grade [35,36]. However, the prediction methods based on cox regression are not very accurate due to their statistical drawbacks. ML can be adapted to more complex nonlinear relationships because it can assess the nonlinear impact of risk factors on patient survival [37]. Therefore, more and more predictive models based on ML have been developed to predict the prognosis of tumor patients and show a bright application prospect.

In the present study, we compared six different machine learning models to assess the recurrence risk of HCC patients after hepatectomy and finally determined that the MLP model was the best predictive model. We further developed a web calculator based on the MLP model. Clinicians can assess the patient's recurrence risk by entering the patient's indicators on the web page. Furthermore, we conducted a comprehensive assessment of the various indicators for each participant. If an indicator was identified as a risk factor for recurrence, it was displayed in red on the web calculator. Conversely, if an indicator was deemed a protective factor for recurrence, it was represented in blue. MLP model is a machine learning algorithm based on an artificial neural network, which can effectively analyze linear and nonlinear characteristic variables to make effective and accurate predictions [38]. Previous studies used MLP to preprocess input images and achieve an accurate diagnosis of bladder cancer [39]. However, although machine learning models are more powerful and accurate than traditional statistical models, the interpretation of the models is correspondingly more complex. They like a black box, which limits their further clinical application. In this study, we further explained the MLP model through the SHAP method. SHAP is an independent machine learning model interpretation technique, which can explain the global and individual sample black box models and help understand the relationship between prediction indicators and results in the MLP model [40]. We applied SHAP to enhance the global interpretation of the MLP model applied to the recurrence risk prediction of HCC patients after liver resection and help to increase clinicians' trust in the clinical application of the machine learning model.

There were still some limitations in the current study. Firstly, this study was a single-center retrospective study, and the performance of the ML model may vary depending on the characteristics of patients in different regions and the datasets of different institutions. Moreover, as this is a retrospective study, it introduces the possibility of selection bias and limits the establishment of causal relationships. Secondly, because our sample size is relatively small, there is a significant margin of error in the AUC values during the ten-fold cross-validation process. However, multiple cross-validation results show that the optimal machine learning model is still MLP. We will further increase the sample size in our subsequent studies to obtain stable cross-validation results. Lastly, the specificity of our best machine learning model, MLP, in this study is relatively low. One potential reason might be the limited number of feature variables included. In subsequent research, we will collect more pre-operative, intra-operative, and post-operative indicators to establish a comprehensive and standardized decision-making system.

5. Conclusion

We successfully established an MLP model to predict the recurrence risk of HCC patients after hepatectomy. This predictive model can help identify patients at high recurrence risk and can provide patients with early and personalized treatment to further improve the prognosis and quality of life.

Supportive foundations

None.

Data availability statement

All data was in the manuscript and can be obtained from the corresponding author.

Authors’ contributions

Yang Yang and Linsen Ye contributed to the study's inception and design. Rongqiang Liu, Shinan Wu, Hao yuan Yu and Kaining Zeng equally contributed to the literature search, analysis and writing of the manuscript. Other authors contributed to the study design and study supervision. All authors approved the final version of the manuscript.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Contributor Information

Yang Yang, Email: yysysu@163.com.

Linsen Ye, Email: ye_linsen@163.com.

Abbreviation

HCC

hepatocellular carcinoma

ML:

machine learning

MLP

multilayer perceptron

SHAP

Shapley additive explanations

AB

adaptive boosting

BAG

bootstrapped aggregating;

LR

logistic regression

GBM

gradient boosting machine;

XGB

extreme gradient boost

ROC:

receiver operating characteristic curve

ALT

aminotransferase

AST

aspartate aminotransferase

GGT

γ-glutamyl transpeptidase

NLR

neutrophils to lymphocytes ratio

References

  • 1.Vogel A., Meyer T., Sapisochin G., Salem R., Saborowski A. Hepatocellular carcinoma. Lancet. 2022;400(10360):1345–1362. doi: 10.1016/S0140-6736(22)01200-4. [DOI] [PubMed] [Google Scholar]
  • 2.El-Serag H.B. Hepatocellular carcinoma. N. Engl. J. Med. 2011;365(12):1118–1127. doi: 10.1056/NEJMra1001683. [DOI] [PubMed] [Google Scholar]
  • 3.Sung H., Ferlay J., Siegel R.L., Laversanne M., Soerjomataram I., Jemal A., Bray F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. Ca - Cancer J. Clin. 2021;71(3):209–249. doi: 10.3322/caac.21660. [DOI] [PubMed] [Google Scholar]
  • 4.Maki H., Hasegawa K. Advances in the surgical treatment of liver cancer. Biosci. Trends. 2022;16(3):178–188. doi: 10.5582/bst.2022.01245. [DOI] [PubMed] [Google Scholar]
  • 5.Tabrizian P., Jibara G., Shrager B., Schwartz M., Roayaie S. Recurrence of hepatocellular cancer after resection: patterns, treatments, and prognosis. Ann. Surg. 2015;261(5):947–955. doi: 10.1097/SLA.0000000000000710. [DOI] [PubMed] [Google Scholar]
  • 6.Deo R.C. Machine learning in medicine. Circulation. 2015;132(20):1920–1930. doi: 10.1161/CIRCULATIONAHA.115.001593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Fleuren L.M., Klausch T.L.T., Zwager C.L., Schoonmade L.J., Guo T., Roggeveen L.F., Swart E.L., Girbes A.R.J., Thoral P., Ercole A., et al. Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy. Intensive Care Med. 2020;46(3):383–400. doi: 10.1007/s00134-019-05872-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Montazeri M., ZahediNasab R., Farahani A., Mohseni H., Ghasemian F. Machine learning models for image-based diagnosis and prognosis of COVID-19: systematic review. JMIR Med. Inform. 2021;9(4) doi: 10.2196/25181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Liu W.C., Li M.X., Qian W.X., Luo Z.W., Liao W.J., Liu Z.L., Liu J.M. Application of machine learning techniques to predict bone metastasis in patients with prostate cancer. Cancer Manag. Res. 2021;13:8723–8736. doi: 10.2147/CMAR.S330591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Li W., Zhou Q., Liu W., Xu C., Tang Z.R., Dong S., Wang H., Li W., Zhang K., Li R., et al. A machine learning-based predictive model for predicting lymph node metastasis in patients with Ewing's sarcoma. Front. Med. 2022;9 doi: 10.3389/fmed.2022.832108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hasnain Z., Mason J., Gill K., Miranda G., Gill I.S., Kuhn P., Newton P.K. Machine learning models for predicting post-cystectomy recurrence and survival in bladder cancer patients. PLoS One. 2019;14(2) doi: 10.1371/journal.pone.0210976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Liang J.D., Ping X.O., Tseng Y.J., Huang G.T., Lai F., Yang P.M. Recurrence predictive models for patients with hepatocellular carcinoma after radiofrequency ablation using support vector machines with feature selection methods. Comput. Methods Progr. Biomed. 2014;117(3):425–434. doi: 10.1016/j.cmpb.2014.09.001. [DOI] [PubMed] [Google Scholar]
  • 13.Wang K., Hong T., Liu W., Xu C., Yin C., Liu H., Wei X., Wu S.N., Li W., Rong L. Development and validation of a machine learning-based prognostic risk stratification model for acute ischemic stroke. Sci. Rep. 2023;13(1) doi: 10.1038/s41598-023-40411-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Huang Y., Chen H., Zeng Y., Liu Z., Ma H., Liu J. Development and validation of a machine learning prognostic model for hepatocellular carcinoma recurrence after surgical resection. Front. Oncol. 2020;10 doi: 10.3389/fonc.2020.593741. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Zeng J., Zeng J., Lin K., Lin H., Wu Q., Guo P., Zhou W., Liu J. Development of a machine learning model to predict early recurrence for hepatocellular carcinoma after curative resection. Hepatobiliary Surg. Nutr. 2022;11(2):176–187. doi: 10.21037/hbsn-20-466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Ji G.W., Zhu F.P., Xu Q., Wang K., Wu M.Y., Tang W.W., Li X.C., Wang X.H. Machine-learning analysis of contrast-enhanced CT radiomics predicts recurrence of hepatocellular carcinoma after resection: a multi-institutional study. EBioMedicine. 2019;50:156–165. doi: 10.1016/j.ebiom.2019.10.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Ekanayake I., Meddage D., Rathnayake U. A novel approach to explain the black-box nature of machine learning in compressive strength predictions of concrete using Shapley additive explanations (SHAP) Case Stud. Constr. Mater. 2022;16 [Google Scholar]
  • 18.Fernández A., Garcia S., Herrera F., Chawla N.V. SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary. J. Artif. Intell. Res. 2018;61:863–905. [Google Scholar]
  • 19.Liu Y., Wang Y., Zhang J. Information Computing and Applications: Third International Conference, ICICA 2012, Chengde, China, September 14-16, 2012 Proceedings 3: 2012. Springer; 2012. New machine learning algorithm: random forest; pp. 246–252. [Google Scholar]
  • 20.Marcano-Cedeño A., Quintanilla-Domínguez J., Cortina-Januchs M., Andina D. IECON 2010-36th Annual Conference on IEEE Industrial Electronics Society: 2010. IEEE; 2010. Feature selection using sequential forward selection and classification applying artificial metaplasticity neural network; pp. 2845–2850. [Google Scholar]
  • 21.Pal S.K., Mitra S. classifiaction; 1992. Multilayer Perceptron, Fuzzy Sets. [DOI] [PubMed] [Google Scholar]
  • 22.Natekin A., Knoll A. Gradient boosting machines, a tutorial. Front. Neurorob. 2013;7 doi: 10.3389/fnbot.2013.00021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Stoltzfus J.C. Logistic regression: a brief primer. Acad. Emerg. Med. 2011;18(10):1099–1104. doi: 10.1111/j.1553-2712.2011.01185.x. [DOI] [PubMed] [Google Scholar]
  • 24.Chen T., He T., Benesty M., Khotilovich V., Tang Y., Cho H., Chen K., Mitchell R., Cano I., Zhou T. Xgboost: extreme gradient boosting. R package version 04-2. 2015;1(4):1–4. [Google Scholar]
  • 25.Rodríguez-Pérez R., Bajorath J. Interpretation of compound activity predictions from complex machine learning models using local approximations and Shapley values. J. Med. Chem. 2020;63(16):8761–8777. doi: 10.1021/acs.jmedchem.9b01101. [DOI] [PubMed] [Google Scholar]
  • 26.Massey F.J., Jr. The Kolmogorov-Smirnov test for goodness of fit. J. Am. Stat. Assoc. 1951;46(253):68–78. [Google Scholar]
  • 27.López-Espejo M.A., Chávez M.H., Huete I. Short-term outcomes after a neonatal arterial ischemic stroke. Childs Nerv. Syst. 2021;37(4):1249–1254. doi: 10.1007/s00381-020-04931-w. [DOI] [PubMed] [Google Scholar]
  • 28.Alawneh A., Anshasi H. Place of death for patients treated at a tertiary cancer center in Jordan. Support. Care Cancer. 2021;29(4):1837–1842. doi: 10.1007/s00520-020-05677-6. [DOI] [PubMed] [Google Scholar]
  • 29.Roayaie S., Obeidat K., Sposito C., Mariani L., Bhoori S., Pellegrinelli A., Labow D., Llovet J.M., Schwartz M., Mazzaferro V. Resection of hepatocellular cancer ≤2 cm: results from two Western centers. Hepatology. 2013;57(4):1426–1435. doi: 10.1002/hep.25832. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Dai T., Peng L., Lin G., Li Y., Yao J., Deng Y., Li H., Wang G., Liu W., Yang Y., et al. Preoperative elevated plasma fibrinogen level predicts tumor recurrence and poor prognosis in patients with hepatocellular carcinoma. J. Gastrointest. Oncol. 2019;10(6):1049–1063. doi: 10.21037/jgo.2019.09.11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Wong L., Bozhilov K., Hernandez B., Kwee S., Chan O., Ellis L., LeMarchand L. Underlying liver disease and advanced stage liver cancer are associated with elevated neutrophil-lymphocyte ratio. Clin. Mol. Hepatol. 2019;25(3):305–316. doi: 10.3350/cmh.2019.0004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Marrero J.A., Fontana R.J., Barrat A., Askari F., Conjeevaram H.S., Su G.L., Lok A.S. Prognosis of hepatocellular carcinoma: comparison of 7 staging systems in an American cohort. Hepatology. 2005;41(4):707–716. doi: 10.1002/hep.20636. [DOI] [PubMed] [Google Scholar]
  • 33.Ang S.F., Ng E.S., Li H., Ong Y.H., Choo S.P., Ngeow J., Toh H.C., Lim K.H., Yap H.Y., Tan C.K., et al. The Singapore Liver Cancer Recurrence (SLICER) Score for relapse prediction in patients with surgically resected hepatocellular carcinoma. PLoS One. 2015;10(4) doi: 10.1371/journal.pone.0118658. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Xie D.Y., Ren Z.G., Zhou J., Fan J., Gao Q. Chinese clinical guidelines for the management of hepatocellular carcinoma: updates and insights. Hepatobiliary Surg. Nutr. 2020. 2019;9(4):452–463. doi: 10.21037/hbsn-20-480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Chan A.W.H., Zhong J., Berhane S., Toyoda H., Cucchetti A., Shi K., Tada T., Chong C.C.N., Xiang B.D., Li L.Q., et al. Development of pre and post-operative models to predict early recurrence of hepatocellular carcinoma after surgical resection. J. Hepatol. 2018;69(6):1284–1293. doi: 10.1016/j.jhep.2018.08.027. [DOI] [PubMed] [Google Scholar]
  • 36.Lin C.Y., Lin C.C., Wang C.C., Chen C.L., Hu T.H., Hung C.H., Huang P.Y., Tsai M.C. The ALBI grade is a good predictive model for very late recurrence in patients with hepatocellular carcinoma undergoing primary resection. World J. Surg. 2020;44(1):247–257. doi: 10.1007/s00268-019-05197-3. [DOI] [PubMed] [Google Scholar]
  • 37.Kourou K., Exarchos T.P., Exarchos K.P., Karamouzis M.V., Fotiadis D.I. Machine learning applications in cancer prognosis and prediction. Comput. Struct. Biotechnol. J. 2015;13:8–17. doi: 10.1016/j.csbj.2014.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Khan R.U., Almakdi S., Alshehri M., Kumar R., Ali I., Hussain S.M., Haq A.U., Khan I., Ullah A., Uddin M.I. Probabilistic approach to COVID-19 data analysis and forecasting future outbreaks using a multi-layer perceptron neural network. Diagnostics. 2022;12(10) doi: 10.3390/diagnostics12102539. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Lorencin I., Anđelić N., Španjol J., Car Z. Using multi-layer perceptron with Laplacian edge detector for bladder cancer diagnosis. Artif. Intell. Med. 2020;102 doi: 10.1016/j.artmed.2019.101746. [DOI] [PubMed] [Google Scholar]
  • 40.Lundberg S.M., Erion G., Chen H., DeGrave A., Prutkin J.M., Nair B., Katz R., Himmelfarb J., Bansal N., Lee S.I. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2020;2(1):56–67. doi: 10.1038/s42256-019-0138-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All data was in the manuscript and can be obtained from the corresponding author.


Articles from Heliyon are provided here courtesy of Elsevier

RESOURCES