Skip to main content
Gastro Hep Advances logoLink to Gastro Hep Advances
. 2022 Feb 3;1(1):29–37. doi: 10.1016/j.gastha.2021.09.003

Machine Learning–Based Personalized Prediction of Hepatocellular Carcinoma Recurrence After Radiofrequency Ablation

Masaya Sato 1,2,, Ryosuke Tateishi 2, Makoto Moriyama 2, Tsuyoshi Fukumoto 2, Tomoharu Yamada 2, Ryo Nakagomi 2, Mizuki Nishibatake Kinoshita 2, Takuma Nakatsuka 2, Tatsuya Minami 2, Koji Uchino 2, Kenichiro Enooku 2, Hayato Nakagawa 2, Shuichiro Shiina 3, Kota Ninomiya 4, Satoshi Kodera 4, Yutaka Yatomi 1, Kazuhiko Koike 2
PMCID: PMC11308827  PMID: 39129938

Abstract

Background and Aims

Radiofrequency ablation (RFA) is a widely accepted, minimally invasive treatment for hepatocellular carcinoma (HCC). This study aimed to develop a machine learning (ML) model to predict the risk of HCC recurrence after RFA treatment for individual patients.

Methods

We included a total of 1778 patients with treatment-naïve HCC who underwent RFA. The cumulative probability of overall recurrence after the initial RFA treatment was 78.9% and 88.0% at 5 and 10 years, respectively. We developed a conventional Cox proportional hazard model and 6 ML models—including the deep learning–based DeepSurv model. Model performance was evaluated using Harrel’s c-index and was validated externally using the split-sample method.

Results

The gradient boosting decision tree (GBDT) model achieved the best performance with a c-index of 0.67 from external validation, and it showed a high discriminative ability in stratifying the external validation sample into 2, 3, and 4 different risk groups (P < .001 among all risk groups). The c-index of DeepSurv was 0.64. In order of significance, the tumor number, serum albumin level, and des-gamma-carboxyprothrombin level were the most important variables for the prediction of HCC recurrence in the GBDT model. Also, the current GBDT model enabled the output of a personalized cumulative recurrence prediction curve for each patient.

Conclusion

We developed a novel ML model for the personalized risk prediction of HCC recurrence after RFA treatment. The current model may lead to the personalization of effective follow-up strategies after RFA treatment according to the risk stratification of HCC recurrence.

Keywords: Hepatocellular Carcinoma, Radiofrequency Ablation, Machine Learning, DeepSurv

Introduction

Hepatocellular carcinoma (HCC) is the second most common cause of cancer-related deaths worldwide, and its incidence is expected to rise further.1 Only 20% of patients with HCC are candidates for resection.2 Image-guided minimally invasive techniques have been widely used for the treatment of HCC in the past 2 decades, and radiofrequency ablation (RFA) has yielded promising clinical results, with survival rates comparable with those of hepatectomy.3 During the last decade, there has been growing interest in the use of RFA in patients with liver tumors considered to be unresectable owing to impaired hepatic functions or associated comorbidities.4,5 Improvement of surveillance programs—that have reduced the number of HCC cases detected at an already advanced stage—has also increased the use of RFA.2

As many as 70% of patients have HCC recurrence within 5 years after RFA treatment; this includes distant recurrence due to intrahepatic metastasis or de novo primary cancer development and local tumor progression.3 The HCC recurrence risk varied widely among patients and was shown to be affected by tumor factors—including tumor size, tumor number, or tumor markers—as well as underlying chronic liver diseases—such as fibrosis or inflammation. These contribute to the development of HCC.3,6 Although current guidelines recommend regular follow-ups—using imaging and serum tumor marker studies of patients who received curative treatment for HCC7, 8, 9, 10, 11, 12—surveillance intervals after RFA is preferred to be customized based on the recurrence risk. Objective individual risk assessments and stratifications are warranted for the establishment of personalized surveillance strategies.

Machine learning (ML) is a multidisciplinary field that draws on the fields of computer science and mathematics for developing and implementing computer algorithms, which can maximize predictive accuracy through the development of probabilistic or statistical models, based on existing data.13 Deep learning (DL) has been a major topic and has achieved great success in various fields—especially in the field of image recognition.14 Also recently; nonlinear ML algorithms such as DeepSurv, a DL generalization of the Cox proportional hazard (CPH) model which can handle survival or cumulative incidence, were developed.15, 16, 17 Although these models have been implemented successfully in the survival or cumulative hazard analysis of various diseases,18,19 there have been no reports found on HCC recurrence prediction after RFA using ML-based survival or cumulative incidence models.

This study aimed to develop an ML model for hazard identification to predict the risk of HCC recurrence after RFA treatment for individual patients. This may lead to personalized settings of RFA follow-up intervals.

Materials and Methods

Patients

From February 1999 to December 2019, 1855 patients with treatment-naïve HCC underwent RFA as an initial treatment at the Department of Gastroenterology, the University of Tokyo Hospital. For all 1855 patients, information on patient age at initial treatment; sex; levels of serum albumin, total bilirubin (TB), aspartate aminotransferase (AST), alanine aminotransferase (ALT), serum creatinine, and hepatitis B surface (HBs) antigen; platelet count; prothrombin time (PT); and hepatitis C virus (HCV) antibody status were available. Finally, we enrolled 1778 patients in the present study after excluding 49 patients in whom RFA was performed with a noncurative intent to reduce tumor burden. Owing to warfarin administration, information on serum des-gamma-carboxyprothrombin (DCP) levels regarding 28 patients was not available. The inclusion criteria for RFA were as follows: platelet count of no less than 50 × 103/mm3, TB level < 3 mg/dL, and prothrombin activity level > 50%. Patients with macroscopic vascular invasion, refractory ascites, or extrahepatic metastasis were excluded. The clinical data on each patient who underwent RFA in our department were stored in a database designed and maintained prospectively to assess the short- and long-term efficacy and safety of the RFA treatment.

The current study was conducted in accordance with the ethical guidelines for epidemiological research provided by the Japanese Ministry of Education, Culture, Sports, Science and Technology and the Ministry of Health, Labor, and Welfare. The study design was approved by the University of Tokyo Medical Research Center Ethics Committee (Registration No. 2058).

Diagnosis of HCC

HCC was diagnosed using dynamic computed tomography (CT) or magnetic resonance imaging (MRI); hyperattenuation during the arterial phase and washout during the late phase was regarded as a definite sign of HCC.20 When a definite diagnosis of HCC could not be made using CT, an ultrasound-guided tumor biopsy was performed and the pathological diagnosis was based on the Edmondson-Steiner criteria.21

RFA Procedure and Follow-Up

A session was defined as a single intervention episode that consisted of one or more ablations performed on one or more tumors, and the RFA procedure was defined as the completed effort to ablate all tumors that consisted of one or more sessions.22 The details of the RFA technique are described elsewhere.23 Briefly, RFA was performed under real ultrasound guidance using a single needle pass (Cool-tip®, Covidien, Boulder, CO, USA/Medtronic, Minneapolis, MN, USA, or VIVARF, STARmed®, Gyeonggi-do, Korea). After treatment, we monitored HCC recurrence using dynamic CT or MRI every 4 months. Furthermore, measurements of the serum alpha-fetoprotein (AFP), DCP, and Lens culinaris agglutinin-reactive fraction of AFP (AFP-L3) levels were also monitored. HCC recurrence was diagnosed using the same criteria as applied to the diagnosis of HCC. Liver biochemistry tests were also performed every 4 months to evaluate liver function.

Model Development

We developed a conventional CPH model and 6 ML models: a DL-based model (DeepSurv), neural multitask logistic regression model (MTLR), random forest (RF), gradient boosting decision tree (GBDT), elastic net penalized regression, and support vector machine (SVM). For model development, we used the following 21 candidate variables: patient age; sex; body mass index; alcoholic consumption; tumor number; maximum tumor size; levels of serum albumin, TB, AST, ALT, serum creatinine, AFP, DCM, AFP-L3, and HBs antigen; platelet count and PT; presence or absence of ascites and encephalopathy; HCV antibody status; and the number of RFA sessions required for treatment. A glossary of terms in the ML models is shown in Table A1.

DeepSurv is a multilayer feedforward network that is inspired by the biological nervous systems for information processing, and it predicts the effects of variables on their hazard rate parameterized by the weight of the network.17

The MTLR model is an ML method for survival prediction which is an alternative to the CPH model. This model casts the outcome as a multitask binary classification on a discretized time axis and uses a sequence of dependent logistic regressions to ensure consistency of predictions.24 The neural MTLR is an approach for survival prediction based on a neural network and is an extension of the MTLR framework.25

Random forests and GBDTs are decision tree (DT) algorithms and are structures for classification. They include a root node, branches, and leaf nodes. RF and GBDT algorithms are ensembles of multiple DT models through bagging and boosting, respectively.26 RFs build multiple DTs in parallel, whereas GBDTs build a sequential series of trees—where each tree tries to complement each other and correct for the residuals in the predictions made by all previous trees.27,28 Elastic net penalized regression is a hybrid of the 2 approaches; it has a penalty structure that is a mixture of the LASSO29 and ridge30 penalties, controlled by a specified mixing parameter.31

SVM, a supervised ML technique, is an optimal margin-based algorithm,32 which transforms the original data into a higher dimensional space using nonlinear mapping. In the new higher dimension, the SVM finds the appropriate hyperplane which separates the original data according to an annotation profile with the help of support vectors.33 All models were implemented in Python using PySurvival34 and scikit-survival.35 The latter builds directly on the scikit-learn library to implement survival models.

Statistical Analysis

Continuous variables were expressed as medians with the first and third quartiles, while categorical variables were expressed as numbers and frequencies (%). To evaluate the predictive performance of the model, we randomly split a total of 1778 patients into a training set (90% of patients), which was used to build the model, and a test set (10% of patients), which was used to evaluate the performance of each model. The discrimination performances among the models in both the training set (internal validation) and test set (external validation) were assessed using Harrell`s c-index.36 The variable importance for class discrimination in the DT predictive model was assessed using the permutation-based variable importance scores37 The discriminative ability of each model was assessed using Kaplan-Meier curves and log-rank tests for various risk groups using R3.6.3 (http://www.R-project.org).38 The threshold of P-values for significance was set at <.05.

Results

Patient Characteristics

We included a total of 1778 patients with treatment-naïve HCC who received RFA as initial treatment at our institution. Table 1 shows the clinical characteristics of the patients. Approximately 60% were male, and the median age was 70 years. The median maximum tumor size and tumor number are 2.2 cm and 1, respectively. The cumulative probability of overall recurrence after the initial RFA treatment was 78.9% and 88.0% at 5 and 10 years, respectively.

Table 1.

Baseline Characteristics of the 1778 Patients Undergoing Radiofrequency Ablation for Primary Hepatocellular Carcinoma

Variable Value (n = 1778)
Sex, n (%)
 Female 670 (37.7)
 Male 1108 (62.3)
Age (y) 70 (64–76)
Body mass index 23.4 (21.3–36.7)
Viral infection, n (%)
 HBs antigen positive 200 (11.2)
 HCV antibody positive 1219 (68.7)
 Both positive 15 (0.8)
 Both negative 344 (19.3)
Alcoholic consumption, n (%)
 <50 g/d 1346 (75.7)
 50–80 g/d 171 (9.6)
 >80 g/d 261 (14.7)
Albumin (g/dL) 3.7 (3.4–4.0)
Creatinine (mg/dL) 0.7 (0.6–0.9)
Total bilirubin (mg/dL) 0.8 (0.6–1.1)
Prothrombin time (%) 86 (74–100)
Platelet count (×103/mm3) 110 (79–150)
AST (IU/l) 50 (34.25–70.75)
ALT (IU/l) 42 (27–66)
Tumor size (cm) 2.2 (1.7–2.9)
Tumor number 1 (1–2)
Serum AFP (ng/dL) 15 (6–56)
Serum DCP (mAU/mL) 22 (15–49)
Serum AFP-L3 (%) 0.5 (0.5–6.8)
Ascites, n (%)
 Absent 1563 (87.9)
 Present 215 (12.1)
Encephalopathy, n (%)
 Absent 1741 (97.9)
 Present 37 (2.1)
RFA sessions required for treatment, n (%)
 1 1028 (57.8)
 2 590 (33.2)
 3 or more 160 (9.0)

Data are expressed as median (1–3rd quartile).

Comparison of the Predictive Performance of Each Model

The discriminatory performance of the predictive models assessed using Harrell’s c-index is shown in Table 2. The c-index of the CPH model in the test set was 0.64. Among all models, the GBDT model showed the highest c-index of 0.67 (95% confidence interval, 0.61–0.72) in the test set (external validation.)

Table 2.

Predictive Performance (c-Index) of the Models

Machine learning model Training set (internal validation) Test set (external validation)
Cox proportional hazard model 0.64 0.64
DeepSurv 0.67 0.65
Neural multi-task logistic regression model 0.62 0.63
Random forest model 0.74 0.66
Gradient boosting model 0.72 0.67
Elastic net penalized regression 0.62 0.62
Support vector machine 0.63 0.65

Predictive Models for HCC Recurrence Using the GBDT Model

Figure 1 shows the Kaplan-Meier curves of HCC recurrence after the initial RFA; they are categorized into 2 (Figure 1A), 3 (Figure 1B), and 4 (Figure 1C) risk groups in the test set using the GBDT model—which is the model that showed the highest c-index among all models. The GBDT model showed high discriminative ability in all different risk divisions (P < .001 among risk groups in all risk divisions—see Figure 1A–C). We then investigated the variable importance of the predictive model using the GBDT model developed in the current study. Figure 2 shows the relative importance of each variable assessed using the permutation-based variable importance scores of this model. The most important variables for HCC recurrence were the tumor number, serum albumin level, and DCP level.

Figure 1.

Figure 1

Kaplan-Meier curves for different risk groups of HCC recurrence after initial RFA—categorized into 2 (A), 3 (B), and 4 (C) risk groups using the gradient boosting model.

Figure 2.

Figure 2

Relative variable importance of the gradient boosting model for the prediction of HCC recurrence assessed using the permutation-based variable importance scores.

Personalized Prediction of HCC Recurrence in Individual Patients Using the GBDT Model

Finally, we applied the GBDT model for the prediction of HCC recurrence after RFA in individual patients. Figure 3 shows plots of the cumulative recurrence rate for 9 individual patients, 3 each from the high-, intermediate-, and low-risk groups, in the test set predicted by the GBDT model. The predicted HCC recurrence rate varies widely depending on a variety of factors, including patient background or tumor factors. We also applied the current model to predict HCC recurrence after RFA in 2 hypothetical patients: a high-risk case of recurrence in a man in his 70s with large multiple lesions and high tumor markers (Patient A) and a low-risk case in a woman in her 40s with a small single lesion and normal tumor markers (Patient B) (Figure 4). The GBDT model made a clear distinction between the high-risk Patient A and low-risk Patient B.

Figure 3.

Figure 3

Plots of the predicted cumulative recurrence rate of 178 individual patients in the test set as demonstrated by the gradient boosting model.

Figure 4.

Figure 4

Personalized prediction of HCC recurrence after RFA in 2 hypothetical patients: a high-risk case of recurrence in a man in his 70s with large multiple lesions and high tumor markers (Patient A) and a low-risk case in a woman in her 40s with a small single lesion and normal tumor markers (Patient B). Details of the 2 hypothetical patients are shown in the table on the right in the figure.

Discussion

In the current study, we developed CPH and ML models to predict the risk of HCC recurrence after an initial RFA treatment. We compared the conventional CPH model based on the linear fitting of variables39 and nonlinear ML models. Model fitting is important for a successful predictive method. If the data are linearly separable, a linear model will fit the data.40,41 Variables often have nonlinear effects with cancer recurrence in clinical settings.42,43 In actual fact, the nonlinear GBDT model showed the best predictive performance assessed using the c-index in the present study. The GBDT model showed a higher predictive value compared with DeepSurv which has a DL architecture. Although DeepSurv is a promising and powerful model for the survival or time-to-event prediction,17 this model did not always show the best performance for event prediction in the medical field.19,44 Identifying an optimal classifier depending on the available data is important to obtain precise predictions of HCC recurrence.

Each ML model has its strengths or weaknesses. DL has enabled major breakthroughs in the processing of images, video, speech, or audio.14 Although DL is a powerful technique, it requires a large polynomial sample size in terms of the depth of the network to obtain ideal convergence boundaries.45 This may be difficult to achieve in clinical settings for ethical or methodological reasons. The main advantage of DT models (GBDT and RF) which showed the highest predictive performance in the current study is their capability to break down a complex structure into a collection of simpler structures.46 The major advantage of the RF model is that it is robust to noise and overfitting,47 whereas those of GBDT models are its state-of-the-art predictive performance on tabular data and the customizability of loss of function.26,48 The elastic net model ensures sparsity in the feature weights and promotes a grouping effect of the highly correlated features.49 In addition, the SVM can, with relative ease, overcome the high dimensionality problem (ie, the problem that arises when there is a large number of input variables relative to the number of available observations).50 However, the SVM is computationally expensive because it involves quadratic optimization problems.51

In the current study, we also analyzed feature importance for the prediction of HCC recurrence using the GBDT model. The tumor number, serum albumin level, and DCP level, which are well-known risk factors for HCC recurrence,3,52 were found to be the most important variables. In line with our results, serum albumin level was shown to be a significant predictor of recurrence or recurrence-free survival after a curative treatment for HCC.53, 54, 55 Huang et al19 developed a predictive ML model for HCC recurrence after surgical resection. In this study, the GBDT model was selected as the optimal model having a c-index of 0.697 in the external validation, and tumor factors such as tumor number, tumor size, or vascular invasion were identified as important predictors. Yamashita et al56 developed a DL-based system (HCC-SurvNet) that predicts the risk of HCC recurrence using histological digital images of paraffin-embedded HCC samples and demonstrated a c-index of 0.683. In this study, tumor factors such as tumor size and microvascular invasion were shown to be candidate predictors of HCC recurrence, which were significantly associated with the HCC-SurvNet score. Because the treatment options and data sets were different, a quantitative comparison of c-indices may not be practical. The c-index of our model, 0.67, is comparable or slightly lower than the aforementioned models.19,56 Because tumor biopsies are not usually performed in RFA procedures—owing to concerns about the risk of tumor dissemination57—we could not obtain the histological information. The lack of inclusion of important factors, such as microvascular invasion, may be one of the probable reasons for the slightly lower predictive performance of our ML model than the models in previous reports.

Personalization is one of the ultimate goals of modern medicine.58 To realize personalized medicine, the individualized prediction of risk for each patient is essential. In the current study, we utilized the developed ML model for the individual prediction of HCC recurrence for each patient. To the best of our knowledge, the present study is the first demonstration of the personalized prediction of HCC recurrence—after RFA—in individual patients using an ML model. This may lead to the individualization of follow-up intervals after RFA. However, the accuracy for the prediction of HCC recurrence is not satisfactory in the current study. The accurate predictions of recurrent HCC after curative treatment were limited to fairly poor performances with c-indices less than 0.759 in the current and previous studies using clinical data or histological images of resected tumors.19,56 This is likely because of its biological heterogeneity, variety of recurrent patterns (ie, de novo recurrence, local tumor progression, or intrahepatic metastasis),3 or technical complexity of treatment procedures. Because the predictive accuracy of ML models depends on the number of samples for training,60 further studies for the improvement of predictive models using larger sample size are needed for the clinical use of such models. The integration of multiple heterogenous information (ie, image modality and clinical data information) using a multimodal representation model 61 may be a useful approach to obtain greater predictive performance.

Current guidelines recommend regular follow-ups using imaging and serum tumor markers studies of patients who received curative treatments for HCC.7, 8, 9, 10, 11, 12 The American Association for the Study of Liver Diseases provides a guideline that recommends a 3- to 4-month imaging interval, after initial treatment, of up to 2 years; and thereafter, the interval can be at less frequent intervals.7 The European Association for the Study of the Liver and the Japanese Society of Hepatology also recommend 3- to 4-month follow-up intervals according to the surveillance approaches used in extremely high-risk cases at the time of onset.8,11 CT and MRI were used for post-RFA surveillance; therefore, individualization of the follow-up interval according to the risk of recurrence is desirable with regards to radiation exposure and medical economics. Predictive models provide a personalized assessment of the probability of a clinical event using patient- or tumor-specific characteristics—and will be further incorporated into the field of cancer medicine in the future.

The limitation of the current study is its single-center design that limited the generalizability of the findings, and therefore, our predictive model might not be generalizable to other practice settings. Further validation in a multicenter setting is required to confirm the general feasibility of the proposed ML model in the present study.

In conclusion, we developed a novel ML model for the personalized risk prediction of HCC recurrence after RFA treatment. The current model may lead to the personalization of follow-up strategies after RFA treatment according to the risk stratification of HCC recurrence.

Acknowledgments:

The authors would like to thank Editage (www.editage.com) for English language editing.

Authors' Contributions:

Masaya Sato: conception, design, analysis and interpretation of data, drafting of the manuscript, final approval given. Ryosuke Tateishi: data collection, critical revision of manuscript, final approval given. Makoto Moriyama: data collection, final approval given. Tsuyoshi Fukumoto: data collection, final approval given. Tomoharu Yamada: data collection, final approval given. Ryo Nakagomi: data collection, final approval given. Mizuki Nishibatake Kinoshita: data collection, final approval given. Takuma Nakatsuka: data collection, final approval given. Tatsuya Minami: data collection, final approval given. Koji Uchino: data collection, final approval given. Kenichiro Enooku: data collection, final approval given. Hayato Nakagawa: data collection, final approval given. Shuichiro Shiina: data collection, final approval given. Kota Ninomiya: analysis of data, final approval given. Satoshi Kodera: analysis of data, final approval given. Yutaka Yatomi: supervision of research, critical revision of manuscript, final approval given. Kazuhiko Koike: supervision of research, critical revision of manuscript, final approval given.

Footnotes

Conflicts of interest: The authors disclose no conflicts.

Funding: This work was supported by the Health, Labour and Welfare Policy Research Grants from The Ministry of Health, Labour, and Welfare of Japan (Policy Research for Hepatitis Measures [H30-Kansei-Shitei-003]).

Ethical Statement: The corresponding author, on behalf of all authors, jointly and severally, certifies that their institution has approved the protocol for any investigation involving humans or animals and that all experimentation was conducted in conformity with ethical and humane principles of research.

Data Transparency Statement: The data that support the findings of this study are available from the corresponding author, MS, upon reasonable request.

Material associated with this article can be found in the online version at https://doi.org/10.1016/j.gastha.2021.09.003.

Supplementary Materials

Table A1
mmc1.docx (15.9KB, docx)

References

  • 1.Singal A.G., El-Serag H.B. Hepatocellular carcinoma from epidemiology to prevention: translating knowledge into practice. Clin Gastroenterol Hepatol. 2015;13:2140–2151. doi: 10.1016/j.cgh.2015.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bruix J., Sherman M. Management of hepatocellular carcinoma. Hepatology. 2005;42:1208–1236. doi: 10.1002/hep.20933. [DOI] [PubMed] [Google Scholar]
  • 3.Shiina S., Tateishi R., Arano T., et al. Radiofrequency ablation for hepatocellular carcinoma: 10-year outcome and prognostic factors. Am J Gastroenterol. 2012;107:569–577. doi: 10.1038/ajg.2011.425. quiz 578. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Livraghi T., Goldberg S.N., Lazzaroni S., et al. Small hepatocellular carcinoma: treatment with radio-frequency ablation versus ethanol injection. Radiology. 1999;210:655–661. doi: 10.1148/radiology.210.3.r99fe40655. [DOI] [PubMed] [Google Scholar]
  • 5.Tateishi R., Shiina S., Teratani T., et al. Percutaneous radiofrequency ablation for hepatocellular carcinoma. An analysis of 1000 cases. Cancer. 2005;103:1201–1209. doi: 10.1002/cncr.20892. [DOI] [PubMed] [Google Scholar]
  • 6.Lee S.H., Kim S.U., Jang J.W., et al. Use of transient elastography to predict de novo recurrence after radiofrequency ablation for hepatocellular carcinoma. OncoTargets Ther. 2015;8:347. doi: 10.2147/OTT.S75077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Bruix J., Sherman M. Management of hepatocellular carcinoma: an update. Hepatology. 2011;53:1020–1022. doi: 10.1002/hep.24199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.European Association for the Study of the Liver. European Organisation for Research and Treatment of Cancer EASL-EORTC clinical practice guidelines: management of hepatocellular carcinoma. J Hepatol. 2012;56:908–943. doi: 10.1016/j.jhep.2011.12.001. [DOI] [PubMed] [Google Scholar]
  • 9.Benson A.B., 3rd, D'Angelica M.I., Abbott D.E., et al. NCCN guidelines insights: hepatobiliary cancers, version 1.2017. J Natl Compr Canc Netw. 2017;15:563–573. doi: 10.6004/jnccn.2017.0059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Omata M., Lesmana L.A., Tateishi R., et al. Asian Pacific Association for the Study of the Liver consensus recommendations on hepatocellular carcinoma. Hepatol Int. 2010;4:439–474. doi: 10.1007/s12072-010-9165-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kokudo N., Hasegawa K., Akahane M., et al. Evidence-based Clinical Practice Guidelines for Hepatocellular Carcinoma: The Japan Society of Hepatology 2013 update (3rd JSH-HCC Guidelines) Hepatol Res. 2015;45 doi: 10.1111/hepr.12464. [DOI] [PubMed] [Google Scholar]
  • 12.Korean Liver Cancer Study Group (KLCSG) National Cancer Center, Korea (NCC) 2014 KLCSG-NCC Korea practice guideline for the management of hepatocellular carcinoma. Gut Liver. 2015;9:267–317. doi: 10.5009/gnl14460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Sato M., Morimoto K., Kajihara S., et al. Machine-learning approach for the development of a novel predictive model for the diagnosis of hepatocellular carcinoma. Sci Rep. 2019;9:7704. doi: 10.1038/s41598-019-44022-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.LeCun Y., Bengio Y., Hinton G. Deep learning. Nature. 2015;521:436–444. doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
  • 15.Cox D.R. Regression models and life-tables. J R Stat Soc B Methodol. 1972;34:187–202. [Google Scholar]
  • 16.Babajide Mustapha I., Saeed F. Bioactive molecule prediction using extreme gradient boosting. Molecules. 2016;21:983. doi: 10.3390/molecules21080983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Katzman J.L., Shaham U., Cloninger A., et al. DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Med Res Methodol. 2018;18:1–12. doi: 10.1186/s12874-018-0482-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Audureau E., Carrat F., Layese R., et al. Personalized surveillance for hepatocellular carcinoma in cirrhosis - using machine learning adapted to HCV status. J Hepatol. 2020;73:1434–1445. doi: 10.1016/j.jhep.2020.05.052. [DOI] [PubMed] [Google Scholar]
  • 19.Huang Y., Chen H., Zeng Y., et al. Development and validation of a machine learning prognostic model for hepatocellular carcinoma recurrence after surgical resection. Front Oncol. 2020;10:593741. doi: 10.3389/fonc.2020.593741. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Torzilli G., Minagawa M., Takayama T., et al. Accurate preoperative evaluation of liver mass lesions without fine-needle biopsy. Hepatology. 1999;30:889–893. doi: 10.1002/hep.510300411. [DOI] [PubMed] [Google Scholar]
  • 21.Edmondson H.A., Steiner P.E. Primary carcinoma of the liver: a study of 100 cases among 48,900 necropsies. Cancer. 1954;7:462–503. doi: 10.1002/1097-0142(195405)7:3<462::aid-cncr2820070308>3.0.co;2-e. [DOI] [PubMed] [Google Scholar]
  • 22.Ahmed M., Solbiati L., Brace C.L., et al. Image-guided tumor ablation: standardization of terminology and reporting criteria--a 10-year update. J Vasc Interv Radiol. 2014;25:1691–1705.e4. doi: 10.1016/j.jvir.2014.08.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Shiina S., Teratani T., Obi S., et al. A randomized controlled trial of radiofrequency ablation with ethanol injection for small hepatocellular carcinoma. Gastroenterology. 2005;129:122–130. doi: 10.1053/j.gastro.2005.04.009. [DOI] [PubMed] [Google Scholar]
  • 24.Yu C.-N., Greiner R., Lin H.-C., et al. Learning patient-specific cancer survival distributions as a sequence of dependent regressors. Adv Neural Inf Process Syst. 2011;24:1845–1853. [Google Scholar]
  • 25.Fotso S. Deep neural networks for survival analysis based on a multi-task framework. arXiv. 2018 preprint arXiv:1801.05512. [Google Scholar]
  • 26.Natekin A., Knoll A. Gradient boosting machines, a tutorial. Front Neurorobot. 2013;7:21. doi: 10.3389/fnbot.2013.00021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Qi Y. Ensemble machine learning. Springer; Boston, MA: 2012. Random forest for bioinformatics; pp. 307–323. [Google Scholar]
  • 28.Sheridan R.P., Wang W.M., Liaw A., et al. Extreme gradient boosting as a method for quantitative structure–activity relationships. J Chem Inf Model. 2016;56:2353–2360. doi: 10.1021/acs.jcim.6b00591. [DOI] [PubMed] [Google Scholar]
  • 29.Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc B Methodol. 1996;58:267–288. [Google Scholar]
  • 30.Hoerl A., Kennard R., Kotz S., et al. Wiley; New York: 1988. Encyclopedia of statistical sciences. [Google Scholar]
  • 31.Zou H., Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc B Stat Methodol. 2005;67:301–320. [Google Scholar]
  • 32.Vapnik V. Springer-Verlag; New York: 2013. The nature of statistical learning theory. [Google Scholar]
  • 33.Hearst M.A., Dumais S.T., Osuna E., et al. Support vector machines. IEEE Intell Syst Appl. 1998;13:18–28. [Google Scholar]
  • 34.Fotso S. 2019. PySurvival: open source package for survival analysis modeling. Available at: https://www.pysurvival.io/. Accessed November 22, 2021. [Google Scholar]
  • 35.Pölsterl S. Scikit-survival: a library for time-to-event analysis built on top of scikit-learn. J Mach Learn Res. 2020;21:1–6. [Google Scholar]
  • 36.Harrell F.E., Jr., Lee K.L., Mark D.B. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996;15:361–387. doi: 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4. [DOI] [PubMed] [Google Scholar]
  • 37.Altmann A., Toloşi L., Sander O., et al. Permutation importance: a corrected feature importance measure. Bioinformatics. 2010;26:1340–1347. doi: 10.1093/bioinformatics/btq134. [DOI] [PubMed] [Google Scholar]
  • 38.Royston P., Altman D.G. External validation of a Cox prognostic model: principles and methods. BMC Med Res Methodol. 2013;13:1–15. doi: 10.1186/1471-2288-13-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Burkholder I., Edler L. Wiley StatsRef: statistics reference online. John Wiley & Sons, Inc; Hoboken, New Jersey: 2014. Linear model: overview. [Google Scholar]
  • 40.Wu X., Zhu X., Wu G.-Q., et al. Data mining with big data. IEEE Trans Knowl Data Eng. 2014;26:97–107. [Google Scholar]
  • 41.Hengl S., Kreutz C., Timmer J., et al. Data-based identifiability analysis of non-linear dynamical models. Bioinformatics. 2007;23:2612–2618. doi: 10.1093/bioinformatics/btm382. [DOI] [PubMed] [Google Scholar]
  • 42.Rajkomar A., Dean J., Kohane I. Machine learning in medicine. N Engl J Med. 2019;380:1347–1358. doi: 10.1056/NEJMra1814259. [DOI] [PubMed] [Google Scholar]
  • 43.Austin P.C., Pencinca M.J., Steyerberg E.W. Predictive accuracy of novel risk factors and markers: a simulation study of the sensitivity of different performance measures for the Cox proportional hazards regression model. Stat Methods Med Res. 2017;26:1053–1077. doi: 10.1177/0962280214567141. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.De Vries B., Hegeman J., Nijmeijer W., et al. Comparing three machine learning approaches to design a risk assessment tool for future fractures: predicting a subsequent major osteoporotic fracture in fracture patients with osteopenia and osteoporosis. Osteoporos Int. 2021;32:437–449. doi: 10.1007/s00198-020-05735-z. [DOI] [PubMed] [Google Scholar]
  • 45.Zhang C., Bengio S., Hardt M., et al. Understanding deep learning (still) requires rethinking generalization. Commun ACM. 2021;64:107–115. [Google Scholar]
  • 46.Saragih T.H., Fajri D.M.N., Rakhmandasari A. Comparative study of decision tree, k-nearest neighbor, and modified k-nearest neighbor on jatropha curcas plant disease identification. Kinet Game Technol Inf Syst Comput Network Comput Electron Control. 2020;5 [Google Scholar]
  • 47.Fawagreh K., Gaber M.M., Elyan E. Random forests: from early developments to recent advancements. Syst Sci Control Eng. 2014;2:602–609. [Google Scholar]
  • 48.Fernández-Delgado M., Cernadas E., Barro S., et al. Do we need hundreds of classifiers to solve real world classification problems? J Mach Learn Res. 2014;15:3133–3181. [Google Scholar]
  • 49.Cui L., Bai L., Wang Y., et al. Internet financing credit risk evaluation using multiple structural interacting elastic net feature selection. Pattern Recognit. 2021;114:107835. [Google Scholar]
  • 50.Kim M.-J. Building a cardiovascular disease prediction model for smartwatch users using machine learning: based on the Korea National Health and Nutrition Examination Survey. Biosensors. 2021;11:228. doi: 10.3390/bios11070228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Kaushik A., Singh G., Bhatia A. SVM classification in multiclass letter recognition system. Glob J Comput Sci Technol. 2013;13 [Google Scholar]
  • 52.Nojiri S., Kusakabe A., Shinkai N., et al. Factors influencing distant recurrence of hepatocellular carcinoma following combined radiofrequency ablation and transarterial chemoembolization therapy in patients with hepatitis C. Cancer Manag Res. 2011;3:267. doi: 10.2147/CMR.S22073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Cho H.J., Kim B., Kim H.J., et al. Liver stiffness measured by MR elastography is a predictor of early HCC recurrence after treatment. Eur Radiol. 2020;30:4182–4192. doi: 10.1007/s00330-020-06792-y. [DOI] [PubMed] [Google Scholar]
  • 54.Du Z.-G., Wei Y.-G., Chen K.-F., et al. Risk factors associated with early and late recurrence after curative resection of hepatocellular carcinoma: a single institution's experience with 398 consecutive patients. Hepatobiliary Pancreat Dis Int. 2014;13:153–161. doi: 10.1016/s1499-3872(14)60025-4. [DOI] [PubMed] [Google Scholar]
  • 55.Imai K., Takai K., Nishigaki Y., et al. Insulin resistance raises the risk for recurrence of stage I hepatocellular carcinoma after curative radiofrequency ablation in hepatitis C virus-positive patients: a prospective, case series study. Hepatol Res. 2010;40:376–382. doi: 10.1111/j.1872-034X.2009.00616.x. [DOI] [PubMed] [Google Scholar]
  • 56.Yamashita R., Long J., Saleem A., et al. Deep learning predicts postsurgical recurrence of hepatocellular carcinoma from digital histopathologic images. Sci Rep. 2021;11:1–14. doi: 10.1038/s41598-021-81506-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Imamura J., Tateishi R., Shiina S., et al. Neoplastic seeding after radiofrequency ablation for hepatocellular carcinoma. Am J Gastroenterol. 2008;103:3057–3062. doi: 10.1111/j.1572-0241.2008.02153.x. [DOI] [PubMed] [Google Scholar]
  • 58.Blaha M.J., Blumenthal R.S. Risk factors: new risk-assessment guidelines—more or less personalized? Nat Rev Cardiol. 2014;11:136. doi: 10.1038/nrcardio.2013.216. [DOI] [PubMed] [Google Scholar]
  • 59.Fischer J.E., Bachmann L.M., Jaeschke R. A readers' guide to the interpretation of diagnostic test properties: clinical example of sepsis. Intensive Care Med. 2003;29:1043–1051. doi: 10.1007/s00134-003-1761-8. [DOI] [PubMed] [Google Scholar]
  • 60.Zacksenhouse M., Braun S., Feldman M., et al. Toward helicopter gearbox diagnostics from a small number of examples. Mech Syst Signal Process. 2000;14:523–543. [Google Scholar]
  • 61.Pandey G., Dukkipati A. 2017 International Joint Conference on Neural Networks (IJCNN) IEEE; Manhattan, New York: 2017. Variational methods for conditional multimodal deep learning. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table A1
mmc1.docx (15.9KB, docx)

Articles from Gastro Hep Advances are provided here courtesy of Elsevier

RESOURCES