CT-radiomics and clinical risk scores for response and overall survival prognostication in TACE HCC patients

Simon Bernatz; Oleg Elenberger; Jörg Ackermann; Lukas Lenga; Simon S Martin; Jan-Erik Scholtz; Vitali Koch; Leon D Grünewald; Yannis Herrmann; Maximilian N Kinzler; Angelika Stehle; Ina Koch; Stefan Zeuzem; Katrin Bankov; Claudia Doering; Henning Reis; Nadine Flinner; Falko Schulze; Peter J Wild; Renate Hammerstingl; Katrin Eichler; Tatjana Gruber-Rouh; Thomas J Vogl; Daniel Pinto dos Santos; Scherwin Mahmoudi

doi:10.1038/s41598-023-27714-0

. 2023 Jan 11;13:533. doi: 10.1038/s41598-023-27714-0

CT-radiomics and clinical risk scores for response and overall survival prognostication in TACE HCC patients

Simon Bernatz ^1,^2,^3,^✉, Oleg Elenberger ¹, Jörg Ackermann ⁴, Lukas Lenga ¹, Simon S Martin ¹, Jan-Erik Scholtz ¹, Vitali Koch ¹, Leon D Grünewald ¹, Yannis Herrmann ¹, Maximilian N Kinzler ⁵, Angelika Stehle ⁵, Ina Koch ⁴, Stefan Zeuzem ⁵, Katrin Bankov ², Claudia Doering ², Henning Reis ², Nadine Flinner ², Falko Schulze ², Peter J Wild ^2,^3,⁶, Renate Hammerstingl ¹, Katrin Eichler ¹, Tatjana Gruber-Rouh ¹, Thomas J Vogl ¹, Daniel Pinto dos Santos ^1,⁷, Scherwin Mahmoudi ¹

PMCID: PMC9834236 PMID: 36631548

Abstract

We aimed to identify hepatocellular carcinoma (HCC) patients who will respond to repetitive transarterial chemoembolization (TACE) to improve the treatment algorithm. Retrospectively, 61 patients (mean age, 65.3 years ± 10.0 [SD]; 49 men) with 94 HCC mRECIST target-lesions who had three consecutive TACE between 01/2012 and 01/2020 were included. Robust and non-redundant radiomics features were extracted from the 24 h post-embolization CT. Five different clinical TACE-scores were assessed. Seven different feature selection methods and machine learning models were used. Radiomics, clinical and combined models were built to predict response to TACE on a lesion-wise and patient-wise level as well as its impact on overall-survival prognostication. 29 target-lesions of 19 patients were evaluated in the test set. Response rates were 37.9% (11/29) on the lesion-level and 42.1% (8/19) on the patient-level. Radiomics top lesion-wise response prognostications was AUC 0.55–0.67. Clinical scores revealed top AUCs of 0.65–0.69. The best working model combined the radiomic feature LargeDependenceHighGrayLevelEmphasis and the clinical score mHAP_II_score_group with AUC = 0.70, accuracy = 0.72. We transferred this model on a patient-level to achieve AUC = 0.62, CI = 0.41–0.83. The two radiomics-clinical features revealed overall-survival prognostication of C-index = 0.67. In conclusion, a random forest model using the radiomic feature LargeDependenceHighGrayLevelEmphasis and the clinical mHAP-II-score-group seems promising for TACE response prognostication.

Subject terms: Cancer imaging, Hepatocellular carcinoma, Chemotherapy, Hepatocellular carcinoma, Biomarkers, Translational research

Introduction

In 2020 primary liver cancer ranked as the third leading cause of cancer death world-wide¹. Hepatocellular carcinoma (HCC) comprises around 75–85% of primary liver cancers and over the last 20 years its incidence has been rising^1,2. The diagnostic work-up of HCC-suspicious observations includes among others clinical examinations, laboratory analysis, imaging studies and often tumor biopsy². The treatment of HCC is complex and depends on the tumor stage. Potential curative treatments include liver resection, transplantation or local ablative methods like microwave ablation². HCC is predominantly arterially vascularized enabling the intra-arterial application of chemotherapy and embolization². These methods, like transarterial chemoembolization (TACE) are mainly palliative but may enable the complete destruction of the tumour or size-reduction to enable subsequent resection or transplantation (bridging therapy) in selected cases^2,3. TACE can prolong patient’s overall survival (OS) but it may also harm patients with reduction of OS depending on patient selection². A multitude of scores was developed to identify patients who will most likely benefit from TACE^2,4–8. Nevertheless, the scores’ validity is scarce and the use for treatment decision making is not recommended outside clinical trials². Consequently, patients are generally individually discussed in interdisciplinary tumor board meetings to define the appropriate therapy based on expert consensus. Recent emerges in the field of quantitative computational image analysis, termed radiomics, provide promising opportunities. Images are transformed in mineable data with subsequent bioinformatic analysis allowing lesion characterization beyond visual perception⁹. Radiomics’ prognostic and predictive potential was demonstrated in numerous cancer entities^9,10. Only scarce evidence is available for TACE in HCC patients and most studies examined the pre-TACE contrast-enhanced MRI or CT though variant contrast agents or injection protocols might alter the results^11–14. Lipiodol accumulation patterns after TACE might be used for response prognostication^15,16 but to the best of our knowledge a high dimensional pattern quantification by means of radiomics was not performed yet.

We hypothesized that lipiodol retention patterns from the post-embolization CT after the first TACE can be quantified by means of radiomics to serve as imaging biomarkers for TACE response prediction. The aim of this study was to develop a predictive model for HCC patients on a (I) lesion-wise level, (II) patient-wise level and (III) for overall survival. Further, we aimed to stratify the best working model by comparing CT-derived features with clinical scores and a holistic combined model.

Methods

Written informed consent was obtained from all patients and the study was approved by the institutional Review Boards of the University Cancer Center and the Ethical Committee at the University Hospital Frankfurt (project-number: SGI-10-2020). The patient population was not reported previously.

Study design

In this retrospective study we consecutively enrolled 61 HCC-patients (female, 12; mean age, 65.3 ± 10.0 years) who were treated with conventional TACE between 01/2012 and 01/2020. Inclusion criteria were: (1) Histologically confirmed HCC, (2) three consecutive TACE exclusively with the therapeutics Mitomycin C (Medac®, Hamburg, Germany) and Lipiodol (Guerbet GmbH, France) ± degradable starch microspheres (EmboCept®S, PharmaCept GmbH, Berlin, Germany) and injected in the same liver region, (3) all mRECIST target lesions (TL) were treated with each TACE, (4) post-TACE unenhanced CT 24 h after TACE, (5) contrast-enhanced arterial and portal-venous/ delayed phase MRI or CT prior to the first and after the third TACE. Exclusion criteria: (1) Consecutive TACE applied in different liver regions, (2) time interval between first and last TACE > 6 months, (3) prior local therapy of TLs, (4) no TLs, (5) insufficient image quality, (6) other chemotherapeutic agents. 61 patients met the criteria and were evaluated. In Fig. 1 we depict the flow-chart of patient inclusion following STARD. A scheme of the study’s workflow is shown in Supplementary Data S1.

STARD Flowchart of patient inclusion into the study. STARD, Standards for Reporting Diagnostic Accuracy Studies.

Conventional TACE

Patients were treated with TACE in clinical routine as described in prior studies¹⁷ and in Supplementary Data S2. Imaging acquisition and examination parameters of the post-TACE CT are summarized in Supplementary Data S2.

Assessment of tumor response

Response to TACE was assessed by mRECIST¹⁸. Lesion-wise response was defined as complete (CR) or partial response (PR) of TLs. Patient-wise response (CR or PR) was equivalent to the mRECIST overall response assessment¹⁸.

Image segmentation and preprocessing

The image stack was visualized and processed using the 3D Slicer software platform (http://slicer.org, version 4.9.0)^19,20. We resampled the images to a spacing of 1 mm × 1 mm × 1 mm prior to features extraction. One blinded investigator (OE, board-certified radiologist, 10 years of experience) tagged and segmented a maximum of two TLs per patient using the 24 h post-embolization CT after the first TACE. The tagged TLs were independently segmented by a second blinded investigator (SB, radiologist-in-training, 3.5 years of experience). Segmentation was performed as follows: a three-dimensional volume of interest (VOI) was manually drawn in the HCC-lesion, sparing equivocal border zones. The semi-automatic grow from seeds algorithm was used to augment the VOI to match the whole tumor habitat^20–22. Clear foci of segmentation error were manually erased using the brush-erase tool. A representative segmentation is shown in Fig. 2.

Workflow of the image analysis. (a) Baseline arterial-phase MRI showing mildly enhancing hepatocellular carcinoma. The 24 h post-TACE CT (b) was used to semi-automatically segment the lipiodol retention-pattern in three dimensions (c–d).

Radiomic analysis

We used PyRadiomics within 3D Slicer for radiomics features extraction^20,23. With default settings, all original standard features were extracted (n = 107) as described in prior studies²⁴. The radiomics quality score was 14 (https://radiomics.world/rqs, Supplementary Data S3)²⁵.

Inter-observer robustness and feature redundancy

The intra-class correlation coefficient (ICC) was calculated for each feature using ICC3 of the Pingouin package^24,26. ICC values were interpreted with thresholds commonly used in ICC-analysis, i.e. ICC 0.75–1 = excellent²⁴. We discarded all features with ICC < 0.75 (n = 8) (Supplementary Data S4 and S5). We inter-correlated the robust features by Pearson method and excluded all highly correlated (Pearson > 0.95) features (n = 52) (Supplementary Data S6).

Clinical benchmark

We calculated five different clinical scores for the assessment of the liver function in HCC and for TACE response prediction as described in Supplementary Data S7. The degree of TL’s hypervascularization was visually assessed by three independent raters (see Supplementary Data S7).

Imaging biomarker selection and model development

We describe the workflow of feature selection and model development in a scheme in Supplementary Data S1 and in detail in Supplementary Data S8. We performed all analysis in Python 3.7.6. We used StandardScaler²⁷ to scale the data to uniform variance. We used t-distributed stochastic neighbor embedding (t-SNE) plots to explore cluster distributions (scikit-learn²⁷). We split our dataset into 70% training and 30% testing on a patient level using GroupShuffleSplit²⁷. Fist, we assessed the lesion-wise response using seven different feature selection strategies and seven different machine learning models with hyperparameter optimization using Hyperopt²⁸ (see supplementary Data S8). Feature selection and model development was individually done for radiomics features, clinical scores and their combination. This approach ensured that the radiomics model was benchmark against clinical and combined models. The best working model was locked and transferred to predict the response on the patient-level. The selected features were used to train a random survival forest for overall survival prediction using Scikit-survival 0.16.1²⁹. The performance was assessed by the concordance-index. We used the lifelines package ³⁰ to build and compare Kaplan Meier curves. The WORC.statistics package³¹ was used for the DeLong’s test. For graphical illustrations Python 3.7.6. and Affinity Designer (Serif (Europe) Ltd) was used.

Ethical approval

Patient data used in this study was provided by the University Cancer Center Frankfurt (UCT). Written informed consent was obtained from all patients and the study was approved by the institutional Review Boards of the UCT and the Ethical Committee at the University Hospital Frankfurt (project-number: SGI-10–2020). All analysis were performed in accordance with relevant guidelines and regulations.

Results

Study population

Our dataset comprised 61 patients (mean age, 65.3 years ± 10.0 [SD]; 12 women) with 94 HCC mRECIST TLs. 38.3% (36/94) of the TLs and 41.0% (25/61) of the patients had response to TACE. We randomly drew 70% of the patients (n = 42, mean age, 66.1 years ± 10.3 [SD]) with 65 TLs as training and 30% of the patients (n = 19, mean age, 63.5 years ± 9.2 [SD]) with 29 TLs as independent testing set. Response to therapy was seen in 40.5% (17/42) training-patients (38.5% (25/65) training-TLs) and in 42.1% (8/19) testing-patients (37.9% (11/29) testing-TLs). We depict the overall survival for the complete cohort, training and testing sets in Supplementary Data S9. Patient demographic characteristics are shown in Table 1. Flow diagram of patient inclusion is shown in Fig. 1.

Table 1.

Clinical and epidemiological characteristics.

Demographic variables	All	Train	Test	p-value
Patients (n)	61	42	19
Sex, male (%)	49 (80.3)	35 (83.3)	14 (73.7)	0.380
Median age at diagnosis (years)	66 (37–86)	67 (37–86)	63 (50–86)	0.323
Median time diagnosis to TACE (days)	43 (3–978)	42 (3–874)	59 (17–978)	0.252
Median size of Target lesions (cm)	2.9 (1.0–10.1)	2.2 (1.0–9.8)	3.4 (1.0–10.1)	0.143
Cause of HCC (%)				0.968
Hepatitis B	9 (14.8)	6 (14.3)	3 (15.8)
Hepatitis C	14 (23.0)	10 (23.8)	4 (21.1)
Alcohol	19 (31.1)	14 (33.3)	5 (26.3)
NASH	5 (8.2)	3 (7.1)	2 (10.5)
Alcohol + viral Hepatitis	6 (9.8)	4 (9.5)	2 (10.5)
Alcohol + NASH	1 (1.6)	1 (2.4)	0 (0)
Others (cryptogenic cirrhosis, AIH)	7 (11.5)	4 (9.5)	3 (15.8)
BCLC prior to TACE				0.092
A	18 (29.5)	14 (33.3)	4 (21.1)
B	33 (54.1)	24 (57.1)	9 (47.4)
C	10 (16.4)	4 (9.5)	6 (31.6)
D	0 (0.0)	0 (0.0)	0 (0.0)
Child Pugh Score prior to TACE				0.644
A	36 (59.0)	25 (59.5)	11 (57.9)
B	6 (9.8)	5 (11.9)	1 (5.3)
C	0 (0.0)	0 (0.0)	0 (0.0)
N/A	19 (31.1)	12 (28.6)	7 (36.8)
MELD-Score prior to TACE				0.141
< 6	1 (1.6)	0 (0.0)	1 (5.3)
< 10	23 (37.7)	13 (31.0)	10 (52.6)
< 15	11 (18.0)	10 (23.8)	1 (5.3)
< 20	1 (1.6)	1 (2.4)	0 (0.0)
N/A	25 (41.0)	18 (42.9)	7 (36.8)
Median albumin (g/dl)	3.8 (1.8–7.2)	3.8 (1.8–7.2)	3.9 (3.0–4.5)	0.686
Median bilirubin (mg/dl)	0.8 (0.3–2.2)	1.1 (0.3–2.2)	0.9 (0.3–1.3)	0.540
Median INR	1.1 (0.9–3.3)	1.1 (0.9–3.3)	1.1 (1.0–1.8)	0.204
Median CRP (mg/dl)	0.4 (0.03–4.4)	0.5(0.03- 4.4)	0.3 (0.1–2.9)	0.632
Median AFP (ng/ml)	12.9 (2.1–60,500.0)	9.2 (2.1–60,500.0)	17.3 (2.2–9276.0)	0.615

Open in a new tab

The train and test set were statistically analyzed using the Pearson Chi-Square test or two-sided t-test for ordinal or continuous outcomes.

Interobserver robustness and feature redundancy

The mean intra-class correlation coefficient was 0.90 for all feature classes combined, ranging from 0.76 (± 0.41, ngtdm) to 0.98 (± 0.03, firstorder) (Supplementary Data S4). A set of 8 features (marked in bold in Supplementary Data S5) revealed ICC values < 0.75 (range: 0.04–0.74) and were excluded for further analysis. We intercorrelated the remaining robust features with Pearson metric to exclude 52 features due to redundancy. The final robust and non-redundant feature set consisted of 47 features (Supplementary Data S6).

Lesion-wise response characterization using dimensionality reduction

To assess the variance of radiomics and clinical features regarding the individual TL response to TACE, we used low dimensional embedding via t-SNE plots for each feature subset (radiomics, clinical features and their combination). Neither feature subset showed clear clusters of response (Supplementary Data S10). Therefore, we pursued our analysis with models of higher complexity.

Lesion-wise response prognostication: feature selection, model development and clinical benchmarking

The feature selection and model training were applied independently on three different feature subsets: (I) radiomics features, (II) clinical features or (III) their combination. We identified prognostic signatures for each subset (Table 2, Supplementary Data S8). For each subset, we validated the model on our hold-out test set to stratify the best working model using ROC AUC metrics. If models showed equal performance, we ranked models higher the less features they needed for the prediction. The best working radiomics model revealed a test AUC of 0.60 (train AUC = 1.00). The best clinical model reflected bias with a better test than train performance (train/ test AUC = 0.61/ 0.69). The combined clinical and radiomics model showed the best performance with test AUCs of 0.70 (train AUC = 0.96) (Fig. 3a, Table 3). This best performing combined model was a Random Forest Classifier which included the CT-derived radiomics feature LargeDependenceHighGrayLevelEmphasis and the clinical score mHAP_II_score_group. This final prognostic model was locked (Supplementary Data S8).

Table 2.

Feature subsets of different selection strategies.

Feature selection	Selected features
	Radiomics
RFE	Flatness, Sphericity, 10Percentile, Maximum, Skewness, Imc1, LargeDependenceHighGrayLevelEmphasis, LargeAreaEmphasis, LargeAreaLowGrayLevelEmphasis
RFA	LargeDependenceHighGrayLevelEmphasis
LASSO	Flatness, Minimum, Skewness, LargeAreaLowGrayLevelEmphasis
	clinical scores
RFE & RFA	mHAP_II_score_group
LASSO	mHAP_II_score_group, 6_and_12_group
	Combined features (clinical and radiomics)
Best combined	LargeDependenceHighGrayLevelEmphasis, mHAP_II_score_group

Open in a new tab

LASSO, least absolute shrinkage and selection operator; RFA, recursive feature addition; RFE, recursive feature elimination. See Supplementary Data S8 for more information.

Prediction of response and overall survival. (a, b) Receiver operating characteristics (ROC) curves trained and tested using the final combined feature set of the radiomics feature LargeDependenceHighGrayLevelEmphasis and the clinical score mHAP_II_score_group. (a) Lesion-wise prediction with class 1 describing the individual responding lesions according to mRECIST. (b) Patient-wise prediction with class 1 describing the overall response on the patient-level according to mRECIST, including the impact of non-target lesions and potential new-lesions. (c) Patient-wise overall survival prediction. Kaplan–Meier plot of two test-patients who showed the shortest (102 days) or longest (censored at 2043 days) survival. Kaplan–Meier estimator was based on our final model. Logrank-Test was used.

Table 3.

Classifier, feature selection strategy and performance of the best lesion-wise models.

Classifier	Selection	Train		Test
Classifier	Selection	Accuracy	AUC	Accuracy	AUC
Radiomics
GradientBoostingClassifier	RFE	1.000	1.000	0.517	0.596
ExtraTreesClassifier	RFA	0.738	0.881	0.586	0.674
AdaBoostClassifier	LASSO	1.000	1.000	0.586	0.551
Clinical scores
SVC	RFE & RFA	0.615	0.606	0.621	0.689
SGDClassifier	LASSO	0.615	0.618	0.621	0.649
Combined features
RandomForestClassifier	Best combined	0.862	0.957	0.724	0.705

Open in a new tab

AUC, area under the curve; LASSO, least absolute shrinkage and selection operator; RFA, recursive feature addition; RFE, recursive feature elimination. See Supplementary Data S8 for more information.

Patient-wise response prognostication: model transferability and prognostication of overall survival

We transferred our locked lesion-wise model on a patient-wise level. Response to TACE was defined according to mRECIST including the effect of non-target or potentially new lesions. The model demonstrated a prognostic performance of AUC test = 0.62, CI = 0.41–0.83 (AUC train = 0.77, CI = 0.68–0.91) (Fig. 3b). We transferred the combined CT-derived and clinical two-feature set to test the prognostication of overall survival. The model yielded a C-index of 0.67 (C-index train = 0.71) for overall survival prognostication compared to a C-index of 0.58 (train: 0.70) or 0.60 (train: 0.60) using only the single clinical or single radiomic feature. Finally, we selected two test-patients who showed the shortest (102 days) or longest (last living contact at 2043 days) survival to estimate their individual survival using the Kaplan–Meier estimator based on our final model. We computed the risk score that represents the expected number of events for a particular terminal node in the forest for the respective test patients. The patient with short survival yielded a higher risk score (26.89) than the patient with long overall survival (23.55). We depict the predicted Kaplan–Meier plot of the two patients in Fig. 3c which revealed significant difference in the logrank-Test (p = 0.006).

Discussion

In this study, we assessed the utility of machine learning models in predicting response to repetitive TACE in HCC patients. We used Lipiodol-retention radiomics of the first post-TACE control CT as imaging biomarker. We applied multiple feature selection strategies to train a multitude of machine learning models with exhaustive hyperparameter optimization to stratify tumor lesions’ response to TACE. We transferred our lesion-wise model to a patient-level and corroborated our findings by overall survival prognostication. We demonstrated the model’s ability to denote tumor risk scores associated with shorter or longer overall survival. CT-derived features were benchmarked against clinical risk scores and the best working model consisted of the combination of the single radiomics feature LargeDependenceHighGrayLevelEmphasis and the single clinical risk score mHAP_II_score_group.

HCC hallmark imaging characteristics (arterial hyperenhancement with portal venous/ delayed wash-out) and mRECIST assessment of viable tumour components are well established, especially in patients treated with TACE². Recent studies aimed to stratify imaging biomarkers extracted from pre-treatment contrast-enhanced imaging to build predictive models for HCC TACE response^11–14. The studies tended to build holistic nomograms including imaging and clinical features and yielded promising predictive performances of overall survival ranging from C-indices of 0.70 to 0.77 which are in a similar range to our results^11,13,14. Kuang et al. yielded lesion-wise mRECIST response predictions of AUC approx. 0.81 using pre-treatment MRI and clinical data¹². No patient-wise or survival analysis was done and it remained unclear how many TACE were applied prior to the analysis¹². We followed a more stringent approach by building a model starting at a lesion-wise prediction, transferring the model to a patient-wise level and finally to overall survival. Further, arterial-phase imaging might suffer from reduced image quality due to artifacts or poor arterial phase capture. This might limit the development of robust AI models as they add noise to a system which already suffers from robustness deficiencies even in an experimental setting^24,32,33. In line with prior studies ^15,16, our results promote the potential of lipiodol deposits to serve as imaging biomarker. Miszczuk et al. ¹⁶ prospectively enrolled 39 liver cancer patients (n = 22, HCC) treated with TACE and they could show, that high Lipiodol coverage on the 24-h post-TACE CT was associated with response to therapy. Lipiodol retention may serve as a surrogate for arterial hyperenhancement ¹⁶, the vascularization pattern of HCC lesions might have prognostic impact ³⁴ and our results provide quantitative corroboration of these findings. In our model, the GLDM feature LargeDependenceHighGrayLevelEmphasis, which depends on higher gray-level values (https://pyradiomics.readthedocs.io/), had the highest predictive impact. This is in line with Brancato et al.³⁵ who predicted histological HCC grade by means of radiomics. The feature LargeDependenceHighGrayLevelEmphasis was contributing to the most powerful model to differentiate histological grade 1 versus grade 3 tumors³⁵ emphasizing the feature’s potential to serve as imaging biomarker for HCC aggressiveness. The current ESMO clinical practice guidelines for hepatocellular carcinoma² do not recommend the use of prognostic scores for treatment algorithms outside clinical trials and they describe only the hepatoma arterial-embolisation prognostic (HAP) score as potential stratification tool for TACE in the future². This is in line with the results of our study as the best performing clinical scores revealed biased train-/test results. Nevertheless, the holistic model combining LargeDependenceHighGrayLevelEmphasis with the modified HAP-II⁶ score improved the models’ performances and established the best working model. Our study has limitations that warrant discussion. The retrospective nature of our study might impose selection bias. With 61 patients and 94 lesions our study population is rather small which might lower generalizability, but our cohort is very homogenous only including patients with histologically confirmed HCC, a total of three TACE prior to response assessment and usage of the same chemotherapeutic agent in each patient. In approx. 20% of patients additional degradable starch microspheres (EmboCept®S, PharmaCept GmbH, Berlin, Germany) were given which might have altered the retention in our standard-of-care real-world population. We leveraged a multitude of feature selection and classification strategies, nevertheless various degrees of overfitting were present in some models. Though we resampled the images to a spacing of 1 × 1 × 1 mm, we used standard-of-care imaging to develop our models with post-embolization CTs with originally 5 mm slice thickness and availability of true 1 mm reconstructions would have been favorable.

In conclusion, radiographic features derived from standard-of-care 24 h post-embolization CT have the potential to serve as imaging biomarkers for prognostication of response to TACE in HCC patients. Imaging biomarkers and clinical risk scores seem to incorporate complementary prognostic information and a combined final model of a clinical risk score and a single radiomics feature revealed the best performance. This emerging approach might pave the way to aiding clinical decision making in a clinical domain currently dominated by subjective expert consensus. Such tools might enable the more accurate stratification of patients for personalized healthcare avoiding potential adverse events in patients who most likely won’t respond to TACE.

Supplementary Information

Supplementary Information.^{(2.6MB, docx)}

Abbreviations

CR: Complete response
DART: Dropouts meet multiple additive regression trees
DICOM: Digital Imaging and Communications in Medicine
GBDT: Gradient boosting decision tree
GLCM: Gray level co-occurrence matrix
GLRLM: Gray level run length matrix
GLSZM: Gray level size zone matrix
GLDM: Gray level dependence matrix
GOSS: Gradient-based one-side sampling
HCC: Hepatocellular carcinoma
ICC: Intra-class correlation coefficient
LASSO: Least absolute shrinkage and selection operator
mRECIST: Modified Response Evaluation Criteria in Solid Tumors
NGTDM: Neighboring gray tone difference matrix
NTL: Non-target lesion
PD: Progressive disease
PR: Partial response
OS: Overall survival
RF: Random forest
RFA: Recursive feature addition
RFE: Recursive feature elimination
SD: Stable disease
SGD: Stochastic gradient descent
STARD: Standards for Reporting Diagnostic Accuracy Studies
SVC: Support vector classifier
TACE: Transarterial chemoembolization
TL: Target lesion
T-SNE: T-distributed stochastic neighbor embedding
VOI: Volume of interest

Author contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by S.B., O.E., J.A., J.E.S., Y.H., I.K., K.B., K.E., T.G.R., T.J.V., D.P.dS. and S.M. Interpretation of data was performed by S.B., O.E., J.A., J.E.S., M.N.K., A.S., I.K., K.E., T.G.R., T.J.V., D.P.dS. and S.M. The first draft of the manuscript was written by S.B., O.E., J.A., T.J.V. and SM and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript and agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated, resolved and the resolution documented in the literature. Patients signed informed consent regarding publishing their data and photographs.

Funding

Open Access funding enabled and organized by Projekt DEAL. The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Data availability

The datasets generated and/or analysed during the current study are not publicly available due to privacy regulations but are available from the corresponding author on reasonable request.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-023-27714-0.

References

1.Sung H, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA. Cancer J. Clin. 2021;71:209–249. doi: 10.3322/caac.21660. [DOI] [PubMed] [Google Scholar]
2.Vogel A, et al. Hepatocellular carcinoma: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann. Oncol. 2018;29:iv238–iv55. doi: 10.1093/annonc/mdy308. [DOI] [PubMed] [Google Scholar]
3.Lurje I, et al. Treatment strategies for hepatocellular carcinoma—A multidisciplinary approach. Int. J. Mol. Sci. 2019;20:1–27. doi: 10.3390/ijms20061465. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Johnson PJ, et al. Assessment of liver function in patients with hepatocellular carcinoma: A new evidence-based approach—The ALBI grade. J. Clin. Oncol. 2015;33:550–558. doi: 10.1200/JCO.2014.57.9151. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Kadalayil L, et al. A simple prognostic scoring system for patients receiving transarterial embolisation for hepatocellular cancer. Ann. Oncol. 2013;24:2565–2570. doi: 10.1093/annonc/mdt247. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Park Y, et al. Addition of tumor multiplicity improves the prognostic performance of the hepatoma arterial-embolization prognostic score. Liver Int. 2016;36:100–107. doi: 10.1111/liv.12878. [DOI] [PubMed] [Google Scholar]
7.Hucke F, et al. How to STATE suitability and START transarterial chemoembolization in patients with intermediate stage hepatocellular carcinoma. J. Hepatol. 2014;61:1287–1296. doi: 10.1016/j.jhep.2014.07.002. [DOI] [PubMed] [Google Scholar]
8.Wang Q, et al. Development of a prognostic score for recommended TACE candidates with hepatocellular carcinoma: A multicentre observational study. J. Hepatol. 2019;70:893–903. doi: 10.1016/j.jhep.2019.01.013. [DOI] [PubMed] [Google Scholar]
9.Gillies RJ, Kinahan PE, Hricak H. Radiomics: Images are more than pictures. They Are Data. Radiol. 2016;278:563–577. doi: 10.1148/radiol.2015151169. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Trebeschi S, et al. Predicting response to cancer immunotherapy using noninvasive radiomic biomarkers. Ann. Oncol. 2019;30:998–1004. doi: 10.1093/annonc/mdz108. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Dai Y, et al. noninvasive imaging evaluation based on computed tomography of the efficacy of initial transarterial chemoembolization to predict outcome in patients with hepatocellular carcinoma. J. Hepatocell. Carcinoma. 2022;9:273. doi: 10.2147/JHC.S351077. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Kuang Y, et al. MRI-based radiomics: Nomograms predicting the short-term response after transcatheter arterial chemoembolization (TACE) in hepatocellular carcinoma patients with diameter less than 5 cm. Abdom. Radiol. 2021;46:3772–3789. doi: 10.1007/s00261-021-02992-2. [DOI] [PubMed] [Google Scholar]
13.Li L, et al. Radiomics signature: A potential biomarker for the prediction of survival in advanced hepatocellular carcinoma. Int. J. Med. Sci. 2021;18:2276–2284. doi: 10.7150/ijms.55510. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Meng XP, et al. Radiomics analysis on multiphase contrast-enhanced CT: A survival prediction tool in patients with hepatocellular carcinoma undergoing transarterial chemoembolization. Front. Oncol. 2020;10:1–12. doi: 10.3389/fonc.2020.01196. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Chen CS, et al. Tumor vascularity and lipiodol deposition as early radiological markers for predicting risk of disease progression in patients with unresectable hepatocellular carcinoma after transarterial chemoembolization. Oncotarget. 2016;7(6):7241. doi: 10.18632/oncotarget.6892. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Miszczuk MA, et al. Lipiodol as an imaging biomarker of tumor response after conventional transarterial chemoembolization: Prospective clinical validation in patients with primary and secondary liver cancer. Transl. Oncol. 2020;13:100742. doi: 10.1016/j.tranon.2020.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Vogl TJ, et al. Evaluation of two different transarterial chemoembolization protocols using Lipiodol and degradable starch microspheres in therapy of hepatocellular carcinoma: A prospective trial. Hepatol. Int. 2021;15:685–694. doi: 10.1007/s12072-021-10193-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Lencioni R, Llovet MJ. Modified RECIST (mRECIST) assessment for hepatocellular carcinoma. Semin. Liver Dis. 2010;30:52–60. doi: 10.1055/s-0030-1247132. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Kumar V, et al. Radiomics: The process and the challenges. Magn. Reson. Imaging. 2012;30:1234–1248. doi: 10.1016/j.mri.2012.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Fedorov A, et al. 3D slicer as an image computing platform for the quantitative imaging network. Magn. Reson. Imaging. 2012;30:1323–1341. doi: 10.1016/j.mri.2012.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Parmar C, et al. Robust radiomics feature quantification using semiautomatic volumetric segmentation. PLoS ONE. 2014;9:1–8. doi: 10.1371/journal.pone.0102107. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Velazquez ER, et al. Volumetric CT-based segmentation of NSCLC using 3D-Slicer. Sci. Rep. 2013;3:1–7. doi: 10.1038/srep03529. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Van Griethuysen JJM, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77:e104–e107. doi: 10.1158/0008-5472.CAN-17-0339. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Bernatz S, et al. Impact of rescanning and repositioning on radiomic features employing a multi-object phantom in magnetic resonance imaging. Sci. Rep. 2021;11:1–13. doi: 10.1038/s41598-021-93756-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Lambin P, et al. Radiomics: The bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 2017;14:749–762. doi: 10.1038/nrclinonc.2017.141. [DOI] [PubMed] [Google Scholar]
26.Vallat R. Pingouin: Statistics in Python. J. Open Source Softw. 2018;3:1026. doi: 10.21105/joss.01026. [DOI] [Google Scholar]
27.Pedregosa F, et al. Scikit-learn : Machine learning in Python. J. Mach. Learn. Res. 2011;12:2825–2830. [Google Scholar]
28.Bergstra, J., Yamis, D. & Cox, D. D. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. In TProc. 30th Int. Conf. Mach. Learn. (ICML 2013) I-115–I–23.
29.Pölsterl S. scikit-survival: A library for time-to-event analysis built on top of scikit-learn. J. Mach. Learn. Res. 2020;21(212):1–6. [Google Scholar]
30.Davidson-Pilon C. lifelines: Survival analysis in Python. J. Open Source Softw. 2019;4:1317. doi: 10.21105/joss.01317. [DOI] [Google Scholar]
31.Starmans, M. P. A. et al. Reproducible radiomics through automated machine learning validated on twelve clinical applications. arXiv (2021). doi:10.48550/arXiv.2108.08618.
32.Berenguer R, Pastor-juan MR, Canales-vázquez J. Radiomics of CT features may be nonreproducible and redundant: Influence of CT acquisition parameters. Radiology. 2018;288:407–415. doi: 10.1148/radiol.2018172361. [DOI] [PubMed] [Google Scholar]
33.Baeßler B, Weiss K, Dos Santos DP. Robustness and reproducibility of radiomics in magnetic resonance imaging: A phantom study. Invest. Radiol. 2019;54:221–228. doi: 10.1097/RLI.0000000000000530. [DOI] [PubMed] [Google Scholar]
34.Hasdemir DB, Schweitzer N, Meyer BC, Wacker F, Rodt T. Evaluation of CT vascularization patterns for survival prognosis in patients with hepatocellular carcinoma treated by conventional TACE. Diagn. Interv. Radiol. 2017 doi: 10.5152/dir.2016.16006. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Brancato V, Garbino N, Salvatore M, Cavaliere C. MRI-based radiomic features help identify lesions and predict histopathological grade of Hepatocellular carcinoma. Diagnostics. 2022;12(5):1085. doi: 10.3390/diagnostics12051085. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information.^{(2.6MB, docx)}

Data Availability Statement

The datasets generated and/or analysed during the current study are not publicly available due to privacy regulations but are available from the corresponding author on reasonable request.

[CR1] 1.Sung H, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA. Cancer J. Clin. 2021;71:209–249. doi: 10.3322/caac.21660. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Vogel A, et al. Hepatocellular carcinoma: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann. Oncol. 2018;29:iv238–iv55. doi: 10.1093/annonc/mdy308. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Lurje I, et al. Treatment strategies for hepatocellular carcinoma—A multidisciplinary approach. Int. J. Mol. Sci. 2019;20:1–27. doi: 10.3390/ijms20061465. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Johnson PJ, et al. Assessment of liver function in patients with hepatocellular carcinoma: A new evidence-based approach—The ALBI grade. J. Clin. Oncol. 2015;33:550–558. doi: 10.1200/JCO.2014.57.9151. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Kadalayil L, et al. A simple prognostic scoring system for patients receiving transarterial embolisation for hepatocellular cancer. Ann. Oncol. 2013;24:2565–2570. doi: 10.1093/annonc/mdt247. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Park Y, et al. Addition of tumor multiplicity improves the prognostic performance of the hepatoma arterial-embolization prognostic score. Liver Int. 2016;36:100–107. doi: 10.1111/liv.12878. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Hucke F, et al. How to STATE suitability and START transarterial chemoembolization in patients with intermediate stage hepatocellular carcinoma. J. Hepatol. 2014;61:1287–1296. doi: 10.1016/j.jhep.2014.07.002. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Wang Q, et al. Development of a prognostic score for recommended TACE candidates with hepatocellular carcinoma: A multicentre observational study. J. Hepatol. 2019;70:893–903. doi: 10.1016/j.jhep.2019.01.013. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Gillies RJ, Kinahan PE, Hricak H. Radiomics: Images are more than pictures. They Are Data. Radiol. 2016;278:563–577. doi: 10.1148/radiol.2015151169. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Trebeschi S, et al. Predicting response to cancer immunotherapy using noninvasive radiomic biomarkers. Ann. Oncol. 2019;30:998–1004. doi: 10.1093/annonc/mdz108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Dai Y, et al. noninvasive imaging evaluation based on computed tomography of the efficacy of initial transarterial chemoembolization to predict outcome in patients with hepatocellular carcinoma. J. Hepatocell. Carcinoma. 2022;9:273. doi: 10.2147/JHC.S351077. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Kuang Y, et al. MRI-based radiomics: Nomograms predicting the short-term response after transcatheter arterial chemoembolization (TACE) in hepatocellular carcinoma patients with diameter less than 5 cm. Abdom. Radiol. 2021;46:3772–3789. doi: 10.1007/s00261-021-02992-2. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Li L, et al. Radiomics signature: A potential biomarker for the prediction of survival in advanced hepatocellular carcinoma. Int. J. Med. Sci. 2021;18:2276–2284. doi: 10.7150/ijms.55510. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Meng XP, et al. Radiomics analysis on multiphase contrast-enhanced CT: A survival prediction tool in patients with hepatocellular carcinoma undergoing transarterial chemoembolization. Front. Oncol. 2020;10:1–12. doi: 10.3389/fonc.2020.01196. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Chen CS, et al. Tumor vascularity and lipiodol deposition as early radiological markers for predicting risk of disease progression in patients with unresectable hepatocellular carcinoma after transarterial chemoembolization. Oncotarget. 2016;7(6):7241. doi: 10.18632/oncotarget.6892. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Miszczuk MA, et al. Lipiodol as an imaging biomarker of tumor response after conventional transarterial chemoembolization: Prospective clinical validation in patients with primary and secondary liver cancer. Transl. Oncol. 2020;13:100742. doi: 10.1016/j.tranon.2020.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Vogl TJ, et al. Evaluation of two different transarterial chemoembolization protocols using Lipiodol and degradable starch microspheres in therapy of hepatocellular carcinoma: A prospective trial. Hepatol. Int. 2021;15:685–694. doi: 10.1007/s12072-021-10193-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Lencioni R, Llovet MJ. Modified RECIST (mRECIST) assessment for hepatocellular carcinoma. Semin. Liver Dis. 2010;30:52–60. doi: 10.1055/s-0030-1247132. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Kumar V, et al. Radiomics: The process and the challenges. Magn. Reson. Imaging. 2012;30:1234–1248. doi: 10.1016/j.mri.2012.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Fedorov A, et al. 3D slicer as an image computing platform for the quantitative imaging network. Magn. Reson. Imaging. 2012;30:1323–1341. doi: 10.1016/j.mri.2012.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Parmar C, et al. Robust radiomics feature quantification using semiautomatic volumetric segmentation. PLoS ONE. 2014;9:1–8. doi: 10.1371/journal.pone.0102107. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Velazquez ER, et al. Volumetric CT-based segmentation of NSCLC using 3D-Slicer. Sci. Rep. 2013;3:1–7. doi: 10.1038/srep03529. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Van Griethuysen JJM, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77:e104–e107. doi: 10.1158/0008-5472.CAN-17-0339. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Bernatz S, et al. Impact of rescanning and repositioning on radiomic features employing a multi-object phantom in magnetic resonance imaging. Sci. Rep. 2021;11:1–13. doi: 10.1038/s41598-021-93756-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Lambin P, et al. Radiomics: The bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 2017;14:749–762. doi: 10.1038/nrclinonc.2017.141. [DOI] [PubMed] [Google Scholar]

[CR26] 26.Vallat R. Pingouin: Statistics in Python. J. Open Source Softw. 2018;3:1026. doi: 10.21105/joss.01026. [DOI] [Google Scholar]

[CR27] 27.Pedregosa F, et al. Scikit-learn : Machine learning in Python. J. Mach. Learn. Res. 2011;12:2825–2830. [Google Scholar]

[CR28] 28.Bergstra, J., Yamis, D. & Cox, D. D. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. In TProc. 30th Int. Conf. Mach. Learn. (ICML 2013) I-115–I–23.

[CR29] 29.Pölsterl S. scikit-survival: A library for time-to-event analysis built on top of scikit-learn. J. Mach. Learn. Res. 2020;21(212):1–6. [Google Scholar]

[CR30] 30.Davidson-Pilon C. lifelines: Survival analysis in Python. J. Open Source Softw. 2019;4:1317. doi: 10.21105/joss.01317. [DOI] [Google Scholar]

[CR31] 31.Starmans, M. P. A. et al. Reproducible radiomics through automated machine learning validated on twelve clinical applications. arXiv (2021). doi:10.48550/arXiv.2108.08618.

[CR32] 32.Berenguer R, Pastor-juan MR, Canales-vázquez J. Radiomics of CT features may be nonreproducible and redundant: Influence of CT acquisition parameters. Radiology. 2018;288:407–415. doi: 10.1148/radiol.2018172361. [DOI] [PubMed] [Google Scholar]

[CR33] 33.Baeßler B, Weiss K, Dos Santos DP. Robustness and reproducibility of radiomics in magnetic resonance imaging: A phantom study. Invest. Radiol. 2019;54:221–228. doi: 10.1097/RLI.0000000000000530. [DOI] [PubMed] [Google Scholar]

[CR34] 34.Hasdemir DB, Schweitzer N, Meyer BC, Wacker F, Rodt T. Evaluation of CT vascularization patterns for survival prognosis in patients with hepatocellular carcinoma treated by conventional TACE. Diagn. Interv. Radiol. 2017 doi: 10.5152/dir.2016.16006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR35] 35.Brancato V, Garbino N, Salvatore M, Cavaliere C. MRI-based radiomic features help identify lesions and predict histopathological grade of Hepatocellular carcinoma. Diagnostics. 2022;12(5):1085. doi: 10.3390/diagnostics12051085. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

CT-radiomics and clinical risk scores for response and overall survival prognostication in TACE HCC patients

Simon Bernatz

Oleg Elenberger

Jörg Ackermann

Lukas Lenga

Simon S Martin

Jan-Erik Scholtz

Vitali Koch

Leon D Grünewald

Yannis Herrmann

Maximilian N Kinzler

Angelika Stehle

Ina Koch

Stefan Zeuzem

Katrin Bankov

Claudia Doering

Henning Reis

Nadine Flinner

Falko Schulze

Peter J Wild

Renate Hammerstingl

Katrin Eichler

Tatjana Gruber-Rouh

Thomas J Vogl

Daniel Pinto dos Santos

Scherwin Mahmoudi

Abstract

Introduction

Methods

Study design

Figure 1.

Conventional TACE

Assessment of tumor response

Image segmentation and preprocessing

Figure 2.

Radiomic analysis

Inter-observer robustness and feature redundancy

Clinical benchmark

Imaging biomarker selection and model development

Ethical approval

Results

Study population

Table 1.

Interobserver robustness and feature redundancy

Lesion-wise response characterization using dimensionality reduction

Lesion-wise response prognostication: feature selection, model development and clinical benchmarking

Table 2.

Figure 3.

Table 3.

Patient-wise response prognostication: model transferability and prognostication of overall survival

Discussion

Supplementary Information

Abbreviations

Author contributions

Funding

Data availability

Competing interests

Footnotes

Supplementary Information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases