Skip to main content
The British Journal of Radiology logoLink to The British Journal of Radiology
. 2021 Jul 8;94(1124):20210525. doi: 10.1259/bjr.20210525

A prediction model for degree of differentiation for resectable locally advanced esophageal squamous cell carcinoma based on CT images using radiomics and machine-learning

Daisuke Kawahara 1,, Yuji Murakami 1, Shigeyuki Tani 2, Yasushi Nagata 1,3,1,3
PMCID: PMC8764921  PMID: 34235955

Abstract

Objective:

To propose the prediction model for degree of differentiation for locally advanced esophageal cancer patients from the planning CT image by radiomics analysis with machine learning.

Methods:

Data of 104 patients with esophagus cancer, who underwent chemoradiotherapy followed by surgery at the Hiroshima University hospital from 2003 to 2016 were analyzed. The treatment outcomes of these tumors were known prior to the study. The data were split into 3 sets: 57/16 tumors for the training/validation and 31 tumors for model testing. The degree of differentiation of squamous cell carcinoma was classified into two groups. The first group (Group I) was a poorly differentiated (POR) patients. The second group (Group II) was well and moderately differentiated patients. The radiomics feature was extracted in the tumor and around the tumor regions. A total number of 3480 radiomics features per patient image were extracted from radiotherapy planning CT scan. Models were built with the least absolute shrinkage and selection operator (LASSO) logistic regression and applied to the set of candidate predictors. The radiomics features were used for the input data in the machine learning. To build predictive models with radiomics features, neural network classifiers was used. The precision, accuracy, sensitivity by generating confusion matrices, the area under the curve (AUC) of receiver operating characteristic curve were evaluated.

Results:

By the LASSO analysis of the training data, we found 13 radiomics features from CT images for the classification. The accuracy of the prediction model was highest for using only CT radiomics features. The accuracy, specificity, and sensitivity of the predictive model were 85.4%, 88.6%, 80.0%, and the AUC was 0.92.

Conclusion:

The proposed predictive model showed high accuracy for the classification of the degree of the differentiation of esophagus cancer. Because of the good prediction ability of the method, the method may contribute to reducing the pathological examination by biopsy and predicting the local control.

Advances in knowledge:

For esophageal cancer, the differentiation of degree is the import indexes reflecting the aggressiveness. The current study proposed the prediction model for the differentiation of degree with radiomics analysis.

Introduction

Esophageal cancer has been high mortality and incidence rate for many countries.1 In Asia, the most common type of esophageal cancer is differentiated squamous cell carcinoma (SCC).2 Neoadjuvant chemoradiotherapy (NCRT) followed by surgery has become the standard treatment modality for patients with resectable locally advanced esophageal cancer.3 The past clinical studies of NCRT for esophagus cancer reported that the local control predicts better disease control and survival outcomes.4 For NCRT, the predictive models of treatment response can provide crucial information before surgery. Liu et al reported the TNM stage and differentiation degree has been the import indexes reflecting the aggressiveness of the esophagus cancer.5

The important indexes reflecting the aggressiveness of esophageal cancer have been evaluated with endoscopic ultrasonography (EUS), and endoscopic biopsy preoperatively. The EUS was useful for the classification of the T staging, and endoscopic biopsy pre-operatively could evaluate differentiation degree.6 Levine et al investigated the safety experience of systematic endoscopic biopsy protocols for patients with Barrett’s esophagus.7 The rate of complication was 1.6% for all patients, which was higher than approximate 0.5% of complication rate for routine diagnostic endoscopy.8 Recently, the introduction of radiomics and artificial intelligence (AI) into medicine has become a major topic. Radiomics is the science that systematically handles a large amount of image information in radiology. By combining radiomics and AI, attempts have been made to automatically perform image diagnosis and predict the malignancy of cancer or treatment results.9 Thus, radiomics can be utilized to avoid undesirable complications caused by biopsy.

The purpose of this study is to propose the prediction model for the degree of differentiation for locally advanced esophageal cancer patients from the planning CT image by radiomics analysis with machine learning.

Methods and materials

The process of the radiomics analysis with machine learning is shown in

A) Patients

All patients who underwent the surgery after the NCRT from 2003 to 2016 at Hiroshima University hospital were used for the analysis; a histologically confirmed thoracic esophageal or esophagogastric junction (EGJ) cancer; age ≤75 years; a performance status of 0 to 2 according to the World Health Organization scale; a histologically confirmed thoracic esophageal or esophagogastric junction (EGJ) cancer. All patients provided written informed consent for treatment. Our institutional review board approved this retrospective study (E-1656–1). The characteristics of the patients and their tumors are presented in Table 1.

Table 1.

Patient characteristics

Gender M/F 88/16
Age Median (range) 66 (34–78)
PS 0/1/2 24/49/31
Site Ut/Mt/Lt-Ae 24/49/27
Stage IB/II/III/IV 6/24/59/15

B) Image acquisition

The workflow of the predicting model was shown in Figure 1. CT imaging of a patient was performed during free breathing on a CT scanner (Lightspeed RT16, GE Healthcare, Little Chalfont, UK). The slice thickness and slice interval were 2.5 mm. A tube voltage of 120 kVp and a current of 100–600 mA was used for imaging acquisition.

Figure 1.

Figure 1.

The process of the radiomics analysis and create prediction model.

C) Contouring

Radiotherapy was performed with three-dimensional radiotherapy treatment planning for all patients. The primary tumor was defined as “tumor”, and the tumor with a 5 mm margin in all directions was defined as “tumor + 5 mm”. The shell feature was extracted from the voxels around the tumor boundaries, which was defined as the region between tumor + 5 mm and tumor the current study. The inner tumor feature was extracted from the voxels excluded the tumor boundaries. It was defined as the region of the tumor excluding outer 2 mm. All of the segmentations for radiomics analysis except for primary tumor were extended or reduced from the primary tumor. The primary tumor was segmented by one or two radiation oncologists who include one expert radiation oncologist at least. After the segmentation, the other expert radiation oncologist confirmed the segmentation. Moreover, two analyzers confirmed the segmentation before and after the deformable image registration.

F) Radiomics analysis

The extracted radiomics feature was standardized using z-score normalization so that each feature has the same mean of 0 and a standard deviation of 1 as follows:

fx=x-μxσx (1)

where f(x) and x are the standardized and original voxel intensities, respectively, and μx is the mean standard deviation (SD) and σx is the SD of the image intensity values per case, respectively.

The entire data set of the CT was analyzed to extract a number of textural features from the segmentations for the radiotherapy plans. The feature extraction was performed using an open-source package in Python, Pyradiomics software.10 A detailed list of the radiomics features is shown in Supplementary Material 1.

Supplementary Material 1.

A list of 50 quantitative features including first-order features: 21 features, shape features: 13 features, texture analysis features such as Gray Level Size Zone Matrix (GLSZM) features, and Features Gray Level Run Length Matrix (GLRLM) features: 93 features. were extracted. Moreover, we performed the radiomics analysis for the CT image with the imaging filters. These are preprocessed with a wavelet imaging filter. The wavelet filter has low-pass (L) and high-pass (H) filters. The decompositions are constructed in x, y, and z-direction. For example, HLL is then interpreted as the high-pass sub band, resulting from directional filtering of X with a high-pass filter along the x-direction, a low-pass filter along y-direction and a low-pass filter along the z-direction. In the current study, eight decompositions wavelet-HLL (wavelet-LHL, wavelet-LHH, wavelet-LLH, wavelet-HLH, wavelet-HHH, wavelet-HHL, wavelet-LLL) were performed. Each feature was computed with each of the above-mentioned preprocessing steps separately. From above, a total of 837 features were analyzed for each segmentation in this study.

G) Prediction model

To prevent overfitting, the least absolute shrinkage and selection operator (LASSO) regression model, which is suitable for the regression of high-dimensional data, was used with MATLAB code.11,12 The LASSO performs feature selection during model construction by penalizing the respective regression coefficients. As this penalty is increased, more regression coefficients shrink to zero resulting in a more regularized model. The most significant predictive features were selected from among all the candidate features in the training set.

The objective of this study was to stratify patients into two labeled classes using different ML classifiers. In this regard, POR patients were labeled as 1, and well and moderately differentiated patients were labeled as 0. This issue also was repeated to classify patients with stages. Machine-learning based classification was performed with neural-network (NN) method. The NN method was built with 10 hidden layers and rectified linear unit activation (ReLU), as implemented in the MATLAB program. Moreover, the performance of the prediction with NN method was evaluated by comparing with conventionally used models: Classification and Regression Tree (CART), Support Vector Machine (SVM), and K-nearest Neighbor (KNN), as implemented in the MATLAB program.

The radiomics analysis can deal with the texture features with the wavelet filter in addition to the traditional semantic features such as the shape and histogram analysis. The usability of the prediction model using all radiomics features was evaluated by comparing with the prediction model using the traditional semantic features. The traditional semantic features used the shape and histogram methods, which were listed in Supplementary Material 1. The prediction model was built without the feature selection and the machine learning was used the NN method.

Here, all patients were randomly partitioned into a training/validation data set (57/16 patients) for generation of the model, and a test data set (31 patients) for the testing model performance. As shown Figure 2, the training–validation–testing processes were repeated five times for the fivefold cross-validation for all ML-methods. The predictive performance of all models was evaluated using the area under the curve (AUC) of receiver operator characteristic (ROC).

Figure 2.

Figure 2.

Generation of the prediction model with the training/validation dataset and testing with the test data set. The fivefold cross-validation was applied for robust estimation.

Results

A total of 3480 features were extracted from CT image and were finally reduced to 11 features with the LASSO regression model, as shown in Figure 3. Table 2 shows the list of the selected features by LASSO regression. 21 features were selected with the wavelet filters and one feature was selected without an imaging filter. The features were extracted from various segmentations: three features from the tumor, two features from the tumor + 5 mm, three features from the shell region, two features from the inner region. nine features were selected from the texture analysis of GLCM, NGTDM, GLRLM, and GLDM. two features were selected from the first -order analysis.

Figure 3.

Figure 3.

Radiomics feature selection using the Lasso logistic regression model. (a) Tuning penalization parameter λ (λ) and minimum criterion in the LASSO model. The binomial deviance was plotted vs log (λ). (b) Lasso coefficient profiles of the 3480 radiomics features. The green line showed the optimal λ in the LASSO method with the least partial likelihood deviance. LASSO, least absolute shrinkage and selection operator.

Table 2.

Selected the features by the LASSO regression

Region Filter Category Item
Tumor wevelet-LHL NGTDM Contrast
Tumor wavelet-HLH GLDM Large Dependence Low Gray Level Emphasis
Tumor wavelet-HHL Firstorder Mean
Tumor +5 mm wavelet-HLL GLRLM Run Variance
Tumor +5 mm original GLCM Idn
Inner wavelet-HLH GLDM Large Dependence Low Gray Level Emphasis
Inner wavelet-HLH GLCM MaximumProbability
Shell wavelet-HLL GLCM Joint Average
Shell Wavelet-LHL Firstorder Mean
Shell Wavelet-LHH GLCM Idn
Shell Wavelet-HLH GLDM Small Dependence Emphasis

LASSO, least absolute shrinkage and selection operator.

Those models were evaluated with the training and test data sets with fivefold cross-validation, as shown in Table 3. Figure 4 shows the performance of the predictive model was tested according to the ROC metrics for the test dataset on fivefold cross-validation. The average accuracy of the five models was 87.5%, with the training data set. The average accuracy, sensitivity, and specificity with the test data were 85.4%, 88.6%, and 80.0%, respectively. Figure 4 shows the performance of the classifier which was validated according to the ROC metrics in the test data set with fivefold cross-validation. The AUC was 0.90 for first model, 0.92 for second model, 0.93 for third model, 0.91 for fourth mode, and 0.92 for fifth model. The average and standard deviation AUC with fivefold cross-validation was 0.92 and 0.0.

Table 3.

Model performance of NN with fivefold cross-validation in the accuracy, sensitivity, and specificity for the training and test data set

Model I Model II Model III Model IV Model V Average SD
Training Accuracy (%) 89.1 94.5 82.8 83.6 87.3 87.5 3.8
Test Accuracy (%) 88.0 82.6 87.0 82.6 87.0 85.4 2.1
Specificity (%) 89.5 84.6 92.9 82.4 93.8 88.6 4.1
Sensitivity (%) 87.5 80.0 77.8 83.3 71.4 80.0 4.9
AUC 0.90 0.92 0.93 0.91 0.92 0.92 0.0

AUC, area under the curve; SD, standard deviation.

Figure 4.

Figure 4.

The performance of the predictive model with NN was tested according to the ROC metrics for the test dataset on fivefold cross-validation. ROC, receiver operator characteristic.

Figure 5 shows average ROC metrics with the performance of the predictive model of CART, SVM, and KNN for the test data set. Table 4 shows the average accuracy, sensitivity, specificity, and AUC for the model performance of NN, CART, SVM, and KNN for the test dataset. The accuracy, sensitivity, specificity, and AUC were significantly highest for the model of NN.

Figure 5.

Figure 5.

Average ROC metrics with the performance of the predictive model of CART, SVM, and KNN for the test data set. CART, Classification and Regression Tree; KNN, K-nearest Neighbor; ROC, receiver operator characteristic; SVM, Support Vector Machine.

Table 4.

Model performance of the NN, CART, SVM, and KNN with fivefold cross-validation in the accuracy, sensitivity, specificity, and AUC for the test data set

NN CART SVM KNN
Accuracy (%) 85.4 76.1 78.2 77.1
Specificity (%) 88.6 68.9 74.1 82.7
Sensitivity (%) 80.0 79.3 80.0 67.6
AUC 0.92 0.69 0.85 0.76

AUC, area under the curve; CART, Classification and Regression Tree; KNN, K-nearest Neighbor; NN, neural network; SVM, Support Vector Machine.

Figure 6 shows average ROC metrics with the performance of the predictive model using all radiomics features and traditional semantic features for the test data set. Table 5 shows the comparison of the prediction performance of NN using all radiomics features and traditional semantic features with fivefold cross-validation for the test data set. The performance of the prediction model using all radiomics features was superior to that using the traditional semantic features.

Figure 6.

Figure 6.

Average ROC metrics for the performance of the predictive model of NN with the traditional semantic features for the test data set. NN, neural network; ROC, receiver operator characteristic.

Table 5.

Model performance of the NN with all radiomics analysis features and traditional semantic features with fivefold cross-validation in the accuracy, sensitivity, specificity, and AUC for the test data set

All radiomics features Traditional semantic features
Accuracy (%) 85.4 55.7
Specificity (%) 88.6 61.8
Sensitivity (%) 80.0 36.0
AUC 0.92 0.50

AUC, area under the curve; NN, neural network.

Discussion

Liu et al investigated the correlation of the texture parameters and T, N staging, and differentiation degrees with CT image.5 The parameters of kurtosis and entropy were significantly correlated with T stage, and some parameters (kurtosis, skew, mean, 10th and 90th percentile) are correlated to N stage. On the other hand, the correlation of the texture parameters and differentiation degree was weak.

The traditional semantic features can extract shape, physical characteristics, and statistical information. The accuracy of the prediction performance was significantly improved by adding the texture features. Radiomics analysis extracts a large number of quantitative image features, which leads to comprehensive quantification of tumor phenotypes.13 In the current study, the predictive model demonstrated good predictive power in both training (accuracy: 85.4%) and test (accuracy: 88.6%, AUC: 0.92) datasets. The current study proposed the predictive model for the differentiation degree with the CT-based radiomics feature in the shell region which is the outer of the tumor region and the inner tumor region which eliminated the peripheral of the tumor in addition to the tumor region. Hao et al proposed a predictive model for a distant failure by shell analysis that allows us to detect its associations with metastasis within the microenvironment.14 Usually, the segmentation of the tumor in treatment planning for radiotherapy was defined by a physician with visual assessment. The current study showed radiomics feature within the microenvironment could also be useful for predicting the differentiation degrees. Moreover, the current study used the imaging filter before extracting the radiomics features. For the feature selection with LASSO regression, the radiomics features with the wavelet filter and that extracted in texture analysis were mostly selected. Nie et al performed radiomics analysis in the liver for CT image to differentiating focal nodular hyperplasia (FNH) from hepatocellular carcinoma.15 10 features which were high-order filter and wavelet features were selected as the significant predictors to construct the radiomics signature. The imaging filter could reduce noise, enhance the edge, and extract or eliminate a constant frequency. It leads to eliminate the redundant and extract effective factors for prediction.

The prediction performance with the NN-based machine learning was higher than the other models of the CART, SVM, and KNN. The NN-based machine learning uses the weights expressing the importance of the respective inputs which is the radiomics features to the output which is the group of the differentiation degree. Thus, the NN-based machine learning can combine the radiomics features.

Because of the good prediction ability of the method, the method could contribute to reducing the pathological examination by biopsy. Moreover, the predictive model for the differentiation of degree of the locally advanced esophagus cancer may use for the prediction of the local control without examination of bioscopy. There were two limitations in the current study. One was that the current study was performed in the single institution. The other was low sample size. Future investigations will use the data of large sample size collected from other institutions and validate the prediction model in a multi institutional study.

Conclusion

The proposed predictive model showed high accuracy for the classification of the degree of the differentiation of esophagus cancer. Because of the good prediction ability of the method, the method may contribute to reducing the pathological examination by biopsy and predicting the local control.

Footnotes

Patient consent: Informed consent was obtained from all individual participants included in the study.

Ethics approval: All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.

Contributor Information

Daisuke Kawahara, Email: daika99@hiroshima-u.ac.jp.

Yuji Murakami, Email: daika999@hiroshima-u.ac.jp.

Shigeyuki Tani, Email: daika9999@hiroshima-u.ac.jp.

Yasushi Nagata, Email: daika99999@hiroshima-u.ac.jp.

REFERENCES

  • 1.Gupta B, Kumar N. Worldwide incidence, mortality and time trends for cancer of the oesophagus. Eur J Cancer Prev 2017; 26: 107–18. doi: 10.1097/CEJ.0000000000000249 [DOI] [PubMed] [Google Scholar]
  • 2.Domper Arnal MJ, Ferrández Arenas Ángel, Lanas Arbeloa Ángel. Esophageal cancer: risk factors, screening and endoscopic treatment in Western and eastern countries. World J Gastroenterol 2015; 21: 7933–43. doi: 10.3748/wjg.v21.i26.7933 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Lordick F, Mariette C, Haustermans K, Obermannová R, Arnold D, .ESMO Guidelines Committee . Oesophageal cancer: ESMO clinical practice guidelines for diagnosis, treatment and follow-up. Ann Oncol 2016; 27(suppl 5): v50–7. doi: 10.1093/annonc/mdw329 [DOI] [PubMed] [Google Scholar]
  • 4.Donahue JM, Nichols FC, Li Z, Schomas DA, Allen MS, Cassivi SD, et al. Complete pathologic response after neoadjuvant chemoradiotherapy for esophageal cancer is associated with enhanced survival. Ann Thorac Surg 2009; 87: 392–9. doi: 10.1016/j.athoracsur.2008.11.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Liu S, Zheng H, Pan X, Chen L, Shi M, Guan Y, et al. Texture analysis of CT imaging for assessment of esophageal squamous cancer aggressiveness. J Thorac Dis 2017; 9: 4724–32. doi: 10.21037/jtd.2017.06.46 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Sun F, Chen T, Han J, Ye P, Hu J. Staging accuracy of endoscopic ultrasound for esophageal cancer after neoadjuvant chemotherapy: a meta-analysis and systematic review. Dis Esophagus 2015; 28: 757–71. doi: 10.1111/dote.12274 [DOI] [PubMed] [Google Scholar]
  • 7.Levine DS, Blount PL, Rudolph RE, Reid BJ. Safety of a systematic endoscopic biopsy protocol in patients with Barrett's esophagus. Am J Gastroenterol 2000; 95: 1152–7. doi: 10.1111/j.1572-0241.2000.02002.x [DOI] [PubMed] [Google Scholar]
  • 8.Fennerty MB. Complications of endoscopy. In: Brandt L, ed. ., Clinical practice of gastroenterology. Philadelphia: Current Medicine; 1999et al. [Google Scholar]
  • 9.Kawahara D, Tang X, Lee CK, et al. Predicting the local response of metastatic brain tumor to gamma knife radiosurgery by radiomics with a machine learning method. Frontier Oncology. Accepted. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational Radiomics system to decode the radiographic phenotype. Cancer Res 2017; 77: e104. doi: 10.1158/0008-5472.CAN-17-0339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Tibshirani R. The LASSO method for variable selection in the COX model. Stat Med 1997; 16: 385–95. doi: [DOI] [PubMed] [Google Scholar]
  • 12.Zhang J-X, Song W, Chen Z-H, Wei J-H, Liao Y-J, Lei J, et al. Prognostic and predictive value of a microRNA signature in stage II colon cancer: a microRNA expression analysis. Lancet Oncol 2013; 14: 1295–306. doi: 10.1016/S1470-2045(13)70491-1 [DOI] [PubMed] [Google Scholar]
  • 13.Gillies RJ, Kinahan PE, Hricak H. Radiomics: images are more than pictures, they are data. Radiology 2016; 278: 563–77. doi: 10.1148/radiol.2015151169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Hao H, Zhou Z, Li S, Maquilan G, Folkert MR, Iyengar P, et al. Shell feature: a new radiomics descriptor for predicting distant failure after radiotherapy in non-small cell lung cancer and cervix cancer. Phys Med Biol 2018; 63: 095007. doi: 10.1088/1361-6560/aabb5e [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Nie P, Yang G, Guo J, Chen J, Li X, Ji Q, et al. A CT-based radiomics nomogram for differentiation of focal nodular hyperplasia from hepatocellular carcinoma in the non-cirrhotic liver. Cancer Imaging 2020; 20: 20.20: 20. doi: 10.1186/s40644-020-00297-z [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1.

Articles from The British Journal of Radiology are provided here courtesy of Oxford University Press

RESOURCES