Skip to main content
Lippincott Open Access logoLink to Lippincott Open Access
. 2016 Nov 23;41(3):455–460. doi: 10.1097/RCT.0000000000000555

Support Vector Machines Model of Computed Tomography for Assessing Lymph Node Metastasis in Esophageal Cancer with Neoadjuvant Chemotherapy

Zhi-Long Wang *, Zhi-Guo Zhou , Ying Chen *, Xiao-Ting Li *, Ying-Shi Sun *
PMCID: PMC5457826  PMID: 27879527

Abstract

Objective

The aim of this study was to diagnose lymph node metastasis of esophageal cancer by support vector machines model based on computed tomography.

Materials and Methods

A total of 131 esophageal cancer patients with preoperative chemotherapy and radical surgery were included. Various indicators (tumor thickness, tumor length, tumor CT value, total number of lymph nodes, and long axis and short axis sizes of largest lymph node) on CT images before and after neoadjuvant chemotherapy were recorded. A support vector machines model based on these CT indicators was built to predict lymph node metastasis.

Results

Support vector machines model diagnosed lymph node metastasis better than preoperative short axis size of largest lymph node on CT. The area under the receiver operating characteristic curves were 0.887 and 0.705, respectively.

Conclusions

The support vector machine model of CT images can help diagnose lymph node metastasis in esophageal cancer with preoperative chemotherapy.

Key Words: support vector machine, computed tomography, esophageal cancer, lymph node metastasis


The prognosis of patients with resectable esophageal cancer remains poor. The reported 5-year survival rates range from 20 to 36% after intentionally curative surgery.13 Median survival is only 9–24 months in patients with surgical treatment.49 Prospective randomized trials demonstrated an improved survival after neoadjuvant therapy compared to surgery alone in patients with esophageal cancer. Some important studies include the CROSS trial, which analyzed neoadjuvant chemoradiation for patients with esophageal adenocarcinoma or squamous cell carcinoma, and the MAGIC and French trials analyzing neoadjuvant chemotherapy for adenocarcinoma.1012 Data from the FFCD9901 study suggested preoperative chemoradiotherapy increases complication incidence and mortality.13 Therefore, preoperative chemotherapy in treating esophageal carcinoma is gradually accepted by surgeons.

As reported by Worldwide Esophageal Cancer Collaboration, survival decreases with the presence of lymph node metastases (LNM).14 Indeed, the lymph node category was shown to be an independent prognostic factor in lymph node positive patients with resectable thoracic esophageal cancer.15 Imaging examinations are the most commonly used tools for lymph node status evaluation in esophageal cancer. Wakelin et al compared computed tomography (CT), laparoscopic ultrasound, and endoscopic ultrasound (EUS) in the preoperative staging of esophagogastric carcinoma; the accuracy of CT in diagnosing the N stage of esophageal cancer was 17 of 29 (59%).16 These authors concluded that the nodal status remains the most difficult area to assess using all three modalities. The main hurdle appeared to be the differentiation between benign and malignant enlarged nodes, with lymph node size alone not being a good criterion for assessing malignancy.

In recent years, machine-learning methods have been used to predict complex biological problems. Support vector machines (SVMs) are supervised machine learning techniques widely used in pattern recognition and classification problems. An SVM algorithm performs a classification by constructing a multidimensional hyperplane that optimally discriminates between two classes, by maximizing the margin between two data clusters. This algorithm achieves high discriminative power by using special nonlinear functions called kernels to transform the input space into a multidimensional space.17 SVMs have been used in medical applications.1820 Given a set of training cases, each marked as belonging to one of two categories, a SVM training algorithm builds a model that predicts whether a new case falls into one category or the other.

Therefore, we used CT imaging data before and after neoadjuvant chemotherapy to establish a SVM mathematical model. In addition, the diagnostic power of the SVM method for differentiating LNM in patients with esophageal cancer was assessed. Because squamous cell carcinomas are significantly more common than adenocarcinomas and other malignant esophageal cancers in Asians, only patients with squamous cell carcinomas were evaluated in this study.

MATERIALS AND METHODS

Patients

This retrospective study was approved by the Ethics Committee of our hospital, with a waiver of requirement for informed consent. The clinical data were collected from the prospective database of our hospital. All patients in this database who had pathologically confirmed esophageal squamous cell carcinoma and received preoperative chemotherapy from January 2006 to January 2012 were included. All patients underwent gastroscopy to acquire pathological information, and received baseline and preoperative enhanced CT examinations.

Exclusion criteria were (a) pathologically proven adenocarcinoma, small cell carcinoma, mixed cancer, or other diseases; (b) other preoperative therapies (e.g., radiotherapy) simultaneously; (c) esophageal multiple primary carcinoma; (d) death within 30 days after surgery; (e) enhanced CT data before preoperative chemotherapy not obtained or images not interpretable; and (f) non-suitability for radical esophagectomy because of tumor progression or patient’s physical condition.

CT Protocol

MDCT was performed using a 64–detector row CT scanner (LightSpeed 64; GE Healthcare, Milwaukee, WI). Chest unenhanced CT scans were acquired with 0.625 mm collimation, 120–140 kVp, and 300–350 mAs. Subsequently, a total of 100 ml iopromide (Ultravist; Schering, Berlin, Germany) was administered intravenously via an 18-gauge angiographic catheter inserted into an antecubital vein, at 3 mL/sec with an automatic injector. Contrast-enhanced CT scans were performed at 60 seconds after iopromide injection. Sagittal and coronal reconstructions were carried out with contrast-enhanced images.

Image Analysis

Baseline and preoperative CT images were analyzed using the picture archiving communication system (PACS) by two independent radiologists blinded to patients’ clinical history. The following CT indicators were measured:

  • Tumor length: longest diameter obtained from the sagittal CT image.

  • Tumor thickness: lesion thickness obtained from the axial CT image.

  • Tumor CT value: region of Interest (ROI) placed on the lesion with maximum cross-section at the cross-sectional CT image.

  • Total LN number: number of all visible regional lymph nodes on Chest CT image.

  • Long axis size of largest regional LN (LSDL): long axis diameter of the largest regional lymph node in the axial CT image.

  • Short axis size of largest regional LN (SSDL). Diameter perpendicular to the long axis of the largest regional lymph node in the axial CT image.

  • The average results from the two radiologists were used for continuous variable analysis. Changes of CT image indicators between baseline and preoperative CT were assessed.

Statistical Analysis

LNM Assessment

All patients were divided into positive-LNM and negative-LNM groups, respectively. Node metastasis was confirmed by postoperative pathological results. A univariate statistical analysis with the SPSS software version 17.0 (SPSS Inc., Chicago, IL) was performed to evaluate the differences in various imaging indicators between the positive-LNM and negative-LNM groups. Group comparison was carried out by independent-samples T test. P <0.05 was considered statistically significant.

The CT indicators significantly different between positive-LNM and negative-LNM groups were selected to build the SVM model. Receiver operating characteristic (ROC) curves were used to evaluate these indicators in diagnosing LNM. The MedCalc software version 11.2 (MedCalc; MedCalc Software, Ghent, Belgium) was used to generate and compare the ROC curves.

Least Squares Support Vector Machine (SVM)

Least squares support vector machine (LS-SVM) was proposed by Suykens and Vandewalle.21 Compared with other SVMs, LS-SVM utilizes quadratic sum of the slack variables as the penalty factor which ensures that LS-SVM can obtain a small training error. Specifically, LS-SVM minimizes the follow optimization problem, that is:

graphic file with name rct-41-455-i001.jpg

where xi is the i-th training sample, yi{−1,1} is the label of xi, ϕ is a feature map which maps xi to the feature space, w is the weight parameter vector, b is the bias parameter, ei is i-th slack variable and e = (e1,⋯,eN)T, y is a tuning parameter which makes a trade-off between the slack variable penalty and the margin. The Lagrangian function of Eq. (1) is

graphic file with name rct-41-455-i002.jpg

where αi is i-th Lagrange multiplier (i = 1, ⋯, N). According to the optimal conditions, we have

graphic file with name rct-41-455-i003.jpg

graphic file with name rct-41-455-i004.jpg

graphic file with name rct-41-455-i005.jpg

and

graphic file with name rct-41-455-i006.jpg

By eliminating w and e, we can obtain the system of linear equations

graphic file with name rct-41-455-i007.jpg

where α = (α1,⋯,αN)T, y = (y1,⋯,yN)T, 1 = (1,⋯,1)T is a length-N vector, I is N × N identity matrix, and Ω = (ϕ(xi)Tϕ(xj))N×N is Gram matrix. Solving Eq. (7), we obtain the solution b* and α*. Then the optimal weight parameter vector w* can be computed by Eq. (3) and ei by Eq. (5). For a testing sample x, its label can be estimated by

graphic file with name rct-41-455-i008.jpg

LS-SVM can easily be extended to K-class (K > 2) classification problem by one-versus-one strategy.22

In this study, a free LS-SVM software package in MATLAB (version 7.0; MathWorks, Inc., Natick, MA) was applied to generate the SVM model. The default kernel function is used. A LS-SVM model for assessing LNM was established. Input indexes were the indicators obtained by the above univariate statistical analysis. The output index was lymph node metastasis in the patient. Confirmation was carried out by surgery and histopathology. Positive LNM was defined as 1 and negative LNM as −1. Fifty percent of cases were randomly selected to constitute the training sample. The remaining 50% of cases formed the testing sample. The training sample was used to establish the LS-SVM model. Finally, the ability of the model to predict LNM in the testing set and all cases was evaluated by ROC curves. ROC curves can be created automatically by the MATLAB software.

RESULTS

A total of 131 patients (102 males and 29 females; mean age of 58.0 years, ranging from 42 to 75 years) were included in the study (Fig. 1). There were 51 cases with lymph node metastasis and 80 without. The clinicopathological features of the patients are detailed in Table 1. The majority of patients (97%; 127/131) received a platinum-based two-drug combination, mainly paclitaxel (175 mg/m2, IV, d1 Q21) and cisplatin (25 mg/m2 IV, d1–3 Q21); the remaining patients received nedaplatin (80 mg/m2) combined with paclitaxel. A total of one to four chemotherapy cycles were administered before surgery at 3–6 weeks after neoadjuvant chemotherapy.

FIGURE 1.

FIGURE 1

Flow chart of the study.

TABLE 1.

Patient Characteristics

graphic file with name rct-41-455-g002.jpg

In the univariate analysis, preoperative tumor thickness, preoperative long axis and short axis sizes of largest lymph node, total numbers of lymph nodes in baseline and preoperative CT, and change of tumor thickness in second CT showed statistically significant differences between the LNM positive and negative groups (Table 2). Of these six CT indicators, preoperative short axis size of largest lymph node yielded the highest power for diagnosing LNM in ROC curves (Table 3, Fig. 2), with an area under the ROC curve (AUC) of 0.705.

TABLE 2.

The Results of Univariate Statistical Analysis for CT Indicators of Baseline and Preoperative CT

graphic file with name rct-41-455-g003.jpg

TABLE 3.

AUC of CT Indicators

graphic file with name rct-41-455-g004.jpg

FIGURE 2.

FIGURE 2

Receiver operating characteristic (ROC) curve for lymph node metastasis with six CT indicators. The highest AUC of these six CT indicators was 0.705 which was performed by the short axis size of maximum lymph node (SSLN) of preoperative CT. Figure 2 can be viewed online in color at www.jcat.org.

After the random sampling by the SPSS statistical software, 66 cases were randomly selected to constitute the training sample. The other 65 cases were defined as testing sample. The training sample was used to establish the LS-SVM model. When we use this model to predict the training sample and testing sample, the AUCs of the SVM model were 0.955 and 0.553, respectively (Fig. 3a, b). The AUC of the SVM model predicting all samples reached 0.887 (Fig. 3c).

FIGURE 3.

FIGURE 3

(A–C) Receiver operating characteristic (ROC) curve for lymph node metastasis with LS-SVM model. A, The AUC of the model predicting training sample was 0.955. B, The AUC of the model predicting testing sample was 0.553. C, The AUC of the SVM model predicting all samples reached 0.887. Figure 3 can be viewed online in color at www.jcat.org.

By comparing the ROC curves, the SVM model performed significantly better than preoperative short axis size of largest lymph node (P < 0.05).

DISCUSSION

Lymph node metastasis affects the surgical treatment of patients with esophageal cancer, and is also an important prognostic factor. Currently, preoperative diagnosis mainly depends on various imaging methods. Yokota et al indicated that clinical node diagnosis has low specificity and negative predictive value for predicting pathological nodes in the preoperative diagnosis of lymph node metastasis for patients with locally advanced resectable esophageal cancer.23

Because metastatic lymph node detection on CT images mainly depends on size, the sensitivity and specificity of CT vary with the definition of an abnormally enlarged node. In general, intrathoracic and abdominal lymph nodes greater than 1 cm in diameter are considered to be enlarged, while supraclavicular lymph nodes with a short axis exceeding 5 mm are considered to be pathologic.24

Most studies using the common size criterion of 1 cm to define enlarged nodes report CT sensitivity of 30–60%, whereas specificity tends to be somewhat higher (60–80%).25,26 In a study by Picus et al, nearly all metastatic peri-esophageal lymph nodes measuring less than 7 mm were indistinguishable from non-metastatic lymph nodes by CT.27 In addition, the presence of benign enlarged and inflammatory lymph nodes in esophageal cancer reduces the specificity of CT for detecting lymph node metastases. The lack of uniform criteria is the main constraint in predicting lymph node metastasis preoperatively.

The biological behavior of esophageal cancer reflects the histopathological performance of tumor malignancy and invasion. It affects lymph node metastasis directly or indirectly. The concrete manifestations of cancer biological behavior include, for example, tumor size, tumor invasion of other organs, lymph node metastasis, and distant metastasis. Therefore, MDCT imaging can accurately reflect the biological behavior of esophageal cancer histopathology. Univariate analysis in this study showed that all six indicators obtained from CT images were associated with LNM in esophageal cancer. Therefore, these biological behavior factors should be taken into account in comprehensively predicting LNM.

Other machine-learning methods have been used in medical studies. The mainly used method is artificial neural network (ANN), which is considered an appropriate tool for medical data analysis.28 Bollschweiler et al applied a single-layer perceptron, an ANN, to predict lymph node metastasis in esophageal cancer, with an accuracy of 79%.29 However, the ANN has some disadvantages: (1) the model is prone to overfitting, (2) it requires lengthy development and optimization time, and (3) it is more difficult to use in the field because of computational requirements.30 Considering the above reasons, this study instead selected the SVM model, which could produce lower prediction error compared with classifiers based on other methods like artificial neural networks.31 Compared with ANN, SVM may have the same or even better predictive ability.32,33 Few reports are available regarding the application of SVM in esophageal cancer lymph node metastasis. In this preliminary study, the results indicated that the SVM model has better diagnostic capability for LNM than the traditional LN size criteria, with AUC achieving a good diagnostic power. With further improvement, SVM may become an effective tool in predicting lymph node staging in esophageal cancer.

Our study has some limitations. First, although a relatively large sample size was used, this was a single-center retrospective study. Further prospective studies are warranted to confirm the diagnostic power of the SVM model. In addition, the majority of patients were male (77.9%). Gender factors may influence the external validity of these findings. Finally, the AUC obtained for the LS-SVM model in the testing sample was relatively lower compared with the training sample value. This indicates a need for improvement of the model’s ability to assess new cases.

CONCLUSIONS

The least squares support vector machine model based on CT images can help diagnose lymph node metastasis in esophageal cancer with preoperative chemotherapy.

ACKNOWLEDGMENTS

We thank Jie Li, Kun Cao, Lei Tang, Yong Cui, Li-Ping Qi, and Shun-Yu Gao for editorial support and Jun Shan, Ning Wang, Ying Li, Xiao-Yan Zhang, and Yan-Ling Li for reviewing the manuscript.

Footnotes

Funding: This work was supported by the National Natural Science Foundation of China (grant no. 81471640), the National Basic Research Program of China (973 Program) (grant no. 2011CB707705), and Beijing Health System High Level Health Technical Personnel Training Plan (No. 2013-3-083).

The authors declare no conflict of interest.

Authors’ contributions: ZLW and YSS were guarantors of integrity of the entire study. ZLW carried out the study design and manuscript editing. ZGZ and YC carried out the data analysis. YSS participated in the manuscript preparation. XTL carried out the statistical analysis. All authors read and approved the final manuscript.

REFERENCES

  • 1.Lerut T, De Leyn P, Coosemans W, et al. Surgical strategies in esophageal carcinoma with emphasis on radical lymphadenectomy. Ann Surg. 1992;216:583–590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Siewert JR, Bartels H, Bollschweiler E, et al. [Squamous cell cancer of the esophagus. Treatment concept at the surgical clinic of the Munich Technical University]. Chirurg. 1992;63:693–700. [PubMed] [Google Scholar]
  • 3.Watanabe H. [Squamous cell cancer of the esophagus. Treatment concept at the National Cancer Center in Tokyo]. Chirurg. 1992;63:689–692. [PubMed] [Google Scholar]
  • 4.Roth JA, Pass HI, Flanagan MM, et al. Randomized clinical trial of preoperative and postoperative adjuvant chemotherapy with cisplatin, vindesine, and bleomycin for carcinoma of the esophagus. J Thorac Cardiovasc Surg. 1988;96:242–248. [PubMed] [Google Scholar]
  • 5.Swisher SG, Hunt KK, Holmes EC, et al. Changes in the surgical management of esophageal cancer from 1970 to 1993. Am J Surg. 1995;169:609–614. [DOI] [PubMed] [Google Scholar]
  • 6.Müller JM, Erasmi H, Stelzner M, et al. Surgical therapy of oesophageal carcinoma. Br J Surg. 1990;77:845–857. [DOI] [PubMed] [Google Scholar]
  • 7.Hulscher JB, van Sandick JW, de Boer AG, et al. Extended transthoracic resection compared with limited transhiatal resection for adenocarcinoma of the esophagus. N Engl J Med. 2002;347:1662–1669. [DOI] [PubMed] [Google Scholar]
  • 8.Hagen JA, DeMeester SR, Peters JH, et al. Curative resection for esophageal adenocarcinoma: analysis of 100 en bloc esophagectomies. Ann Surg. 2001;234:520–530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hofstetter W, Swisher SG, Correa AM, et al. Treatment outcomes of resected esophageal cancer. Ann Surg. 2002;236:376–384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.van Hagen P, Hulshof MC, van Lanschot JJ, et al. Preoperative chemoradiotherapy for esophageal or junctional cancer. N Engl J Med. 2012;366:2074–2084. [DOI] [PubMed] [Google Scholar]
  • 11.Ychou M, Boige V, Pignon JP, et al. Perioperative chemotherapy compared with surgery alone for resectable gastroesophageal adenocarcinoma: an FNCLCC and FFCD multicenter phase III trial. J Clin Oncol. 2011;29:1715–1721. [DOI] [PubMed] [Google Scholar]
  • 12.Cunningham D, Allum WH, Stenning SP, et al. Perioperative chemotherapy versus surgery alone for resectable gastroesophageal cancer. N Engl J Med. 2006;355:11–20. [DOI] [PubMed] [Google Scholar]
  • 13.Mariette C, Dahan L, Mornex F, et al. Surgery alone versus chemoradiotherapy followed by surgery for stage I and II esophageal cancer: final analysis of randomized controlled phase III trial FFCD 9901. J Clin Oncol. 2014;32:2416–2422. [DOI] [PubMed] [Google Scholar]
  • 14.Rice TW, Rusch VW, Apperson-Hansen C, et al. Worldwide esophageal cancer collaboration. Dis Esophagus. 2009;22:1–8. [DOI] [PubMed] [Google Scholar]
  • 15.Xu Y, Jiang Y, Yu X, et al. Analysis of new N-category on prognosis of oesophageal cancer with positive lymph nodes in a Chinese population. Radiol Oncol. 2013;47:63–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wakelin SJ, Deans C, Crofts TJ, et al. A comparison of computerised tomography, laparoscopic ultrasound and endoscopic ultrasound in the preoperative staging of oesophago-gastric carcinoma. Eur J Radiol. 2002;41:161–167. [DOI] [PubMed] [Google Scholar]
  • 17.Yu W, Liu T, Valdez R, et al. Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes. BMC Med Inform Decis Mak. 2010;10:16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Klöppel S, Stonnington CM, Barnes J, et al. Accuracy of dementia diagnosis: a direct comparison between radiologists and a computerized method. Brain. 2008;131(pt 11):2969–2974. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Das K, Giesbrecht B, Eckstein MP. Predicting variations of perceptual performance across individuals from neural activity using pattern classifiers. Neuroimage. 2010;51:1425–1437. [DOI] [PubMed] [Google Scholar]
  • 20.Mourão-Miranda J, Bokde AL, Born C, et al. Classifying brain states and determining the discriminating activation patterns: Support Vector Machine on functional MRI data. Neuroimage. 2005;28:980–995. [DOI] [PubMed] [Google Scholar]
  • 21.Suykens J, Vandewalle J. Least squares support vector machine classifiers. Neural Process Lett. 1999;9:293–300. [Google Scholar]
  • 22.Bishop C. Pattern recognition and machine learning. Springer; 2006. http://www.springer.com/cn/book/9780387310732. [Google Scholar]
  • 23.Yokota T, Igaki H, Kato K, et al. Accuracy of preoperative diagnosis of lymph node metastasis for thoracic esophageal cancer patients from JCOG9907 trial. Int J Clin Oncol. 2016;21:283–288. [DOI] [PubMed] [Google Scholar]
  • 24.Kim TJ, Kim HY, Lee KW, et al. Multimodality assessment of esophageal cancer: preoperative staging and monitoring of response to therapy. Radiographics. 2009;29:403–421. [DOI] [PubMed] [Google Scholar]
  • 25.Dorfman RE, Alpern MB, Gross BH, et al. Upper abdominal lymph nodes: criteria for normal size determined with CT. Radiology. 1991;180:319–322. [DOI] [PubMed] [Google Scholar]
  • 26.Fultz PJ, Feins RH, Strang JG, et al. Detection and diagnosis of nonpalpable supraclavicular lymph nodes in lung cancer at CT and US. Radiology. 2002;222:245–251. [DOI] [PubMed] [Google Scholar]
  • 27.Picus D, Balfe DM, Koehler RE, et al. Computed tomography in the staging of esophageal carcinoma. Radiology. 1983;146:433–438. [DOI] [PubMed] [Google Scholar]
  • 28.Patel JL, Goyal RK. Applications of artificial neural networks in medical science. Curr Clin Pharmacol. 2007;2:217–226. [DOI] [PubMed] [Google Scholar]
  • 29.Bollschweiler EH, Mönig SP, Hensler K, et al. Artificial neural network for prediction of lymph node metastases in gastric cancer: a phase II diagnostic study. Ann Surg Oncol. 2004;11:506–511. [DOI] [PubMed] [Google Scholar]
  • 30.Ahmed FE. Artificial neural networks for diagnosis and survival prediction in colon cancer. Mol Cancer. 2005;4:29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Byvatov E, Schneider G. Support vector machine applications in bioinformatics. Appl Bioinformatics. 2003;2:67–77. [PubMed] [Google Scholar]
  • 32.McQuisten KA, Peek AS. Comparing artificial neural networks, general linear models and support vector machines in building predictive models for small interfering RNAs. PLoS One. 2009;4:e7522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Lee HJ, Hwang SI, Han SM, et al. Image-based clinical decision support for transrectal ultrasound in the diagnosis of prostate cancer: comparison of multiple logistic regression, artificial neural network, and support vector machine. Eur Radiol. 2010;20:1476–1484. [DOI] [PubMed] [Google Scholar]

Articles from Journal of Computer Assisted Tomography are provided here courtesy of Wolters Kluwer Health

RESOURCES