Development and External Validation of a Machine Learning Model to Predict Pathological Complete Response After Neoadjuvant Chemotherapy in Breast Cancer

Ji-Jung Jung; Eun-Kyu Kim; Eunyoung Kang; Jee Hyun Kim; Se Hyun Kim; Koung Jin Suh; Sun Mi Kim; Mijung Jang; Bo La Yun; So Yeon Park; Changjin Lim; Wonshik Han; Hee-Chul Shin

doi:10.4048/jbc.2023.26.e14

. 2023 Mar 28;26(4):353–362. doi: 10.4048/jbc.2023.26.e14

Development and External Validation of a Machine Learning Model to Predict Pathological Complete Response After Neoadjuvant Chemotherapy in Breast Cancer

Ji-Jung Jung ¹, Eun-Kyu Kim ^1,², Eunyoung Kang ^1,², Jee Hyun Kim ³, Se Hyun Kim ³, Koung Jin Suh ³, Sun Mi Kim ⁴, Mijung Jang ⁴, Bo La Yun ⁴, So Yeon Park ⁵, Changjin Lim ¹, Wonshik Han ^1,⁶, Hee-Chul Shin ^1,^2,^✉

PMCID: PMC10475713 PMID: 37272242

Abstract

Purpose

Several predictive models have been developed to predict the pathological complete response (pCR) after neoadjuvant chemotherapy (NAC); however, few are broadly applicable owing to radiologic complexity and institution-specific clinical variables, and none have been externally validated. This study aimed to develop and externally validate a machine learning model that predicts pCR after NAC in patients with breast cancer using routinely collected clinical and demographic variables.

Methods

The electronic medical records of patients with advanced breast cancer who underwent NAC before surgical resection between January 2017 and December 2020 were reviewed. Patient data from Seoul National University Bundang Hospital were divided into training and internal validation cohorts. Five machine learning techniques, including gradient boosting machine (GBM), support vector machine, random forest, decision tree, and neural network, were used to build predictive models, and the area under the receiver operating characteristic curve (AUC) was compared to select the best model. Finally, the model was validated using an independent cohort from Seoul National University Hospital.

Results

A total of 1,003 patients were included in the study: 287, 71, and 645 in the training, internal validation, and external validation cohorts, respectively. Overall, 36.3% of the patients achieved pCR. Among the five machine learning models, the GBM showed the highest AUC for pCR prediction (AUC, 0.903; 95% confidence interval [CI], 0.833–0.972). External validation confirmed an AUC of 0.833 (95% CI, 0.800–0.865).

Conclusion

Commonly available clinical and demographic variables were used to develop a machine learning model for predicting pCR following NAC. External validation of the model demonstrated good discrimination power, indicating that routinely collected variables were sufficient to build a good prediction model.

Keywords: Breast Neoplasms, Machine Learning, Neoadjuvant Therapy

INTRODUCTION

Neoadjuvant chemotherapy (NAC) allows unresectable tumors to become operable, whereas downstaging operable tumors allows breast conservation [1,2]. After NAC, pathological complete response (pCR), defined as no invasive nor in situ residuals in the breast and nodes, is associated with favorable outcomes compared with residual tumors [3,4,5]. For patients who have achieved pCR, less invasive surgical approaches, including breast-conserving surgery and de-escalation of axillary lymph node dissection, may be feasible while achieving comparable clinical outcomes.

Depending on the tumor subtype, approximately 20%–40% of patients receiving NAC achieve pCR; pCR rates are highest in human epidermal growth factor receptor 2 (HER2)-positive cancers and lowest in luminal A tumors [6,7]. Several predictive models have been developed to predict pCR [8,9,10,11,12,13]; however, few are broadly applicable because of radiologic complexity and variables that require additional tests. Moreover, most models focus on identifying patients who may benefit from NAC before initiating chemotherapy.

This study aimed to develop a machine learning model using simple variables to predict pCR after NAC. This model will help surgeons identify patients eligible for less-invasive surgical interventions and eventually reduce the treatment burden. We also aimed to externally validate the model using patient data from an independent hospital to demonstrate that this model could be generally applied to all patients with breast cancer who received NAC and were awaiting surgery.

METHODS

This multicenter retrospective study was performed at Seoul National University Bundang Hospital (SNUBH), a tertiary, academic, three-site hospital system. The study protocol was approved by the Institutional Review Boards of SNUBH and Seoul National University Hospital (SNUH), which waived the need for informed consent for the use of identifiable data.

Study cohort and data collection

Patients with histologically proven invasive breast carcinoma who underwent NAC followed by surgery between January 2017 and December 2019 at SNUBH and between January 2015 and December 2018 at SNUH were included in this study. The exclusion criteria were bilateral breast cancer and a personal history of breast cancer. A total of 358 patients from SNUBH and 645 patients from SNUH were included, and their clinical data were obtained from electronic medical systems.

Based on previous studies, clinical and pathological variables known to have a predictive value for NAC response in breast cancer were collected [8,9,13,14]. The clinical data included age, body mass index (BMI), clinical T stage, N stage before the initiation of NAC, and serum carbohydrate antigen 15-3 (CA15-3). Tumor size was defined as the largest tumor size measured on ultrasound or magnetic resonance imaging (MRI), and nodal staging was based on the results of physical examination, imaging tests (ultrasound, MRI, and chest computed tomography), and biopsy. cN0 was defined as negative for metastases on physical and radiological examinations. Pathological data on histological grade, estrogen receptor (ER), progesterone receptor (PR), HER2, and Ki-67 status were collected from prechemotherapy biopsy reports. The status of ER, PR, and HER2 was evaluated by immunohistochemistry and fluorescence in situ hybridization when needed. For ER and PR, ≥ 1% positive tumor cells with nuclear staining were considered positive. Finally, the mass size on post-NAC MRI was used to assess changes in tumor size after NAC [15]. Non-mass enhancement without definite features of the mass was not included in the tumor size.

Model development and validation

Before developing the prediction models, patients in SNUBH were randomly divided into training and test cohorts in a ratio of 80:20. The training set was used to develop prediction models using five machine learning algorithms: gradient boosting machine (GBM), support vector machine (SVM), random forest (RF), decision tree (DT), and neural network (NN). The outcome variable was pCR, and 10 variables selected from the literature review were used as predictors. As missing data were present in less than 2% of the total records, a simple imputation technique was used. Missing data were substituted with the mean for continuous variables and the median for categorical variables. Feature importance statistics were used to rank the variables; however, feature selection was not performed to include all the variables. To ensure the robustness of each machine learning method, a 10-fold cross-validation resampling technique was used for the training set, and the hyperparameters were optimized using a random search until the highest area under the receiver operating characteristic curve (AUC) and accuracy of each model were achieved [16].

Internal validation of each model with optimal hyperparameters was performed on the test set. The performances of the predictive models were evaluated using the AUC, accuracy, precision, recall, and F1 score. The AUCs of the different machine learning models were compared using DeLong’s test [17], and the cut-off values were calculated using the Youden index. Finally, the model with the highest AUC was selected and further tested using an external validation cohort from SNUH.

Statistical analysis

Qualitative data were presented as numbers and percentages, and quantitative data were presented as mean ± standard deviation. Student’s t-test, Fischer’s exact test, and the χ² test were used to compare continuous and discrete variables between the two groups. Statistical significance was set at p < 0.05. R software version 3.6.3 (The R Foundation for Statistical Computing, Vienna, Austria) and Python programming language (version 3.5.2; Python Software Foundation, Wilmington, USA) were used for statistical analyses and model development. The scikit-learn package was used for DT, RF, and SVM; XGBoost and CatBoost for GBM; and Keras and neuralnet for NN.

RESULTS

In total, 1,003 patients including 358 from SNUBH (287 in the training cohort and 71 in the internal validation cohort) and 645 from SNUH were included in our analyses. Overall, a pCR was observed in 348 (36.3%) patients. The incidence of pCR was 37.4% (134/358) for SNUBH and 33.2% (214/645) for SNUH. Patient demographic and clinicopathological characteristics in both the training and test sets are presented in Table 1.

Table 1. Patient characteristics of the training and test cohorts.

Characteristics		Training set (n = 287)	Test set (n = 71)	p-value
Age (yr)		49.7 ± 10.4	48.8 ± 9.9	0.507
BMI (kg/m²)		23.8 ± 3.7	23.6 ± 3.6	0.687
Clinical tumor stage^*				0.607
	T1	24 (8.4)	5 (7.0)
	T2	184 (64.1)	43 (60.6)
	T3	57 (19.9)	19 (26.8)
	T4	22 (7.7)	4 (5.6)
Clinical node stage^*				0.619
	N0	71 (24.7)	14 (19.7)
	N1	121 (42.2)	36 (50.7)
	N2	51 (17.8)	11 (15.5)
	N3	44 (15.3)	10 (14.1)
Histologic grade				0.527
	Low	6 (2.1)	1 (1.4)
	Intermediate	99 (35.2)	30 (42.3)
	High	176 (62.6)	40 (56.3)
ER status				0.737
	Negative	165 (57.5)	43 (60.6)
	Positive	122 (42.5)	28 (39.4)
HER2 status				0.440
	Negative	162 (57.4)	45 (63.4)
	Positive	120 (42.6)	26 (36.6)
Ki-67 index (%)		38.4 ± 18.5	39.4 ± 19.8	0.700
CA15-3 (U/mL)		14.2 ± 15.4	16.6 ± 15.8	0.245
Lesion size at post-NAC MRI (cm)		1.7 ± 1.8	1.6 ± 1.8	0.663

Open in a new tab

Data are presented as means ± standard deviations or number (%).

BMI = body mass index; ER = estrogen receptor; HER2 = human epidermal growth factor receptor 2; CA15-3 = carbohydrate antigen 15-3; NAC = neoadjuvant chemotherapy; MRI = magnetic resonance imaging.

^*Stratified according to the American Joint Committee on Cancer (AJCC) 7th TNM stage.

The associations between pCR and 10 variables were assessed (Table 2). Univariate analysis revealed that the factors that significantly affected the possibility of pCR were negative ER status, positive HER2 status, serum CA15-3 levels, and lesion size on post-NAC MRI. A variance importance plot for the GBM is shown in Figure 1. The lesion size on post-NAC MRI ranked first, followed by HER2, ER, and CA15-3 status.

Table 2. Univariate analysis of pathological complete response in relation to clinicopathological characteristics (all cohort).

Characteristics		Non-pCR (n = 224)	pCR (n = 134)	p-value
Age (yr)		49.4 ± 10.5	49.7 ± 10.0	0.812
BMI (kg/m²)		24.0 ± 3.9	23.5 ± 3.2	0.173
Clinical tumor stage^*				0.194
	T1	15 (6.7)	14 (10.4)
	T2	138 (61.6)	89 (66.4)
	T3	51 (22.8)	25 (18.7)
	T4	20 (8.9)	6 (4.5)
Clinical node stage^*				0.200
	N0	48 (21.4)	37 (27.6)
	N1	99 (44.2)	58 (43.3)
	N2	37 (16.5)	25 (18.7)
	N3	40 (17.9)	14 (10.4)
Histologic grade				0.104
	Low	6 (2.7)	1 (0.8)
	Intermediate	87 (39.7)	42 (31.6)
	High	126 (57.5)	90 (67.7)
ER status				< 0.001
	Negative	106 (47.3)	102 (76.1)
	Positive	118 (52.7)	32 (23.9)
HER2 status				< 0.001
	Negative	160 (72.7)	47 (35.3)
	Positive	60 (27.3)	86 (64.7)
Ki-67 index (%)		37.5 ± 18.8	40.4 ± 18.7	0.166
CA15-3 (U/mL)		17.0 ± 18.8	10.8 ± 5.1	< 0.001
Lesion size at post-NAC MRI (cm)		2.1 ± 1.7	0.9 ± 1.7	< 0.001

Open in a new tab

Data are means ± standard deviations or number (%).

pCR = pathological complete response; BMI = body mass index; ER = estrogen receptor; HER2 = human epidermal growth factor receptor 2; CA15-3 = carbohydrate antigen 15-3; NAC = neoadjuvant chemotherapy; MRI = magnetic resonance imaging.

^*Stratified according to the American Joint Committee on Cancer (AJCC) 7th TNM stage.

NAC = neoadjuvant chemotherapy; MRI = magnetic resonance imaging; HER2 = human epidermal growth factor receptor 2; ER = estrogen receptor; CA15-3 = carbohydrate antigen 15-3; BMI = body mass index.

Performance of machine learning models

The optimal hyperparameters found in a 10-fold cross-validation and the performance of each machine learning model in the test set are shown in Table 3 and Figure 2. GBM showed the largest AUC (0.903, 95% confidence interval [CI], 0.833–0.972) and the highest accuracy (81.7%) in the test cohort. The AUC of RF was 0.898 (95% CI, 0.824–0.972), SVM 0.801 (95% CI, 0.696–0.906), and NN 0.870 (95% CI, 0.746–0.956). DT showed the smallest AUC (0.758; 95% CI, 0.640–0.876) and SVM showed the lowest accuracy (71.8%). DeLong’s test on the AUC of each model compared to the AUC of GBM showed that the performance of GBM was better than that of RF, but the difference was insignificant. However, the GBM model was significantly more accurate than the DT, SVM, and NN models. The details of the model performance in terms of accuracy, precision, recall, and F1 score are provided in Supplementary Table 1.

Table 3. Optimal hyperparameters of each machine learning model and DeLong’s test comparing area under the receiver operating characteristic curve to gradient boosting machine in the test set.

Characteristics	Optimal hyperparameter	AUC	Accuracy	p-value
Decision tree	Maximum depth: 5	0.758	0.746	0.009
Support vector machine	Kernel: radial bias function	0.801	0.718	0.002
	C: 1.0
	Gamma: 0.05
Random forest	No. of trees: 150	0.898	0.803	0.406
Random forest	Maximum depth: 5	0.898	0.803	0.406
Neural network	No. of hidden layers: 3	0.870	0.746	0.037
Neural network	No. of noes in a layer: 5	0.870	0.746	0.037
Gradient boosting machine	No. of trees: 500	0.903	0.817	-
Gradient boosting machine	Interaction depth: 3	0.903	0.817	-

Open in a new tab

AUC = area under the receiver operating characteristic curve.

External validation of the final model

Internal validation of the test set demonstrated the superior performance of the GBM compared to that of other machine learning models. Therefore, we selected the GBM model to assess external generalizability and real-world performance. The external validation of the SNUH cohort is shown in Figure 3. Satisfactory results were obtained with an AUC of 0.833 (95% CI, 0.800–0.865), sensitivity of 72.8%, and specificity of 77.7% with a cut-off value of 0.318. The AUC was slightly lower than that of the internal validation set; however, the difference was not significant (p = 0.387).

PV = predictive value; Sens = sensitivity; Spec = specificity.

DISCUSSION

We developed and compared the ability of five machine learning models to predict pCR after NAC in patients with breast cancer. Ten routinely available clinicopathological variables were used as predictive variables, and the GBM model resulted in the largest AUC of 0.903 and highest accuracy of 81.7%. External validation using patient data from an independent hospital demonstrates the generalizability of our prediction model.

Previous studies have attempted to predict pathological responses to NAC in patients with breast cancer. Several researchers proposed nomograms using preoperative clinicopathological variables (e.g., age, NAC cycles, ER, HER2, and status of lymphovascular invasion) or just simple laboratory indexes (e.g., clinical tumor stage, lymphocyte-to-monocyte ratio, fibrinogen level, and D-dimer level) to predict pCR after NAC [10,12]. The AUCs ranged from 0.77 to 0.90 [8,11,18], and these nomograms helped stratify patients who would benefit more or less from NAC. However, the small patient size limited the validation power, and the results of the external validation were not as good when tested in other institutions.

Recently, machine learning-based prediction models have been developed with better performance (AUCs ranged from 0.83 to 0.93) [14,19,20]. For example, Kim et al. [14] compared six machine learning algorithms (logistic regression, LASSO regression, SVM-linear, SVM-rbf, RF, and LightGBM) and demonstrated that LightGBM had the highest AUC for predicting pCR in patients with breast cancer treated with NAC, followed by curative surgery. Asri et al. [21] compared four machine learning algorithms (SVM, DT, naïve Bayes, and k-nearest neighbors) on the Wisconsin Breast Cancer dataset to assess the best algorithm for breast cancer risk prediction and diagnosis. Although the SVM was the best in terms of precision and error rate, all four types of algorithms showed an accuracy above 90%. Delen et al. [22] compared two data mining methods (artificial NN and DT) with a statistical method (logistic regression) to develop a prediction model for breast cancer survivability and demonstrated that artificial NN and DT performed better than logistic regression models. Machine learning algorithms can incorporate an unlimited number of input variables, and the optimal set of variables can be improved using various training methods. By including extensive clinicopathological variables that might have been omitted while developing logistic regression-based nomograms, the predictive performance of machine learning models tends to be better. However, machine learning models are more complicated in nature and difficult to apply in other institutes because of their complexity. Therefore, we intended to develop a prediction model using a machine learning algorithm to increase accuracy while using routinely available variables to maximize generalizability.

We performed a literature review to select clinical and demographic variables known to be associated with pCR. To create an accurate but generalizable model, we excluded institution-specific clinical or radiologic variables and finally selected 10 variables: age, BMI, clinical T and N stages, histologic grade, ER and HER2 status, Ki-67 index, CA15-3, and MRI lesion size after NAC. Previous studies have reported that age, BMI, tumor stage at diagnosis, histologic grade, tumor subtype, and Ki-67 expression are associated with pCR [8,13,23]. Most studies showed that molecular subtype was the most important predictive marker for pCR [24,25], and another study showed that patients with higher BMI were associated with higher tumor characteristics, such as hormone-negative status and higher TNM stage, and were less likely to achieve pCR after NAC [26]. The baseline CA15-3 level before NAC also correlated with pCR and was added to our model as an independent predictor [14]. Our study showed that HER2-positive and ER-negative cancers tended to have higher pCR rates. Breast cancer subtypes are categorized based on ER, PR, HER2, and Ki-67 expression, and chemotherapeutic agents are selected according to the cancer subtype. These results reflect a higher response to trastuzumab in HER2 positive subtype and a lower response to chemotherapy in hormone receptor-positive breast cancer cells. Feature importance in the GBM model showed that in addition to radiologic tumor size after NAC and tumor subtype, CEA 15-3 and BMI contributed more than other tumor characteristics in predicting pCR. Unfortunately, it is difficult to provide a statistical interpretation of the feature importance score, which is a limitation of this study.

To our knowledge, this is the first study to externally validate a machine learning model for predicting pCR following NAC. We demonstrated that when using machine learning algorithms, simple clinical variables were sufficient to create a good prediction model. With the aid of this prediction model, surgeons can perform more breast-conserving surgeries and avoid unnecessary wider excisions. However, to allow real-time predictions, the model should be deployed as a web service in the near future.

Our study had some limitations. Many studies have demonstrated that the response to NAC is largely dependent on the breast cancer subtype. Triple-negative or HER2-positive breast cancer is more sensitive to chemotherapy, with pCR rates of up to 70%, whereas luminal A breast cancer is less sensitive to chemotherapy and less likely to achieve pCR. Nonetheless, approximately half of our patients were of the luminal type, and the pCR rate was less than one-fourth, whereas more than half of patients with triple-negative or HER2-positive breast cancer achieved pCR. Although we included ER and HER2 statuses as predictive variables, it would be more useful to develop independent prediction models for different subtypes. Finally, our predictive model did not provide information on axillary pCR. However, previous studies have shown that breast pCR strongly correlates with axillary pCR [27]. Although this correlation is more evident in HER2-positive or triple-negative breast cancer than in hormone-positive breast cancer, axillary surgery may be de-escalated in patients with predicted breast pCR.

In conclusion, we demonstrated that when using machine learning algorithms, routinely available clinical and demographic variables were sufficient to predict pCR following NAC. This simple model enables surgeons to minimize the extent of surgery and allows for more breast-conserving surgery. External validation of the model using independent validation samples confirmed its good discriminatory power, demonstrating that this model is applicable in different clinical settings.

Footnotes

Conflict of Interest: The authors declare that they have no competing interests.

Author Contributions:

Conceptualization: Jung JJ, Kim EK, Kang E, Park SY, Shin HC.
Data curation: Jung JJ, Kim SM, Jang M, Yun BL.
Formal analysis: Jung JJ.
Investigation: Shin HC.
Methodology: Jung JJ.
Project administration: Shin HC.
Resources: Kim JH, Kim SH, Suh KJ.
Supervision: Shin HC.
Validation: Jung JJ, Lim C, Han W.
Visualization: Jung JJ.
Writing - original draft: Jung JJ.
Writing - review & editing: Jung JJ, Park SY, Han W, Shin HC.

SUPPLEMENTARY MATERIAL

Supplementary Table 1

Comparison of machine learning models to predict pathologic complete response following neoadjuvant chemotherapy

jbc-26-353-s001.xls^{(27KB, xls)}

References

1.Schott AF, Hayes DF. Defining the benefits of neoadjuvant chemotherapy for breast cancer. J Clin Oncol. 2012;30:1747–1749. doi: 10.1200/JCO.2011.41.3161. [DOI] [PubMed] [Google Scholar]
2.Fisher B, Brown A, Mamounas E, Wieand S, Robidoux A, Margolese RG, et al. Effect of preoperative chemotherapy on local-regional disease in women with operable breast cancer: findings from National Surgical Adjuvant Breast and Bowel Project B-18. J Clin Oncol. 1997;15:2483–2493. doi: 10.1200/JCO.1997.15.7.2483. [DOI] [PubMed] [Google Scholar]
3.Huang M, O’Shaughnessy J, Zhao J, Haiderali A, Cortes J, Ramsey S, et al. Evaluation of pathologic complete response as a surrogate for long-term survival outcomes in triple-negative breast cancer. J Natl Compr Canc Netw. 2020;18:1096–1104. doi: 10.6004/jnccn.2020.7550. [DOI] [PubMed] [Google Scholar]
4.Huang M, O’Shaughnessy J, Zhao J, Haiderali A, Cortés J, Ramsey SD, et al. Association of pathologic complete response with long-term survival outcomes in triple-negative breast cancer: a meta-analysis. Cancer Res. 2020;80:5427–5434. doi: 10.1158/0008-5472.CAN-20-1792. [DOI] [PubMed] [Google Scholar]
5.Cortazar P, Zhang L, Untch M, Mehta K, Costantino JP, Wolmark N, et al. Pathological complete response and long-term clinical benefit in breast cancer: the CTNeoBC pooled analysis. Lancet. 2014;384:164–172. doi: 10.1016/S0140-6736(13)62422-8. [DOI] [PubMed] [Google Scholar]
6.Boughey JC, McCall LM, Ballman KV, Mittendorf EA, Ahrendt GM, Wilke LG, et al. Tumor biology correlates with rates of breast-conserving surgery and pathologic complete response after neoadjuvant chemotherapy for breast cancer: findings from the ACOSOG Z1071 (Alliance) Prospective Multicenter Clinical Trial. Ann Surg. 2014;260:608–614. doi: 10.1097/SLA.0000000000000924. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.von Minckwitz G, Untch M, Blohmer JU, Costa SD, Eidtmann H, Fasching PA, et al. Definition and impact of pathologic complete response on prognosis after neoadjuvant chemotherapy in various intrinsic breast cancer subtypes. J Clin Oncol. 2012;30:1796–1804. doi: 10.1200/JCO.2011.38.8595. [DOI] [PubMed] [Google Scholar]
8.Colleoni M, Bagnardi V, Rotmensz N, Viale G, Mastropasqua M, Veronesi P, et al. A nomogram based on the expression of Ki-67, steroid hormone receptors status and number of chemotherapy courses to predict pathological complete remission after preoperative chemotherapy for breast cancer. Eur J Cancer. 2010;46:2216–2224. doi: 10.1016/j.ejca.2010.04.008. [DOI] [PubMed] [Google Scholar]
9.Rouzier R, Pusztai L, Delaloge S, Gonzalez-Angulo AM, Andre F, Hess KR, et al. Nomograms to predict pathologic complete response and metastasis-free survival after preoperative chemotherapy for breast cancer. J Clin Oncol. 2005;23:8331–8339. doi: 10.1200/JCO.2005.01.2898. [DOI] [PubMed] [Google Scholar]
10.Zhang J, Xiao L, Pu S, Liu Y, He J, Wang K. Can we reliably identify the pathological outcomes of neoadjuvant chemotherapy in patients with breast cancer? Development and validation of a logistic regression nomogram based on preoperative factors. Ann Surg Oncol. 2021;28:2632–2645. doi: 10.1245/s10434-020-09214-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Kim SY, Cho N, Choi Y, Lee SH, Ha SM, Kim ES, et al. Factors affecting pathologic complete response following neoadjuvant chemotherapy in breast cancer: development and validation of a predictive nomogram. Radiology. 2021;299:290–300. doi: 10.1148/radiol.2021203871. [DOI] [PubMed] [Google Scholar]
12.Zhang F, Huang M, Zhou H, Chen K, Jin J, Wu Y, et al. A nomogram to predict the pathologic complete response of neoadjuvant chemotherapy in triple-negative breast cancer based on simple laboratory indicators. Ann Surg Oncol. 2019;26:3912–3919. doi: 10.1245/s10434-019-07655-7. [DOI] [PubMed] [Google Scholar]
13.Hwang HW, Jung H, Hyeon J, Park YH, Ahn JS, Im YH, et al. A nomogram to predict pathologic complete response (pCR) and the value of tumor-infiltrating lymphocytes (TILs) for prediction of response to neoadjuvant chemotherapy (NAC) in breast cancer patients. Breast Cancer Res Treat. 2019;173:255–266. doi: 10.1007/s10549-018-4981-x. [DOI] [PubMed] [Google Scholar]
14.Kim JY, Jeon E, Kwon S, Jung H, Joo S, Park Y, et al. Prediction of pathologic complete response to neoadjuvant chemotherapy using machine learning models in patients with breast cancer. Breast Cancer Res Treat. 2021;189:747–757. doi: 10.1007/s10549-021-06310-8. [DOI] [PubMed] [Google Scholar]
15.Hylton NM, Blume JD, Bernreuter WK, Pisano ED, Rosen MA, Morris EA, et al. Locally advanced breast cancer: MR imaging for prediction of response to neoadjuvant chemotherapy--results from ACRIN 6657/I-SPY TRIAL. Radiology. 2012;263:663–672. doi: 10.1148/radiol.12110748. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Bergstra J, Bengio Y. Random search for hyper-parameter optimization. J Mach Learn Res. 2012;13:281–305. [Google Scholar]
17.DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–845. [PubMed] [Google Scholar]
18.Rouzier R, Pusztai L, Garbay JR, Delaloge S, Hunt KK, Hortobagyi GN, et al. Development and validation of nomograms for predicting residual tumor size and the probability of successful conservative surgery with neoadjuvant chemotherapy for breast cancer. Cancer. 2006;107:1459–1466. doi: 10.1002/cncr.22177. [DOI] [PubMed] [Google Scholar]
19.Tahmassebi A, Wengert GJ, Helbich TH, Bago-Horvath Z, Alaei S, Bartsch R, et al. Impact of machine learning with multiparametric magnetic resonance imaging of the breast for early prediction of response to neoadjuvant chemotherapy and survival outcomes in breast cancer patients. Invest Radiol. 2019;54:110–117. doi: 10.1097/RLI.0000000000000518. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Basmadjian RB, Kong S, Boyne DJ, Jarada TN, Xu Y, Cheung WY, et al. Developing a prediction model for pathologic complete response following neoadjuvant chemotherapy in breast cancer: a comparison of model building approaches. JCO Clin Cancer Inform. 2022;6:e2100055. doi: 10.1200/CCI.21.00055. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Asri H, Mousannif H, Al Moatassime H, Noel T. Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Comput Sci. 2016;83:1064–1069. [Google Scholar]
22.Delen D, Walker G, Kadam A. Predicting breast cancer survivability: a comparison of three data mining methods. Artif Intell Med. 2005;34:113–127. doi: 10.1016/j.artmed.2004.07.002. [DOI] [PubMed] [Google Scholar]
23.Von Minckwitz G, Untch M, Nüesch E, Loibl S, Kaufmann M, Kümmel S, et al. Impact of treatment characteristics on response of different breast cancer phenotypes: pooled analysis of the German neo-adjuvant chemotherapy trials. Breast Cancer Res Treat. 2011;125:145–156. doi: 10.1007/s10549-010-1228-x. [DOI] [PubMed] [Google Scholar]
24.Houssami N, Macaskill P, von Minckwitz G, Marinovich ML, Mamounas E. Meta-analysis of the association of breast cancer subtype and pathologic complete response to neoadjuvant chemotherapy. Eur J Cancer. 2012;48:3342–3354. doi: 10.1016/j.ejca.2012.05.023. [DOI] [PubMed] [Google Scholar]
25.Haque W, Verma V, Hatch S, Suzanne Klimberg V, Brian Butler E, Teh BS. Response rates and pathologic complete response by breast cancer molecular subtype following neoadjuvant chemotherapy. Breast Cancer Res Treat. 2018;170:559–567. doi: 10.1007/s10549-018-4801-3. [DOI] [PubMed] [Google Scholar]
26.Litton JK, Gonzalez-Angulo AM, Warneke CL, Buzdar AU, Kau SW, Bondy M, et al. Relationship between obesity and pathologic response to neoadjuvant chemotherapy among women with operable breast cancer. J Clin Oncol. 2008;26:4072–4077. doi: 10.1200/JCO.2007.14.4527. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Samiei S, van Nijnatten TJ, de Munck L, Keymeulen KB, Simons JM, Kooreman LF, et al. Correlation between pathologic complete response in the breast and absence of axillary lymph node metastases after neoadjuvant systemic therapy. Ann Surg. 2020;271:574–580. doi: 10.1097/SLA.0000000000003126. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Table 1

Comparison of machine learning models to predict pathologic complete response following neoadjuvant chemotherapy

jbc-26-353-s001.xls^{(27KB, xls)}

[B1] 1.Schott AF, Hayes DF. Defining the benefits of neoadjuvant chemotherapy for breast cancer. J Clin Oncol. 2012;30:1747–1749. doi: 10.1200/JCO.2011.41.3161. [DOI] [PubMed] [Google Scholar]

[B2] 2.Fisher B, Brown A, Mamounas E, Wieand S, Robidoux A, Margolese RG, et al. Effect of preoperative chemotherapy on local-regional disease in women with operable breast cancer: findings from National Surgical Adjuvant Breast and Bowel Project B-18. J Clin Oncol. 1997;15:2483–2493. doi: 10.1200/JCO.1997.15.7.2483. [DOI] [PubMed] [Google Scholar]

[B3] 3.Huang M, O’Shaughnessy J, Zhao J, Haiderali A, Cortes J, Ramsey S, et al. Evaluation of pathologic complete response as a surrogate for long-term survival outcomes in triple-negative breast cancer. J Natl Compr Canc Netw. 2020;18:1096–1104. doi: 10.6004/jnccn.2020.7550. [DOI] [PubMed] [Google Scholar]

[B4] 4.Huang M, O’Shaughnessy J, Zhao J, Haiderali A, Cortés J, Ramsey SD, et al. Association of pathologic complete response with long-term survival outcomes in triple-negative breast cancer: a meta-analysis. Cancer Res. 2020;80:5427–5434. doi: 10.1158/0008-5472.CAN-20-1792. [DOI] [PubMed] [Google Scholar]

[B5] 5.Cortazar P, Zhang L, Untch M, Mehta K, Costantino JP, Wolmark N, et al. Pathological complete response and long-term clinical benefit in breast cancer: the CTNeoBC pooled analysis. Lancet. 2014;384:164–172. doi: 10.1016/S0140-6736(13)62422-8. [DOI] [PubMed] [Google Scholar]

[B6] 6.Boughey JC, McCall LM, Ballman KV, Mittendorf EA, Ahrendt GM, Wilke LG, et al. Tumor biology correlates with rates of breast-conserving surgery and pathologic complete response after neoadjuvant chemotherapy for breast cancer: findings from the ACOSOG Z1071 (Alliance) Prospective Multicenter Clinical Trial. Ann Surg. 2014;260:608–614. doi: 10.1097/SLA.0000000000000924. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B7] 7.von Minckwitz G, Untch M, Blohmer JU, Costa SD, Eidtmann H, Fasching PA, et al. Definition and impact of pathologic complete response on prognosis after neoadjuvant chemotherapy in various intrinsic breast cancer subtypes. J Clin Oncol. 2012;30:1796–1804. doi: 10.1200/JCO.2011.38.8595. [DOI] [PubMed] [Google Scholar]

[B8] 8.Colleoni M, Bagnardi V, Rotmensz N, Viale G, Mastropasqua M, Veronesi P, et al. A nomogram based on the expression of Ki-67, steroid hormone receptors status and number of chemotherapy courses to predict pathological complete remission after preoperative chemotherapy for breast cancer. Eur J Cancer. 2010;46:2216–2224. doi: 10.1016/j.ejca.2010.04.008. [DOI] [PubMed] [Google Scholar]

[B9] 9.Rouzier R, Pusztai L, Delaloge S, Gonzalez-Angulo AM, Andre F, Hess KR, et al. Nomograms to predict pathologic complete response and metastasis-free survival after preoperative chemotherapy for breast cancer. J Clin Oncol. 2005;23:8331–8339. doi: 10.1200/JCO.2005.01.2898. [DOI] [PubMed] [Google Scholar]

[B10] 10.Zhang J, Xiao L, Pu S, Liu Y, He J, Wang K. Can we reliably identify the pathological outcomes of neoadjuvant chemotherapy in patients with breast cancer? Development and validation of a logistic regression nomogram based on preoperative factors. Ann Surg Oncol. 2021;28:2632–2645. doi: 10.1245/s10434-020-09214-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11.Kim SY, Cho N, Choi Y, Lee SH, Ha SM, Kim ES, et al. Factors affecting pathologic complete response following neoadjuvant chemotherapy in breast cancer: development and validation of a predictive nomogram. Radiology. 2021;299:290–300. doi: 10.1148/radiol.2021203871. [DOI] [PubMed] [Google Scholar]

[B12] 12.Zhang F, Huang M, Zhou H, Chen K, Jin J, Wu Y, et al. A nomogram to predict the pathologic complete response of neoadjuvant chemotherapy in triple-negative breast cancer based on simple laboratory indicators. Ann Surg Oncol. 2019;26:3912–3919. doi: 10.1245/s10434-019-07655-7. [DOI] [PubMed] [Google Scholar]

[B13] 13.Hwang HW, Jung H, Hyeon J, Park YH, Ahn JS, Im YH, et al. A nomogram to predict pathologic complete response (pCR) and the value of tumor-infiltrating lymphocytes (TILs) for prediction of response to neoadjuvant chemotherapy (NAC) in breast cancer patients. Breast Cancer Res Treat. 2019;173:255–266. doi: 10.1007/s10549-018-4981-x. [DOI] [PubMed] [Google Scholar]

[B14] 14.Kim JY, Jeon E, Kwon S, Jung H, Joo S, Park Y, et al. Prediction of pathologic complete response to neoadjuvant chemotherapy using machine learning models in patients with breast cancer. Breast Cancer Res Treat. 2021;189:747–757. doi: 10.1007/s10549-021-06310-8. [DOI] [PubMed] [Google Scholar]

[B15] 15.Hylton NM, Blume JD, Bernreuter WK, Pisano ED, Rosen MA, Morris EA, et al. Locally advanced breast cancer: MR imaging for prediction of response to neoadjuvant chemotherapy--results from ACRIN 6657/I-SPY TRIAL. Radiology. 2012;263:663–672. doi: 10.1148/radiol.12110748. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B16] 16.Bergstra J, Bengio Y. Random search for hyper-parameter optimization. J Mach Learn Res. 2012;13:281–305. [Google Scholar]

[B17] 17.DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–845. [PubMed] [Google Scholar]

[B18] 18.Rouzier R, Pusztai L, Garbay JR, Delaloge S, Hunt KK, Hortobagyi GN, et al. Development and validation of nomograms for predicting residual tumor size and the probability of successful conservative surgery with neoadjuvant chemotherapy for breast cancer. Cancer. 2006;107:1459–1466. doi: 10.1002/cncr.22177. [DOI] [PubMed] [Google Scholar]

[B19] 19.Tahmassebi A, Wengert GJ, Helbich TH, Bago-Horvath Z, Alaei S, Bartsch R, et al. Impact of machine learning with multiparametric magnetic resonance imaging of the breast for early prediction of response to neoadjuvant chemotherapy and survival outcomes in breast cancer patients. Invest Radiol. 2019;54:110–117. doi: 10.1097/RLI.0000000000000518. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B20] 20.Basmadjian RB, Kong S, Boyne DJ, Jarada TN, Xu Y, Cheung WY, et al. Developing a prediction model for pathologic complete response following neoadjuvant chemotherapy in breast cancer: a comparison of model building approaches. JCO Clin Cancer Inform. 2022;6:e2100055. doi: 10.1200/CCI.21.00055. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B21] 21.Asri H, Mousannif H, Al Moatassime H, Noel T. Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Comput Sci. 2016;83:1064–1069. [Google Scholar]

[B22] 22.Delen D, Walker G, Kadam A. Predicting breast cancer survivability: a comparison of three data mining methods. Artif Intell Med. 2005;34:113–127. doi: 10.1016/j.artmed.2004.07.002. [DOI] [PubMed] [Google Scholar]

[B23] 23.Von Minckwitz G, Untch M, Nüesch E, Loibl S, Kaufmann M, Kümmel S, et al. Impact of treatment characteristics on response of different breast cancer phenotypes: pooled analysis of the German neo-adjuvant chemotherapy trials. Breast Cancer Res Treat. 2011;125:145–156. doi: 10.1007/s10549-010-1228-x. [DOI] [PubMed] [Google Scholar]

[B24] 24.Houssami N, Macaskill P, von Minckwitz G, Marinovich ML, Mamounas E. Meta-analysis of the association of breast cancer subtype and pathologic complete response to neoadjuvant chemotherapy. Eur J Cancer. 2012;48:3342–3354. doi: 10.1016/j.ejca.2012.05.023. [DOI] [PubMed] [Google Scholar]

[B25] 25.Haque W, Verma V, Hatch S, Suzanne Klimberg V, Brian Butler E, Teh BS. Response rates and pathologic complete response by breast cancer molecular subtype following neoadjuvant chemotherapy. Breast Cancer Res Treat. 2018;170:559–567. doi: 10.1007/s10549-018-4801-3. [DOI] [PubMed] [Google Scholar]

[B26] 26.Litton JK, Gonzalez-Angulo AM, Warneke CL, Buzdar AU, Kau SW, Bondy M, et al. Relationship between obesity and pathologic response to neoadjuvant chemotherapy among women with operable breast cancer. J Clin Oncol. 2008;26:4072–4077. doi: 10.1200/JCO.2007.14.4527. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B27] 27.Samiei S, van Nijnatten TJ, de Munck L, Keymeulen KB, Simons JM, Kooreman LF, et al. Correlation between pathologic complete response in the breast and absence of axillary lymph node metastases after neoadjuvant systemic therapy. Ann Surg. 2020;271:574–580. doi: 10.1097/SLA.0000000000003126. [DOI] [PubMed] [Google Scholar]

PERMALINK

Development and External Validation of a Machine Learning Model to Predict Pathological Complete Response After Neoadjuvant Chemotherapy in Breast Cancer

Ji-Jung Jung

Eun-Kyu Kim

Eunyoung Kang

Jee Hyun Kim

Se Hyun Kim

Koung Jin Suh

Sun Mi Kim

Mijung Jang

Bo La Yun

So Yeon Park

Changjin Lim

Wonshik Han

Hee-Chul Shin

Abstract

Purpose

Methods

Results

Conclusion

INTRODUCTION

METHODS

Study cohort and data collection

Model development and validation

Statistical analysis

RESULTS

Table 1. Patient characteristics of the training and test cohorts.

Table 2. Univariate analysis of pathological complete response in relation to clinicopathological characteristics (all cohort).

Figure 1. Variance importance plot of the gradient boosting machine model in predicting pathological complete response.

Performance of machine learning models

Table 3. Optimal hyperparameters of each machine learning model and DeLong’s test comparing area under the receiver operating characteristic curve to gradient boosting machine in the test set.

External validation of the final model

Figure 3. External validation of the gradient boosting machine model to predict pathological complete response after neoadjuvant chemotherapy.

DISCUSSION

Footnotes

SUPPLEMENTARY MATERIAL

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases