The impact of chemotherapy and survival prediction by machine learning in early Elderly Triple Negative Breast Cancer (eTNBC): a population based study from the SEER database

Kaiyan Huang; Jie Zhang; Yushuai Yu; Yuxiang Lin; Chuangui Song

doi:10.1186/s12877-022-02936-5

. 2022 Apr 1;22:268. doi: 10.1186/s12877-022-02936-5

The impact of chemotherapy and survival prediction by machine learning in early Elderly Triple Negative Breast Cancer (eTNBC): a population based study from the SEER database

Kaiyan Huang ^1,^#, Jie Zhang ^1,^2,^3,^#, Yushuai Yu ^1,^#, Yuxiang Lin ^1,^2,³, Chuangui Song ^1,^2,^3,^✉

PMCID: PMC8973884 PMID: 35361134

Abstract

Purpose

We aimed to analysis the impact of chemotherapy and establish prediction models of prognosis in early elderly triple negative breast cancer (eTNBC) by using machine learning.

Methods

We enrolled 4,696 patients in SEER Database who were 70 years or older, diagnosed with primary early TNBC(larger than 5 mm), from 2010 to 2016. The propensity-score matched method was utilized to reduce covariable imbalance. Univariable and multivariable analyses were used to compare breast cancer-specific survival(BCSS) and overall survival(OS). Nine models were developed by machine learning to predict the 5-year OS and BCSS for patients received chemotherapy.

Results

Compared to matched patients in no-chemotherapy group, multivariate analysis showed a better survival in chemotherapy group. Stratified analyses by stage demonstrated that patients with stage II and stage III other than stage I could benefit from chemotherapy. Further investigation in stage II found that chemotherapy was a better prognostic indicator for patients with T2N0M0 and stage IIb, but not in T1N1M0. Patients with grade III could achieve a better survival by receiving chemotherapy, but those with grade I and II couldn’t. With 0.75 in 5-year BCSS and 0.81 in 5-year OS for AUC, the LightGBM outperformed other algorithms.

Conclusion

For early eTNBC patients with stage I, T1N1M0 and grade I-II, chemotherapy couldn’t improve survival. Therefore, de-escalation therapy might be appropriate for selected patients. The LightGBM is a trustful model to predict the survival and provide precious systemic treatment for patients received chemotherapy.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12877-022-02936-5.

Keywords: Elderly triple negative breast cancer, Breast cancer-specific survival, Overall survival, SEER database, Machine learning

Introduction

Breast cancer is presently the commonest malignant tumor [1]. With the extension of average life expectancy of women, nearly 30%-40% of breast cancer patients were over 70 years old when they were initially diagnosed [2]. Moreover, by 2030, the number of elderly breast cancer might increase 57% in expectation [3]. However, due to the lack of clinical trial data for aged patients (enrollment rate not more than 20%), the management of senile breast cancer is still controversial, especially chemotherapy [4].

According to previous reports, the proportion of the elders who receive chemotherapy is much lower than that of the young patients, which may influence the prognosis in elders, particularly in triple negative breast cancer [5, 6]. A study based on SEER data-base showed higher breast cancer specific mortality in elderly triple negative breast cancer (eTNBC) when compared to younger cohort [7]. However, this difference is not significant when both of them received adjuvant treatment. Although the tumor biological characteristics in elders is higher hormone receptor(HR) expression, and less human epidermal growth factor receptor 2(Her2) expression, nearly 5% of them are eTNBC and account for 10%-20% TNBC in all ages [7–9]. Therefore, it is urgent to investigate the effect of chemotherapy in eTNBC and explore who could benefit from chemotherapy. There were two studies focused on this topic by using National Cancer Database (Jennifer A Crozier) and population from Swedish(Slavica Janeva) [9, 10]. Both of them recommended chemotherapy in general population. Nevertheless, there were still some insufficiencies in both studies. Neither of them compared the efficacy of chemotherapy in subgroups other than lymph node stage, for example, tumor size, different age groups, clinical stage and so on. Hence, we could not know who is more needed to receive chemotherapy. Even for the analysis of different nodal status, there is a contradiction. Jennifer draw a conclusion that chemotherapy should be recommended without considering nodal status. Yet, Slavica only found better results in nodal negative patients. Besides disease stage which is also an important predictor in elderly patients, the benefit of chemotherapy in elders depends on several added factors, such as comorbidities, tolerance to toxicities, heterogeneity in health, and expected life expectancy [11–14]. The discrete-time stochastic state transition simulation model showed that the benefit of chemotherapy is more common in the patients with higher risk, fewer comorbidities and longer expected survival [15]. Since life expectancy continues to improving in recent years, more patients might benefit from chemotherapy [16].

In order to make up for the insufficient evidence, we use the data from SEER data-base to conduct an analysis of the efficacy of chemotherapy in elderly TNBC by using propensity score matched(PSM). In addition, we investigated it in different subgroups according to stage, tumor size, lymph node status and histological grade. We also establish prediction models of the survival time of eTNBC by using machine learning. We believe our study may be helpful to predict the population benefit from chemotherapy in elderly breast cancer by combining the above methods.

Materials and methods

Ethics approval and consent to participate

Considering SEER database is publicly available and does not require informed patient consent. So, we did not need to get patient consent and exempt from Institutional Review Board approval. We signed a Data-Use Agreement for the SEER 1973–2016 Research Data File to get access conditions.

Data source and study population

We used SEER*Stat version 8.3.8 to generate a case list. We enrolled 4,696 patients according to the following inclusion criteria: female; year of diagnosis from 2010 to 2016; age of diagnosis ≥ 70 years; breast carcinoma as the only primary malignant cancer diagnosis; American Joint Committee on Cancer (AJCC) sixth edition stage I-III; tumor larger than 5 mm in diameter; triple negative subtype. Patients who present with distant metastasis, in situ disease were expelled from the study. We defined two patient groups according to the Chemotherapy recode in SEER database: chemotherapy group or no-chemotherapy group. We calculated follow-up durations from January 1, 2010 to December 31, 2016. Patient characteristics and treatment courses in our study were identified, including age, race, marital status, surgery approach, chemotherapy status and radiotherapy status. Tumor characteristics included grade, AJCC stage, tumor status and nodal status.

Outcome measurement

In our study, breast cancer-specific survival (BCSS) was used as a primary study outcome. It was calculated from the date of diagnosis to the date of death due to breast cancer. However, we should notice that if a patient died from other causes, the end date of her being followed-up in a BCSS analysis was the day of the last contact, the date of death from other causes, or the end of this study. Overall survival (OS), served as a secondary outcome, was defined as from the date of diagnosis to the date of death or was censored at the last follow-up date. Patients being lost to follow-up or survived at the end of the follow-up period were censored. If a patient still alive at the end of the follow-up period, the follow-up duration was calculated from the date of diagnosis to the end of this study. If a patient was lost to follow-up, the follow-up duration was calculated from the date of diagnosis to the day of the last contact.

Statistical analysis

The chi-square test was conducted to describe the demographic and clinical characteristics of the chemotherapy and no-chemotherapy cases, in both the whole groups and 1:1 PSM groups. The Kaplan–Meier method was utilized to generate the survival curves. The log-rank test was conducted to identify whether the differences in BCSS or OS rates between chemotherapy patients and no-chemotherapy patients was statistically significant. Hazard ratio (HR) with 95% confidence intervals (CI) was calculated by using a Cox proportional hazard regression model to determine the outcome-related factors. Factors with a P-value of 0.05 or less in univariate analysis were included as candidate variables in the multivariate analysis. Proportional hazard assumptions were examined the by Schoenfeld residuals test. For the variables fail to meet the proportional hazards assumption, we conducted time-dependent covariate analysis to minimize the potential bias. In order to reduce the influences of baseline differences in demographic and clinical characteristics on outcome differences, 1:1 PSM method was performed to match patients in chemotherapy group and no-chemotherapy group. Covariables included in propensity score matching were age, race, marital status, grade, AJCC stage, tumor status, nodal status, surgery approach and radiation status. The two groups of patients were matched one to one by nearest-neighbor matching with a 0.1 caliper distance.

Before building machine learning models, all patients in the chemotherapy group were randomly divided to 2 sets, a training set and a testing set, at a 8:2 ratio. In the training set, K-nearest neighbor, CatBoost, decision tree, random forest method, Gradient Boost, LightGBM, neural network models, support vector machine and XGBoost models were developed to predict the 5-year BCSS and OS for patients in the chemotherapy group. The performance of these models was evaluated by ten‐fold cross validation.

These statistical analyses were conducted by using R software version 3.6.1 and Python Version 3.8. All statistical analyses were two-sided, and a P value of less than 0.05 was considered as a significance level.

Results

Demographics and clinical characteristics of the study population

Overall, 4,696 eligible patients were enrolled in our study, including 2,122 patients belonged to chemotherapy group and 2,574 patients belonged to no-chemotherapy group. The median follow-up time was 27 months. The baseline characteristics of the chemotherapy group and no-chemotherapy group were summarized in Table 1. There were significant differences in characteristics between two groups, including age, marital status, grade, AJCC stage, tumor status, nodal status and radiation status. The patients treated with chemotherapy presented a higher proportion of younger age (70–79 years old, 86.5% vs. 48.8%, p < 0.001), married status (married, 51.1% vs. 37.3%, p < 0.001), and grade III (81.7% vs. 72.5%, p < 0.001). A lower proportion of patients in chemotherapy group presented AJCC stage I, T1 stage and N0 stage (AJCC stage I, 33.3% vs. 51.2%, p < 0.001; T1 stage, 42.9% vs. 55.7%, p < 0.001; N0 stage, 60.8% vs. 79.2%, p < 0.001, respectively). In addition, the chemotherapy group were inclined to accept radiotherapy than no-chemotherapy group (55.6% vs. 39.4%, p < 0.001). Other characteristics, including race and surgery approach, were similarly distributed between two groups.

Table 1.

Baseline characteristics of patients with chemotherapy and no-chemotherapy

Characteristics		No-Chemotherapy (n = 2574)		Chemotherapy (n = 2122)		Total (n = 4696)		P^c
Characteristics		No	%	No	%	No	%	P^c
Median follow-up (months) (IQR)		27(12–49)		27(12–47)		27(12–48)
Age (years)	70–79	1257	48.8	1836	86.5	3093	65.9	< 0.001
Age (years)	80 +	1317	51.2	286	13.5	1603	34.1
Race	White	2003	77.8	1636	77.1	3639	77.5	0.238
	Black	383	14.9	348	16.4	731	15.6
	Other^a	188	7.3	138	6.5	326	6.9
Marital status	Married	959	37.3	1085	51.1	2044	43.5	< 0.001
Marital status	Not married^b	1615	62.7	1037	48.9	2652	56.5
Grade	I and II	707	27.5	388	18.3	1095	23.3	< 0.001
Grade	III	1867	72.5	1734	81,7	3601	76.7
AJCC stage	I	1317	51.2	706	33.3	2023	43.1	< 0.001
	II	960	37.3	999	47.1	1959	41.7
	III	297	11.5	417	8.9	714	15.2
Tumor status	T1	1435	55.7	910	42.9	2345	49.9	< 0.001
	T2	876	34.0	919	43.3	1795	38.2
	T3	150	5.8	148	7.0	298	6.3
	T4	113	4.4	145	6.8	258	5.5
Nodal status	N0	2039	79.2	1290	60.8	3329	70.9	< 0.001
	N1	357	13.9	556	26.2	913	19.4
	N2	104	4.0	172	8.1	276	5.9
	N3	74	2.9	104	4.9	178	3.8
Surgery approach	No surgery	150	5.8	107	5.0	257	5.5	0.239
	Mastectomy and BCS	2424	94.2	2015	95.0	4439	94.5
Radiation status	Yes	1013	39.4	1179	55.6	2192	46.7	< 0.001
	No	1561	60.6	943	44.4	2504	53.3

Open in a new tab

Abbreviation: AJCC American Joint Committee on Cancer, BCS Breast-conserving surgery, IQR Interquartile range

^aOther includes American Indian/Alaskan native and Asian/Pacific Islander and Unknown

^bNot married includes divorced, separated, single (never married), unmarried or domestic partner, and widowed

^cThe P value of the Chi-square test was calculated between the chemotherapy and no-chemotherapy groups, and bold type indicates significance

Comparison of survival between chemotherapy group and no-chemotherapy group

The univariate Cox regression analysis for each variable was shown in Table S1. Compared to the survival of overall patients in no-chemotherapy group, the result of multivariate analysis shown in Table 2 revealed a better survival in patients received chemotherapy, according to BCSS and OS (HR = 0.656, 95% CI = 0.553–0.779, p < 0.001; HR = 0.561, 95% CI = 0.488–0.644, p < 0.001, respectively). We conducted 1:1 PSM analysis between patients in two groups to lower the effects of bias. Finally, we obtained a group with 2,660 patients, and each subgroup included 1,330 patients. As shown in Table 3, we performed the chi-square test for matched dataset. The P values for each covariables are more than 0.05, which indicates the propensity score overlapped well between the two groups of patients.

Table 2.

Multivariate Cox proportional hazard model of breast cancer-specific survival (BCSS) and overall survival (OS) in all patients

Variables		BCSS		OS
Variables		HR (95% CI)	P	HR (95% CI)	P
Age (years)	70–79	Reference		Reference
Age (years)	80 +	1.315(1.111–1.557)	0.001	1.629(1.429–1.856)	< 0.001
Marital status	Married	Reference		Reference
Marital status	Not married^a	1.062(0.902–1.250)	0.472	1.120 (0.985–1.272)	0.084
Race	White	Reference		Reference
	Black	1.068 (0.875–1.305)	0.516	1.070 (0.914–1.252)	0.399
	Other^b	0.706 (0.507–0.983)	0.039	0.709 (0.549–0.916)	0.008
Grade	I and II	Reference		Reference
Grade	III	1.510 (1.222–1.865)	< 0.001	1.344(1.152–1.568)	< 0.001
Stage	I	Reference		Reference
	II	3.982 (3.137–5.055)	< 0.001	2.602(2.221–3.048)	< 0.001
	III	11.609(9.015–14.949)	< 0.001	6.528(5.468–7.793)	< 0.001
Surgery approach	No surgery	Reference		Reference
	Mastectomy and BCS	0.246(0.198–0.304)	< 0.001	0.293(0.245–0.351)	< 0.001
Radiation	No	Reference		Reference
Radiation	Yes	0.626 (0.529–0.741)	< 0.001	0.565 (0.495–0.645)	< 0.001
Chemotherapy	No	Reference		Reference
Chemotherapy	Yes	0.656 (0.553–0.779)	< 0.001	0.561(0.488–0.644)	< 0.001

Open in a new tab

Abbreviation: 70–79 70–79 years old, 80 + More than 80 years old, BCS Breast Conserving Surgery, HR Hazard ratio

^aNot married includes divorced, separated, single (never married), unmarried or domestic partner, and widowed

^bOther includes American Indian/Alaskan native and Asian/Pacific Islander and Unknown. Bold type indicates significance

Table 3.

Baseline characteristics of patients with chemotherapy and no-chemotherapy in PSM group

Characteristics		No- Chemotherapy (n = 1330)		Chemotherapy (n = 1330)		Total (n = 2660)		P^c
Characteristics		No	%	No	%	No	%	P^c
Median follow-up (months) (IQR)		25(9–49)		26(11–48)		26(10–48.75)
Age (years)	70–79	1046	78.6	1055	79.3	2101	79.0	0.703
Age (years)	80 +	284	21.4	275	20.7	559	21.0
Race	White	982	73.8	982	73.8	1964	73.8	1.000
	Black	248	18.7	248	18.7	496	18.7
	Other^a	100	7.5	100	7.5	200	7.5
Marital status	Married	608	45.7	632	47.5	1240	46.6	0.371
Marital status	Not married^b	722	54.3	698	52.5	1420	53.4
Grade	I and II	259	19.5	287	21.6	546	20.5	0.195
Grade	III	1071	80.5	1043	78.4	2114	79.5
AJCC stage	I	647	48.6	665	50.0	1312	49.3	0.645
	II	494	37.1	491	36.9	985	37.0
	III	189	7.1	174	13.1	363	13.6
Tumor status	T1	727	54.7	723	54.4	1450	54.5	0.258
	T2	453	34.1	436	32.8	889	33.4
	T3	83	6.2	80	6.0	163	6.1
	T4	67	5.0	91	6.8	158	5.9
Nodal status	N0	990	74.4	1030	77.4	2020	75.9	0.146
	N1	218	16.4	197	14.8	415	15.6
	N2	67	5.0	66	4.9	133	5.0
	N3	55	4.1	37	2.9	92	3.5
Surgery approach	No surgery	70	5.3	67	5.0	137	5.2	0.861
	Mastectomy and BCS	1260	94.7	1263	95.0	2523	94.8
Radiation	Yes	610	45.9	638	48.0	1248	46.9	0.294
Radiation	No	720	54.1	692	52.0	1412	53.1

Open in a new tab

Abbreviation: AJCC American Joint Committee on Cancer; BCS, breast-conserving surgery, IQR Interquartile range

^aOther includes American Indian/Alaskan native and Asian/Pacific Islander and Unknown

^bNot married includes divorced, separated, single (never married), unmarried or domestic partner, and widowed

^cThe P value of the Chi-square test was calculated between the chemotherapy and no-chemotherapy groups, and bold type indicates significance

In matched population, patients could significantly benefit from chemotherapy (BCSS, HR = 0.612, 95% CI = 0.493–0.759, p < 0.001; OS HR = 0.549, 95% CI = 0.459–0.655, p < 0.001, shown in Table 4). To investigate the effects of chemotherapy on patients with different subgroups, we stratified the patients by specific clinical features. We examined the proportional hazard assumptions for all subgroups. The results of the Schoenfeld residuals test for each subgroup were shown in Table S2 -Table S5. For the variables that fail to meet the proportional hazards assumption, we conducted time-dependent covariate analysis to minimize the potential bias. The subgroups, in which we conducted time-dependent covariate analysis, were specifically marked with asterisks.

Table 4.

Comparison of breast cancer-specific survival (BCSS) and overall survival (OS) between matched patients with chemotherapy and no-chemotherapy in specific stage

Stage	BCSS			OS
Stage	Events No	HRs (95%CI)	P ^a	Events No	HRs (95%CI)	P ^a
Stage I ( n = 1,312)	53		0.932	103		0.111
Chemotherapy		1.024(0.595–1.764)			0.723(0.485–1.078)*
No-Chemotherapy		Reference			Reference
Stage II ( n = 985)	164		0.001	247		< 0.001
Chemotherapy		0.564(0.408–0.779)			0.522(0.400–0.682)*
No-Chemotherapy		Reference			Reference
Stage III ( n = 363)	142		0.001	192		< 0.001
Chemotherapy		0.549(0.386–0.781)			0.537(0.395–0.728)*
No-Chemotherapy		Reference			Reference
Stage I-III ( n = 2,660)	359		< 0.001	542		< 0.001
Chemotherapy		0.612(0.493–0.759)*			0.549(0.459–0.655)*
No-Chemotherapy		Reference			Reference

Open in a new tab

Abbreviation: HR Hazard ratio, CI Confidence interval, BCSS Breast cancer-specific survival, OS Overall survival, Events No Number of events

^aP value was adjusted by a multivariate Cox proportional hazard regression model or a time-dependent covariate analysis. Bold type indicates significance

^bThe groups using time-dependent covariate analysis were specifically marked with asterisks(*)

For the purpose of investigating the effects of chemotherapy on patients with different stages, we categorized the patients into stage I, stage II and stage III. The survival curves and results are shown in Fig. 1 and Table 4. As expected, chemotherapy didn’t lower the risk of cancer-specific mortality and all-cause mortality in stage I cohor (BCSS, HR = 1.024, 95% CI = 0.595–1.764, p = 0.932; OS HR = 0.723, 95% CI = 0.485–1.078, p = 0.111, respectively). But patients diagnosed with stage II can benefit from chemotherapy (BCSS, HR = 0.564, 95% CI = 0.408–0.779, p = 0.001; OS HR = 0.522, 95% CI = 0.400–0.682, p < 0.001, respectively). We observed similar phenomena in the stage III cohort(BCSS, HR = 0.549, 95% CI = 0.386–0.781, p < 0.001; OS HR = 0.537, 95% CI = 0.395–0.728, p < 0.001, respectively). But what made us curious was whether chemotherapy could be skipped in part of patients in stage II. We further investigated the effects of tumor status and nodal status in stage II cases between the two groups. As showcased in Fig. 2 and Table 5, we found that chemotherapy was a better prognostic indicator for patients with T2N0M0 (BCSS, HR = 0.420, 95% CI = 0.261–0.675, p < 0.001; OS HR = 0.361, 95% CI = 0.243–0.536, p < 0.001, respectively), but not for T1N1M0 (BCSS, HR = 0.778, 95% CI = 0.249–2.432, p = 0.666; OS HR = 1.072, 95% CI = 0.458–2.508, p = 0.872, respectively). The OS of stage IIb patients who were treated with chemotherapy was better than that without (HR = 0.640, 95% CI = 0419–0.978, p = 0.039). However, no significant difference in BCSS level was detected in patients who received chemotherapy compared with those who did not (HR = 0.767, 95% CI = 0.454–1.296, p = 0.321).

Fig. 1 — Kaplan–Meier survival curves of the effect of chemotherapy on BCSS (A–D) and OS (E–H) stratified by stage

Fig. 2 — Kaplan–Meier survival curves of the effect of chemotherapy on BCSS (A-E) and OS (F-J) stratified by T stage, N stage and tumor grade

Table 5.

Comparison of breast cancer-specific survival (BCSS) and overall survival (OS) between matched patients with chemotherapy and no-chemotherapy in specific clinical variables

Variables	BCSS			OS
Variables	Events No	HRs (95%CI)	P^a	Events No	HRs (95%CI)	P^a
T1N1M0 (n = 109)	21		0.666	34		0.872
Chemotherapy		0.778(0.249–2.432)			1.072(0.458–2.508)
No-Chemotherapy		Reference			Reference
T2N0M0 (n = 595)	80		< 0.001	123		< 0.001
Chemotherapy		0.420(0.261–0.675)			0.361(0.243–0.536)
No-Chemotherapy		Reference			Reference
Stage IIb (n = 283)	63		0.321	90		0.039
Chemotherapy		0.767(0.454–1.296)*			0.640(0.419–0.978)
No-Chemotherapy		Reference			Reference
Grade I&II (n = 546)	53		0.387	81		0.306
Chemotherapy		0.781(0.445–1.368)			0.790(0.503–1.240)*
No-Chemotherapy		Reference			Reference
Grade III (n = 2,114)	306		< 0.001	461		< 0.001
Chemotherapy		0.559(0.441–0.708)*			0.505(0.415–0.615)*
No-Chemotherapy		Reference			Reference

Open in a new tab

Abbreviation: HR Hazard ratio, CI Confidence interval, BCSS Breast cancer-specific survival, OS Overall survival, Events No Number of events

^aP value was adjusted by a multivariate Cox proportional hazard regression model or a time-dependent covariate analysis. Bold type indicates significance

^bThe groups using time-dependent covariate analysis were specifically marked with asterisks(*)

Histological grade is one of the fundamental features to describe breast cancer. For patients with grade I and grade II, no statistical survival differences were identified between chemotherapy and no-chemotherapy patients (BCSS, HR = 0.781, 95% CI = 0.445–1.368, p = 0.387; OS HR = 0.790, 95% CI = 0.503–1.240, p = 0.306). While for patients with grade III, the chemotherapy patients demonstrated a better prognosis than no-chemotherapy patients in terms of both BCSS and OS (HR = 0.559, 95% CI = 0.441–0.708, p < 0.001; HR = 0.505, 95% CI = 0.415–0.615, p < 0.001, respectively).

Machine-learning based outcome prediction in patients received chemotherapy

With respect to the nine algorithms for 5-year BCSS and 5-year OS, the performance metrics of the algorithms are presented in Table 6. The Table 7 showed the resulting confusion matrix. On average, the accuracy was 0.886 on 5-year BCSS and 0.857 on 5-year OS. The average precision of the examined ten algorithms was 0.888 on 5-year BCSS and 0.863 on 5-year OS. Similarly, the average sensitivity was 0.981 on 5-year BCSS and 0.969 on 5-year OS. There was average F1 score of 0.932 on 5-year BCSS and 0.913 on 5-year OS. In terms of the area under receiving operating characteristics curve (AUC), the highest AUC was observed in LightGBM. For predicting the 5-year BCSS, LightGBM achieved 0.882 accuracy, 0.887 precision, 0.991 sensitivity, 0.936 F1 score and 0.75 AUC. For 5-year OS, the parameters were 0.851, 0.859, 0.983,0.916 and 0.81 for accuracy, precision, sensitivity, F1 score and AUC, respectively. Considering all the parameters above, the LightGBM outperformed all other algorithms. The score of importance of each variable used in LightGBM was illustrated in Fig. 3, which demonstrated that the stage was the most relevant variables to explain the BCSS and OS. This model could provide more precious systemic treatments guidance and support for reducing overtreatment that may be present for patients with early eTNBC.

Table 6.

Model performance for 5-year BCSS and 5-year OS

Algorithms	Accuracy	Precision	Sensitivity	F1 score	AUC
5-year BCSS
K-nearest neighbor	0.879	0.882	0.98	0.928	0.70
Catboost	0.905	0.892	0.974	0.932	0.69
Decision tree	0.908	0.901	0.949	0.924	0.61
Random forest	0.869	0.889	0.971	0.929	0.70
Gradient booster	0.882	0.887	0.991	0.936	0.75
LightGBM	0.882	0.887	0.991	0.936	0.75
Neural network model	0.886	0.877	1.0	0.934	0.75
Support vector machine	0.882	0.887	0.991	0.936	0.51
XGBoost	0.879	0.892	0.98	0.934	0.70
5-year OS
K-nearest neighbor	0.844	0.857	0.952	0.902	0.73
Catboost	0.877	0.86	0.977	0.915	0.76
Decision tree	0.882	0.869	0.940	0.903	0.69
Random forest	0.837	0.864	0.954	0.907	0.72
Gradient booster	0.849	0.855	0.985	0.916	0.80
LightGBM	0.851	0.859	0.983	0.916	0.81
Neural network model	0.86	0.877	0.949	0.911	0.79
Support vector machine	0.854	0.854	0.994	0.919	0.70
XGBoost	0.865	0.868	0.988	0.924	0.79

Open in a new tab

Abbreviation: AUC Area Under Curve

Table 7.

Confusion matrix of nine algorithms for 5-year BCSS and 5-year OS

Algorithms		Predictions		Algorithms		Predictions
Algorithms		Dead	Alive	Algorithms		Dead	Alive
5-year BCSS				5-year OS
K-nearest neighbor	Dead	3	47	K-nearest neighbor	Dead	15	56
K-nearest neighbor	Alive	7	350	K-nearest neighbor	Alive	17	337
Catboost	Dead	8	42	Catboost	Dead	15	56
Catboost	Alive	9	348	Catboost	Alive	8	346
Decision tree	Dead	13	37	Decision tree	Dead	21	50
Decision tree	Alive	18	339	Decision tree	Alive	21	333
Random forest	Dead	7	43	Random forest	Dead	18	53
Random forest	Alive	10	347	Random forest	Alive	16	338
Gradient booster	Dead	5	45	Gradient booster	Dead	12	59
Gradient booster	Alive	3	354	Gradient booster	Alive	5	349
LightGBM	Dead	5	45	LightGBM	Dead	14	57
LightGBM	Alive	3	354	LightGBM	Alive	6	348
Neural network model	Dead	0	50	Neural network model	Dead	24	47
Neural network model	Alive	0	357	Neural network model	Alive	18	336
Support vector machine	Dead	5	45	Support vector machine	Dead	11	60
Support vector machine	Alive	3	354	Support vector machine	Alive	2	352
XGBoost	Dead	8	42	XGBoost	Dead	18	53
XGBoost	Alive	7	350	XGBoost	Alive	4	350

Open in a new tab

Fig. 3 — The importance score of predictor variables in predicting 5-year BCSS (A) and 5-year OS (B)

Discussion

eTNBC is a special group of patients who are considered to have more indolent tumor behavior but higher risk of disease specific mortality when compared with younger patients [7, 17–20]. Insufficient treatment is considered as one of the reasons for this phenomenon. Surgery, chemotherapy and radiotherapy are the three primary treatments for eTNBC. Using PSM methods and multivariable regression in our study, we found all of them could reduce the disease-specific mortality and all-cause mortality in entire cohort. Compared with surgery and radiotherapy, chemotherapy leads to the most controversial. It was regarded as a double-edge sword in eTNBC due to its exclusive anti-tumor effect and high incidence of side-effects. Therefore, understanding how to optimally manage chemotherapy in eTNBC is increasingly important.

Several studies have focused on this topic. In accordance with us, most of them confirmed that chemotherapy could increase the survival rate in the general population of eTNBC [9, 10, 21, 22]. Meanwhile, they also pointed out the benefit was mainly observed in individuals who have low competing risks and who have high recurrent risks. However, presently there is no clear definition of high recurrent risk which can indicate the need of chemotherapy in eTNBC. Based on previous research conclusions, the status of lymph node was once regarded as a key point, among all clinical risk factors. Nearly all of those studies demonstrated the benefit of chemotherapy was only achieved in lymph nodes positive group [9, 10, 21, 22]. However, other clinicopathological factors have rarely been evaluated in those studies. Could different loads of lymph node metastasis, from only one involvement to more than nine, be put in the same planning, not to mention the difference of tumor burden, histological grades or other factors? In order to explore the value of chemotherapy in eTNBC with different clinicopathological characteristics, we investigated the efficacy of chemotherapy in more detailed subgroups.

Firstly, we found chemotherapy was necessary in patients with stage II and stage III, but not in stage I. Our conclusion is slightly different from that of Margaret M. Kozak who also worked based on SEER database [7]. They announced that people with stage II obtained greatest benefit from chemotherapy but not in stage III. And they believed this phenomenon was related to less intensive chemotherapy which was not effective enough in stage III TNBC. However, we could not agree with this conclusion. At first, SEER did not provide the detail of chemotherapy regimens, so we certainly could not know real intensity of chemotherapy for each patient. Secondly, Margaret M. Kozak compared the survival differences between the older group and younger group in different stages all receiving chemotherapy. They found chemotherapy could help to reduce the death risk in the elders to a level similar to that in the young in stage II but not in stage III. We thought they confused the effect of chemotherapy on the survival difference between the elders and young for the effect of chemotherapy on the survival of elders. After directly compared the risk between chemotherapy group and no-chemotherapy group, we found the effectiveness of reducing death risk by chemotherapy is consistent in stage II and stage III eTNBC. Therefore, we strongly recommended chemotherapy to stage III eTNBC. But what interests us is that does all patients in stage II need chemotherapy? What about the patients with negative lymph node or positive lymph node with small tumor? Further stratified analysis of the patients with stage II based on different N stage and T stage were conducted. We found that the patients with T1N1M0 could exempt chemotherapy, since no significant improvements were observed both in BCSS and OS after chemotherapy. But the patients with T2N0M0 and stage IIb could still benefit from chemotherapy. Besides, the relative risks of breast cancer specific mortality and overall mortality were reduced by about 25% ~ 35% in stage IIb. Therefore, we regarded that lymph node status should not be the only determinant. More detailed stratification could help us to identify candidates who really need chemotherapy which is particularly important in eTNBC.

Histological grade is an important pathological determinant for chemotherapy[23]. High grade always means high proliferation, poor prognosis and strong chemotherapy recommendation level [24]. In our study, we found that the risk of death decreased dramatically in grade III cohort after chemotherapy when compared with grade II and grade I cohorts. Considering chemotherapy is more effective in killing tumor with high proliferation, it could not be omitted in grade III eTNBC. Since there were no significant differences for their outcomes in grade I and grade II groups, we did not recommend chemotherapy to them. However, as showed in previous studies and our study, nearly 80% of eTNBC tumors presented as grade III, chemotherapy is still an important treatment modality.

At present, machine learning model can be considered as a model that automatically adjusts the weights of the factors. In addition, it can constructs a model that does not reduce the predictive effectiveness by fully exploiting data. On the contrary, some factors could not be incorporated into the model due to a lack of statistical significance in traditional statistics (for example, Cox proportional hazard regression model). In terms of performance, machine learning algorithms are more accurate than traditional statistical methods in predicting survival outcome in the fifth year. This is one of the purposes of our study. While Cox proportional hazards model is more appropriate for investigating the associations between covariates and end-point events. In terms of speed, machine learning algorithms can produce results within milliseconds. This strengths allows the system to react in real time. Delen and colleagues is the first to established a prediction model based on machine learning for patients of breast cancer [25]. Subsequently, machine learning is widely used in breast cancer. But, there is a paucity in machine learning algorithms predicting the impact of chemotherapy in early eTNBC. In our study, nine models were built to predict the 5-year BCSS and OS for patients received chemotherapy. Taken together, the results showed that the performance of LightGBM method exceeded that of all the other models in prediction of OS and BCSS. To the best of our knowledge, this is the first available predictive model for predicting survival impact of chemotherapy in early eTNBC, based on machine learning algorithms. We established the prediction model with the excellent performance. It could provide doctors with an easily accessible prediction tool and lead to more individualized and tailored chemotherapy for patients of early eTNBC.

In our study, we enrolled the largest number of participants to evaluate the value of chemotherapy in eTNBC by using the SEER database. After researching by PSM and investigating in more detailed subgroups, we believed we could offer a more helpful reference for clinical practice. However, we admit there are still several inevitable limitations. Since the SEER database did not provide information about comorbidity, we could not evaluate the impact of comorbidity on the results between the two groups. This might lead to minor bias. However, it is also a common inadequacy of those studies based on SEER databases. In general, breast cancer remains the most predominant cause of death in such groups of patients. By using an efficient modelling approach, we could effectively evaluate the prognostic impact of chemotherapy in elderly TNBC patients. Nevertheless, in view of the complexity of different comorbidity and the risk of it could not be quantified, we need to weight pros and cons carefully and individually. Another limitation is due to the SEER database itself, the information about the details of chemotherapy including drug, dose and number of cycles are unavailable. In this study, competing risk model might be an alternative statistical method, which can reduce the impact of competitive events on the results to a certain extent. But it might not be the best option. Because it may led to some inaccuracy and caused difficulties interpreting the results. Competing risk model is not necessarily better than the Cox proportional hazard model [26]. Thus, we didn't choose the competing risk model as the main statistical method. In contrast, Cox proportional hazard model is a more mature method. It could produce accurate results and it is more easier for Cox proportional hazard model to interpret the results than competing risk model. In addition, the models developed in this study have not been verified in the external validation cohort.

Conclusion

In our study, chemotherapy improved survival in patients with grade III, T2N0M0, stage IIb and stage III early eTNBC. For patients diagnosed with stage I, T1N1M0, grade I and grade II, chemotherapy could not improve OS and BCSS. So, chemotherapy might be skipped. The nine models developed by machine learning performed well in survival prediction of early eTNBC patients and the LightGBM model have a best performance. The LightGBM is practical and trustful model to predict the survival and provide precious systemic treatment for patients in the chemotherapy group.

Supplementary Information

12877_2022_2936_MOESM1_ESM.docx^{(21.3KB, docx)}

Additional file 1: Table S1. Univariate Cox proportional hazard model of breast cancer-specific survival (BCSS) and overall survival (OS) in all patients.

12877_2022_2936_MOESM2_ESM.docx^{(16.6KB, docx)}

Additional file 2: Table S2. The test of the proportional hazards assumption in subgroups sorted by specific stage (BCSS).

12877_2022_2936_MOESM3_ESM.docx^{(16.7KB, docx)}

Additional file 3: Table S3. The test of the proportional hazards assumption in subgroups sorted by specific stage (OS).

12877_2022_2936_MOESM4_ESM.docx^{(16.9KB, docx)}

Additional file 4: Table S4. The test of the proportional hazards assumption in subgroups sorted by specific clinical variables (BCSS).

12877_2022_2936_MOESM5_ESM.docx^{(17KB, docx)}

Additional file 5: Table S5. The test of the proportional hazards assumption in subgroups sorted by specific clinical variables(OS).

Acknowledgements

We thank Prof. Baochang He from Fujian Medical University for statistics consultation.

Authors’ contributions

C.G.S. and J.Z. contributed to conception and design; K.Y.H. and Y.S.Y. contributed to the development of methodology; K.Y.H., Y.S.Y., J.Z., Y.X.L. and C.G.S. contributed to acquisition of data and analysis of data; K.Y.H., J.Z. and C.G.S. wrote, reviewed, and/or revised the manuscript; C.G.S. and J.Z. did study supervision. The author(s) read and approved the final manuscript.

Funding

This research was not supported by any grant funding.

Availability of data and materials

The datasets generated and analysed during the current study are available in the Surveillance, Epidemiology, and End Results (SEER) database. The URL of the database is https://seer.cancer.gov/.

Declarations

Ethics approval and consent to participate

Considering SEER database is publicly available. We signed a Data-Use Agreement for the SEER 1973–2016 Research Data File to get access conditions. Data extraction and usage has been approved by SEER Program. We confirm that all methods were carried out in accordance with relevant guidelines and regulations.

Consent for publication

The manuscript is approved by all authors for publication.

Competing interests

The authors have declared that no competing interests exist.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Kaiyan Huang, Jie Zhang and Yushuai Yu contributed equally to this work.

References

1.Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer Statistics, 2021. CA: Cancer J Clin. 2021;71(1):7–33. doi: 10.3322/caac.21654. [DOI] [PubMed] [Google Scholar]
2.DeSantis CE, Ma J, Gaudet MM, Newman LA, Miller KD, Goding Sauer A, Jemal A, Siegel RL. Breast cancer statistics, 2019. CA: Cancer J Clin. 2019;69(6):438–451. doi: 10.3322/caac.21583. [DOI] [PubMed] [Google Scholar]
3.Smith BD, Smith GL, Hurria A, Hortobagyi GN, Buchholz TA. Future of cancer incidence in the United States: burdens upon an aging, changing nation. J Clin Oncol. 2009;27(17):2758–2765. doi: 10.1200/JCO.2008.20.8983. [DOI] [PubMed] [Google Scholar]
4.Freedman RA, Foster JC, Seisler DK, Lafky JM, Muss HB, Cohen HJ, Mandelblatt J, Winer EP, Hudis CA, Partridge AH, et al. Accrual of older patients with breast cancer to alliance systemic therapy trials over time: protocol A151527. J Clin Oncol. 2017;35(4):421–431. doi: 10.1200/JCO.2016.69.4182. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Bastiaannet E, Liefers GJ, de Craen AJ, Kuppen PJ, van de Water W, Portielje JE, van der Geest LG, Janssen-Heijnen ML, Dekkers OM, van de Velde CJ, et al. Breast cancer in elderly compared to younger patients in the Netherlands: stage at diagnosis, treatment and survival in 127,805 unselected patients. Breast Cancer Res Treat. 2010;124(3):801–807. doi: 10.1007/s10549-010-0898-8. [DOI] [PubMed] [Google Scholar]
6.Ring A, Harder H, Langridge C, Ballinger RS, Fallowfield LJ. Adjuvant chemotherapy in elderly women with breast cancer (AChEW): an observational study identifying MDT perceptions and barriers to decision making. Ann Oncol. 2013;24(5):1211–1219. doi: 10.1093/annonc/mds642. [DOI] [PubMed] [Google Scholar]
7.Kozak MM, Xiang M, Pollom EL, Horst KC. Adjuvant treatment and survival in older women with triple negative breast cancer: a surveillance, epidemiology, and end results analysis. Breast J. 2019;25(3):469–473. doi: 10.1111/tbj.13251. [DOI] [PubMed] [Google Scholar]
8.Gennari R, Curigliano G, Rotmensz N, Robertson C, Colleoni M, Zurrida S, Nolè F, de Braud F, Orlando L, Leonardi MC, et al. Breast carcinoma in elderly women: features of disease presentation, choice of local and systemic treatments compared with younger postmenopasual patients. Cancer. 2004;101(6):1302–1310. doi: 10.1002/cncr.20535. [DOI] [PubMed] [Google Scholar]
9.Crozier JA, Pezzi TA, Hodge C, Janeva S, Lesnikoski BA, Samiian L, Devereaux A, Hammond W, Audisio RA, Pezzi CM. Addition of chemotherapy to local therapy in women aged 70 years or older with triple-negative breast cancer: a propensity-matched analysis. Lancet Oncol. 2020;21(12):1611–1619. doi: 10.1016/S1470-2045(20)30538-6. [DOI] [PubMed] [Google Scholar]
10.Janeva S, Zhang C, Kovács A, Parris TZ, Crozier JA, Pezzi CM, Linderholm B, Audisio RA, Olofsson Bagge R. Adjuvant chemotherapy and survival in women aged 70 years and older with triple-negative breast cancer: a Swedish population-based propensity score-matched analysis. Lancet Health Longev. 2020;1(3):e117–e124. doi: 10.1016/S2666-7568(20)30018-0. [DOI] [PubMed] [Google Scholar]
11.Klepin HD, Pitcher BN, Ballman KV, Kornblith AB, Hurria A, Winer EP, Hudis C, Cohen HJ, Muss HB, Kimmick GG. Comorbidity, chemotherapy toxicity, and outcomes among older women receiving adjuvant chemotherapy for breast cancer on a clinical trial: CALGB 49907 and CALGB 361004 (alliance) J Oncol Pract. 2014;10(5):e285–292. doi: 10.1200/JOP.2014.001388. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Edwards BK, Noone AM, Mariotto AB, Simard EP, Boscoe FP, Henley SJ, Jemal A, Cho H, Anderson RN, Kohler BA, et al. Annual Report to the Nation on the status of cancer, 1975–2010, featuring prevalence of comorbidity and impact on survival among persons with lung, colorectal, breast, or prostate cancer. Cancer. 2014;120(9):1290–1314. doi: 10.1002/cncr.28509. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Mohile SG, Dale W, Somerfield MR, Hurria A. Practical assessment and management of vulnerabilities in older patients receiving chemotherapy: ASCO Guideline for geriatric oncology summary. J Oncol Pract. 2018;14(7):442–446. doi: 10.1200/JOP.18.00180. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Lee SJ, Boscardin WJ, Kirby KA, Covinsky KE. Individualizing life expectancy estimates for older adults using the gompertz law of human mortality. PloS One. 2014;9(9):e108540. doi: 10.1371/journal.pone.0108540. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Chandler Y, Jayasekera JC, Schechter CB, Isaacs C, Cadham CJ, Mandelblatt JS. Simulation of chemotherapy effects in older breast cancer patients with high recurrence scores. J Natl Cancer Inst. 2020;112(6):574–581. doi: 10.1093/jnci/djz189. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Walter LC, Covinsky KE. Cancer screening in elderly patients: a framework for individualized decision making. JAMA. 2001;285(21):2750–2756. doi: 10.1001/jama.285.21.2750. [DOI] [PubMed] [Google Scholar]
17.Kaplan HG, Malmgren JA, Atwood MK. Triple-negative breast cancer in the elderly: Prognosis and treatment. Breast J. 2017;23(6):630–637. doi: 10.1111/tbj.12813. [DOI] [PubMed] [Google Scholar]
18.Syed BM, Green AR, Nolan CC, Morgan DA, Ellis IO, Cheung KL. Biological characteristics and clinical outcome of triple negative primary breast cancer in older women - comparison with their younger counterparts. PloS One. 2014;9(7):e100573. doi: 10.1371/journal.pone.0100573. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Liedtke C, Hess KR, Karn T, Rody A, Kiesel L, Hortobagyi GN, Pusztai L, Gonzalez-Angulo AM. The prognostic impact of age in patients with triple-negative breast cancer. Breast Cancer Res Treat. 2013;138(2):591–599. doi: 10.1007/s10549-013-2461-x. [DOI] [PubMed] [Google Scholar]
20.Schwartzberg LS, Blair SL. Strategies for the management of early-stage breast cancer in older women. JNCCN. 2016;14(5 Suppl):647–650. doi: 10.6004/jnccn.2016.0182. [DOI] [PubMed] [Google Scholar]
21.Giordano SH, Duan Z, Kuo YF, Hortobagyi GN, Goodwin JS. Use and outcomes of adjuvant chemotherapy in older women with breast cancer. J Clin Oncol. 2006;24(18):2750–2756. doi: 10.1200/JCO.2005.02.3028. [DOI] [PubMed] [Google Scholar]
22.Elkin EB, Hurria A, Mitra N, Schrag D, Panageas KS. Adjuvant chemotherapy and survival in older women with hormone receptor-negative breast cancer: assessing outcome in a population-based, observational cohort. J Clin Oncol. 2006;24(18):2757–2764. doi: 10.1200/JCO.2005.03.6053. [DOI] [PubMed] [Google Scholar]
23.Loibl S, Poortmans P, Morrow M, Denkert C, Curigliano G. Breast cancer. Lancet (London, England) 2021;397(10286):1750–1769. doi: 10.1016/S0140-6736(20)32381-3. [DOI] [PubMed] [Google Scholar]
24.NCCN guideline :Breast Cancer. Version 7.2021; http://www.nccn.org.
25.Delen D, Walker G, Kadam A. Predicting breast cancer survivability: a comparison of three data mining methods. Artif Intell Med. 2005;34(2):113–127. doi: 10.1016/j.artmed.2004.07.002. [DOI] [PubMed] [Google Scholar]
26.Zhou ZR, Wang WW, Li Y, Jin KR, Wang XY, Wang ZW, Chen YS, Wang SJ, Hu J, Zhang HN, et al. In-depth mining of clinical data: the construction of clinical prediction model with R. Ann Trans Med. 2019;7(23):796. doi: 10.21037/atm.2019.08.63. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

12877_2022_2936_MOESM1_ESM.docx^{(21.3KB, docx)}

Additional file 1: Table S1. Univariate Cox proportional hazard model of breast cancer-specific survival (BCSS) and overall survival (OS) in all patients.

12877_2022_2936_MOESM2_ESM.docx^{(16.6KB, docx)}

Additional file 2: Table S2. The test of the proportional hazards assumption in subgroups sorted by specific stage (BCSS).

12877_2022_2936_MOESM3_ESM.docx^{(16.7KB, docx)}

Additional file 3: Table S3. The test of the proportional hazards assumption in subgroups sorted by specific stage (OS).

12877_2022_2936_MOESM4_ESM.docx^{(16.9KB, docx)}

Additional file 4: Table S4. The test of the proportional hazards assumption in subgroups sorted by specific clinical variables (BCSS).

12877_2022_2936_MOESM5_ESM.docx^{(17KB, docx)}

Additional file 5: Table S5. The test of the proportional hazards assumption in subgroups sorted by specific clinical variables(OS).

Data Availability Statement

The datasets generated and analysed during the current study are available in the Surveillance, Epidemiology, and End Results (SEER) database. The URL of the database is https://seer.cancer.gov/.

[CR1] 1.Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer Statistics, 2021. CA: Cancer J Clin. 2021;71(1):7–33. doi: 10.3322/caac.21654. [DOI] [PubMed] [Google Scholar]

[CR2] 2.DeSantis CE, Ma J, Gaudet MM, Newman LA, Miller KD, Goding Sauer A, Jemal A, Siegel RL. Breast cancer statistics, 2019. CA: Cancer J Clin. 2019;69(6):438–451. doi: 10.3322/caac.21583. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Smith BD, Smith GL, Hurria A, Hortobagyi GN, Buchholz TA. Future of cancer incidence in the United States: burdens upon an aging, changing nation. J Clin Oncol. 2009;27(17):2758–2765. doi: 10.1200/JCO.2008.20.8983. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Freedman RA, Foster JC, Seisler DK, Lafky JM, Muss HB, Cohen HJ, Mandelblatt J, Winer EP, Hudis CA, Partridge AH, et al. Accrual of older patients with breast cancer to alliance systemic therapy trials over time: protocol A151527. J Clin Oncol. 2017;35(4):421–431. doi: 10.1200/JCO.2016.69.4182. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Bastiaannet E, Liefers GJ, de Craen AJ, Kuppen PJ, van de Water W, Portielje JE, van der Geest LG, Janssen-Heijnen ML, Dekkers OM, van de Velde CJ, et al. Breast cancer in elderly compared to younger patients in the Netherlands: stage at diagnosis, treatment and survival in 127,805 unselected patients. Breast Cancer Res Treat. 2010;124(3):801–807. doi: 10.1007/s10549-010-0898-8. [DOI] [PubMed] [Google Scholar]

[CR6] 6.Ring A, Harder H, Langridge C, Ballinger RS, Fallowfield LJ. Adjuvant chemotherapy in elderly women with breast cancer (AChEW): an observational study identifying MDT perceptions and barriers to decision making. Ann Oncol. 2013;24(5):1211–1219. doi: 10.1093/annonc/mds642. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Kozak MM, Xiang M, Pollom EL, Horst KC. Adjuvant treatment and survival in older women with triple negative breast cancer: a surveillance, epidemiology, and end results analysis. Breast J. 2019;25(3):469–473. doi: 10.1111/tbj.13251. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Gennari R, Curigliano G, Rotmensz N, Robertson C, Colleoni M, Zurrida S, Nolè F, de Braud F, Orlando L, Leonardi MC, et al. Breast carcinoma in elderly women: features of disease presentation, choice of local and systemic treatments compared with younger postmenopasual patients. Cancer. 2004;101(6):1302–1310. doi: 10.1002/cncr.20535. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Crozier JA, Pezzi TA, Hodge C, Janeva S, Lesnikoski BA, Samiian L, Devereaux A, Hammond W, Audisio RA, Pezzi CM. Addition of chemotherapy to local therapy in women aged 70 years or older with triple-negative breast cancer: a propensity-matched analysis. Lancet Oncol. 2020;21(12):1611–1619. doi: 10.1016/S1470-2045(20)30538-6. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Janeva S, Zhang C, Kovács A, Parris TZ, Crozier JA, Pezzi CM, Linderholm B, Audisio RA, Olofsson Bagge R. Adjuvant chemotherapy and survival in women aged 70 years and older with triple-negative breast cancer: a Swedish population-based propensity score-matched analysis. Lancet Health Longev. 2020;1(3):e117–e124. doi: 10.1016/S2666-7568(20)30018-0. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Klepin HD, Pitcher BN, Ballman KV, Kornblith AB, Hurria A, Winer EP, Hudis C, Cohen HJ, Muss HB, Kimmick GG. Comorbidity, chemotherapy toxicity, and outcomes among older women receiving adjuvant chemotherapy for breast cancer on a clinical trial: CALGB 49907 and CALGB 361004 (alliance) J Oncol Pract. 2014;10(5):e285–292. doi: 10.1200/JOP.2014.001388. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Edwards BK, Noone AM, Mariotto AB, Simard EP, Boscoe FP, Henley SJ, Jemal A, Cho H, Anderson RN, Kohler BA, et al. Annual Report to the Nation on the status of cancer, 1975–2010, featuring prevalence of comorbidity and impact on survival among persons with lung, colorectal, breast, or prostate cancer. Cancer. 2014;120(9):1290–1314. doi: 10.1002/cncr.28509. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Mohile SG, Dale W, Somerfield MR, Hurria A. Practical assessment and management of vulnerabilities in older patients receiving chemotherapy: ASCO Guideline for geriatric oncology summary. J Oncol Pract. 2018;14(7):442–446. doi: 10.1200/JOP.18.00180. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Lee SJ, Boscardin WJ, Kirby KA, Covinsky KE. Individualizing life expectancy estimates for older adults using the gompertz law of human mortality. PloS One. 2014;9(9):e108540. doi: 10.1371/journal.pone.0108540. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Chandler Y, Jayasekera JC, Schechter CB, Isaacs C, Cadham CJ, Mandelblatt JS. Simulation of chemotherapy effects in older breast cancer patients with high recurrence scores. J Natl Cancer Inst. 2020;112(6):574–581. doi: 10.1093/jnci/djz189. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Walter LC, Covinsky KE. Cancer screening in elderly patients: a framework for individualized decision making. JAMA. 2001;285(21):2750–2756. doi: 10.1001/jama.285.21.2750. [DOI] [PubMed] [Google Scholar]

[CR17] 17.Kaplan HG, Malmgren JA, Atwood MK. Triple-negative breast cancer in the elderly: Prognosis and treatment. Breast J. 2017;23(6):630–637. doi: 10.1111/tbj.12813. [DOI] [PubMed] [Google Scholar]

[CR18] 18.Syed BM, Green AR, Nolan CC, Morgan DA, Ellis IO, Cheung KL. Biological characteristics and clinical outcome of triple negative primary breast cancer in older women - comparison with their younger counterparts. PloS One. 2014;9(7):e100573. doi: 10.1371/journal.pone.0100573. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Liedtke C, Hess KR, Karn T, Rody A, Kiesel L, Hortobagyi GN, Pusztai L, Gonzalez-Angulo AM. The prognostic impact of age in patients with triple-negative breast cancer. Breast Cancer Res Treat. 2013;138(2):591–599. doi: 10.1007/s10549-013-2461-x. [DOI] [PubMed] [Google Scholar]

[CR20] 20.Schwartzberg LS, Blair SL. Strategies for the management of early-stage breast cancer in older women. JNCCN. 2016;14(5 Suppl):647–650. doi: 10.6004/jnccn.2016.0182. [DOI] [PubMed] [Google Scholar]

[CR21] 21.Giordano SH, Duan Z, Kuo YF, Hortobagyi GN, Goodwin JS. Use and outcomes of adjuvant chemotherapy in older women with breast cancer. J Clin Oncol. 2006;24(18):2750–2756. doi: 10.1200/JCO.2005.02.3028. [DOI] [PubMed] [Google Scholar]

[CR22] 22.Elkin EB, Hurria A, Mitra N, Schrag D, Panageas KS. Adjuvant chemotherapy and survival in older women with hormone receptor-negative breast cancer: assessing outcome in a population-based, observational cohort. J Clin Oncol. 2006;24(18):2757–2764. doi: 10.1200/JCO.2005.03.6053. [DOI] [PubMed] [Google Scholar]

[CR23] 23.Loibl S, Poortmans P, Morrow M, Denkert C, Curigliano G. Breast cancer. Lancet (London, England) 2021;397(10286):1750–1769. doi: 10.1016/S0140-6736(20)32381-3. [DOI] [PubMed] [Google Scholar]

[CR24] 24.NCCN guideline :Breast Cancer. Version 7.2021; http://www.nccn.org.

[CR25] 25.Delen D, Walker G, Kadam A. Predicting breast cancer survivability: a comparison of three data mining methods. Artif Intell Med. 2005;34(2):113–127. doi: 10.1016/j.artmed.2004.07.002. [DOI] [PubMed] [Google Scholar]

[CR26] 26.Zhou ZR, Wang WW, Li Y, Jin KR, Wang XY, Wang ZW, Chen YS, Wang SJ, Hu J, Zhang HN, et al. In-depth mining of clinical data: the construction of clinical prediction model with R. Ann Trans Med. 2019;7(23):796. doi: 10.21037/atm.2019.08.63. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

The impact of chemotherapy and survival prediction by machine learning in early Elderly Triple Negative Breast Cancer (eTNBC): a population based study from the SEER database

Kaiyan Huang

Jie Zhang

Yushuai Yu

Yuxiang Lin

Chuangui Song

Abstract

Purpose

Methods

Results

Conclusion

Supplementary Information

Introduction

Materials and methods

Ethics approval and consent to participate

Data source and study population

Outcome measurement

Statistical analysis

Results

Demographics and clinical characteristics of the study population

Table 1.

Comparison of survival between chemotherapy group and no-chemotherapy group

Table 2.

Table 3.

Table 4.

Fig. 1.

Fig. 2.

Table 5.

Machine-learning based outcome prediction in patients received chemotherapy

Table 6.

Table 7.

Fig. 3.

Discussion

Conclusion

Supplementary Information

Acknowledgements

Authors’ contributions

Funding

Availability of data and materials

Declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Footnotes

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases