Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2024 Sep 14;14:21475. doi: 10.1038/s41598-024-72730-3

Development and validation of a prognostic nomogram for esophageal cancer patients based on SEER Asian population

Xinwei Guo 1,, Lang Qin 2, Jie Tian 2, Pengcheng Li 3, Zhenling Dou 2, Yu Gong 2, Haobiao Wang 3
PMCID: PMC11401934  PMID: 39277664

Abstract

This study aims to develop and validate a nomogram for predicting overall survival (OS) in Asian patients with Esophageal Cancer (EC). Data from Asian EC patients were collected from the Surveillance, Epidemiology, and End Results (SEER) database. The patients were randomly divided into training and validation cohorts in a 7:3 ratio. The Least Absolute Shrinkage and Selection Operator (LASSO) regression was used for initial variable selection, followed by multivariate Cox regression analysis to identify independent prognostic factors. A nomogram was subsequently constructed based on these factors. The predictive performance of the nomogram was evaluated using receiver operating characteristic (ROC) curves and calibration curves, while the clinical utility of the nomogram was assessed through decision curve analysis (DCA). The LASSO regression and multivariate Cox regression analysis identified age, sex, marital status, tumor size, M stage, surgery, and chemotherapy as independent prognostic factors. The ROC curve results demonstrated that the area under the curve (AUC) values for predicting 1-year, 3-year, and 5-year OS in the training cohort were 0.770, 0.756, and 0.783, respectively. In the validation cohort, the AUC values were 0.814, 0.763, and 0.771, respectively. Calibration curves indicated a high concordance between predicted and actual OS. The DCA demonstrated that the nomogram has significant clinical applicability. This nomogram provides reliable predictions and valuable guidance for personalized survival estimates and high-risk patient identification.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-024-72730-3.

Keywords: Esophageal cancer, Prognosis, Asian, Nomogram, SEER

Subject terms: Diseases, Gastroenterology, Medical research, Oncology, Risk factors

Introduction

Esophageal cancer (EC) ranks among the primary causes of cancer-related mortality worldwide, placing seventh in incidence and sixth in mortality rates, estimates suggest that in 2020, approximately 1 in every 18 cancer-related deaths was attributed to EC1. The two predominant histological subtypes of EC are esophageal squamous cell carcinoma (ESCC) and esophageal adenocarcinoma (EAC). ESCC prevails predominantly among Asian populations, with alcohol consumption and smoking identified as significant risk factors, while EAC is more prevalent among Caucasians and closely associated with obesity and smoking2,3. Substantial disparities in the incidence and mortality rates of EC are observed across different geographic regions. According to GLOBOCAN 2020 estimates, the incidence and mortality rates among Asian populations are the highest globally, accounting for 79.7% and 79.8%, respectively, additionally, the mortality-to-incidence ratio (MIR) for EC in Asia is 0.89, surpassing that of Europe (0.82) and North America (0.83)4. A spectrum of lifestyle and environmental factors contributes to EC risk or protection. Studies have identified factors such as rubber production, pickled vegetable consumption, hot beverage consumption, and exposure to tetrachloroethylene as etiological factors for EC5,6, with certain Asian populations facing heightened exposure risks due to dietary habits and occupational attributes. Despite notable advancements in modern radiotherapy techniques and updated chemotherapy agents, the efficacy of EC treatment remains modest, with low survival rates and high rates of local recurrence persisting among EC patients7, particularly in Asian populations where the predominant subtype is ESCC, with a 5-year overall survival (OS) rate falling below 30%.8 Consequently, the urgent need to identify prognostic factors conducive to risk stratification and clinical management for Asian EC populations, characterized by elevated incidence and mortality rates, is underscored.

The tumor, lymph node, metastasis (TNM) staging system has garnered widespread recognition and utilization in prognostic analysis of cancer. However, with advancements in treatment modalities and multimodal approaches, prognostic disparities persist even among patients with identical TNM staging9. In recent years, nomograms have emerged as a more efficient predictive tool for forecasting the prognosis of various cancer types, serving as a complementary approach to the TNM staging system. Additionally, some researchers argue for the consideration of other significant risk factors such as age, tumor location, tumor size, and treatment modalities in predicting cancer survival rates1012. Various prediction models related to cancer have been established utilizing the Surveillance, Epidemiology, and End Results (SEER) database. For instance, Qian et al. developed a nomogram to predict the prognosis of EAC13, while Fan et al. compared the prognostic disparities between endoscopic and surgical treatments in T1b-stage EC and formulated a nomogram for survival prediction14. Similarly, Tang et al. devised a nomogram to predict cancer-specific survival in patients with initial distant metastases in EC15. However, to our knowledge, there has been no relevant study focusing on the Asian EC population. Hence, we intend to develop a nomogram for predicting the prognosis of Asian EC patients based on the SEER database.

Methods

Patient selection

The SEER database (http://seer.cancer.gov/seerstat/) contains extensive cancer case data spanning from 1973 to the present day, encompassing populations from various regions of the United States. These data comprise individual information about cancer patients, including demographics (such as age, gender, and race), pathological details (such as cancer type, differentiation, and tumor size), treatment information (such as radiation therapy, surgery, and chemotherapy), and patient survival status. Necessary personal and organizational information is provided on the SEER database’s official website, and upon completion of an online registration form, approval for access is granted along with corresponding access credentials. Data retrieval is performed online using SEER*Stat software version 8.4.0.1. In the SEER database, the smallest unit for the “survival months” variable is one month, meaning that all patients with a survival time of less than one month are recorded as having a survival time of “0 month.” Some of these patients include cases of intraoperative or in-hospital death. To ensure the accuracy and reliability of the data analysis and to avoid the influence of extreme data on the overall statistical results, these cases were excluded.

Inclusion criteria encompass (1) EC with clear pathological diagnosis; (2) primary tumor; (3) initial diagnosis occurring between 2004 and 2015. Exclusion criteria include (1) demographic characteristics, including unknown age, race, and marital status; (2) clinical pathological features, including unknown tumor size, histological type, tumor grade, and tumor staging; (3) treatment information, including unknown surgery, radiation therapy, and chemotherapy; (4) survival time less than 1 month. The specific process can be seen in Supplementary Fig. 1.

Variable selection and transformation

This study collected demographic characteristics of patients, including age, sex, race, and marital status; clinical pathological features such as histological type, primary site of tumor, tumor size, grade, and TNM staging; treatment information including surgery, radiation therapy, and chemotherapy; and follow-up data including survival status and survival time. The primary observation endpoint of the study was OS, defined as the time from the beginning of the study until death for any reason. To ensure reliable follow-up data, patients included in the study were diagnosed between 2004 and 2015, with the last follow-up conducted in November 2020, ensuring a minimum follow-up period of five years for patients still alive at inclusion. Regarding tumor site, “Cervical esophagus” and “Upper third of esophagus” were categorized as upper segment, “Middle third of esophagus” as middle segment, “Lower third of esophagus” and “Abdominal esophagus” as lower segment, “Overlapping lesion of esophagus” and patients with unspecified tumor site as “Esophagus, NOS” and “Thoracic esophagus” were defined as others. For histological types, International Classification of Diseases, the third edition (ICD-O-3) codes 8140–8145, 8210, 8211, 8255, 8260, 8261, 8263, 8310, 8480, 8481, 8490, and 8574 were defined as EAC; codes 8050–8052, 8070–8076, and 8083–8084 were defined as ESCC, and the remaining codes were defined as other.

Additionally, as age and tumor size were discrete variables in this study, for further analysis, X-tile (Yale University, version 3.6.1) was used to determine the optimal cutoff values for age and tumor size in prognostic analysis. Using these optimal cutoff values as thresholds, patients were divided into groups with the most significant differences in survival.

Statistical analysis

To analyze the differences in variable distribution between Asian EC patients and those of other ethnicities, the Pearson chi-square test was employed. Asian EC patients were randomly divided into a training cohort and a validation cohort at a ratio of 7:3. The Pearson chi-square test was also used to examine the differences in variable distribution between these two cohorts. In the training cohort, the Least Absolute Shrinkage and Selection Operator (Lasso) regression was utilized for the preliminary selection of variables related to the prognosis of EC patients. A 10-fold cross-validation was conducted to avoid overfitting, and the selected variables were included in a multivariable Cox regression analysis. Variables with a p-value of less than 0.05 in the multivariable analysis were considered independent prognostic factors, which were then used to construct a nomogram. The accuracy of the nomogram was assessed using calibration curves, which compared the predicted OS rates to the actual OS rates. To reduce overfitting bias, Bootstrap resampling was performed 1,000 times. The discriminative ability of the nomogram, which refers to its capability to distinguish between surviving and deceased patients, was evaluated using the Concordance Index (C-index) and the area under the curve (AUC) of the receiver operating characteristic (ROC). The clinical utility of the nomogram was assessed through decision curve analysis (DCA), determining whether the nomogram-assisted decisions improved patient prognosis evaluation.

All statistical analyses were performed using R statistical software (version 4.2.1, http://www.R-project.org). Specifically, the “rio” package was utilized for data output, the “corrplot” and “glmnet” packages for lasso regression, the “survival,” “forestplot,” and “survminer” packages for univariate and multivariable Cox regression analysis, nomogram construction, and calculation of C-index with corresponding 95% confidence intervals (CI). The “rms” package was utilized for calibration curve construction, while the “riskRegression” package and “survival” package were employed for ROC curve construction. Furthermore, the “ggDCA,” “ggprism,” and “survival” packages were used for DCA. A significance level of p < 0.05 was considered statistically significant.

Results

Patient characteristics

A total of 13,789 EC patients were selected from the SEER database, comprising 678 Asian individuals and 13,111 individuals from other racial groups. Demographic, clinicopathological, and treatment information for all patients were summarized (Table 1). Compared to other racial groups, Asian individuals exhibited a higher proportion of being married (70.06% vs. 59.81%, p < 0.001), a higher percentage of tumors located in the upper segment (9.00% vs. 4.76%, p < 0.001), and middle segment (29.06% vs. 15.32%, p < 0.001), a higher prevalence of tumors larger than 27 mm (80.38% vs. 76.52%, p = 0.023), a higher frequency of histological type ESCC (70.94% vs. 30.04%, p < 0.001), a higher incidence of lymph node metastasis (61.95% vs. 55.36%, p < 0.001), a higher proportion of patients not undergoing surgical treatment (69.32% vs. 57.68%, p < 0.001), and a higher rate of receiving radiotherapy (71.83% vs. 65.91%, p < 0.001). Among Asian EC patients, 51 did not receive any treatment, accounting for approximately 7.52% (51/678). There were no significant differences between Asian EC patients and those of other ethnicities regarding age, gender, differentiation grade, T stage, M stage, and chemotherapy (p > 0.05). During the follow-up period, there were 11,674 deaths in the total cohort. The 1-year, 3-year, and 4-year OS rates were 56.11%, 30.28%, and 22.42%, respectively. Among Asian EC patients, there were 572 deaths, with 1-year, 3-year, and 5-year OS rates of 55.90%, 30.09%, and 20.65%, respectively. In contrast, 11,102 deaths occurred among patients of other races, with 1-year, 3-year, and 5-year OS rates of 56.12%, 29.96%, and 22.51%, respectively. Subsequently, the Asian patient cohort was randomly divided into a training cohort and a validation cohort. There were no statistically significant differences in the distribution of variables between these two cohorts (p > 0.05), as shown in Table 2.

Table 1.

Comparison of baseline characteristics between Asian and other ethnic populations.(chi square test).

Variables All Asian Others p
n = 13,789 n = 678 n = 13,111
Age > 75 2669 (19.36) 147 (21.68) 2522 (19.24) 0.128
≤ 75 11,120 (80.64) 531 (78.32) 10,589 (80.76)
Sex Female 2723 (19.75) 140 (20.65) 2583 (19.70) 0.579
Male 11,066 (80.25) 538 (79.35) 10,528 (80.30)
Marriage Married 8317 (60.32) 475 (70.06) 7842 (59.81) < 0.001
Unmarried 5472 (39.68) 203 (29.94) 5269 (40.19)
Site Upper 685 (4.97) 61 (9.00) 624 (4.76) < 0.001
Middle 2206 (16.00) 197 (29.06) 2009 (15.32)
Lower 8899 (64.54) 286 (42.18) 8613 (65.69)
Others 1999 (14.50) 134 (19.76) 1865 (14.22)
Size(mm) > 27 10,578 (76.71) 545 (80.38) 10,033 (76.52) 0.023
≤ 27 3211 (23.29) 133 (19.62) 3078 (23.48)
Histology EAC 8409 (60.98) 159 (23.45) 8250 (62.92) < 0.001
ESCC 4419 (32.05) 481 (70.94) 3938 (30.04)
Others 961 (6.97) 38 (5.60) 923 (7.04)
Grade Well 740 (5.37) 30 (4.42) 710 (5.42) 0.225
Moderately 5127 (37.18) 269 (39.68) 4858 (37.05)
Poorly 5813 (42.16) 264 (38.94) 5549 (42.32)
Undifferentiated 211 (1.53) 13 (1.92) 198 (1.51)
Unknown 1898 (13.76) 102 (15.04) 1796 (13.70)
T T1 4114 (29.84) 205 (30.24) 3909 (29.81) 0.116
T2 1764 (12.79) 77 (11.36) 1687 (12.87)
T3 5836 (42.32) 274 (40.41) 5562 (42.42)
T4 2075 (15.05) 122 (17.99) 1953 (14.90)
N N0 6111 (44.32) 258 (38.05) 5853 (44.64) 0.001
N1 7678 (55.68) 420 (61.95) 7258 (55.36)
M M0 10,212 (74.06) 483 (71.24) 9729 (74.20) 0.094
M1 3577 (25.94) 195 (28.76) 3382 (25.80)
Surgery No 8032 (58.25) 470 (69.32) 7562 (57.68) < 0.001
Yes 5757 (41.75) 208 (30.68) 5549 (42.32)
Radiation No 4660 (33.80) 191 (28.17) 4469 (34.09) 0.002
Yes 9129 (66.20) 487 (71.83) 8642 (65.91)
Chemotherapy No 4036 (29.27) 184 (27.14) 3852 (29.38) 0.227
Yes 9753 (70.73) 494 (72.86) 9259 (70.62)

EAC: esophageal adenocarcinoma, ESCC: esophageal squamous cell carcinoma.

Table 2.

Comparison of baseline characteristics between training cohort and validation cohort.(chi square test) EAC: esophageal adenocarcinoma, ESCC: esophageal squamous cell carcinoma.

Variables All cohort Training cohort Validation cohort p
n = 678 n = 475 n = 203
Age >75 147 (21.68) 106 (22.32) 41 (20.20) 0.609
≤ 75 531 (78.32) 369 (77.68) 162 (79.80)
Sex Female 140 (20.65) 95 (20.00) 45 (22.17) 0.593
Male 538 (79.35) 380 (80.00) 158 (77.83)
Marriage Married 475 (70.06) 326 (68.63) 149 (73.40) 0.250
Unmarried 203 (29.94) 149 (31.37) 54 (26.60)
Site Upper 61 (9.00) 44 (9.26) 17 (8.37) 0.773
Middle 197 (29.06) 133 (28.00) 64 (31.53)
Lower 286 (42.18) 205 (43.16) 81 (39.90)
Others 134 (19.76) 93 (19.58) 41 (20.20)
Size (mm) >27 545 (80.38) 383 (80.63) 162 (79.80) 0.886
≤ 27 133 (19.62) 92 (19.37) 41 (20.20)
Histology EAC 159 (23.45) 105 (22.11) 54 (26.60) 0.253
ESCC 481 (70.94) 340 (71.58) 141 (69.46)
Others 38 (5.60) 30 (6.32) 8 (3.94)
Grade Well 30 (4.42) 19 (4.00) 11 (5.42) 0.561
Moderately 269 (39.68) 187 (39.37) 82 (40.39)
Poorly 264 (38.94) 191 (40.21) 73 (35.96)
Undifferentiated 13 (1.92) 7 (1.47) 6 (2.96)
Unknown 102 (15.04) 71 (14.95) 31 (15.27)
T T1 205 (30.24) 137 (28.84) 68 (33.50) 0.505
T2 77 (11.36) 53 (11.16) 24 (11.82)
T3 274 (40.41) 194 (40.84) 80 (39.41)
T4 122 (17.99) 91 (19.16) 31 (15.27)
N N0 258 (38.05) 190 (40.00) 68 (33.50) 0.131
N1 420 (61.95) 285 (60.00) 135 (66.50)
M M0 483 (71.24) 338 (71.16) 145 (71.43) 1.000
M1 195 (28.76) 137 (28.84) 58 (28.57)
Surgery No 470 (69.32) 336 (70.74) 134 (66.01) 0.258
Yes 208 (30.68) 139 (29.26) 69 (33.99)
Radiation No 191 (28.17) 133 (28.00) 58 (28.57) 0.954
Yes 487 (71.83) 342 (72.00) 145 (71.43)
Chemotherapy No 184 (27.14) 128 (26.95) 56 (27.59) 0.939
Yes 494 (72.86) 347 (73.05) 147 (72.41)

Selection of independent prognostic factors

Based on the training cohort, dimensionality reduction was initially performed using Lasso-Cox regression, selecting variables with non-zero coefficients (Fig. 1). These variables included age, gender, marital status, tumor location, tumor size, degree of differentiation, T stage, N stage, M stage, surgery, radiotherapy, and chemotherapy. Subsequently, a multivariate Cox regression model was constructed using these variables (Table 3). The results indicated that age (p = 0.005), sex (p = 0.010), marital status (p = 0.003), tumor size (p = 0.002), M stage (p < 0.001), surgery (p < 0.001), and chemotherapy (p < 0.001) are independent prognostic factors for Asian patients with EC.

Fig. 1.

Fig. 1

Correlation between coefficient and log lambda (A) and sifting of variables via cross-validation (B) in the LASSO model. LASSO, least absolute shrinkage and selection operator.

Table 3.

Multivariate Cox analysis of the variables filtered by Lasso regression.(based on the training cohort).

Variables Multivariable p
HR (95% CI)
Age > 75 Reference
≤ 75 0.707 (0.554, 0.903) 0.005
Sex Female Reference
Male 1.435 (1.092, 1.885) 0.010
Marriage Married Reference
Unmarried 1.409 (1.127, 1.761) 0.003
Site Upper 0.713 (0.484, 1.050) 0.087
Middle 0.933 (0.722, 1.206) 0.598
Lower Reference
Others 1.088 (0.829, 1.427) 0.545
Size (mm) > 27 Reference
≤ 27 0.622 (0.459, 0.844) 0.002
Grade Well 0.779 (0.448, 1.353) 0.375
Moderately Reference
Poorly 1.136 (0.901, 1.432) 0.282
Undifferentiated 0.494 (0.155, 1.574) 0.233
Unknown 0.724 (0.530, 0.989) 0.043
T T1 Reference
T2 0.931 (0.642, 1.348) 0.703
T3 1.296 (0.987, 1.702) 0.062
T4 1.473 (1.078, 2.013) 0.015
N N0 Reference
N1 1.155 (0.917, 1.455) 0.220
M M0 Reference
M1 1.649 (1.295, 2.098) < 0.001
Surgery No Reference
Yes 0.391 (0.291, 0.526) < 0.001
Radiation No Reference
Yes 0.801 (0.596, 1.077) 0.141
Chemotherapy No Reference
Yes 0.589 (0.438, 0.792) < 0.001

Construction and validation of nomogram

A nomogram was developed based on the independent prognostic factors identified through multivariate Cox regression (Fig. 2). Subsequently, cross-validation of the nomogram was performed using training and validation cohorts derived from random splits. The C-index of the nomogram was 0.700 (95% CI: 0.673–0.727) in the training cohort and 0.709 (95% CI: 0.668–0.750) in the validation cohort. The calibration curves demonstrated a high concordance between the predicted and actual OS rates in both the training and validation cohorts (Fig. 3). The discriminative ability of the nomogram was evaluated using the AUC (Fig. 4). In the training cohort, the AUC for predicting 1-year, 3-year, and 5-year OS rates were 0.770 (95% CI: 0.728–0.881), 0.756 (95% CI: 0.709–0.803), and 0.783 (95% CI: 0.732–0.834), respectively. In the validation cohort, the AUC for predicting 1-year, 3-year, and 5-year OS rates were 0.814 (95% CI: 0.754–0.875), 0.763 (95% CI: 0.697–0.830), and 0.771 (95% CI: 0.705–0.837), respectively. DCA further indicated that the nomogram exhibited good clinical utility in predicting the 1-year, 3-year, and 5-year OS rates (Fig. 5).

Fig. 2.

Fig. 2

Nomogram for predicting 1-, 3- and 5-yr overall survival of Asian patients with esophageal cancer.

Fig. 3.

Fig. 3

Calibration curves of the nomogram. (A) Calibration curves of 1-, 3- and 5-yr overall survival in the training cohort; (B) Calibration curves of 1-, 3- and 5-yr overall survival in the validation cohort.

Fig. 4.

Fig. 4

Receiver operating characteristic curves of the nomogram (AC) For 1-, 3- and 5-yr overall survival in the training cohort; (DF) for 1-, 3- and 5-yr overall survival in the validation cohort.

Fig. 5.

Fig. 5

Decision curve analysis for survival prediction. (A) For 1-, 3- and 5-yr overall survival in the training cohort; (B) for 1-, 3- and 5-yr overall survival in the validation cohort.

Risk stratification

The scores assigned to each variable in the nomogram (Fig. 1) are derived from the regression coefficients corresponding to each categorical variable in the multivariate Cox regression risk model. Higher scores indicate greater risk. Based on the risk scores obtained from the nomogram, patients are assigned points. Specifically, for age, patients aged over 75 years are assigned 31 points, while those aged 75 years or younger are assigned 0 points. Regarding gender, females are assigned 0 points, whereas males are assigned 58 points. Marital status is scored as 0 for married individuals and 44 for unmarried individuals. Tumor length scores 0 if less than 27 mm and 76 if greater than 27 mm. For the M stage, patients classified as M0 receive 0 points, while those classified as M1 receive 49 points. Patients who underwent surgery and chemotherapy received 0 points, while those who did not undergo either surgery or chemotherapy received 100 and 64 points, respectively. Scores based on independent prognostic factors from the nomogram are assigned to all patients in the Asian patient cohort, and the total score is calculated. The optimal cutoff value for risk scores is determined using X-tile software, dividing patients into low-risk (total score ≤ 291) and high-risk (total score > 291) groups. Kaplan-Meier survival curves are plotted (Fig. 6), and a log-rank test reveals a significant difference in OS between the two groups (p < 0.001).

Fig. 6.

Fig. 6

Kaplan–Meier method estimate of overall survival in the whole cohort. Low risk refers to a total score of ≤ 291, high risk refers to a total score of > 291.

Discussion

This study established a nomogram to predict the 1-year, 3-year, and 5-year OS rates of Asian EC patients using data from the SEER database, and it underwent rigorous validation. In the modeling process, Lasso regression was initially employed to select variables with non-zero coefficients corresponding to the lambda value that minimized the average error during cross-validation. While this approach often yields the highest predictive accuracy, it may also result in the selection of numerous variables, leading to increased model complexity. Subsequently, a multivariate Cox regression model was utilized to further refine the variables and identify independent prognostic factors, including age, gender, marital status, tumor size, M stage, surgery, and chemotherapy. Although the multivariate Cox regression model revealed that tumor grade as unknown and T stage as T4 had p-values less than 0.05, they were not considered independent prognostic factors to enhance the statistical reliability of the nomogram. The established nomogram demonstrated robust predictive performance and clinical utility in both the training and validation cohorts, with C-index and AUC values exceeding 0.7. Subsequently, risk stratification was performed based on the risk scores, enabling clinicians to assess the prognosis of Asian EC patients and implement appropriate interventions.

The treatment modalities for EC are continuously advancing. Rooted in clinical trials such as RTOG85-01 and RTOG94-0516,17, definitive chemoradiotherapy emerged as a non-surgical treatment method initially recognized and applied in locally advanced EC cases. The CROSS study further solidified the treatment standards for EC over the past decade18. Its follow-up data, released in 2021, demonstrated sustained OS benefits with neoadjuvant chemoradiotherapy (NCRT). With a median follow-up time of 147 months, the 10-year OS rate was higher in the NCRT group compared to the surgery-only group (38% vs. 25%, p = 0.004). While NCRT was associated with a reduced local recurrence rate compared to surgery alone, the rates of distant recurrence were similar between the two groups (27% vs. 28%). In our study, chemotherapy emerged as a protective factor postoperatively for Asian EC patients (HR = 0.589, 95% CI: 0.438–0.792, p < 0.001), while radiotherapy was not a protective factor (HR = 0.801, 95% CI: 0.596–1.077, p = 0.141). Some studies suggest that NCRT, compared to neoadjuvant chemotherapy alone, may improve local-regional control rates and rates of curative surgical resection19,20 However, no significant differences in long-term survival have been observed between these approaches. Additionally, research by Justin Rucker and colleagues found that for patients with postoperative pathological staging of T2-4aN0M0 esophageal adenocarcinoma who have undergone perioperative adjuvant chemotherapy, the addition of concurrent radiotherapy does not significantly impact OS21. The value of radiotherapy remains to be further discussed.

In the nomogram developed within this study, individuals who did not undergo surgery exhibited the highest risk scores. Despite the rapid advancements in therapies such as chemotherapy, radiotherapy, immunotherapy, and cellular treatments in recent years, comprehensive treatment modalities centered around surgery remain the preferred approach. Conventional surgical techniques involving direct excision or ablation of tumors, along with minimally invasive procedures such as laparoscopic surgery, endoscopic resection, and non-resectional methods, continue to serve as fundamental strategies in the management of EC2224. It is noteworthy that the proportion of Asian EC patients undergoing surgery in this study was significantly lower compared to other ethnic groups (30.68% vs. 42.32%, p < 0.001). Paulson et al. found a close association between racial disparities and surgical utilization, after adjusting for all other independent variables, multivariable logistic regression revealed that Caucasian patients were over twice as likely to undergo surgical treatment compared to non-Caucasian patients (p < 0.001).25 Similar findings were corroborated by Steyerberg et al., indicating significant disparities in surgical acceptance among different racial groups26. The underlying reasons for these substantial discrepancies may be attributed to inequalities in socioeconomic status and access to healthcare2729. Future research endeavors should further investigate the extent to which these factors influence surgical decision-making among EC patients.

Advanced age is considered one of the independent prognostic factors for Asian EC patients. Findings from other studies on independent prognostic factors among various Asian subgroups are largely consistent with the results obtained in this study13,15,3032. Generally, elderly patients often present with comorbidities such as hypertension and diabetes, leading to poorer baseline health conditions. Consequently, they exhibit increased rates of in-hospital mortality and postoperative complications following surgery. This inclination towards conservative management is further influenced by traditional cultural and ideological perceptions among elderly Asian populations, resulting in lower acceptance rates of surgical interventions. Markar et al.‘s review study revealed that elderly EC patients undergoing esophagectomy face higher risks, with increased in-hospital mortality (7.83% vs. 4.21%) and higher rates of respiratory (21.77% vs. 19.49%) and cardiovascular (18.7% vs. 13.17%) complications compared to younger patients33. Furthermore, the 5-year OS rate (21.23% vs. 29.01%, p < 0.05) and 5-year disease-free survival rate (34.4% vs. 41.8%, p < 0.05) were lower in the elderly group than in the younger group. Mantziari et al. conducted a more refined analysis, demonstrating that compared to younger patients undergoing esophagectomy, elderly patients exhibited higher rates of respiratory complications (20% vs. 16%), cardiovascular complications (15.6% vs. 7.0%), and postoperative mortality (7.9% vs. 3.4%).34

Marital status is one of the socio-economic factors, where being married can serve as a surrogate indicator of social support, contributing to improving the survival status of patients with various diseases, including malignant tumors. The study by Krajc et al. demonstrated that married cancer patients, compared to unmarried ones, had a lower likelihood of tumor metastasis, a greater chance of receiving definitive treatment, and a reduced probability of dying from cancer35. Similarly, Du et al. conducted a study based on the SEER database among EC patients, revealing that the risk of death from various causes for singles (HR = 1.14, 95% CI 1.11–1.17, p < 0.001), divorced or separated individuals (HR = 1.16, 95% CI 1.13–1.19, p < 0.001), and widowed individuals (HR = 1.22, 95% CI 1.19–1.26, p < 0.001) was higher than that for married patients36. This is generally consistent with the viewpoint of our study. However, since the focus of this study is on comprehensive prognostic factors among Asian populations, the unmarried population was not specifically categorized. Aizer et al. found that married patients were less likely to experience distant tumor metastasis, more likely to undergo curative treatment, and had a lower risk of cancer-related mortality compared to unmarried patients after adjusting for demographic characteristics, staging, and treatment modalities (HR = 0.80, 95% CI: 0.79–0.81, p<0.001)37. In our study, unmarried patients had a higher risk compared to married patients (HR = 1.409, 95% CI: 1.127, 1.761, p<0.001). Gender differences are one of the factors affecting the incidence and mortality rates of EC. In Asia, the incidence rate of EC in males and females differs by 2.37 times4. According to the SEER data report, female patients with ESCC exhibit a higher relative survival rate38. The mechanisms underlying these gender differences are not fully understood and seem to be multifactorial, primarily involving hormonal and genomic factors39,40. A meta-analysis showed that larger tumors are independent prognostic factors leading to poorer OS and disease-free survival (DFS) in EC patients41. However, some studies have overlooked the impact of tumor size on prognosis24,4247. We believe that the reason for this discrepancy is that we established the optimal cutoff values using X-tile software, greatly increasing the accuracy of grouping.

To date, several studies have constructed nomograms to predict long-term survival in EC patients32,48. However, to the best of our knowledge, close to 80% of EC patients are of Asian descent, with a high incidence rate and short survival time, yet there has been no prior research specifically targeting this population. This study, for the first time, analyzed prognostic factors in Asian EC patients based on big data and established a nomogram. For such patients, relying solely on TNM staging provides extremely limited predictive efficacy for individualized survival, highlighting the urgent need for additional methods for auxiliary assessment. This study extensively incorporated demographic, clinicopathological, and treatment data of patients and used X-tile to determine the optimal cutoff values for age and tumor size. It established highly accurate visual predictive models, with C-indices all exceeding 0.70 and AUCs all surpassing 0.75. Calibration curves demonstrated high consistency between predicted OS rates by the nomogram and actual OS rates, while DCA indicated excellent clinical value of the model. Furthermore, the established risk stratification can provide recommendations for clinicians to comprehensively assess the prognosis and individualized treatment of Asian EC patients.

Although this study has established a reliable nomogram, there are still certain limitations. Firstly, the SEER database does not contain specific information on chemotherapy regimens and radiotherapy doses, nor does it record detailed information on other prognostic factors, such as immunotherapy, Performance Status (PS) scores, postoperative complications, nutritional status, targeted drug therapy, genetic molecular markers, etc. Secondly, although this study was based on analysis of a large sample population and obtained good cross-validation results, external data were not used to validate the accuracy of this study. Finally, this study is a retrospective analysis based on a database, with OS as the primary endpoint. Causes of death unrelated to EC (e.g., accidental death or death from external causes) may be partially included, and the data lack randomization. Prospective data are needed to further validate the conclusions.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1 (1.6MB, xlsx)
Supplementary Material 2 (188.1KB, docx)

Author contributions

X.G. and L.Q. designed the study. X.G. wrote the manuscript. P.L., Z.D., Y.G., H.W. collected, analyzed and interpreted the data. X.G. critically reviewed, edited and approved the manuscript. All authors read and approved the final manuscript.

Funding

This work was supported by the grant from Science and Technology Project of Natural Science Approval in Bengbu Medical College (2022byzd180), Guiding Project for Social Development in Taizhou City (SSF20230012), and China Health Promotion Foundation.

Data availability

The dataset analysed during the current study are available in the Surveillance, Epidemiology, and End Results (SEER) database (https://seer.cancer.gov/). All data analysed during this study are included in the supplementary file.

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Sung, H. et al. Global Cancer statistics 2020: GLOBOCAN estimates of incidence and Mortality Worldwide for 36 cancers in 185 countries. CA Cancer J. Clin.71, 209–249. 10.3322/caac.21660 (2021). 10.3322/caac.21660 [DOI] [PubMed] [Google Scholar]
  • 2.Pennathur, A., Gibson, M. K., Jobe, B. A. & Luketich, J. D. Oesophageal carcinoma. Lancet. 381, 400–412. 10.1016/s0140-6736(12)60643-6 (2013). 10.1016/s0140-6736(12)60643-6 [DOI] [PubMed] [Google Scholar]
  • 3.Wang, R. H. From reflux esophagitis to Barrett’s esophagus and esophageal adenocarcinoma. World J. Gastroenterol.21, 5210–5219. 10.3748/wjg.v21.i17.5210 (2015). 10.3748/wjg.v21.i17.5210 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Zhu, H. et al. Epidemiological landscape of esophageal cancer in Asia: Results from GLOBOCAN 2020. Thorac. Cancer. 14, 992–1003. 10.1111/1759-7714.14835 (2023). 10.1111/1759-7714.14835 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Huang, F. L. & Yu, S. J. Esophageal cancer: Risk factors, genetic association, and treatment. Asian J. Surg.41, 210–215. 10.1016/j.asjsur.2016.10.005 (2018). 10.1016/j.asjsur.2016.10.005 [DOI] [PubMed] [Google Scholar]
  • 6.Lander, S., Lander, E. & Gibson, M. K. Esophageal cancer: Overview, risk factors, and reasons for the rise. Curr. Gastroenterol. Rep.25, 275–279. 10.1007/s11894-023-00899-0 (2023). 10.1007/s11894-023-00899-0 [DOI] [PubMed] [Google Scholar]
  • 7.Lagergren, J., Smyth, E., Cunningham, D. & Lagergren, P. Oesophageal cancer. Lancet. 390, 2383–2396. 10.1016/s0140-6736(17)31462-9 (2017). 10.1016/s0140-6736(17)31462-9 [DOI] [PubMed] [Google Scholar]
  • 8.Huang, R. J. et al. Disaggregated mortality from gastrointestinal cancers in Asian americans: Analysis of United States death records. Int. J. Cancer. 148, 2954–2963. 10.1002/ijc.33490 (2021). 10.1002/ijc.33490 [DOI] [PubMed] [Google Scholar]
  • 9.Rice, T. W. et al. Recommendations for clinical staging (cTNM) of cancer of the esophagus and esophagogastric junction for the 8th edition AJCC/UICC staging manuals. Dis. Esophagus. 29, 913–919. 10.1111/dote.12540 (2016). 10.1111/dote.12540 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Yin, H., Li, D., Zhu, C., Wang, M. & Wei, N. Factors relevant to the prognosis of patients with esophageal cancer who received intensity-modulated radiotherapy. Thorac. Cancer. 9, 1215–1219. 10.1111/1759-7714.12800 (2018). 10.1111/1759-7714.12800 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.He, C., Cai, Z., Zhang, Y. & Lin, X. Prognostic model to predict cancer-specific survival for patients with gallbladder carcinoma after surgery: A population-based analysis. Front. Oncol.9, 1329. 10.3389/fonc.2019.01329 (2019). 10.3389/fonc.2019.01329 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Chen, B. et al. The influence of adjuvant radiation therapy after endoscopic resection on survival for early stage EC: An analysis of the surveillance epidemiology and end results (SEER) database. J. Thorac. Dis.11, 3427–3434. 10.21037/jtd.2019.07.78 (2019). 10.21037/jtd.2019.07.78 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Qian, H. et al. Clinical characteristics, prognosis, and nomogram for esophageal cancer based on adenosquamous carcinoma: A SEER database analysis. Front. Oncol.11, 603349. 10.3389/fonc.2021.603349 (2021). 10.3389/fonc.2021.603349 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Fan, X. et al. Efficacy of endoscopic therapy for T1b esophageal cancer and construction of prognosis prediction model: A retrospective cohort study. Int. J. Surg.109, 1708–1719. 10.1097/js9.0000000000000427 (2023). 10.1097/js9.0000000000000427 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Tang, X. et al. A novel nomogram and risk classification system predicting the cancer-specific survival of patients with initially diagnosed metastatic esophageal cancer: A SEER-based study. Ann. Surg. Oncol.26, 321–328. 10.1245/s10434-018-6929-0 (2019). 10.1245/s10434-018-6929-0 [DOI] [PubMed] [Google Scholar]
  • 16.Cooper, J. S. et al. Chemoradiotherapy of locally advanced esophageal cancer: Long-term follow-up of a prospective randomized trial (RTOG 85 – 01). Radiat. Therapy Oncol. Group. Jama. 281, 1623–1627. 10.1001/jama.281.17.1623 (1999). 10.1001/jama.281.17.1623 [DOI] [PubMed] [Google Scholar]
  • 17.Minsky, B. D. et al. INT 0123 (Radiation Therapy Oncology Group 94 – 05) phase III trial of combined-modality therapy for esophageal cancer: High-dose versus standard-dose radiation therapy. J. Clin. Oncol.20, 1167–1174. 10.1200/jco.2002.20.5.1167 (2002). 10.1200/jco.2002.20.5.1167 [DOI] [PubMed] [Google Scholar]
  • 18.Eyck, B. M. et al. Ten-year outcome of neoadjuvant chemoradiotherapy plus surgery for esophageal cancer: The randomized controlled CROSS trial. J. Clin. Oncol.39, 1995–2004. 10.1200/jco.20.03614 (2021). 10.1200/jco.20.03614 [DOI] [PubMed] [Google Scholar]
  • 19.Ando, N. et al. A randomized trial comparing postoperative adjuvant chemotherapy with cisplatin and 5-fluorouracil versus preoperative chemotherapy for localized advanced squamous cell carcinoma of the thoracic esophagus (JCOG9907). Ann. Surg. Oncol.19, 68–74. 10.1245/s10434-011-2049-9 (2012). 10.1245/s10434-011-2049-9 [DOI] [PubMed] [Google Scholar]
  • 20.Ott, K., Lordick, F., Blank, S. & Büchler, M. Gastric cancer: Surgery in 2011. Langenbecks Arch. Surg.396, 743–758. 10.1007/s00423-010-0738-7 (2011). 10.1007/s00423-010-0738-7 [DOI] [PubMed] [Google Scholar]
  • 21.Rucker, A. J. et al. The impact of adjuvant therapy on survival after esophagectomy for node-negative esophageal adenocarcinoma. Ann. Surg.275, 348–355. 10.1097/sla.0000000000003886 (2022). 10.1097/sla.0000000000003886 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Weksler, B. & Sullivan, J. L. Survival after esophagectomy: A propensity-matched study of different surgical approaches. Ann. Thorac. Surg.104, 1138–1146. 10.1016/j.athoracsur.2017.04.065 (2017). 10.1016/j.athoracsur.2017.04.065 [DOI] [PubMed] [Google Scholar]
  • 23.Worrell, S. G. & Molena, D. Controversies in the surgical management of esophageal adenocarcinoma. J. Gastrointest. Oncol.14, 1919–1926. 10.21037/jgo-22-713 (2023). 10.21037/jgo-22-713 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Yoshida, N. et al. Can minimally invasive esophagectomy replace open esophagectomy for esophageal cancer? Latest analysis of 24,233 esophagectomies from the Japanese National Clinical Database. Ann. Surg.272, 118–124. 10.1097/sla.0000000000003222 (2020). 10.1097/sla.0000000000003222 [DOI] [PubMed] [Google Scholar]
  • 25.Paulson, E. C. et al. Underuse of esophagectomy as treatment for resectable esophageal cancer. Arch. Surg.143, 1198–1203. 10.1001/archsurg.143.12.1198 (2008). discussion 1203. 10.1001/archsurg.143.12.1198 [DOI] [PubMed] [Google Scholar]
  • 26.Steyerberg, E. W., Earle, C. C., Neville, B. A. & Weeks, J. C. Racial differences in surgical evaluation, treatment, and outcome of locoregional esophageal cancer: A population-based analysis of elderly patients. J. Clin. Oncol.23, 510–517. 10.1200/jco.2005.05.169 (2005). 10.1200/jco.2005.05.169 [DOI] [PubMed] [Google Scholar]
  • 27.Virnig, B. A., Baxter, N. N., Habermann, E. B., Feldman, R. D. & Bradley, C. J. A matter of race: Early-versus late-stage cancer diagnosis. Health Aff (Millwood). 28, 160–168. 10.1377/hlthaff.28.1.160 (2009). 10.1377/hlthaff.28.1.160 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Schwartz, K. L., Crossley-May, H., Vigneau, F. D., Brown, K. & Banerjee, M. Race, socioeconomic status and stage at diagnosis for five common malignancies. Cancer Causes Control. 14, 761–766. 10.1023/a:1026321923883 (2003). 10.1023/a:1026321923883 [DOI] [PubMed] [Google Scholar]
  • 29.Herget, K., Stroup, A., Smith, K., Wen, M. & Sweeney, C. Unstaged cancer: Long-term decline in incidence by site and by demographic and socioeconomic characteristics. Cancer Causes Control. 28, 341–349. 10.1007/s10552-017-0874-2 (2017). 10.1007/s10552-017-0874-2 [DOI] [PubMed] [Google Scholar]
  • 30.Li, J. et al. Novel nomogram predicting cancer-specific survival and overall survival in patients with primary esophageal small-cell carcinoma: a surveillance, epidemiology, and end results-based study. J. Cancer Res. Ther.17, 630–637. 10.4103/jcrt.JCRT_1612_20 (2021). 10.4103/jcrt.JCRT_1612_20 [DOI] [PubMed] [Google Scholar]
  • 31.Wu, X. X. et al. Nomogram predicting cancer-specific mortality in patients with esophageal adenocarcinoma: A competing risk analysis. J. Thorac. Dis.11, 2990–3003. 10.21037/jtd.2019.07.56 (2019). 10.21037/jtd.2019.07.56 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Du, F. et al. Development and validation of an individualized nomogram for predicting survival in patients with esophageal carcinoma after resection. J. Cancer. 11, 4023–4029. 10.7150/jca.40767 (2020). 10.7150/jca.40767 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Markar, S. R. et al. Systematic review and pooled analysis assessing the association between elderly age and outcome following surgical resection of esophageal malignancy. Dis. Esophagus. 26, 250–262. 10.1111/j.1442-2050.2012.01353.x (2013). 10.1111/j.1442-2050.2012.01353.x [DOI] [PubMed] [Google Scholar]
  • 34.Mantziari, S. et al. Esophageal cancer in elderly patients, current treatment options and outcomes: A systematic review and pooled analysis. Cancers (Basel). 10.3390/cancers13092104 (2021). [DOI] [PMC free article] [PubMed]
  • 35.Krajc, K. et al. Marital status and survival in cancer patients: A systematic review and meta-analysis. Cancer Med.12, 1685–1708. 10.1002/cam4.5003 (2023). 10.1002/cam4.5003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Du, L., Kim, J. J., Chen, B., Zhu, S. & Dai, N. Marital status is associated with superior survival in patients with esophageal cancer: A surveillance, epidemiology, and end results study. Oncotarget. 8, 95965–95972. 10.18632/oncotarget.21609 (2017). 10.18632/oncotarget.21609 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Aizer, A. A. et al. Marital status and survival in patients with cancer. J. Clin. Oncol.31, 3869–3876. 10.1200/jco.2013.49.6489 (2013). 10.1200/jco.2013.49.6489 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Bohanes, P. et al. Influence of sex on the survival of patients with esophageal cancer. J. Clin. Oncol.30, 2265–2272. 10.1200/jco.2011.38.8751 (2012). 10.1200/jco.2011.38.8751 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Lopes-Ramos, C. M., Quackenbush, J. & DeMeo, D. L. Genome-wide sex and gender differences in cancer. Front. Oncol.10, 597788. 10.3389/fonc.2020.597788 (2020). 10.3389/fonc.2020.597788 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Mathieu, L. N., Kanarek, N. F., Tsai, H. L., Rudin, C. M. & Brock, M. V. Age and sex differences in the incidence of esophageal adenocarcinoma: Results from the surveillance, epidemiology, and end results (SEER) Registry (1973–2008). Dis. Esophagus. 27, 757–763. 10.1111/dote.12147 (2014). 10.1111/dote.12147 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Wang, Z. Y. et al. Prognostic impact of tumor length in esophageal cancer: A systematic review and meta-analysis. BMC Cancer. 21, 988. 10.1186/s12885-021-08728-1 (2021). 10.1186/s12885-021-08728-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Gao, H. J. et al. Adjuvant radiotherapy for patients with pathologic node-negative esophageal carcinoma: A population based propensity matching analysis. Thorac. Cancer. 11, 243–252. 10.1111/1759-7714.13235 (2020). 10.1111/1759-7714.13235 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Deng, X. et al. The impact of adjuvant therapy on survival for node-negative esophageal squamous cell carcinoma: A propensity score-matched analysis. Ann. Transl Med.9, 998. 10.21037/atm-21-2539 (2021). 10.21037/atm-21-2539 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Shao, Y. et al. Survival benefit of perioperative chemotherapy for T1-3N0M0 stage esophageal cancer: A SEER database analysis. J. Thorac. Dis.13, 995–1004. 10.21037/jtd-20-2877 (2021). 10.21037/jtd-20-2877 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Gao, H. J. et al. Neoadjuvant chemoradiotherapy plus surgery versus surgery alone for clinical node-negative esophageal carcinoma. Thorac. Cancer. 11, 2618–2629. 10.1111/1759-7714.13586 (2020). 10.1111/1759-7714.13586 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Gabriel, E. et al. Association between clinically staged node-negative esophageal adenocarcinoma and overall survival benefit from neoadjuvant chemoradiation. JAMA Surg.151, 234–245. 10.1001/jamasurg.2015.4068 (2016). 10.1001/jamasurg.2015.4068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Lin, G., Liu, H. & Li, J. Pattern of recurrence and prognostic factors in patients with pT1-3 N0 esophageal squamous cell carcinoma after surgery: Analysis of a single center experience. J. Cardiothorac. Surg.14, 58. 10.1186/s13019-019-0883-1 (2019). 10.1186/s13019-019-0883-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Zheng, Y. et al. Predicting prognosis in resected esophageal squamous cell carcinoma using a clinical nomogram and recursive partitioning analysis. Eur. J. Surg. Oncol.44, 1199–1204. 10.1016/j.ejso.2018.04.011 (2018). 10.1016/j.ejso.2018.04.011 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1 (1.6MB, xlsx)
Supplementary Material 2 (188.1KB, docx)

Data Availability Statement

The dataset analysed during the current study are available in the Surveillance, Epidemiology, and End Results (SEER) database (https://seer.cancer.gov/). All data analysed during this study are included in the supplementary file.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES