Skip to main content
Springer logoLink to Springer
. 2023 Aug 28;149(17):15395–15406. doi: 10.1007/s00432-023-05323-8

Construction of the survival nomograms for colon cancer patients of different ages based on the SEER database

Yuzhou Yang 1,2, Peng Xu 1,, Cheng Zhang 1,2,
PMCID: PMC10620318  PMID: 37639008

Abstract

Introduction

Three nomograms for predicting the outcomes of early- and late-onset colon cancer (COCA) among patients not stratified by age were constructed using data in the Epidemiology and End Results (SEER) database (1975–2019). The accuracy of the nomogram was then assessed.

Method

Clinical data of 6107 patients with COCA were obtained from the SEER database. The patients were randomly divided into training and validation cohorts in a ratio of 7:3. Univariate and multivariate COX analyses of factors that could independently impact the prognosis of COCA were performed, and the corresponding nomograms for early-onset and late-onset COCA were constructed. Calibration curves, ROC curves, and C-index were used to determine the predictive accuracy. The discriminatory ability of the nomograms to assess their clinical utility, which was compared with the TNM staging system of the 8th edition of AJCC, was verified using survival analysis.

Result

Tumor primary site, ethnicity, and serum carcinoembryonic antigen (CEA) level significantly impacted the prognosis of colon cancer. Race, brain metastasis, and CEA were independent factors for predicting COCA prognosis. C-index, ROC, and calibration curves demonstrated that the three nomograms were accurate and superior to the traditional TNM staging system. Among the three nomograms, the early-onset COCA nomogram had the highest predictive accuracy, followed by that of colon cancer not stratified by age.

Conclusion

Three nomograms for patients not stratified by age, early-onset colon cancer, and late-onset colon cancer were constructed. The accuracies of the nomograms were good and were all superior to the conventional TNM staging system. The early- and late-onset COCA nomograms are useful for clinical management and individualized treatment of COCA patients at different ages.

Keywords: Colon cancer, SEER, Nomogram, Early-onset colon cancer, Late-onset colon cancer

Introduction

Colorectal cancer (CRC) is the fourth most fatal cancer globally, killing nearly 9 million people annually. In addition to an aging population and dietary habits, several factors, such as obesity, lack of physical activity, and smoking, increase the risk of CRC (Dekker et al. 2019). The USA had 147,950 new cases and 53,200 deaths in 2020 (Siegel et al. 2020). Patients with colon cancer (COCA) mainly present with small cell anemia, rectal bleeding, chronic abdominal pain, and changes in bowel habits (Benson et al. 2018). The median age of COCA onset is 67 years. Although only 12% of COCA patients are aged under 50 years, the incidence of CRC in individuals younger than 50 has increased by approximately 2% per year in the USA since the 1990s (Siegel et al. 2020).

Early-onset COCA refers to COCA diagnosed at < 50 years of age. The incidence of early-onset COCA is increasing worldwide, and its pathogenesis is still unclear. Early-onset COCA has different clinical, pathological, and molecular features. Compared with late-onset COCA, which refers to COCA diagnosed at age > 50 years, early-onset COCA mostly occurs in the descending colon, is mostly diagnosed in the late stage, and is poorly differentiated. The risk factors of early-onset COCA are not the same as those of late-onset COCA (Zaborowski et al. 2021). Comparative analysis of gene expression in early-onset and late-onset COCA identified 88 genes specific to early-onset COCA (Jandova et al. 2016). CLC and IFNAR1 differ in somatic gene expression between younger and older COCA patients, highlighting the genomic complexity of COCA in patients of different ages (Ågesen et al. 2011).

The tumor-node-metastasis (TNM) staging system, proposed by the International Union Against Cancer (UICC) and the American Joint Committee on Cancer (AJCC), is the standard method for staging malignant tumors and is widely used to assess cancer prognosis (Hari et al. 2013). The TNM staging system plays an important role in formulating treatment strategies, assessing treatment outcomes, predicting the survival time of patients after surgery, and the management of COCA patients (Delattre et al. 2022). In addition, studies have shown that other factors such as age, serum carcinoembryonic antigen (CEA) level, race, and tumor site are also strongly associated with tumor development in individual cases (Wang et al. 2015; Liang et al. 2018; Biller and Schrag 2021). Clinically, COCA can easily metastasize to other parts of the body, with the lung and liver being the most common metastatic organs, followed by bone and brain (Wang et al. 2020). The TNM staging system has two main limitations; it assesses the risk of individual patients by incorporating only three variables (T-stage, N-stage, and M-stage) and other risk factors such as age, gender, race, and tumor size, among others, cannot be incoperated (Duijster et al. 2021).

Nomograms have been applied in clinical studies related to prognosis and have gained wide acceptance (Song et al. 2018a; He et al. 2018). However, studies on the differences in the predictive accuracy of nomograms for COCA patients at different ages remain scanty. Herein, we analyzed a large amount of colon cancer data from the Surveillance, Epidemiology and End Results (SEER) program (1975–2019) and constructed three nomograms: not stratified-by-age nomograms, early-onset colon cancer (COCA) nomogram, and late-onset COCA nomogram to predict individualized survival time of colon cancer patients at any age. The accuracy of each nomogram was assessed.

Materials and methods

Patients selection

This was a retrospective cohort study in which data were obtained from the surveillance, epidemiology, and final results database (www.seer.cancer.gov). The SEER database, updated every year since 1973, comprises the incidence, prevalence, and mortality data of patients with various tumors and can be used to analyze tumor trends. The data for patients diagnosed with colon cancer between 1975 and 2019 were retrieved using SEER * Stat software version 8.4.0.1. The database contained data for 333,496 COCA patients. Patients with invalid or missing data were excluded from the study. Patients were randomly divided into training and validation cohorts in a ratio of 7:3, and patients in each group were divided into two groups based on age (> 50 and < 50). The procedure for data retrieval is shown in Fig. 1.

Fig. 1.

Fig. 1

Flow chart for selecting COCA patients included in the present study

Clinical variables and outcomes

The clinical variables analyzed in this study include age, race, gender, tumor primary site, tumor size, TNM stage based on the 8th edition of AJCC guidelines, serum CEA level, nerve invasion, lymph node metastasis, liver metastasis, lung metastasis, bone metastasis, and brain metastasis. The primary outcome analyzed was the overall survival (OS), which refers to the time from diagnosis to death from any cause or the end of the follow-up period.

Univariate and multivariate COX regression analyses

The 6107 MCC patients included in this study were randomly divided into two groups in the ratio of 7:3 using R software (version 4.1.3). Data for patients in the training set were used to build the nomograms (n = 4276), while that of those in the validation set were used to verify the predictive accuracy of the nomograms (n = 1831). Univariate Cox proportional risk regression analysis was performed to evaluate the contribution of each clinical variable in predicting COCA prognosis. These variables included tumor characteristics (T stage, N stage, M stage, tumor size, primary tumor location, other organ metastases, etc.), demographic variables (race, gender, and age), and serum carcinoembryonic antigen level. Demographic variables, TNM stage, and statistically significant factors were then included in the multivariate COX regression analysis to calculate hazard ratios (HRs) at 95% confidence intervals (CIs). AJCC staging was excluded from the Cox regression because combining T, N, and M stages could interfere with AJCC staging.

Construction of prognostic nomogram

Demographic variables (race, gender, age) and indicators with p less than 0.05 in the univariate and multivariate analyses were included in constructing a predictive nomogram, and the no-model plots were constructed using “Survival,” “Foreign,” and “RMS” packages in the R software. RMS program package is to construct the nomogram. Each variable in the nomogram was assigned a score, and the final multiple scores were summed to give an overall score for predicting OS at months 6 and 18, and the performance of the nomogram was measured by the consistency index (C-index). The median risk score was used as the cutoff value to divide the patients into high-risk and low-risk groups. KaplanMeier (KM) survival curves were used to fit the correlation between survival time and predicted scores in the high-risk and low-risk groups. Models were also constructed based on the TNM scoring system of the 8th AJCC edition. The C-index was calculated, and the ROC, correction, and survival curves for early-onset and late-onset colon cancer were plotted. The reliability and applicability of the models were compared.

Result

Patient characteristics

A total of 6107 COCA patients were obtained from the SEER database. Of these, 4276 were classified into the training cohort and 1831 in the validation group. Of the patients in the training cohort, 488 patients (11.4%) in the training cohort were < 50 years old, the majority (75.9%) were white, while the rest were black or other races, and 2154 (50.3%) were males. For the validation cohort, 916 patients (50%) were males, and 192 patients (48.0%) were < 50 years of age (Table 1).

Table 1.

The demographics and pathological characteristics of patients included in the present study

Characteristic Train cohort (n = 4276) Test cohort (n = 1831)
Age
 < 30 24 (0.5%) 6 (0.3%)
 30–44 249 (5.8%) 95 (5.1%)
 45–59 1003 (23.4%) 432 (23.5%)
 60–74 1563 (36.5%) 685 (37.4%)
 ≥ 75 1437 (33.6%) 613 (33.4%)
Race
 Black 422 (9.8%) 190 (10.3%)
 White 3248 (75.9%) 1375 (75%)
 Other 606 (14.1%) 266 (14.5%)
Sex
 Male 2122 (49.6%) 915 (49.9%)
 Female 2154 (50.3%) 916 (50%)
Primary site
 Cecum (180) 1005 (23.5%) 405 (22.1%)
 Colon ascendens (182) 994 (23.2%) 413 (22.5%)
 Flexura hepatica coli (183) 233 (5.5%) 94 (5.1%)
 Colon transversum (184) 450 (10.5%) 205 (11.1%)
 Splenic flexure of colon (185) 145 (3.3%) 72 (3.9%)
 Colon descendens (186) 250 (5.8%) 106 (5.7%)
 Colon sigmoideum (187) 1115 (26.0%) 496 (27%)
 Cross-overlap span (188) 67 (1.5%) 29 (1.5%)
 Other (189) 17 (0.3%) 11 (0.6%)
T stage
 Tis 40 (0.9%) 10 (0.5%)
 T0 3 (< 0.1%) 0
 T1 474 (11.0%) 218 (11.9%)
 T2 548 (12.8%) 265 (14.4%)
 T3 2261 (52.8%) 963 (52.5%)
 T4a 661 (15.4%) 254 (13.8%)
 T4b 289 (6.7%) 121 (6.6%)
N stage
 N0 2353 (55.0%) 1046 (57.1%)
 N1a 502 (11.7%) 228 (12.4%)
 N1b 584 (13.6%) 239 (13%)
 N1c 141 (3.2%) 51 (2.7%)
 N2a 353 (8.2%) 135 (7.3%)
 N2b 343 (8%) 132 (7.2%)
M Stage
 M0 3708 (86.7%) 1590 (86.8%)
 M1a 309 (7.2%) 145 (7.9%)
 M1b 128 (2.9%) 47 (2.5%)
 M1c 131 (3%) 49 (2.6%)
CEA
 Negative 2538 (59.3%) 1065 (58.1%)
 Positive 1738 (40.6%) 766 (41.8%)
Perineural invasion
 No 3644 (85.2%) 1550 (84.6%)
 Yes 632 (14.7%) 281 (15.3%)
Liver metastasis
 No 3902 (91.2%) 1664 (90.8%)
 Yes 374 (8.7%) 167 (9.1%)
Lung metastasis
 No 4175 (97.6%) 1796 (98%)
 Yes 101 (2.3%) 35 (1.9%)
Bone metastasis
 No 4262 (99.6%) 1824 (99.6%)
 Yes 14 (0.3%) 7 (0.3%)
Brain metastasis
 No 4271 (99.8%) 1829 (99.8%)
 Yes 5 (0.1%) 2 (0.1%)
LN metastasis
 No 4203 (98.2%) 1800 (98.3%)
 Yes 73 (1.7%) 31 (1.6%)
Tumor size
 < 5 cm 2264 (52.9%) 989 (54%)
 5–10 cm 1654 (38.6%) 695 (37.9%)
 ≥ 10 cm 358 (8.3%) 147 (8%)
Status
 Alive 3850 (90%) 1667 (91%)
 Dead 426 (10%) 164 (8.9%)

Univariate and multivariate COX regression analysis

Firstly, one-way COX regression analysis for age, race, gender, tumor primary site, tumor size, TNM stage of the 8th edition of AJCC, serum carcinoembryonic antigen, nerve invasion, lymph node metastasis, liver metastasis, lung metastasis, bone metastasis, and brain metastasis were performed. Several factors, including primary tumor site, tumor size, ethnicity, serum carcinoembryonic antigen, nerve invasion, lymph node metastasis, liver metastasis, lung metastasis, bone metastasis, and brain metastasis, were significantly associated with COCA prognosis (p < 0.05). Primary tumor site, ethnicity, brain metastasis, serum carcinoembryonic antigen, and nerve invasion were independent factors associated with COCA prognosis. The univariate and multivariate COX regression analysis results are shown in Table 2.

Table 2.

Univariate and multivariate analyses of factors linked to COCA prognosis

Characteristic Univariate analysis Multivariate analysis
Hazard rate (95%CI) p value Hazard rate (95%CI) p value
Age
 < 30 1
 30–44 0.72 (0.09–5.77) 0.758
 45–59 0.99 (0.14–7.20) 0.994
 60–74 1.80 (0.25–12.88) 0.559
 ≥ 75 3.98 (0.56–28.36) 0.168
Race
 Black 1 1
 White 0.97 (0.71–1.32) 0.864 1.06 (0.77–1.46) 0.708
 Other 0.58 (0.38–0.89) 0.012* 0.64 (0.42–1.00) 0.046*
Sex
 Male 1
 Female 1.00 (0.83–1.21) 0.982
Primary site
 Cecum (180) 1 1
 Colon ascendens (182) 0.86 (0.66–1.12) 0.258 1.14 (0.87–1.49) 0.335
 Flexura hepatica coli (183) 0.63 (0.39–1.02) 0.059 0.78 (0.48–1.28) 0.328
 Colon transversum (184) 0.96 (0.69–1.33) 0.818 1.11 (0.79–1.54) 0.551
 Splenic flexure of colon (185) 0.68 (0.37–1.22) 0.197 0.76 (0.42–1.38) 0.361
 Colon descendens (186) 0.80 (0.52–1.25) 0.382 0.99 (0.63–1.54) 0.947
 Colon sigmoideum (187) 0.55 (0.42–0.74)  < 0.001*** 0.64 (0.48–0.86) 0.002***
 Cross-overlap span (188) 1.23 (0.64–2.34) 0.531 1.12 (0.57–2.20) 0.736
Other (189) 2.17 (0.80–5.87) 0.128 2.52 (0.84–7.58) 0.101
T stage
 Tis 1 1
 T0 9.20 (0.58–147.15) 0.117 1.40 (0.07–26.45) 0.823
 T1 2.48 (0.34–18.21) 0.371 2.72 (0.37–20.00) 0.325
 T2 1.88 (0.26–13.82) 0.535 1.89 (0.26–13.95) 0.530
 T3 3.42 (0.48–24.38) 0.220 2.43 (0.34–17.44) 0.377
 T4a 7.21 (1.01–51.67) 0.049* 3.55 (0.49–25.72) 0.210
 T4b 8.51 (1.18–61.44) 0.034* 3.39 (0.46–24.80) 0.229
N stage
 N0 1 1
 N1a 1.40 (1.01–1.93) 0.041* 1.03 (0.74–1.45) 0.854
 N1b 1.40 (1.02–1.89) 0.032 0.87 (0.62–1.21) 0.423
 N1c 1.82 (1.07–3.09) 0.027 1.08 (0.62–1.87) 0.782
 N2a 2.01 (1.46–2.77)  < 0.001*** 1.05 (0.74–1.49) 0.788
 N2b 4.36 (3.38–5.62)  < 0.001*** 1.63 (1.19–2.23) 0.003**
M stage
 M0 1 1
 M1a 3.54 (2.73–4.60)  < 0.001*** 2.02 (1.33–3.06)  < 0.001***
 M1b 4.97 (3.58–6.91)  < 0.001*** 2.52 (1.56–4.08)  < 0.001***
 M1c 6.96 (5.14–9.43)  < 0.001*** 3.19 (2.12–4.81) < 0.001***
CEA
 Negative 1 1
 Positive 2.71 (2.22–3.30)  < 0.001*** 1.78 (1.43–2.21)  < 0.001***
Perineural invasion
 No 1 1
 Yes 2.07 (1.66–2.57)  < 0.001*** 1.19 (0.94–1.52) 0.155
Liver metastasis
 No 1 1
 Yes 3.74 (3.00–4.66)  < 0.001*** 1.23 (0.86–1.76) 0.256
Lung metastasis
 No 1 1
 Yes 2.92 (1.95–4.38)  < 0.001*** 0.83 (0.53–1.31) 0.432
Bone metastasis
 No 1 1
 Yes 7.04 (3.49–14.17)  < 0.001*** 1.98 (0.94–4.19) 0.075
Brain metastasis
 No 1 1
 Yes 8.98 (2.88–27.96)  < 0.001*** 4.24 (1.27–14.22) 0.019*
LN metastasis
 No 1 1
 Yes 5.59 (2.34–5.51) < 0.001*** 0.88 (0.55–1.40) 0.581
Tumor size
 < 5 cm 1 1
 5–10 cm 1.53 (1.25–1.87)  < 0.001*** 1.11 (0.90–1.38) 0.332
 ≥ 10cm 1.72 (1.26–2.36) < 0.001*** 1.04 (0.75–1.45) 0.821

*p < 0.05; **p < 0.01; ***p < 0.001

Prognostic nomogram construction

Nomograms for predicting 6 and 18 months of OS of COCA were constructed using data for patients in the training set (Fig. 2). The risk factors included in the not stratified by age nomogram contained significant demographic variables (race, sex, and age; p < 0.05) obtained by univariate and multivariate analyses. The nomogram for predicting the early-onset COCA and late-onset COCA comprised all of the above indicators except age. The nomograms contained scores for each risk factor and the total scores. The nomogram could generally predict the 6-month and 18-month OS of COCA patients (Fig. 2).

Fig. 2.

Fig. 2

Nomograms for predicting the OS of COCA patients. A Nomogram for predicting the prognosis of non-age-stratified COCA patients; B nomogram for predicting the prognosis of early-onset COCA cancer; C nomograms for predicting the prognosis of late-onset COCA

Nomogram calibration and validation

The C-index was used to compare the performance of the nomograms. Particularly, the C-index of the training cohort without stratification by age and the validation cohort was 0.79 (95% CI 0.77–0.81) and 0.81 (95% CI 0.77–0.84), respectively. In the early-onset COCA training set cohort, validation cohort C-index was 0.95 (95% CI 0.91–0.99), 1. In the late-onset COCA training cohort, the validation cohort C-index was 0.72 (95% CI 0.69–0.75), 0.75 (95% CI 0.71–0.79), respectively. The C-index of the nomogram for predicting the OS of patients grouped by the TNM staging based on the 8th edition of AJCC guidelines was 0.68 (95% CI 0.65–0.71). ROC curves were plotted to assess the performance of the models (Fig. 3), and the area under the ROCs of the three for predicting the 6- and 18-month prognosis of COCA patients was greater than 0.7, which was higher than that of the TNM staging AJCC system. In addition, calibration plots of the three prediction models and the TNM staging system for the training cohort were plotted (Fig. 4). We found a high consistency between the predicted survival probabilities and the observed outcomes. The above results demonstrate that the better performance of our models is superior to that of the TNM staging system.

Fig. 3.

Fig. 3

ROC for the 6- and 18-month prognosis prediction of the nomograms and the TNM staging system. A ROC for the 6- and 18-month OS prediction of the models for the training cohort of COCA patients not stratified by age; B ROC curves for the 6- and 18-months OS prediction of the models for the validation cohort of COCA patients not stratified by age; C ROC for the 6 and 18 months’ of OS prediction of the models for the early-onset COCA patients in the training cohort; D ROC for the 6- and 18-months OS prediction of the models for the early-onset COCA patients in the validation cohort; E ROC for the 6 and 18 months’ of OS prediction of the models for the late-onset COCA patient in the training set; F ROC for the 6 and 18 months’ of OS prediction of the models for the late-onset COCA patient in the validation set; G ROC for 6 and 18 months for the OS prediction of the TNM staging system

Fig. 4.

Fig. 4

Calibration curves of the nomogram and TNM staging system. A Six-month calibration curves for the nomogram not stratified by age; B the 18-month calibration curves for nomogram not stratified by age; C the 6-month calibration curve for early-onset COCA's nomogram; D the 18-month calibration curve for early-onset COCA's nomogram; E the 6-month calibration curve for late-onset COCA’s nomogram; F the 18-month calibration curve for late-onset COCA’s nomogram; G the 6-month calibration curve for TNM staging system; H the 18-month calibration curve for TNM staging system

Survival analyses

The overall risk scores (with the median score as the critical value) of each prognostic factor included in the three nomograms were calculated, and patients were divided into low-risk and high-risk groups. The KM curves were plotted (Fig. 5A–C), which suggested that the prognosis of the low-risk group individuals was better than that of the high-risk group individuals. We also plotted KM curves for the conventional TNM staging system, which still revealed a better prognosis for the low-risk group (Fig. 5D).

Fig. 5.

Fig. 5

The KM curves for OS prediction of the TNM staging system. A KM curves of the models for patients not stratified by age; B KM curve of early-onset COCA model; C KM curve of late-onset COCA model; D KM curve of TNM staging system

Discussion

COCA is the fifth leading cause of cancer-related deaths worldwide (Sung et al. 2020). A total of 13 factors, including age, gender, and metastatic status, have been associated with COCA (Ge et al. 2019). COCA is prone to distant metastasis, and the site of metastasis is important to COCA prognosis. COCA is highly metastatic to the liver, lungs, brain, and bone, and the risk of death is higher in patients with these metastases than those without (Wang et al. 2020; Chang et al. 2017; Nakamura et al. 2021). In this study, we constructed a non-age stratification, early-onset COCA, and late-onset COCA nomograms using data in the SEER database. The nomograms were validated to have had good discriminatory ability, accuracy, and positive predictive power. The clinical significance of indicators such as race, distant metastasis, and CEA in COCA patients was investigated.

The dataset not stratified by age in this study is one in which the study cohort was not grouped based on age. Many previous studies have demonstrated that for cancer patients, older adults have a shorter OS survival than their younger opposites (Tai et al. 2022; Chen et al. 2020; Badic et al. 2021; Pilleron et al. 2021). In addition, Kuai et al. constructed a nomogram of patients with liver metastases COCA and found that age was one of the most important variables in predicting the prognosis of liver cancer patients (Kuai et al. 2021). Pei et al. (2020) also found that age impacted the survival of patients with non-metastatic COCA. Herein, although univariate analysis revealed that age in the dataset not stratified by age was not significantly linked to COCA prognosis, we included it in constructing the nomogram. The incidence of early-onset COCA is increasing every year (Tanaka et al. 2023), which has increased by 2% per year since 1994 (Mauri et al. 2019). Compared with late-onset COCA, early-onset COCA, a new subtype of COCA, has unique molecular mechanisms of development, genetic characteristics, and histopathological features (Zaborowski et al. 2021). Most early-onset COCAs are diagnosed in the later stage, it is characterized by poor cell differentiation, and the primary tumor is located in the descending colon. Furthermore, a detailed comparative analysis of gene expression between early-onset COCA and late-onset COCA revealed 88 gene expression changes specific to early-onset COCA (Jandova et al. 2016). In summary, early-onset COCA presents with specific clinical features and has unique molecular development mechanisms. Thus, it was crucial to investigate the unique clinical features and molecular blueprint of early-onset COCA, both of which are important in the clinical diagnosis and treatment of early-onset COCA. The COCA patients in the SEER database were divided into the early-onset COCA and the late-onset COCA groups, and the same factors that impact COCA prognosis were used to construct corresponding nomograms. Based on the area under the ROC, our nomogram accurately predicted COCA prognosis. Notably, the nomogram for the early-onset COCA was more accurate in COCA prognosis prediction. Our findings illustrate differences exist in the factors that impact the prognosis of early-onset and late-onset COCA.

In this study, the relationship between sex and prognosis of COCA patients was not significant. However, demographic variables such as sex and race are one of essential variables in cancer treatment (Goggins and Lo 2012; Shavers and Brown 2002; Zeng et al. 2015). The incidence of COCA is about 20% lower in women than in men (Hultcrantz 2021). Numerous studies have been conducted on the tumor sites of COCA (Siegel et al. 2012; Meguid et al. 2008). The COCA biologies, such as microsatellite instability and differences in gene expression, vary with the cancer site (Papagiorgis et al. 2012; Sun 2021). A study by Ge et al. found significantly different prognoses for COCA at different sites. The OS of patients with right-sided COCA was shorter than those with left-sided COCA (Ge et al. 2019). The precise reason for this difference is not known. However, only a few studies have investigated the specific tumor sites. Specific tumor sites were among the variables incorporated in the constructed nomogram, which increased the accuracy of the nomogram. Nomograms provide a reasonable and reproducible algorithm for individualized prognostic assessment. They have been used for predicting the prognosis of several cancer types, such as pancreatic cancer, gastrointestinal mesenchymal tumor, and gastric cancer, among others (Song et al. 2018a, b; Liu et al. 2016). Combining many clinical variables related to prognosis into the nomogram can reveal comparable staging systems and more disease-specific characteristics. Only significant variables in the univariate and multifactorial analyses were included in constructing the nomograms in the present study. Furthermore, important variables considered in cancer treatment, such as gender, were also included because they are relevant in clinical practice.

The TNM staging system is the standard method for staging COCA (Hari et al. 2013). However, the inherent limitations of the TNM staging system are unavoidable because it only recognizes the T-stage, N-stage, and M-stage when assessing patient prognosis but does not consider other factors that impact patient prognosis (Guevara-Cuellar et al. 2021; Feng et al. 2021). Herein, more variables that impact the OS of COCA patients were included. The ROC curve revealed that the accuracy of the constructed nomogram was higher than that of the conventional TNM staging system.

This study had several limitations. First, given that it was a retrospective study, selection bias in the patient selection process cannot be ruled out. In addition, due to the limited clinical information on patients in the SEER database, more valuable clinical factors, such as specific radiation treatment regimens, surgical access, were not considered in the analysis. Finally, the accuracy of our nomogram was not validated with external data. To override this limitation, our study cohort was divided into the training and validation cohort in a ratio of 7:3, in which the 30% of the population was used for internal validation. The results of the internal validation demonstrated the robustness of the model. Despite these drawbacks, our study has some clear advantages. First, this study included sufficient data, which increased the reliability of our findings. Second, we constructed three nomograms simultaneously, which not only made the results more comparable but also illustrated some differences in the factors that impact prognosis between early-onset COCA and late-onset COCA. Third, the model's accuracy was validated, further demonstrating the stability and reliability of the constructed models. Finally, the dynamic nomograms could predict the OS of COCA patients at all ages.

Conclusion

Herein, we constructed highly accurate nomograms for predicting the outcomes of early-onset COCA and late-onset COCA. The accuracy of the nomograms was higher than that of the TNM staging system. Among the three nomograms, the nomogram for the early-onset COCA prognosis prediction was the most accurate, followed by the non-age-stratified COCA prognosis prediction nomogram. The nomograms for predicting early-onset and late-onset COCA are useful for clinical management and individualized treatment of patients at different ages.

Author contributions

All authors contributed to the research design. YZY designed and prepared the manuscript; YZY and PX performed the analysis, and CZ read and approved the final manuscript.

Funding

This study was supported by Minsheng Science and Technology Program Joint Program (2021JH2/10300106).

Data availability statement

The dataset supporting the conclusions of this paper is included in the material of this paper.

Declarations

Conflict of interest

The authors declare that the research was conducted without any commercial or financial relationships that could be construed as a potential conflict of interest.

Ethics approval and consent to participate

Not applicable.

Consent for publication

All authors consent to the publication of this study.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Peng Xu, Email: pppengxu@163.com.

Cheng Zhang, Email: zhangc1109@163.com.

References

  1. Ågesen TH, Berg M, Clancy T, Thiis-Evensen E, Cekaite L, Lind GE, Nesland JM, Bakka A, Mala T, Hauss HJ, Fetveit T, Vatn MH, Hovig E, Nesbakken A, Lothe RA, Skotheim RI. CLC and IFNAR1 are differentially expressed and a global immunity score is distinct between early- and late-onset colorectal cancer. Genes Immun. 2011;12(8):653–662. doi: 10.1038/gene.2011.43. [DOI] [PubMed] [Google Scholar]
  2. Badic B, Oguer M, Cariou M, Kermarrec T, Bouzeloc S, Nousbaum JB, Robaszkiewicz M, Queneherve L. Prognostic factors for stage III colon cancer in patients 80 years of age and older. Int J Colorectal Dis. 2021;36(4):811–819. doi: 10.1007/s00384-021-03861-6. [DOI] [PubMed] [Google Scholar]
  3. Benson AB, Venook AP, Al-Hawary MM, Cederquist L, Chen YJ, Ciombor KK, Cohen S, Cooper HS, Deming D, Engstrom PF, Garrido-Laguna I, Grem JL, Grothey A, Hochster HS, Hoffe S, Hunt S, Kamel A, Kirilcuk N, Krishnamurthi S, Messersmith WA, Meyerhardt J, Miller ED, Mulcahy MF, Murphy JD, Nurkin S, Saltz L, Sharma S, Shibata D, Skibber JM, Sofocleous CT, Stoffel EM, Stotsky-Himelfarb E, Willett CG, Wuthrick E, Gregory KM, Freedman-Cass DA. NCCN guidelines insights: colon cancer, Version 2.2018. J Natl Compr Cancer Netw JNCCN. 2018;16(4):359–369. doi: 10.6004/jnccn.2018.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Biller LH, Schrag D. Diagnosis and treatment of metastatic colorectal cancer: a review. JAMA. 2021;325(7):669–685. doi: 10.1001/jama.2021.0106. [DOI] [PubMed] [Google Scholar]
  5. Chang YC, Su CY, Chen MH, Chen WS, Chen CL, Hsiao M. Secretory RAB GTPase 3C modulates IL6-STAT3 pathway to promote colon cancer metastasis and is associated with poor prognosis. Mol Cancer. 2017;16(1):135. doi: 10.1186/s12943-017-0687-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Chen F, Wang F, Bailey CE, Murff HJ, Berlin JD, Shu XO, Zheng W. Evaluation of determinants for age disparities in the survival improvement of colon cancer: results from a cohort of more than 486,000 patients in the United States. Am J Cancer Res. 2020;10(10):3395–3405. [PMC free article] [PubMed] [Google Scholar]
  7. Dekker E, Tanis PJ, Vleugels JLA, Kasi PM, Wallace MB. Colorectal cancer. Lancet (london, England) 2019;394(10207):1467–1480. doi: 10.1016/S0140-6736(19)32319-0. [DOI] [PubMed] [Google Scholar]
  8. Delattre JF, Selcen Oguz-Erdogan A, Cohen R, Shi Q, Emile JF, Taieb J, Tabernero J, André T, Meyerhardt JA, Nagtegaal ID, Svrcek M. A comprehensive overview of tumour deposits in colorectal cancer: towards a next TNM classification. Cancer Treat Rev. 2022;103:102325. doi: 10.1016/j.ctrv.2021.102325. [DOI] [PubMed] [Google Scholar]
  9. Duijster J, Mughini-Gras L, Neefjes J, Franz E. Occupational exposure and risk of colon cancer: a nationwide registry study with emphasis on occupational exposure to zoonotic gastrointestinal pathogens. BMJ Open. 2021;11(8):e050611. doi: 10.1136/bmjopen-2021-050611. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Feng H, Lyu Z, Zheng J, Zheng C, Wu Q, Liang W, Li Y. Association of tumor size with prognosis in colon cancer: a surveillance, epidemiology, and end results (SEER) database analysis. Surgery. 2021;169(5):1116–1123. doi: 10.1016/j.surg.2020.11.011. [DOI] [PubMed] [Google Scholar]
  11. Ge H, Yan Y, Xie M, Guo L, Tang D. Construction of a nomogram to predict overall survival for patients with M1 stage of colorectal cancer: a retrospective cohort study. Int J Surg (london, England) 2019;72:96–101. doi: 10.1016/j.ijsu.2019.10.021. [DOI] [PubMed] [Google Scholar]
  12. Goggins WB, Lo FF. Racial and ethnic disparities in survival of US children with acute lymphoblastic leukemia: evidence from the SEER database 1988–2008. Cancer Causes Control CCC. 2012;23(5):737–743. doi: 10.1007/s10552-012-9943-8. [DOI] [PubMed] [Google Scholar]
  13. Guevara-Cuellar CA, Soto-Rojas VE, Echeverry-Molina MI, Gómez M, Martínez P. Optimal allocation of chemotherapy schemes for metastatic colon cancer in Colombia. Value Health Reg Issues. 2021;26:105–112. doi: 10.1016/j.vhri.2021.01.006. [DOI] [PubMed] [Google Scholar]
  14. Hari DM, Leung AM, Lee JH, Sim MS, Vuong B, Chiu CG, Bilchik AJ. AJCC Cancer Staging Manual 7th edition criteria for colon cancer: do the complex modifications improve prognostic assessment? J Am Coll Surg. 2013;217(2):181–190. doi: 10.1016/j.jamcollsurg.2013.04.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. He C, Zhang Y, Cai Z, Lin X, Li S. Overall survival and cancer-specific survival in patients with surgically resected pancreatic head adenocarcinoma: a competing risk nomogram analysis. J Cancer. 2018;9(17):3156–3167. doi: 10.7150/jca.25494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Hultcrantz R. Aspects of colorectal cancer screening, methods, age and gender. J Intern Med. 2021;289(4):493–507. doi: 10.1111/joim.13171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Jandova J, Xu W, Nfonsam V. Sporadic early-onset colon cancer expresses unique molecular features. J Surg Res. 2016;204(1):251–260. doi: 10.1016/j.jss.2016.04.068. [DOI] [PubMed] [Google Scholar]
  18. Kuai L, Zhang Y, Luo Y, Li W, Li XD, Zhang HP, Liu TY, Yin SY, Li B. Prognostic nomogram for liver metastatic colon cancer based on histological type, tumor differentiation, and tumor deposit: a TRIPOD compliant large-scale survival study. Front Oncol. 2021;11:604882. doi: 10.3389/fonc.2021.604882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Liang L, Zeng JH, Qin XG, Chen JQ, Luo DZ, Chen G. Distinguishable prognostic signatures of left- and right-sided colon cancer: a study based on sequencing data. Cell Physiol Biochem. 2018;48(2):475–490. doi: 10.1159/000491778. [DOI] [PubMed] [Google Scholar]
  20. Liu J, Geng Q, Liu Z, Chen S, Guo J, Kong P, Chen Y, Li W, Zhou Z, Sun X, Zhan Y, Xu D. Development and external validation of a prognostic nomogram for gastric cancer using the national cancer registry. Oncotarget. 2016;7(24):35853–35864. doi: 10.18632/oncotarget.8221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Mauri G, Sartore-Bianchi A, Russo AG, Marsoni S, Bardelli A, Siena S. Early-onset colorectal cancer in young individuals. Mol Oncol. 2019;13(2):109–131. doi: 10.1002/1878-0261.12417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Meguid RA, Slidell MB, Wolfgang CL, Chang DC, Ahuja N. Is there a difference in survival between right- versus left-sided colon cancers? Ann Surg Oncol. 2008;15(9):2388–2394. doi: 10.1245/s10434-008-0015-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Nakamura Y, Hokuto D, Koyama F, Matsuo Y, Nomi T, Yoshikawa T, Kamitani N, Sadamitsu T, Takei T, Matsumoto Y, Iwasa Y, Fukuoka K, Obara S, Nakamoto T, Kuge H, Sho M. The prognosis and recurrence pattern of right- and left-sided colon cancer in stage II, stage III, and liver metastasis after curative resection. Ann Coloproctol. 2021;37(5):326–336. doi: 10.3393/ac.2020.09.14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Papagiorgis PC, Zizi AE, Tseleni S, Oikonomakis IN, Nikiteas NI. The pattern of epidermal growth factor receptor variation with disease progression and aggressiveness in colorectal cancer depends on tumor location. Oncol Lett. 2012;3(5):1129–1135. doi: 10.3892/ol.2012.621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Pei JP, Zhang CD, Liang Y, Zhang C, Wu KZ, Zhao ZM, Dai DQ. Novel nomograms individually predicting overall survival of non-metastatic colon cancer patients. Front Oncol. 2020;10:733. doi: 10.3389/fonc.2020.00733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Pilleron S, Maringe C, Charvat H, Atkinson J, Morris EJA, Sarfati D. The impact of timely cancer diagnosis on age disparities in colon cancer survival. J Geriatr Oncol. 2021;12(7):1044–1051. doi: 10.1016/j.jgo.2021.04.003. [DOI] [PubMed] [Google Scholar]
  27. Shavers VL, Brown ML. Racial and ethnic disparities in the receipt of cancer treatment. J Natl Cancer Inst. 2002;94(5):334–357. doi: 10.1093/jnci/94.5.334. [DOI] [PubMed] [Google Scholar]
  28. Siegel RL, Ward EM, Jemal A. Trends in colorectal cancer incidence rates in the United States by tumor location and stage, 1992–2008. Cancer Epidemiol Biomark Prev. 2012;21(3):411–416. doi: 10.1158/1055-9965.EPI-11-1020. [DOI] [PubMed] [Google Scholar]
  29. Siegel RL, Miller KD, Goding Sauer A, Fedewa SA, Butterly LF, Anderson JC, Cercek A, Smith RA, Jemal A. Colorectal cancer statistics, 2020. CA Cancer J Clin. 2020;70(3):145–164. doi: 10.3322/caac.21601. [DOI] [PubMed] [Google Scholar]
  30. Song W, Miao DL, Chen L. Nomogram for predicting survival in patients with pancreatic cancer. Onco Targets Ther. 2018;11:539–545. doi: 10.2147/OTT.S154599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Song W, Lv CG, Miao DL, Zhu ZG, Wu Q, Wang YG, Chen L. Development and validation of a nomogram for predicting survival in patients with gastrointestinal stromal tumours. Eur J Surg Oncol. 2018;44(10):1657–1665. doi: 10.1016/j.ejso.2018.07.004. [DOI] [PubMed] [Google Scholar]
  32. Sun BL. Current microsatellite instability testing in management of colorectal cancer. Clin Colorectal Cancer. 2021;20(1):e12–e20. doi: 10.1016/j.clcc.2020.08.001. [DOI] [PubMed] [Google Scholar]
  33. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2021;71(3):209–249. doi: 10.3322/caac.21660. [DOI] [PubMed] [Google Scholar]
  34. Tai Q, Xue W, Li M, Zhuo S, Zhang H, Fang F, Zhang J. Survival nomogram for metastasis colon cancer patients based on SEER database. Front Genet. 2022;13:832060. doi: 10.3389/fgene.2022.832060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Tanaka LF, Figueroa SH, Popova V, Klug SJ, Buttmann-Schweiger N. The rising incidence of early-onset colorectal cancer. Dtsch Arztebl Int. 2023;120(Forthcoming):59–64. doi: 10.3238/arztebl.m2022.0368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Wang J, Sun Y, Bertagnolli MM. Comparison of gastric cancer survival between Caucasian and Asian patients treated in the United States: results from the Surveillance Epidemiology and End Results (SEER) database. Ann Surg Oncol. 2015;22(9):2965–2971. doi: 10.1245/s10434-015-4388-4. [DOI] [PubMed] [Google Scholar]
  37. Wang J, Li S, Liu Y, Zhang C, Li H, Lai B. Metastatic patterns and survival outcomes in patients with stage IV colon cancer: a population-based analysis. Cancer Med. 2020;9(1):361–373. doi: 10.1002/cam4.2673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Zaborowski AM, Abdile A, Adamina M, Aigner F, d'Allens L, Allmer C, Álvarez A, Anula R, Andric M, Atallah S, Bach S, Bala M, Barussaud M, Bausys A, Bebington B, Beggs A, Bellolio F, Bennett MR, Berdinskikh A, Bevan V, Biondo S, Bislenghi G, Bludau M, Boutall A, Brouwer N, Brown C, Bruns C, Buchanan DD, Buchwald P, Burger JWA, Burlov N, Campanelli M, Capdepont M, Carvello M, Chew HH, Christoforidis D, Clark D, Climent M, Cologne KG, Contreras T, Croner R, Daniels IR, Dapri G, Davies J, Delrio P, Denost Q, Deutsch M, Dias A, D'Hoore A, Drozdov E, Duek D, Dunlop M, Dziki A, Edmundson A, Efetov S, El-Hussuna A, Elliot B, Emile S, Espin E, Evans M, Faes S, Faiz O, Fleming F, Foppa C, Fowler G, Frasson M, Figueiredo N, Forgan T, Frizelle F, Gadaev S, Gellona J, Glyn T, Gong J, Goran B, Greenwood E, Guren MG, Guillon S, Gutlic I, Hahnloser D, Hampel H, Hanly A, Hasegawa H, Iversen LH, Hill A, Hill J, Hoch J, Hoffmeister M, Hompes R, Hurtado L, Iaquinandi F, Imbrasaite U, Islam R, Jafari MD, Kanemitsu Y, Karachun A, Karimuddin AA, Keller DS, Kelly J, Kennelly R, Khrykov G, Kocian P, Koh C, Kok N, Knight KA, Knol J, Kontovounisios C, Korner H, Krivokapic Z, Kronberger I, Kroon HM, Kryzauskas M, Kural S, Kusters M, Lakkis Z, Lankov T, Larson D, Lázár G, Lee KY, Lee SH, Lefèvre JH, Lepisto A, Lieu C, Loi L, Lynch C, Maillou-Martinaud H, Maroli A, Martin S, Martling A, Matzel KE, Mayol J, McDermott F, Meurette G, Millan M, Mitteregger M, Moiseenko A, Monson JRT, Morarasu S, Moritani K, Möslein G, Munini M, Nahas C, Nahas S, Negoi I, Novikova A, Ocares M, Okabayashi K, Olkina A, Oñate-Ocaña L, Otero J, Ozen C, Pace U, São Julião GP, Panaiotti L, Panis Y, Papamichael D, Park J, Patel S, Patrón Uriburu JC, Pera M, Perez RO, Petrov A, Pfeffer F, Phang PT, Poskus T, Pringle H, Proud D, Raguz I, Rama N, Rasheed S, Raval MJ, Rega D, Reissfelder C, Reyes Meneses JC, Ris F, Riss S, Rodriguez-Zentner H, Roxburgh CS, Saklani A, Salido AJ, Sammour T, Saraste D, Schneider M, Seishima R, Sekulic A, Seppala T, Sheahan K, Shine R, Shlomina A, Sica GS, Singnomklao T, Siragusa L, Smart N, Solis A, Spinelli A, Staiger RD, Stamos MJ, Steele S, Sunderland M, Tan KK, Tanis PJ, Tekkis P, Teklay B, Tengku S, Jiménez-Toscano M, Tsarkov P, Turina M, Ulrich A, Vailati BB, van Harten M, Verhoef C, Warrier S, Wexner S, de Wilt H, Weinberg BA, Wells C, Wolthuis A, Xynos E, You N, Zakharenko A, Zeballos J, Winter DC. Characteristics of early-onset vs late-onset colorectal cancer: a review. JAMA Surg. 2021;156(9):865–874. doi: 10.1001/jamasurg.2021.2380. [DOI] [PubMed] [Google Scholar]
  39. Zeng C, Wen W, Morgans AK, Pao W, Shu XO, Zheng W. Disparities by race, age, and sex in the improvement of survival for major cancers: results from the national cancer institute surveillance, epidemiology, and end results (SEER) program in the United States, 1990 to 2010. JAMA Oncol. 2015;1(1):88–96. doi: 10.1001/jamaoncol.2014.161. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The dataset supporting the conclusions of this paper is included in the material of this paper.


Articles from Journal of Cancer Research and Clinical Oncology are provided here courtesy of Springer

RESOURCES