Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2024 Oct 29;19(10):e0311540. doi: 10.1371/journal.pone.0311540

Using a Bayesian analytic approach to identify county-level ecological factors associated with survival among individuals with early-onset colorectal cancer

Sunny Siddique 1,*, Laura V M Baum 2, Nicole C Deziel 3, Jill R Kelly 3, Joshua L Warren 4,, Xiaomei Ma 1,
Editor: Zhuo Chen5
PMCID: PMC11521299  PMID: 39471191

Abstract

Background

In the United States (US), incidence of early age of onset colorectal cancer (EOCRC, diagnosed <50 years of age) has been increasing. Using a Bayesian analytic approach, we evaluated the association between county-level ecological factors and survival among individuals with EOCRC and identified hotspot and coldspot counties with unexplained low and high survival, respectively.

Methods

Principal component (PC) analysis was used to reduce dimensionality of 36 county-level social, behavioral, and preventive factors from the Centers for Disease Control and Prevention data. Survival information was derived from the Surveillance, Epidemiology, and End Results Program data from January 1, 2000 to December 31, 2019. The association between the identified PCs and survival was evaluated using multivariable spatial generalized linear mixed models. Counties with residual low and high survival (i.e., unexplained by the PCs) were classified as hotspots and coldspots, respectively.

Results

Four PCs were used to explain the spatial variability in 5-year survival among 75,215 individuals with EOCRC: PC1) poverty, chronic disease, health risk behaviors (β = -0.03, 95% credible interval (CrI): -0.04, -0.03); PC2) younger age, chronic disease-free, minority status (β = -0.01, 95% CrI: -0.02, 0.00); PC3) urban environment, preventive services (β = 0.02, 95% CrI: 0.00, 0.03); and PC4) older age (-0.04, 95% CrI: -0.06, -0.02). Among individuals with distant malignancies, the residual spatial variability remained high for two US counties: 1) Salt Lake County, UT residents experiencing 26.5% (95% CrI: 1.5%, 47.8%) lower odds of survival [hotspot], and 2) Riverside County, CA residents experiencing 37% (95% CrI: 7.97%, 78.8%) higher odds survival [coldspot] after adjustment for county-level factors.

Conclusions

County-level ecological factors are strongly associated with survival among individuals with EOCRC. Yet there is some evidence of survival disparities among individuals with distant malignancies that remain unexplained by the included factors.

Introduction

In the United States (US), incidence of early age of onset colorectal cancer (EOCRC), defined as colorectal cancer (CRC) diagnosed among individuals <50 years of age, has increased approximately 2% per year since 2011 [1]. Compared to later-onset CRC, EOCRC is characterized by distinct genetic, molecular, and clinical characteristics and survival trajectories. For example, early-onset tumors show a higher prevalence of microsatellite instability, a lower prevalence of BRAF mutations, and more often present in women [2]. EOCRC patients are more likely to present with hematochezia and abdominal pain, in part because of the predominance of left-sided tumors [3, 4]. They are more often diagnosed with advanced disease and experience longer time to diagnosis [3]. Compared to older individuals, individuals with EOCRC receive more aggressive treatment, including surgery, multiagent chemotherapy, and adjuvant or neoadjuvant chemotherapy and/or radiotherapy [59]. However, those with EOCRC also experience a survival benefit upon adjusting for stage at diagnosis, compared to those diagnosed with CRC at later ages [10].

In CRC broadly, disparities in survival are observed based on race, ethnicity, sex, and geography [1]. However, due to the distinct etiology, clinical characteristics, and survival trajectories of individuals with EOCRC, it is important to assess their survival disparities separately. Previously, a small number of studies have examined EOCRC survival disparities and reported that non-Hispanic (NH) Black men and individuals living in Southern states experience the lowest survival [1114]. A limited set of county-level factors have been assessed, and prevalence of smoking was found to be a strong predictor of disparities in survival [11]. Factors such as county-level prevalence of smoking may be highly correlated with previously unexplored social and structural factors (i.e., housing, health care, and screening access) with complex implications on survival among individuals with EOCRC. Furthermore, county-level social and structural factors are often spatially correlated due to the tendency of geographically proximal units to share similar characteristics [15]. These social and structural predictors of survival and the spatial correlation among them have been unaccounted for in prior studies, potentially impacting the findings (e.g., confidence intervals may be too narrow).

In this study, we evaluated the association between county-level social, behavioral, and preventive factors and survival among individuals with EOCRC, stratified by stage at diagnosis. We also identified hotspot and coldspot counties where the survival disparities remained unexplained by the county-level factors that were assessed. By including a comprehensive set of county-level social, behavioral, and preventive factors in our analysis and by utilizing a hierarchical Bayesian spatial analytic approach, we accounted for correlation in spatial data that was unaccounted for in prior studies.

Methods

Study population

The study population included all individuals who were diagnosed with colorectal cancer (ICD-O-3 Sties: C180-189, C260, C199, C209) under 50 years of age between January 1, 2000 and December 31, 2019 in the 17 population-based registries covered by the National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) Program (November 2021 submission) [16]. SEER covers nearly one-third of the US population and includes data on the number of individuals with EOCRC who were alive 5 years after diagnosis in each county [16, 17]. Since EOCRC mortality is reported to have increased approximately 1% per year since the mid-2000s, we chose a study period from the year 2000 to 2019 to capture this shift in the epidemiology of EOCRC [18]. We included only the first primary cancer and removed recurrent malignancies (n = 2,102). Residents of Alaska, Hawaii, and other island counties (n = 220) were excluded because our spatial modeling required all counties to have non-missing covariates and at least one “neighbor” (i.e., bordering county). Remote counties in Alaska had missing covariates, while Hawaii and other islands had no bordering counties.

Predictors of survival

To identify county-level predictors of survival, aggregated 5-year survival data from SEER were linked with two publicly available datasets using county FIPS codes: 1) 2020 Social Vulnerability Index (SVI) data from the Centers for Disease Control and Prevention’s (CDC) Agency for Toxic Substances and Disease Registry (ATSDR), 2) CDC’s 2020 PLACES data [1921]. The CDC/ATSDR SVI database and mapping tool was created to assist state, local, and tribal disaster management officials in identifying the locations of their most socially vulnerable populations [19, 22]. Previously utilized in disparity focused studies assessing CRC outcomes, the CDC/ATSDR SVI database is comprised of 16 census variables, representing four domains: socioeconomic status, household composition, racial and ethnic minority status, and housing and transportation [23, 24]. The CDC’s PLACES database uses small area estimation methods to obtain 29 chronic disease measures for the entire US at the county level and has been utilized to assess CRC prevention [25, 26]. Drawing upon Behavioral Risk Factor Surveillance System’s (BRFSS) annual telephone surveys, the PLACES database also includes factors related to health outcomes, prevention, health risk behaviors, and health status [20]. Factors such as prevalence of older individuals, as well as chronic conditions and preventive initiatives among older individuals, were included in our study because they correlate with cancer survival and serve as markers of the social and built environment in which young individuals reside [2729]. For example, in a county with a high prevalence of older adults receiving CRC screening, it may be challenging to allocate resources to screen younger individuals.

To address the collinearity in the 36 predictor variables that were ascertained from the CDC/ATSDR SVI and PLACES databases (S1 Table), we conducted principal component analysis (PCA). PCA reduces dimensionality in large datasets and allows us to construct a smaller number of variables, called principal components (PCs) that are linear combinations or mixtures of the original variables [30]. Scree plots were used to evaluate the number of PCs to be included in the final model. The optimal number of PCs were chosen based on this plot, interpretability, and Kaiser’s rule (i.e., dropping the components with eigenvalues <1) [31].

Statistical modeling

Upon identification, PCs were included as predictors in a series of multivariable spatial generalized linear mixed models for areal unit data, with inference carried out in a Bayesian setting using a Markov Chain Monte Carlo sampling algorithm (S1 Appendix) [32, 33]. The response variable was the number of individuals with EOCRC in a county who were alive 5 years after diagnosis, accounting for the total number of individuals diagnosed with EOCRC in the county. This modeling approach ensures that counties with higher number of individuals with EOCRC are weighed more heavily but also improves the estimation of survival in counties with smaller number of cancer cases by taking into consideration the disease burden of neighboring counties. Convergence of the model was assessed using trace plots for individual parameters and analyzing the potential scale reduction factor results, along with the Geweke diagnostic (S1 Appendix) [34, 35].

Hotspot and Coldspot identification

We extracted posterior samples from the spatial random effect parameters and calculated the posterior means and 95% quantile-based equal-tailed credible intervals (CrI) as summaries for each county. These random effects represent the residual (i.e., after adjusting for the PCs) risk of the outcome and can be useful in understanding spatial patterns that have not been accounted for in the model and/or which may be currently unknown. Counties with negative random effect values (upper 95% CrIs < 0) were considered hotspots, which had generally lower survival among individuals with EOCRC that could not be explained by the PCs. Inversely, counties with positive random effect values (lower 95% CrIs > 0) were considered coldspots (i.e., areas that had generally higher survival among individuals with EOCRC that could not be explained by the PCs) [36, 37].

All spatial models were conducted using the CARBayes package in R Statistical Software (v4.2.4; R Core Team 2022) [32, 38]. Data used in this study were accessed on July 15, 2024. This study was deemed non-human subjects research by the Institutional Review Board of Yale University (Protocol ID: 2000035862) because the data used are publicly available and non-identifiable. Therefore, informed consent was not required.

Results

We evaluated the association between 36 social, behavioral, and preventive factors and survival among individuals with EOCRC (Table 1). These 36 factors clustered into four PCs, explaining 67.7% of the variance in the original data. Interpretation of the resulting factors was determined based on variables with factor loadings ≥ |0.20|. Trace plots, scale reduction factors, and the Geweke diagnostics showed no obvious signs that the model failed to converge.

Table 1. Rotated factor patterns for first four factors from principal component analysis of county-level prevalence of social, behavioral, chronic disease, and preventive factors.

Principal Component
1 2 3 4
Poverty, chronic disease, health risk behaviors Young, chronic disease-free, minority status Urban environment and preventive services Older age
Eigenvalue 14.48 5.14 2.86 1.87
Percent Variance Explained 40.20% 14.30% 8.00% 5.19%
Cumulative Variance 40.20% 54.50% 62.50% 67.70%
Regression Coefficient (95% Crl)
    Overall -0.03 (-0.04, -0.03) -0.01 (-0.02, 0.00) 0.02 (0.00, 0.03) -0.04 (-0.06, -0.02)
    In-Situ -0.15 (-0.23, -0.06) 0.06 (-0.04, 0.15) 0.01 (-0.17, 0.17) -0.16 (-0.36, 0.04)
    Localized -0.07 (-0.09, -0.05) 0.01 (-0.02, 0.03) 0.04 (-0.01, 0.09) -0.05 (-0.11, -0.01)
    Regional -0.05 (-0.06, -0.04) -0.01 (-0.03, 0.00) 0.01 (-0.04, 0.02) -0.02 (-0.05, 0.01)
    Distant -0.04 (-0.06, -0.03) 0.01 (-0.02, 0.03) 0.01 (-0.04, 0.04) -0.01 (-0.05, 0.04)
Sociodemographic Factors
    % Below 150% of the poverty line 0.24 0.01 0.03 0.15
    % Unemployment 0.16 0.03 0.17 0.14
    % Housing burdened 0.04 0.13 0.36 0.27
    % No HS diploma 0.21 0.12 -0.12 0.00
    % Uninsured 0.13 0.18 -0.18 -0.03
    % >65 years of age -0.03 -0.22 -0.20 0.38
    % <17 years of age 0.04 0.21 -0.09 -0.47
    % Disabled 0.16 -0.21 -0.09 0.21
    % Single parent households 0.14 0.17 0.22 -0.15
    % with limited English 0.03 0.34 -0.08 0.04
    % Minority 0.12 0.34 0.11 0.04
    % Multiunit homes -0.09 0.18 0.36 0.10
    % Mobile homes 0.17 -0.07 -0.18 0.08
    % Living in crowded housing 0.09 0.28 -0.07 0.04
    % with no vehicle 0.11 0.05 0.30 0.19
    % Living in group quarters 0.04 0.02 0.07 0.22
Chronic Diseases
    % Arthritis 0.19 -0.24 0.11 -0.11
    % Cancer -0.10 -0.35 0.03 -0.07
    % Kidney disease 0.25 0.06 0.02 0.07
    % COPD 0.22 -0.20 -0.01 -0.02
    % Coronary heart disease 0.25 -0.09 -0.06 0.00
    % Asthma 0.16 -0.17 0.18 0.05
    % Depression 0.12 -0.25 0.01 -0.11
    % Diabetes 0.25 0.10 0.03 -0.03
    % Obesity 0.20 -0.06 -0.03 -0.21
    % Stroke 0.25 -0.02 0.06 0.04
Adverse Health Behaviors
    % Binge drinking -0.16 -0.02 -0.17 0.12
    % Smoking 0.21 -0.20 -0.06 -0.04
    % With no leisure time physical activity 0.24 0.00 -0.04 -0.10
    % Not sleeping 7 hours or more 0.21 -0.07 0.14 -0.09
Preventative Services
    % Aged ≥ 18 years ranking their health poorly 0.26 0.03 -0.02 -0.01
    % Men aged ≥ 65 years receiving preventive services -0.15 -0.11 0.20 -0.19
    % Women aged ≥ 65 years receiving preventive services -0.16 -0.11 0.20 -0.17
    % Aged ≥ 18 receiving routine checkup 0.11 -0.12 0.34 -0.23
    % Ever had colonoscopy (ages 50–75) -0.15 -0.08 0.27 -0.11
    % Using fecal occult blood test (ages 50–75) -0.06 -0.06 0.19 0.31

Boldface indicates factor loadings ≥ |0.20|

a 95% CrI = 95% Credible Interval

Patterns in county-level attributes

The four PCs were dominated by: 1) PC1: poverty, chronic disease, and health risk behaviors; 2) PC2: younger age, chronic disease-free, and minority status; 3) PC3: urban environment and preventive services; and 4) PC4: older age (Table 1).

PC1: Poverty, chronic disease, health risk behaviors

PC1 included county-level prevalence of extreme poverty, lack of high school education, as well as prevalence of several chronic conditions, including obesity, coronary heart disease, stroke, and diabetes (factor loadings ≥ 0.20). Additionally, prevalence of several health risk behaviors such as smoking, lack of leisure time physical activity and lack of sleeping 7 hours or more loaded heavily on this PC. Furthermore, prevalence of the adult population ranking their health poorly loaded heavily on this PC (factor loading = 0.26). Overall, this PC, containing markers of poverty, chronic disease, and health risk behaviors, showed significant negative association with survival (β = -0.03, 95% CrI: -0.04, -0.03). This PC was also negatively associated with 5-year survival when stratified by stage at diagnosis (Table 1).

PC2: Young, chronic disease-free, minority status

On PC2, county-level prevalence of individuals <17 years of age, limited English language proficiency, and minority status loaded positively (factor loadings ≥ 0.20). Prevalence of individuals >65 years of age, disability, having a history of arthritis, cancer, depression, other chronic diseases, and smoking, loaded negatively on this PC (factor loadings ≤ -0.20). This PC was not significantly associated with EOCRC survival (β = -0.01, 95% CrI: -0.02, 0.00).

PC3: Urban environment and preventive services

PC3 included factors related to urban living conditions and prevalence of preventive services utilized by individuals living in the county. Factors indicative of urban living conditions included: prevalence of multiunit homes, households that spend 30% or more of annual income on housing costs, and households with no vehicle. Furthermore, factors indicative of preventive services included: prevalence of the population receiving routine primary care checkups, prevalence of the eligible population receiving colonoscopy and fecal occult blood test or fecal immunochemical tests, and men and women up to date with preventive services such as flu vaccine, pneumococcal polysaccharide vaccine, and mammograms. These factors all loaded heavily on PC3 (factor loadings ≥ 0.20), and this PC had significant positive association with survival in both crude (β = 0.02, 95% CrI: 0.00, 0.03) and stage stratified analysis (Table 1).

PC4: Older age

Finally, PC4 included prevalence of factors indicative of a high prevalence of older adults: prevalence of individuals > 65 years of age, having disability, and living in group quarters. These factors showed significant negative association with 5-year survival (β = -0.04, 95% CrI: -0.06, -0.02). This PC was also negatively associated with survival upon stratifying by stage, though the association was not statistically significant (except for those with localized disease, β = -0.05, 95% CrI: -0.11, -0.01) (Table 1).

Overall, PC2 and PC4 contained indicators of positive and negative health outcomes, respectively. For example, high prevalence of population <17 years of age loaded positively on PC2 (factor loading = 0.21) and negatively on PC4 (factor loading = -0.47). Furthermore, PC2 contained low prevalence of chronic diseases such as cancer and arthritis (factor loadings < -0.20), whereas PC4 included factors indicative of an aging population, such as high prevalence of population aged 65 years or older and having a disability (factor loadings > 0.20).

Survival disparities

A total of 75,215 individuals were diagnosed with EOCRC between January 1, 2000 and December 31, 2019 in the following SEER covered states: California, Connecticut, Georgia, Iowa, Kentucky, Louisiana, New Jersey, New Mexico, Utah, and Washington. Among the 609 contiguous counties included in our analysis, county-level ecological factors explained some of the spatial variability in 5-year survival among individuals with in-situ, localized, and regional disease across all included counties. Among individuals with distant disease, we identified two counties where residual spatial variability remained high. Individuals with distant malignancies residing in Riverside County, CA [coldspot] had 37% higher odds of survival (95% CrI: 7.97%, 78.8%) after adjustment for county-level factors. On the other hand, individuals with distant malignancies residing in Salt Lake County, UT [hotspot] experienced 26.5% (95% CrI: 1.5%, 47.8%) lower odds of 5-year survival after adjustment for county-level factors.

Discussion

This study evaluated the association between county-level social, behavioral, and preventive factors and 5-year survival among individuals with EOCRC and identified 4 PCs that are associated with survival in all included US counties. Furthermore, evidence of unexplained survival disparities was observed among individuals with distant malignancies residing in two US counties. Analysis of these county-level factors is important in understanding how ecological factors contribute to potential survival disparities, elucidating regional patterns not captured by individual-level data, and informing state and national policy geared to reduce disparities and improve health for all.

Previously, disparity-focused studies have observed that individuals with EOCRC residing in Southern and Midwestern states, and rural counties, including the lower Mississippi delta and west central Appalachia, experience low survival following their diagnosis [11, 3943]. Much of this disparity has been attributable to well established factors. For example, one study identified prevalence of diabetes as the most important risk factor in predicting EOCRC outcomes [12]. Other studies have established prevalence of smoking as a strong contributor to EOCRC survival disparities [11, 43]. Findings from our study are novel, because we accounted for these factors and additionally identified a potential hotspot in the state of Utah and coldspot in the state of California. Survival patterns among individuals with distant malignancies residing in these two counties remain largely unexplained by our included ecological factors. Notably, Salt Lake County is the most populous county in Utah and Riverside County is the fourth most populous county in California. Therefore, future research is warranted to elucidate the individual-level clinical, demographic, and socioeconomic factors as well as the diverse healthcare landscapes in these counties that may be contributing to the survival disparities observed.

The PCs identified in this study showed strong correlations among county-level social, behavioral, and preventive factors whose disproportionate burden may drive survival disparities–offering a window into ways in which novel interventions may be developed to target complex, multidimensional risk factors. The first PC, accounting for over 40% of variance, included poverty, chronic diseases, and health risk behaviors and revealed several modifiable factors that may be targeted by health promotion initiatives. For example, lack of leisure time physical activity, lack of sleeping 7 hours or more, and smoking loaded heavily on this PC, along with prevalence of several chronic diseases, including diabetes and coronary heart disease. Interventions to reduce one or more of these risk behaviors and management of chronic conditions may ultimately improve clinical outcomes of individuals with EOCRC and improve survival.

This study has several limitations. To protect individual identity and privacy, geographic information from SEER was aggregated and made publicly available at the county level. Therefore, the survival disparities that we observed may be attributable to individual level factors, such as proximity to cancer care and insurance status [44]. Although several prior studies have established county-level factors as adequate measures to identify geographic disparities, our study may be prone to the modifiable areal unit problem–a statistical biasing effect that occurs when observed associations between explanatory factors and outcomes vary based on the scale of the chosen geographic unit [11, 12, 45, 46]. Therefore, conducting this analysis using smaller geographic units, such as census tracts, may reveal further insights into specific areas within counties that are experiencing survival disparities. Next, as with all self-reported sample surveys, the covariate measures utilized in this study might be subject to systematic error resulting from noncoverage, nonresponse, or measurement bias due to small population size of counties. The CDC uses a multilevel regression and poststratification (MRP) approach that links geocoded health surveys and high spatial resolution demographic and socioeconomic data. CDC’s internal and external validation studies confirm the strong consistency in the MRP approach to estimating county-level prevalence of the covariates utilized in this study [1921]. Furthermore, the potential for residential mobility has not been considered in this study. During the time period included in this study (from January 1, 2000 to December 31, 2019), individuals with EOCRC may have changed counties and their residential environment may have improved or declined [4749]. Changing the residential environment may have a separate effect on health outcomes, possibly due to negative factors, such as stress related to moving and disruption of established social connections or behavioral routines, and positive factors such as moving to a more walkable neighborhood [49, 50]. Lastly, since our study covered nearly two decades of data, changes in the overall healthcare landscape that may have impacted healthcare access among individuals with EOCRC, such as the implementation of the Affordable Care Act, are not accounted for in our study.

Conclusion

In the majority of SEER covered counties in the US, county-level social, behavioral, and preventive factors are associated with 5-year survival among individuals with EOCRC and help explain some of the spatial variability in survival, after accounting for stage at diagnosis. Yet in two US counties, the residual spatial variability remained high among individuals with distant EOCRC, likely due to their large heterogenous populations and differences in individual level clinical, demographic, and socioeconomic characteristics. The rising burden of EOCRC warrants the incorporation of ecological data in future studies, as ecological factors encapsulate a person’s multiple interactions with the social and physical environment that can result in lasting health impact. Results from this study may be utilized to help identify geographic areas with lower than expected survival among individuals with EOCRC, and design community-level interventions to address survival disparities. A similar approach may also be applicable to the study of other chronic diseases.

Supporting information

S1 Appendix. Supplementary methods.

Descriptions of spatial modeling approach.

(DOCX)

pone.0311540.s001.docx (23.4KB, docx)
S1 Table. Detailed descriptions of data source and covariates used to identify hotspots and coldspots.

(DOCX)

pone.0311540.s002.docx (17.7KB, docx)

Data Availability

The data underlying the results presented in the study are available from the National Cancer Insitute Surveillance, Epidemiology, and End Results Program (https://seer.cancer.gov/data-software/documentation/seerstat/nov2021/).

Funding Statement

Sunny Siddique (SS) was supported by Clinical and Translational Science Awards Grant Number UL1 TR001863 from the National Center for Advancing Translational Science, a component of the National Institutes of Health (NIH). SS was also supported by the National Cancer Institute of the NIH under the Ruth L. Kirschstein National Research Service Award Individual Predoctoral Fellowship (award number 1F31CA281335-01). Contents of this manuscript are solely the responsibility of the authors and do not necessarily represent the official view of NIH. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Siegel RL, Wagle NS, Cercek A, Smith RA, Jemal A. Colorectal cancer statistics, 2023. CA Cancer J Clin. 2023;73(3):233–54. doi: 10.3322/caac.21772 [DOI] [PubMed] [Google Scholar]
  • 2.Ugai T, Haruki K, Harrison TA, Cao Y, Qu C, Chan AT, et al. Molecular Characteristics of Early-Onset Colorectal Cancer According to Detailed Anatomical Locations: Comparison With Later-Onset Cases. Am J Gastroenterol. 2023;118(4):712–26. doi: 10.14309/ajg.0000000000002171 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Chen FW, Sundaram V, Chew TA, Ladabaum U. Advanced-Stage Colorectal Cancer in Persons Younger Than 50 Years Not Associated With Longer Duration of Symptoms or Time to Diagnosis. Clin Gastroenterol Hepatol. 2017;15(5):728–37 e3. doi: 10.1016/j.cgh.2016.10.038 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Cercek A, Chatila WK, Yaeger R, Walch H, Fernandes GDS, Krishnan A, et al. A Comprehensive Comparison of Early-Onset and Average-Onset Colorectal Cancers. J Natl Cancer Inst. 2021;113(12):1683–92. doi: 10.1093/jnci/djab124 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Abraham A, Habermann EB, Rothenberger DA, Kwaan M, Weinberg AD, Parsons HM, et al. Adjuvant chemotherapy for stage III colon cancer in the oldest old: results beyond clinical guidelines. Cancer. 2013;119(2):395–403. doi: 10.1002/cncr.27755 [DOI] [PubMed] [Google Scholar]
  • 6.Murphy CC, Harlan LC, Lund JL, Lynch CF, Geiger AM. Patterns of Colorectal Cancer Care in the United States: 1990–2010. J Natl Cancer Inst. 2015;107(10). doi: 10.1093/jnci/djv198 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kneuertz PJ, Chang GJ, Hu CY, Rodriguez-Bigas MA, Eng C, Vilar E, et al. Overtreatment of young adults with colon cancer: more intense treatments with unmatched survival gains. JAMA Surg. 2015;150(5):402–9. doi: 10.1001/jamasurg.2014.3572 [DOI] [PubMed] [Google Scholar]
  • 8.Fontana E, Meyers J, Sobrero A, Iveson T, Shields AF, Taieb J, et al. Early-Onset Colorectal Adenocarcinoma in the IDEA Database: Treatment Adherence, Toxicities, and Outcomes With 3 and 6 Months of Adjuvant Fluoropyrimidine and Oxaliplatin. J Clin Oncol. 2021;39(36):4009–19. doi: 10.1200/JCO.21.02008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Lumish MA, Cercek A. Practical Considerations in Diagnosing and Managing Early-Onset GI Cancers. J Clin Oncol. 2022;40(24):2662–80. doi: 10.1200/JCO.21.02708 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cheng E, Blackburn HN, Ng K, Spiegelman D, Irwin ML, Ma X, et al. Analysis of Survival Among Adults With Early-Onset Colorectal Cancer in the National Cancer Database. JAMA Netw Open. 2021;4(6):e2112539. doi: 10.1001/jamanetworkopen.2021.12539 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Rogers CR, Moore JX, Qeadan F, Gu LY, Huntington MS, Holowatyj AN. Examining factors underlying geographic disparities in early-onset colorectal cancer survival among men in the United States. Am J Cancer Res. 2020;10(5):1592–607. [PMC free article] [PubMed] [Google Scholar]
  • 12.Dong W, Kim U, Rose J, Hoehn RS, Kucmanic M, Eom K, et al. Geographic Variation and Risk Factor Association of Early Versus Late Onset Colorectal Cancer. Cancers (Basel). 2023;15(4). doi: 10.3390/cancers15041006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zaki TA, Liang PS, May FP, Murphy CC. Racial and Ethnic Disparities in Early-Onset Colorectal Cancer Survival. Clin Gastroenterol Hepatol. 2023;21(2):497–506 e3. doi: 10.1016/j.cgh.2022.05.035 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Abualkhair WH, Zhou M, Ochoa CO, Lacayo L, Murphy C, Wu XC, et al. Geographic and intra-racial disparities in early-onset colorectal cancer in the SEER 18 registries of the United States. Cancer Med. 2020;9(23):9150–9. doi: 10.1002/cam4.3488 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Jackson MC, Huang L, Xie Q, Tiwari RC. A modified version of Moran’s I. Int J Health Geogr. 2010;9:33. doi: 10.1186/1476-072X-9-33 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Surveillance E, and End Results (SEER) Program (www.seer.cancer.gov). SEER*Stat Database: Incidence—SEER Research Data, 17 Registries, Nov 2021 Sub (2000–2019)—Linked To County Attributes—Time Dependent (1990–2019) Income/Rurality, 1969–2020 Counties. 2023. [Google Scholar]
  • 17.National Cancer Institute Surveillance E, and End Results Program. Survival Calculation [cited 2024 April 18]. Available from: https://seer.cancer.gov/help/seerstat/survival-session/survival-parameters-tab/survival-calculation-1#:~:text=When%20creating%20a%20life%20table,somewhere%20within%20the%20current%20interval. [Google Scholar]
  • 18.Society AC. Colorectal Cancer Facts & Figures 2023–20252023. Available from: https://www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-statistics/colorectal-cancer-facts-and-figures/colorectal-cancer-facts-and-figures-2023.pdf. [Google Scholar]
  • 19.Centers for Disease Control and Prevention. CDC/ATSDR Social Vulnerability Index. 2022. [Google Scholar]
  • 20.Centers for Disease Control and Prevention. PLACES: Local Data for Better Health [Web]. 2022. [updated March 31, 2022; cited 2022 April 3]. Available from: https://www.cdc.gov/places. [Google Scholar]
  • 21.Environmental Protection Agency. Air Data: Air Quality Data Collected at Outdoor Monitors Across the US [updated Nov. 2, 2021; cited 2002 April 4]. Available from: https://www.epa.gov/outdoor-air-quality-data. [Google Scholar]
  • 22.Flanagan BE, Hallisey EJ, Adams E, Lavery A. Measuring Community Vulnerability to Natural and Anthropogenic Hazards: The Centers for Disease Control and Prevention’s Social Vulnerability Index. J Environ Health. 2018;80(10):34–6. [PMC free article] [PubMed] [Google Scholar]
  • 23.Diaz A, Hyer JM, Barmash E, Azap R, Paredes AZ, Pawlik TM. County-level Social Vulnerability is Associated With Worse Surgical Outcomes Especially Among Minority Patients. Ann Surg. 2021;274(6):881–91. doi: 10.1097/SLA.0000000000004691 [DOI] [PubMed] [Google Scholar]
  • 24.Diaz A, Dalmacy D, Hyer JM, Tsilimigras D, Pawlik TM. Intersection of social vulnerability and residential diversity: Postoperative outcomes following resection of lung and colon cancer. J Surg Oncol. 2021;124(5):886–93. doi: 10.1002/jso.26588 [DOI] [PubMed] [Google Scholar]
  • 25.Sauer AG, Liu B, Siegel RL, Jemal A, Fedewa SA. Comparing cancer screening estimates: Behavioral Risk Factor Surveillance System and National Health Interview Survey. Prev Med. 2018;106:94–100. doi: 10.1016/j.ypmed.2017.10.019 [DOI] [PubMed] [Google Scholar]
  • 26.Charkhchi P, Schabath MB, Carlos RC. Breast, Cervical, and Colorectal Cancer Screening Adherence: Effect of Low Body Mass Index in Women. J Womens Health (Larchmt). 2020;29(7):996–1006. doi: 10.1089/jwh.2019.7739 [DOI] [PubMed] [Google Scholar]
  • 27.Barberan Parraga C, Singh R, Lin R, Tamariz L, Palacio A. Colorectal Cancer Screening Disparities Among Race: A Zip Code Level Analysis. Clin Colorectal Cancer. 2023;22(2):183–9. doi: 10.1016/j.clcc.2023.01.001 [DOI] [PubMed] [Google Scholar]
  • 28.Tu H, Wen CP, Tsai SP, Chow WH, Wen C, Ye Y, et al. Cancer risk associated with chronic diseases and disease markers: prospective cohort study. BMJ. 2018;360:k134. doi: 10.1136/bmj.k134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Fitzpatrick KM, Willis D. Chronic Disease, the Built Environment, and Unequal Health Risks in the 500 Largest U.S. Cities. Int J Environ Res Public Health. 2020;17(8). doi: 10.3390/ijerph17082961 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Groth D, Hartmann S, Klie S, Selbig J. Principal components analysis. Methods Mol Biol. 2013;930:527–47. doi: 10.1007/978-1-62703-059-5_22 [DOI] [PubMed] [Google Scholar]
  • 31.Jolliffe IT, Cadima J. Principal component analysis: a review and recent developments. Philos T R Soc A. 2016;374(2065). doi: 10.1098/rsta.2015.0202 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Lee D. CARBayes: An R Package for Bayesian Spatial Modeling with Conditional Autoregressive Priors. J Stat Softw. 2013;55(13):1–24. [Google Scholar]
  • 33.Leroux B, Lei X, Breslow N. Estimation of Disease Rates in Small Areas; A New Mixed Model for Spatial Dependence. Statistical Models in Epidemiology, the Environment and Clinical Trials. New York: Springer-Verlag; 2000. p. 179–91. [Google Scholar]
  • 34.Gelman A, Carlin J, Stern H, Rubin D. Bayesian Data Analysis. 2nd ed. London: Chapman and Hall; 2003. [Google Scholar]
  • 35.Geweke J. Evaluating the Accuracy of Sampling-Based Approaches to the Calculation of Posterior Moments. Oxford: Clarendon Press; 1992. 1992. [Google Scholar]
  • 36.Gunasekera KS, Zelner J, Becerra MC, Contreras C, Franke MF, Lecca L, et al. Children as sentinels of tuberculosis transmission: disease mapping of programmatic data. BMC Med. 2020;18(1):234. doi: 10.1186/s12916-020-01702-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Warren JL, Grandjean L, Moore DAJ, Lithgow A, Coronel J, Sheen P, et al. Investigating spillover of multidrug-resistant tuberculosis from a prison: a spatial and molecular epidemiological analysis. BMC Med. 2018;16(1):122. doi: 10.1186/s12916-018-1111-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Team RC. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2022. [Google Scholar]
  • 39.Society AC. Colorectal Cancer Facts & Figures 2020–2022. American Cancer Society. [Google Scholar]
  • 40.Monroy-Iglesias MJ, Dolly S, Sarker D, Thillai K, Van Hemelrijck M, Santaolalla A. Pancreatic Cancer Exposome Profile to Aid Early Detection and Inform Prevention Strategies. J Clin Med. 2021;10(8). doi: 10.3390/jcm10081665 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Soffian SSS, Nawi AM, Hod R, Chan HK, Hassan MRA. Area-Level Determinants in Colorectal Cancer Spatial Clustering Studies: A Systematic Review. Int J Environ Res Public Health. 2021;18(19). doi: 10.3390/ijerph181910486 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Holowatyj AN, Langston ME, Han Y, Viskochil R, Perea J, Cao Y, et al. Community Health Behaviors and Geographic Variation in Early-Onset Colorectal Cancer Survival Among Women. Clin Transl Gastroenterol. 2020;11(12):e00266. doi: 10.14309/ctg.0000000000000266 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Siegel RL, Sahar L, Robbins A, Jemal A. Where can colorectal cancer screening interventions have the most impact? Cancer Epidemiol Biomarkers Prev. 2015;24(8):1151–6. doi: 10.1158/1055-9965.EPI-15-0082 [DOI] [PubMed] [Google Scholar]
  • 44.Cardoso R, Guo F, Heisser T, De Schutter H, Van Damme N, Nilbert MC, et al. Overall and stage-specific survival of patients with screen-detected colorectal cancer in European countries: A population-based study in 9 countries. Lancet Reg Health Eur. 2022;21:100458. doi: 10.1016/j.lanepe.2022.100458 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Tuson M, Yap M, Kok MR, Boruff B, Murray K, Vickery A, et al. Overcoming inefficiencies arising due to the impact of the modifiable areal unit problem on single-aggregation disease maps. Int J Health Geogr. 2020;19(1):40. doi: 10.1186/s12942-020-00236-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Stone E. The Modifiable Areal Unit Problem: An Overview Web2017 [cited 2022 March 3]. Available from: https://www.metrans.org/news/the-modifiable-areal-unit-problem-an-overview. [Google Scholar]
  • 47.Hirsch JA, Moore KA, Barrientos-Gutierrez T, Brines SJ, Zagorski MA, Rodriguez DA, et al. Built environment change and change in BMI and waist circumference: Multi-ethnic Study of Atherosclerosis. Obesity (Silver Spring). 2014;22(11):2450–7. doi: 10.1002/oby.20873 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Wing JJ, August E, Adar SD, Dannenberg AL, Hajat A, Sanchez BN, et al. Change in Neighborhood Characteristics and Change in Coronary Artery Calcium: A Longitudinal Investigation in the MESA (Multi-Ethnic Study of Atherosclerosis) Cohort. Circulation. 2016;134(7):504–13. doi: 10.1161/CIRCULATIONAHA.115.020534 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Shvetsov YB, Shariff-Marco S, Yang J, Conroy SM, Canchola AJ, Albright CL, et al. Association of change in the neighborhood obesogenic environment with colorectal cancer risk: The Multiethnic Cohort Study. SSM Popul Health. 2020;10:100532. doi: 10.1016/j.ssmph.2019.100532 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Baum A, Wisnivesky J, Basu S, Siu AL, Schwartz MD. Association of Geographic Differences in Prevalence of Uncontrolled Chronic Conditions With Changes in Individuals’ Likelihood of Uncontrolled Chronic Conditions. JAMA. 2020;324(14):1429–38. doi: 10.1001/jama.2020.14381 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Appendix. Supplementary methods.

Descriptions of spatial modeling approach.

(DOCX)

pone.0311540.s001.docx (23.4KB, docx)
S1 Table. Detailed descriptions of data source and covariates used to identify hotspots and coldspots.

(DOCX)

pone.0311540.s002.docx (17.7KB, docx)

Data Availability Statement

The data underlying the results presented in the study are available from the National Cancer Insitute Surveillance, Epidemiology, and End Results Program (https://seer.cancer.gov/data-software/documentation/seerstat/nov2021/).


Articles from PLOS ONE are provided here courtesy of PLOS

RESOURCES