Background:
Epidemiologic evidence on the relationships between air pollution and the risks of primary cancers other than lung cancer remained largely lacking. We aimed to examine associations of 10-year exposures to fine particulate matter (PM2.5) and nitrogen dioxide (NO2) with risks of breast, prostate, colorectal, and endometrial cancers.
Methods:
For each cancer, we constructed a separate cohort among the national Medicare beneficiaries during 2000 to 2016. We simultaneously examined the additive associations of six exposures, namely, moving average exposures to PM2.5 and NO2 over the year of diagnosis and previous 2 years, previous 3 to 5 years, and previous 6 to 10 years, with the risk of first cancer diagnosis after 10 years of follow-up, during which there was no cancer diagnosis.
Results:
The cohorts included 2.2 to 6.5 million subjects for different cancers. Exposures to PM2.5 and NO2 were associated with increased risks of colorectal and prostate cancers but were not associated with endometrial cancer risk. NO2 was associated with a decreased risk of breast cancer, while the association for PM2.5 remained inconclusive. At exposure levels below the newly updated World Health Organization Air Quality Guideline, we observed substantially larger associations between most exposures and the risks of all cancers, which were translated to hundreds to thousands new cancer cases per year within the cohort per unit increase in each exposure.
Conclusions:
These findings suggested substantial cancer burden was associated with exposures to PM2.5 and NO2, emphasizing the urgent need for strategies to mitigate air pollution levels.
Keywords: Air pollution, Primary cancers, Causal modeling, Additive effects
What this study adds
Although air pollution has been identified as a carcinogen for lung cancer, the relation between air pollution and other primary cancers remained largely unquantified. We estimated the simultaneous associations of 10-year exposures to fine particulate matter (PM2.5) and nitrogen dioxide (NO2) with the risks of breast, prostate, colorectal, and endometrial cancers among the national Medicare cohort between 2000 and 2016. Our results showed that PM2.5 and NO2 exposures increased the risks of colorectal and prostate cancers. At low exposure levels, we observed substantially larger associations between most exposures and the risks of all cancers.
Introduction
Outdoor air pollution is a leading contributor to the global burden of disease.1 Outdoor air pollution is a complex mixture of particles and irritant gases such as organic aerosols, heavy metals, and nitrogen dioxide (NO2).2 Once inhaled, those pollutants can penetrate deeper into the lungs and are carried by the bloodstream throughout the body. A recent global review showed that chronic exposure to air pollution can affect almost every organ in the body, causing a variety of diseases and conditions.3
In 2013, the International Agency for Research on Cancer identified air pollution as a group 1 carcinogen for lung cancer.4 For other cancers, genotoxicity and molecular biomarker data from experimental studies demonstrated that air pollution is associated with altered gene expressions, inflammation, and oxidative stress in tissues other than the lung, indicating potential links between air pollution and the risk of developing various types of cancers.5–8 Epidemiologic evidence on air pollution and other cancers remained largely lacking but was growing substantially in recent years.9–12 Nevertheless, most existing studies had relatively short follow-up periods and evaluated exposures from one up to a few years before cancer diagnosis.13–16 Due to cancer’s long latency period, those studies might not be able to adequately identify critical windows of susceptibility or capture the full effect. Besides, previous studies were limited to small unrepresentative samples, which was further complicated by the heterogeneity in cancer.17–19 Additionally, previous studies had reported risks on a multiplicative scale, which obscured the actual size of the risk and are more difficult to interpret compared with the additive effect estimates.20 Further, the competing risk of death was pervasive but commonly overlooked, leading to potential overestimation of the effect size.21
We aimed to examine the simultaneous associations of 10-year exposures to fine particulate matter (PM2.5) and NO2, the two major primary pollutants emitted directly from fossil fuel and biomass combustion sources, on the risks of breast, prostate, colorectal, and endometrial cancers, for national Medicare beneficiaries ≥65 years of age with continuous enrollment in the Fee-for-Service (FFS) program and in the Medicare Parts A and B. To provide a better sense of the risk posed by the pollutants, we estimated additive associations and translated them to the numbers of new cancer cases per unit increase in each exposure, accounting for the competing risk of death.
Methods
Study population and design
We obtained the Medicare denominator file and the Medicare Chronic Conditions Warehouse (CCW) from the Centers for Medicare and Medicaid Services.22 The Medicare denominator file contained enrollment records and demographic information, including sex, race/ethnicity, age, Medicaid enrollment status (a marker of low socioeconomic status), ZIP code of residence, and date of death (if any), for each Medicare beneficiary. Age, Medicaid enrollment status, and ZIP code of residence were updated annually. The CCW contained the date of the first occurrence of cancer diagnosis for each cancer type across all available claims in the FFS and the Medicare Parts A (inpatient) and B (outpatient, physician service, home health, etc.). To reduce the chance of missing cancer diagnoses, we restricted the cohort to beneficiaries with continuous enrollment in FFS and in both Part A and Part B throughout the follow-up period. This was because if a beneficiary spent any time in the Medicare Advantage Plans (Part C), or only one of Parts A or B during the study period, there was the possibility that the cancer claims were not available for CCW.23
For each cancer, we constructed a separate cohort in which each beneficiary was followed since 2000 or entering the study. To allow for the assessment of the exposure effects over the 10 years before cancer diagnosis, we implemented an inclusion criterion that required a minimum follow-up period of 10 years after enrollment, during which no cancer diagnosis occurred. That is, we only included beneficiaries who were cancer-free for the initial 10 years of follow-up. By applying the 10-year cancer-free inclusion criterion, we also ensured that the first identified cancer diagnosis was an incidence case. If a beneficiary had been diagnosed with cancer before enrolling in Medicare, we believed they would have had a claim record for physician visit, treatment, hospitalization, etc. with cancer diagnosis within the first 10 years after study entry. After that, beneficiaries were censored at the first occurrence of cancer diagnosis, death, or the end of the study in 2016, whichever occurred earliest. We restricted the cohort to females for breast and endometrial cancers and males for prostate cancer.
This study was approved by the Institutional Review Boards at the Harvard T.H. Chan School of Public Health and Emory University.
Exposures
We predicted daily average outdoor PM2.5 and NO2 levels at 1-km2 grid cells across the contiguous US using geographically weighted regressions that ensembled predictions from random forests, gradient boosting, and neural network.24–27 The predictions integrated multiple data sources including satellite data, land-use variables, monitoring data, chemical transport model simulations, etc. The ensembled models demonstrated better predictive performance than that of the individual machine learners with 10-fold cross-validated R-squared values of 0.89 for annual PM2.5 predictions and 0.84 for annual NO2 predictions. This performance was in held-out monitors not used for model fitting. Based on these predictions at 1-km2 grid cells and ZIP code shapefiles provided by the Environmental Systems Research Institute (ESRI), we estimated concentrations at ZIP codes by averaging the predictions at grids whose centroids were inside the polygonal area for general ZIP codes, or assigning the prediction at the nearest grid for other ZIP Codes that did not have polygon representations, such as apartment buildings, military bases, or post offices.28
The ZIP code-level concentrations were considered as proxy exposure measurements and were linked to each beneficiary in each year based on their residential ZIP codes. For each beneficiary in each year, we examined three exposure windows: the moving average exposure over that year and the previous two years (lag 0–2 years), the moving average over the previous 3 to 5 years (lag 3–5 years), and the moving average over the previous 6 to 10 years (lag 6–10 years).
Community-level covariates
We included a variety of annual, ZIP code-level covariates in the analysis as potential confounders and effect modifiers. The percent of ever smokers, average body mass index (BMI), and number of active medical doctors per 1000 people were derived from the Behavioral Risk Factor Surveillance System and the Area Health Resources Files.29,30 The percent White, percent Black, percent below poverty, percent rental housing, percent education below high school, and median household income were derived from the Decennial Census and the American Community Survey.31–33 The distance from the centroid of each ZIP code to the nearest hospital was calculated based on the ESRI’s hospital distribution files, which was considered as a proxy for average distance to the nearest hospital.34 The percent of Medicare beneficiaries with diabetes who had a lipid-panel test in a year and the percent beneficiaries with at least one ambulatory doctor’s visit in a year were derived from the Dartmouth Health Atlas.35 Rurality, as measured by population density, was derived from the NASA Socioeconomic Data and Applications Center.36 The Normalized Difference Vegetation Index (NDVI), a measure of greenness that was associated with cancer incidence,37 was obtained from the NASA’s Moderate Resolution Imaging Spectroradiometer Land data products.38 The national Area Deprivation Index (ADI), a composite measure of neighborhood disadvantage level, was obtained from the Neighborhood Atlas website.39
Statistical analysis
We had six exposures of interest, specifically lag 0 to 2, lag 3 to 5, and lag 6 to 10 exposures to PM2.5 and NO2. For each cancer and each exposure, to obtain the independent effect estimate, we employed an inverse probability weighting (IPW)-based additive model in which the other five exposures were adjusted as confounders.20,40 The analysis consisted of two stages: a design stage where the inverse probability weight (IPW) of each person-year’s exposure was estimated to provide a scalar summary that can be used to eliminate confounding and competing risk through weighting, and an analysis stage where the additive association of the exposure with the cancer risk was estimated.
In the first stage, the IPW (swi) for person-year i was estimated by
where K (·) represented a kernel density estimator; xi the exposure level; the average exposure across all person-years; ci a set of confounders including the other five exposures, individual risk factors (sex [male or female], race/ethnicity [White, Black, or other], age group after initial 10-year cancer-free follow-up [75–84 or ≥85 years], and Medicaid enrollment status [yes or no]), and community-level covariates as delineated above; Di the indicator of death (0 or 1); and a gradient boosting machine (GBM) prediction model with a gaussian loss function.41 Overall, the above function incorporated the stabilized IPWs for both confounding and competing risk of death. The use of nonparametric algorithms of kernel density and GBM had superior performance over parametric models in terms of bias and flexibility: the kernel density function relaxed the strong assumption of distributional form of the exposure residuals, and the GBM captured all possible nonlinearity and interactions of covariates and was unaffected by the potential autocorrelation.42,43 If the assumptions underlying the propensity score were met, this would provide a causal model.40
In the analysis stage, we fitted the following linear probability model weighted by the IPWs obtained in the previous stage to estimate the additive association of the exposure with the absolute cancer risk,
where Yi was the outcome of cancer diagnosis (0 or 1). Given the estimated association β1, we further estimated the annual number of new cancer cases attributed to a unit increase in the exposure by
where N was the total number of person-years of follow-up (including those of beneficiaries with cancer diagnoses in the first 10 years of follow-up) and the duration of study was 17 years.
To examine whether the associations remained at low exposure levels, for PM2.5, we repeated the whole analysis for person-years with exposure levels at lag 0 to 2, lag 3 to 5, and lag 6 to 10 years all below 10 µg/m3; for NO2, we repeated the whole analysis for person-years with exposure levels at lag 0 to 2, lag 3 to 5, and lag 6 to 10 years all below 20 parts per billion (ppb). We further examined the associations at stricter exposure levels, specifically the newly updated World Health Organization air quality guidelines ([AQG]; 5 µg/m3 for PM2.5 and 10 ppb for NO2).44
To examine whether certain subpopulations and communities experienced higher cancer risk in the impact of air pollution, we conducted separate analysis for each subgroup by age, sex (for colorectal cancer only), race/ethnicity, ever enrollment in Medicaid, and upper or lower quartile of community-level percent of ever smokers, average BMI, population density, and ADI.
Results
The cohorts had 3.8 million participants for breast cancer, 6.5 million for colorectal cancer, 4.0 million for endometrial cancer, and 2.2 million for prostate cancer. The cohorts predominantly consisted of Whites (>90%) and beneficiaries aged between 75 and 84 years (>80%) after initial 10-year cancer-free follow-up (Table 1). Beneficiaries excluded from the cohort due to a cancer diagnosis, death, or end of the study within the first 10 years after the Medicare enrollment had larger population sizes, higher numbers of cancer diagnoses, and lower proportions of White (Table S1; http://links.lww.com/EE/A234). Over the study period between 2000 and 2016, the annual concentrations of PM2.5 averaged 9.8 µg/m3 and annual NO2 averaged 17.3 ppb (Table 2). For general ZIP Codes that had polygon representations, the average distance from a ZIP Code centroid to its boundary was ≈7.3 km; the fifth percentile corresponded to a distance of 1.2 km, while the 95th percentile represented a distance of 17.5 km.
Table 1.
Demographic characteristics of the constructed Medicare cohorts.
Breast cancer | Colorectal cancer | Endometrial cancer | Prostate cancer | |
---|---|---|---|---|
Number of individuals (%) | 3,756,831 (100) | 6,457,863 (100) | 4,044,008 (100) | 2,161,156 (100) |
Number of person-years | 55,133,844 | 94,243,120 | 59,429,929 | 31,113,440 |
Number of cancer diagnoses (%) | 90,421 (2.4) | 68,456 (1.0) | 21,660 (0.5) | 80,615 (3.7) |
Sex | ||||
Male (%) | 0 | 2,459,690 (38.1) | 0 | 2,161,156 (100) |
Female (%) | 3,756,831 (100) | 3,998,173 (61.9) | 4,044,008 (100) | 0 |
Race | ||||
White (%) | 3,416,371 (91.0) | 5,902,528 (91.4) | 3,682,920 (91.1) | 1,989,780 (92.1) |
Black (%) | 206,382 (5.5) | 328,257 (5.1) | 220,178 (5.4) | 90,702 (4.2) |
Othera (%) | 134,078 (3.5) | 227,078 (3.5) | 140,910 (3.5) | 80,674 (3.7) |
Age group after initial 10-year cancer-free follow-up (years) | ||||
75–84 (%) | 3,105,175 (82.7) | 5,511,808 (85.4) | 3,353,113 (82.9) | 1,918,776 (88.8) |
≥85 (%) | 651,656 (17.3) | 946,055 (14.6) | 690,895 (17.1) | 242,380 (11.2) |
Ever enrolled in Medicaid (%) | 486,759 (13.0) | 665,057 (10.3) | 513,226 (12.7) | 146,367 (6.8) |
aOther indicates Asian, Hispanic, Native American, Pacific Islander, and multiracial individuals.
Table 2.
Distributions of annual concentrations of PM2.5 and NO2 at ZIP codes among all years in the contiguous US.
PM2.5 (µg/m3) | NO2 (ppb) | |
---|---|---|
Minimum | 0.0 | 0.0 |
10th percentile | 5.8 | 7.1 |
25th percentile | 7.6 | 9.7 |
Median | 9.8 | 15.0 |
Mean | 9.8 | 17.3 |
75th percentile | 11.8 | 21.9 |
90th percentile | 14.0 | 31.1 |
Maximum | 30.9 | 127.6 |
Results of the main analysis and low-level analysis are shown in Table 3. For breast cancer, exposures to PM2.5 at lag 0 to 2 and lag 3 to 5 years were statistically and significantly associated with decreased cancer risk, while at lag 6 to 10 year, the association was insignificant. Exposures to NO2 at lag 0 to 2, lag 3 to 5, and lag 6 to 10 years were all associated with increased breast cancer risk. In the low-level analysis in which the analysis was restricted to exposure levels below 10 µg/m3 for PM2.5 and below 20 ppb for NO2, both pollutants across the lag periods were associated with increased cancer risk. Notably, in the below-AQG analysis in which the analysis was restricted to exposure levels below 5 µg/m3 for PM2.5 and below 10 ppb for NO2, substantially larger positive associations were observed for both pollutants across the lag periods. These additive associations were translated to hundreds to thousands of newly diagnosed breast cancer cases in each year within the cohort. As an example, each 1-µg/m3 increase in lag 0 to 2 PM2.5 was associated with 3,291 (95% confidence interval [CI] = 2,280, 4,302) new breast cancer diagnoses per year, and each 1-ppb increase in lag 6 to 10 NO2 was associated with 809 (95% CI = 548, 1,071) new breast cancer diagnoses per year among the cohort.
Table 3.
Absolute increases (and 95% CIs) in the risk of cancer diagnosis and the annual number of new cancer cases per unit increase in PM2.5 and NO2.
Breast cancer | Colorectal cancer | Endometrial cancer | Prostate cancer | |||||
---|---|---|---|---|---|---|---|---|
Absolute increase in the risk of cancer diagnosis | Annual number of new cancer cases | Absolute increase in the risk of cancer diagnosis | Annual number of new cancer cases | Absolute increase in the risk of cancer diagnosis | Annual number of new cancer cases | Absolute increase in the risk of cancer diagnosis | Annual number of new cancer cases | |
Main analysisa | ||||||||
PM2.5, lag 0–2 years | −0.0024% (−0.0038%, −0.0010%) | −78 (−125, −32) | 0.0047% (0.0040%, 0.0054%) | 261 (220, 301) | 0.0003% (−0.0004%, 0.0009%) | 10 (−13, 33) | 0.0072% (0.0048%, 0.0097%) | 132 (87, 178) |
PM2.5, lag 3–5 years | −0.0026% (−0.0038%, −0.0013%) | −83 (−125, −42) | 0.0040% (0.0033%, 0.0046%) | 219 (183, 255) | −0.0005% (−0.0011%, 0.0001%) | −18 (−38, 3) | 0.0067% (0.0045%, 0.0089%) | 123 (82, 163) |
PM2.5, lag 6–10 years | 0.0006% (−0.0004%, 0.0017%) | 21 (−14, 56) | 0.0041% (0.0035%, 0.0046%) | 225 (195, 255) | −0.0005% (−0.0010%, −0.0001%) | −19 (−36, −2) | 0.0112% (0.0094%, 0.0131%) | 205 (171, 240) |
NO2, lag 0–2 years | 0.0018% (0.0014%, 0.0021%) | 57 (45, 69) | 0.0005% (0.0003%, 0.0007%) | 29 (18, 39) | 0.0000% (−0.0001%, 0.0002%) | 1 (−5, 7) | 0.0035% (0.0029%, 0.0041%) | 64 (52, 75) |
NO2, lag 3–5 years | 0.0014% (0.0011%, 0.0017%) | 46 (35, 56) | 0.0009% (0.0007%, 0.0010%) | 48 (39, 58) | 0.0000% (−0.0002%, 0.0001%) | −2 (−7, 4) | 0.0041% (0.0035%, 0.0046%) | 74 (64, 85) |
NO2, lag 6–10 years | 0.0012% (0.0009%, 0.0015%) | 39 (29, 49) | 0.0003% (0.0002%, 0.0005%) | 18 (9, 26) | −0.0001% (−0.0003%, 0.0000%) | −5 (−10, 0) | 0.0047% (0.0041%, 0.0052%) | 85 (75, 95) |
Low-level analysisb | ||||||||
PM2.5, lag 0–2 years | 0.0177% (0.0136%, 0.0218%) | 575 (441, 708) | 0.0069% (0.0050%, 0.0089%) | 383 (276, 491) | 0.0031% (0.0012%, 0.0050%) | 109 (44, 174) | 0.0380% (0.0314%, 0.0446%) | 695 (575, 816) |
PM2.5, lag 3–5 years | 0.0075% (0.0036%, 0.0113%) | 242 (116, 368) | 0.0078% (0.0060%, 0.0097%) | 434 (332, 536) | 0.0006% (−0.0011%, 0.0024%) | 22 (−40, 84) | 0.0310% (0.0248%, 0.0371%) | 566 (453, 680) |
PM2.5, lag 6–10 years | 0.0158% (0.0121%, 0.0195%) | 512 (392, 632) | 0.0065% (0.0048%, 0.0083%) | 362 (265, 458) | 0.0000% (−0.0017%, 0.0017%) | 1 (−58, 61) | 0.0228% (0.0168%, 0.0287%) | 417 (308, 525) |
NO2, lag 0–2 years | 0.0079% (0.0065%, 0.0092%) | 255 (211, 299) | 0.0033% (0.0026%, 0.0040%) | 182 (144, 221) | 0.0030% (0.0023%, 0.0036%) | 104 (82, 125) | 0.0057% (0.0034%, 0.0080%) | 105 (63, 147) |
NO2, lag 3–5 years | 0.0043% (0.0030%, 0.0057%) | 141 (97, 185) | 0.0008% (0.0001%, 0.0014%) | 42 (4, 80) | 0.0009% (0.0002%, 0.0015%) | 30 (8, 52) | 0.0053% (0.0031%, 0.0076%) | 97 (56, 139) |
NO2, lag 6–10 years | 0.0052% (0.0040%, 0.0064%) | 169 (130, 208) | 0.0001% (−0.0005%, 0.0007%) | 8 (−26, 41) | −0.0005% (−0.0010%, 0.0001%) | −17 (−36, 2) | 0.0115% (0.0094%, 0.0135%) | 210 (173, 247) |
Below-AQG analysisc | ||||||||
PM2.5, lag 0–2 years | 0.1015% (0.0703%, 0.1326%) | 3,291 (2,280, 4,302) | 0.0631% (0.0490%, 0.0772%) | 3,499 (2,717, 4,281) | 0.0265% (0.0123%, 0.0407%) | 926 (430, 1,423) | 0.1986% (0.1525%, 0.2448%) | 3,635 (2,791, 4,480) |
PM2.5, lag 3–5 years | 0.0537% (0.0207%, 0.0868%) | 1,743 (672, 2,814) | 0.0140% (−0.0004%, 0.0285%) | 777 (−24, 1,579) | 0.0280% (0.0136%, 0.0425%) | 980 (475, 1,484) | 0.1583% (0.1100%, 0.2066%) | 2,897 (2,014, 3,780) |
PM2.5, lag 6–10 years | 0.0673% (0.0330%, 0.1017%) | 2,184 (1,070, 3,298) | 0.0521% (0.0367%, 0.0676%) | 2,890 (2,034, 3,746) | −0.0130% (−0.0282%, 0.0021%) | −455 (−985, 75) | 0.1040% (0.0540%, 0.1541%) | 1,904 (989, 2,819) |
NO2, lag 0–2 years | 0.0345% (0.0269%, 0.0420%) | 1,118 (872, 1,364) | 0.0180% (0.0142%, 0.0218%) | 997 (785, 1,209) | 0.0121% (0.0086%, 0.0156%) | 423 (300, 546) | 0.0568% (0.0442%, 0.0693%) | 1,039 (809, 1,268) |
NO2, lag 3–5 years | 0.0188% (0.0109%, 0.0267%) | 609 (353, 866) | 0.0000% (−0.0039%, 0.0039%) | 0 (−214, 215) | 0.0043% (0.0007%, 0.0080%) | 152 (24, 279) | 0.0192% (0.0065%, 0.0318%) | 351 (119, 583) |
NO2, lag 6–10 years | 0.0249% (0.0169%, 0.0330%) | 809 (548, 1,071) | 0.0067% (0.0026%, 0.0107%) | 371 (146, 595) | 0.0040% (0.0003%, 0.0078%) | 141 (10, 272) | 0.0409% (0.0276%, 0.0542%) | 748 (505, 992) |
aMain analysis was conducted among the entire cohort.
bFor PM2.5, the low-level analysis was conducted among person-years with exposure levels at lag 0–2 years, lag 3–5 years, and lag 6–10 years all below 10 µg/m3; for NO2, the low-concentration analysis was conducted among person-years with exposure levels at lag 0–2 years, lag 3–5 years, and lag 6–10 years all below 20 ppb.
cFor PM2.5, the below-guideline analysis was conducted among person-years with exposure levels at lag 0–2 years, lag 3–5 years, and lag 6–10 years all below 5 µg/m3; for NO2, the below-guideline analysis was conducted among person-years with exposure levels at lag 0–2 years, lag 3–5 years, and lag 6–10 years all below 10 ppb.
For colorectal cancer, both pollutants across the lag periods were associated with increased cancer risk. The effect sizes for PM2.5 were larger than those for NO2. In the low-level analysis, associations for PM2.5 were slightly larger than those for the main analysis, and for NO2 the associations attenuated with increasing lag periods. In the below-AQG analysis, substantially larger associations were observed for both pollutants, which were translated to hundreds to thousands of newly diagnosed colorectal cancer cases per year among the cohort.
For endometrial cancer, most associations were not statistically significant across the entire exposure ranges. However, the lag 6 to 10 PM2.5 exposure was marginally and significantly associated with decreased cancer risk. In the low-level and below-AQG analyses, the associations for both pollutants attenuated with increasing lag periods.
For prostate cancer, all the exposures were associated with increased cancer risk across the entire exposure ranges. In the low-level analysis, all the effect sizes were larger compared with those in the full exposure ranges. In the below-AQG analysis, the associations became considerably larger, which were translated to hundreds to thousands of newly diagnosed prostate cancer cases per year among the cohort.
Susceptibility among subpopulations was mostly inconsistent across exposures and cancer types (Figure 1). For breast cancer, those who ever enrolled in Medicaid experienced higher risk associated with PM2.5 exposures over lag 3 to 5 and lag 6 to 10 years, and those who lived in ZIP codes with higher BMI (>75th percentile) had higher risk for NO2 exposures at lag 0 to 2 and lag 3 to 5 years. For colorectal cancer, those who lived in ZIP codes with higher rate of ever smokers (>75th percentile) or average BMI (>75th percentile) had higher risk associated with NO2 exposures at lag 0 to 2 and lag 3 to 5 years. For endometrial cancer, those who lived in ZIP codes with higher average BMI (>75th percentile) were at a higher cancer risk associated with lag 0 to 2 PM2.5 and lag 0 to 2 NO2 exposures. For prostate cancer, Blacks were at a higher cancer risk associated with PM2.5 exposures across lag 0 to 2, lag 3 to 5, and lag 6 to 10 years, and those who lived in ZIP codes with higher average BMI (>75th percentile) had higher risk associated with NO2 exposures across lag 0 to 2, lag 3 to 5, and lag 6 to 10 years.
Figure 1.
Absolute increases (and 95% CIs) in the risk of cancer diagnosis and the annual number of new cancer cases per unit increase in PM2.5 and NO2 for each subgroup of individual and community-level characteristics. Subgroups of community-level characteristics were divided by upper or lower quartiles.
Discussion
In this cohort study of millions of Medicare beneficiaries across the contiguous US, we found clear evidence that chronic exposures to PM2.5 and NO2 increased the risks of colorectal and prostate cancers in a window of 10 years before diagnosis, consistent with the most recent epidemiological studies.11,13,14,16,17,45–47 Compared with previous studies, our study controlled for the competing risk of death, used propensity score and an additive model that provided marginal estimates of effect sizes, and found stronger associations at lower concentrations, including below the newly updated WHO guidelines. In the main analysis, when considering the mutually adjusted and independent additive effect estimates, the combined effects over three exposure windows were substantial: for colorectal cancer, an unit reduction in long-term PM2.5 and NO2 would prevent at least 705 and 95 annual incident cases within the cohort, respectively; for prostate cancer, an unit reduction in long-term PM2.5 and NO2 would prevent at least 460 and 223 annual cases within the cohort, respectively. Given that the effect sizes were considerably larger at exposure levels below the newly updated WHO guidelines, further reducing the pollution levels would achieve greater health and economic benefits by preventing more cancers. In contrast, there was little evidence observed in our main analysis supporting the relevance of PM2.5 and NO2 exposures to endometrial cancer risk, consistent with the Japan’s National Cancer Registry study.12 However, in the low-level and below-AQG analyses, the larger associations suggested that there may be stronger relations between lower levels of PM2.5 and NO2 exposures and the risk of endometrial cancer, and the concentration-response relations may follow a nonlinear pattern in an additive scale.
For breast cancer, the results can be interpreted as associations for postmenopausal breast cancer risk, considering that the Medicare cohort consisted of individuals aged 75 years or older at the time of diagnosis. In accord with Crouse et al.48 and Poulsen et al.13 who investigated the postmenopausal women, we found that NO2 was negatively associated with breast cancer incidence. Evidence for PM2.5 was less clear. Consistent with several large cohort studies,15,16,18,49–51 we detected both positive and negative association for PM2.5 across the entire exposure range. The mixed associations in our study and the previous studies may be due to the heterogeneous changes in chemical compositions and sources of PM2.5.19 The difference in PM2.5 compositions may also explain the more varying results for PM2.5 than those for NO2. However, when restricting the analysis to lower exposure levels, where the compositions of local PM2.5 remained fairly stable,52 we found strongly and consistently adverse impacts of PM2.5. Clearly, future studies with data on PM2.5 compositions are needed to explore the relations of specific chemical components with breast cancer risk. Similar to endometrial cancer, the substantially larger associations in the low-level and below-AQG analyses indicated that the effect of air pollution on breast cancer risk may be more pronounced at lower exposure levels.
Few studies had explored the potential impact of individual- and community-level factors on the susceptibility of cancer to air pollution.53 In this study, we found inconsistent subgroup differences across exposures and cancer types, possibly due to the complex etiology of different cancers.9 However, we found suggestive evidence indicating a disproportionately higher risk of all the cancers associated with NO2 exposures among individuals residing in communities with higher average BMI, suggesting that a lack of access to healthy food and limited opportunities for physical activities may contribute to an increased cancer susceptibility to the adverse effect of NO2.54 The finding of greater susceptibility of the Medicaid enrollees and Blacks to the PM2.5-induced cancer risks was also important in terms of improving health equity. Blacks had been exposed to higher levels of air pollution,55 so the larger effect sizes resulted in an even greater injustice.
Our study had several strengths. First, the inclusion of millions of participants with long-term follow-up allowed for investigating the exposures later in life, from up to 10 years before cancer diagnosis, with ample statistical power. Second, the independent and additive associations made it possible to combine them to obtain the overall estimates for a window of 10 years before cancer diagnosis. As a result, adding up the individual effect estimates for each pollutant was a straightforward and valid approach that provided a better sense of its long-term impact. Third, the use of GBM and kernel density estimator achieved a high level of flexibility and accuracy for estimating IPW, making the results robust against any observed confounding bias. Fourth, accounting for competing risk avoided the associations being overestimated.
This study also had limitations. First, restricted by available data sources, we could not capture individual-level risk factors such as smoking status and BMI, but used community-level smoking rate and average BMI as surrogates, which may lead to confounding bias. For breast cancer, it is important to note that reproductive factors and the use of hormone therapy were not available for the analysis.56 For colorectal cancer, the analysis did not account for alcohol consumption, a recognized risk factor.57 While these factors may not be directly associated with air pollution exposures and therefore may not confound the association between air pollution and cancer incidence, incorporating them in future research can still contribute to obtaining more robust effect estimates. Second, the use of ZIP code-level exposures was subject to measurement error, especially for NO2, which has greater spatial variability.58 However, the larger ZIP codes were generally in less populated and rural areas, where the spatial variability in NO2 exposure may be smaller than in urban areas, where the ZIP codes are smaller. This may limit, but not eliminate, the exposure error. A recent simulation study suggested that correcting for exposure measurement error would lead to larger effect estimates.59 Third, participants in our study were restricted to Medicare beneficiaries with continuous enrollment in both FFS program and Part A and B, which may limit generalizability to the total elderly population. Fourth, restricting the analysis to only beneficiaries who were cancer-free for the initial 10 years of follow-up may introduce selection bias, considering the demographic differences between the analysis group and excluded group. There was a possibility that the beneficiaries who developed cancer during the initial 10 years might be more susceptible to the effects of air pollution. In this case, overlooking those beneficiaries could lead to the associations being underestimated.
Conclusions
Our study of the national Medicare cohort investigated additive associations of 10-year exposures to PM2.5 and NO2 with the risks of breast, prostate, colorectal, and endometrial cancers. We found that exposures to PM2.5 and NO2 were associated with increased risks of colorectal and prostate cancers, but were not associated with endometrial cancer risk. NO2 was associated with a decreased risk of breast cancer, while the association for PM2.5 remained inconclusive. At lower exposure levels, in particular below the newly updated WHO AQG, we observed substantially larger associations between most exposures and the risks of all cancers. The observed additive associations resulted in a substantial number of newly diagnosed cancer cases per year within the cohorts, ranging from hundreds to thousands for each unit increase in exposure. These findings suggested substantial cancer burden associated with exposures to PM2.5 and NO2, emphasizing the urgent need for strategies to mitigate air pollution levels.
Conflicts of interest statement
The authors declare that they have no they have no conflicts of interest with regard to the content of this report.
This work was supported by the National Institutes of Health grants R01ES032418 and P30ES000002.
The Medicare data are available upon request to the Centers for Medicare and Medicaid Services. The grid-level PM2.5 data are available at https://doi.org/10.7927/0rvr-4538. The grid-level NO2 data are available at https://doi.org/10.7927/f8eh-5864. The ZIP code-level PM2.5 and NO2 are available at https://doi.org/10.7927/9yp5-hz11. Other covariate data are publicly available with sources described in the article.
ACKNOWLEDGMENTS
Y.W. and J.S. did conceptualization. Y.W., M.D.Y., E.C., C.S.L., and L.S. did data curation. Y.W. did formal analysis. Y.W., T.M., X.Q., J.H., B.N.V., C.W., L.S. did methodology. J.S. did supervision. Y.W. wrote the original draft. Y.W., M.D.Y., T.M., E.C., C.S.L., X.Q., J.H., B.N.V., C.W., L.S., and J.S. did review and editing.
Supplementary Material
Footnotes
Supplemental digital content is available through direct URL citations in the HTML and PDF versions of this article (www.environepidem.com).
References
- 1.Murray CJL, Aravkin AY, Zheng P, et al. Global burden of 87 risk factors in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet. 2020;396:1223–1249. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Stanek LW, Brown JS, Stanek J, Gift J, Costa DL. Air pollution toxicology—a brief review of the role of the science in shaping the current understanding of air pollution health risks. Toxicol Sci. 2010;120(suppl_1):S8–27. [DOI] [PubMed] [Google Scholar]
- 3.Manisalidis I, Stavropoulou E, Stavropoulos A, Bezirtzoglou E. Environmental and health impacts of air pollution: a review. Front Public Health. 2020;8:14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Loomis D, Grosse Y, Lauby-Secretan B, et al. ; International Agency for Research on Cancer Monograph Working Group IARC. The carcinogenicity of outdoor air pollution. Lancet Oncol. 2013;14:1262–1263. [DOI] [PubMed] [Google Scholar]
- 5.Zhou W, Tian D, He J, et al. Repeated PM2.5 exposure inhibits BEAS-2B cell P53 expression through ROS-Akt-DNMT3B pathway-mediated promoter hypermethylation. Oncotarget. 2016;7:20691–20703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Sanchez-Guerra M, Zheng Y, Osorio-Yanez C, et al. Effects of particulate matter exposure on blood 5-hydroxymethylation: results from the Beijing truck driver air pollution study. Epigenetics. 2015;10:633–642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gondalia R, Baldassari A, Holliday KM, et al. Methylome-wide association study provides evidence of particulate matter air pollution-associated DNA methylation. Environ Int. 2019;132:104723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Brauer M, Avila-Casado C, Fortoul TI, Vedal S, Stevens B, Churg A. Air pollution and retained particles in the lung. Environ Health Perspect. 2001;109:1039–1043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Turner MC, Andersen ZJ, Baccarelli A, et al. Outdoor air pollution and cancer: an overview of the current evidence and public health recommendations. CA Cancer J Clin. 2020;70:460–479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.White AJ. Invited perspective: air pollution and breast cancer risk: current state of the evidence and next steps. Environ Health Perspect. 2021;129:51302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Pritchett N, Spangler EC, Gray GM, et al. Exposure to outdoor particulate matter air pollution and risk of gastrointestinal cancers in adults: a systematic review and meta-analysis of epidemiologic evidence. Environ Health Perspect. 2022;130:36001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hasegawa K, Tsukahara T, Nomiyama T. Association between air pollutants and cancer incidence rates in Japan: An ecological study. Med Sci Forum. 2021;4:15. [Google Scholar]
- 13.Poulsen AH, Hvidtfeldt UA, Sørensen M, et al. Air pollution with NO2, PM2.5, and elemental carbon in relation to risk of breast cancer– a nationwide case-control study from Denmark. Environ Res. 2023;216:114740. [DOI] [PubMed] [Google Scholar]
- 14.Chu H, Xin J, Yuan Q, et al. A prospective study of the associations among fine particulate matter, genetic variants, and the risk of colorectal cancer. Environ Int. 2021;147:106309. [DOI] [PubMed] [Google Scholar]
- 15.Villeneuve PJ, Goldberg MS, Crouse DL, et al. Residential exposure to fine particulate matter air pollution and incident breast cancer in a cohort of Canadian women. Environ Epidemiol. 2018;2:e021. Available at: https://journals.lww.com/environepidem/Fulltext/2018/09000/Residential_exposure_to_fine_particulate_matter.2.aspx. [Google Scholar]
- 16.Hvidtfeldt UA, Chen J, Rodopoulou S, et al. Breast cancer incidence in relation to long-term low-level exposure to air pollution in the ELAPSE pooled cohort. Cancer Epidemiol Biomarkers Prev. 2023;32:105–113. [DOI] [PubMed] [Google Scholar]
- 17.Goldberg MS, Villeneuve PJ, Crouse D, et al. Associations between incident breast cancer and ambient concentrations of nitrogen dioxide from a national land use regression model in the Canadian National Breast Screening Study. Environ Int. 2019;133:105182. [DOI] [PubMed] [Google Scholar]
- 18.Andersen ZJ, Stafoggia M, Weinmayr G, et al. Long-term exposure to ambient air pollution and incidence of postmenopausal breast cancer in 15 European cohorts within the ESCAPE Project. Environ Health Perspect. 2017;125:107005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Raaschou-Nielsen O, Andersen ZJ, Beelen R, et al. Air pollution and lung cancer incidence in 17 European cohorts: prospective analyses from the European Study of Cohorts for Air Pollution Effects (ESCAPE). Lancet Oncol. 2013;14:813–822. [DOI] [PubMed] [Google Scholar]
- 20.Wei Y, Coull B, Koutrakis P, et al. Assessing additive effects of air pollutants on mortality rate in Massachusetts. Environ Health. 2021;20:19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Berry SD, Ngo L, Samelson EJ, Kiel DP. Competing risk of death: an important consideration in studies of older adults. J Am Geriatr Soc. 2010;58:783–787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Gorina Y, Kramarow EA. Identifying chronic conditions in medicare claims data: evaluating the chronic condition data warehouse algorithm. Health Serv Res. 2011;46:1610–1627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Ma T, Yazdi MD, Schwartz J, et al. Long-term air pollution exposure and incident stroke in American older adults: A national cohort study. Global Epidemiol. 2022;4:100073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Di Q, Amini H, Shi L, et al. An ensemble-based model of PM2.5 concentration across the contiguous United States with high spatiotemporal resolution. Environ Int. 2019;130:104909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Di Q, Amini H, Shi L, et al. Assessing NO2 concentration and model uncertainty with high spatiotemporal resolution across the contiguous United States. Environ Sci Technol. 2020;54:1372–1384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Di Q, Wei Y, Shtein A, et al. Daily and Annual PM2.5 Concentrations for the Contiguous United States, 1-km Grids, v1 (2000 - 2016). NASA Socioeconomic Data and Applications Center (SEDAC); 2021. [Google Scholar]
- 27.Di Q, Wei Y, Shtein A, et al. Daily and Annual NO2 Concentrations for the Contiguous United States, 1-km Grids, v1 (2000 - 2016). NASA Socioeconomic Data and Applications Center (SEDAC); 2022. [Google Scholar]
- 28.Wei Y, Xing X, Shtein A, et al. Daily and Annual PM2.5, O3, and NO2 Concentrations at ZIP Codes for the Contiguous U.S., 2000-2016, v1.0. NASA Socioeconomic Data and Applications Center (SEDAC); 2022. [Google Scholar]
- 29.CDC. Behavioral Risk Factor Surveillance System. BRFSS 2013 Survey Data and Documentation. 2013.
- 30.Best AE. Secondary data bases and their use in outcomes research: a review of the area resource file and the Healthcare Cost and Utilization Project. J Med Syst. 1999;23:175–181. [DOI] [PubMed] [Google Scholar]
- 31.Bureau UC. 2010 Census Demographic Profile Summary File.
- 32.Bureau UC. 2000 Summary File 3.
- 33.Council NR. Using the American Community Survey: Benefits and Challenges. The National Academies Press; 2007. [Google Scholar]
- 34.Institute ESR. Esri Data & Maps 10, An Esri White Paper. Redlands, CA; 2010. Available at: http://downloads.esri.com/support/whitepapers/ao_/Esri_Data_and_Maps_10.pdf. [Google Scholar]
- 35.Cronenwett JL, Birkmeyer JD. The Dartmouth Atlas of Vascular Health Care. Cardiovasc Surg. 2000;8:409–410. Available at: https://www.ncbi.nlm.nih.gov/pubmed/11184739. [PubMed] [Google Scholar]
- 36.Center for International Earth Science Information Network CCU. Gridded Population of the World, Version 4 (GPWv4): Population Density, Revision 11. NASA Socioeconomic Data and Applications Center (SEDAC); 2018. [Google Scholar]
- 37.Datzmann T, Markevych I, Trautmann F, Heinrich J, Schmitt J, Tesch F. Outdoor air pollution, green space, and cancer incidence in Saxony: a semi-individual cohort study. BMC Public Health. 2018;18:715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Justice CO, Vermote E, Townshend JRG, et al. The Moderate Resolution Imaging Spectroradiometer (MODIS): land remote sensing for global change research. IEEE Trans Geosci Remote Sens. 1998;36:1228–1249. [Google Scholar]
- 39.2015 Area Deprivation Index. University of Wisconsin School of Medicine and Public Health. Available at: https://www.neighborhoodatlas.medicine.wisc.edu/.
- 40.Robins JM, Hernan MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology (Cambridge, Mass). 2000;11:550–560. [DOI] [PubMed] [Google Scholar]
- 41.Wei Y, Yazdi MD, Di Q, et al. Emulating causal dose-response relations between air pollutants and mortality in the Medicare population. Environ Health. 2021;20:53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Terrell GR, Scott DW. Variable kernel density estimation. Ann Stat. 1992;20: 1236–1265. [Google Scholar]
- 43.Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001;29:1189–1232. Available at: http://www.jstor.org/stable/2699986. [Google Scholar]
- 44.Goshua A, Akdis CA, Nadeau KC. World Health Organization global air quality guideline recommendations: executive summary. Allergy. 2022;77:1955–1960. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Ma JW, Lai TJ, Hu SY, Lin TC, Ho WC, Tsan YT. Effect of ambient air pollution on the incidence of colorectal cancer among a diabetic population: a nationwide nested case-control study in Taiwan. BMJ Open. 2020;10:e036955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Youogo LMK, Parent ME, Hystad P, Villeneuve PJ. Ambient air pollution and prostate cancer risk in a population-based Canadian case-control study. Environ Epidemiol. 2022;6:e219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Vigneswaran HT, Jagai JS, Greenwald DT, et al. Association between environmental quality and prostate cancer stage at diagnosis. Prostate Cancer Prostatic Dis. 2021;24:1129–1136. [DOI] [PubMed] [Google Scholar]
- 48.Crouse DL, Goldberg MS, Ross NA, Chen H, Labrèche F. Postmenopausal breast cancer is associated with exposure to traffic-related air pollution in Montreal, Canada: a case-control study. Environ Health Perspect. 2010;118:1578–1583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Andersen ZJ, Ravnskjær L, Andersen KK, et al. Long-term exposure to fine particulate matter and breast cancer incidence in the Danish nurse cohort study. Cancer Epidemiol Biomarkers Prev. 2017;26:428–430. [DOI] [PubMed] [Google Scholar]
- 50.Hart JE, Bertrand KA, DuPre N, et al. Long-term particulate matter exposures during adulthood and risk of breast cancer incidence in the nurses’ health study II prospective cohort. Cancer Epidemiol Biomarkers Prev. 2016;25:1274–1276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Reding KW, Young MT, Szpiro AA, et al. Breast cancer risk in relation to ambient air pollution exposure at residences in the sister study cohort. Cancer Epidemiol Biomarkers Prev. 2015;24:1907–1909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.van Donkelaar A, Martin RV, Li C, Burnett RT. Regional estimates of chemical composition of fine particulate matter using a combined geoscience-statistical method with information from satellites, models, and monitors. Environ Sci Technol. 2019;53:2595–2611. [DOI] [PubMed] [Google Scholar]
- 53.Hiatt RA, Beyeler N. Cancer and climate change. Lancet Oncol. 2020;21:e519–e527. [DOI] [PubMed] [Google Scholar]
- 54.Avgerinos KI, Spyrou N, Mantzoros CS, Dalamaga M. Obesity and cancer risk: Emerging biological mechanisms and perspectives. Metabolism. 2019;92:121–135. [DOI] [PubMed] [Google Scholar]
- 55.Jbaily A, Zhou X, Liu J, et al. Air pollution exposure disparities across US population and income groups. Nature. 2022;601:228–233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Rosato V, Bosetti C, Negri E, et al. Reproductive and hormonal factors, family history, and breast cancer according to the hormonal receptor status. Eur J Cancer Prev. 2014;23:412–417. Available at: https://journals.lww.com/eurjcancerprev/Fulltext/2014/09000/Reproductive_and_hormonal_factors,_family_history,.11.aspx. [DOI] [PubMed] [Google Scholar]
- 57.Rossi M, Jahanzaib Anwar M, Usman A, Keshavarzian A, Bishehsari F. Colorectal cancer and alcohol consumption-populations to molecules. Cancers (Basel). 2018;10:38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Wang Y, Bechle MJ, Kim S-Y, et al. Spatial decomposition analysis of NO2 and PM2.5 air pollution in the United States. Atmos Environ. 2020;241:117470. [Google Scholar]
- 59.Wei Y, Qiu X, Yazdi MD, et al. The impact of exposure measurement error on the estimated concentration–response relationship between long-term exposure to PM2.5 and mortality. Environ Health Perspect. 2022;130:77006. [DOI] [PMC free article] [PubMed] [Google Scholar]