Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2021 Jun 4;16(6):e0252719. doi: 10.1371/journal.pone.0252719

Urinary polycyclic aromatic hydrocarbon metabolites and mortality in the United States: A prospective analysis

Achal P Patel 1,#, Suril S Mehta 2,*,#, Alexandra J White 3, Nicole M Niehoff 3, Whitney D Arroyave 4, Amy Wang 2, Ruth M Lunn 2
Editor: Giovanni Signore5
PMCID: PMC8177506  PMID: 34086784

Abstract

Background

Polycyclic aromatic hydrocarbons (PAHs) are ubiquitous organic compounds associated with chronic disease in epidemiologic studies, though the contribution of PAH exposure on fatal outcomes in the U.S. is largely unknown.

Objectives

We investigated urinary hydroxylated PAH metabolites (OH-PAHs) with all-cause and cause-specific mortality in a representative sample of the U.S. population.

Methods

Study participants were ≥20 years old from the National Health and Nutrition Examination Survey 2001–2014. Concentrations (nmol/L) of eight OH-PAHs from four parent PAHs (naphthalene, fluorene, phenanthrene, pyrene) were measured in spot urine samples at examination. We identified all-cause, cancer-specific, and cardiovascular-specific deaths through 2015 using the National Death Index. We used Cox proportional hazards regression to estimate adjusted hazard ratios (HRs) and 95% confidence intervals (CIs) for the association between ΣOH-PAHs and mortality endpoints. We assessed potential heterogeneity by age, gender, smoking status, poverty, and race/ethnicity. Additionally, we examined the overall mixture effect using quantile g-computation.

Results

In 9,739 eligible participants, there were 934 all-cause deaths, 159 cancer-specific deaths, and 108 cardiovascular-specific deaths (median 6.75 years follow-up). A log10 increase in ΣOH-PAHs was associated with higher all-cause mortality (HRadj = 1.39 [95%CI: 1.21, 1.61]), and possibly cancer-specific mortality (HRadj = 1.15 [95%CI: 0.79, 1.69]), and cardiovascular-specific mortality (HRadj = 1.49 [95%CI: 0.94, 2.33]). We observed substantial effect modification by age, smoking status, gender, and race/ethnicity across mortality endpoints. Risk of cardiovascular mortality was higher for non-Hispanic blacks and those in poverty, indicating potential disparities. Quantile g-computation joint associations for a simultaneous quartile increase in OH-PAHs were HRadj = 1.15 [95%CI: 1.02, 1.31], HRadj = 1.41 [95%CI: 1.05, 1.90], and HRadj = 0.98 [95%CI: 0.66, 1.47] for all-cause, cancer-specific, and cardiovascular-specific mortalities, respectively.

Discussion

Our results support a role for total PAH exposure in all-cause and cause-specific mortality in the U.S. population.

Introduction

Polycyclic aromatic hydrocarbons (PAHs) are organic compounds with fused aromatic rings commonly produced via incomplete combustion of organic material. Major PAH sources in the United States include vehicle exhaust, tobacco smoke, coal tar, grilled and smoked foods, agricultural burning, and occupational sources [1]. PAHs are widely distributed in the environment and exposure to the U.S. population is ubiquitous, generally as a complex mixture of multiple PAHs [2, 3].

In human observational studies, PAH exposure has been associated with a range of chronic, and often fatal, diseases including ischemic heart disease, impaired respiratory function, lung cancer, and breast cancer [49]. Occupations with high levels of PAH exposure, including coke ovens, aluminum production, asphalt, and chimney sweeping are associated with excess mortality from lung and other cancers, cardiovascular diseases, and non-malignant respiratory diseases [4, 5, 1014]. The International Agency for Research on Cancer (IARC) [15] and the U.S. National Toxicology Program’s Report on Carcinogens (RoC) [16] have classified more than a dozen individual PAHs as known or likely carcinogens primarily based on mechanistic and toxicological data.

Methods for assessing exposure to PAHs include job-exposure matrices in occupational settings, exposure modeling, personal air monitor sampling, self-reported exposures, and biomarkers [5, 79]. Biomarkers reflect exposure from across multiple routes and thus may better represent internal or biological dose compared to methods that consider a specific exposure route (e.g., personal air sampling). PAH biomarkers reflect shorter term exposure, with a half-life of a week to months for PAH-DNA adducts and hours to days for urinary hydroxylated (OH) PAH metabolites (OH-PAHs) [1719]. Genetic polymorphisms in activation and detoxification pathways impact levels of both types of PAHs biomarkers, while PAH-DNA adduct levels are additionally influenced by polymorphisms in DNA repair pathways [20].

Commonly detected OH-PAHs include urinary metabolites of naphthalene, fluorene, phenanthrene, and pyrene. Importantly, naphthalene, fluorene, and phenanthrene are low-molecular weight PAHs with 2–3 aromatic rings, while pyrene is considered high-molecular weight with 4 aromatic rings [1, 21]. Evidence suggests that higher-molecular weight PAHs with ≥4 aromatic rings, such as benzo[a]pyrene, are primarily excreted through feces and are typically not detectable in urinary samples [2225]. 1-hydroxypyrene (1-PYR) has been widely used as a proxy for total PAHs exposure given its moderate-high correlation with both low and high-molecular weight PAHs [24, 26, 27]. However, 1-PYR alone may not be an adequate proxy given that the magnitude and type of PAHs vary by PAH sources. For instance, 1-PYR levels are strongly linked to dietary patterns, whereas naphthalene and fluorene levels are strongly linked to tobacco and wood smoke exposure [2831]. For this reason, several studies have highlighted the need for assessment of multiple OH-PAHs to obtain a more comprehensive representation of PAHs body burden across multiple exposure sources [30, 32].

Urinary metabolites of naphthalene, fluorene, phenanthrene, and pyrene have been highly detected in the U.S. population across two decades of the National Health and Nutrition Examination Survey (NHANES) biomonitoring [24]. Short half-lives coupled with high levels of detection indicate likely chronic exposure from multiple PAHs sources in the U.S. population. However, despite evidence for chronic exposure and prior evidence suggesting a link with chronic and often fatal disease, the impact of PAHs on mortality in the general population has received limited attention. To date, only one study has examined this relationship using a limited sample of the NHANES population [33]. This study did not account for urinary dilution of PAH concentrations and the number of mortality endpoints was sparse, resulting in highly imprecise estimates for several PAHs. Furthermore, Chen et al. did not account for the dense correlation structure between PAH metabolites.

In this study, we aimed to prospectively evaluate associations between a sum of multiple, commonly detected urinary OH metabolites, as a proxy for total PAH exposure from various sources, and mortality endpoints in the U.S. adult population, with consideration of differences by age, gender, race/ethnicity, smoking and socioeconomic status. Additionally, as a secondary aim, we leverage quantile g-computation, an analytic approach to evaluate the risk associated with environmental mixtures, to gain insight into the joint association of OH metabolites of commonly detected PAHs, individual OH-PAH contributions, and potential heterogeneity.

Methods

Study population and data collection

NHANES is conducted by the National Center for Health Statistics (NCHS) within the U.S. Centers for Disease Control and Prevention (CDC). Individuals are selected using a complex, multistep, probability-based sampling design and are considered representative of the civilian, non-institutionalized U.S. population. NHANES oversamples certain population sub-groups including African Americans, Mexican Americans, low-income white Americans, and 12-19-year-olds. Since 1999, the survey has been continuous, and each cycle covers two years. The survey comprises a household interview and a standardized physical examination in a mobile examination center. Household interviews collect detailed demographic, socioeconomic, dietary, and health-related information. Physical examinations comprise of physiological measurements, medical and dental examinations, biological sampling, and laboratory tests. We included participants from seven consecutive survey cycles (2001–2014), totaling 72,126 individuals. NCHS’ Institutional Review Board approved each NHANES cycle, and survey participants provided informed consent.

NHANES analyzes blood or urine environmental chemical data on a one-third random sub-sample of survey participants. There were 12,316 individuals at least 20 years old with spot urine sample OH-PAHs biomarker data across the 2001–2014 survey cycles.

Exclusion criteria

We excluded 6,763 individuals less than 20 years old at the time of interview or physical examination because mortality ascertainment data is not publicly-available for individuals less than 18 years of age, and children and adolescents differ from adults in physiological profile, which may influence exposure dynamics and response (S1 Fig). We subsequently excluded individuals (N = 21) missing information on vital status as ascertained through the National Death Index (NDI) and individuals (N = 982) missing data on any of the eight OH-PAHs or urinary creatinine. Lastly, we excluded individuals missing information on any covariates defined below under statistical analysis (N = 1,539), improbable follow-up time (N = 6) and accidental deaths (N = 29). Our final analytical sample for all causes of death was 9,739 participants. To assess for potential for selection bias due to exclusion criteria, we compared distribution of exposure and covariates in our study population by mortality status before and after application of exclusion criteria (S1 Table).

Exposure assessment

A single spot urine sample was collected from participants, stored at -20°C, and sent for analysis to the National Center for Environmental Health at CDC. Eight OH-PAHs from four parent PAHs (naphthalene, fluorene, phenanthrene, and pyrene) were measured across all seven survey cycles: 1-hydroxynaphthalene/naphthol (1-NAP); 2-hydroxynaphthtalene/naphthol (2-NAP); 2-hydroxyfluorene (2-FLUO); 3-hydroxyfluorene (3-FLUO); 1-hydroxyphenanthrene (1-PHEN); 2-hydroxyphenanthrene (2-PHEN); 3-hydroxyphenanthrene (3-PHEN); and 1-PYR.

OH-PAHs were measured using enzymatic deconjugation followed by automated solid phase extraction. Quantification of analytes was done through capillary isotope dilution gas chromatography (GC) and high-resolution mass spectroscopy (HRMS) for cycles 2001–2008 [34, 35], GC tandem mass spectrometry (GC-MS/MS) for cycles 2009–2012 [36], and online solid-phase extraction and high-performance liquid chromatography tandem MS (online SPE-HPLC-MS/MS) for 2013–2014 [37].

For the 2013–2014 cycle, online SPE-HPLC-MS/MS quantitated 2-PHEN and 3-PHEN together, which is equivalent to the sum of the two urinary metabolites from previous cycles. We present the two metabolites as Σ [2-PHEN & 3-PHEN].

For each OH-PAH, concentrations (ng/L) below the highest methodological LOD across survey cycles were assigned the value of that highest LOD divided by square root of two. To standardize across differential molecular weights of OH-PAHs, concentrations (ng/L) were converted to molar concentrations (nmol/L). All OH-PAHs were log-transformed following visual and quantitative ascertainment of right skewness. Upon log transformation, all OH-PAHs were normally distributed.

For assessment of mortality in relation to PAHs exposure, a Σ OH-PAHs (nmol/L) was created. This was calculated as the molar sum of 1-NAP, 2-NAP, 2-FLUO, 3-FLUO, 1-PHEN, 2&3 PHEN, and 1 PYR. Σ OH-PAHs was subsequently log-transformed.

Mortality outcomes

Mortality status for the study population was ascertained through the U.S. National Death Index (NDI), a compiled index of death record information from local and state vital statistics offices. NCHS periodically links NHANES surveys with death certificate records from the NDI. We utilized the publicly available Linked Mortality Files provided by NCHS, updated through December 31, 2015, which served as end of follow-up (i.e., point of administrative censoring) (https://www.cdc.gov/nchs/data-linkage/mortality-public.htm). All NDI data is based on death certificates coded to International Classification of Diseases version 10 codes (ICD-10).

We examined three main outcomes: all-cause mortality, cancer-specific mortality (ICD-10: C00-C97), and cardiovascular-specific mortality (ICD-10: I00-I09, I11, I13, I20-I51). Our cancer-specific mortality outcome aimed to identify deaths from incident cancer after baseline. Therefore, we excluded 877 additional individuals with self-reported history of cancer at baseline, resulting in an analytic sample of N = 8,862 for the cancer-specific mortality analyses. Analogously, for our cardiovascular mortality outcome, we excluded 764 individuals with self-reported history of coronary heart disease, heart attack, and congestive heart failure at baseline, resulting in an analytic sample of N = 8,975 for the cardiovascular-specific mortality analyses.

Statistical analysis

In descriptive analyses, we computed geometric means (GM) and standard errors of the geometric mean (GSE) for Σ H-PAHs (nmol/L) by mortality status and relevant covariates. Frequencies of categorical covariates and GM of continuous covariates were additionally computed by mortality status. To account for NHANES complex, multistage, probability sampling design in our analyses, we used PAH subsample weights, stratification variables, and primary sampling units provided by NCHS. A 14-year modified sample weight across seven NHANES cycles was created for each participant based on NCHS guidance (https://www.cdc.gov/nchs/tutorials/environmental/critical_issues/limitations/Info3.htm). We produced Pearson correlation coefficients for all OH-PAHs.

The minimally sufficient adjustment set (covariates) was determined a priori through a directed acyclic graph approach [38] (S2 Fig), including age at time of interview (years), sex (male, female), race/ethnicity (non-Hispanic White, non-Hispanic Black, Hispanic, other), smoking status (current active smoker or environmental tobacco smoke [ETS] exposed, not active smoker and ETS unexposed), urinary creatinine (g/L), body mass index (BMI) (underweight, normal weight, overweight, and obese according to WHO guidelines ((kg/m2); [39]), educational attainment (less than high school, high school graduate, some college or above), family income to poverty ratio (income greater than poverty level, income less than or equal to poverty level), and survey cycle years (cycles 1–7). Our primary method to account for urine dilution was to add urinary creatinine (g/L) as a covariate to all models, as recommended by Barr et al. [40]. Urinary creatinine was measured in urine spot samples through automated colorimetric determination on a Beckman Synchron CX3 clinical analyzer. Smoking status at baseline was derived from a combination of self-reported household interview questions about current smoking status, as well as a laboratory measurement of serum cotinine, indicating exposure to ETS. Serum cotinine (ng/mL) was measured using an isotope dilution HPLC coupled with atmospheric pressure chemical ionization tandem mass spectrometry. Individuals were categorized as current active smokers or ETS exposed based on self-report (Q1: “Smoked at least 100 cigarettes in life?”; Q2: “Age started smoking cigarettes regularly”; Q3: “Do you now smoke cigarettes?”) or serum cotinine concentrations >10 ng/mL.

For analyses of Σ OH-PAHs and mortality endpoints, we used multivariable survey-weighted Cox proportional hazards regression. Adjusted hazard ratios (HRadj) and 95% confidence intervals (CI) are reported. Our models use age, in years, as the timescale for analyses (age at NHANES interview was age of entry into study and age at mortality or age at administrative censoring was the age of exit from study). To verify that the proportional hazards assumption was met, we ran diagnostic tests, including an assessment of interaction between exposure and time in addition to visual inspection of degree of parallelism plotting log cumulative hazard versus log time. Σ OH-PAHs was modeled as a continuous variable in main and subgroup analyses and additionally modeled using a quartile categorization in main analyses. Linear trend was tested by assigning median value of Σ OH-PAH quartiles to the respective quartiles followed by a multivariable model incorporating assigned median values as a continuous exposure variable.

Subgroup analyses were performed for age (<60 years, ≥60 years), gender (male, female), smoking status (current active smoker or ETS exposed, not current active smoker or ETS exposed), family income to poverty ratio (family income at or below poverty level, family income above poverty level) and race/ethnicity (non-Hispanic White, non-Hispanic Black, Hispanic, other). Effect modification was assessed through interaction term(s) between Σ OH-PAHs and sub-group levels followed by assessment of interaction term p-values (two-level sub-group) or the group F-statistic for unequal slopes (multiple-level sub-group). A p-value of less than 0.05 was considered a statistically significant interaction, though we additionally considered differences in magnitude of association across strata as well as biological plausibility.

In sensitivity analyses, we excluded individuals who died within the first and second year of follow-up to examine for the possibility of mortality within these years driving effects. We additionally assessed the impact of urine dilution using both urinary creatinine-corrected OH-PAHs concentrations, as well as adjusting for urinary creatinine as a covariate, as recommended by Weinberg et al. [41]. Lastly, we conducted analyses including prevalent cases of CVD and cancer (i.e. self-reported history of CVD or cancer at baseline) to assess the impact exclusion of such cases may have had on our risk estimates.

Mixtures analysis

For analyses of the overall joint effect of the eight OH-PAHs and contributions of individual OH-PAHs, we used Cox proportional hazards quantile g-computation [42]. Quantile g-computation estimates the change in mortality risk for a one quantile simultaneous increase in the eight OH-PAHs. Importantly, quantile g-computation allows flexibility in direction and magnitude of effect across OH-PAHs, while controlling for mutual confounding among the OH-PAHs. In addition to the overall joint effect, quantile g-computation generates weights for each constituent of the mixture. Weights for OH-PAHs represent the proportion of the total effect for each OH-PAH when all OH-PAHs have an effect in the same direction or the proportion of the positive or negative partial effect when coefficients are in different directions. We chose a quartile categorization of exposure for internal consistency with other presented models. We were not able to survey-weight the quantile g-computation models because specification of sampling strata is a feature not yet developed for this method and NHANES weights are valid only when sampling cluster and strata are defined. We also computed multi-pollutant (mutually-adjusted) survey weighted Cox proportional hazard models for the eight OH-PAHs for each mortality endpoint for comparison to quantile g-computation results. Quantile g-computation and multi-pollutant models all included adjustment for covariates used in our main models.

All analyses were conducted in Statistical Analysis Software version 9.4 (Cary, North Carolina) and R software version 3.6.2 (cran.r-project.org).

Results

Of the 9,739 participants included in the analysis, 934 participants (9.60%) died (Table 1). Follow-up time among the deceased was shorter than those alive at time of censoring (5.63 and 7.56 years, respectively). Across the seven NHANES cycles, OH-PAHs were detected in nearly all participants (detection frequency (DF) > 98%), except for 1-PYR (DF = 60.95%). ΣOH-PAHs were higher among deceased participants compared to alive participants. Compared to living participants, those deceased participants were more likely to be older, have a lower educational attainment, be non-Hispanic white and be at or below the poverty level. Across survey cycles (2001–2014), PAHs exposure was highest in 2005–2006, although no clear trends were apparent. We did not find any differences in baseline characteristics of the study population prior to and after application of exclusion criteria (S1 Table).”

Table 1. Baseline descriptive characteristics in NHANES 2001–2014 (N = 9,739) by mortality status.

Alive (N = 8,805) Deceased (N = 934)
>LOD (%) GM (GSE)
Urinary OH-PAH (ng/L)
1-naphthalene 99.85 2287.34 (61.24) 3560.92 (258.57)
2-naphthalene 99.97 3759.81 (87.54) 3126.28 (172.27)
3-hydroxyfluorene 98.44 112.68 (2.92) 110.58 (7.72)
2-hydroxyfluorene 99.97 277.46 (6.64) 298.24 (18.11)
1-hydroxyphenanthrene 99.67 135.78 (2.24) 136.75 (5.55)
Σ 2- & 3-hydroxyphenanthrene 99.82 151.00 (2.61) 163.57 (8.20)
1-hydroxypyrene 60.95 115.05 (1.75) 89.69 (3.13)
Σ OH-PAHsa --- 8192.57 (187.10) 9134.58 (534.17)
Age at baseline (years) 45.03 (0.25) 66.01 (0.65)
Follow up time (years) 7.56 5.63
Urinary creatinine (g/L) 1.22 (0.01) 1.12 (0.03)
Frequency (SE)
Gender
Male 48.49 (0.58) 54.40 (2.11)
Female 51.51 (0.58) 45.60 (2.11)
Race/ethnicity
Non-Hispanic white 69.74 (1.33) 77.61 (2.00)
Non-Hispanic black 11.02 (0.70) 10.52 (1.04)
Hispanic 12.97 (0.98) 9.23 (1.47)
Other 6.28 (0.42) 2.64 (0.61)
Educational attainment
Less than high school graduate 16.44 (0.72) 30.19 (1.78)
High school graduate 22.82 (0.66) 29.57 (1.36)
Some college or above 60.74 (1.03) 40.24 (1.79)
Smoking statusb
Not active smoker or ETS exposed 71.79 (0.72) 72.27 (1.72)
Active smoker or ETS exposed 28.21 (0.72) 27.74 (1.72)
Body mass index (kg/m2)
Underweight (<18.5) 1.55 (0.17) 2.67 (0.78)
Normal (18.5–24.9) 30.59 (0.69) 29.53 (1.73)
Overweight (25–29.9) 33.36 (0.78) 33.32 (1.64)
Obese (≥30) 34.50 (0.67) 34.49 (1.90)
Family poverty statusc
Above the poverty threshold 86.15 (0.57) 82.17 (1.45)
At or below the poverty threshold 13.85 (0.57) 17.83 (1.45)

Abbreviations: ETS = environmental tobacco smoke; GM = geometric mean; GSE = standard error of the GM; LOD = limit of detection; SE = standard error.

aΣ OH-PAHs include all eight urinary hydroxylated PAH metabolites from four parent compounds (naphthalene, fluorene, phenanthrene, pyrene)

bSmoking status defined as current active smoker based on questionnaire, or serum cotinine concentrations >10 ng/mL, defined at ETS exposure

cPoverty status is calculated as the ratio of the family’s self-reported income to the family’s poverty threshold

OH-PAHs were moderately to highly correlated (r = 0.51–0.95), both within and across a OH-PAH’s parent compound (S2 Table). Participants with higher GM ΣOH-PAH concentrations were more likely to be <60 years old, male, current active smoker or ETS exposed, and underweight or obese. Higher GM ΣOH-PAH participants were also more likely to be non-Hispanic black, have lower educational attainment, and be living at or below the poverty threshold (Table 2). A log10 increase in ΣOH-PAHs was positively associated with increased all-cause mortality (HRadj: 1.39 [95%CI: 1.21, 1.61]; N = 934 deaths) (Table 3). In categorical analysis comparing the highest and lowest quartiles of log10 ΣOH-PAHs, all-cause mortality was elevated (HRadj for Q4 vs. Q1: 1.66 [95%CI: 1.32, 2.09]), with a positive exposure-response relationship (Ptrend < 0.001). An elevated but non-significant association was seen for ΣOH-PAHs and cancer-specific mortality when modeled continuously (HRadj: 1.15 [95%CI: 0.79, 1.69]; N = 159 deaths) and categorically (HRadj for Q4 vs. Q1: 1.56 [95%CI: 0.80, 3.04]). HRadj for cardiovascular-specific mortality (N = 108 deaths) were 1.49 (95%CI: 0.94, 2.33) and 1.79 (95%CI: 0.68, 4.71) when modeled continuously and as Q4 vs. Q1 exposure categories, respectively. While no significant exposure-response relationship was seen for cancer-specific or cardiovascular-specific mortality, there was evidence for a monotonic, positive exposure-response relationship for cancer-specific mortality, albeit limited in precision.

Table 2. Baseline Σ OH-PAHs (ng/L)a by participant characteristics (N = 9,739).

Participant characteristics GM (GSE) of Σ PAHs (ng/L)
Age at baseline
20 to <60 years 8808.11 (219.68)
≥60 years 6621.75 (216.29)
Race/ethnicity
Non-Hispanic white 7819.83 (239.93)
Non-Hispanic black 12305.00 (374.41)
Hispanic 8415.46 (273.22)
Other 7136.77 (463.27)
Gender
Male 9030.31 (229.93)
Female 7568.09 (224.39)
Educational attainment
<High school graduate 11040.00 (448.91)
High school graduate 10168.00 (383.73)
Some college or above 6983.47 (193.79)
Smoking statusb
Not active smoker or ETS exposed 5656.07 (106.50)
Active smoker or ETS exposed 21601.00 (710.26)
Body mass index (kg/m2)
Underweight (<18.5) 9516.43 (1418.73)
Normal (18.5–24.9) 7484.54 (298.65)
Overweight (25–29.9) 7792.44 (232.88)
Obese (≥30) 9440.58 (281.32)
Family poverty statusc
Above the poverty threshold 7773.32 (187.41)
At or below the poverty threshold 11858.00 (483.10)
NHANES survey cycle
2001–2002 6748.33 (511.10)
2003–2004 8615.59 (594.79)
2005–2006 10492.00 (622.53)
2007–2008 8862.80 (672.71)
2009–2010 7952.30 (392.26)
2011–2012 7950.09 (390.85)
2013–2014 7819.23 (301.47)

Abbreviations: ETS = environmental tobacco smoke; GM = geometric mean; GSE = standard error of the GM; NHANES = National Health and Nutrition Examination Survey

aΣ OH-PAHs include 8 urinary PAH metabolites from four parent compounds (naphthalene, fluorene, phenanthrene, pyrene)

bSmoking status defined as current active smoker based on questionnaire, or serum cotinine concentrations >10 ng/mL.

cPoverty status is calculated as the ratio of the family’s self-reported income to the family’s poverty threshold

Table 3. Association between log-transformed ΣOH-PAHs (nmol/L) and all-cause, cancer-specific, and CVD-specific mortality, adjusted for covariatesa.

Mortality type Continuous (per log10-increase) Quartile 1 Quartile 2 Quartile 3 Quartile 4 p-trendb
All causes (N = 9739) Cases 934 240 233 195 266
ΣOH-PAHs (nmol/L), median 1.72 1.17 1.56 1.89 2.40
HRadj (95% CI) 1.39 (1.21, 1.61) Ref. 1.16 (0.89, 1.51) 1.17 (0.89, 1.54) 1.66 (1.32, 2.09) <0.001
Cancer-specific (N = 8862) Cases 159 34 41 37 47
ΣOH-PAHs (nmol/L), median 1.72 1.18 1.56 1.90 2.40
HRadj (95% CI) 1.15 (0.79, 1.69) Ref. 1.14 (0.66, 1.96) 1.35 (0.74, 2.45) 1.56 (0.80, 3.04) 0.17
CVD-specific (N = 8975) Cases 108 32 25 22 29
ΣOH-PAHs (nmol/L), median 1.72 1.17 1.56 1.89 2.39
HRadj (95% CI) 1.49 (0.94, 2.33) Ref. 1.36 (0.55, 3.34) 1.11 (0.47, 2.59) 1.79 (0.68, 4.71) 0.28

Abbreviations: CVD = cardiovascular disease

aModels adjusted for age (years), gender (male/female), race/ethnicity (non-Hispanic white, non-Hispanic black, Hispanic, other race/ethnicity), smoking status (current, not-current), BMI (kg/m2), survey cycle (cycles 1–7), educational attainment (<high school, high school graduate, some college or above), family poverty status (above, at or below family poverty threshold), and urinary creatinine (g/L)

bComputed using a ’continuous’ exposure created out of medians of each quartile of log10 ΣOH-PAHs

In subgroup analyses (Table 4), substantial differences were observed by smoking (Pinteraction = 0.02) and age (Pinteraction = 0.06) for all-cause mortality, by gender (Pinteraction = 0.05) for cancer-specific mortality, and by race/ethnicity for cardiovascular-specific mortality (Pinteraction = 0.02). Mortality across all three endpoints was higher among active smokers and ETS exposed, participants younger than 60 years, and non-Hispanic blacks relative to non-active or passive smokers, participants at or above 60 years, and non-Hispanic white and other race/ethnicities, respectively. Women had an increased risk of cancer-specific mortality (HRadj = 1.53 [95%CI: 1.05, 2.23]) compared to men (HRadj = 0.80 [95%CI: 0.45, 1.42]). Participants living at or below the poverty level had an increased risk of cardiovascular-specific mortality (HRadj = 2.83 [95%CI: 1.14, 6.99]) compared to those living above the poverty level (HRadj = 1.28 [95%CI: 0.78, 2.11]).

Table 4. Assessment of effect heterogeneity or effect measure modification for log-transformed ΣOH-PAHs (nmol/L) concentrations and mortality by age, gender, family poverty status, smoking status, and race/ethnicitya.

All-cause mortality Cancer-specific mortality CVD-specific mortality
events/N HRadj (95%CI) pb events/N HRadj (95%CI) pb events/N HRadj (95%CI) pb
Age at baseline
<60 years 211/6709 1.59 (1.31, 1.93) 59/6423 1.35 (0.88, 2.07) 22/6507 1.60 (0.99, 2.60)
≥60 years 723/3030 1.32 (1.13, 1.54) 0.06 100/2439 1.04 (0.69, 1.56) 0.12 86/2468 1.44 (0.88, 2.37) 0.62
Gender
Male 554/4818 1.37 (1.12, 1.67) 99/4409 0.80 (0.45, 1.42) 61/4337 1.46 (0.82, 2.61)
Female 380/4921 1.41 (1.18, 1.69) 0.80 60/4453 1.53 (1.05, 2.23) 0.05 47/4638 1.50 (0.84, 2.67) 0.95
Family poverty status
>Federal poverty level 739/7709 1.36 (1.17, 1.58) 117/6955 1.20 (0.78, 1.83) 82/7129 1.28 (0.78, 2.11)
≤Federal poverty level 195/2030 1.56 (1.15, 2.11) 0.40 42/1907 0.99 (0.50, 1.98) 0.63 26/1846 2.83 (1.14, 6.99) 0.11
Smoking status
Nonsmoker/no ETS exposure 683/7068 1.27 (1.08, 1.50) 110/6369 1.02 (0.61, 1.72) 77/6501 1.28 (0.76, 2.15)
Active smoker/ETS exposure 251/2671 1.96 (1.43, 2.67) 0.02 49/2493 1.59 (0.86, 2.94) 0.29 31/2474 2.41 (1.09, 5.32) 0.14
Race/ethnicity
Non-Hispanic white 572/4703 1.39 (1.18, 1.63) 75/4095 1.10 (0.70, 1.74) 61/4225 1.58 (0.99, 2.50)
Non-Hispanic black 170/2006 1.57 (1.08, 2.29) 37/1865 1.30 (0.63, 2.68) 25/1865 1.96 (0.79, 4.88)
Other 192/3030 1.23 (0.89, 1.69) 0.58 47/2902 1.23 (0.76, 2.01) 0.88 22/2895 0.48 (0.18, 1.28) 0.02

Abbreviations: CVD = cardiovascular disease; ETS = environmental tobacco smoke

aAll models adjusted for BMI (kg/m2), NHANES survey cycle (cycles 1–7), educational attainment (<high school, high school graduate, some college or above), and urinary creatinine (g/L). Age-stratified models additionally adjusted for age (years), gender (male/female), race/ethnicity (non-Hispanic white, non-Hispanic black, Hispanic, other race/ethnicity), smoking status (current, not-current), and family poverty status (above, at or below family poverty threshold). Gender-stratified models additionally adjusted for age, race/ethnicity, smoking status, and family poverty status; smoking-stratified models additionally adjusted for age, race/ethnicity, gender, and family poverty status; family poverty status additionally adjusted for age, race/ethnicity, gender, and smoking status; race/ethnicity-stratified models additionally adjusted for age, gender, smoking status, and family poverty status; for race/ethnicity stratified models, Hispanic and other race/ethnicities were collapsed into “other” for purposes of heterogeneity analyses due to event sample size limitations.

bTest for heterogeneity was computed using either the p-value for the product term in the model (age, gender, smoking, poverty models) or through the F-test (race/ethnicity model)

In models excluding deaths occurring within 1- or 2-years of baseline (S3 Table), we observed similar estimates for each outcome of interest, except that risk of cardiovascular-specific mortality was elevated compared to models used in main analyses (Table 3). When ΣOH-PAHs were both creatinine-corrected (nmol/g Cre) and creatinine included as a covariate in models, results for our three mortality outcomes were generally similar (S4 Table). When including baseline cases, the adjusted risk estimates slightly increased for cancer-specific mortality, and decreased for CVD-specific mortality, but remained non-significant and elevated (S5 Table).

In our quantile g-computation analysis (Table 5), HRadj for the overall joint effect (simultaneous quartile increase) of the eight OH-PAHs were 1.15 (95%CI: 1.02, 1.31), 1.41 (95%CI: 1.05, 1.90), and 0.98 (95% CI: 0.66, 1.47) for all-cause, cancer-specific, and cardiovascular-specific mortality, respectively. In quantile g-computation, 1-NAP, 2-NAP, 2-FLUO, 2&3-PHEN, and 1-PYR were positively weighted while 3-FLUO and 1-PHEN were negatively weighted for all-cause mortality. All OH-PAHs but PHEN were positively weighted for cancer-specific mortality, while only 2&3-PHEN, 1-NAP, and 2-FLUO were positively weighted for cardiovascular-specific mortality (S3 Fig). There was some concordance between direction of weights from quantile g-computation and direction of effects in multi-pollutant models across endpoints (S4 Table).

Table 5. Joint effect of OH-PAHsa on mortality outcomesb using quantile g-computation.

Mortality type OH-PAHs categorization HRadj (95% CI) p-value
All-cause Quartile 1.15 (1.02, 1.31) 0.03
Cancer-specific Quartile 1.41 (1.05, 1.90) 0.02
CVD-specific Quartile 0.98 (0.66, 1.47) 0.93

a OH-PAHs were 1-hydroxynaphthalene, 2-hydroxynaphthalene, 2-hydroxyfluorene, 3-hydroxyfluorene, 1-hydroxyphenanthrene, 2&3-hydroxyphenanthrene, 1-hydroxypyrene

bModels adjusted for age (years), gender (male/female), race/ethnicity (non-Hispanic white, non-Hispanic black, Hispanic, other race/ethnicity), smoking status (current, not-current), BMI (underweight, normal, overweight, obese), survey cycle (cycles 1–7), educational attainment (<high school, high school graduate, some college or above), family poverty status (above, at or below family poverty threshold), and urinary creatinine (g/L)

Discussion

In a large representative sample of the U.S. adult population, our study found that an increase in ΣOH-PAHs urinary concentrations was associated with higher all-cause mortality. For cause-specific deaths, ΣOH-PAHs were associated with non-significant increases in cardiovascular-specific mortality, and to a lesser extent, cancer-specific mortality. We observed substantial differences in PAH deleterious effects by age and smoking status for all-cause mortality, gender for cancer-specific mortality, and race/ethnicity for cardiovascular-specific mortality. Our results also identified disparities in risk of cardiovascular-specific mortality for participants who were non-Hispanic black and those living at or below the poverty level. When examining the joint effect using quantile g-computation, a quartile simultaneous increase of all eight OH-PAHs was associated with increased all-cause and cancer-specific mortality; null effects were seen for cardiovascular-specific mortality. There was consistency in the direction and significance of association for all-cause mortality across our quantile g-computation and sum OH-PAHs approaches. In quantile g-computation, we observed mostly positive and some negative contributions of OH-PAHs for endpoints.

Our findings for all-cause mortality and cardiovascular-specific mortality support an emerging body of epidemiologic literature on the health impact of PAHs, including Chen et al. (although we note that the studies are not directly comparable due to lack of urinary dilution, differences in study population size, and analytic approach). All-cause mortality represents a broad range of disease categories. Though the apportionment of specific causes to overall mortality was not examined as such data is not publicly available, leading causes of death (excluding unintentional injury) in the U.S. include lower respiratory disease, cerebrovascular disease, cardiovascular disease, and cancer [43]. In occupational settings, PAHs have been linked to fatal ischemic heart disease and fatal respiratory disease [4, 5]; however, these PAHs are often found at higher concentrations compared to the general population. Evidence of PAH exposure and cardiovascular-specific mortality in non-occupational populations is not well-characterized. A Chinese population-based cohort reported an association between OH-PAHs and 10-year atherosclerotic cardiovascular disease risk score, but not coronary heart disease [44]. Previous NHANES studies have reported positive associations between PHEN and prevalence of self-reported cardiovascular disease [45] as well as positive associations between OH-PAHs and prevalence of self-reported CVD [46]. Similarly, we found no mortality studies examining fatal respiratory diseases and PAH biomarkers, though cross-sectional studies have reported significant decreases in respiratory measures associated with commonly measured OH-PAHs [9, 47]. Although understanding of mechanisms through which PAHs impact human health remains a work in progress, one important mechanism is thought to be PAH induced aryl hydrocarbon receptor (AhR) activation and ensuing inflammatory, oxidative, and genotoxic effects [4850]. AhR is highly expressed in liver, adipose, and bronchial tissue and is pivotal in cardiac development and function [5154].

PAH-related health disparities by race/ethnicity (non-Hispanic black) and poverty status (at or below the poverty level) were seen for cardiovascular-related mortality. In the US, non-Hispanic black and individuals below the poverty level have higher cardiovascular mortality compared to other race/ethnicities and individuals above the poverty level [55]. We also saw overall higher OH-PAH concentrations among these minority population groups, indicating higher overall PAH exposure. Trend analysis from non-smoking NHANES participants from 2001–2014 also revealed increasing concentrations of PAHs among minority groups, including Non-Hispanic blacks and Mexican Americans [56]. One possible explanation for the observed disparities for PAH exposure and cardiovascular mortality is the interplay of psychosocial stressors (e.g., allostatic load) on cardiovascular disease among these population groups [57]. PAHs may be linked to some allostatic biomarkers; an NHANES study found an association with C-reactive protein and urinary PAH biomarkers [46]. Findings by smoking status in relation to all-cause mortality highlighted that PAH-related mortality risk from non-tobacco smoke sources is substantial and that PAHs from tobacco smoke further elevate mortality risk, possibly through differential PAHs profile and/or synergy among PAHs and other deleterious components of tobacco smoke such as carbon monoxide and formaldehyde.

We observed weak elevated risk of cancer-specific mortality with increasing OH-PAH concentrations, with substantial differences where there was elevated risk among women and slightly reduced risk among men. Cancer is many diseases with different etiologies. Though tumor sites were not accessible in the publicly available NDI dataset, leading causes of mortality by cancer subtype in the US, including lung, breast, and colorectal cancer, have relevance to PAH exposure. Urinary biomarkers of PAH exposure (2-NAP, 1-PYR) have been associated with increased lung cancer, especially in the presence of biomarkers of oxidative stress [58]. For breast cancer, several case-control investigations have reported positive associations with breast cancer incidence [6, 7, 59], although Lee et al. reported slight negative associations [60]. As a check on the possibility of undetected, potentially advanced cancer driving effect patterns, we excluded cancer deaths within 1 and 2 years of cohort entry in sensitivity analyses and found that doing so had no influence on findings. Our pattern of findings supports some of the aforementioned literature, although it should be noted that these studies [6, 7, 59] reported on breast cancer incidence, while we report on cancer mortality from incident cancer. Even if breast cancer is a major contributor to cancer mortality in this representative sample of the US population, the effect of PAHs on breast cancer incidence and mortality may not be directly comparable. With regards to gender-specific differences in cancer outcomes in relation to PAH exposure in this study as well as in Chen et al. [33], Guo et al. [61] suggested that women are more susceptible than men to chromosomal damage and oxidative stress consequences of PAH (Σ [NAP, FLUO, PHEN, PYR]) exposure [61].

The lack of consistency in some results between ΣOH-PAHs and our mixtures analysis using quantile g-computation highlight differences in approaches reflective of their divergent purposes. ΣOH-PAHs approximate the total body burden of commonly detected OH-PAHs without further resolution by PAHs mixture composition. Environmental PAHs co-occur as complex mixtures of tens of PAHs, many of which vary in relative toxicity and are poorly detected in biological samples. Therefore, our ΣOH-PAH measure serves as a crude proxy for total PAH exposure, irrespective of individual contributions. By comparison, quantile g-computation in this study estimates the impact of simultaneous quantile increase of all measured OH-PAHs, and examines individual contributions of each metabolite to that joint effect. Quantile g-computation weights, a measure of individual metabolite contribution, are robust to collinearity among measured OH-PAHs where traditional multipollutant regression models tend to perform poorly [62]. From the parent set of examined OH-PAHs, naphthalene is classified by the NTP [16] as reasonably anticipated to be a human carcinogen and by IARC as probably carcinogenic to humans (Group 2B) while fluorene, phenanthrene, and pyrene are deemed non-classifiable as to their carcinogenicity in humans (Group 3) [15]. While multipollutant models showed conflicting negative and positive associations for 1-NAP and 2-NAP, respectively, quantile g-computation positive weights for both are consistent with IARC classification. Additionally, 2-FLUO, 3-FLUO, and 1-PYR were found to contribute positive weights for cancer mortality while 1-PHEN and 2&3-PHEN were found to contribute negative weights cancer mortality. Although Chen et al. previously reported associations for mortality endpoints by individual OH-PAHs that are considered in this study, we note that they don’t correct for mutual confounding among measured OH-PAHs (S2 Table); moreover, previously reported effect estimates are highly imprecise [33]. Given the paucity of human evidence on individual OH-PAHs effects, weights from our quantile g-computation results are intended to inform future analyses, with the aforementioned limitations and also the possibility of confounding by unmeasured factors, including unmeasured PAHs, in mind given that PAHs are a highly complex mixture.

Our study had several limitations. The use of a one-time spot urinary sample may not reflect PAH exposure over longer time frames. Biomonitoring studies demonstrate moderate temporal reliability of spot urinary measurements for PAHs; Dobraca et al. [63] reported intraclass correlation coefficients (ICC) ranging from 0.07 to 0.53 across PAH metabolites considered in this study and ICCs for most PAH metabolites ranged between 0.35 and 0.50 across multiple years. Importantly, concentration from the single spot sample reliably ranked exposure into quartiles consistent with study’s multi-year average [63]. Li et al. [18] reported an ICC of 0.55 for 1-PYR. Reliability was found to be higher in the case of first morning voids or 24-hour urine samples, neither of which are available for this particular environmental NHANES sample [18, 63]. While exposure misclassification is possible, it is unlikely to be differential by mortality status. We were also unable to control for specific dietary patterns (e.g., red meat consumption, grilled/smoked foods consumption) and ambient air pollution; therefore, the potential for residual confounding from non-PAHs components (e.g., PM2.5, heavy metals) of these major PAHs sources cannot be ruled out. Further residual confounding by genetic architecture is possible if genetic differences that impact PAH metabolism and levels also impact mortality endpoints. Though we attempted to account for both active and passive smoking in our analysis using a combination of questionnaire (e.g., “Do you now smoke cigarettes?”) and smoking biomarker (serum cotinine) data, the available NHANES did not allow for a more refined measure of smoking intensity and duration and thus residual confounding due to smoking is possible and may have biased our results. Another limitation is that we had a relatively short follow-up for longer latency period endpoints such as cancer-specific mortality (from incident cancer). Latency periods for common cancers such as lung and breast are on the order of 10 to 20 years [64]. PAHs are hypothesized to function as both initiators and promoters in carcinogenesis and it is likely our analyses capture more of the latter PAHs effect [4850]. Moreover, given that we removed individuals with self-reported history of cancer at baseline in cancer mortality analyses, our analysis also predisposed to incident cancers with relatively high mortality (e.g., lung cancer). Additionally, although we had a robust number of individuals with all-cause mortality (N = 934), we had a relatively small number of individuals with cancer mortality (N = 159) and cardiovascular mortality (N = 108). In mixtures analysis, our simultaneous assessment of quartiles of individual PAHs and covariates against relatively sparse cancer and cardiovascular mortality events could have led to overfitting and, therefore, poorer performance of the model.

Our study had several strengths. This is one of first studies on PAHs exposure and overall and cause-specific mortality in a large, representative sample of the U.S. population. Unlike many prior investigations, we were able to establish temporality through critical linkage of two national databases. NDI is the most comprehensive resource for mortality ascertainment in the U.S.; therefore, the possibility of outcome misclassification in our study is likely minimal. A similar NHANES investigation [33] examined PAH exposure and all-cause mortality in a subset of cycles, though with important distinctions. First, Chen et al. did not account for urinary dilution in PAH measures, which can lead to biased estimates. Secondly, our larger sample size (N = 9,739) compared to Chen et al. (N = 1,409) allows for more robust analyses, including to exploration of effect modification by important sociodemographic and lifestyle factors. Given our study population emanated from a nationally-representative sample, our results may have important public health implications for the U.S. population. In our analyses, we examined the effect of OH-PAHs on mortality in multiple ways, including as a ΣOH-PAHs proxy for PAHs exposure from multiple sources and also as an environmental mixture using quantile g-computation, the latter of which allowed insight into joint OH-PAH contributions.

Conclusions

Our prospective analysis of the U.S. adult population found ΣOH-PAHs to be associated with higher total and cause-specific mortality, confirming and expanding on some of the prior evidence in a population-based, representative sample. We found evidence of effect modification by smoking status, gender, and race/ethnicity across all-cause, cancer-specific, and cardiovascular-specific mortalities, notably potential disparities for cardiovascular-specific mortality for non-Hispanic black and poorer participants. Quantile g-computation allowed us to further assess the total mixtures effect of OH-PAHs, signaling possible OH-PAH effects that may be of interest in future investigations. Results may inform public health efforts aimed at PAH exposure mitigation.

Supporting information

S1 Fig. Schematic diagram of exclusion criteria for study population (NHANES 2001–2015).

(DOCX)

S2 Fig. Directed acyclic graph to determine the relationships between hydroxylated PAH metabolites (OH-PAHs) and mortality.

(DOCX)

S3 Fig. Weights for urinary OH-PAHs from quantile g-computation, by mortality endpoint.

a) Negative and positive weights for OH-PAHs associated with all-cause mortality. b) Negative and positive weights for OH-PAHs associated with cancer-specific mortality. c) Negative and positive weights for OH-PAHs associated with cardiovascular-specific mortality.

(DOCX)

S1 Table. Baseline descriptive characteristics in NHANES 2001–2014 by mortality status, prior to exclusion criteria and in the study population.

(DOCX)

S2 Table. Pearson correlation coefficients of eight urinary OH-PAHs (ng/L) from participants participating in NHANES 2001–2014 (N = 9739).

(DOCX)

S3 Table. Continuous final models of Σ OH-PAHs and all-cause and cause-specific mortality, excluding participants who died within one and two years of baseline.

(DOCX)

S4 Table. Associations between creatinine-corrected Σ OH-PAHs (nmol/g Cre) and mortality, including urinary creatinine (g/L) as an additional covariate.

(DOCX)

S5 Table. Continuous final models of ΣOH-PAHs and all-cause and cause-specific mortality, not excluding self-reported history of cancer or CVD at baseline.

(DOCX)

S6 Table. Multipollutant models examining the association between individual OH-PAHs and all-cause, cancer-specific, and CVD-specific mortality.

(DOCX)

Acknowledgments

We thank Alexander Keil for his guidance on the quantile g-computation, and Kyla Taylor, Paige Bommarito, John Bucher, and Scott Masten for their critical review of the manuscript.

Data Availability

All NHANES and NDI files are available for download from the National Center for Health Statistics database (https://www.cdc.gov/nchs/nhanes/index.htm).

Funding Statement

S.S.M., A.W., R.M.L., National Institute of Environmental Health Sciences, Intramural Research Program Project ES-103317-05, The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. A.P.P., National Institute of Environmental Health Sciences, Training Grant T32 ES007018, The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Boström CE, Gerde P, Hanberg A, Jernström B, Johansson C, Kyrklund T, et al. Cancer risk assessment, indicators, and guidelines for polycyclic aromatic hydrocarbons in the ambient air. Environ Health Perspect. 2002;110 Suppl 3:451–88. Epub 2002/06/13. doi: 10.1289/ehp.110-1241197 ; PubMed Central PMCID: PMC1241197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Choi H, Harrison R, Komulainen H, Delgado Saborit JM. Polycyclic aromatic hydrocarbons. WHO Guidelines for Indoor Air Quality: Selected Pollutants. Geneva: World Health Organization; 2010. [PubMed] [Google Scholar]
  • 3.Kim KH, Jahan SA, Kabir E, Brown RJ. A review of airborne polycyclic aromatic hydrocarbons (PAHs) and their human health effects. Environment international. 2013;60:71–80. Epub 2013/09/10. doi: 10.1016/j.envint.2013.07.019 . [DOI] [PubMed] [Google Scholar]
  • 4.Burstyn I, Boffetta P, Heederik D, Partanen T, Kromhout H, Svane O, et al. Mortality from obstructive lung diseases and exposure to polycyclic aromatic hydrocarbons among asphalt workers. Am J Epidemiol. 2003;158(5):468–78. Epub 2003/08/26. doi: 10.1093/aje/kwg180 . [DOI] [PubMed] [Google Scholar]
  • 5.Burstyn I, Kromhout H, Partanen T, Svane O, Langård S, Ahrens W, et al. Polycyclic aromatic hydrocarbons and fatal ischemic heart disease. Epidemiology. 2005;16(6):744–50. Epub 2005/10/14. doi: 10.1097/01.ede.0000181310.65043.2f . [DOI] [PubMed] [Google Scholar]
  • 6.Gammon MD, Sagiv SK, Eng SM, Shantakumar S, Gaudet MM, Teitelbaum SL, et al. Polycyclic aromatic hydrocarbon-DNA adducts and breast cancer: a pooled analysis. Archives of environmental health. 2004;59(12):640–9. Epub 2006/06/23. doi: 10.1080/00039890409602948 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.White AJ, Bradshaw PT, Herring AH, Teitelbaum SL, Beyea J, Stellman SD, et al. Exposure to multiple sources of polycyclic aromatic hydrocarbons and breast cancer incidence. Environment international. 2016;89–90:185–92. Epub 2016/02/16. doi: 10.1016/j.envint.2016.02.009 ; PubMed Central PMCID: PMC4818720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zhang Y, Tao S, Shen H, Ma J. Inhalation exposure to ambient polycyclic aromatic hydrocarbons and lung cancer risk of Chinese population. Proc Natl Acad Sci U S A. 2009;106(50):21063–7. Epub 2009/12/10. doi: 10.1073/pnas.0905756106 ; PubMed Central PMCID: PMC2789752. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Zhou Y, Sun H, Xie J, Song Y, Liu Y, Huang X, et al. Urinary polycyclic aromatic hydrocarbon metabolites and altered lung function in Wuhan, China. Am J Respir Crit Care Med. 2016;193(8):835–46. Epub 2015/12/05. doi: 10.1164/rccm.201412-2279OC . [DOI] [PubMed] [Google Scholar]
  • 10.Bertrand JP, Chau N, Patris A, Mur JM, Pham QT, Moulin JJ, et al. Mortality due to respiratory cancers in the coke oven plants of the Lorraine coalmining industry (Houillères du Bassin de Lorraine). Br J Ind Med. 1987;44(8):559–65. Epub 1987/08/01. doi: 10.1136/oem.44.8.559 ; PubMed Central PMCID: PMC1007875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hogstedt C, Jansson C, Hugosson M, Tinnerberg H, Gustavsson P. Cancer incidence in a cohort of Swedish chimney sweeps, 1958–2006. Am J Public Health. 2013;103(9):1708–14. Epub 2013/01/19. doi: 10.2105/AJPH.2012.300860 ; PubMed Central PMCID: PMC3780661. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Miller BG, Doust E, Cherrie JW, Hurley JF. Lung cancer mortality and exposure to polycyclic aromatic hydrocarbons in British coke oven workers. BMC public health. 2013;13:962. Epub 2013/10/18. doi: 10.1186/1471-2458-13-962 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Redmond CK. Cancer mortality among coke oven workers. Environ Health Perspect. 1983;52:67–73. Epub 1983/10/01. doi: 10.1289/ehp.835267 ; PubMed Central PMCID: PMC1569361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Vimercati L, Bisceglia L, Cavone D, Caputi A, De Maria L, Delfino MC, et al. Environmental monitoring of PAHs exposure, biomarkers and vital status in coke oven workers. Int J Environ Res Public Health. 2020;17(7). Epub 2020/03/29. doi: 10.3390/ijerph17072199 ; PubMed Central PMCID: PMC7178092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.IARC. Some Non-heterocyclic Polycyclic Aromatic Hydrocarbons and Some Related Exposures. Lyon, France: International Agency for Research on Cancer; 2010. pp. 1017–606 p. [Google Scholar]
  • 16.NTP. Report on Carcinogens, Fourteenth Edition. Research Triangle Park, NC: U.S. Department of Health and Human Services, Public Health Service, National Toxicology Program; 2016. [Google Scholar]
  • 17.Godschalk RWL, Moonen EJC, Schilderman PAEL, Broekmans WMR, Kleinjans JCS, Van Schooten FJ. Exposure-route-dependent DNA adduct formation by polycyclic aromatic hydrocarbons. Carcinogenesis. 2000;21(1):87–92. doi: 10.1093/carcin/21.1.87 [DOI] [PubMed] [Google Scholar]
  • 18.Li Z, Romanoff LC, Lewin MD, Porter EN, Trinidad DA, Needham LL, et al. Variability of urinary concentrations of polycyclic aromatic hydrocarbon metabolite in general population and comparison of spot, first-morning, and 24-h void sampling. J Expo Sci Environ Epidemiol. 2010;20(6):526–35. Epub 2009/08/27. doi: 10.1038/jes.2009.41 ; PubMed Central PMCID: PMC4615693. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Pratt MM, John K, MacLean AB, Afework S, Phillips DH, Poirier MC. Polycyclic aromatic hydrocarbon (PAH) exposure and DNA adduct semi-quantitation in archived human tissues. Int J Environ Res Public Health. 2011;8(7):2675–91. Epub 2011/08/17. doi: 10.3390/ijerph8072675 ; PubMed Central PMCID: PMC3155323. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Pavanello S, Lotti M. Biomonitoring exposures to carcinogens. In: Gupta R, editor. Biomarkers in Toxicology. Oxford: Elsevier; 2014. p. pp. 785–98. [Google Scholar]
  • 21.Srogi K. Monitoring of environmental exposure to polycyclic aromatic hydrocarbons: a review. Environ Chem Lett. 2007;5(4):169–95. Epub 2007/01/01. doi: 10.1007/s10311-007-0095-0 ; PubMed Central PMCID: PMC5614912. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Alhamdow A, Lindh C, Albin M, Gustavsson P, Tinnerberg H, Broberg K. Cardiovascular disease-related serum proteins in workers occupationally exposed to polycyclic aromatic hydrocarbons. Toxicol Sci. 2019;171(1):235–46. Epub 2019/06/23. doi: 10.1093/toxsci/kfz142 ; PubMed Central PMCID: PMC6735884. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Alhamdow A, Tinnerberg H, Lindh C, Albin M, Broberg K. Cancer-related proteins in serum are altered in workers occupationally exposed to polycyclic aromatic hydrocarbons: a cross-sectional study. Carcinogenesis. 2019;40(6):771–81. Epub 2019/02/13. doi: 10.1093/carcin/bgz022 ; PubMed Central PMCID: PMC6612054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Li Z, Sandau CD, Romanoff LC, Caudill SP, Sjodin A, Needham LL, et al. Concentration and profile of 22 urinary polycyclic aromatic hydrocarbon metabolites in the US population. Environ Res. 2008;107(3):320–31. Epub 2008/03/04. doi: 10.1016/j.envres.2008.01.013 . [DOI] [PubMed] [Google Scholar]
  • 25.Ramesh A, Walker SA, Hood DB, Guillen MD, Schneider K, Weyand EH. Bioavailability and risk assessment of orally ingested polycyclic aromatic hydrocarbons. Int J Toxicol. 2004;23(5):301–33. Epub 2004/10/30. doi: 10.1080/10915810490517063 . [DOI] [PubMed] [Google Scholar]
  • 26.Castano-Viñyals G, D’Errico A, Malats N, Kogevinas M. Biomarkers of exposure to polycyclic aromatic hydrocarbons from environmental air pollution. Occup Environ Med. 2004;61(4):e12. Epub 2004/03/20. doi: 10.1136/oem.2003.008375 ; PubMed Central PMCID: PMC1740739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Merlo F, Andreassen A, Weston A, Pan CF, Haugen A, Valerio F, et al. Urinary excretion of 1-hydroxypyrene as a marker for exposure to urban air levels of polycyclic aromatic hydrocarbons. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology. 1998;7(2):147–55. [PubMed] [Google Scholar]
  • 28.Kato M, Loomis D, Brooks LM, Gattas GF, Gomes L, Carvalho AB, et al. Urinary biomarkers in charcoal workers exposed to wood smoke in Bahia State, Brazil. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology. 2004;13(6):1005–12. [PubMed] [Google Scholar]
  • 29.Li Z, Mulholland JA, Romanoff LC, Pittman EN, Trinidad DA, Lewin MD, et al. Assessment of non-occupational exposure to polycyclic aromatic hydrocarbons through personal air sampling and urinary biomonitoring. J Environ Monit. 2010;12(5):1110–18. Epub 2011/04/16. doi: 10.1039/c000689k . [DOI] [PubMed] [Google Scholar]
  • 30.Nethery E, Wheeler AJ, Fisher M, Sjödin A, Li Z, Romanoff LC, et al. Urinary polycyclic aromatic hydrocarbons as a biomarker of exposure to PAHs in air: A pilot study among pregnant women. J Expo Sci Environ Epidemiol. 2011;22(1):70–81. doi: 10.1038/jes.2011.32 [DOI] [PubMed] [Google Scholar]
  • 31.St Helen G, Goniewicz ML, Dempsey D, Wilson M, Jacob P 3rd, Benowitz NL. Exposure and kinetics of polycyclic aromatic hydrocarbons (PAHs) in cigarette smokers. Chem Res Toxicol. 2012;25(4):952–64. Epub 2012/03/21. doi: 10.1021/tx300043k ; PubMed Central PMCID: PMC3330130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Li Z, Trinidad D, Pittman EN, Riley EA, Sjodin A, Dills RL, et al. Urinary polycyclic aromatic hydrocarbon metabolites as biomarkers to woodsmoke exposure—results from a controlled exposure study. J Expo Sci Environ Epidemiol. 2016;26(3):241–8. Epub 2015/01/22. doi: 10.1038/jes.2014.94 ; PubMed Central PMCID: PMC4532642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Chen YY, Kao TW, Wang CC, Chen YJ, Wu CJ, Lai CH, et al. Polycyclic aromatic hydrocarbon metabolites and mortality risk in an adult population. Environ Geochem Health. 2021;43(1):37–45. Epub 2020/07/28. doi: 10.1007/s10653-020-00663-7 . [DOI] [PubMed] [Google Scholar]
  • 34.Li Z, Romanoff LC, Trinidad DA, Hussain N, Jones RS, Porter EN, et al. Measurement of urinary monohydroxy polycyclic aromatic hydrocarbons using automated liquid-liquid extraction and gas chromatography/isotope dilution high-resolution mass spectrometry. Anal Chem. 2006;78(16):5744–51. Epub 2006/08/16. doi: 10.1021/ac0606094 . [DOI] [PubMed] [Google Scholar]
  • 35.Romanoff LC, Li Z, Young KJ, Blakely NC 3rd, Patterson DG Jr., Sandau CD. Automated solid-phase extraction method for measuring urinary polycyclic aromatic hydrocarbon metabolites in human biomonitoring using isotope-dilution gas chromatography high-resolution mass spectrometry. J Chromatogr B Analyt Technol Biomed Life Sci. 2006;835(1–2):47–54. Epub 2006/03/28. doi: 10.1016/j.jchromb.2006.03.004 . [DOI] [PubMed] [Google Scholar]
  • 36.Li Z, Romanoff LC, Trinidad DA, Pittman EN, Hilton D, Hubbard K, et al. Quantification of 21 metabolites of methylnaphthalenes and polycyclic aromatic hydrocarbons in human urine. Anal Bioanal Chem. 2014;406(13):3119–29. Epub 2014/04/10. doi: 10.1007/s00216-014-7676-0 ; PubMed Central PMCID: PMC4582777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wang Y, Meng L, Pittman EN, Etheredge A, Hubbard K, Trinidad DA, et al. Quantification of urinary mono-hydroxylated metabolites of polycyclic aromatic hydrocarbons by on-line solid phase extraction-high performance liquid chromatography-tandem mass spectrometry. Anal Bioanal Chem. 2017;409(4):931–7. Epub 2016/11/01. doi: 10.1007/s00216-016-9933-x ; PubMed Central PMCID: PMC5568775. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Epidemiology. 1999;10:37–48. doi: 10.1097/00001648-199901000-00005 [DOI] [PubMed] [Google Scholar]
  • 39.Weir CB, Jan A. BMI classification percentile and cut off points [Updated 2020 Jul 10]. StatPearls [Internet]. Treasure Island, FL: StatPearls Publishing; 2020. Jan; 2020. [PubMed] [Google Scholar]
  • 40.Barr DB, Wilder LC, Caudill SP, Gonzalez AJ, Needham LL, Pirkle JL. Urinary creatinine concentrations in the U.S. population: implications for urinary biologic monitoring measurements. Environ Health Perspect. 2005;113(2):192–200. Epub 2005/02/03. doi: 10.1289/ehp.7337 ; PubMed Central PMCID: PMC1277864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Weinberg CR, Shi M, O’Brien KM, Umbach DM. Adjustment for urinary creatinine or serum lipids for analytes assayed in pooled specimens. Epidemiology. 2019;30(5):768–79. Epub 2019/06/18. doi: 10.1097/EDE.0000000000001053 ; PubMed Central PMCID: PMC6679734. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Keil AP, Buckley JP, O’Brien KM, Ferguson KK, Zhao S, White AJ. A quantile-based g-computation approach to addressing the effects of exposure mixtures. Environ Health Perspect. 2020;128(4):47004. Epub 2020/04/08. doi: 10.1289/EHP5838 ; PubMed Central PMCID: PMC7228100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Hoyert DL, Xu J. Deaths: preliminary data for 2011. Natl Vital Stat Rep. 2012;61(6):1–51. [PubMed] [Google Scholar]
  • 44.Hu C, Hou J, Zhou Y, Sun H, Yin W, Zhang Y, et al. Association of polycyclic aromatic hydrocarbons exposure with atherosclerotic cardiovascular disease risk: A role of mean platelet volume or club cell secretory protein. Environ Pollut. 2018;233:45–53. Epub 2017/10/21. doi: 10.1016/j.envpol.2017.10.042 . [DOI] [PubMed] [Google Scholar]
  • 45.Xu X, Cook RL, Ilacqua VA, Kan H, Talbott EO, Kearney G. Studying associations between urinary metabolites of polycyclic aromatic hydrocarbons (PAHs) and cardiovascular diseases in the United States. Sci Total Environ. 2010;408(21):4943–8. Epub 2010/08/13. doi: 10.1016/j.scitotenv.2010.07.034 . [DOI] [PubMed] [Google Scholar]
  • 46.Alshaarawy O, Elbaz HA, Andrew ME. The association of urinary polycyclic aromatic hydrocarbon biomarkers and cardiovascular disease in the US population. Environment international. 2016;89–90:174–8. Epub 2016/02/16. doi: 10.1016/j.envint.2016.02.006 ; PubMed Central PMCID: PMC4818691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Cakmak S, Hebbern C, Cakmak JD, Dales RE. The influence of polycyclic aromatic hydrocarbons on lung function in a representative sample of the Canadian population. Environ Pollut. 2017;228:1–7. Epub 2017/05/14. doi: 10.1016/j.envpol.2017.05.013 . [DOI] [PubMed] [Google Scholar]
  • 48.Holme JA, Brinchmann BC, Refsnes M, Låg M, Øvrevik J. Potential role of polycyclic aromatic hydrocarbons as mediators of cardiovascular effects from combustion particles. Environ Health. 2019;18(1):74. Epub 2019/08/24. doi: 10.1186/s12940-019-0514-2 ; PubMed Central PMCID: PMC6704565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Moorthy B, Chu C, Carlin DJ. Polycyclic aromatic hydrocarbons: From metabolism to lung cancer. Toxicol Sci. 2015;145(1):5–15. doi: 10.1093/toxsci/kfv040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Nebert DW, Dalton TP, Okey AB, Gonzalez FJ. Role of aryl hydrocarbon receptor-mediated induction of the CYP1 enzymes in environmental toxicity and cancer. J Biol Chem. 2004;279(23):23847–50. doi: 10.1074/jbc.R400004200 [DOI] [PubMed] [Google Scholar]
  • 51.Carreira VS, Fan Y, Wang Q, Zhang X, Kurita H, Ko CI, et al. Ah receptor signaling controls the expression of cardiac development and homeostasis genes. Toxicol Sci. 2015;147(2):425–35. Epub 2015/07/04. doi: 10.1093/toxsci/kfv138 ; PubMed Central PMCID: PMC4707199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Eckers A, Sauerbier E, Anwar-Mohamed A, Hamann I, Esser C, Schroeder P, et al. Detection of a functional xenobiotic response element in a widely employed FoxO-responsive reporter construct. Arch Biochem Biophys. 2011;516(2):138–45. Epub 2011/10/25. doi: 10.1016/j.abb.2011.10.006 . [DOI] [PubMed] [Google Scholar]
  • 53.Tsay JJ, Tchou-Wong KM, Greenberg AK, Pass H, Rom WN. Aryl hydrocarbon receptor and lung cancer. Anticancer research. 2013;33:1247–56. [PMC free article] [PubMed] [Google Scholar]
  • 54.Zhang N. The role of endogenous aryl hydrocarbon receptor signaling in cardiovascular physiology. J Cardiovasc Dis Res. 2011;2(2):91–5. Epub 2011/08/05. doi: 10.4103/0975-3583.83033 ; PubMed Central PMCID: PMC3144625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Van Dyke M, Greer S, Odom E, Schieb L, Vaughan A, Kramer M, et al. Heart disease death rates among blacks and whites aged >/ = 35 years—United States, 1968–2015. MMWR Surveill Summ. 2018;67(5):1–11. Epub 2018/03/30. doi: 10.15585/mmwr.ss6705a1 ; PubMed Central PMCID: PMC5877350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Hudson-Hanley B, Smit E, Branscum A, Hystad P, Kile ML. Trends in urinary metabolites of polycyclic aromatic hydrocarbons (PAHs) in the non-smoking U.S. population, NHANES 2001–2014. Chemosphere. 2021;276:130211. Epub 2021/03/21. doi: 10.1016/j.chemosphere.2021.130211 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Duru OK, Harawa NT, Kermah D, Norris KC. Allostatic load burden and racial disparities in mortality. J Natl Med Assoc. 2012;104(1–2):89–95. doi: 10.1016/s0027-9684(15)30120-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Eom SY, Yim DH, Moon SI, Youn JW, Kwon HJ, Oh HC, et al. Polycyclic aromatic hydrocarbon-induced oxidative stress, antioxidant capacity, and the risk of lung cancer: a pilot nested case-control study. Anticancer research. 2013;33(8):3089–97. Epub 2013/07/31. . [PubMed] [Google Scholar]
  • 59.Shen J, Liao Y, Hopper JL, Goldberg M, Santella RM, Terry MB. Dependence of cancer risk from environmental exposures on underlying genetic susceptibility: an illustration with polycyclic aromatic hydrocarbons and breast cancer. British journal of cancer. 2017;116(9):1229–33. Epub 2017/03/30. doi: 10.1038/bjc.2017.81 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Lee KH, Shu XO, Gao YT, Ji BT, Yang G, Blair A, et al. Breast cancer and urinary biomarkers of polycyclic aromatic hydrocarbon and oxidative stress in the Shanghai Women’s Health Study. Cancer epidemiology, biomarkers & prevention: a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology. 2010;19(3):877–83. Epub 2010/02/18. doi: 10.1158/1055-9965.EPI-09-1098 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Guo H, Huang K, Zhang X, Zhang W, Guan L, Kuang D, et al. Women are more susceptible than men to oxidative stress and chromosome damage caused by polycyclic aromatic hydrocarbons exposure. Environ Mol Mutagen. 2014;55(6):472–81. Epub 2014/03/29. doi: 10.1002/em.21866 . [DOI] [PubMed] [Google Scholar]
  • 62.Vatcheva KP, Lee M, McCormick JB, Rahbar MH. Multicollinearity in regression analyses conducted in epidemiologic studies. Epidemiology (Sunnyvale). 2016;6(2):227. Epub 2016/06/09. doi: 10.4172/2161-1165.1000227 ; PubMed Central PMCID: PMC4888898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Dobraca D, Lum R, Sjödin A, Calafat AM, Laurent CA, Kushi LH, et al. Urinary biomarkers of polycyclic aromatic hydrocarbons in pre- and peripubertal girls in Northern California: Predictors of exposure and temporal variability. Environ Res. 2018;165:46–54. doi: 10.1016/j.envres.2017.11.011 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Nadler DL, Zurbenko IG. Estimating cancer latency times using a Weibull model. Adv Epidemiol. 2014;2014:1–8. doi: 10.1155/2014/746769 [DOI] [Google Scholar]

Decision Letter 0

Giovanni Signore

11 Mar 2021

PONE-D-20-37390

Urinary polycyclic aromatic hydrocarbon metabolites and mortality in the United States: a prospective analysis

PLOS ONE

Dear Dr. Mehta,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

In particular, reviewers raised some concerns regarding how methods are explained and detailed. These issues are of extreme importance in allowing evaluation and replication of the results by external users, and should be fully addressed. Furthermore, statistical robustness should be better supported and discussed, as pointed out by the reviewers.

Please submit your revised manuscript by Apr 23 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Giovanni Signore

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: I Don't Know

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Title: Urinary polycyclic aromatic hydrocarbon metabolites and mortality in the United States: a prospective analysis.

The authors report on an analysis using NHANES data to investigate select urinary PAH metabolite concentrations and mortality. Individuals 20 years of age or older for seven consecutive survey cycles between 2001 and 2014 were included in the analysis. In current analysis is comprise of data from 9,739 individuals. Urine samples were analyzed at the National Center for Environmental Health, Centers for Disease Control and Prevention. In total, eight PAH metabolites were quantified, using different methods over the seven survey cycles. Vital status as of December 31, 2015 was ascertained by linkage with the National Death Index. All-cause, cancer-specific, and cardiovascular-specific mortality were reported on. The statistical analyses included Cox proportional hazards regression, using age (years) as the time scale. Covariates were determined a priori with a directed acyclic graph. PAH metabolite mixtures were analyzed with quantile g-computation, a novel statistical approach that has advantages over weighted quantile sum regression. The primary results reported were that the log 10 increase in the sum of Oh-PAH urinary metabolite concentrations were associated with all-cause mortality (HR= 1.39; 95% CI = 1.21-1.61), and possibility for cancer-specific mortality (HR= 1.15; 95 CI = 0.79-1.69) and cardiovascular-specific mortality (HR=1.49; 95% CI = 0.94- 2.33). There was evidence of effect measure modification for age, smoking status, gender and race/ethnicity. Quantile g-computation associations for all-cause (HR=1.15; 95% CI = 1.15; 95% CI = 1.02-1.31), cancer-specific (HR=1.41; 95%CI = 1.05-1.90), and cardiovascular-specific mortality (HR=0.98; 95%CI = 0.66-1.47) were also reported. The authors conclude that total PAH exposure has a role in all-cause and cause-specific mortality and reported that reducing exposure would result in a 11.33% reduction in all-cause mortality, 10.64% reduction in cancer-specific mortality, and a 13.49% reduction in cardiovascular mortality in the United States.

Comments:

Overall, the manuscript is very well written and addresses an important, ubiquitous exposure that at high occupational levels have been demonstrated to cause cancer of the lung, bladder and skin. It is less clear whether lower non-occupational levels are associated with increased risk. Specific comments are detailed below:

1) Neither the introduction nor the discussion section describes the substantial literature published on coke oven workers and mortality. Given that coke oven workers have some of the highest reported exposure to PAHs, the results of these studies seem particularly relevant to the current manuscript, especially the mortality studies. These studies clearly showed association with working on the tops of coke ovens with lung cancer, but increases in cardiovascular were not as clearly demonstrated. These data need to considered and discussed in the manuscript.

2) The methods section indicates that 2,577 excluded from the analysis due to missing information or improbable values for follow-up time or accidental deaths. It seems that 20% of the sample is missing, could this missingness have biased the associations, especially because the observed associations were generally small. In addition, the number of individuals 20 years of age and older who donated a urine samples is not reported. The reader has to calculate it from then text. It would be more useful to the reader to just report that there 12,316 individuals 20 years of age or older with urine samples.

3) Individuals with a history of cancer were excluded from the analysis of cancer-specific mortality as were those with a history of CVD for the cardiovascular-specific mortality analyses. It is customary to removed prevalent cases at baseline from analyses intended to assess incidence of a disease, but extending this rationale to analyses intended to assess mortality seems as if it could bias the hazard ratios. A sensitivity analysis would probably inform on this.

4) The authors indicate that they used a directed acyclic graph (DAG) to a priori determine potential confounders to adjust for in the analysis. This DAG should be presented as a figure either in the manuscript or as a supplemental figure.

5) Smoking is an important source of PAHs and other chemical carcinogens and a well-established lung carcinogen. It this analysis, smoking status was used to adjust for potential confounding. However, this is a crude assessment of smoking behavior and the potential for residual confounding seems a likely limitation. In addition, the results stratified by smoking status with residual confounding. For instance, in the nonsmoker/no ETS stratum, neither cancer-specific nor CVD-specific hazard ratios are associated with PAH levels and the association for all-cause mortality is substantially attenuated. This nonsmoker stratum is likely to be have little to no confounding by smoking. Its only in the active smokers that association are observed and this may be due to residual confounding. The NHANES data are limited in that more detailed and precise measure of smoking are not available, so adequate control of smoking is a potential issue. If this is the case, them many, if not all, the hazard ratios reported that adjust for smoking status may be biased towards strengthening the association.

6) The NHANES survey were designed to recruit a representative sample of the US population, resulting in a wide age distribution of potentially eligible individuals. As such, there may be a delayed entry problem and the regression analyses should use left truncation to avoid bias that might be introduced by delayed entry.

7) There were 934 deaths. Of which, 108 (11.5%) were CVD-specific and 159 (17%) were cancer-specific deaths. According to the CDC, however, the vast majority of deaths in the US are due to CVD and cancer. It does not appear as if this sample of individuals from NHANES is representative of the US population.

8) Percent Attributable Risk (PAR) due to exposure are presented (table 6) for all-cause, cancer-specific, and CVD-specific mortality. PARs assume that there is a causal established and that confounding is adequately controlled to have meaningful interpretations. Given that residual confounding by smoking has not been adequately addressed in this report, that assumption seems to be violated. In addition, all-cause, cancer-specific, and CVD-specific mortality are all measures of aggregate disease processes, and many specific diseases within each aggregate may not be caused by PAHs. This information should be dropped from the manuscript.

Reviewer #2: The authors used urinary metabolite data available from the NHANES and linked the data with a mortality database (NDI) to prospectively evaluate the risk for mortality based on PAH exposures. This was a grand endeavor and one of the few studies I know that sought to link such large databases. The research is novel, with the one exception noted below, and the statistical analyses appear robust, although I will acknowledge that some of the analyses are outside my knowledge area, and I am not familiar with their strengths and limitations. Therefore, I do recommend that a biostatistician examine the statistical models.

The discussion and conclusions are supported by the data and the authors appropriately acknowledge the limitations of the study. Overall, I recommend publication of the manuscript after considering the following revisions:

1. It seems like this research is very similar to a paper published in 2020 by Chen et al. The authors acknowledge this but point out that the Chen et al. study was limited by not accounting for urine dilution in their statistical analyses. I agree, urine dilution is an important factor to consider in the analyses. However, it’s still worth noting in this manuscript how the outcomes of the two studies compared. Did they observe similar associations or not? In the current study, the authors acknowledge that urine adjustments for creatinine, or using creatinine as an adjustment in the model did not impact the results. But what happens if you leave it out of the model completely, like Chen et al did. Are the results similar? This really should be addressed.

2. It would be helpful if the authors could edit some of their terminology and lexicon to suit a more general audience. Some of the phrases and terms I believe are very specific to the field of epidemiology/statistical science. For example, the authors use the term “heterogeneity” in many places throughout the manuscript. I assume they refer to differences in significance or model outputs based on either stratified analyses, or univariate analyses, but it’s not always clear to me. For example, on page 13, the first sentence of the last paragraph says “….substantial heterogeneity was observed by smoking and ae for all-cause mortality, by gender…”. Can you use more general terms than heterogeneity to help readers understand your meaning a bit better?

3. I recommend that the authors consider adding a column all the way to the left in Tables 1 and 2 that includes the p-value describing differences between the categories (alive vs deceased; and sum PAHs by category, respectively). I understand that p-values should not be emphasized as much as the coefficients and beta estimates, but all the same, it helps readers understand which variables are significantly different within a univariate analyses.

4. An effect modifier that appears to be missing from the analysis is stress. Stress can obviously impact health outcomes, including mortality. Is there any data available within NHANES that can be paired to the urinary metabolite levels and dataset to consider the impact of stress within these models? For example, did NHANES measure cortisol? Or is there survey data that reflects measures of social stress?

5. The references need to be edited. They are not in alphabetic order and it was hard to find the references that were cited in the manuscript. In addition, some references are included twice. For example, Burstyn et al. 2003 is found on page 23 and again on page 26. Please edit the references to remove duplicates and alphabetize.

6. A major limitation of this research is the use of spot urine samples. The authors acknowledge this limitation in the discussion, but it would be helpful to clarify the variability that could be expected from relying on spot urine samples. For example, please comment on the magnitude of intra class correlation coefficients that have been reported in urine for PAH metabolites over various time frames (over a day, week, month, etc). Furthermore, the PAH metabolites measured can be linked to multiple potential parent PAH compounds, making it difficult to know the exact PAH that contributed to the metabolite.

7. I’m curious to know how easy it is to match NHANES data with a large database like the NDI. And I’m sure others may have similar thoughts. This cannot have been easy. This would also require accessing personal identifiers from each person that participated in NHANES to track them through the NDI. Was special permission requested and granted to access personal information? Who did the linkage exactly? Can you clarify in the methods section?

8. The methods section indicates that all OH-PAHs were log-transformed prior to statistical analyses. Did the authors confirm that the values were normally distributed after log transformation? If they are not normal after log transformation, please include this information in the paper.

9. Is there any suggestion in the NHANES data that PAH exposures have decreased between 2001-2015? Are the geometric means increasing or decreasing with year of sample collection? Can you comment on this in the manuscript?

10. I believe there is older literature supporting a link between PAH exposure and scrotum cancer in chimney cleaners. The authors may wish to add this to their discussion as they only cite one paper on PAH exposures and lung cancer/smoking.

11. The authors reference the fact that PAHs are lipophilic on page 17, and suggest that tissue specific adiposity and BMI may somehow mediate these associations. This is a concern for lipophilic compounds that have long half-lives in the body (like PCBs, DDE, etc); however, PAHs are rapidly metabolized, and the metabolites are not lipophilic. I think this reference and discussion point should be removed, unless the authors can cite some papers showing an increase in PAH biomarkers following weight loss.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Jun 4;16(6):e0252719. doi: 10.1371/journal.pone.0252719.r002

Author response to Decision Letter 0


23 Apr 2021

Thank you for your very helpful comments, it has greatly improved our manuscript. Please see our response to comments document, which details in red our responses and provides tables for easy viewing. Thank you.

Response to Comments

Reviewer 1

Overall, the manuscript is very well written and addresses an important, ubiquitous exposure that at high occupational levels have been demonstrated to cause cancer of the lung, bladder and skin. It is less clear whether lower non-occupational levels are associated with increased risk. Specific comments are detailed below:

1) Neither the introduction nor the discussion section describes the substantial literature published on coke oven workers and mortality. Given that coke oven workers have some of the highest reported exposure to PAHs, the results of these studies seem particularly relevant to the current manuscript, especially the mortality studies. These studies clearly showed association with working on the tops of coke ovens with lung cancer, but increases in cardiovascular were not as clearly demonstrated. These data need to considered and discussed in the manuscript.

RESPONSE: We have expanded our introduction to include the literature on associations between high levels of PAH exposure through occupations (coke oven work, aluminum production, asphalt work etc.) and both respiratory (malignant, non-malignant) and cardiovascular mortality.

“Occupations with high levels of PAH exposure, including coke ovens, aluminum production, asphalt, and chimney sweeping are associated with excess mortality from lung and other cancers, cardiovascular diseases, and non-malignant respiratory diseases (Redmond 1983, Bertrand et al. 1987, Burstyn et al. 2003, Bursytn et al. 2005, Hogstedt et al. 2013, Miller et al. 2013, Vimercati et al. 2020).”

2) The methods section indicates that 2,577 excluded from the analysis due to missing information or improbable values for follow-up time or accidental deaths. It seems that 20% of the sample is missing, could this missingness have biased the associations, especially because the observed associations were generally small. In addition, the number of individuals 20 years of age and older who donated a urine samples is not reported. The reader has to calculate it from then text. It would be more useful to the reader to just report that there 12,316 individuals 20 years of age or older with urine samples.

RESPONSE: As part of our data analysis, we compared baseline characteristics in the study population prior to application of exclusion criteria (e.g., missingness) to baseline characteristics in the final (analytic) study population. Please see the table below. Compared to the study population prior to application of exclusion criteria, there was virtually no difference in terms of baseline characteristics in the final (analytic) population. Based on your comment, we have decided to include this table as a supplement, describe this in our Methods, and added this additional sentence to the Results section “We did not find any differences in baseline characteristics prior to and after application of exclusion criteria (Supplemental Table S1).”

Supplemental Table S1. (see Response to comments document)

Additionally, per your suggestion, we have amended language in the Methods section to indicate only those 20+ years old had available urine samples.

“There were 12,316 individuals at least 20 years old with spot urine sample OH-PAHs biomarker data across the 2001-2014 survey cycles.”

3) Individuals with a history of cancer were excluded from the analysis of cancer-specific mortality as were those with a history of CVD for the cardiovascular-specific mortality analyses. It is customary to removed prevalent cases at baseline from analyses intended to assess incidence of a disease, but extending this rationale to analyses intended to assess mortality seems as if it could bias the hazard ratios. A sensitivity analysis would probably inform on this.

RESPONSE: We have included an additional supplemental table below which does not exclude individuals with a self-reported history of cancer or CVD at baseline. Comparing to Table 3 in the manuscript, the median concentration of ΣOH-PAHs (nmol/L) does not change. When including individuals with a self-reported history of cancer or CVD at baseline, the adjusted risk estimates for ΣOH-PAHs increase slightly for cancer-specific mortality and decrease slightly for CVD-specific mortality, while remaining non-significant and elevated. Under quartile categorization of ΣOH-PAHs, P-trend for cancer-specific mortality was virtually identical and P-trend for CVD-specific mortality slightly higher when including individuals with a self-reported history of cancer or CVD at baseline.

We have expanded our manuscript to include this additional sensitivity analysis (see Methods, Results, and Supplemental Tables sections). See additions below.

Methods: “Lastly, we conducted analyses including prevalent cases of CVD and cancer (i.e. self-reported history of CVD or cancer at baseline) to assess the impact exclusion of such cases may have had on our risk estimates.”

Results: “When including baseline cases, the adjusted risk estimates slightly increased for cancer-specific mortality, and decreased for CVD-specific mortality, but remained non-significant and elevated (Supplemental Table S5).”

Supplemental Table is displayed in our Response to comments document.

4) The authors indicate that they used a directed acyclic graph (DAG) to a priori determine potential confounders to adjust for in the analysis. This DAG should be presented as a figure either in the manuscript or as a supplemental figure.

RESPONSE: We have included a directed acyclic graph as Supplemental Figure S2.

5) Smoking is an important source of PAHs and other chemical carcinogens and a well-established lung carcinogen. It this analysis, smoking status was used to adjust for potential confounding. However, this is a crude assessment of smoking behavior and the potential for residual confounding seems a likely limitation. In addition, the results stratified by smoking status with residual confounding. For instance, in the nonsmoker/no ETS stratum, neither cancer-specific nor CVD-specific hazard ratios are associated with PAH levels and the association for all-cause mortality is substantially attenuated. This nonsmoker stratum is likely to be have little to no confounding by smoking. Its only in the active smokers that association are observed and this may be due to residual confounding. The NHANES data are limited in that more detailed and precise measure of smoking are not available, so adequate control of smoking is a potential issue. If this is the case, them many, if not all, the hazard ratios reported that adjust for smoking status may be biased towards strengthening the association.

RESPONSE: Thank you for raising this point. We agree that residual confounding by smoking is possible given that data for confounding control were limited by availability of variables in NHANES.

We attempted to account for both active and passive smoking by using both reported smoking and a biomarker for tobacco exposure. Specifically, we used a combination of answers to three questions (Q1: “Smoked at least 100 cigarettes in life”; Q2: “Age started smoking cigarettes regularly”; Q3: “Do you now smoke cigarettes”) as well as serum cotinine levels to determine smoking status. We also highlight the complexities of interpreting findings for the combination of tobacco smoke and PAH exposure in our discussion of our smoking-stratified analysis: “Findings by smoking status in relation to all-cause mortality highlighted that PAH-related mortality risk from non-tobacco smoke sources is substantial and that PAHs from tobacco smoke further elevate mortality risk, possibly through differential PAHs profile and/or synergy among PAHs and other deleterious components of tobacco smoke such as carbon monoxide and formaldehyde.”

We recognize the broader limitation that confounding control could be further improved by better availability of data on smoking intensity (e.g., cigarettes per week) and duration (e.g., pack years). As such, we have amended language in the discussion to reflect this and also added further detail regarding construction of the smoking variable in the methods. Please see our language below.

Methods: “Individuals were categorized as current active smokers or ETS exposed based on self-report (Q1: “Smoked at least 100 cigarettes in life?”; Q2: “Age started smoking cigarettes regularly”; Q3: “Do you now smoke cigarettes?”) or serum cotinine concentrations >10 ng/mL.”

Discussion: “Though we attempted to account for both active and passive smoking in our analysis using a combination of questionnaire (e.g., “Do you now smoke cigarettes?”) and smoking biomarker (serum cotinine) data, the available NHANES did not allow for a more refined measures of smoking intensity and duration, and thus, residual confounding due to smoking is possible and may have biased our results.”

6) The NHANES survey were designed to recruit a representative sample of the US population, resulting in a wide age distribution of potentially eligible individuals. As such, there may be a delayed entry problem and the regression analyses should use left truncation to avoid bias that might be introduced by delayed entry.

RESPONSE: In our analyses, we use age as the time scale, where age at NHANES interview determines study entry (start of follow-up) and age where individuals either died or were censored determines study exit (end of follow-up). Such an approach addresses potential confounding by age on the PAH exposure and mortality relationship while also accounting for left-truncation/delayed entry (by comparison, if time since interview were used as the time scale in analyses, there would be bias due to left-truncation/delayed entry because those entering into study at more recent NHANES survey cycles would have to have survived up until those later study entry times) (Lamarca et al. 1998). We additionally control for potential temporal/period effects through adjustment for survey cycle in multivariable models.

Lamarca R, Alonso J, Gómez G, Muñoz A. Left-truncated data with age as time scale: an alternative for survival analysis in the elderly population. J Gerontol A Biol Sci Med Sci. 1998 Sep;53(5):M337-43. doi: 10.1093/gerona/53a.5.m337. PMID: 9754138.

7) There were 934 deaths. Of which, 108 (11.5%) were CVD-specific and 159 (17%) were cancer-specific deaths. According to the CDC, however, the vast majority of deaths in the US are due to CVD and cancer. It does not appear as if this sample of individuals from NHANES is representative of the US population.

RESPONSE: Thank you for making this point. As we highlight in the manuscript, we intended to capture mortality from incident cancers and CVD for those cause-specific analyses. To this end, in analyses for those endpoints, we excluded individuals with self-reported history of cancer or CVD (for each endpoint, respectively). Prior to this exclusion, we had 222 cancer-specific deaths and 173 CVD-specific deaths, which amounts to ~42% of the mortality events (~ 24% for cancer and ~ 19% for CVD). In 2007 (midpoint of follow-up in this study), cancer and CVD accounted for 49% of mortality events, although mortality from CVD was higher than that for cancer.

Therefore, our analytic sample for all-cause mortality analyses, which did not feature exclusions based on self-reported history of either cancer or CVD (N=934), is largely representative of the US population. However, we recognize that the analytic samples for the CVD and cancer-specific endpoints are not as representative and have amended language pertaining to discussion of results for the cause-specific endpoints accordingly.

8) Percent Attributable Risk (PAR) due to exposure are presented (table 6) for all-cause, cancer-specific, and CVD-specific mortality. PARs assume that there is a causal established and that confounding is adequately controlled to have meaningful interpretations. Given that residual confounding by smoking has not been adequately addressed in this report, that assumption seems to be violated. In addition, all-cause, cancer-specific, and CVD-specific mortality are all measures of aggregate disease processes, and many specific diseases within each aggregate may not be caused by PAHs. This information should be dropped from the manuscript.

RESPONSE: Based on the reviewer’s comments, we agree that the PAR analysis should be dropped. We have deleted all mention in the manuscript.

Reviewer #2: The authors used urinary metabolite data available from the NHANES and linked the data with a mortality database (NDI) to prospectively evaluate the risk for mortality based on PAH exposures. This was a grand endeavor and one of the few studies I know that sought to link such large databases. The research is novel, with the one exception noted below, and the statistical analyses appear robust, although I will acknowledge that some of the analyses are outside my knowledge area, and I am not familiar with their strengths and limitations. Therefore, I do recommend that a biostatistician examine the statistical models.

The discussion and conclusions are supported by the data and the authors appropriately acknowledge the limitations of the study. Overall, I recommend publication of the manuscript after considering the following revisions:

1. It seems like this research is very similar to a paper published in 2020 by Chen et al. The authors acknowledge this but point out that the Chen et al. study was limited by not accounting for urine dilution in their statistical analyses. I agree, urine dilution is an important factor to consider in the analyses. However, it’s still worth noting in this manuscript how the outcomes of the two studies compared. Did they observe similar associations or not? In the current study, the authors acknowledge that urine adjustments for creatinine, or using creatinine as an adjustment in the model did not impact the results. But what happens if you leave it out of the model completely, like Chen et al did. Are the results similar? This really should be addressed.

RESPONSE: We agree that accounting for urine dilution prior to conducting statistical analysis is critical, as the results would be substantially biased by inter-individual variability in urine volume. As Chen et al. did not account for urinary dilution, our results are not comparable in this sense, and a presentation of unadjusted estimates would be biased.

Apart from accounting for urine dilution in multiple ways (creatinine adjustment, creatinine correction), our study populations are also different. We expanded our population to include four additional survey cycles (corresponding to years 2007-2014 compared to Chen et al. 2001-2006), resulting in substantially higher mortality events. We also adopted a different analytic approach, where we examined total PAHs exposure and performed a mixtures analysis compared to single PAH exposure estimates (without correction for the dense correlation structure across single PAHs) presented in Chen et al. Therefore, direct comparability of results across these two analyses is difficult. However, we have added language regarding broad comparisons to Chen et al. while expanding on differences between the studies to highlight that they aren’t directly comparable.

“To date, only one study has examined this relationship using a limited sample of the NHANES population (Chen et al. 2020). This study did not account for urinary dilution of PAH concentrations and the number of mortality endpoints was sparse, resulting in highly imprecise estimates for several PAHs. Furthermore, Chen et al. did not account for the dense correlation structure between PAH metabolites.”

“Our findings for all-cause mortality and cardiovascular-specific mortality support an emerging body of epidemiologic literature on the health impact of PAHs, including Chen et al. (although we note that the studies are not directly comparable due to lack of urinary dilution, differences in study population size, and analytic approach).”

“With regards to gender-specific differences in cancer outcomes in relation to PAH exposure in this study as well as in Chen et al. (2010), Guo et al. (2014) suggested that women are more susceptible than men to chromosomal damage and oxidative stress consequences of PAH (�[NAP, FLUO, PHEN, PYR]) exposure (Guo et al. 2014).”

2. It would be helpful if the authors could edit some of their terminology and lexicon to suit a more general audience. Some of the phrases and terms I believe are very specific to the field of epidemiology/statistical science. For example, the authors use the term “heterogeneity” in many places throughout the manuscript. I assume they refer to differences in significance or model outputs based on either stratified analyses, or univariate analyses, but it’s not always clear to me. For example, on page 13, the first sentence of the last paragraph says “….substantial heterogeneity was observed by smoking and ae for all-cause mortality, by gender…”. Can you use more general terms than heterogeneity to help readers understand your meaning a bit better?

RESPONSE: The term “heterogeneity” refers to differences seen between results in our stratified analysis. We have gone through the manuscript to use more general terms (e.g., “differences between”) to help provide greater clarity in our phrasing.

3. I recommend that the authors consider adding a column all the way to the left in Tables 1 and 2 that includes the p-value describing differences between the categories (alive vs deceased; and sum PAHs by category, respectively). I understand that p-values should not be emphasized as much as the coefficients and beta estimates, but all the same, it helps readers understand which variables are significantly different within a univariate analyses.

RESPONSE: As we mention in the methods, we used a directed acyclic graph (DAG) approach for identification of a minimally-sufficient adjustment set for confounding control. Statistical significance of covariates in relation to the exposure and outcome variables in univariate analyses alone is likely to be biased given that this is an observational study. We do not believe it would be appropriate to make statistical inferences from crude differences, and given that substantive knowledge (i.e., prior literature) rather than statistical significance in univariate analyses was the criterion for identification of variables for confounding control, we err on the side of caution and describe differences in our manuscript results in Table 1 and 2. This decision to not include p-values on Table 1 has also been supported by multiple publications, including the STROBE guidelines (Hayes-Larson et al. 2019, Vandenbroucke et al. 2007).

Hayes-Larson E, Kezios KL, Mooney SJ, Lovasi G. Who is in this study, anyway? Guidelines for a useful Table 1. J Clin Epidemiol. 2019 Oct;114:125-132.

Vandenbroucke JP, von Elm E, Altman DG, Gøtzsche PC, Mulrow CD, Pocock SJ, Poole C, Schlesselman JJ, Egger M; STROBE Initiative. Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): explanation and elaboration. Epidemiology. 2007 Nov;18(6):805-35.

4. An effect modifier that appears to be missing from the analysis is stress. Stress can obviously impact health outcomes, including mortality. Is there any data available within NHANES that can be paired to the urinary metabolite levels and dataset to consider the impact of stress within these models? For example, did NHANES measure cortisol? Or is there survey data that reflects measures of social stress?

RESPONSE: Thank you for this comment. Cortisol is not measured by NHANES, though measures of oxidative stress (e.g., IL-6) and allostatic load are available. Two variables we evaluate as effect modifiers (family poverty status, race/ethnicity) are well-established and strong predictors of allostatic load so we suspect findings stratified by allostatic load would mirror existing findings by family poverty status, for example. A more refined investigation of social stress with robust variables such as cortisol would be interesting for future investigations but beyond the scope of our research question.

5. The references need to be edited. They are not in alphabetic order and it was hard to find the references that were cited in the manuscript. In addition, some references are included twice. For example, Burstyn et al. 2003 is found on page 23 and again on page 26. Please edit the references to remove duplicates and alphabetize.

RESPONSE: Thank you. We have fixed references to be in alphabetical order, and have removed duplicates.

6. A major limitation of this research is the use of spot urine samples. The authors acknowledge this limitation in the discussion, but it would be helpful to clarify the variability that could be expected from relying on spot urine samples. For example, please comment on the magnitude of intra class correlation coefficients that have been reported in urine for PAH metabolites over various time frames (over a day, week, month, etc). Furthermore, the PAH metabolites measured can be linked to multiple potential parent PAH compounds, making it difficult to know the exact PAH that contributed to the metabolite.

RESPONSE: Thank you, we have expanded on this in the discussion and included estimates of intraclass correlation coefficients.

“Biomonitoring studies demonstrate moderate temporal reliability of spot urinary measurements for PAHs; Dobraca et al. (2018) reported intraclass correlation coefficients (ICC) ranging from 0.07 to 0.53 across PAH metabolites considered in this study and most PAH metabolites ranged between 0.35 and 0.50 across multiple years. Importantly, concentrations from a single spot sample reliably ranked exposure into quartiles consistent with study’s multi-year average (Dobraca et al. 2018). Li et al. (2010) reported an ICC of 0.55 for 1-PYR. Reliability was found to be higher in the case of first morning voids or 24-hour urine samples, neither of which are available for this particular environmental NHANES sample (Li et al. 2010, Dobraca et al. 2018).”

7. I’m curious to know how easy it is to match NHANES data with a large database like the NDI. And I’m sure others may have similar thoughts. This cannot have been easy. This would also require accessing personal identifiers from each person that participated in NHANES to track them through the NDI. Was special permission requested and granted to access personal information? Who did the linkage exactly? Can you clarify in the methods section?

RESPONSE: This data linkage has existed for some time (https://www.cdc.gov/nchs/data-linkage/mortality.htm), and linkage is done through CDC/NCHS. More information is available here (https://www.cdc.gov/nchs/data/datalinkage/LMF2015_Methodology_Analytic_Considerations.pdf).

We used all publicly-available files (as noted in the “Mortality outcomes” section of the Methods), though researchers can submit proposals to use restricted-use data, which includes specific tumor sites, etc. We have included additional clarification in the methods section that NCHS provided the publicly-available files.

“We utilized the publicly available Linked Mortality Files provided by NCHS, updated through December 31, 2015, which served as end of follow-up (i.e., point of administrative censoring) (https://www.cdc.gov/nchs/data-linkage/mortality-public.htm). All NDI data is based on death certificates coded to International Classification of Diseases version 10 codes (ICD-10).”

8. The methods section indicates that all OH-PAHs were log-transformed prior to statistical analyses. Did the authors confirm that the values were normally distributed after log transformation? If they are not normal after log transformation, please include this information in the paper.

RESPONSE: All OH-PAH metabolites were log-normally distributed. Prior to our statistical analysis, we performed analytical checks to confirm that log transformation normalized our data.

“All OH-PAHs were log-transformed following visual and quantitative ascertainment of right skewness. Upon log transformation, all OH-PAHs were normally distributed.”

9. Is there any suggestion in the NHANES data that PAH exposures have decreased between 2001-2015? Are the geometric means increasing or decreasing with year of sample collection? Can you comment on this in the manuscript?

RESPONSE: We have included language relevant to these trends and disparities in the discussion.

“Across survey cycles (2001-2014), PAHs exposure was highest in 2005-2006, although no clear trends were apparent”.

“In the US, non-Hispanic black and individuals below the poverty level have higher cardiovascular mortality compared to other race/ethnicities and individuals above the poverty level (Van Dyke et al. 2018). We also saw overall higher OH-PAH concentrations among these minority population groups, indicating higher overall PAH exposure. Trend analysis from non-smoking NHANES participants from 2001-2014 also revealed increasing concentrations of PAHs among minority groups, including Non-Hispanic blacks and Mexican Americans (Hudson-Hanley et al. 2021).”

10. I believe there is older literature supporting a link between PAH exposure and scrotum cancer in chimney cleaners. The authors may wish to add this to their discussion as they only cite one paper on PAH exposures and lung cancer/smoking.

RESPONSE: Based on comments by both reviewers, we have expanded our Introduction to include the literature on associations between high levels of PAH exposure through occupations (coke oven work, aluminum production, asphalt work etc.) and respiratory (malignant, non-malignant) and cardiovascular mortality.

“Occupations with high levels of PAH exposure, including coke ovens, aluminum production, asphalt, and chimney sweeping are associated with excess mortality from lung and other cancers, cardiovascular diseases, and non-malignant respiratory diseases (Redmond 1983, Bertrand et al. 1987, Burstyn et al. 2003, Bursytn et al. 2005, Hogstedt et al. 2013, Miller et al. 2013, Vimercati et al. 2020).”

11. The authors reference the fact that PAHs are lipophilic on page 17, and suggest that tissue specific adiposity and BMI may somehow mediate these associations. This is a concern for lipophilic compounds that have long half-lives in the body (like PCBs, DDE, etc); however, PAHs are rapidly metabolized, and the metabolites are not lipophilic. I think this reference and discussion point should be removed, unless the authors can cite some papers showing an increase in PAH biomarkers following weight loss.

RESPONSE: We have removed mention of this in the manuscript.

Decision Letter 1

Giovanni Signore

21 May 2021

Urinary polycyclic aromatic hydrocarbon metabolites and mortality in the United States: a prospective analysis

PONE-D-20-37390R1

Dear Dr. Mehta,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Giovanni Signore

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: (No Response)

Reviewer #2: I Don't Know

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: (No Response)

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: I am not a statistician and cannot comment on the rigor of the statistics. All my comments have been addressed, with the exception of comment #3. I still think the p-value would be of use in these tables; however, as I am not a statistician, I will go with the judgement of the editor and Reviewer #1.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Acceptance letter

Giovanni Signore

27 May 2021

PONE-D-20-37390R1

Urinary polycyclic aromatic hydrocarbon metabolites and mortality in the United States: a prospective analysis

Dear Dr. Mehta:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Giovanni Signore

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Schematic diagram of exclusion criteria for study population (NHANES 2001–2015).

    (DOCX)

    S2 Fig. Directed acyclic graph to determine the relationships between hydroxylated PAH metabolites (OH-PAHs) and mortality.

    (DOCX)

    S3 Fig. Weights for urinary OH-PAHs from quantile g-computation, by mortality endpoint.

    a) Negative and positive weights for OH-PAHs associated with all-cause mortality. b) Negative and positive weights for OH-PAHs associated with cancer-specific mortality. c) Negative and positive weights for OH-PAHs associated with cardiovascular-specific mortality.

    (DOCX)

    S1 Table. Baseline descriptive characteristics in NHANES 2001–2014 by mortality status, prior to exclusion criteria and in the study population.

    (DOCX)

    S2 Table. Pearson correlation coefficients of eight urinary OH-PAHs (ng/L) from participants participating in NHANES 2001–2014 (N = 9739).

    (DOCX)

    S3 Table. Continuous final models of Σ OH-PAHs and all-cause and cause-specific mortality, excluding participants who died within one and two years of baseline.

    (DOCX)

    S4 Table. Associations between creatinine-corrected Σ OH-PAHs (nmol/g Cre) and mortality, including urinary creatinine (g/L) as an additional covariate.

    (DOCX)

    S5 Table. Continuous final models of ΣOH-PAHs and all-cause and cause-specific mortality, not excluding self-reported history of cancer or CVD at baseline.

    (DOCX)

    S6 Table. Multipollutant models examining the association between individual OH-PAHs and all-cause, cancer-specific, and CVD-specific mortality.

    (DOCX)

    Data Availability Statement

    All NHANES and NDI files are available for download from the National Center for Health Statistics database (https://www.cdc.gov/nchs/nhanes/index.htm).


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES