Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Feb 28.
Published in final edited form as: Ann Appl Stat. 2020 Jun 29;14(2):850–871. doi: 10.1214/20-aoas1330

A causal exposure response function with local adjustment for confounding: Estimating health effects of exposure to low levels of ambient fine particulate matter

Georgia Papadogeorgou 1, Francesca Dominici 2
PMCID: PMC7914396  NIHMSID: NIHMS1670282  PMID: 33649709

Abstract

In the last two decades, ambient levels of air pollution have declined substantially. At the same time, the Clean Air Act mandates that the National Ambient Air Quality Standards (NAAQS) must be routinely assessed to protect populations based on the latest science. Therefore, researchers should continue to address the following question: is exposure to levels of air pollution below the NAAQS harmful to human health? Furthermore, the contentious nature surrounding environmental regulations urges us to cast this question within a causal inference framework. Several parametric and semi-parametric regression approaches have been used to estimate the exposure-response (ER) curve between long-term exposure to ambient air pollution concentrations and health outcomes. However, most of the existing approaches are not formulated within a formal framework for causal inference, adjust for the same set of potential confounders across all levels of exposure, and do not account for model uncertainty regarding covariate selection and the shape of the ER.

In this paper, we introduce a Bayesian framework for the estimation of a causal ER curve called LERCA (Local Exposure Response Confounding Adjustment), which a) allows for different confounders and different strength of confounding at the different exposure levels; and b) propagates model uncertainty regarding confounders’ selection and the shape of the ER. Importantly, LERCA provides a principled way of assessing the observed covariates’ confounding importance at different exposure levels, providing researchers with important information regarding the set of variables to measure and adjust for in regression models. Using simulation studies, we show that state of the art approaches perform poorly in estimating the ER curve in the presence of local confounding.

LERCA is used to evaluate the relationship between long-term exposure to ambient PM2.5, a key regulated pollutant, and cardiovascular hospitalizations for 5,362 zip codes in the continental U.S. and located near a pollution monitoring site, while adjusting for a potentially varying set of confounders across the exposure range. Our data set includes rich health, weather, demographic, and pollution information for the years of 2011–2013. The estimated exposure-response curve is increasing indicating that higher ambient concentrations lead to higher cardiovascular hospitalization rates, and ambient PM2.5 was estimated to lead to an increase in cardiovascular hospitalization rates when focusing at the low exposure range. Our results indicate that there is no threshold for the effect of PM2.5 on cardiovascular hospitalizations.

Keywords: air pollution, cardiovascular hospitalizations, causal inference, exposure response function, local confounding, low exposure levels, particulate matter

1. Introduction

The Clean Air Act, one of the most comprehensive and expensive air quality regulations in the world, mandates that the National Ambient Air Quality Standards (NAAQS) are routinely reviewed. If evidence of the adverse health effects of exposure to ambient air pollution at levels below the NAAQS is established based on the peer reviewed literature, then the NAAQS must be lowered, even at the cost of hundreds of million of dollars. For that reason, researchers routinely address the following question: is exposure to levels of air pollution, even below the NAAQS, harmful to human health? With the next review of the NAAQS for fine particulate matter (PM2.5) scheduled to be completed by the end of the year 2020, the determination of whether exposure levels of PM2.5 below the NAAQS is harmful to human health is subject to unprecedented level of scrutiny. More recently, because of the highly contentious nature surrounding air pollution regulations and the lowering of the NAAQS particularly, there is an increasing pressure to cast this question within a causal inference framework [Zigler and Dominici, 2014]. The method in this paper is motivated by the need to address this critically important question by flexibly estimating an exposure response curve while reliably eliminating confounding bias especially at low levels of exposure.

The literature on the harmful effects of air pollution is very extensive (see, for example, Dominici et al. [2002], Eftim et al. [2008], Zeger et al. [2008], Zanobetti and Schwartz [2007], Crouse et al. [2015, 2016], Di et al. [2017a,b], Berger et al. [2017], Makar et al. [2018], Lim et al. [2018]). However, significant methodological gaps remain in the context of estimating health effects at very low levels. Environmental research studying the health effects of exposure to low levels of ambient air pollution has either examined the relationship in the subset of the sample residing in areas with ambient concentrations below a pre-specified threshold [Lee et al., 2016, Shi et al., 2016, Di et al., 2017a,b, Schwartz et al., 2017, Makar et al., 2018, Wang et al., 2018, Schwartz et al., 2018], or has employed regression approaches for ER estimation across the observed range of pollution concentrations [Daniels et al., 2000, Dominici et al., 2002, Schwartz et al., 2002, Bell et al., 2006, Hart et al., 2015, Thurston et al., 2016, Jerrett et al., 2017, Weichenthal et al., 2017, Lim et al., 2018]. In either case, confounding adjustment in air pollution studies is most-often performed using either a pre-specified set of covariates, or a set of covariates which is decided upon using an ad-hoc variable selection procedure. Such procedure is often based on the statistical significance of covariates’ coefficients in an outcome regression, or the change in the pollution concentration’s coefficient in an outcome model including and excluding sets of covariates [Devries et al., 2016, Pinault et al., 2016, Garcia et al., 2016, Weichenthal et al., 2017].

Generally, regression and semi-parametric modeling approaches for ER estimation such as generalized linear models or generalized additive models [Hastie and Tibshirani, 1986, Daniels et al., 2004, Shaddick et al., 2008, Shi et al., 2016, Dominici et al., 2002] make the following assumptions: 1) the set of potential confounders that are included into the regression model among a potentially large set of available covariates is specified a priori; 2) uncertainty arising from the variable selection techniques is not accounted for; 3) the same potential confounders with constant confounding strength are considered when estimating the health effects across all exposure levels (we refer to this as global confounding adjustment); and 4) the shape of the ER function is modelled as a spline, a polynomial, or linear with a threshold.

Even though ER estimation in air pollution research has mostly remained outside the potential outcome framework, there has been substantial work in ER estimation within the causal inference literature. Hirano and Imbens [2004] introduced the generalized propensity score (GPS) in order to adjust for confounding when estimating the effects of a continuous exposure. Flores et al. [2012] estimated a causal ER function employing a weighted locally linear regression with weights defined based on the GPS. Kennedy et al. [2017] introduced a doubly robust approach for estimating the causal ER function using flexible machine learning tools.

These approaches are very promising and manifest the growing interest in principled causal inference methods for continuous exposures. However, none of the existing approaches explicitly accommodates that in ER estimation, and in contrast to binary treatments, confounding might differ across levels of the exposure. In fact, even though some of the approaches could be altered to allow for different set of confounders or different confounding strength across exposure levels, current implementations of causal methodology for ER estimation has assumed global confounding of pre-selected covariates. Furthermore, it is unclear how these approaches perform in the case of confounding that varies across exposure levels. To address this, confounding adjustment and confounder selection need to be meaningfully extended in the case of a continuous exposure to provide useful scientific guidance with regard to covariates’ confounding importance at different exposure levels.

In our exploratory analyses (§2), we report that the relationship between ambient PM2.5 concentrations and the rate of hospitalization for cardiovascular diseases might be confounded by a different set of covariates at the low versus at the high exposure levels, or by covariates with different confounding strength. We refer to this phenomenon as local confounding. We argue that –especially in the context of estimating causal effects at low levels– local confounding adjustment is deemed necessary.

To target local confounding, if exposure levels with different confounding were known, one could adopt a separate model at each level and adjust for all measured variables using one of the approaches described above. However, even if the number of covariates and local sample size rendered such approach computationally feasible, including unnecessary confounders in the regression model could lead to inefficient estimation of causal effects, especially at very low levels of exposure where data are sparse. Data driven methods to select a minimal necessary set of covariates to be included into an outcome model for estimation of causal effects of binary treatments have been proposed [Luna et al., 2011, Wang et al., 2012, Wilson and Reich, 2014], but to our knowledge, they have not been extended in the context of ER estimation with local confounding adjustment.

The goal of this paper is to overcome the challenges described above by introducing a Bayesian framework for the estimation of a causal ER curve called LERCA (Local Exposure Response Confounding Adjustment). We cast our approach within a causal inference framework by introducing the concept of experiment configuration s¯=(s0,s1,,sK+1), where [sk–1, sk) denotes a specific range of exposure values. We use the term experiment to mimic the hypothetical assignment of a unit to exposure value within [sk–1, sk). Within each experiment, i.e. locally in the exposure range [sk–1, sk), we assume that: 1) the ER is linear; 2) the potential confounders of the ER relationship are unknown but observed; and 3) the strength of the local confounding is also unknown. Across experiments, we require that the ER is continuous at the points s¯. Importantly, the internal points of the experiment configuration, s1, s2, …, sK, are themselves unknown and have to be estimated from the data.

Our work contributes to various components in the literature. First, we contribute to the estimation of causal effects of continuous treatments by extending our understanding of confounding in these settings. Second, our work has connections to the literature on Bayesian free-knot splines [Denison et al., 1998, Dimatteo et al., 2001]. The location of the knots (internal points of the experiment configuration) is informed by both the ER fit, and the necessity for local confounding adjustment. Lastly, our work contributes to the highly controversial and politically charged issue of estimating the causal effects of population exposure to low levels of ambient air pollution.

Even though our motivation and focus is the effects of air pollution, the statistical challenges related to ER estimation at low exposure levels are common across many fields, such as toxicology [Scholze et al., 2001], and clinical trials [Babb et al., 1998]. In fact, the methodology presented in this paper can be used to evaluate regulatory settings of potential harmful substances, and can be routinely used to assess health effects of low level exposures. Such applications include the effects of lead [Chiodo et al., 2004, Jusko et al., 2008], environmental contaminants [Van Der Oost et al., 2003], radiation [National Research Council, 2006, Fazel et al., 2009], and pesticides [Mackenzie Ross et al., 2010, Androutsopoulos et al., 2012].

In §2 we introduce our motivating data set, discuss the difference between personal exposures and ambient concentrations in air pollution studies, and illustrate that local confounding is likely to be present in our study. In §3, we introduce the notation and assumptions on which LERCA in §4 is based. In §5 we show through simulations that both off-the-shelf and state of the art approaches for ER estimation perform poorly when local confounding is present, and we compare LERCA to alternatives in the presence of global confounding. Finally, in §6, we use LERCA to estimate the causal ER function relating ambient PM2.5 concentrations with log cardiovascular hospitalization rates in the Medicare population of 5,362 zip codes. Limitations and potential extensions are discussed in §7.

2. Data description, ambient concentrations, local confounding

In this section we illustrate that, in our study, there might exist a different set of confounders at the low and the high levels of ambient pollution concentrations. LERCA is motivated to overcome this particular challenge.

2.1. Data description

We start by briefly describing our data set which is a collection of linked data from many sources. The unit of the observation is the zip code i, with sample size N = 5, 362. For each zip code, we acquired information on several potential confounders, denoted by Cij for j = 1, 2, …, p and p = 27, capturing socio-economic, demographic, climate, and risk factor information. The full set of zip code level covariates are described in Table A.1. We calculate the outcome Yi defined as log hospitalization rate for cardiovascular diseases (codes ICD-9 390 to 459) among Medicare beneficiaries residing in zip code i in the year 2013. Since Medicare beneficiaries are, in their plurality, individuals over the age of 65, our focus is on the health effect of PM2.5 on the elderly. For each zip code i, we assign exposure Xi defined as the average of daily levels of ambient PM2.5 concentrations for the years 2011 and 2012 recorded by EPA (U.S. Environmental Protection Agency) monitors within a 6 mile radius of zip code i’s centroid. The values of Xi range from 2.7 to 18.3μg/m3 (see Figure 1). We define Xi using the two years prior to the year whose outcome we analyze in order to respect the temporal ordering of treatment and outcome when drawing causal conclusions. Longer time lags could be considered, but, in such settings, our analysis would potentially be more susceptible to population mobility.

Figure 1:

Figure 1:

Average levels of PM2.5 for the years 2011–2012 for each zip code i included into the analysis.

Since our definition of Xi requires the presence of an EPA monitor within 6 miles of a zip code’s centroid, the zip codes included in our study are a subset of the full set of zip codes in the continental U.S. Supplement A includes a detailed description of data linkage (EPA monitors, Medicare, others), and descriptive statistics across zip codes in the whole continental U.S. and only those included in our study. Excluded zip codes resemble, in general, those included in our analysis, but are perhaps in more rural areas, with lower population density, and higher proportions of white population and unemployment.

2.2. Ambient concentrations versus personal exposures

In order to agree with existing causal inference literature for continuous treatments, we often refer to measurements Xi as zip code i’s exposure, and a range of ambient pollution concentration as an exposure level. However, a zip code’s measurement of ambient concentration Xi might be substantially different from the personal exposure of an individual residing in that zip code. Figure 2 shows a hypothetical DAG relating ambient and indoor pollution concentrations with individuals’ personal exposures and health outcomes. Ambient pollution concentrations act on an individual’s outcome only through the individuals’ personal exposures.

Figure 2:

Figure 2:

DAG relating exposure to ambient PM2.5 concentrations to personal exposures and health outcomes. Covariate sets C˜1, C˜2, C˜3 represent potentially different sets of confounders between the health outcome and ambient, personal or indoor PM2.5 concentrations or exposures. Arrows represent potential (but not necessarily present) relationships.

In this paper, we focus on estimating the causal effects of ambient PM2.5 on cardiovascular health outcomes. That is, potentially, the most interesting question from a policy perspective, since policy regulations (and the NAAQS) are set based on the knowledge for the effect of ambient concentrations. The implications of using ambient concentrations instead of personal exposures to study the effect of pollution concentrations are discussed in §7.

2.3. Potential presence of local confounding in our study

In the case of binary treatments, whether a covariate acts as a confounder is often evaluated by checking whether there exists significant imbalance in the covariate distribution of the treated and control groups. For continuous exposures, there is no direct counterpart to covariate balance since units are not separated into two groups. Instead, exploratory analyses for the presence of confounding are often based on covariates’ strength in predicting the exposure through regression models [Imai and Van Dyk, 2004]. Then, a covariate’s p-value in a model for the exposure is used to investigate whether it might be a confounding variable.

We use a related approach to illustrate the potential presence of local confounding in our data. We considered two subsets of zip codes: 1) zip codes with low ambient concentrations (<8μg/m3; 817 observations); and 2) zip codes with high ambient concentrations (>11.5μg/m3; 672 observations). Even though this definition of the low and high exposure levels is arbitrary for the purpose of our illustration, this choice ensures a similar number of observations and similar range of exposure values within the two levels. Within each exposure level separately, we considered a linear regression of ambient pollution concentration on each covariate, and evaluated the covariate’s predictive strength through its p-value.

Figure 3 shows the p-value of the covariates in the p regression models, and for the two exposure levels. We see that some variables such as population density (Population/SQM) and the percentage of the population with less than a high school education (% Below HS) are predictive of ambient concentrations in both low and high exposure levels. However, other variables, such as the median household value (in logarithm – House Value), are only predictive of ambient pollution concentrations at the high exposure levels. The opposite is true for the percentage of population that is white (% White). Such initial investigation indicates that different variables might act as predictors of the ambient exposure at different exposure levels.

Figure 3:

Figure 3:

Covariate p-values in predicting the exposure, separately at the low (blue: < 8μg/m3) and high (red: > 11.5μg/m3) exposure levels.

In Supplement B, we show the estimated covariates’ coefficients whose p-values are shown in Figure 3. Since the two exposure levels are relatively balanced in terms of number of observations and range of exposure values, the magnitude of the p-values in Figure 3 is directly comparable to the magnitude of the estimated coefficients. This indicates that initial investigation of local confounding could be equivalently performed in terms of estimated coefficients or p-values.

In Supplement B, we also consider a similar exploratory analysis to investigate which covariates are predictors of the health outcome at the low and high exposure levels separately. Combining the results presented there to the ones in Figure 3, there is evidence that the variables that confound the ER relationship might differ across levels of the exposure leading to local confounding. For example, the zip code median household value (House Value) is predictive of both ambient air pollution and cardiovascular hospitalization rates at the high exposure levels, but is not predictive of ambient air pollution at low exposure levels. Additionally, there is indication that the percentage of the population with less than a high school degree (% Below HS) is a confounder at the low exposure levels, whereas the same variable is not predictive of the health outcome at the high exposure levels.

3. Causal ER, the experiment configuration, and the local ignorability assumption

We follow the potential outcome framework [Neyman, 1923, Rubin, 1974, Hirano and Imbens, 2004], and under the stable unit value of treatment assumptions (SUTVA; no interference, no hidden versions of the treatment [Rubin, 1980]), we use Yi(x) to denote the potential outcome for observation i at exposure xX, where X is the interval including all possible exposure values. Then, {Yi(x),xX} is unit i’s ER curve, and {Y¯(x)=E[Yi(x)],xX} is the population average ER curve.

Assuming Y¯(x) is differentiable as a function of x, we define the instantaneous causal effect

Δ(x)=limh0Y¯(x+h)Y¯(x)h.

A Δ(x) ≠ 0 implies that variation in the exposure in a neighborhood of x has a causal effect on the expected outcome. We also define the population average causal effect of an exposure shift from x to x + δ, as CEδ(x)=Y¯(x+δ)Y¯(x)=xx+δΔ(t)dt. The observed outcome Yi is equal to the potential outcome at the observed exposure Yi(Xi).

Under the weak ignorability assumption which states that the treatment is as if randomized conditional on observed covariates, XY(x)C, and every subject in the population can experience any xX, Y¯(x) is identifiable using the observed data [Hirano and Imbens, 2004]. Then, a minimal confounding adjustment set C* ⊆ C is a set of covariates which satisfies XY(x)C, but XY(x)C** for any C** strict subset of C* [Luna et al., 2011, Wang et al., 2012, Vansteelandt et al., 2012].

In this paper, we are interested in addressing the possibility that the minimal sufficient adjustment set C* varies across exposure levels. We formalize this by introducing the experiment configuration. Let K denote a fixed positive integer, and min=infX and max=supX denote the known and fixed minimum and maximum values of the exposure range X. Then, s¯=(s0=min,s1,s2,,sK,sK+1=max) is the experiment configuration which defines a partition of the exposure range in K +1 experiments gk = [sk–1, sk). We use s to denote the internal points s1, s2, …, sK. In Figure 4, a hypothetical exposure response function is plotted where s¯ defines a total of 4 experiments (K = 3). Then, Ck* is a minimal sufficient adjustment set in experiment k if it satisfies

XY(x)Ck*,forallxgk, (1)

and (1) does not hold for any strict subset of Ck*. The sets Ck* can be disjoint, identical, or overlapping if the same variable is necessary for confounding adjustment in more than one experiment.

Figure 4:

Figure 4:

ER curve with exposure range partitioned by s¯ in 4 experiments.

4. ER estimation in the presence of local confounding

Motivated by the evidence of local confounding between ambient PM2.5 concentrations and cardiovascular hospitalizations discussed in §2, we introduce LERCA: Local Exposure Response Confounding Adjustment. In order to build intuition, we do so for a fixed experiment configuration in §4.1. LERCA with unknown s is presented in §4.2. The choice of K is discussed in §4.4.

4.1. Known experiment configuration

Assume for now a known experiment configuration s¯. Then, locally, that is for xgk = [sk–1, sk), we assume the following pair of exposure and outcome models:

p(xC=c,xgk)=ϕ(x;δk0X+j=1pαkjXδkjXcj,σk,X2)p(yX=x,C=c,xgk)=ϕ(y;δk0Y+βk(xsk1)+j=1pαkjYδkjYcj,σk,Y2) (2)

where ϕ(·; μ, σ2) denotes the normal density with mean μ and variance σ2, and αkjX{0,1} indicates that covariate Cj is included into the exposure model of the kth experiment (αkjX=1), or not (αkjX=0). The parameter αkjY has the same interpretation, but for the outcome model. The parameter βk denotes the instantaneous change in the expected outcome associated with a local variation in exposure for xgk, adjusted for the Cjs that have αkjY=1. Even though all parameters depend on which covariates are included in the corresponding model, we do not explicitly state this dependence for notational simplicity. Model (2) allows for a different set of variables and variables’ coefficients at the different experiments.

If a minimal confounding adjustment set for experiment k is included in the outcome model and the mean functional form is correctly specified, βk is an unbiased estimator of the instantaneous effect Δ(x), for xgk. Similarly, an unbiased estimator of Y¯(x) is EC{E[Yi|X = x, C]}, which can be estimated by taking the average over the units in our sample of the conditional expectation E[Yi|X = x, Ci].

In §4.1.3 we discuss how the prior distribution on the inclusion indicators is chosen to target confounding adjustment. In §4.1.4, we discuss prior specification for outcome model coefficients that ensures borrowing of information across experiments and ER continuity across the exposure range. But first we address two questions that naturally arise from the specification of model (2). First, we clarify the connection between LERCA and a model that specifies the ER relationship using linear splines in §4.1.1. Then, in §4.1.2, we discuss how LERCA compares to a model that is fit separately within each experiment gk.

4.1.1. Connection to linear splines.

In the outcome specification of model (2), the term βk(xsk−1) in the mean functional could be substituted by βkx and − βksk−1 could be absorbed in the intercept. However, specifying the model as to include βk(xsk−1) demonstrates the connection between model (2) and a model where the ER relationship is specified using linear splines with knots s. Furthermore, such specification significantly simplifies prior elicitation to ensure ER continuity (see §4.1.4), and posterior sampling satisfying the continuity condition (see Supplement E).

Even though the outcome model in (2) resembles a linear splines model, there is a key distinction between the two models. In model (2), different experiments gk are allowed to have a different slope for the exposure (βk), a different set of outcome predictors (covariates with αkjY=1), or the same set of predictors but with different coefficient (δkjY). Therefore, points s in (2) represent a change in the slope or a change in the outcome model covariate adjustment. On the other hand, a model that uses splines for the exposure-response relationship only allows βk to vary with k. In this sense, a splines model is a sub-case of model (2), that for αkjY and δkjY constant across k.

The assumption of local linearity (linear effect of the exposure on the outcome within each experiment) can lead to global non-linearity, and can be relaxed using higher order splines. However, for our study of the health effects of ambient air pollution at low concentrations, previous research indicates that the relationship between air pollution and cardiovascular outcomes is linear [Thurston et al., 2016, Lim et al., 2018] or supra-linear [Crouse et al., 2015, Pinault et al., 2017], situations that our model can adjust to.

4.1.2. Connection to a separate model across experiments.

A natural question that arises from the LERCA model specification in (2), is how LERCA compares to fitting a separate outcome model within each experiment gk. Doing so would still allow for different confounders and different confounding strength at different exposure levels.

However, a separate model within each experiment would not borrow any information across exposure levels, and could estimate an ER that is not continuous at the points s. In contrast, LERCA borrows information across exposure levels by ensuring that the estimated ER is continuous everywhere (see §4.1.4). If higher order polynomials are used within each experiment, LERCA, similarly to splines, could be altered to accommodate higher order smoothness across the exposure range.

4.1.3. Prior distribution on inclusion indicators for confounding adjustment.

We build upon the work by Wang et al. [2012, 2015] to assign an informative prior on covariates’ local inclusion indicators (αkjX,αkjY). This prior choice ensures that model averaging assigns high posterior weights to outcome models including a minimal confounding adjustment set separately for each exposure range, and specifies

P(αkjY=1αkjX=1)P(αkjY=0αkjX=1)=ωwhereω>1,iidj,k. (3)

By specifying (3), a variable Cj is assigned high prior probability to be included into the outcome model if it is also included in the exposure model (xgk & αkjX=1). Wang et al. [2012] and Antonelli et al. [2017b] show that, for binary treatments, this informative prior leads to outcome models that include the minimal set of true confounders with higher posterior weights than model selection approaches that are based solely on the outcome model. In our context, this experiment-specific prior specification ensures that, locally, covariates in the minimal set Ck* are included in the outcome model of experiment k with high posterior probability.

4.1.4. Ensuring ER continuity.

In most applications, it is expected that the causal ER relationship Y¯(x) is continuous in x. Therefore, estimates of Y¯(x), in our case EC{E[Y|X = x, C]}, should also be continuous. If the covariates Cj are centered, and under model (2), continuity of the estimated ER function is satisfied if

limxsk+EC{E[YX=x,C]}=limxskEC{E[YX=x,C]}δk0Y=δ(k1)0Y+βk1(sksk1). (4)

This is ensured by assuming a point-mass recursive prior on δk0Y, k ≥ 2. Then, conditional on s, the outcome model intercept of experiment k ≥ 2 is a deterministic function of the outcome model intercept of the first experiment δ10Y, and the slopes β1, β2, …, βk–1. These parameters are assigned independent non-informative normal prior distributions.

4.1.5. Prior distributions of the remaining coefficients.

Prior distributions on the remaining regression coefficients (exposure model coefficients, outcome model covariates’ coefficients) and variance terms are chosen such that they lead to known forms of the full conditional posterior distributions to simplify sampling. We use independent non-informative inverse gamma prior distributions on σk,X2, σk,Y2. Non-informative normal prior is chosen for the exposure model intercepts δk0Y. Conditional on the inclusion indicators, the prior on the regression coefficient δkjY is a point mass at 0, or a non-informative normal distribution when αkjY is equal to 0 or 1 accordingly. Similarly for the exposure model covariates’ coefficients δkjX. Default hyperparameter values are set to 0.001 for the inverse gamma distribution, and (0, 100) for the mean, and standard deviation of the normal distribution. Details on the prior specifications can be found in Supplement D.

4.2. Unknown experiment configuration

For a fixed experiment configuration s¯, each experiment is treated separately in terms of confounder selection and strength of the confounding adjustment. However, the configuration itself is a key component of the fitted exposure response curve, and fixing it a priori could lead to bias and uncertainty underestimation. Instead, we assume that, a priori, the internal points of the experiment configuration s are distributed as the even-numbered order statistics of 2K +1 samples from a uniform distribution on the interval (s0, sK+1). This prior choice of s discourages specifications of s that include values that are too close to each other [Green, 1995]. The prior is augmented by indicators that consecutive points sk, sk+1 cannot be closer than some distance dk. Conditional on s, we follow the model specification and prior distributions described in §4.1.

4.3. MCMC scheme and convergence diagnostics

Markov Chain Monte Carlo (MCMC) methods are used to acquire samples from the posterior distribution of model parameters. A detailed description of the MCMC scheme including computational challenges and contributions can be found in Supplement E. There, we also discuss MCMC convergence diagnostics based on the potential scale reduction factor (PSR; Gelman and Rubin [1992]) for quantities that do not directly depend on the experiment configuration.

4.4. Number of points in the experiment configuration

As presented previously, LERCA requires the specification of the internal number of points K in the experiment configuration. Since the number of parameters grows with K, possible values for K could be bounded by considering the maximum number of coefficients we are willing to entertain.

Cross validation methods to choose values of tuning parameters are often infeasible in the Bayesian framework due to time and computational resources constraints. In a comprehensive review, Gelman et al. [2014] discusses methods of estimating the expected out of sample prediction error for Bayesian methods. The widely-applicable information criterion (WAIC; Watanabe [2010]) provides an estimate of the out-of-sample prediction error based on one MCMC run. It is defined as W AIC = −2 (lppd – pW AIC), where lppd and pW AIC denote the log point-wise posterior predictive density and the penalty:

lppd=i=1nlogEpostp(xi,yiθ)pWAIC=i=1nvarpost(logp(xi,yiθ)).

Here, 9 denotes the full vector of parameters, and Epost, varpost denote the posterior mean and variance.

In order to choose K for LERCA, LERCA is fit once for different values of K, and K is chosen as the value that minimizes the estimate of the WAIC.

5. Simulation Studies

The main goal of our simulation study is to illustrate that local confounding is an important issue that both commonly-used and flexible approaches for ER estimation fail to adjust for and they return biased results. The results from our simulation study indicate that methodology that directly accommodates local confounding is necessary in order to correctly estimate the causal effect of a continuous exposure. An R package which can be used to generate data with local confounding and fit LERCA is available at https://github.com/gpapadog/LERCA.

Additionally, in §5.4 we discuss results from a simulation study under a generative model without local confounding. In this case, traditional approaches and global confounding adjustment suffice for ER estimation, and the question is how comparably LERCA performs.

The approaches we considered are:

  1. Generalized Additive Model (GAM): Regressing the outcome Y on flexible functions of the exposure X and all potential confounders (4 degrees of freedom for each predictor).

  2. Spline Model (SPLINE): Additive spline estimator described in Bia et al. [2014]. The generalized propensity score (gps) is modelled as a linear regression on all covariates. The ER function is estimated using additive spline bases of the exposure and gps.

  3. The Hirano and Imbens estimator [Hirano and Imbens, 2004] (HI–GPS): ER estimation is obtained by fitting an outcome regression model including quadratic terms for both the exposure and the gps, and the exposure-gps interaction. The gps is estimated as in SPLINE.

  4. Inverse Probability Weighting estimator (IPW): The generalized propensity score is used to weigh observations in an outcome regression model that includes linear and quadratic terms of exposure. The gps is estimated as in SPLINE.

  5. The doubly-robust approach of Kennedy et al. [2017] (KENNEDY): The gps and outcome models are estimated using the Super Learner algorithm [Van Der Laan et al., 2007] combining the sample mean, linear regression with and without two-way interactions, generalized additive models, multivariate adaptive regression splines, and random forests. Based on the gps and outcome model estimates, the pseudo-outcome is calculated and is regressed on the exposure using kernel smoothing. This approach is chosen to represent state-of-the-art methods in ER estimation that are based on flexible, machine-learning and non-parametric approaches.

5.1. Data generation with local confounding

We generate data with exposure values which range from 0 to 10 and are uniformly distributed over the exposure range. Even though a uniform distribution is not accurate for the exposure variable in our study (ambient air pollution concentrations), we consider a uniformly distributed exposure to ensure that methods’ performance is solely affected by the presence of local confounding, and not by the presence of limited sample size at some exposure levels. We consider a quadratic ER, and true experiment configuration s¯=(0,2,4,7,10). Table 1 summarizes which of the 8 potential confounders are predictive of the exposure and/or the outcome within each experiment (correlations and regression coefficients are summarized in Table C.1). Note that in this data generating mechanism the minimal set of confounders vary across the four experiments. We simulate 400 data sets of 800 observations each. Details on the data generating mechanism are in Supplement F.

Table 1:

Representation of which covariates are predictive of the exposure and / or the outcome within each experiment (denoted by a ✓). Covariates with ✓ in both models within the same experiment are local confounders.

Experiment Model C1 C2 C3 C4 C5 C6 C7 C8

1 X | C
Y | X, C

2 X | C
Y | X, C

3 X | C
Y | X, C

4 X | C
Y | X, C

5.2. Fitting the methods

The different methods are fit using the gam and causaldrf R packages [Hastie, 2017, Schafer, 2015], and the code available on Kennedy et al. [2017]. LERCA is fit for K ∈ {2, 3,4}, and for each data set the results shown correspond to the K that minimized the WAIC.

Using each method, we estimate the population average ER curve Y¯(x) over an equally spaced grid of points on the interval (0,10), and compare the root mean squared error (rMSE) as a function of x. We also assess whether LERCA can recover the correct location of the points s, identify the true confounders within each experiment, and choose the correct value for K.

5.3. Simulation Results

Figure 5 shows the estimated ER curves using the alternative methods. In Figure 6 we summarize the LERCA results including the estimated ER, the internal points of the experiment configuration and outcome model inclusion indicators of covariates C1, C4 as a function of exposure x ∈ (0,10). We choose C1 and C4 because, in this data generating mechanism, C1 is a confounder in experiment 1 (x < 2), and C4 is a confounder in experiment 2 only (2 < x < 4). Grey lines correspond to results for individual data sets, whereas black solid lines correspond to averages across simulated data sets.

Figure 5:

Figure 5:

The true mean ER function (dashed line), estimated ER functions from each simulated data set (gray), and the mean of the estimated ER functions (solid lines) using all alternative methods.

Figure 6:

Figure 6:

LERCA results. (Left) Mean ER estimates. (Center) Posterior distribution of the internal locations s. (Right) Outcome model posterior inclusion probability of C1 and C4. Gray lines correspond to simulated data sets separately, and black solid lines correspond to averages across data sets.

In Figure 5 we see that the alternative methods return biased results, especially at very low or very high levels of the exposure. These results indicate that neither commonly-used nor flexible approaches utilizing machine learning tools appropriately accommodate local confounding adjustment for ER estimation. In terms of root MSE, LERCA was consistently lower than the alternative methods at low exposure levels, all approaches performed similarly at middle exposure levels, and GAM slightly outperformed LERCA at high levels (Figure C.1). In Supplement C.2, we show that the relative performance of GAM and LERCA is reversed when the confounding structure is also reversed. These results indicate that local confounding is an issue across all exposure levels, and that, since the true confounding structure is never known for a real data set, LERCA should be preferred if local confounding is of concern.

As showed in Figure 6, even though the true ER is quadratic and LERCA is formulated as piece-wise linear, LERCA is able to identify the correct shape of the exposure-response function. We find that using WAIC to choose the value of K led to choosing the correct value of K = 3 40% of the times, and K = 2 58% of the times indicating that WAIC tends to over-penalize large values of K. Regardless, the correct internal locations s = {2, 4, 7} are located at the modes of the posterior distribution (second panel in Figure 6). By examining the posterior inclusion probabilities of C1, C4, we observe that instrumental variables (e.g., C1 in experiments 2 and 3) are often included in the outcome model. However, LERCA includes the minimal confounding set within each experiment with very high probability. On average (across the points in the exposure range and across all the simulated data sets) the minimal confounding set was included in the adjustment set 99% of the times (ranging from 89–100% across simulated data sets), indicating that the variables necessary for confounding adjustment are almost always included in the adjustment set. Lastly, the point-wise 95% and 50% credible intervals cover the true mean ER values 84% and 39% of the times accordingly. The observed under-coverage is largely due to the underestimation of K.

5.4. Simulation results in the absence of local confounding

The previous generative scenario compared methods’ performance in the presence of local confounding. In Supplement C.3, LERCA is compared to the alternative methods in the more traditional setting of global confounding, that is, in the setting more favorable to the other methods. In this context, LERCA with K = 3 (fixed) performed similarly in terms of root MSE compared to GAM and Kennedy’s doubly-robust estimator, but better than the remaining alternative methods. These results indicate that LERCA offers a protection against bias arising from local confounding, without sacrificing efficiency when local confounding is not present.

6. Estimating the effect of ambient PM2.5 concentrations on zip code cardiovascular hospitalization rates

We estimate the relationship between the average ambient PM2.5 concentrations for the years 2011–2012 and log cardiovascular hospitalization rates in 2013, using the data set introduced in §2, and allowing for local confounding adjustment. Here, a unit i from §3 corresponds to the areal unit of a zip code.

6.1. Plausibility of the causal assumptions in our study

The interpretation of estimated results as causal are bound by the plausibility of the causal assumptions within the study’s setting. Here, we examine these assumptions in the evaluation of the causal relationship between ambient PM2.5 and cardiovascular hospitalization rates.

One assumption discussed in §3 is SUTVA which states that a zip code’s potential outcomes are only a function of the zip code’s own PM2.5 levels. If Medicare beneficiaries residing within a zip code travel outside of it, then other zip codes’ ambient PM2.5 concentrations can affect the personal exposures of zip code i’s beneficiaries and, as a consequence, the zip code’s hospitalization rates, invalidating SUTVA. This phenomenon is referred to in the literature as interference. Since PM2.5 concentrations in nearby zip codes are similar, and Medicare beneficiaries are expected, at some level, to spend most of their time within a relatively close distance to their home, interference can be assumed to be limited (dashed arrow in Figure 7). When interference is limited, Savje et al. [2018] showed that ignoring it returns estimates that are close to an average treatment effect.

Figure 7:

Figure 7:

Subgraph relating zip code i and j’s covariates and ambient PM2.5 concentrations to i’s potential outcomes. Arrow from Xj to Yi is weaker than that from Xi.

The most commonly invoked causal assumption is that of ignorability. In our setting, ignorability implies that, conditional on measured covariates, a zip code’s “assignment” to a specific level of ambient PM2.5 does not depend on its potential outcomes (see (1)), and any zip code can experience any ambient PM2.5 within the observed range. One natural question is whether spatial correlation of ambient pollution concentrations invalidate ignorability, either through confounding or positivity. The no unmeasured confounding assumption is expected to hold if the set of measured covariates includes all confounders. In our study, we might expect that the covariates of nearby zip codes (such as nearby weather conditions) affect a zip code’s ambient PM2.5 concentrations (arrow from Cj to Xi in Figure 7). If, in addition, a zip code’s outcome directly depends on the covariates of other zip codes (arrow from Cj to Yi) then Cj has to be included in the model for Yi. We assume that such direct dependence does not exist in our study. For example, weather conditions near but not in zip code i only affect zip code i’s hospitalization rates through their effect on ambient PM2.5 concentrations. Even though ambient PM2.5 concentrations are spatially correlated, the positivity assumption requires that zip codes can experience any PM2.5 concentration level marginally, and the spatial correlation of PM2.5 does not further complicate the plausibility of the positivity assumption.

Lastly, the interpretation of our study results as causal are bound by the specification of the model in (2). If the mean functionals are not correctly specified, estimates of βk can be biased for the causal effect of PM2.5 within that exposure level. Even though model (2) assumes independence of PM2.5 concentrations, the spatial dependence structure is not expected to affect estimation of the model’s regression coefficients or variable selection.

The results presented next can only be interpreted as causal under the assumptions discussed here. If any of the assumptions is violated, the study results should be interpreted as associational.

6.2. Study results

We fit LERCA for K ∈ {2, 3, …, 6} and we report the results for K = 3 which corresponds to the model with the lowest WAIC. Figure 8 shows the posterior mean and the 95% credible intervals of the ER, the posterior distribution of the internal points of the experiment configuration, and the posterior mean and 95% credible interval of βk within each experiment. Positive values of βk imply that an increase in ambient PM2.5 concentrations leads to an increase in hospitalization rates.

Figure 8:

Figure 8:

Top: Mean ER curve of PM2.5 exposure (x-axis) and log all-cause cardiovascular hospitalizations (y-axis) –solid line– with 95% pointwise credible intervals. The rug of points shows the distribution of observed PM2.5 values. Bottom: The posterior mean and 95% credible interval of the β coefficient within the four experiments. The rug of points shows the posterior distribution for s for K = 3.

In Figure 8, we see that the estimated ER is supra-linear with steeper incline at low concentrations. Examining the 95% credible intervals for βk there is evidence that an increase in PM2.5 at the low levels (≤ 9.9μg/m3) leads to an increase in log hospitalization rates. However, 95% credible intervals for x ≥ 9.9μg/m3 include zero. Note that the current NAAQS for long term exposure to ambient PM2.5 is equal to 12μg/m3. These results indicate that there is no exposure threshold for the effect of PM2.5 on cardiovascular outcomes, which means that reductions in ambient PM2.5 would lead to further health improvements, even at the low levels. These results are consistent with other epidemiological studies which have found that the strength of the association between PM2.5 and health outcomes is larger at low concentration levels [Dominici et al., 2002, Shi et al., 2016, Di et al., 2017b]. Lastly, the posterior distribution of s, shows that observations below 8μg/m3 and over 11.5μg/m3 are always in the same experiment.

6.3. Variability of the covariates’ posterior inclusion across PM2.5 concentration levels

We investigated whether local confounding was present by examining the variability of the covariates’ inclusion probabilities in the exposure and outcome models as a function of PM2.5. Figure 9 shows the posterior inclusion probabilities for three covariates as a function of PM2.5 providing a measure of the covariates’ confounding importance across the PM2.5 concentration range.

Figure 9:

Figure 9:

Posterior inclusion probability of zip code population percentage with less than a high school education, population density, and median house value in the exposure and outcome model as a function of PM2.5.

The posterior inclusion probabilities vary substantially at different concentration levels indicating that local confounding is likely to be present. In Figure 3, the exploratory analysis showed that the zip code median household value (House Value) was predictive of both PM2.5 and hospitalization rates at the high ambient concentration levels, but only of the outcome at the low levels. The LERCA results in Figure 9 lead to the same conclusion. Similarly, the posterior inclusion probability for the variable representing the zip code’s percentage of the population with less than a high school education (% Below HS) indicates that this variable is an important confounder only at the low levels, in accordance to the exploratory analysis. LERCA returns a similar conclusion about the variable representing population density (Population/SQM), in disagreement with the analysis in §2 which showed that population density was predictive of both PM2.5 concentrations and the outcome at both low and high levels. Comparisons between the results in Figure 3 and the outcome model posterior inclusion probabilities were performed for all variables. LERCA tends to include in the outcome model a smaller number of variables than what one might have assumed based on the exploratory analysis. This is expected since LERCA considers the confounding importance of all variables simultaneously.

6.4. Variability of the covariates’ posterior inclusion within the low experiment

With the focus of our study being the evaluation of the effect of ambient PM2.5 at the low concentration levels, we studied the interpretation of β1 across MCMC samples. Since the interpretation of β1 as a causal effect requires that a sufficient adjustment set is included in the outcome model, we examined the variability of the covariates’ outcome model inclusion indicators within the low experiment across iterations of the MCMC.

Across MCMC samples, 174 combinations of the covariates were included in the outcome model (out of the 227 possible ones). Even though this is a large number of potential models, 51% of the posterior weight was given to the model with the following 9 covariates: the zip code’s median house value and percentage of the population with at most a high school education, as measured in the 2000 Census and its extrapolation between 2000 and 2013, the population rate that has been a smoker at some point in their lives, the zip code’s population density, the average dew point, the average age of Medicare beneficiaries and the percentage of them that are women. We refer to this model as Model 1. The model with the second highest posterior probability included the same covariates except for smoking rate, and accounted for 7% of the MCMC samples. Therefore, there is evidence that Model 1 outperformed the rest in confounding adjustment at low levels.

In order to evaluate the impact of model averaging on our final estimates, we compared the posterior distribution of β1 across all MCMC samples to its distribution based on the MCMC samples for which Model 1 was chosen. Across all samples, β1 was estimated to be equal to 0.035 with 95% credible interval 0.012 – 0.06, and posterior probability that it is greater than 0 equal to 99.7%. Among posterior samples for which Model 1 was chosen, β1 was estimated to be 0.034 with 95% credible interval 0.011 – 0.056 and posterior probability that β1 is greater than 0 also equal to 99.7%. The consistency of the Model 1 estimates and the model averaged estimates is an indication that model averaging, in this situation, did not lead to averaging over incompatible models.

7. Discussion

We have introduced an innovative Bayesian approach for flexible estimation of the ER curve in observational studies that has the following features: 1) it casts the formulation of the ER within a potential outcome framework, and mimics several randomized experiments across exposure levels; 2) it uses the data to inform the experiment configuration; and given the current experiment configuration 3) allows for the possibility which is a reality in our study (Figure 3 and Figure 9) that different sets of covariates are confounders at different exposure levels; 4) allows for varying confounding effect across levels of the exposure; 5) performs local covariate selection to increase efficiency; 6) propagates model uncertainty for the experiment configuration and covariate selection in the posterior inference on the whole ER curve; and finally, 7) provides important scientific guidance related to which covariates are confounders at different exposure levels.

Although non-parametric and varying coefficient approaches [Hastie and Tibshirani, 1993] for ER estimation could, in theory, allow for differential confounding across different exposure levels, none of the existing methods for ER estimation explicitly accommodates local confounding, nor provides guidance for which covariates are confounders of the effect of interest at different levels of the exposure. Furthermore, the use of non-parametric methods to estimate a generalized propensity score or model the outcome of interest could prove unfruitful in situations where most of the available data are over a specific range of the exposure variable, the number of potential confounders is large, and interest lies in the estimation of causal effects for a change in the exposure in the tails of the exposure distribution. In such situations, LERCA provides a way to model the outcome acknowledging that the exposure-response relationship might be confounded by different covariates at different exposure levels. Lastly, it is worth noting that LERCA shall not be seen as a direct competitor to the approach by Kennedy et al. [2017]. In fact, since the Super Learner algorithm combines different approaches for modeling the outcome, LERCA could be incorporated in the algorithm as an approach that allows for the presence of local confounding.

The main contribution of this paper is in addressing the issue of local confounding in ER estimation, and in providing guidance of covariates’ confounding importance at different exposure levels. In doing so, LERCA is based on several modeling decisions that can be easily altered. First, within each experiment and thus locally within a narrow exposure range, LERCA assumes linearity for both the outcome and exposure models. Local linearity could be relaxed by using higher order polynomials. Second, the informative prior on the inclusion indicators could lead to the inclusion of instrumental variables in the outcome model, which will not lead to bias, but will decrease the efficiency of our estimators. In the study of air pollution, strong instrumental variables are not expected to be present. Alternative strategies for local confounder selection can be accommodated here, extending, for example, work by Wilson and Reich [2014], Cefalu et al. [2017] and Antonelli et al. [2018]. An interesting line of research is to explore LERCA’s extensions to more flexible functional specifications, and to evaluate the performance of different approaches to model selection (via prior specifications or penalization techniques) for different confounding scenarios.

An alternative modification of LERCA could enforce that the ER curve is monotone, by assuming prior distributions on βk that are left (or right)-truncated at zero. This modification could be of explicit interest for environmental and toxicological research, and in studies of air pollution in particular where the ER relationship is often believed to be supra-linear [Pinault et al., 2017, Vodonos et al., 2018]. In risk assessment studies, the shape of the ER can greatly affect conclusions [Pope et al., 2015], and is often assumed to be linear, log-linear, log-log, or power function with or without threshold [Devos et al., 2016, Burnett et al., 2014]. Nasari et al. [2016] proposed a class of models that can capture various ER shapes and are easy to implement in large data sets. Even though LERCA can capture effectively any ER shape, development of a faster and computationally efficient estimation procedure is required for very large data sets. Future work could focus on incorporating local confounding adjustment in air pollution analyses including the whole United States and including zip codes that are not located near an air pollution monitor.

The results of our study are in agreement with a supra-linear ER shape, indicating that there is a larger health impact of ambient air pollution concentrations at low exposure levels, and that, if the causal assumptions hold, lowering ambient PM2.5 concentrations would lead to a reduction in cardiovascular hospitalization rates. Even though our analysis addresses a key question in air pollution epidemiology, it is also met with its own challenges.

First, even though focusing on ambient pollution concentrations is important from a policy perspective (since regulations can more directly control ambient concentrations), the results of this study are not directly interpretable for evaluating the health effects of personal exposure to PM2.5. The relationship between ambient concentrations and personal exposures is complicated, with studies showing that there is substantial variability in personal exposures among individuals of similar ambient exposure concentrations [Dockery and Spengler, 1981, Clayton et al., 1993], largely due to the individuals’ activities [Meng et al., 2009]. The correlation between ambient concentrations and personal exposures might differ by exposure levels if, for example, individuals residing in highly polluted areas are more likely to avoid outdoor activities. This further complicates interpreting the results of the relationship between ambient concentrations and health to personal exposures. Furthermore, variables that are expected to be confounders of personal exposures and health outcomes (such as an individual’s smoking habits, variables C˜2 in Figure 2) are not necessarily confounders of ambient PM2.5 concentrations and outcomes (variables C˜1 in Figure 2). Indication of confounding of the relationship between ambient concentrations and health outcomes by variables such as the median household value (see Figure 9) implies that these variables are not confounders in the classic sense (since they are not driving ambient PM2.5 concentrations) but are correlated with variables that are. Therefore, a variable’s confounding strength for ambient concentrations is not directly interpretable as its confounding strength for personal exposures.

Second, this analysis has used log event rates as the outcome of interest in a linear regression setting, with all zip codes contributing equally to the estimation of the models irrespective of their size. Linear regression for the analysis of rate data has been used in various settings, for example in Joshua and Garber [1990], Mohamedshah et al. [1993], Chua et al. [2009], Wang et al. [2012] and Liu-Smith et al. [2016]. A Poisson regression model where the number of hospitalizations is the response variable and the Medicare population size within a zip code is the offset would be more in agreement with the literature on count outcomes (and within air pollution epidemiology specifically). However, extending local confounding adjustment to Poisson regression is computationally complicated: (a) enforcing ER continuity for Poisson regression is not straightforward since EC{E[Y|X = x, C]} is not easily acquired (see §4.1.4), (b) posterior sampling of the experiment configuration and regression coefficient involves marginal densities of regression models, and (c) no conjugate prior distribution exists for regression coefficients in Poisson regression.

Acknowledgments

Funding for this work was provided by National Institutes of Health R01 ES024332, USEPA 83587201-0, and Health Effects Institute 4953-RFA14-3/16-4.

A. Data details

We constructed counts corresponding to the cardiovascular-specific (CVD) number of hospitalizations for Medicare enrollees aged at least 65 years during 2013 for zip codes across the continental US. Hospitalization rates were based on the total number of personal years for Medicare enrollees for a zip code on a given year. CVD hospitalizations were considered on the basis of primary diagnosis according to International Classification of Diseases, Ninth Revision (ICD-9) codes (ICD-9 390 to 459). The analysis was restricted to the continental US leading to 34,897 zip codes with hospitalization information.

Population demographic information was acquired using the 2000 Census with information on over 400 variables, although a lot of them are highly correlated. We further used linearly extrapolated Census variables for 2013. Census information is provided at a ZCTA level, and we use a crosswalk to map ZCTA to zip code. Weather information including temperature, relative humidity and dew point is acquired from the NOAA-ASOS (National Oceanic and Atmospheric Administration-Automated Surface Observing System) website, and is linked to zip codes within 150 kilometers.

Lastly, zip code PM2.5 exposure is assigned using the US EPA monitoring sites. By EPA recommendations, monitoring sites with less than 67% of scheduled measurements observed are excluded. For every monitor, the average of the 2011–2012 average annual value of PM2.5 is calculated, and the monitor is linked to all zip codes with centroids within 6 miles. Then, the zip code exposure is set equal to the average over all linked monitors. Since monitoring sites are preferentially located near populated areas or points of interest, many zip codes in remote areas are not linked to any monitor and are therefore dropped from the final data set.

Figure A.1 shows maps of zip code centroids before linkage to EPA monitoring sites, as well as maintained zip code centroids after 3 different linkage procedures corresponding to different specifications of the linkage distance, as well as whether a monitor can be linked to more than one zip code. We visualize how linkage can affect the final data set:

  • Distance: As the distance of allowed linked zip codes and monitors increases, we expect that more zip codes will be linked to at least one monitor. However, PM2.5 values in areas where monitors are located at long distances might suffer more from measurement error.

  • Number of links: Allowing a monitor to be linked to multiple zip codes increases the number of zip codes with PM2.5 information. However, this can lead to adjacent zip codes with very similar or identical PM2.5 measurements.

Linkage (b) in Figure A.1 corresponds to the linkage procedure we used to create our data set. Table A.1 includes the descriptive statistics and data source for the zip code covariates included in our analysis.

Table A.1:

Available demographic and weather information with mean (SD) in the sample and across all zip codes in the continental U.S.

Source Name Description Sample 5,362 zip codes Full population* 34,897 zip codes

2000 Census % White Percentage of White Population 0.71 (0.25) 0.84 (0.2)
% Hisp Percentage of Hispanic Population 0.12 (0.18) 0.07 (0.14)
% HS Percentage of population that attended high school 0.27 (0.1) 0.34 (0.11)
% Poor Percentage of impoverished population 0.14 (0.11) 0.13 (0.09)
% Female Percentage of female population 0.51 (0.04) 0.5 (0.04)
% Moved in 5 Percentage of population that has lived in the area for less than 5 years 0.50 (0.12) 0.43 (0.12)
Avg Commute Mean Travel Time to Work 24.26 (6.02) 26.11 (7.27)
Population/SQM Population per square mile (logarithm) 7.52 (1.54) 4.98 (2.22)
Total Population Total population (logarithm) 9.7 (1.13) 8 (2.25)
Low Occupied Indicator. “=1” if the percent of occupied population is at most 90%. 0.21 (0.41) 0.44 (0.5)
High Occupied Indicator. “=1” if the percent of occupied population is over 95%. 0.42 (0.49) 0.24 (0.43)
Low Hispanic Indicator. “=1” if the percent of Hispanic population is at most 0.02% 0.32 (0.47) 0.54 (0.5)
High Hispanic Indicator. “=1” if the percent of Hispanic population is over 20% 0.2 (0.4) 0.1 (0.3)

Census Extrapolation % Below HS Population percent with less than high school education (above age of 65) 23.65 (15.11) 23.4 (15.94)
% Own Households Percentage of occupied housing units in 2013 0.58 (0.2) 0.72 (0.16)
Low Poverty Indicator. “=1” if the percent of the population below the poverty line in 2013 is at most 5% 0.2 (0.4) 0.28 (0.45)
High Poverty Indicator. “=1” if the percent of the population below the poverty line in 2013 is over 15% 0.24 (0.43) 0.2 (0.4)

Census combination** House Value Median value of owner occupied housing (USD) (logarithm) 12.65 (0.64) 12.37 (0.62)
Household Income Median household income (USD) (logarithm) 11.41 (0.42) 11.37 (0.36)

BRFSS BMI Average BMI in 2013 27.67 (1.31) 29.52 (4.81)
Smoking Rate Ever smoke rate (2013) 0.44 (0.06) 0.46 (0.08)

Weather Avg Temp Average temperature (F) 55.03 (7.14) 54.34 (8.25)
Avg Dew Point Average Dew Point (F) 43.53 (6.96) 43.41 (8.31)
Avg Humidity Average Relative Humidity (%) 69.93 (8.53) 71.01 (8.21)

Medicare Avg Age Average Medicare Age 74.87 (1.62) 74.51 (1.56)
Female Rate Percentage of Female Beneficiaries 0.55 (0.06) 0.53 (0.06)
Dual Rate Percentage of Dual Eligible Beneficiaries 0.23 (0.15) 0.18 (0.12)
*

Missing values for covariates are excluded when calculating the mean and standard deviation in the full population.

**

The 2000 Census is combined with the 2013 extrapolated values by taking each covariate’s mean value across the two years.

B. Additional results for investigation of local confounding

B. 1. Covariates’ estimated coefficients in a model for the exposure at low and high exposure levels

In §2, we investigated the presence of local confounding by checking the p-values of each covariate in a model for the exposure, at low and high exposure levels separately. However, weak associations could have small p-values, if for example the sample size is large. Hence, investigations of local confounding could focus on the magnitude of estimated coefficients instead.

In Figure B.1, we illustrate the absolute value of the estimated regression coefficients for the covariates in the regression model for predicting the exposure. In order for estimated coefficients to be comparable, covariates are standardized within each exposure level. The horizontal line corresponds to estimated coefficient equal to 0.06, a value between the coefficient for High Hispanic and High Occupied, covariates with p-value below and above 0.05 respectively. Comparing the estimated coefficients to the p-values in Figure 3 we see that there is a clear correspondence between small p-values and magnitude of estimated coefficient. That is expected when the two exposure levels have similar range of exposure values and similar number of observations.

This comparison indicates that when exposure levels are balanced, investigations of local confounding could focus on p-values or magnitude of estimated coefficients and lead to equivalent conclusions.

B. 2. Covariates’ strength for predicting the outcome at the low and high exposure levels

In §2, we explored the possibility that different covariates are predictive of the exposure within different exposure levels. Here, we perform a similar analysis to investigate whether different variables act as predictors of the outcome at the low and the high exposure levels separately.

Considering still the two sets of zip codes: zip codes at the low exposure levels (<8μg/m3) and zip codes at the high exposure levels (>11.5μg/m3), we fit regressions of the outcome on the covariates, for each covariate and each exposure level separately. Figure B.2 shows the covariates’ p-value in those regressions. Again, we see that different variables act as predictors of the outcome at the two exposure levels. For example, a zip code’s median house value (in logarithm – House Value) is an outcome predictor at both low and high exposure levels, whereas the percentage of the population with less than high school education (% Below HS) was an outcome predictor only at the low levels.

Figure A.1:

Figure A.1:

(a) All zip codes with available Medicare information. (b) Zip codes with available exposure information after performing linkage within 6 miles and monitors are allowed to be linked to more than one zip code. (c) Zip codes with available exposure information after performing linkage within 6 miles where each monitor is only linked to up to one zip code. (d) Zip codes with available exposure information after linkage with monitors within 60 miles and every monitor can be linked to more than one zip code.

Figure B.1:

Figure B.1:

Absolute value of covariates’ estimated coefficient in a regression of the exposure on each covariate separately and at the low and the high exposure levels.

Figure B.2:

Figure B.2:

Covariates’ p-values from regressing the outcome on each covariate separately. A separate regression is fit at the low and the high exposure levels.

C. Additional simulation results

C. 1. Simulations in the presence of local confounding

Table C.1 shows the correlation of the covariates with the exposure and the coefficients of the covariates in the outcome model for the data simulating scenario with local confounding: different confounders at different levels of the exposure.

Figure C.1 shows the the root MSE (rMSE) as a function of the exposure value x ∈ (0,10). LERCA has the lowest rMSE at the low exposure levels followed by GAM. Root MSE across most methods seems to be comparable for the middle exposure values, and GAM performs slightly better than LERCA at high levels.

Table C.1:

Correlation between the covariates and exposure, and outcome coefficients in each experiment, for scenarios with local confounding.

Covariate - Exposure Covariate - Outcome

xg1 xg2 xg3 xg4 xg1 xg2 xg3 xg4

C1 0.423 0.525 0.402 0 0.641 0 0 0
C2 0.524 0.572 0 0.503 0.962 0.919 0.593 0.651
C3 0.522 0 0.447 0 0.646 0.643 0.616 0.58
C4 0 0.528 0 0 0 0.633 0 0
C5 0 0 0.533 0.539 0 0 0.658 0
C6 0 0 0 0.509 0 0 0 0.52
C7 0 0 0 0 0 0 0 0
C8 0 0 0 0 0 0 0 0

Figure C.1:

Figure C.1:

Mean Root MSE as a function of the exposure x ∈ (0,10).

C. 2. Simulations with reversed confounding structure

In Figure C.1, we see that GAM has slightly lower root MSE than LERCA at high exposure levels. A natural question that arises is whether the relative performance of GAM and LERCA is (1) a feature of the specific confounding structure used to generate the data in those simulations, or (2) specific to the high exposure range. For that reason, we generated data over the same exposure range (0, 10), the same experiment configuration (0, 2, 4, 7, 10), but reversed confounding structure. Specifically, the confounder set and confounding strength of experiment 1 in the simulations presented here are those of experiment 4 in the original simulations, those of experiment 2 here are those of experiment 3 in the original simulations, and so forth.

Figure C.2 show the estimated ER curves and root MSE for all methods. For computational simplicity, we only considered LERCA with K = 3. In these results we see that the estimated ER using LERCA follows the true ER curve very closely, whereas the remaining of the methods have deviations from the true ER curve at low or high exposure levels. When considering the root MSE of the methods, we see that GAM slightly outperforms LERCA at low exposure levels, but LERCA does much better at high exposure levels. Combining these results to those in Figure C.1, we see that (1) the performance of GAM relative to LERCA highly depends to the confounding structure, and (2) LERCA achieves the lowest or close to the lowest root MSE across exposure levels and under either specification of confounding structure.

Figure C.2:

Figure C.2:

Simulation results over 400 data sets with reversed confounding structure.

C.3. Simulations in the presence of global confounding

Briefly, data are generated with covariates C1, C2, C3 as predictors of exposure and C2, C3, C4 as predictors of the outcome and the adjusted R-squared of the true exposure and outcome models was 0.73 and 0.94 accordingly. Table C.2 shows the correlation of covariates with the exposure and the outcome model coefficients in the data simulating scenario with global confounding (same confounders with constant confounding strength across exposure levels) and true quadratic ER. Figure C.3 shows the estimated ER for each data set and the average estimated ER based on LERCA and alternative methods. In Figure C.4, the root MSE for all methods is plotted as a function of the exposure x ϵ (0, 10).

Table C.2:

Correlation between the covariates and exposure, and outcome coefficients in each experiment, for the scenario with global confounding.

C1 C2 C3 C4 C5 C6 C7 C8

Exposure 0.423 0.524 0.522 0 0 0 0 0
Outcome 0 0.812 0.93 0.82 0 0 0 0

Figure C.3:

Figure C.3:

Simulation results in the presence of global confounding. Grey lines correspond to estimated ER for each simulated data set, solid lines correspond to the mean ER over all simulated data sets, and the dashed line corresponds to the true ER.

Figure C.4:

Figure C.4:

Root MSE of all methods in the presence of global confounding as a function of the exposure x ϵ (0, 10).

D. Prior specifications for regression parameters and experiment configuration

D. 1. Regression coefficients and residual variance

Prior independences of all parameters are expressed in the following representation

p(α˜X,α˜Y,δ˜X,β˜k,δ˜Y,σ˜X2,σ˜Y2)=p(δ10Y)k=1K+1{[j=1pp(αkjX,αkjY)p(δkjXαkjX)p(δkjYαkjY)]p(δk0X)p(βk)p(σk,X2)p(σk,Y2)}×k=2KI(δk0Y=δ(k1)0Y+βk1(sksk1). (D.1)

We assume non-informative normal priors on βk, k = 1, 2, …, K + 1, and δ10Y The prior distribution on the regression coefficients for the covariates is a mixture of non-informative normal distribution and point-mass at 0. Non-informative inverse gamma prior distributions are assumed on σk,X2, σk,Y2. Specifically

  • βk~N(μ0,σ02),δ10~N(μ0,σ02).

  • δkjXαkjX~αkjXN(μ0,σ02)+(1αkjX)10(δkjX), where 10(δkjX) is a point-mass distribution at 0. Similarly for δkjYαkjY.

  • σk,X2~IG(a0,b0), and similarly for σk,Y2.

The hyper-parameters μ0, σ02, a0, b0 can be chosen differently for different variables.

D. 2. Experiment configuration

The prior on the points s = (s1, s2, …, sK) defining the experiment configuration is set as the even ordered statistics of (2K + 1) samples from a uniform distribution over the observed exposure range. Compared to a uniform prior distribution on s, this choice of a prior discourages the existence of points Si, Si in the experiment configuration that are very close to each other.

Let K and the exposure range (s0, sK+1) be fixed. Let Zi~U(s0,sK+1), i = 1, 2, …, 2K + 1 and denote the even ordered statistics as Wj = Z(2j), j = 1, 2, …, K. Then,

fW1,W2,,WK(w1,w2,,wK)=fW1(w1)fW2W1(w2w1)fWKW1,W2,,WK1(wkw1,w2,,wk=1)

Since W1 is th 2nd order statistic of 2K + 1 samples from U(s0, sK+1), we know that

fW1(w1)=(2K+1)!(2K1)!1sK+1s0w1s0sK+1s0(1w1s0sK+1s0)2K1=(2K+1)!(2K1)!(sK+1s0)(2K+1)(w1s0)(sK+1w1)2K1

Given W1 = w1, W2 acts like the second order statistic of 2K − 1 uniform samples from a uniform distribution over (w1, sK+1). Therefore, we similarly get that

fW2W1(w2w1)=(2K1)!(2K3)!(sK+1w1)(2K1)(w2w1)(sK+1w2)2K3.

Iteratively, we have that

fW1,W2,,WK(w1,w2,,wK)=(2K+1)!(sK+1s0)(2K+1)(w1s0)(w2w1)(wKwK1)(sK+1wK).

Therefore, the prior distribution on s with minimum distance of consecutive points sk, sk+1 being dk is defined as

fs(s1,s2,,sK)k=0K(sk+1sk)1(sk+1sk>dk) (D.2)

E. Sampling from the posterior distribution

The parameters included in the model are: s (the exposure values in the experiment configuration), α˜X, α˜Y (the vectors of length p including the covariates’ inclusion indicators in the exposure and the outcome model for each experiment), β˜={βk}k=1K+1(coefficients of exposure in the outcome model), δ˜X, δ˜Y (intercepts and coefficients of the covariates in the exposure and outcome model of each experiment), σ˜X2={σk,X2}k=1K+1, σ˜Y2={σk,Y2}k=1K+1 (residual variance of the exposure and outcome within each experiment).

E. 1. Likelihood factorization

We start by noting that the data likelihood (conditional on all parameters) factorizes to components for different experiments and the exposure and outcome models. If Y, X denote the vectors of outcomes and exposures for all units in the sample, and Yk, Xk denote the vectors of outcomes and exposures in experiment k, then

P(Y,Xs,α˜X,α˜Y,δ˜X,δ˜Y,β˜,σ˜X2,σ˜Y2,C)=k=1K+1igkpk(YiXi,α˜kX,δ˜kY,βk,σk,Y2,Ci)pk(Xiα˜kX,δ˜kX,σk,X2,Ci)=k=1K+1[pk(YkXk,α˜kY,δ˜kY,βk,σk,Y2,Ck)pk(Xkα˜kX,δ˜kX,σk,X2,Ck)], (E.1)

where we denote pk(12) as the density of 1 conditional on 2 in experiment k and δ˜kY includes the intercept δk0Y.

Next, we note that if we consider the marginal likelihood integrating out 1) exposure model regression coefficients including the intercept, 2) outcome model covariates’ regression coefficients, and 3) all variance terms, then the likelihood still factorizes in a similar manner. In fact1:

P(Y,Xs,α˜X,α˜Y,β˜,δ10Y,C)=P(Y,Xs,α˜X,α˜Y,δ˜X,δ˜Y,β˜,δ10Y,σ˜X2,σ˜Y2,C)×p(δ˜X,δ˜Y,σ˜X2,σ˜Y2s,α˜X,α˜Y)d(δ˜X,δ˜Y,σ˜X2,σ˜Y2)=k=1K+1pk(YkXk,s,α˜kY,δ˜kY,β˜,δ10Y,σk,Y2,Ck)pk(Xkα˜kX,δ˜kX,σk,X2,Ck)×p(δ˜kX,δ˜kY,σk,X2,σk,Y2s,α˜kX,α˜kY)d(δ˜kX,δ˜kY,σk,X2,σk,Y2)=k=1K+1pk(YkXk,s,α˜kY,δ˜kY,β˜,δ10Y,σk,Y2,Ck)p(δ˜kY,σk,Y2s,α˜kY)d(δ˜kY,σk,Y2)pk(Xkα˜kX,δ˜kX,σk,X2)p(δ˜kX,σk,X2s,α˜kX)d(δ˜kX,σk,X2)=k=1K+1pk(YkXk,s,α˜kY,δk0Y,βk,Ck)pk(Xkα˜kX,Ck) (E.2)

where the second equation holds from the factorization of the likelihood (when δ˜kY does not include the intercepts we need to condition on δ10Y and β˜) and the assumed prior independences.

E. 2. Sampling all model parameters using MCMC

E.2.1. Sampling the regression coefficients and residual variance terms.

The factorization of the full data likelihood over experiments and exposure/outcome models and the choice of the prior distributions lead to full conditional posterior distributions of coefficients δk0X, δkjX, δkjY, and variance terms σk,X2, σk,Y2 of known forms. The variance terms and exposure model intercepts have inverse Gamma and normal full conditional posterior distributions accordingly, whereas the distributions of δkjX, δkjY. are either point mass at 0 or normal, based on whether the corresponding α is 0 or 1.

We update coefficients δkjX for which αkjX=0 separately from the ones with αkjX=1. Parameters δkjX for which αkjX=0 are set to 0. Let j1,j2,,jNx be the indices such that αkjl=1,l=1,2,,Nx. Then,

(δk0X,δkj1X,δkj2X,,δkjNxX)TData,~MVNNx+1(μX,ΣX),whereΣX=(1σk,X2V˜TV˜+1σ02INx+1)1andμX=ΣX(1σk,X2V˜TXk+1σ02μ˜0)

where V˜=(1,Cj1k,Cj2k,,CjNxk) is the design matrix of data in experiment k based on the included covariates, and μ˜0 is a vector of length Nx+ 1 of repeated values μ0. (Update of the coefficients δkjY is performed conditional on δk0Y, βk and is similar to the updates of the coefficients in the exposure model and therefore omitted.)

The full conditional distribution of the variance term σk,X2 is also of known form

σk,X2Data,~IG(aX,bX),whereaX=a0+nk2,bX=b0+12(XKVδ˜kX)T(XkVδ˜kX),

where nk is the number of observations in experiment k, and V = (1, Ck). (The full conditional posterior distribution of σk,Y2 is very similar and is therefore omitted.)

It is worth noting that centering the covariates Cj allows the outcome model intercepts δk0 to depend solely on δ10, βk and s, and not on δkjY ·. This simplifies the form of the full conditional distribution for many coefficients. Since δk0Y, k ≥ 2 is a deterministic function of δ10, β1, β2, …, βk—1, and the points s0, s1, …, sk, the full conditional posterior distribution of δ10 depends on data across all experiments, and that of βk on data from experiment k and onwards. Since the data likelihood in all experiments is normal and we have assumed normal prior distributions, the full conditional posterior distributions are also normal. After each update, intercepts δk0Y, k ≥ 2 need to be updated from (4) to ensure ER continuity.

The parameter δk0Y is included in the mean structure of the outcome model for all experiments. Its full conditional posterior distribution is δ10YData,~N(μ,σ2) where

σ2=(1σ02+k=1K+1nkσk,Y2)1

and

μ=σ2[μ0σ02+k=1K+11σk,Y2igk(yil=1k1βl(slsl1)βk(xisk1)j=1pδkjYCij)],

where ab=0 if b < a. Similarly, the full conditional posterior distribution of βk uses data from experiments k, k + 1, …, K +1, and is βkData,~N(μ,σ2) where

σ2=(1σ02+1σk,Y2igk(xisk1)2+(sksk1)2l=k+1K+1nlσl,Y2)1

and

μ=σ2(μ0σ02+1σk,Y2igk(xisk1)(yiδk0YjδkjYCij)+l=1K+11σl,Y2igl(sksk1)(yiδk0Ye=k+1l1βe(sese1)βl(xisl1)jδljYCij)).

E.2.2. Sampling the experiment configuration and inclusion indicators.

The experiment configuration and inclusion indicators can be updated separately, or simultaneously. We first describe the separate update of s and (α˜X,α˜Y), and afterwards we will discuss why occasional simultaneous sampling was deemed necessary. One of the three moves (separate, jump over, jump within) depicted in Figure E.1 is performed at every iteration with probability 0.8, 0.1, and 0.1 accordingly.

(separate) The experiment configuration and inclusion indicators are updated separately and conditionally on each other. For the update of the experiment configuration s, a Metropolis-Hastings step is used [Metropolis et al., 1953, Hastings, 1970] based on the full conditional likelihood (E.1). k is chosen uniformly over {1, 2, …, K} and s*~U(sk1,sk+1) is drawn as shown in Figure E.1(b). Alternatively, s* could be sampled from a truncated normal distribution centered at sk. If s* violates sk+1s*dk or s*sk1dk1, the move is automatically rejected.

Otherwise, the move ss*=(s1,s2,,sk1,s*,sk+1,,sK) is proposed with all other parameters (excluding β˜) fixed to their current values. Proposing new values of β˜ is necessary to ensure that the ER is continuous at the proposed state. All coefficients but βk, βk+1 are fixed to their current values, and new values for βk, βk+1 are proposed such that the intercepts of the adjacent experiments are also fixed. If s* < sk, the proposed value βk+1* is sampled from a uniform distribution between the values βk+1 (current state) and

β˜k+1=(sk+1s*)1(δ(k+2)0Yδk0Yβk(s*sk1)),

where β˜k+1 is the slope that would connect the value of the ER at point sk+1 with the value of the ER at point s* at the current state. Figure E.2 shows the the limits of the proposed ER. Based on the sampled value for βk+1*, the proposed value for βk is

βk*=(s*sk1)1(δ(k+2)0Yδk0Yβk+1*(sk+1s*)).
Figure E.1:

Figure E.1:

Proposed state for the separate, jump within and jump over moves are depicted schematically for a hypothetical experiment configuration with K = 3. In all proposed states, new slopes are proposed to ensure continuity of the ER. (a) The current state of the MCMC. s3 is chosen to be updated. (b) A new point s* is proposed within (s2, s4) with the corresponding α parameters constant. (c) Simultaneous move of the experiment configuration and the corresponding α’s within (s2, s4). (d) The proposed point s* is located outside the interval (s2, s4) and new a’s are proposed for the experiment that was split (s0, s1), and the experiments that were combined (s0, s4).

Figure E.2:

Figure E.2:

Values of βk, βk+1 for the separate move are proposed such that the estimated ER are within the limits shown in dashed green lines. The black solid line correspond to the current state of the ER.

Similarly for s* > sk by sampling ßk from a uniform that has similar properties.

Since the likelihood factorizes as shown in (E.1) the likelihood ratio of the Metropolis-Hastings acceptance probability includes terms only for experiments k, k + 1. The prior ratio includes terms for the experiment configuration distribution in (D.2), and the prior for βk, βk+1. If a uniform distribution is used to sample s*, the proposal for the cutoffs is symmetric, and the proposal ratio corresponds to the proposal ratio for coefficients βk, βk+1. This is equal to |βk+1β˜k+1|/|βk*β˜k*|, where βk* is the proposed value and β˜k* is the one boundary of the proposal distribution for βk in the reverse move.

After we accept or reject the move s s*, we update the inclusion indicators based on their full conditional. Let A* be all parameters but αkjX, αkjY, ay and δkjY. For α ϵ {0,1}

p(αkjY=αData,A*,αkjX)=p(δkjY=0,αkjY=αData,A*,αkjX)p(δkjY=0αkjY=α,Data,A*,αkjX)=p(Data,A*δkjY=0,αkjY=α,αkjX)p(δkjY=0,αkjY=ααkjX)p(Data,A*αkjX)p(δkjY=0αkjY=α,Data,A*,αkjX)p(δkjY=0αkjY=α,αkjX)p(αkjY=ααkjX)p(δkjY=0αkjY=α,Data,A*,αkjX)=p(δkjY=0αkjY=α)p(αkjY=ααkjX)p(δkjY=0αkjY=α,Data,A*),α{0,1}, (E.3)

where the numerator consists of the product of two prior probabilities, and the denominator consists of the posterior probability that δkjY=0. This has been seen previously in a different context [Antonelli et al., 2017a], and consists a computational improvement over previous implementations of this prior distribution that utilized the MC3 algorithm [Madigan et al., 1995, Wang et al., 2012].

However, sampling the inclusion indicators and experiment configuration separately can lead to slow convergence. For example, consider our simulation scenario where the true experiment configuration is (2, 4, 7), and starting values randomly set to (0.5, 2, 7). Based on the separate move, point s1 is always proposed to be updated between s0, s2 = 2, which can lead to slow mixing. The jump over and jump within moves are meant to alleviate such issues.

In order to avoid the need of proposing values for the covariates’ coefficients and variance terms in the update of the experiment configuration through a simultaneous move, these parameters are integrated out from the data likelihood. Integrating all other parameters out allows us to perform sampling of the experiment configuration without heavy fine tuning of proposal distributions. In both situations, sampling of s, α˜X, α˜Y, β˜ is performed using the marginalized likelihood (E.2):

p(s,α˜X,α˜Y,β˜Data,δ10Y)p(Y,Xs,α˜X,α˜Y,β˜,δ10Y,C)p(s)p(α˜X,α˜Y)p(β˜).

Note that all likelihoods in (E.2) are marginal densities of linear regression models over the regression coefficients and variance terms with Normal-Inverse Gamma priors. Raftery et al. [1997] provided closed form calculations of this marginal likelihood. However, this calculation requires the inversion of a matrix with dimension equal to the number of observations, and is computationally intensive. Since the marginal likelihood is only used in the calculation of Bayes factors, we approximate the Bayes factors when necessary using the BIC [Raftery, 1995].

(jump over) This move is designed to alleviate the MCMC issue described above by proposing a simultaneous move of (s,α˜X,α˜Y,β˜). k ϵ {1, 2, …, K} is again chosen uniformly, but now a new location of the experiment configuration s* is generated uniformly over (s0, sK+1) \ [sk−1, sk+1] (necessarily not between sk, sk+1). The move ss*=(s1,s2,,sk1,sk+1,,sj1,s*,sj,,sK) proposes a combination of experiments k, k +1 and a split in some randomly chosen experiment j. For example, in Figure E.1(d), the proposed move splits the first experiment (s0, s1) in two (s0, s*), (s*, s1), and combines the experiments (s2, s3), (s3, s4).

The inclusion indicators of the unchanged experiments remain to their current values, but new values need to be proposed for the combined or split experiments. The assignment of proposed values for the inclusion indicators is probabilistic based on their current values, encouraging the inclusion of a covariate in the proposed state to resemble that of the current state. For example, in Figure E.1(d) α1*, α2* should resemble α1, and similarly for α4*. In this example, covariates are included in experiment 4 with very low, mediocre and very high probability if none, one or both of the original experiments include it. The values chosen for these probabilities were (0.01, 0.5,0.99) accordingly. Similarly, a variable is proposed to be included in the model of experiments 1 and 2 with low and high probability if the variable was included in the initial model or not. The values chosen were (0.2, 0.95).

Values for β˜ are proposed to ensure that the proposed state corresponds to a continuous ER. Unchanged experiments remain the same. Experiments are combined by connecting the edges of the two linear segments, and values of the split experiments are proposed using a normal perturbation of the current value with variance σtune2. Figure E.1(d) shows proposed states of the ER.

The move is accepted or rejected with probability equal to the product of the following:

  1. The likelihood ratio for split and combined experiments approximated using the BIC for the exposure model and the outcome model (regressing Yk(1,Xksk11)(δk0Y,βk)T on Ck without an intercept).

  2. The prior ratio for the experiment configuration (D.2), the inclusion indicators (3), and the coefficients βk for the combined and split experiments.

  3. The proposal ratio for s, βk and (α˜X,α˜Y)
    (sK+1s0)(sjsj1)(sK+1s0)(sk+1sk1)exp{u2u*22σtune2}l{0,1,2}m{0,1}(plmc)nmlsnlmcl{0,1}m{0,1,2}(plms)nmlcnlms,
    where plmc is the probability of proposing α = m ϵ {0,1} in the combined experiment when l ϵ {0,1, 2} of the two initial experiments had α =1, plms is the probability of proposing α = 1 in m ϵ {0,1, 2} of the two experiments when the initial experiment chosen to be split had α = l ϵ {0,1}, and nlmc, nlms is the number of times that each event occurred when moving from the current to the proposed state. Lastly, u is the difference of the slope for the experiment that was split from the slope of the first split experiment in the proposed state, and u* is the difference of the slope in the first of the experiment that is combined from the slope of the combined experiment in the proposed state.

(jump within) This move is similar to the “jump over” but maintaining the ordering of the locations in s. k ∈ {1, 2, …, K} is again chosen uniformly, and a new value s* is proposed within the interval (sk−1, sk+1). New values for the coefficients βk, βk+1 are proposed as in the separate move. New values of the inclusion indicators are also proposed for the experiments k, k + 1. In fact, Cj is proposed to be included in the outcome model of an experiment with high probability if both current models include it, mediocre probability if only one of the models include it, and low probability if none of the models include it. Similarly for the inclusion indicators of the exposure model. The acceptance probability of this move is similar to the one described above, and is omitted here. Figure E.1(c) depicts random draws for proposed ER states.

E. 3. MCMC convergence

Due to the update of the experiment configuration, commonly used convergence diagnostics such as trace plots are not appropriate since parameters (e.g., βk) may correspond to a different range of exposure values at different iterations. Therefore, convergence must be examined in the context of quantities that are detached from the experiment configuration.

One quantity that we use for convergence inspection is the mean exposure response curve calculated over a set of exposure values within the exposure range. Such a set might be an equally spaced grid of points over the interval (s0, sK+1), denoted by G. For each value xG and MCMC iteration t, identify the experiment k = kt(x) that x belongs to. Then, for observation i calculate the expected response at value x, by defining w˜i(x)=(1,x,Ci1,,Cip)T and calculating Y^it(x)=w˜i(x)Tγkt where γkt is the posterior sample of (δk0Y,βk,δk1Y,,δkpY)T in iteration t. Finally, the t-posterior sample of the mean response at point xG is the average of the expected responses over the individuals in the sample Y^t(x)=1ni=1nY^it(x).

Convergence could be examined by visual inspection of trace plots of Y^(x) for all xG. Based on multiple chains of the MCMC, we calculate the potential scale reduction factor (PSR) for the mean response at every point xG [Gelman and Rubin, 1992]. We consider that the MCMC has converged if |PSR − 1| < c for all xG. An alternative quantity based on which MCMC convergence can be examined is Δ^(x)=βkt(x).

F. Simulating data with differential confounding at different exposure levels

In simulation studies, data are most often simulated in the following order: covariates C1, C2, …, Cp, exposure X given a subset of C1, C2, …, Cp, and outcome Y given X and a potentially different subset of C1, C2, …, Cp. Data with differential confounding at different exposure levels could imply, in its most generality, that the exposure X is generated with different predicting variables at different exposure levels. Generating data with such structure is complicated since the actual X values define the exposure level that an observation belongs to, and the exposure level in which an observation belongs to defines the set of predictors. For that reason, instead of following the C, X|C approach to data simulation, we generate the exposure values X first, and C is generated conditional on X, ensuring that the target experiment-specific mean and variance of X, C, and correlation of all variables remain the same, as if the data were generated with the typical C, X|C order. Generating the outcome with different predictors at different exposure levels is straightforward by including terms of the form δj*CjI(Xsk), or by using a separate outcome model within each experiment. In all situations, one should ensure that data are generated in such a way that the true ER is continuous.

F. 1. The “target” data generating mechanism

Given K, s, we would like the exposure X to be generated such as E(X) and Var(X) are controllable quantities, since they are closely related to the exposure range of each experiment, and we would like to ensure that simulation results are not driven by the inherit variability in X. Furthermore, we would like to ensure that Var(Cj) is approximately the same across experiments and across covariates, such that the the magnitude of δkjY has similar interpretation in terms of correlation.

As discussed above, data (X, C) are usually generated in the order C followed by X|C, using a model for which E(XC)=δ0+j=1pδjCp. Instead of setting target values for δj, we set target correlations Cor(X, Cj) and calculate the δj’s that correspond to these correlations. (The reverse is also possible but requires ensuring that that Var(X)j=1pδj2Var(Cj).) We require that E(CjX=x) is continuous in x to ensure that the joint distribution (X,Cj) is realistic, and does not have “jumps” at the points of the experiment configuration.

Based on the above, the following represent target (controllable) quantities of our data generation:

  • Var(X), E(X) are fixed,

  • Within each experiment Cj are independent random variables with known variance,

  • Cor(X, Cj) are fixed and δj can be calculated, using Cor(X,Cj)=δjVar(Cj)Var(X)

  • The function E(Cj|X = x) is continuous in x.

Ensuring that E(Cj|X = x) is continuous in x across experiments is performed in the following way: Given Var(X), a model for C|X that gives rise to data with the target Var(Cj), Cor(X, Cj) is considered. The variance- covariance targets do not impose any restrictions on the model intercept. For the first experiment, the intercept can be chosen arbitrarily, and for the subsequent experiments intercepts are chosen to ensure that limtxE(CjX=t)=limtx+E(CjX=t) at all points x.

F. 2. Generating the data set maintaining target quantities

As discussed in §F.1, Cor(X, Cj), Var(Cj), and Var(X) are considered known, from which we can derive Cov(X, Cj). We generate data with the following order:

  1. X is generated from a distribution with mean E(X), and variance Var(X). In our simulations X is uniform over the exposure range.

  2. Taking advantage of the laws of the multivariate normal distribution we generate
    CX~MVNp(μ¯,Σ¯),whereμ¯=E(C)+Cov(C,X)Var(X)(XEX),andΣ¯=V(C)1Var(X)Cov(C,X)Cov(C,X)T,
    where Cov(C,X) = (Cov(C1,X), Cov(C2,X),…,Cov(Cp,X))T, and V(C) is a diagonal p × p matrix with entries Var(Cj),j = 1, 2, …,p.
  3. The marginal means of each variable Cj within each experiment is calculated by ensuring that the function E(Cj|X = x) which corresponds to the jth entry of the vector μ¯ is continuous at the points of experiment change.

  4. Covariates Cj are subtracted their overall mean.

A simple linear regression form is used to generate the outcome within each experiment. In experiment k, the outcome is generated from YX,C~N(ξk0+ξk1ϕ(X)+j=1pξk(j+1)Cj,σk,Y2), where ϕ() is a continuous function, and the residual variance σk,Y2 is set equal across k. We ensure that the true ER function E(Y|X) is continuous in X by appropriately setting the intercept values ξk0. The intercept in experiment 1 is decided, and for each experiment onwards we set ξk0 such that

limxskE[YX=x]=limxsk+E[YX=x]ξ(k+1)0=ξk0+(ξk1ξ(k+1)1)ϕ(sk).

Footnotes

1

In the following, δ˜X includes the exposure model intercepts, but δ˜Y includes only the coefficients of the covariates.

References

  1. Androutsopoulos Vasilis P., Hernandez Antonio F., Liesivuori Jyrki, and Tsatsakis Aristidis M.. A mechanistic overview of health associated effects of low levels of organochlorine and organophosphorous pesticides. Toxicology, 307:89–94, 2012. [DOI] [PubMed] [Google Scholar]
  2. Antonelli Joseph, Mazumdar Maitreyi, Bellinger David, Christiani David C, Wright Robert, and Coull Brent A. Bayesian variable selection for multi-dimensional semiparametric regression models. 2017a. [Google Scholar]
  3. Antonelli Joseph, Zigler Corwin, and Dominici Francesca. Guided Bayesian imputation to adjust for confounding when combining heterogeneous data sources in comparative effectiveness research. Biostatistics, 18(3):553–568, 2017b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Antonelli Joseph, Parmigiani Giovanni, and Dominici Francesca. High-dimensional confounding adjustment using continuous spike and slab priors. Bayesian Analysis, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Babb James, Rogatko André, and Zacks Shelemyahu. Cancer phase I clinical trials: efficient dose escalation with overdose control. Statistics in Medicine, 17(10):1103–20, 1998. [DOI] [PubMed] [Google Scholar]
  6. Bell Michelle L, Peng Roger D, and Dominici Francesca. The exposure-response curve for ozone and risk of mortality and the adequacy of current ozone regulations. Environmental health perspectives, 114(4):532–6, April 2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Berger Rebecca E., Ramaswami Ramya, Solomon Caren G., and Drazen Jeffrey M.. Air Pollution Still Kills. New England Journal of Medicine, 376(26):2591–2592, 2017. [DOI] [PubMed] [Google Scholar]
  8. Bia Michela, Flores Carlos A., Alfonso Flores-Lagunes, and Mattei Alessandra. A Stata package for the application of semiparametric estimators of doseresponse functions. The Stata Journal, 14(3):580–604, 2014. [Google Scholar]
  9. Burnett Richard T., Pope C. Arden, Ezzati Majid, Olives Casey, Lim Stephen S., Mehta Sumi, Shin Hwashin H., Singh Gitanjali, Hubbell Bryan, Brauer Michael, Anderson H. Ross, Smith Kirk R., Balmes John R., Bruce Nigel G., Kan Haidong, Laden Francine, Prüss-Ustün Annette, Turner Michelle C., Gapstur Susan M., Diver W. Ryan, and Cohen Aaron. An integrated risk function for estimating the global burden of disease attributable to ambient fine particulate matter exposure. Environmental Health Perspectives, 122(4):397–403, 2014. ISSN 15529924. doi: 10.1289/ehp.1307049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cefalu Matthew, Dominici Francesca, Arvold Nils, and Parmigiani Giovanni. Model averaged double robust estimation. Biometrics, 73(2):410–421, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Chiodo Lisa M, Jacobson Sandra W, and Jacobson Joseph L. Neurodevelopmental effects of postnatal lead exposure at very low levels. Neurotoxicology and Teratology, 26:359–371, 2004. [DOI] [PubMed] [Google Scholar]
  12. Chua Kun Lin, Soh Shu E, Ma Stefan, and Lee Bee Wah. Pediatric Asthma Mortality and Hospitalization Trends Across Asia Pacific. World Allergy Organization Journal, 2:77–82, 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Clayton CA, Perritt RL, Pellizzari ED, Thomas KW, Whitmore RW, Wallace LA, Ozkaynak H, and Spengler JD. Particle Total Exposure Assessment Methodology (PTEAM) study: distributions of aerosol and elemental concentrations in personal, indoor, and outdoor air samples in a southern California community. Journal of exposure analysis and environmental epidemiology, 3(2):227–250, 1993. ISSN 1053–4245. [PubMed] [Google Scholar]
  14. Crouse Dan L, Peters Paul A, Hystad Perry, Brook Jeffrey R, van Donkelaar Aaron, Martin Randall V, Villeneuve Paul J, Jerrett Michael, Goldberg Mark S, Pope C Arden, Brauer Michael, Brook Robert D, Robichaud Alain, Menard Richard, Burnett Richard T, and Burnett Richard T.. Ambient PM$_{2.5}$, O$_3$, and NO$_2$ Exposures and Associations with Mortality over 16 Years of Follow-Up in the Canadian Census Health and Environment Cohort (CanCHEC). Environmental health perspectives, 123(11):1180–6, November 2015. ISSN 1552–9924. doi: 10.1289/ehp.1409276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Crouse Dan L, Philip Sajeev, Van Donkelaar Aaron, Martin Randall V, Jessiman Barry, Peters Paul A, Weichenthal Scott, Brook Jeffrey R, Hubbell Bryan, and Burnett Richard T. A New Method to Jointly Estimate the Mortality Risk of Long-Term Exposure to Fine Particulate Matter and its Components. Nature Publishing Group, (November 2015):1–10, 2016. doi: 10.1038/srep18916. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Daniels Michael J., Dominici Francesca, Samet Jonathan M., and Zeger Scott L.. Estimating Particulate Matter-Mortality Dose-Response Curves and Threshold Levels: An Analysis of Daily Time-Series for the 20 Largest US Cities. American Journal of Epidemiology, 152(5):397–406, 2000. [DOI] [PubMed] [Google Scholar]
  17. Daniels Michael J, Dominici Francesca, Zeger Scott L, and Samet Jonathan M. The National Morbidity, Mortality, and Air Pollution Study. Part III: PM10 concentration-response curves and thresholds for the 20 largest US cities. Research report (Health Effects Institute), (94 Pt 3):1–21; discussion 23–30, May 2004. [PubMed] [Google Scholar]
  18. Denison DGT, Mallick BK, and Smith AFM. Automatic Bayesian curve fitting. Journal of the Royal Statistical Society, 60(2):333–350, 1998. [Google Scholar]
  19. Devos Stefanie, Cox Bianca, Van Lier Tom, Nawrot Tim S, and Putman Koen. Effect of the shape of the exposure-response function on estimated hospital costs in a study on non-elective pneumonia hospitalizations related to particulate matter. Environment International, 94:525–530, 2016. [DOI] [PubMed] [Google Scholar]
  20. Devries Rebecca, Kriebel David, and Sama Susan. Low level air pollution and exacerbation of existing copd: a case crossover analysis. Environmental Health, 15(98):1–11, 2016. ISSN 1476–069X. doi: 10.1186/s12940-016-0179-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Di Qian, Dai Lingzhen, Wang Yun, Zanobetti Antonella, Choirat Christine, Schwartz Joel D, and Dominici Francesca. Association of Short-Term Exposure to Air Pollution with Mortality in Older Adults. Journal of the American Medical Association, 318(24):2446–2456, 2017a. doi: 10.1001/jama.2017.17923.Association. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Di Qian, Wang Yan, Zabonetti Antonella, Wang Yun, Koutrakis Petros, Choirat Christine, Dominici Francesca, and Schwartz Joel D. Air Pollution and Mortality in the Medicare Population. New England Journal of Medicine, 376(26):2513–2522, 2017b. doi: 10.1056/NEJMoa1702747.Air. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Dimatteo Ilaria, Genovese Christopher R, and Kass Robert E. Bayesian curve-fitting with free-knot splines. Biometrika, 88(4):1055–1071, 2001. [Google Scholar]
  24. Dockery Douglas W and Spengler John D. Personal Exposure to Respirable Particulates and Sulfates. Journal of the Air Pollution Control Association, 31(2):153–159, 1981. ISSN 0002–2470. doi: 10.1080/00022470.1981.10465205. [DOI] [PubMed] [Google Scholar]
  25. Dominici Francesca, Daniels Michael, Zeger Scott L, and Samet Jonathan M. Air Pollution and Mortality: Estimating Regional and National Dose-Response Relationships. Journal of the American Statistical Association, 97 (457):100–111, 2002. [Google Scholar]
  26. Eftim Sorina E., Samet Jonathan M., Janes Holly, McDermott Aidan, and Dominici Francesca. Fine Particulate Matter and Mortality: A Comparison of the Six Cities and American Cancer Society Cohorts With a Medicare Cohort. Epidemiology, 19(2):209–216, March 2008. [DOI] [PubMed] [Google Scholar]
  27. Fazel Reza, Krumholz Harlan M., Wang Yongfei, Ross Joseph S., Chen Jersey, Ting Henry H., Shah Nilay D., Nasir Khurram, Einstein Andrew J., and Nallamothu Brahmajee K.. Exposure to Low-Dose Ionizing Radiation from Medical Imaging Procedures. New England Journal of Medicine, 361(9):849–857, August 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Flores Carlos A, Flores-Lagunes Alfonso, and Gonzalez Arturo. Estimating the effects of length of exposure to instruction in a training program: the case of job corps. The Review of Economics and Statistics, 94(1):153–171, 2012. [Google Scholar]
  29. Garcia Cynthia A, Yap Poh-sin, Park Hye-youn, and Weller Barbara L. Association of long-term PM2.5 exposure with mortality using different air pollution exposure models: impacts in rural and urban California. International Journal of Environmental Health Research, 26(2):145–157, 2016. ISSN 0960–3123. doi: 10.1080/09603123.2015.1061113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Gelman Andrew and Rubin Donald B.. Inference from iterative simulation using multiple sequences. Statistical Science, 7(4):457–511, 1992. [Google Scholar]
  31. Gelman Andrew, Hwang Jessica, and Vehtari Aki. Understanding predictive information criteria for Bayesian models. Statistics and Computing, 24(6):997–1016, November 2014. [Google Scholar]
  32. Green PJ. Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrica, 82:711–732, 1995. [Google Scholar]
  33. Hart Jaime E, Liao Xiaomei, Hong Biling, Puett Robin C, Yanosky Jeff D, Suh Helen, Kioumourt-zoglou Marianthi-anna, Spiegelman Donna, and Laden Francine. The association of long-term exposure to PM 2.5 on all-cause mortality in the Nurses’ Health Study and the impact of measurement-error correction. Environmental health, 14(38):1–9, 2015. doi: 10.1186/s12940-015-0027-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Hastie Trevor. gam: Generalized Additive Models, 2017. [DOI] [PubMed]
  35. Hastie Ttevor and Tibshirani Robert. Generalized Additive Models. Statistical Science, 1(3):297–318, 1986. [DOI] [PubMed] [Google Scholar]
  36. Hastie Trevor and Tibshirani Robert. Varying-Coefficient Models. Journal of the Royal Statistical Society. Series B, 55(4):757–796, 1993. [Google Scholar]
  37. Hastings WK. Monte Carlo sampling methods using Markov chains and their applications. Biometrika, 57(1), 1970. doi: 10.1093/biomet/57.1.97/284580. [DOI] [Google Scholar]
  38. Hirano Keisuke and Imbens Guido W. The Propensity Score with Continuous Treatments *. 2004. [Google Scholar]
  39. Imai Kosuke and Van Dyk David A. Causal Inference With General Treatment Regimes: Generalizing the Propensity Score. Journal of the American Statistical Association, 99(467):854–866, 2004. doi: 10.1198/016214504000001187. [DOI] [Google Scholar]
  40. Jerrett Michael, Turner Michelle C, Beckerman Bernardo S, Pope C Arden Iii, and Van Donkelaar Aaron. Comparing the Health Effects of Ambient Particulate Matter Estimated Using Ground-Based versus Remote Sensing Exposure Estimates. Environmental Health Perspectives, 125(4):552–559, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Joshua Sarath C and Garber Nicholas J. Estimating truck accident rate and involvements using linear and Poisson regression models. Transportation planning and Technology, 15(1):41–58, 1990. [Google Scholar]
  42. Jusko Todd A, Henderson Charles R, Lanphear Bruce P, Cory-Slechta Deborah A, Parsons Patrick J, and Canfield Richard L.. Blood Lead Concentrations < 10 mug/dL and Child Intelligence at 6 Years of Age. Environmental health perspectives, 116(2):243–8, February 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Kennedy Edward H., Ma Zongming, McHugh Matthew D., and Small Dylan S.. Non-parametric methods for doubly robust estimation of continuous treatment effects. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 79(4):1229–1245, September 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Lee Mihye, Koutrakis Petros, Coull Brent A, Kloog Itai, and Schwartz Joel D.. Acute effect of fine particulate matter on mortality in three southeastern states 20072011. Journal of Exposure Science and Environmental Epidemiology, 26(2):173–179, 2016. doi: 10.1038/jes.2015.47.Acute. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Lim Chris C, Hayes Richard B, Ahn Jiyoung, Shao Yongzhao, Silverman Debra T, Jones Rena R, Garcia Cynthia, and Thurston George D. Association between long-term exposure to ambient air pollution and diabetes mortality in the US. Environmental Research, 165(February):330–336, 2018. ISSN 0013–9351. doi: 10.1016/j.envres.2018.04.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Liu-Smith Feng, Farhat Ahmed Majid, Arce Anthony, Ziogas Argyrios, Taylor Thomas, Wang Zi, Yourk Vandy, Liu Jing, Wu Jun, Mceligot Archana J, Anton-Culver Hoda, and Meyskens Frank L. Gender differences in the association between the incidence of cutaneous melanoma and geographic UV exposure. Journal of the American Academy of Dermatology, 76(3):499–505, 2016. doi: 10.1016/j.jaad.2016.08.027. URL http://www.temis.nl/uvradiation/SCIA/stations{_}uv.html. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. De Luna Xavier, Waernbaum Ingeborg, and Richardson Thomas S.. Covariate selection for the nonparametric estimation of an average treatment effect. Biometrika, 98(4):861–875, 2011. [Google Scholar]
  48. Mackenzie Ross Sarah Jane, Brewin Chris Ray, Curran Helen Valerie, Furlong Clement Eugene, Abraham-Smith Kelly Michelle, and Harrison Virginia. Neuropsychological and psychiatric functioning in sheep farmers exposed to low levels of organophosphate pesticides. Neurotoxicology and teratology, 32(4):452–9, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Madigan David, York Jeremy, and Allard Denis. Bayesian Graphical Models for Discrete Data. International Statistical Review, 63(2):215–232, 1995. ISSN 03067734. doi: 10.2307/1403615. [DOI] [Google Scholar]
  50. Makar Maggie, Antonelli Joseph, Di Qian, Cutler David, and Schwartz Joel. Estimating the Causal Effect of Fine Particulate Matter Levels on Death and Hospitalization: Are Levels Below the Safety Standards Harmful? Epidemiology, 28(5):627–634, 2018. doi: 10.1097/EDE.0000000000000690.Estimating. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Meng Qing Yu, Spector Dalia, Colome Steven, and Turpin Barbara. Determinants of Indoor and Personal Exposure to PM 2.5 of Indoor and Outdoor Origin during the RIOPA Study. Atmos Environ, 43(36):5750–5758, 2009. doi: 10.1016/j.atmosenv.2009.07.066. URL https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2842982/pdf/nihms159887.pdf. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Metropolis Nicholas, Rosenbluth Arianna W, Rosenbluth Marshall N, Teller Augusta H, and Teller Edward. Equation of State Calculations by Fast Computing Machines. The Journal of Chemical Physics, 21(6):1087–1092, 1953. doi: 10.1063/1.439486. [DOI] [Google Scholar]
  53. Mohamedshah Yusuf M, Paniati Jeffery F, and Hobeika Antoine G. Truck Accident Models for Interstates and Two-Lane Rural Roads. TRANSPORTATION RESEARCH RECORD, 1407:35–41, 1993. [Google Scholar]
  54. Nasari Masoud M., Szyszkowicz Mieczyslaw, Chen Hong, Crouse Daniel, Turner Michelle C., Jerrett Michael, Pope C. Arden, Hubbell Bryan, Fann Neal, Cohen Aaron, Gapstur Susan M., Diver W. Ryan, Stieb David, Forouzanfar Mohammad H., Kim Sun-Young, Olives Casey, Krewski Daniel, and Burnett Richard T.. A class of non-linear exposure-response models suitable for health impact assessment applicable to large cohort studies of ambient air pollution. Air Quality, Atmosphere & Health, 9(8):961–972, December 2016. ISSN 1873–9318. doi: 10.1007/s11869-016-0398-z. URL http://link.springer.com/10.1007/s11869-016-0398-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. National Research Council. Health Risks from Exposure to Low Levels of Ionizing Radiation: BEIR VII Phase 2. The National Academies Press, Washington, D.C., March 2006. [PubMed] [Google Scholar]
  56. Neyman Jerzy. On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9. Statistical Science, 5(4):465–480, 1923. [Google Scholar]
  57. Pinault Lauren, Tjepkema Michael, Crouse Daniel L., Weichenthal Scott, Aaron van Donkelaar Randall V. Martin, Brauer Michael, Chen Hong, and Burnett Richard T.. Risk estimates of mortality attributed to low concentrations of ambient fine particulate matter in the Canadian community health survey cohort. Environmental Health, 15 (1):18, December 2016. ISSN 1476–069X. doi: 10.1186/s12940-016-0111-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Pinault Lauren L., Weichenthal Scott, Crouse Daniel L., Brauer Michael, Erickson Anders, Aaron van Donkelaar Randall V. Martin, Hystad Perry, Chen Hong, Philippe Finès Jeffrey R. Brook, Tjepkema Michael, and Burnett Richard T.. Associations between fine particulate matter and mortality in the 2001 Canadian Census Health and Environment Cohort. Environmental Research, 159:406–415, November 2017. ISSN 0013–9351. doi: 10.1016/J.ENVRES.2017.08.037. [DOI] [PubMed] [Google Scholar]
  59. Arden Pope C, Cropper Maureen, Coggins Jay, and Cohen Aaron. Health benefits of air pollution abatement policy: Role of the shape of the concentration-response function. Journal of the Air & Waste Management Association, 65(5):516–522, 2015. ISSN 2162–2906. doi: 10.1080/10962247.2014.993004. URL https://www.tandfonline.com/action/journalInformation?journalCode=uawm20. [DOI] [PubMed] [Google Scholar]
  60. Raftery Adrian E.. Bayesian Model Selection in Social Research. Sociological Methodology, 25:111–163, 1995. [Google Scholar]
  61. Raftery Adrian E, Madigan David, and Hoeting J. Bayesian model averaging for linear regression models. Journal of the American Statistical Association, 92(437):179–191, 1997. ISSN 0162–1459. doi: 10.1080/01621459.1997.10473615. [DOI] [Google Scholar]
  62. Rubin Donald B.. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5):688–701, 1974. [Google Scholar]
  63. Rubin Donald B. Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment. Source Journal of the American Statistical Association, 75(371):591–593, 1980. [Google Scholar]
  64. Savje Fredrik, Aronow Peter M, and Hudgens Michael G. Average treatment effects in the presence of unknown interference. 2018. URL https://arxiv.org/pdf/1711.06399.pdf. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Schafer Joseph. causaldrf: Tools for Estimating Causal Dose Response Functions, 2015. [Google Scholar]
  66. Scholze Martin, Boedeker Wolfgang, Faust Michael, Backhaus Thomas, Altenburger Rolf, and L Horst. A general best-ft method for concentration-response curves and the estimation of low-effect concentrations. Environmental Toxicology and Chemistry, 20(2):448–457, 2001. [PubMed] [Google Scholar]
  67. Schwartz Joel, Laden Francine, and Zanobetti Antonella. The Concentration-Response Relation between PM2.5 and Daily Deaths. Environmental Health Perspectives, 110(10):1025–1029, 2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Schwartz Joel, Bind Marie-abele, and Koutrakis Petros. Estimating Causal Effects of Local Air Pollution on Daily Deaths: Effect of Low Levels. Environmental Health Perspectives, 125(1):23–29, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Schwartz Joel, Fong Kelvin, and Zanobetti Antonella. A National Multicity Analysis of the Causal Effect of Local Pollution, NO2, and PM2.5 on Mortality. Environmental Health Perspectives, 126(2):1–10, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Shaddick Gavin, Lee Duncan, Zidek James V., and Salway Ruth. Estimating exposure response functions using ambient pollution concentrations. Annals of Applied Statistics, 2(4):1249–1270, 2008. [Google Scholar]
  71. Shi Liuhua, Zanobetti Antonella, Kloog Itai, Brent A Coull Petros Koutrakis, Melly Steven J, and Schwartz Joel D. Low-Concentration PM2.5 and Mortality: Estimating Acute and Chronic Effects in a Population-Based Study. Environmental health perspectives, 124(1):46–52, January 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Thurston George D., Ahn Jiyoung, Cromar Kevin R., Shao Yongzhao, Reynolds Harmony R., Jerrett Michael, Lim Chris C., Shanley Ryan, Park Yikyung, and Hayes Richard B.. Ambient Particulate Matter Air Pollution Exposure and Mortality in the NIH-AARP Diet and Health Cohort. Environmental Health Perspectives, 124:484–90, September 2016. ISSN 0091–6765. doi: 10.1289/ehp.1509676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Van Der Laan Mark J, Polley Eric C, and Hubbard Alan E. Super Learner. Statistical Applications in Genetics and Molecular Biology, 6(1), 2007. [DOI] [PubMed] [Google Scholar]
  74. Van Der Oost Ron, Beyer Jonny, and Vermeulen Nico P E. Fish bioaccumulation and biomarkers in environmental risk assessment: a review. Environmental Toxicology and Pharmacology, 13:57–149, 2003. [DOI] [PubMed] [Google Scholar]
  75. Vansteelandt Stijn, Bekaert Maarten, and Claeskens Gerda. On model selection and model misspecification in causal inference. Statistical methods in medical research, 21(1):7–30, 2012. [DOI] [PubMed] [Google Scholar]
  76. Vodonos Alina, Awad Yara Abu, and Schwartz Joel. The concentration-response between long-term PM 2 . 5 exposure and mortality; A meta-regression approach. Environmental Research, 166(December 2017):677–689, 2018. ISSN 0013–9351. doi: 10.1016/j.envres.2018.06.021. URL 10.1016/j.envres.2018.06.021. [DOI] [PubMed] [Google Scholar]
  77. Wang Chi, Parmigiani Giovanni, and Dominici Francesca. Bayesian Effect Estimation Accounting for Adjustment Uncertainty. Biometrics, 68(3):661–671, 2012. [DOI] [PubMed] [Google Scholar]
  78. Wang Chi, Dominici Francesca, Parmigiani Giovanni, and Zigler Corwin Matthew. Accounting for uncertainty in confounder and effect modifier selection when estimating average causal effects in generalized linear models. Biometrics, 71(3):654–665, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Wang Yan, Shi Liuhua, Lee Mihye, Liu Pengfei, Di Qian, Zabonetti Antonella, and Schwartz Joel D.. Long-term exposure to PM2.5 and mortality among older adults in the Southeastern US. Epidemiology, 28(2):207–214, 2018. doi: 10.1097/EDE.0000000000000614.Long-term. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Watanabe Sumio. Asymptotic Equivalence of Bayes Cross Validation and Widely Applicable Information Criterion in Singular Learning Theory. Journal of Machine Learning Research, 11:3571–3594, 2010. [Google Scholar]
  81. Weichenthal Scott, Kulka Ryan, Lavigne Eric, David Van Rijswijk Michael Brauer, Paul J Villeneuve Dave Stieb, Joseph Lawrence, and Burnett Rick T. Biomass Burning as a Source of Ambient Fine Particulate Air Pollution and Acute Myocardial Infarction. Epidemiology, 28(3):329–337, 2017. doi: 10.1097/EDE.0000000000000636. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Wilson Ander and Reich Brian J.. Confounder selection via penalized credible regions. Biometrics, 70(4):852–861, 2014. [DOI] [PubMed] [Google Scholar]
  83. Zanobetti Antonella and Schwartz Joel. Particulate air pollution, progression, and survival after myocardial infarction. Environmental health perspectives, 115(5):769–75, May 2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Zeger Scott L, Dominici Francesca, McDermott Aidan, and Samet Jonathan M. Mortality in the Medicare population and chronic exposure to fine particulate air pollution in urban centers (2000–2005). Environmental health perspectives, 116(12):1614–9, December 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Zigler Corwin M. and Dominici Francesca. Point: Clarifying Policy Evidence With Potential-Outcomes Thinking-Beyond Exposure-Response Estimation in Air Pollution Epidemiology. American Journal of Epidemiology, 180 (12):1133–1140, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES