Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Sep 1.
Published in final edited form as: Biometrics. 2019 Apr 13;75(3):778–787. doi: 10.1111/biom.13049

Causal inference with interfering units for cluster and population level treatment allocation programs

Georgia Papadogeorgou 1, Fabrizia Mealli 2, Corwin M Zigler 3
PMCID: PMC6784535  NIHMSID: NIHMS1053236  PMID: 30859545

Abstract

Interference arises when an individual’s potential outcome depends on the individual treatment level, but also on the treatment level of others. A common assumption in the causal inference literature in the presence of interference is partial interference, implying that the population can be partitioned in clusters of individuals whose potential outcomes only depend on the treatment of units within the same cluster. Previous literature has defined average potential outcomes under counterfactual scenarios where treatments are randomly allocated to units within a cluster. However, within clusters there may be units that are more or less likely to receive treatment based on covariates or neighbors’ treatment. We define new estimands that describe average potential outcomes for realistic counterfactual treatment allocation programs, extending existing estimands to take into consideration the units’ covariates and dependence between units’ treatment assignment. We further propose entirely new estimands for population-level interventions over the collection of clusters, which correspond in the motivating setting to regulations at the federal (vs. cluster or regional) level. We discuss these estimands, propose unbiased estimators and derive asymptotic results as the number of clusters grows. For a small number of observed clusters, a bootstrap approach for confidence intervals is proposed. Finally, we estimate effects in a comparative effectiveness study of power plant emission reduction technologies on ambient ozone pollution.

Keywords: air pollution, interference, inverse probability weighting, policy evaluation

1 |. INTRODUCTION

Most causal inference literature assumes that a unit’s potential outcome depends solely on its treatment, and not on the treatments of other units in the population. However, this assumption is often not reasonable. Perhaps the most classical example arises in vaccination studies (Ali et al., 2005; Hudgens and Halloran, 2008) where a unit’s disease status depends on their own vaccination status but also on the vaccination status of others. The presence of interference can lead to estimated quantities from familiar estimators that are misleading (Sobel, 2006) and lack clear causal interpretation (Tchetgen Tchetgen and VanderWeele, 2012), but can also introduce new estimands of intrinsic scientific interest.

Sobel (2006) defined estimands assuming partial interference referring to situations where the population can be partitioned into clusters (groups) for which a unit’s potential outcomes depend only on the treatment of units within the same cluster. Hudgens and Halloran (2008) formalized causal inference in this setting and in the context of two-stage randomization designs, extended to observational studies by Tchetgen Tchetgen and VanderWeele (2012).

In order to continue development in the context of observational studies, we highlight a key distinction that arises when formulating average potential outcomes in the presence of interference. We use the term treatment allocation strategy to refer to a process giving rise to either observed or hypothesized vectors of treatment assignments. The observed treatment allocation strategy refers to that which gives rise to observed treatments. The counterfactual treatment allocation strategy refers to how treatments may have been assigned in some hypothesized counterfactual world for which causal contrasts are considered. This distinction between observed and counterfactual treatment allocation programs helps illuminate that existing causal estimands are limited to counterfactual treatment allocation programs that remain agnostic with regard to covariate information. These estimands ignore the possible role of unit-level covariates that relate to treatment adoption, implicitly assuming an intervention manipulating each individual unit’s treatment propensity.

In many settings, however, treatment allocations corresponding to unit-level manipulation are difficult to conceive. For example, policy interventions may be designed to increase the regional prevalence of a treatment without direct control over the individual treatment propensity. In such settings, individual treatment adoption might generally depend on unit-level covariates or the treatment status of neighboring units. To address such settings, we develop new causal estimands anchored to counterfactual treatment allocations that correspond to realistic regional interventions. These interventions are conceived at the cluster level, where a particular allocation strategy dictates the cluster-average propensity of receiving treatment without directly specifying individual-level treatment propensities. Note that, in focusing on new estimands for covariate-dependent counterfactual treatment allocation programs, our work has commonalities with independent work in Barkley et al. (2017). Defining estimands for realistic treatment allocations has received limited attention in the causal inference literature with an exemption found in van der Laan and Petersen (2007).

In addition to extending existing estimands to accommodate realistic regional treatment allocations, this work provides entirely novel estimands motivated by the desire to investigate interventions which take place at the population (vs. regional) level. These estimands are useful for evaluating policies that change the distribution of cluster-average propensities of treatment by, for example, providing a population-wide incentive to adopt treatment.

Definition of the new causal estimands described above is accompanied here by new estimators, derivation of corresponding asymptotic properties as the number of clusters grows, and a bootstrap approach to inference when the number of observed clusters is small. Related work can be found in Ferracci et al. (2014), in Liu and Hudgens (2014) where asymptotic results are derived for growing number of clusters or number of individuals within clusters, in Perez-Heydrich et al. (2015) where large sample variance estimators for the estimator of Tchetgen Tchetgen and VanderWeele (2012) are derived, and in Liu et al. (2016) where estimands and estimators are extended to a network where partial interference does not hold, but asymptotic results are derived under the assumption of partial interference.

The motivating context for this work is the evaluation of interventions to limit harmful pollution from power plants. The movement of air pollution through space leads to interference: intervening on one power plant can affect the air pollution surrounding nearby power plants. Importantly, pollution regulations often work by incentivizing regions of power plants to adopt certain technologies without mandating treatment adoption at each individual plant, which, in reality, is heavily influenced by individual power plant characteristics (e.g., size). In addition to regional strategies, federal pollution regulations can provide national incentives for power plants to install technologies, motivating the new population-level estimands.

The new estimators are deployed here to an analysis of U.S. power plants investigating the comparative effectiveness of Selective Catalytic or non-Catalytic Reduction (SCR/SNCR) systems relative to other strategies for reducing ambient ozone pollution. A preliminary investigation of these same data in Papadogeorgou et al. (2018) ignored interference and indicated that these systems causally reduced NOx emissions (an important precursor to ozone pollution) but estimated an effect on ambient ozone very close to zero. The analysis here to address the possibility of interference produces meaningfully different results that are more consistent with the literature relating NOx emissions to ambient ozone pollution.

2 |. ESTIMANDS UNDER PARTIAL INTERFERENCE

We adopt the notation used in Tchetgen Tchetgen and VanderWeele (2012). Let N be the number of clusters, and ni the number of units in cluster i, i ∈ {1, 2, …, N}. Furthermore, let Ai=(Ai1,Ai2,,Aini)A(ni) denote the cluster treatment vector, and Ai,j=(Ai1,Ai2,,Aij1,Aij+1,,Aini)A(ni1) the treatment of all units in cluster i apart from unit j, where A(n)={0,1}n. Furthermore, let Lij be a vector of individual and cluster-level covariates, and Li=(Li1,Li2,,Lini) be the collection of covariates of all units within cluster i. Under the assumption of partial interference, the potential outcome of unit j in cluster i may depend on the treatment of units in cluster i, but not on the treatment of units in different clusters. For every i we postulate the existence of group i’s potential outcomes Yi()={Yi(ai),aiA(ni)}, where Yi(ai)=(Yi1(ai),Yi2(ai),,Yini(ai)).

2.1 |. Average potential outcome

We define the individual average potential outcome for a counterfactual treatment allocation strategy with two features: (1) treatment assignment for units within a cluster is unlikely to be independent, and (2) individual covariates can be predictive of a unit’s treatment probability. Let Pα,L represent the (arbitrarily specified) counterfactual treatment allocation program, specified intentionally to depend on covariates and/or allow correlated assignments, and whose important features are represented by parameters α. For the purpose of this paper, we consider α to represent the cluster-average propensity of treatment.

The individual average potential outcome for unit j in cluster i is defined as:

Y¯ijL(a;α)=sA(ni1)Yij(Aij=a,Ai,j=s)Pα,L(Ai,j=s|Aij=a,Li), (1)

representing the unit’s expected outcome in the counterfactual world where treatment is assigned according to Pα,L, but treatment of unit j is set to a. The group average potential outcome for cluster i is defined as Y¯iL(a;α)=1ninij=1Y¯ijL(a;α). Assuming that clusters are observed from a super-population according to a distribution G0, the population average potential outcome is defined as Y¯L(a;α)=EG0{Y¯iL(a;α)}. Note that these estimands are well-defined for any fixed choice of Pα,L. Definitions of the group and population average potential outcomes implicitly require that, when a cluster is observed, data are collected on all the units belonging to that cluster.

In Web Appendix E, we discuss the sample average potential outcome defined in the literature as the (weighted) average of the observed clusters’ group average potential outcomes. There, we provide an argument for a definition (and corresponding estimator) which gives equal-weight to all clusters irrespective of their cluster size.

2.2 |. The counterfactual treatment allocation in existing literature

The above development has left unspecified the term Pα,L in (1) providing relative weights to different cluster treatment vectors in the individual average potential outcomes. Specifying Pα,L(ai|Li)=nij=1αaij(1α)1aij would correspond to the estimands in Tchetgen Tchetgen and VanderWeele (2012) and Perez-Heydrich et al. (2015) that give equal probability to all cluster treatment vectors with the same number of treated units, irrespective of which those units are. For this choice of Pα,L, the estimands represent quantities in a counterfactual world where individual treatment propensity can be manipulated and units are assigned to treatment independently and with equal probability α.

2.3 |. Realistic counterfactual treatment allocation program

However, in some situations, counterfactual treatment allocations can only be realistically conceived to depend on covariates or incorporate correlation between treatment of units in the same cluster. In the study of power plant interventions, the decision of whether to “treat” a power plant is at the discretion of the power company and heavily influenced by power plant covariates. Therefore, a hypothesized counterfactual treatment allocation is realistic only when such covariates are incorporated. For example, consider the covariate ‘heat input’, a proxy for the size of the power plant, and let Lij be the heat input of power plant j in cluster i. One specification of a counterfactual treatment allocation strategy that would acknowledge that different-sized power plants are more or less likely to adopt treatment is:

logit Pα,L(Aij=1|Lij)=ξiα+δLij, (2)

for some fixed value of δ, and ξiα such that (ni1)j=1niexpit (ξiα+δLij)=α . The value δ could be specified according to knowledge of how the size of the power plant is expected to impact the propensity to adopt treatment. For δ = 0 the estimands coincide with the ones for independent Bernoulli counterfactual treatment allocations found in Tchetgen Tchetgen and VanderWeele (2012). Note that the covariates included in the counterfactual treatment allocation are at the discretion of the investigator based on judgments about which allocations might correspond to the policies of interest.

Based on (2), the probability of the cluster treatment vector under the counterfactual treatment allocation Pα,L (Ai = ai|Li) could be fully specified by hypothesizing that, in the counterfactual world, the Aij ‘s are conditionally independent given Li, and Aij is conditionally independent of Li,−j given Lij. Then, Pα,L(Ai=ai|Li)=j=1niPα,L(Aij=aij|Lij) which, in turn, specifies Pα,L (Ai, −j = s|Aij = a,Li) for all sA(ni1) giving relative weights in the specification of the individual average potential outcome (1). Based on this specification of Pα,L, the estimands of interest correspond to quantities in a hypothesized world where treatment within a cluster is assigned independently across units with treatment propensity that depends on Lij, but is on average equal to α.

Alternatively, a counterfactual treatment allocation strategy could incorporate dependence of treatments of nearby units by, for example, including a spatial random effect in Pα,L. A data-driven way to choose Pα,L is presented in Section 6.

2.4 |. Direct and indirect effects

Contrasts of average potential outcomes can characterize how treatment affects the outcome of interest. For counterfactual allocation strategy Pα,L, direct effects represent contrasts in average potential outcomes when only the individual treatment changes. On the other hand, indirect effects (also known as spillover effects) contrast average potential outcomes for a fixed level of individual treatment, but different specification of the parameter α governing Pα,L. For that reason, indirect effects represent expected changes in potential outcomes for changes only in the “treatment of neighbors”, and they constitute a measure of interference. Based on the average potential outcomes defined above, one can define the individual direct effect as DEijL(α)=Y¯ijL(1;α)Y¯ijL(0;α),, and the corresponding group and population direct effects as DEiL(α)=Y¯iL(1;α)Y¯iL(0;α), and DEL(α)=Y¯L(1;α)Y¯L(0;α)=EG0{DEiL(α)} accordingly. Similarly, the individual indirect effect is specified as IEijL(α1,α2)=Y¯ijL(0;α2)Y¯ijL(0;α1), based on which group and population indirect effects can be defined. Indirect effects could be alternatively specified for individual treatment assignment a = 1, but here our focus is on the effect of neighbors’ treatment in the areas surrounding untreated power plants. Further, total effects can be defined as the sum of direct and indirect effects, while similar development can lead to the definition of overall effects as contrasts of the individual average overall potential outcomes Y¯ijL,O(α)=sA(ni)Yij(s)Pα,L(s|Li). The overall effects (and corresponding estimators) are discussed in detail in Web Appendix G.

3 |. POPULATION-LEVEL COUNTERFACTUAL DISTRIBUTION OF CLUSTER-AVERAGE PROPENSITY OF TREATMENT

Development in Section 2 pertained to estimands that fix the cluster-average propensity of treatment, α, corresponding to counterfactual worlds were one intervenes at the level of the cluster but units within the cluster choose their own treatment. Here we develop completely new estimands for settings where the cluster average propensity of treatment is not fixed to a specific value α, but arises from a hypothesized distribution of cluster-level propensities.

Such estimands correspond to interventions that occur at an administrative level higher than the cluster. For example, a federal regulation targeting all clusters might incentivize increased treatment coverage without enforcing a specific average propensity of treatment for any particular geographic area, leading to an overall shift in the distribution of cluster average propensity of treatment from some distribution Fα1 to Fα2, like in Figure 1.

FIGURE 1.

FIGURE 1

Distributions of cluster average propensity of treatment. A federal intervention might target the increase of cluster treatment coverage targeting a shift in the distribution of cluster average treatment propensity from Fα1 to Fα2.

Let Fα (·) denote a hypothesized distribution of cluster-average propensity of treatment. Then, define the Fα-individual average potential outcome as Y¯ijL(a;Fα)=Y¯ijL(a;α)dFα(α). Thus, Y¯ijL(a;Fα) describes the average potential outcome of unit j in cluster i, for cluster average propensity of treatment arising from Fα and individual treatment a. Consequently, the Fα-group and population average potential outcomes are defined as

Y¯iL(a;Fα)=1nij=1niY¯ijL(a;Fα), (3)

and

Y¯L(a;Fα)=EG0{Y¯iL(a;Fα)}, (4)

accordingly. We focus on the contrast of Fα-population average potential outcomes for hypothesized distributions of cluster-average propensity of treatment Fα1 and Fα2,

IEL(Fα1,Fα2)=Y¯L(0;Fα2)Y¯L(0;Fα1). (5)

Such estimands are of primary interest in the evaluation of air pollution policies, since IEL(Fα1,Fα2) represents the expected change in air pollution concentration surrounding control units for a federal intervention to increase emission control technology coverage. Direct effect estimands based on the Fα-population average potential outcome could also be defined as DE(Fα)=Y¯L(1;Fα)Y¯L(0;Fα), but are not pursued here.

4 |. ESTIMATING THE POPULATION AVERAGE POTENTIAL OUTCOME

For a fixed choice of Pα,L, we provide estimators of the population average potential outcome Y¯L(a;α), unbiasedness and consistency results, and derive the asymptotic distribution when the number of clusters increases to infinity for a known or estimated (and correctly specified) parametric cluster-propensity score model (defined below). These results are used to derive estimators (and their asymptotic properties) for the estimands in Section 2.4 and the population level estimands in Section 3. Asymptotic results are derived to simplify inference, since analytic finite sample unbiased variance estimators cannot be derived without additional assumptions on the interference structure (Hudgens and Halloran, 2008). For finite sample confidence intervals, we discuss a bootstrap approach. Proofs are in Web Appendix C.

We start by making the consistency of potential outcomes assumption, and cluster-level positivity and conditional ignorability assumptions:

  • Assumption 1. Consistency of potential outcomes. Yij = Yij(Ai) with probability 1.

  • Assumption 2. Positivity. There exists ρ > 0 such that the probability of observing cluster treatment vector ai given cluster covariates Li, fA|L,i(Ai = ai|Li) > ρ with probability 1. fA|L,i is the cluster-propensity score.

  • Assumption 3. Conditional ignorability. For G0, AiYi(·)|Li.

These assumptions are adaptations of standard assumptions required for causal inference. A crucial difference is that in settings of partial interference these assumptions are made at the level of the cluster, rather than at the level of the individual. Note that the covariates Li in Assumption 3 do not need to coincide with the ones included in the counterfactual treatment allocation Pα,L.

The inclusion of cluster subscript i in the notation for fA|L,i depends on the assumed form of the propensity score. In the analysis of clustered data, a cluster-specific random intercept is often included in the propensity score model (as will be discussed in Section 6.1), making the dependence of fA|L,i on i explicit.

4.1 |. Estimators of the group and population average potential outcome

Let Y^iL(a;α)=1nij=1niPα,L(Ai,j|Aij=a,Li)fA|L,i(Ai|Li)I(Aij=a)Yij, and Y^L(a;α)=1Ni=1NY^iL(a;α). For the true cluster propensity score fA|L,i(·|Li) and if Assumptions 2 and 3 hold, Y^iL(a;α) is unbiased for Y¯iL(a;α). Unbiasedness is derived with respect to the distribution of the observed treatment assignment.

4.2 |. Asymptotic results for population average potential outcome estimators

We derive the asymptotic properties of the estimator Y^L(a;α) for an increasing number of clusters N, where G0 represents the joint distribution of the independent and identically distributed random vectors (Yi(·), Ai, Li). (For notational simplicity, ni is included in Li.)

We denote the estimator Y^L(a;α) based on N clusters as Y^NL(a;α). and derive the asymptotic distribution of μ^N01(α)=(Y^NL(0;α),Y^NL(1;α))T using the estimated correctly-specified parametric propensity score model. The cluster-propensity score for the observed treatment vector will be denoted by fA|L, i(Ai|Li; γ) where γ are the propensity score model parameters.

Theorem 1. Let μ001(α)=(Y¯L(0;α),Y¯L(1;α))T. Assume that Assumptions 2, 3 hold, the outcome is bounded (there exists M > 0 : |Yij| < M with probability 1), the parametric form of the propensity score model indexed by γ, fA|L,i(ai|li; γ), is correctly specified and differentiable with respect to γ with true parameters γ0, and μ^N01(α) is calculated using γ^, the MLE of γ. Let ψγ(li,ai;γ)=γTlogf(ai|li;γ) be the score functions. Assume that:

  1. γ0 is in an open subset of the Euclidean space,

  2. γψγ(li, ai; γ) is twice continuously differentiable ∀(li, ai),

  3. EG0ψγ(Li,Ai;γ0)22<,

  4. EG0{γTψγ(Li,[i;A)]|γ=γ0} exists and is non-singular, and

  5. ∃ measurable integrable function ψ¨γ(li,ai) fixed such that ψ¨γ dominates the second partial derivatives of ψγ for all γ in a neighborhood of γ0.

Then, N{μ^N01(α)μ001(α)}dN(0,W(γ0,μ001(α))), where

W(γ,μ)=V(γ,μ)+A21B111A21T+A21B111B12+(A21B111B12)T,
V(γ,μ)=EG0{ψ01,α(Yi,Li,Ai;μ,γ)ψ01,α(Yi,Li,Ai;μ,γ)T},
ψ01,α(Yi,Li,Ai;μ,γ)=(ψ0,α(Yi,Li,Ai;μ0,γ),ψ1,α(Yi,Li,Ai;μ1,γ))T
ψa,α(Yi,Li,Ai;μa,γ)=1nij=1niPα,L(Ai,j|Aij=a,Li)fA|L,i(Ai|Li;γ)I(Aij=a)YijμaA21=A21(γ,μ)=EG0{ψ0,α(Yi,Li,Ai;μ0,γ)/γψ1,α(Yi,Li,Ai;μ1,γ)/γ}T,
B11=B11(γ)=EG0{ψγ(Li,Ai;γ)ψγ(Li,Ai;γ)T},
B12=B12(γ,μ)=EG0{ψγ(Li,Ai;γ)ψ0,α(Yi,Li,Ai;μ0,γ),ψγ(Li,Ai;γ)ψ1,α(Yi,Li,Ai;μ1,γ)},

for μ = (μ0, μ1).

Even though the MLE γ^ and the score equations were used in Theorem 1, any consistent M-estimator of ψ-type can be used, as long as its objective function ψγ(li, ai; γ) satisfies the theorem’s conditions.

For a known propensity score, asymptotic results follow by considering fA|L,i(Ai|Li; γ) as fixed with respect to γ, leading to W(γ0,μ001(α))=V(γ0,μ001(α)). In both cases, Theorem 1 leads to the approximation μ^N01(α)~MVN2(μ001(α),N1W(γ0,μ001(α))) for large number of clusters. Even if assumptions about G0 are made, W(γ0,μ001(α)) is hard to calculate analytically. Instead, W(γ0,μ001(α)) can be estimated using W^(γ^,μ^N01(α)), where W^(γ,μ) is the matrix W(γ, μ) where all expectations are substituted with the empirical expectations. For example, B^11(γ)=1Ni=1Nψγ(Li,Ai;γ)ψγ(Li,Ai;γ)T. Under regularity conditions, discussed in Iverson and Randles (1989), W^(γ^,μ^N01(α)) will be consistent for W(γ0,μ001(α)). Using Theorem 1, one can obtain the asymptotic distribution of a contrast between Y^L(0;α),Y^L(1;α) specifying a direct effect by an application of the multivariate delta method.

Next, we derive the asymptotic distribution for μ^IE(α0,α1)=(Y^NL(0;α0),Y^NL(0;α1))T.

Theorem 2. Let μ0IE(α0,α1)=(Y¯L(0;α1),Y¯L(0;α2))T. If the assumptions of Theorem 1 hold, N{μ^IE(α0,α1)μ0IE(α0,α1)}N(0,Q(γ0,γ0IE(α0,α1))), where

Q(γ,μ)=D22(γ,μ)+C21B111C21T+C21B111D12+(C21B111D12)T
D22(γ,μ)=EG0{ψ0,α1,α2(Yi,Li,Ai;μ,γ)ψ0,α1,α2(Yi,Li,Ai;μ,γ)T}
ψ0,α1,α2(Yi,Li,Ai;μ,γ)=(ψ0,α1(Yi,Li,Ai;μ0,γ),ψ0,α2(Yi,Li,Ai;μ0,γ))T
D12=D12(γ,μ)=EG0{ψγ(Li,Ai;γ)×ψ0,α1(Yi,Li,Ai;μ0,γ),
ψγ(Li,Ai;γ)ψ0,α2(Yi,Li,Ai;μ0,γ)},
C21=C21(γ,μ)=E{ψ0,α1(Yi,Li,Ai;μ0,γ)/γψ0,α2(Yi,Li,Ai;μ0,γ)/γ},

and B11 as in Theorem 1.

4.3 |. Estimators and asymptotic results for the population-level estimands

Similar arguments lead to estimators of the Fα-group and population average potential outcomes in (3) and (4) as

Y^iL(a;Fα)=Y^iL(a;α)dFα(α)=1nij=1niPFα,L(Ai,j|Aij=a,Li)fA|L,i(Ai|Li)I(Aij=a)Yij,
Y^L(a;Fα)=Y^L(a;α)dFα(α)=1Ni=1niY^iL(a;Fα)

accordingly, where PFα,L(Ai,j|Aij=a,Li)=Pα,L(Ai,j|Aij=a,Li)dFα(α).

Assume that Fα1, Fα2 are discrete distributions taking values αk ∈ (0, 1), k = 1, 2, …, K, with probability p1k and p2k accordingly, such that k=1Kphk=1, h = 1, 2. Then, Y¯L(0;Fαh)=k=1KphkY¯L(0;αk), IEL(Fα1,Fα2)=k=1K(p2kp1k)Y¯L(0;αk) and a consistent estimator of IEL(Fα1,Fα2) is IE^L(Fα1,Fα2)=k=1K(p2kp1k)Y¯L(0;αk). The asymptotic distribution of IE^L(Fα1,Fα2) follows by arguments similar to the ones in Theorem 2 (Web Appendix D).

Although the estimands in Section 3 are well-defined for any distribution Fα, reliable estimation requires sufficiently overlapping support of Fα with the empirical distribution F^α.

4.4 |. Finite sample variance estimator

The asymptotic distributions derived above can be used to compute Wald-type confidence intervals for a sample with a large number of clusters. However, in settings with a small number of observed clusters (as in the study of Section 6 with 50 clusters), a bootstrap (Efron, 1979) approach to finite sample confidence intervals might be preferable.

Note that the population average potential outcome Y¯L(a;α) is defined as the mean of the cluster quantity Y¯iL(a;α). Thinking of the clusters as the unit of observation, uncertainty around Y^L(a;α) can be acquired employing a bootstrap approach commonly used for mean estimators. Here, B new data sets are created, each consisting of the same number of clusters as in the original data, sampled with replacement from the observed ones. Confidence intervals for Y^L(a;α) are constructed using the quantiles of the B bootstrap estimates, for large B. Confidence intervals for all estimators discussed here can be obtained similarly.

5 |. SIMULATIONS

Web Appendix A presents a simulation study to evaluate the operating characteristics of the estimators based on the true and estimated propensity score and using the asymptotic variance and bootstrap approach to confidence intervals. We construct a super-population of 2000 clusters, from which N ∈ {50, 300} clusters are sampled randomly. The simulations show that, when the propensity score is known, the estimators are close to unbiased, and the 95% confidence intervals achieve the nominal coverage for both the asymptotic and bootstrap approach. When the propensity score is estimated, 95% confidence intervals based on the asymptotic variance exhibit under-coverage. Bootstrap-based confidence intervals perform favorably achieving close to the 95% level in most cases, with apparent over coverage when estimating the indirect effect with N = 50.

6 |. EVALUATING THE EFFECTIVENESS OF POWER PLANT EMISSIONS CONTROLS FOR REDUCING AMBIENT OZONE POLLUTION

Limited literature exists evaluating U.S. air pollution regulations within a causal inference framework. Power plant regulations for the reduction of NOx emissions have been predicated on the knowledge that reducing NOx emissions would lead to a subsequent reduction in ambient ozone. Among various NOx emission reduction strategies, SCR and SNCR are believed to be the most effective in reducing emissions. While work in Papadogeorgou et al. (2018) corroborated this effectiveness of SCR and SNCR for reducing NOx emissions, the analysis of ambient ozone pollution in that paper ignores the possibility of interference and estimates an effect on ozone very close to zero. However, interference is a key component in the study of air pollution: ambient pollution concentrations near a power plant will depend on the treatment levels of other nearby power plants.

We use the same data as in Papadogeorgou et al. (2018) to estimate direct and indirect effects of SCR/SNCR against alternatives on ambient ozone under realistic counterfactual programs. The publicly-available data set includes 473 coal or gas burning power generating facilities in the U.S. operating during June, July and August 2004, with covariate information on power plant characteristics, weather and demographic information of the surrounding areas. For every power plant, the value of ozone is calculated as the average of the 4th highest ozone measurements across EPA monitoring locations within 100km. See Papadogeorgou et al. (2018) for a full description of the data set and linkage. Web Appendix B includes links to the publicly available data set, R package, and R scripts replicating the results.

Power plant facilities are grouped according to Ward’s agglomerative hierarchical clustering method (Ward, 1963), which starts by considering every unit as its own cluster and proceeds by merging clusters based on an objective function until all units form a single cluster. Clusters are meant to represent interference groups, so the objective function is based solely on geographic closeness to encourage grouping nearby power plants that are likely to emit over each other. Visual inspection is used to choose 50 clusters of power plants in the sequence of power plant partitions. The grouping and treatment of facilities are depicted in Figure 2.

FIGURE 2.

FIGURE 2

Treated (SCR/SNCR) and control (Other) power plant facilities during June, July, August of 2004. Shaded areas depict the interference clusters according to the agglomerative clustering method.

6.1 |. Plausibility of the ignorability and positivity assumption

While regulatory programs provide incentives to install emission-control technologies, power plants have latitude to select which (if any) technology to adopt. Such decisions are largely determined by the power plant’s characteristics such as its size and operating capacity, as well as by factors related to local or regional air pollution incentives that are influenced by area-level characteristics such as population density and urbanicity. To capture such factors, 18 covariates are included in the data set describing power plant, weather, and demographic characteristics, based on which ignorability is expected to hold. The variability in the observed proportion of treated power plants across clusters provides an additional indication that the positivity assumption is plausible. The propensity score was modeled as in Papadogeorgou et al. (2018) augmented with a cluster-specific random effect following common practice in the analysis of clustered data:

logit P(Aij=1|Lij)=γ0+βi+LijTγL,βi~N(0,σb2), (6)

where γ=(γ0,γLT,σb2)T in the notation of Theorem 1. Such model has been shown to perform well in the presence of unmeasured cluster-level confounders (Arpino and Mealli, 2011; Li et al., 2013) providing robustness to violations of the conditional ignorability assumption.

6.2 |. Counterfactual treatment allocation for the installation of SCR/SNCR technologies

To specify counterfactual treatment allocations that reflect realistic relationships between covariates and the propensity to adopt treatment, we specify Pα,L such that the log-odds of treatment installation related to individual covariates are as observed in the propensity score model for the observed treatment in (6). Even though this choice of Pα,L depends on the data through the estimated log-odds, the corresponding estimands are well-defined and the asymptotic results are valid for Pα,L fixed across replications of the sampling or an increasing number of clusters. Note that, in general, the covariates used to specify Pα,L need not be identical to those used in the observed propensity score model.

Values of α were considered between the 20th and 80th quantiles of the observed cluster treatment proportions, corresponding to α ∈ [0.073, 0.458]. Figure 3 shows the population direct effect DE(α), and population indirect effect IE(α1, α2) for a subset of values of α1 with bootstrap confidence intervals (asymptotic intervals are reported in Web Appendix B). The direct effect is estimated to be negative for all values of α and has a somewhat increasing trend, implying that in a world where the average propensity of SCR/SNCR among power plants in a cluster is fixed, the installation of SCR/SNCR at one power plant is estimated to reduce ozone concentrations in the surrounding area, but these reductions are smaller when the cluster average propensity of treatment is high (larger number of treated neighbors). Using the bootstrap, these reductions are statistically significant for α ∈ (0.22, 0.28), whereas asymptotic confidence intervals indicate significance for almost all values of α considered.

FIGURE 3.

FIGURE 3

Estimates of the direct effect of SCR/SNCR versus alternative strategies on ozone concentrations as a function of α, and estimates of the indirect effect for values α1 ∈ {0.1, 0.4}. Shaded areas represent 95% Bootstrap confidence intervals. Ozone is measured in parts per million.

The indirect effect is, in a way, a measure of pollution transport since it quantifies the effect of changes in the cluster average propensity of treatment on ozone concentrations near control power plants. For fixed α1, IEL(α1, α2) is almost always decreasing in α2, implying that higher cluster-average SCR/SNCR propensity leads to reductions in ambient ozone concentrations surrounding control power plants. Bootstrap confidence intervals indicate that the indirect effects were not significant. However, based on the asymptotic confidence intervals, the indirect effect is estimated to be significant for values of α1 > 0.15 and all α2.

Next, we considered estimating the effect of hypothesized federal regulations that would shift the distribution of cluster-average propensity of treatment. Fα1(Fα2) was assumed to be a discrete distribution within the 20th (50th) and 80th quantiles of the observed cluster-treatment proportions. In Figure 4, we show the empirical probability mass function, as well as the two counterfactual treatment allocations. IEL(Fα1,Fα2) was estimated to be −0.0036 parts per million (95% CI: Bootstrap −0.0208 to 0.005, Asymptotic −0.0059 to −0.0013) implying that federal regulations that encourage SCR/SNCR installation enough to bring the cluster average treatment propensity distribution from taking values between the 20th and 80th percentiles of the empirical cluster coverage distribution to taking values between the 50th and 80th percentiles of the empirical cluster coverage distribution, would lead to ambient ozone concentrations surrounding control power plants that are on average 0.0036 parts per million lower. Compare these estimates against the national ozone air quality standard of 0.07 parts per million.

FIGURE 4.

FIGURE 4

Observed cluster treatment proportions (“Observed”), and two discrete hypothesized distributions of cluster-average propensity of treatment. One corresponds to the empirical distribution restricted within the 20th and 80th quantiles of the observed cluster treatment proportions (“Observed-restricted”), and the other (“Counterfactual”) corresponds to the empirical (or the Observed-restricted) further restricted between the 50th and 80th quantiles of the observed cluster treatment proportions.

We explored the sensitivity of the results to the choice of clustering, and saw that the qualitative results for the effectiveness of SCR/SNCR emission reduction technologies are mostly consistent with negative estimated direct effects and decreasing indirect effect curves. The same holds using a propensity score that excluded the cluster-specific random effects. Estimates of the estimands in Tchetgen Tchetgen and VanderWeele (2012) that assume manipulation of individual power-plant treatment propensities indicated similar patterns and magnitudes to those in Figure 3. These results can be found in Web Appendix B.

7 |. DISCUSSION

Analyzing data accounting for interference disentangles the individual treatment effect from one’s neighbors’ treatment effect. We proposed new estimands for counterfactual strategies that depend on covariates and the treatment of one’s neighbors, manipulating treatment at the cluster, or the population of clusters level. Despite the focus on air pollution interventions, similar considerations could be construed in classical interference settings such as vaccine studies, where certain types of community members are more likely to receive the vaccine and vaccine programs are designed to increase coverage at the community, or national level.

For the estimands referring to population level interventions, Fα could be specified to depend on cluster-level covariates acting as predictors of cluster-average propensity of treatment. Alternative estimands can be defined for strategies that manipulate the relationship between covariates and treatment assignment to reflect, for example, interventions for which larger power plants receive higher penalties for over-emission.

Consistent estimators were proposed for which asymptotic distributions were derived. These estimators were employed in the comparative effectiveness of power plant emission control strategies on ambient ozone, returning results that are more in line with subject-matter knowledge than results from a previous study that assumed no interference. While the power plant analysis showed the potential for causal inference methods for interference to lead to important results in air pollution research, the assumption of partial interference– and the existence of distinct clusters– is a simplification since pollution from one power plant can travel long distances. The Web Appendix explores alternative clusterings with comparable results. Ongoing work is oriented towards an account of interference that does not rely on clusters (Forastiere et al., 2018). The existence of between-clusters interference should be viewed in the light of work by Sobel (2006), Tchetgen Tchetgen and VanderWeele (2012), Forastiere et al. (2017), and Sävje et al. (2018), where it is shown that ignoring interference can lead to estimated quantities that lack causal interpretation, unless further assumptions are made. Despite the limitations, the analysis of the air quality data entails a methodological advance for studies of air pollution interventions and formalizes interference in a realm where it has not, to our knowledge, been previously considered in detail.

Supplementary Material

Supp

ACKNOWLEDEGEMENTS

Funding was provided by National Institutes of Health R01ES026217, USEPA 83587201–0, and Health Effects Institute 4953-RFA14–3/16–4. The contents of this work are solely the responsibility of the grantee and do not necessarily represent the official views of the USEPA. Further, USEPA does not endorse the purchase of any commercial products or services mentioned in the publication. The authors thank Dr Christine Choirat for tools created to manage and link the disparate (publicly-available) data used for this analysis.

Footnotes

SUPPORTING INFORMATION

Additional supporting information referenced in Sections 2, 4, 5, and 6 may be found online in the Supporting Information section at the end of the article. Data and code to reproduce the study results are available at https://osf.io/7dp8c/. R package implementing the estimators in this paper can be found at https://github.com/gpapadog/Interference.

REFERENCES

  1. Ali M, Emch M, Von Seidlein L, Yunus M, Sack DA, Rao M, Holmgren J, and Clemens JD (2005). Herd immunity conferred by killed oral cholera vaccines in Bangladesh: A reanalysis. Lancet 366, 44–49. [DOI] [PubMed] [Google Scholar]
  2. Arpino B and Mealli F (2011). The specification of the propensity score in multilevel observational studies. Computational Statistics and Data Analysis 55, 1770–1780. [Google Scholar]
  3. Barkley BG, Hudgens MG, Clemens JD, Ali M, and Emch ME (2017). Causal Inference from Observational Studies with Clustered Interference.
  4. Efron B (1979). Bootstrap Methods: Another Look at the Jackknife. The Annals of Statistics 7, 1–26. [Google Scholar]
  5. Ferracci M, Jolivet G, and van den Berg GJ (2014). Evidence of Treatment Spillovers Within Markets. Review of Economics and Statistics 96, 812–823. [Google Scholar]
  6. Forastiere L, Airoldi EM, and Mealli F (2017). Identification and estimation of treatment and interference effects in observational studies on networks.
  7. Forastiere L, Mealli F, and Zigler CM (2018). Evaluating power plant regulations with methods for causal inference on bipartite networks with interference.
  8. Hudgens MG and Halloran ME (2008). Toward Causal Inference With Interference. Journal of the American Statistical Association 103, 832–842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Iverson HK and Randles RH (1989). The Effects on Convergence of Substituting Parameter Estimates into U-Statistics and Other Families of Statistics. Probability Theory and Related Fields 81, 453–471. [Google Scholar]
  10. Li F, Zaslavsky AM, and Landrum MB (2013). Propensity score weighting with multilevel data. Statistics in Medicine 32, 3373–3387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Liu L and Hudgens MG (2014). Large sample randomization inference of causal effects in the presence of interference. Journal of the American Statistical Association 109, 288–301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Liu L, Hudgens MG, and Becker-Dreps S (2016). On inverse probability-weighted estimators in the presence of interference. Biometrika 103, 829–842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Papadogeorgou G, Choirat C, and Zigler CM (2018). Adjusting for unmeasured spatial confounding with distance adjusted propensity score matching. Biostatistics. In Press. pages 1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Perez-Heydrich C, Hudgens MG, Halloran ME, Clemens JD, Ali M, and Emch ME (2015). Assessing Effects of Cholera Vaccination in the Presence of Interference. Biometrics 33, 395–401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Sävje F, Aronow PM, and Hudgens MG (2018). Average treatment effects in the presence of unknown interference. [DOI] [PMC free article] [PubMed]
  16. Sobel ME (2006). What Do Randomized Studies of Housing Mobility Demonstrate? Journal of the American Statistical Association 101, 1398–1407. [Google Scholar]
  17. Tchetgen Tchetgen EJ and VanderWeele TJ (2012). On causal inference in the presence of interference. Statistical Methods in Medical Research 21, 55–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. van der Laan MJ and Petersen ML (2007). Causal Effect Models for Realistic Individualized Treatment and Intention to Treat Rules. International Journal of Biostatistics 3, Article3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ward JHJ (1963). Hierarchical Grouping to Optimize an Objective Function. Journal of the American Statistical Association 58, 236–244. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp

RESOURCES