Toward Causal Inference With Interference

Michael G Hudgens; M Elizabeth Halloran

doi:10.1198/016214508000000292

. Author manuscript; available in PMC: 2008 Dec 10.

Published in final edited form as: J Am Stat Assoc. 2008 Jun;103(482):832–842. doi: 10.1198/016214508000000292

Toward Causal Inference With Interference

Michael G Hudgens ¹, M Elizabeth Halloran ¹

PMCID: PMC2600548 NIHMSID: NIHMS73860 PMID: 19081744

Abstract

A fundamental assumption usually made in causal inference is that of no interference between individuals (or units); that is, the potential outcomes of one individual are assumed to be unaffected by the treatment assignment of other individuals. However, in many settings, this assumption obviously does not hold. For example, in the dependent happenings of infectious diseases, whether one person becomes infected depends on who else in the population is vaccinated. In this article, we consider a population of groups of individuals where interference is possible between individuals within the same group. We propose estimands for direct, indirect, total, and overall causal effects of treatment strategies in this setting. Relations among the estimands are established; for example, the total causal effect is shown to equal the sum of direct and indirect causal effects. Using an experimental design with a two-stage randomization procedure (first at the group level, then at the individual level within groups), unbiased estimators of the proposed estimands are presented. Variances of the estimators are also developed. The methodology is illustrated in two different settings where interference is likely: assessing causal effects of housing vouchers and of vaccines.

Keywords: Group-randomized trials, Potential outcomes, Stable unit treatment value assumption, SUTVA, Vaccine

1. INTRODUCTION

1.1 Background and Outline

A fundamental assumption usually made in the potential outcomes approach to causal inference is that of no interference between individuals (Cox 1958), a critical component of the stable unit treatment value assumption (SUTVA) (Rubin 1980). Under the no-interference assumption, the potential outcomes of any individual are assumed to be unaffected by the treatment assignment of every other individual. However, in many settings, this assumption obviously does not hold. A classical example is given by the dependent happenings of infectious diseases (Ross 1916, p. 211), where whether one person becomes infected depends on who else in the population is vaccinated. In econometrics, a household’s decision whether to move may be affected by whether their neighbors receive a housing voucher to move (Sobel 2006). In education, interventions given to certain students may affect other students in the same class (Rubin 1990; Rosenbaum 2007). Sobel (2006) and Rosenbaum (2007) gave several other examples where interference is likely. In some settings, interference is a nuisance while in other settings it creates effects of interest. An example of the former includes agricultural experiments, where fallow rows between treatment plots can sometimes eliminate interference between plots. An example of the latter includes vaccinating against infectious diseases, where interference is an inherent result of the biology of transmission and is intrinsically of interest.

The assumption of no interference between individuals is often made without critical examination. Models not requiring this assumption have been considered in the context of plant variety evaluation (Kempton 1997) and cross-over trials (Senn 1993; Bailey and Kunert 2006). However, these methods typically assume a specific interference structure that is local in either space or time. Without making any such assumptions about the nature of interference, Struchiner, Halloran, Robins, and Spielman (1990) and Halloran and Struchiner (1991) conceptually defined several different types of causal effects of interventions that are possible in the presence of interference, namely, direct, indirect, total, and overall effects. To estimate the latter three effects, they noted one needs a population of groups as in group-randomized studies (Murray 1998). Several vaccination studies have been conducted or analyzed with the intent to estimate certain of these effects (Moulton et al. 2001; Longini, Halloran, and Nizam 2002; Ali et al. 2005; King et al. 2006).

Halloran and Struchiner (1995) delineated many of the complications of using potential outcomes to define causal estimands for the different types of effects possible in the presence of interference. They used Rubin’s (1978, 1990) suggestion for a general notation in the presence of interference to define individual direct, indirect, total, and overall effects by letting the potential outcomes for any individual depend on the vector of treatment assignments to the other individuals in the population. However, they found this approach impracticable because the number of possible potential outcomes becomes unwieldy for any reasonably sized population. More recently, Sobel (2006) proposed causal estimands for assessing housing voucher effects defined by averaging causal effects over all possible treatment assignments for a particular voucher allocation strategy compared to a benchmark allocation wherein all households receive no voucher. Rosenbaum (2007) developed nonparametric tests and confidence intervals for assessing treatment effect in the presence of interference.

In this article, we consider a population of groups of individuals where interference is possible between individuals within the same group. We propose causal estimands for direct, indirect, total, and overall causal effects of treatment assignment strategies based on Sobel’s approach of averaging over all possible treatment assignments (Sec. 3). Relations among the estimands are established and inference concerning the estimands is considered (Sec. 4). Using an experimental design with a two-stage randomization procedure (the first at the group level, the second at the individual level within groups), unbiased estimators of the proposed estimands are presented. Estimating the variance of the estimators is also considered. The methodology is illustrated in two different settings where interference is likely: assessing causal effects of housing vouchers and of vaccines (Sec. 5). Proofs are given in the Appendix. We begin with an example to motivate the development of the rest of the article.

1.2 Motivating Example

In this section, we consider data from an individually randomized, placebo-controlled trial of killed oral cholera vaccines to illustrate the direct, indirect, total, and overall effects as defined by Halloran and Struchiner (1991). Table 1 presents data from a reanalysis of this trial where the interest was in determining whether the level of vaccine coverage in a residential area, called a bari, was related to the incidence of cholera in individual vaccine recipients or placebo recipients residing in the bari (Ali et al. 2005). The target population was divided into groups by level of vaccine coverage. For illustration, we consider the groups with more than 50% and less than 28% coverage, which we denote as groups A and B.

Table 1.

Risk of cholera in recipients of killed oral cholera vaccines or placebo, by level of coverage of the bari during one year of follow-up, based on data from Ali et al. (2005)

Level of vaccine coverage		Vaccine recipients			Placebo recipients
Level of vaccine coverage	Target population	Total	Cases	Risk per 1,000 population	Total	Cases	Risk per 1,000 population
>50%	22,394	12,541	16	1.27	6,082	9	1.47
41-50%	24,159	11,513	26	2.26	5,801	27	4.65
36-40%	24,583	10,772	17	1.58	5,503	26	4.72
28-35%	25,059	8,883	22	2.48	4,429	26	5.87
<28%	24,954	5,627	15	2.66	2,852	20	7.01

Open in a new tab

The effects of vaccination can be estimated based on differences in the incidence of cholera during the first year of follow-up of the trial. The direct effects are estimated by comparing the incidence (risk per 1,000 population) between vaccinated individuals and unvaccinated individuals within each group. For example, the estimated direct effect in group B is 7.01-2.66 = 4.35, suggesting vaccination results in 4.35 fewer cases of cholera per 1,000 individuals per year. The estimated direct effect in group A is 1.47-1.27 = .20, considerably lower than in group B. The difference in the two estimates illustrates one of the challenges in making comparisons directly within groups when interference is present. If an analysis were limited to group A only, the evidence would suggest that the vaccine has little effect.

The indirect effects of vaccination are those effects due to the level of coverage. They can be estimated by comparing the outcomes in the unvaccinated in the two groups or the outcomes in the vaccinated in the two groups. For instance, the estimated indirect effect in the unvaccinated is 7.01 - 1.47 = 5.54. Note this estimate is greater than the estimated direct effect in either of the groups, highlighting the importance of looking beyond direct effects in the presence of interference. Based on similar analyses, Ali et al. concluded that the vaccines provide significant indirect protection to nonvaccinated individuals.

Total and overall effects provide summary measures that combine direct and indirect effects. The total effect of vaccination is the effect of being vaccinated in the group with higher coverage (A) compared to not being vaccinated in the group with lower coverage (B). The estimated total effect (B - A) is 7.01-1.27 = 5.74. Note the total effect (B - A) estimate equals the direct effect estimate in group A plus the indirect effect estimate in the unvaccinated (B - A). The overall effect is the average effect of being in the group with higher coverage compared to being in the group with lower coverage. The overall effect can be estimated by the difference in incidence between the two groups, that is, 35/8,479 - 25/18,623 = 2.79/1,000.

2. PRELIMINARIES

2.1 Potential Outcomes

Suppose there are N > 1 groups of individuals [or blocks of units using Rosenbaum’s (2007) terminology]. For i = 1, . . . , N, let n_i denote the number of individuals in group i and let Z_i ≡ (Z_i1, . . . , Z_{in_i}) denote the treatments those n_i individuals receive. We assume throughout that assignment of an individual to a particular treatment is equivalent to receipt of that treatment that is, there is perfect compliance. Assume Z_ij is a dichotomous random variable having values 0 or 1 such that Z_i can take on 2^n_i possible values. Let Z_i(j) denote the n_i - 1 subvector of Z_i with the j th entry deleted. The vector Z_i will be referred to as an intervention or treatment program, to distinguish it from the individual treatment Z_ij. Let z_i and z_ij denote possible values of Z_i and Z_i and Z_ij. Define R^j to be the set of vectors of possible treatment programs of length j for j = 1, 2, . . . ; for example, R² ≡ {(0, 0), (0, 1), (1, 0), (1, 1)}. Possible values z_i of Z_i are elements of R^n_i. For positive integer n and k ∈ {0, . . . , n}, define $R_{k}^{n}$ to be the subset of Rⁿ wherein exactly k individuals receive treatment 1; for example, $\sum_{j = 1}^{n i} z_{i j} = k$ for all $z_{i} \in R_{k}^{n_{i}}$ .

Denote the potential outcome of individual j in group i under treatment z_i as Y_ij (z_i). Following the usual approach to causal inference (see, e.g., Rosenbaum 2007), we assume the Y_ij (z_i) potential responses are fixed because they do not depend on the realized random assignment of treatments Z_i, whereas the observed responses Y_ij (Z_i) do depend on Z_i and, thus, are random variables. The notation Y_ij (z_i) allows for the possibility that the potential outcome for individual j may depend on another individual’s treatment assignment in group i; that is, there may be interference between individuals within a group. Implicit in this notation is the assumption that the potential outcomes for individuals in group i do not depend on treatment assignments of individuals in group i’ for i’ ≠ i. In other words, we assume no interference between individuals in different groups but allow for interference between individuals within the same group (Halloran and Struchiner 1991, 1995). This will be a reasonable assumption provided the groups are sufficiently separate (e.g., in space or time). Sobel (2006) called this a partial interference assumption. In the literature of group-randomized studies, violation of no interference across groups is called contamination.

2.2 Treatment Assignment Mechanisms

Let ψ and ϕ denote parameterizations that govern the distribution of Z_i for i = 1, . . . , N. For example, ψ might correspond to randomly assigning half of individuals in a group to treatment 1 and the other half to treatment 0, while ϕ might correspond to assigning all individuals in a group to treatment 0. We refer to ψ and ϕ as individual treatment assignment strategies. Our goal is to assess the causal effects of assigning groups to ψ compared to ϕ.

As is typical of causal inference articles, we use randomization inference whereby the randomization distribution induced by the experimental design forms the basis for statistical inference. For the experimental design, we consider a two-stage randomization procedure. In the first stage, each of the N groups is randomly assigned to either ϕ or ψ. In the second stage, individuals are randomly assigned treatment conditional on their group’s assignment in the first stage. For example, in the first stage, half of the N groups might be assigned to an allocation strategy ϕ and the other half ψ; in the second stage, two-thirds of the individuals within a group are randomly assigned treatment 1 for groups assigned ϕ, while one-third of the individuals within a group are randomly assigned treatment 1 for groups assigned ψ. Such a design has been referred to as splitplot (Hayes, Alexander, Bennett, and Cousens 2000) or pseudo-cluster (Borm, Melis, Teerenstra, and Peer 2005) randomization and has been proposed for evaluation of intervention programs in the elderly (Melis et al. 2005) and vaccine efficacy (see Sec. 5.2). This design can be employed to answer questions such as: How many infections will be averted by vaccinating two-thirds of the population compared to only vaccinating one-third of the population? What proportion of households will move if two-thirds receive vouchers compared to only one-third receiving vouchers?

Corresponding to the first stage of randomization, let S ≡ (S₁, . . . , S_N) denote the group assignments with S_i = 1 if the ith group is assigned to ψ and 0 otherwise. Let ν denote the parameterization that governs the distribution of S and let C ≡ Σ_i S_i denote the number of groups assigned ψ. Define ν to be a mixed (Sobel 2006) or permutation (Friedman, Furberg, and DeMets 1998) group assignment strategy if 0 < C < N and Pr_ν (S = s) = C!(N - C)!/N! if $s \in R_{C}^{N}$ , 0 otherwise. In other words, under a mixed group assignment strategy, a fixed number C of N groups are assigned ψ, with each of the $(\begin{matrix} N \\ C \end{matrix})$ possible group assignments receiving equal probability. Similarly, corresponding to the second stage of randomization, let K_i ≡ Σ_j Z_ij and define ϕ and ψ to be mixed individual group assignment strategies if K_i is fixed given S_i, with 0 < K_i < n_i and each of the $(\begin{matrix} n_{i} \\ K_{i} \end{matrix})$ possible individual treatment assignments receiving equal probability.

3. CAUSAL ESTIMANDS

3.1 Average Potential Outcomes

A fundamental problem in causal inference is that, in general, it is not possible to observe more than one potential outcome for an individual. Faced with this problem, causal estimands are typically defined in terms of averages of potential outcomes that are identifiable from observable random variables. Following this approach, we begin by writing the potential outcomes for individual j in group i under z_ij = z as

Y_{i j} (z_{i (j)}, z_{i j} = z)

(1)

for z = 0, 1. Because (1) depends on z_i(j), define the individual average potential outcome under treatment assignment z by

{\overset{‒}{Y}}_{i j} (z; ψ) \equiv \sum_{ω \in R^{n_{i} - 1}} Y_{i j} (z_{i (j)} = ω, z_{i j} = z) \times \Pr_{ψ} (Z_{i (j)} = ω ∣ Z_{i j} = z) .

In other words, the individual average potential outcome is the conditional expectation of Y_ij (Z_i) given Z_ij = z under assignment strategy ψ. Averaging over individuals, define the group average potential outcome under treatment assignment z as ${\overset{‒}{Y}}_{i} (z; ψ) \equiv \sum_{j = 1}^{n_{i}} {\overset{‒}{Y}}_{i j} (z; ψ) ∕ n_{i}$ . Finally, averaging over groups, define the population average potential outcome under treatment assignment z as $\overset{‒}{Y} (z; ψ) \equiv \sum_{i = 1}^{N} {\overset{‒}{Y}}_{i} (z; ψ) ∕ N$ .

The average potential outcomes discussed previously are defined as functions of both the group assignment ψ (or ϕ) and the individual treatment assignment z. We can also define average potential outcomes solely as a function of ψ. For example, define the marginal individual average potential outcome by ${\overset{‒}{Y}}_{i j} (ψ) \equiv \sum_{z \in R^{n_{i}}} Y_{i j} (z) \Pr_{ψ} (Z_{i} = z)$ , that is, the average potential outcome for individual j in group i when group i is assigned ψ. Similarly, define the marginal group and population average potential outcomes by ${\overset{‒}{Y}}_{i} (ψ) \equiv \sum_{j = 1}^{n_{i}} {\overset{‒}{Y}}_{i j} (ψ) ∕ n_{i}$ and $\overset{‒}{Y} (ψ) \equiv \sum_{i = 1}^{N} {\overset{‒}{Y}}_{i} (ψ) ∕ N$ .

In the following sections, causal estimands are defined in terms of these various average potential outcomes.

3.2 Direct Causal Effects

Halloran and Struchiner (1991) defined the direct effect of a treatment on an individual as the difference between the potential outcome for that individual given treatment compared to the potential outcome for that individual without treatment, all other things being equal. Formally, following Halloran and Struchiner (1995), we define the individual direct causal effect of treatment 0 compared to treatment 1 for individual j in group i by

C E_{i j}^{D} (z_{i (j)}) \equiv Y_{i j} (z_{i (j)}, z_{i j} = 0) - Y_{i j} (z_{i (j)}, z_{i j} = 1) .

(2)

Next, define the individual average direct causal effect for individual j in group i by

{\overset{‒}{C E}}_{i j}^{D} (ψ) \equiv {\overset{‒}{Y}}_{i j} (0; ψ) - {\overset{‒}{Y}}_{i j} (1; ψ),

(3)

that is, the difference in individual average potential outcomes when z_ij = 0 and when z_ij = 1 under ψ. Using Rubin’s (2005) terminology, (3) is a marginal causal effect in that a comparison is being made between expected values of the marginal distributions of Y_ij (Z_i(j), Z_ij = 0) and of Y_ij (Z_i(j), Z_ij = 1). Finally, define the group average direct causal effect by ${\overset{‒}{C E}}_{i}^{D} (ψ) \equiv {\overset{‒}{Y}}_{i} (0; ψ) - {\overset{‒}{Y}}_{i} (1; ψ) = \sum_{j = 1}^{n_{i}} {\overset{‒}{C E}}_{i j}^{D} (ψ) ∕ n_{i}$ and the population average direct causal effect by ${\overset{‒}{C E}}^{D} (ψ) \equiv \overset{‒}{Y} (0; ψ) - \overset{‒}{Y} (1; ψ) = \sum_{i = 1}^{N} {\overset{‒}{C E}}_{i}^{D} (ψ) ∕ N$ .

3.3 Indirect Causal Effects

In contrast to direct effects, an indirect effect describes the effect on an individual of the treatment received by others in the group. In particular, Halloran and Struchiner (1991) defined the indirect effect of a treatment on an individual as the difference between the potential outcomes for that individual without treatment when the group (i) receives an intervention program and (ii) receives the benchmark program of no intervention. Similar to Halloran and Struchiner (1995), we define the individual indirect causal effect of treatment program z_i compared with $z_{i}^{'}$ on individual j in group i by

C E_{i j}^{I} (z_{i (j)}, z_{i (j)}^{'}) \equiv Y_{i} (z_{i (j)}, z_{i j} = 0) - Y_{i} (z_{i (j)}^{'}, z_{i j}^{'} = 0),

(4)

where $z_{i}^{'}$ is another n_i-dimensional vector of individual treatment assignments. (Note $z_{i}^{'}$ does not denote the transpose of z_i.)

Remark

Definition (4) does not restrict either z_i or $z_{i}^{'}$ to be the benchmark program of no intervention; that is, individual indirect causal effects may exist between two different intervention programs. The same is true for the definitions of individual total and overall causal effects.

Remark

The individual indirect causal effect could be defined analogously for individuals with $z_{i j} = z_{i j}^{'}$ =1 ; that is, individuals under either treatment may experience indirect effects. This yields two individual indirect causal effects, which need not be equal. For simplicity, only indirect effects based on (4) are considered in the rest of this article.

Similar to direct effects, define the individual average indirect causal effect by ${\overset{‒}{C E}}_{i j}^{I} (ϕ, ψ) \equiv {\overset{‒}{Y}}_{i j} (0; ϕ) - {\overset{‒}{Y}}_{i j} (0; ψ)$ . Clearly, if ψ = ϕ, then ${\overset{‒}{C E}}_{i j}^{I} (ϕ, ψ) = 0$ ; that is, there are no individual average indirect causal effects. Finally, define the group average indirect causal effect as ${\overset{‒}{C E}}_{i}^{I} (ϕ, ψ) \equiv {\overset{‒}{Y}}_{i} (0; ϕ) - {\overset{‒}{Y}}_{i} (0; ψ) = \sum_{j = 1}^{n_{i}} {\overset{‒}{C E}}_{i j}^{I} (ϕ, ψ) ∕ n_{i}$ and the population average indirect causal effect as ${\overset{‒}{C E}}^{I} (ϕ, ψ) \equiv \overset{‒}{Y} (0; ϕ) - \overset{‒}{Y} (0; ψ) = \sum_{i = 1}^{N} {\overset{‒}{C E}}_{i}^{I} (ϕ, ψ) ∕ N$ .

3.4 Total Causal Effects

Total effects describe both the direct and the indirect effects of a particular treatment assignment on an individual. Halloran and Struchiner (1991) defined the total effect of a treatment on an individual as the difference between the potential outcomes for that individual (i) with treatment when the group receives an intervention program and (ii) without treatment when the group receives no intervention. Following Halloran and Struchiner (1995), we define the individual total causal effects for individual j in group i as

C E_{i j}^{T} (z_{i (j)}, z_{i (j)}^{'}) \equiv Y_{i j} (z_{i (j)}, z_{i j} = 0) - Y_{i j} (z_{i (j)}^{'}, z_{i (j)}^{'} = 1) .

(5)

Define the individual average total causal effect by ${\overset{‒}{C E}}_{i j}^{T} (ϕ, ψ) \equiv {\overset{‒}{Y}}_{i j} (0; ϕ) - {\overset{‒}{Y}}_{i j} (1; ψ)$ , the group average total causal effect by ${\overset{‒}{C E}}_{i}^{T} (ϕ, ψ) \equiv {\overset{‒}{Y}}_{i} (0; ϕ) - {\overset{‒}{Y}}_{i} (1; ψ) = \sum_{j = 1}^{n_{i}} {\overset{‒}{C E}}_{i j}^{T} (ϕ, ψ) ∕ n_{i}$ , and the population average total causal effect by ${\overset{‒}{C E}}^{T} (ϕ, ψ) \equiv \overset{‒}{Y} (0; ϕ) - \overset{‒}{Y} (1; ψ) = \sum_{i = 1}^{N} {\overset{‒}{C E}}_{i}^{T} (ϕ, ψ) ∕ N$ .

Remark

It follows from (2), (4), and (5) that the individual total causal effect is the sum of individual direct and indirect causal effects, that is, $C E_{i j}^{T} (z_{i (j)}, z_{i (j)}^{'}) = C E_{i j}^{D} (z_{i (j)}^{'}) + C E_{i j}^{I} (z_{i (j)}, z_{i (j)}^{'})$ . Likewise, the total causal effects can be decomposed as the sum of direct and indirect causal effects at the individual average, group average, and population average levels, for example, ${\overset{‒}{C E}}^{T} (ϕ, ψ) = {\overset{‒}{C E}}^{D} (ψ) + {\overset{‒}{C E}}^{I} (ϕ, ψ)$ . This result formalizes, using a causal framework, models from the vaccine and plant variety evaluation literature, which assume the total effect is the sum of direct and indirect effects (Halloran and Struchiner 1991, 1995; Kempton 1997; Moulton et al. 2006).

Remark

A few other characteristics of the algebra of causal effects bear mentioning. First, total causal effects are not commutative; for example, ${\overset{‒}{C E}}^{T} (ϕ, ψ)$ will not necessarily equal ${\overset{‒}{C E}}^{T} (ψ, ϕ)$ for ϕ ≠ ψ. However, indirect effects have the property ${\overset{‒}{C E}}^{I} (ψ, ϕ) = - {\overset{‒}{C E}}^{I} (ϕ, ψ)$ , implying ${\overset{‒}{C E}}^{D} (ψ) + {\overset{‒}{C E}}^{D} (ϕ) = {\overset{‒}{C E}}^{T} (ϕ, ψ) + {\overset{‒}{C E}}^{T} (ψ, ϕ)$ . Thus, the total causal effects, while not necessarily equal, are constrained in sum to equal the sum of the direct effects. Also note that if ${\overset{‒}{C E}}^{I} (ψ, ϕ) = {\overset{‒}{C E}}^{I} (ϕ, ψ) = 0$ , then ${\overset{‒}{C E}}^{T} (ϕ, ψ) = {\overset{‒}{C E}}^{T} (ψ, ϕ)$ if and only if ${\overset{‒}{C E}}^{D} (ϕ) = {\overset{‒}{C E}}^{D} (ψ)$ ; that is, in the absence of indirect effects, the total effects are commutative if and only if the direct effects are equal.

3.5 Overall Causal Effect

Halloran and Struchiner (1991) defined the overall causal effect to be the average effect of an intervention program relative to no intervention. We define the individual overall causal effect of treatment z_i compared to treatment $z_{i}^{'}$ for individual j in group i by $C E_{i j}^{O} (z_{i}, z_{i}^{'}) \equiv Y_{i j} (z_{i}) - Y_{i j} (z_{i}^{'})$ . Similarly, for the comparison of ϕ to ψ, define the individual average overall causal effect by ${\overset{‒}{C E}}_{i j}^{O} (ϕ, ψ) \equiv {\overset{‒}{Y}}_{i j} (ϕ) - {\overset{‒}{Y}}_{i j} (ψ)$ , the group average overall causal effect by ${\overset{‒}{C E}}_{i}^{O} (ϕ, ψ) \equiv {\overset{‒}{Y}}_{i} (ϕ) - {\overset{‒}{Y}}_{i} (ψ)$ , and the population average overall causal effect by ${\overset{‒}{C E}}^{O} (ϕ, ψ) \equiv \overset{‒}{Y} (ϕ) - \overset{‒}{Y} (ψ)$ .

3.6 No Interference

The estimands defined previously simplify under the assumption of no interference between individuals within a group, that is, under the assumption $Y_{i j} (z_{i}) = Y_{i j} (z_{i}^{'})$ for any two treatment programs z_i = (z_i1, . . . , z_{in_i}) and $z_{i}^{'} = (z_{i 1}^{'}, \dots, z_{i n_{i}}^{'})$ such that $z_{i j} = z_{i j}^{'}$ (Rubin 1980; Angrist, Imbens, and Rubin 1996). Assuming no interference, the potential outcomes for individual j in group i can be written simply as Y_ij (0) and Y_ij (1). In turn, the individual direct causal effect equals Y_ij (0) - Y_ij (1). The corresponding group average direct causal effect becomes $\sum_{j = 1}^{n_{i}} {Y_{i j} (0) - Y_{i j} (1)} ∕ n_{i}$ , that is, the usual average causal effect (ACE) estimand. By (4), the individual indirect causal effect equals 0 for all individuals assuming no-interference. Similarly, by (5), the individual total causal effect equals the individual direct causal effect. Likewise, at the group and population average levels, under the no-interference assumption the indirect causal effect is 0 and the direct causal effect equals the total causal effect. Assuming no interference also implies the direct, indirect, and total effects do not depend on the treatment assignment strategies ϕ and ψ, whereas in the presence of interference within a group, they do in general.

4. INFERENCE

In this section, we consider drawing inference about the estimands defined previously. Throughout this section, we assume:

Assumption 1. ν, ϕ, and ψ are mixed assignment strategies.

In Section 4.1, we present estimators for the estimands defined previously and show they are unbiased under Assumption 1. In Section 4.2, we consider the variances of these estimators.

4.1 Estimators

Theorem 1

Suppose S_i = 1 and let

{\hat{Y}}_{i} (z; ψ) \equiv \frac{\sum_{j = 1}^{n_{i}} Y_{i j} (Z_{i}) I [Z_{i j} = z]}{\sum_{j = 1}^{n_{i}} I [Z_{i j} = z]} for z = 0, 1;

(6)

that is, ${\hat{Y}}_{i} (z; ψ)$ is the average of observed outcomes for individuals in group i receiving treatment z under treatment program Z_i. Under Assumption 1, $E {{\hat{Y}}_{i} (z; ψ) ∣ S_{i} = 1} = {\overset{‒}{Y}}_{i} (z; ψ)$ for z = 0, 1.

Corollary

Under Assumption 1, ${\hat{C E}}_{i}^{D} (ψ) \equiv {\hat{Y}}_{i} (0; ψ) - {\hat{Y}}_{i} (1; ψ)$ is a conditionally unbiased estimator of ${\overset{‒}{C E}}_{i}^{D} (ψ)$ given S_i = 1.

Remark

Unbiased estimators of the group average indirect, total, and overall causal effects do not exist without further assumptions because the same group is not observed under ϕ and ψ.

Theorem 2

For z = 0, 1, let $\hat{Y} (z; ψ) \equiv \sum_{i = 1}^{N} {\hat{Y}}_{i} (z; ψ) \times I [S_{i} = 1] ∕ \sum_{i = 1}^{N} I [S_{i} = 1]$ Under Assumption 1, $E {\hat{Y} (z; ψ)} = \overset{‒}{Y} (z; ψ)$ for z = 0.1.

Corollary

Under Assumption 1, unbiased estimators for the population average direct, indirect, and total causal effects are given by ${\hat{C E}}^{D} (ψ) \equiv \hat{Y} (0; ψ) - \hat{Y} (1; ψ)$ , ${\hat{C E}}^{I} (ϕ, ψ) \equiv \hat{Y} (0; ϕ) - \hat{Y} (0; ψ)$ , and ${\hat{C E}}^{T} (ϕ, ψ) \equiv \hat{Y} (0; ϕ) - \hat{Y} (1; ψ)$ , where $\hat{Y} (z; ϕ)$ is defined analogously to $\hat{Y} (z; ψ)$ for z = 0, 1.

Theorem 3

Let ${\hat{Y}}_{i} (ψ) \equiv \sum_{j = 1}^{n_{i}} Y_{i j} (Z_{i}) ∕ n_{i}$ and $\hat{Y} (ψ) \equiv \sum_{i = 1}^{N} {\hat{Y}}_{i} (ψ) I [S_{i} = 1] ∕ \sum_{i = 1}^{N} I [S_{i} = 1]$ Under Assumption 1, $E {{\hat{Y}}_{i} (ψ) ∣ S_{i} = 1} = {\overset{‒}{Y}}_{i} (ψ)$ and $E {\hat{Y} (ψ)} = \overset{‒}{Y} (ψ)$ .

Corollary

Under Assumption 1, an unbiased estimator of ${\overset{‒}{C E}}^{O} (ϕ, ψ)$ is given by ${\hat{C E}}^{O} (ϕ, ψ) \equiv \hat{Y} (ϕ) - \hat{Y} (ψ)$ , where $\hat{Y} (ϕ)$ is defined analogously to $\hat{Y} (ψ)$ .

4.2 Variance Estimators

In general, unbiased estimators of the variances of the estimators discussed previously do not exist without making further assumptions. For example, consider estimating $Var ({\hat{Y}}_{i} (z; ψ) ∣ S_{i} = 1)$ under Assumption 1. The estimator ${\hat{Y}}_{i} (1; ψ)$ is based on sampling from the set of potential outcomes ${Y_{i j} (z_{i}) : z_{i} \in R_{K_{i}}^{n_{i}}, z_{i j} = 1}$ for some fixed value of K_i. This set can be partitioned into $(\begin{matrix} n_{i} \\ K_{i} \end{matrix})$ clusters of size K_i, where each cluster corresponds to a particular $z_{i} \in R_{K_{i}}^{n_{i}}$ . Moreover, given S_i = 1 from the first stage of randomization, the second randomization stage entails selecting exactly one of these clusters according to Z_i. Thus, ${\hat{Y}}_{i} (1; ψ)$ can be viewed as the sample mean from a single systematic sample. It is known that, in general, unbiased estimators of the variance of the sample mean from a single systematic sample do not exist without making further assumptions about the underlying population (Som 1973, sec. 4.4; Thompson 1992, chap. 12.4).

Therefore, to make progress in deriving variance estimators, in Section 4.2.1 an additional assumption is introduced about the structure of interference (stratified interference), which may be plausible in a broad range of settings. In Section 4.2.2, variance estimators of the direct, indirect, total, and overall causal effect estimators are proposed. Under the additional assumption of stratified interference, these variance estimators are shown to be unbiased if the causal effects are additive and positively biased otherwise.

4.2.1 Stratified Interference

Suppose that R^n_i, that is, the set of possible treatment programs for group i, can be partitioned into strata such that within strata there is no interference. In particular, we assume:

Assumption 2 (Stratified interference). For k = 1, . . . , n_i - 1, $Y_{i j} (z_{i}) = Y_{i j} (z_{i}^{'})$ for all z_i, $z_{i}^{'} \in R_{k}^{n_{i}}$ such that $z_{i j} = z_{i j}^{'}$ .

To illustrate the meaning of Assumption 2, consider a study of the effects of an intervention on children in a school. The stratified interference assumption states that the outcome for a child receiving the intervention will be the same when k - 1 schoolmates also receive the intervention, regardless of which particular k - 1 schoolmates receive the intervention. This assumption can be viewed as an intermediate assumption between (i) assuming no interference within a group and (ii) making no assumptions about the nature of interference within a group. Moreover, because there are n_i possible values of K_i given z_ij = z, it follows that Y_ij (z_i(j), z_ij = z) can take on n_i values. Thus, for a given z_ij = z, an individual has n_i potential outcomes under Assumption 2 compared to only one potential outcome under (i) and 2^n_i-1 potential outcomes under (ii).

To illustrate the utility of Assumption 2, again consider estimating $Var ({\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1)$ . Suppose, by way of contradiction, there exists an unbiased estimator $Var ({\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1)$ in general, that is, under Assumption 1 only. Denote this estimator by g(O_i1 (Z_i)), where O_i1 (Z_i) ≡ {Y_ij (Z_i) : Z_ij = 1} is the set of observed outcomes for individuals in group i assigned treatment Z_ij = 1 and g is some real-valued function of O_i (Z_i). By our supposition, $E {g (O_{i 1} (Z_{i})) ∣ S_{i} = 1} = Var ({\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1)$ under Assumption 1. Consider the following two different scenarios where n_i = 3 and K_i = 2 given S_i = 1.

First, suppose Y_ij(z_i) = k for j = 1, 2, 3, and $z_{i} \in R_{2}^{3} = {(011), (110), (101)}$ , where κ is some constant. In other words, the potential outcomes in group i are constant for all individuals and all treatment programs given S_i = 1. Then $Var ({\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1) = 0$ , implying g({κ, κ}) = 0 for any constant κ.

Second, suppose Y_ij (z_i) = f (z_i) for j = 1, 2, 3 and $z_{i} \in R_{2}^{3}$ , where f is some real-valued function of z_i. In other words, for any treatment program $z_{i} \in R_{2}^{3}$ , all individuals in group i have the same response. Suppose also that f (011) ≠ f (110) ≠ f (101). Now ${\hat{Y}}_{i} (1; ψ) = f (Z_{i})$ , implying $Var ({\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1)$ equals the sample variance of the set ${f (z_{i}) : z_{i} \in R_{2}^{3}}$ Thus, $Var ({\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1) > 0$ . However, because O_i1 (Z_i) = {f (Z_i), f (Z_i)} in this scenario, it follows that E{g(O_i1 (Z_i))| S_i = 1} = 0. Thus, g is a biased estimator of $Var ({\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1)$ , a contradiction.

Intuitively, an unbiased estimator of $Var ({\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1)$ does not exist in general because the observed data no way to distinguish between these two scenarios under either scenario, the observed outcomes are all equal, that is, Y_i1 (Z_i) = Y_i₂(Z_i) = Y_i₃(Z_i). However, with the addition of Assumption 2, one can rule out the possibility of the second scenario. Namely, under Assumption 2, Y_i1 (110) = Y_i1 (101) and Y_i₂(110) = Y_i₂(011), implying f (011) = f (110) f (101).

More generally, unbiased variance estimators do not exist without further assumptions (such as stratified interference) because observing Y_ij (z_i) provides no information about Y_ij (ω_i) for ω_i ≠ z_i. Under Assumption 2, each individual now has only two potential outcomes, one for z_ij = 0 and one for z_ij = 1,, within a particular stratum $R_{k}^{n_{i}}$ . Therefore, given S_i, the observed data under one treatment program will provide information about the potential outcomes under other treatment programs. For example, suppose ψ is a mixed strategy such that K_i is fixed. Then, under Assumption 2, the outcomes for individual j are constant for all $z_{i} \in R_{K_{i}}^{n_{i}}$ such that z_ij = 1. Denote this value by Y_ij (1; ψ), that is, Y_ij (1; ψ) ≡ Y_ij (ω, z_ij = 1) for any $ω \in R_{K_{i} - 1}^{n_{i} - 1}$ . Define Y_ij (0; ψ) similarly.

4.2.2 Variance Estimators Assuming Stratified Interference

Theorem 4

Let

\hat{Var} ({\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1) \equiv (1 - \frac{K_{i}}{n_{i}}) \frac{{\hat{σ}}_{i 1}^{2} (ψ)}{K_{i}},

where ${\hat{σ}}_{i 1}^{2} (ψ) \equiv \sum_{j = 1}^{n_{i}} {Y_{i j} (1; ψ) - {\hat{Y}}_{i} (1; ψ)}^{2} Z_{i j} ∕ (K_{i} - 1)$ is the within-group sample variance, and

\hat{Var} ({\hat{Y}}_{i} (1; ψ)) \equiv (1 - \frac{C}{N}) \frac{{\hat{σ}}_{g 1}^{2} (ψ)}{C} + \frac{1}{C N} \sum_{i = 1}^{N} (1 - \frac{K_{i}}{n_{i}}) \frac{{\hat{σ}}_{i 1}^{2} (ψ)}{K_{i}} S_{i},

where ${\hat{σ}}_{g 1}^{2} (ψ) \equiv \sum_{i = 1}^{N} {{\hat{Y}}_{i} (1; ψ) - \hat{Y} (1; ψ)}^{2} S_{i} ∕ (C - 1)$ . Define $\hat{Var} ({\hat{Y}}_{i} (0; ψ) ∣ S_{i} = 1)$ , ${\hat{σ}}_{i 0}^{2} (ψ)$ , $\hat{Var} (\hat{Y} (0; ψ))$ , and ${\hat{σ}}_{g 0}^{2} (ψ)$ analogously. Under Assumptions 1 and 2,

E {\hat{Var} ({\hat{Y}}_{i} (z; ψ) ∣ S_{i} = 1) ∣ S_{i} = 1} = Var ({\hat{Y}}_{i} (z; ψ) ∣ S_{i} = 1)

(7)

and $E {\hat{Var} (\hat{Y} (z; ψ))} = Var (\hat{Y} (z; ψ)) for z = 0, 1$ for z = 0, 1.

Theorem 5

Let

\hat{Var} ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) \equiv \frac{{\hat{σ}}_{i 1}^{2} (ψ)}{K_{i}} + \frac{{\hat{σ}}_{i 0}^{2} (ψ)}{n_{i} - K_{i}} .

(8)

Under Assumptions 1 and 2,

E {\hat{Var} ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) ∣ S_{i} = 1} = Var ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) + σ_{i (0 - 1)}^{2} (ψ) ∕ n_{i},

where $σ_{i (0 - 1)}^{2} (ψ) \equiv \sum_{j = 1}^{n_{i}} {[{Y_{i j} (0; ψ) - Y_{i j} (1; ψ)} - {{\overset{‒}{Y}}_{i} (0; ψ) - {\overset{‒}{Y}}_{i} (1; ψ)}]}^{2} ∕ (n_{i} - 1)$ is the variance of the n_i differences Y_ij (0; ψ) - Y_ij (1; ψ).

Corollary

Under Assumptions 1 and 2,

E {\hat{Var} ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) ∣ S_{i} = 1} \geq Var ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1),

with equality holding if and only if

Y_{i j} (0; ψ) = Y_{i j} (1; ψ) + η_{D i}

(9)

for fixed constant η_Di and j = 1, . . . , n_i.

Remark

The corollary to Theorem 5 says (8) is a conditionally unbiased estimator of $Var ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1)$ if and only if the individual direct effect is additive. If (9) does not hold, (8) will be a positively biased estimator of $Var ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1)$ . This could occur, for instance, if the potential outcomes are binary, taking on values 0 and 1 only. In this case, (9) will only be true if either (i) η_{D_i} = 0 or (ii) |η_{D_i}| = 1, with (ii) corresponding to the scenario that either (Y_ij (0; ψ), Y_ij (1; ψ)) = (0, 1) for all j or (Y_ij (0; ψ), Y_ij (1; ψ)) = (1, 0) for all j.

Theorem 6

Let

\hat{Var} ({\hat{C E}}^{D} (ψ)) \equiv (1 - \frac{C}{N}) \frac{{\hat{σ}}_{D}^{2} (ψ)}{C} + \frac{1}{C N} \sum_{i = 1}^{N} \hat{Var} ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) S_{i},

(10)

where ${\hat{σ}}_{D}^{2} (ψ) \equiv \sum_{i = 1}^{N} {{\hat{CE}}_{i}^{D} (ψ) - {\hat{C E}}^{D} (ψ)}^{2} S_{i} ∕ (C - 1)$ . Under Assumptions 1 and 2,

E {\hat{Var} ({\hat{C E}}^{D} (ψ))} = Var ({\hat{C E}}^{D} (ψ)) + \frac{1}{N^{2}} \sum_{i = 1}^{N} σ_{i (0 - 1)}^{2} (ψ) ∕ n_{i} .

Corollary

Under Assumptions 1 and 2, $E {\hat{Var} ({\hat{C E}}^{D} (ψ))} \geq Var ({\hat{C E}}^{D} (ψ))$ with equality holding if and only if (9) holds for all i = 1, . . . , N.

Remark

The corollary to Theorem 6 is similar to the corollary to Theorem 5 in that (10) is an unbiased estimator of $Var ({\hat{C E}}^{D} (ψ))$ if and only if the individual direct effects are additive. If direct additivity does not hold for all individuals, (10) will be positively biased. Analogous results for the group average indirect, total, and overall effects follow from Theorems 7-9.

Theorem 7

Let $\hat{Var} ({\hat{C E}}^{I} (ϕ, ψ)) \equiv {\hat{σ}}_{g 0}^{2} (ϕ) ∕ (N - C) + {\hat{σ}}_{g 0}^{2} (ψ) ∕ C$ . Under Assumptions 1 and 2,

E {\hat{Var} ({\hat{C E}}^{I} (ϕ, ψ))} = Var ({\hat{C E}}^{I} (ϕ, ψ)) + σ_{g (0 - 0)}^{2} (ϕ, ψ) ∕ N,

where $σ_{g (0 - 0)}^{2} (ϕ, ψ) \equiv \sum_{i = 1}^{N} {[{{\overset{‒}{Y}}_{i} (0; ϕ) - {\overset{‒}{Y}}_{i} (0; ψ)} - {\overset{‒}{Y} (0; ϕ) - \overset{‒}{Y} (0; ψ)}]}^{2} ∕ (N - 1)$ is the variance of the N differences ${\overset{‒}{Y}}_{i} (0; ϕ) - {\overset{‒}{Y}}_{i} (0; ψ)$ .

Corollary

Under Assumptions 1 and 2, $E {\hat{Var} ({\hat{C E}}^{I} (ϕ, ψ))} \geq Var ({\hat{C E}}^{I} (ϕ, ψ))$ with equality holding if and only if ${\overset{‒}{Y}}_{i} (0; ϕ) = {\overset{‒}{Y}}_{i} (0; ψ) + η_{I}$ for fixed constant η_I and i = 1, . . . , N.

Theorem 8

Let $\hat{Var} ({\hat{C E}}^{T} (ϕ, ψ)) \equiv {\hat{σ}}_{g 0}^{2} (ϕ) ∕ (N - C) + {\hat{σ}}_{g I}^{2} (ψ) ∕ C$ . Under Assumptions 1 and 2,

E {\hat{Var} ({\hat{C E}}^{T} (ϕ, ψ))} = Var ({\hat{C E}}^{T} (ϕ, ψ)) + σ_{g (0 - 1)}^{2} (ϕ, ψ) ∕ N,

where $σ_{g (0 - 1)}^{2} (ϕ, ψ) \equiv \sum_{i = 1}^{N} {[{{\overset{‒}{Y}}_{i} (0; ϕ) - {\overset{‒}{Y}}_{i} (1; ψ)} - {\overset{‒}{Y} (0; ϕ) - \overset{‒}{Y} (1; ψ)}]}^{2} ∕ (N - 1)$ .

Corollary

Under Assumptions 1 and 2, $E {\hat{Var} ({\hat{C E}}^{T} (ϕ, ψ))} \geq Var ({\hat{C E}}^{T} (ϕ, ψ))$ with equality holding if and only if ${\overset{‒}{Y}}_{i} (0; ϕ) = {\overset{‒}{Y}}_{i} (1; ψ) + η_{T}$ for fixed constant η_T and i = 1, . . . , N.

Theorem 9

Let $\hat{Var} ({\hat{C E}}^{O} (ϕ, ψ)) \equiv {\hat{σ}}_{M}^{2} (ϕ) ∕ (N - C) + {\hat{σ}}_{M}^{2} (ψ) ∕ C$ , where ${\hat{σ}}_{M}^{2} (ψ) \equiv \sum_{i = 1}^{N} {{\hat{Y}}_{i} (ψ) - \hat{Y} (ψ)}^{2} S_{i} ∕ (C - 1)$ and ${\hat{σ}}_{M}^{2} (ϕ)$ is defined analogously. Under Assumptions 1 and 2,

E {\hat{Var} ({\hat{C E}}^{O} (ϕ, ψ))} = Var ({\hat{C E}}^{O} (ϕ, ψ)) + σ_{M}^{2} (ϕ, ψ) ∕ N,

where $σ_{M}^{2} (ϕ, ψ) \equiv \sum_{i = 1}^{N} {[{{\overset{‒}{Y}}_{i} (ϕ) - {\overset{‒}{Y}}_{i} (ψ)} - {\overset{‒}{Y} (ϕ) - \overset{‒}{Y} (ψ)}]}^{2} ∕ (N - 1)$ .

Corollary

Under Assumptions 1 and 2, $E {\hat{Var} ({\hat{C E}}^{O} (ϕ, ψ))} \geq Var ({\hat{C E}}^{O} (ϕ, ψ))$ with equality holding if and only if ${\overset{‒}{Y}}_{i} (ϕ) = {\overset{‒}{Y}}_{i} (ψ) + η_{O}$ for fixed constant η_O and i = 1, . . . , N.

5. EXAMPLES

5.1 Housing Vouchers

Motivated by randomized studies designed to assess the effect of vouchers on housing mobility, Sobel (2006) proposed causal estimands and estimators when interference between units is present. At the first level, Sobel considered the effect of housing vouchers on the lease-up rate, that is, whether a household moves. At the second level, he considered voucher effects on other outcomes such as parents’ perceptions of safety, welfare receipt, and child health. In this section, some of these estimands and estimators are shown to be special cases of those defined in Sections 3 and 4.1. To begin, we demonstrate that Sobel’s causal estimand and estimator of the voucher effect on the lease-up rate are examples of the group average total causal effect estimand and estimator. Because Sobel considered just one group, we drop the subscript i for group in the rest of this section.

Consider a study where n households within a neighborhood are randomized to receive a housing voucher. In our terminology, households correspond to individuals and the neighborhood corresponds to a single group. Let Z_j = 1 if the j th household receives a voucher, Z_j = 0 otherwise for j = 1, . . . , n. Let Y_j (z) = 1 if the j th household moves using a voucher, Y_j (z) = 0 otherwise. Because moving using a voucher is clearly not possible without a voucher, it follows immediately from the definition of Y_j (z) that

Y_{j} (z_{(j)}, z_{j} = 0) = 0 for j = 1, \dots, n .

(11)

Suppose Pr_ϕ(Z = 0) = 1; that is, ϕ corresponds to the benchmark allocation strategy where no household receives a voucher. Then (11) implies

{\overset{‒}{C E}}^{T} (ϕ, ψ) = - \frac{1}{n} \sum_{j = 1}^{n} \sum_{z \in R^{n - 1}} Y_{j} (Z_{(j)} = z, Z_{j} = 1) \times \Pr_{ϕ} (Z_{(j)} = z ∣ Z_{j} = 1) .

for any other household assignment strategy ψ. In particular, suppose ψ corresponds to the mixed assignment strategy, where exactly k of n households receive a voucher. Then

\begin{matrix} {\overset{‒}{C E}}^{T} & (ϕ, ψ) \\ = & - \frac{(k - 1)! (n - k)!}{n!} \sum_{z \in R_{k - 1}^{n - 1}} \sum_{j = 1}^{n} Y_{j} (Z_{(j)} = z, Z_{j} = 1) \\ = & - \frac{k! (n - k)!}{n!} \sum_{z \in R_{k - 1}^{n - 1}} \sum_{ζ = 0}^{1} \frac{1}{k} \sum_{j = 1}^{n} Y_{j} (Z_{(j)} = z, Z_{j} = ζ) ζ \\ = & - \frac{k! (n - k)!}{n!} \sum_{z \in R_{k}^{n}} \frac{1}{k} \sum_{j = 1}^{n} Y_{j} (Z = z) z_{j}, \end{matrix}

which is equivalent (up to a minus sign) to equation (2) of Sobel in the setting where there are two levels of treatment. Sobel actually considered the more general situation of three treatment levels, which is not considered here. Sobel’s corresponding estimator, the observed lease-up rate among voucher recipients, is equivalent to $\hat{Y} (1; ψ)$ as given by (6). Under exclusion restriction (11), ${\hat{C E}}^{T} (ϕ, ψ) = - \hat{Y} (1; ψ)$ .

Although interference is possible among those receiving housing vouchers, the exclusion restriction (11) precludes interference when a household does not receive a voucher. Thus, the indirect effect of housing vouchers on mobility is 0, ${\overset{‒}{C E}}^{I} (ϕ, ψ) = 0$ , and the total effects equal the direct effects. If households could move without the aid of a voucher, an analysis based on (11) could potentially overestimate the magnitude of the total effect of vouchers. By instead defining Y_j (z) = 1 if the j th household moves and 0 otherwise, a voucher allocation strategy may have indirect effects in those not receiving vouchers as well as direct and total causal effects. For example, this would allow for the possibility that if several neighbors move because they receive vouchers, household j might also move even though they did not obtain a voucher. Estimating such indirect causal effects of voucher distribution within a neighborhood on those households that do not receive a voucher would likely be of interest to policy makers. Of course, in this case, observation of potential outcomes under the benchmark allocation program Z = 0 would also be necessary to estimate the indirect and total effects.

Sobel’s estimand of the effects of vouchers on outcomes such as welfare receipt can be viewed as an example of the group average overall effect estimand. For instance, assume all households that receive a voucher subsequently move and ϕ is the benchmark allocation strategy. Then taking the expected value of Sobel’s “average effect” [his eq. (3)] over the distribution of possible intervention programs under ψ yields the group average overall effect ${\overset{‒}{C E}}^{O} (ϕ, ψ)$ . Sobel noted that the average overall effect is a weighted average of the indirect effects on those not receiving a voucher, which he called spillover effects, and the effects in those receiving a voucher. More precisely, for ϕ the benchmark allocation strategy, the overall effect equals the following weighted sum of the indirect and total effects: ${\overset{‒}{C E}}^{O} (ϕ, ψ) = \Pr_{ψ} (Z_{j} = 0) {\overset{‒}{C E}}^{I} (ϕ, ψ) + \Pr_{ψ} (Z_{j} = 1) {\overset{‒}{C E}}^{T} (ϕ, ψ)$ .

Because the design of the housing voucher study does not include randomizing some neighborhoods to the benchmark allocation, outcomes such as welfare receipt and parents’ perception of safety are not observed under this allocation. Thus, without further assumptions akin to (11), voucher effects on these outcomes are not identifiable from the data. Alternatively, these effects are identifiable by considering a population of neighborhoods and a two-stage randomization design. For example, neighborhoods within a city or set of cities could be identified that were sufficiently separated geographically to ensure that the assumption of no interference between neighborhoods is plausible. Then, in the first stage of randomization, some neighborhoods could be randomly assigned the benchmark allocation and other neighborhoods to an allocation strategy where in the second stage, a specified proportion of randomly selected households would receive a voucher. Such a design would permit estimation of the direct, indirect, total, and overall effects of housing vouchers on the outcomes described previously without making exclusion restrictions such as (11). Estimation of the variances of the causal effect estimators would also be possible with this design under the additional assumption of stratified interference.

5.2 Vaccines

Direct application of the proposed methods to the data given in Table 1 is not appropriate because baris were not randomly assigned to particular levels of vaccine coverage in the actual trial. Therefore, for illustrative purposes, we consider a hypothetical two-stage randomized placebo-controlled trial of cholera vaccines in a setting similar to that of Ali et al. (2005). Suppose in the first stage of this hypothetical trial that five geographically separate groups were randomized using a mixed allocation strategy ν such that three groups were assigned ϕ and the remaining two were assigned ψ. Then, in the second stage, suppose 30% of individuals were randomly chosen to receive vaccine within groups assigned ϕ and 50% of individuals were randomly chosen to receive vaccine within groups assigned ψ. Individuals were then followed for one year for detection of cholera. Results from this hypothetical trial are given in Table 2.

Table 2.

Illustrative example of a two-stage randomized placebo-controlled vaccine trial based on data from Ali et al. (2005)

	Group assignment S_i	Vaccine recipients (Z_ij = 1)		Placebo recipients (Z_ij = 0)
Group i	Group assignment S_i	Total Σ_jZ_ij	Cases Σ_jZ_ijY_ij (Zi)	Total Σ_j (1 - Z_ij)	Cases Σ_j (1 - Z_ij)Y_ij (Zi)
1	1	12,541	16	12,541	18
2	1	11,513	26	11,513	54
3	0	10,772	17	25,134	119
4	0	8,883	22	20,727	122
5	0	5,627	15	13,130	92

Open in a new tab

NOTE: Group assignment S_i = 1 (0) corresponds to 50% (30%) vaccine coverage.

Estimates of the population average direct, indirect, total, and overall effects are given in Table 3. The estimated variances are also presented. Note the direct effect estimate is nearly three times greater under ϕ (30% coverage) compared to ψ (50% coverage). Ali et al. (2005) noted a similar phenomenon and cautioned that high levels of vaccine coverage can bias estimates of vaccine efficacy (i.e., the direct effect of vaccination). In fact, the issue here is not one of bias, but rather that the parameter being estimated can depend on the level of coverage due to interference between individuals. Moreover, the totality of effects of a vaccination strategy must be viewed by considering indirect, total, and overall effects in addition to direct effects. Estimates of these other effects can easily be interpreted by investigators. For example, the indirect effect estimate in Table 3 suggests 50% vaccine coverage results in 2.8 fewer cholera cases per 1,000 unvaccinated individuals per year compared to only 30% vaccine coverage. Note the estimated total effect of being vaccinated under ψ is over three times the corresponding estimated direct effect, demonstrating the importance of randomizing groups to different levels of vaccine coverage. Had all groups received 50% coverage such that only the direct effect could be estimated, the utility of vaccination would have been substantially underestimated. The estimated overall effect provides a simple summary comparison of the two strategies, indicating that, on average, 50% vaccine coverage results in 2.4 fewer cases of cholera per 1,000 individuals per year compared to 30% vaccine coverage.

Table 3.

Estimates of population average direct, indirect, total, and overall effects per 1,000 individuals per year for data in Table 2

Effect	Parameter	Estimate	Estimated variance
Direct	${\overset{‒}{C E}}^{D} (ψ)$	1.30	.856
Direct	${\overset{‒}{C E}}^{D} (ϕ)$	3.64	.178
Indirect	${\overset{‒}{C E}}^{I} (ϕ, ψ)$	2.81	3.079
Total	${\overset{‒}{C E}}^{T} (ϕ, ψ)$	4.11	.672
Overall	${\overset{‒}{C E}}^{O} (ϕ, ψ)$	2.37	1.430

Open in a new tab

6. DISCUSSION

In this article, estimands for direct, indirect, total, and overall causal effects of different treatment strategies are proposed in the setting where interference between individuals is possible. Relations between the estimands are established, and estimators of the proposed estimands are presented. These estimators are shown to be unbiased assuming a two-stage randomization procedure with a mixed assignment mechanism at each stage (Assumption 1). Under an additional assumption of stratified interference (Assumption 2), variance estimators of the causal effect estimators are derived that are unbiased under additivity and positively biased otherwise.

This article builds on previous work in several significant ways. First, causal inference in the presence of interference is considered in a general framework not specific to any one subject area, unifying previous work on housing mobility studies (Sobel 2006) and infectious diseases (Halloran and Struchiner 1991, 1995). The definitions of individual direct, indirect, total, and overall causal effects in the presence of interference (Halloran and Struchiner 1991, 1995) are formally extended to groups and populations of groups by averaging over all possible treatment assignments for particular allocation strategies (Sobel 2006). By considering a population of groups, rather than just one group as in Sobel (2006), unbiased estimators of the causal estimands of interest are derived without requiring exclusion restriction assumptions. The variance of causal effect estimators in the presence of interference is also considered, which had not been done previously. The utility of the proposed variance estimators will depend on whether Assumptions 1 and 2 are reasonable. Assumption 1 is determined by the experimental design and, thus, should be under control of the investigator. Assumption 2 may be reasonable in many settings, such as in the evaluation of the effects of vaccines, educational interventions, or housing vouchers. However, in other contexts, such as in cross-over trials or plant variety studies, assuming different forms of interference may be more appropriate.

The methods developed here could be extended to settings with more than two treatment levels or noncompliance as in Sobel (2006). Consideration of population subgroups may be of interest in the presence of interference. As in Halloran, Longini, Cowart, and Nizam (2002), one could define the indirect, total, and overall effects for different subgroups of the population. For example, a strategy of vaccinating 70% of children against influenza and another strategy of not vaccinating children could be compared by the indirect effects on the incidence of influenza in adults from the same population. Similarly, Moulton et al. (2001) and Sobel (2006) considered populations consisting of participants and nonparticipants; that is, a subset of the population does not receive either treatment.

In addition to these extensions, other areas of research remain to be explored in the setting where interference between individuals is present. Different randomization strategies and interference structures might be considered. An anonymous referee suggested more efficient variance estimators might be derived if one is willing to make certain additional additivity assumptions. For example, under (9), ${\hat{σ}}_{i 1}^{2} (ψ)$ and ${\hat{σ}}_{i 0}^{2} (ψ)$ are estimating the same parameter, and, thus, more efficient variance estimators might be obtained by combining data from individuals assigned Z_ij = 0 and individuals assigned Z_ij = 1. Stochastic causal models could also be considered, wherein the potential outcomes Y_ij (z_i) are treated as random rather than fixed (e.g., as in Robins and Greenland 1989). Methods are needed to construct confidence intervals for the different causal effects, perhaps by building on the work of Rosenbaum (2007).

Rubin (2005) recently noted that causal inference can be conducted by making only two assumptions: a probabilistic model about the treatment assignment mechanism and SUTVA. By providing a framework where the latter assumption is not completely necessary, our work here will hopefully contribute to a foundation for causal inference in the presence of interference upon which others can build.

Acknowledgments

MGH and MEH were supported in part by NIH grant 2 R01 AI32042. The authors thank Peter Gilbert and Thomas Richardson for helpful comments.

APPENDIX: PROOFS OF THEOREMS 1-9

A.1 Proof of Theorem 1

Without loss of generality, let z = 1. Under Assumption 1, K_i is fixed, so that

E {{\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1} = \frac{1}{K_{i}} \sum_{j = 1}^{n_{i}} \sum_{z \in R_{K_{i}}^{n_{i}}} \Pr_{ψ} (Z_{i} = z) Y_{i j} (z) I [z_{i j} = 1] .

Now any z such that z_ij = 0 does not contribute to the summation, so that we can equivalently write

\begin{matrix} E {{\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1} & = \frac{1}{K_{i}} \sum_{j = 1}^{n_{i}} \sum_{ω \in R_{K_{i} - 1}^{n_{i} - 1}} \Pr_{ψ} (Z_{i (j)} = ω, Z_{i j} = 1) \times Y_{i j} (z_{i (j)} = ω, z_{i j} = 1) \\ = \frac{1}{K_{i}} \sum_{j = 1}^{n_{i}} \sum_{ω \in R_{K_{i} - 1}^{n_{i} - 1}} \Pr_{ψ} (Z_{i (j)} = ω ∣ Z_{i j} = 1) \times \Pr_{ψ} (Z_{i j} = 1) Y_{i j} (z_{i (j)} = ω, z_{i j} = 1) . \end{matrix}

Under Assumption 1, Pr_ψ (Z_ij = 1) = K_i/n_i, implying

\begin{matrix} E {{\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1} & = \frac{1}{n_{i}} \sum_{j = 1}^{n_{i}} \sum_{ω \in R_{K_{i} - 1}^{n_{i} - 1}} \Pr_{ψ} (Z_{i (j)} = ω ∣ Z_{i j} = 1) \times Y_{i j} (z_{i (j)} = ω, z_{i j} = 1) \\ = {\overset{‒}{Y}}_{i} (1; ψ) . \end{matrix}

A.2 Proof of Theorem 2

Without loss of generality, let z = 1. Using the fact that $E {\hat{Y} (1; ψ)} = E [E {\hat{Y} (1; ψ) ∣ S}]$ , from Theorem 1 it follows that $E {\hat{Y} (1; ψ)} = E {\sum_{i = 1}^{N} {\overset{‒}{Y}}_{i} (1; ψ) S_{i} ∕ C} = \overset{‒}{Y} (1; ψ)$ .

A.3 Proof of Theorem 3

The conditional expectation result follows from

\begin{matrix} E {{\hat{Y}}_{i} (ψ) ∣ S_{i} = 1} & = \sum_{j = 1}^{n_{i}} \sum_{z \in R^{n_{i}}} Y_{i j} (z) \Pr_{ψ} (Z_{i} = z) ∕ n_{i} \\ = \sum_{j = 1}^{n_{i}} {\overset{‒}{Y}}_{i j} (ψ) ∕ n_{i} \\ = {\overset{‒}{Y}}_{i} (ψ) . \end{matrix}

The remainder of the proof parallels that of Theorem 2.

A.4 Proof of Theorem 4

The proof follows directly from known properties of estimators of population means using simple random sampling (SRS) and two-stage cluster sampling (see, e.g., Kish 1965, chap. 2; Splawa-Neyman 1990; Thompson 1992, chaps. 2 and 3). For example, given S_i = 1 and Assumptions 1 and 2, ${\hat{Y}}_{i} (1; ψ)$ can be viewed as the sample mean from a simple random sample drawn without replacement from {Y_i1(1; ψ), . . . , Y_{in_i} (1; ψ)}. Thus,

Var ({\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1) = (1 - \frac{K_{i}}{n_{i}}) \frac{σ_{i 1}^{2} (ψ)}{K_{i}},

(A.1)

where $σ_{i 1}^{2} (ψ) \equiv \sum_{j = 1}^{n_{i}} {Y_{i j} (1; ψ) - {\overset{‒}{Y}}_{i} (1; ψ)}^{2} ∕ (n_{i} - 1)$ is the within-group variance. It is also well known that $E ({\hat{σ}}_{i 1}^{2} (ψ) ∣ S_{i} = 1) = σ_{i 1}^{2} (ψ)$ , implying (7) holds.

Similarly, that $E {\hat{Var} (\hat{Y} (z; ψ))} = Var (\hat{Y} (z; ψ))$ for z = 0, 1 follows from known results on two-stage cluster sampling. A sketch of a proof of this follows. First, one can show

Var {\hat{Y} (1; ψ)} = (1 - \frac{C}{N}) \frac{σ_{g 1}^{2} (ψ)}{C} + \frac{1}{C N} \sum_{i = 1}^{N} (1 - \frac{K_{i}}{n_{i}}) \frac{σ_{i 1}^{2} (ψ)}{K_{i}},

(A.2)

where $σ_{g 1}^{2} (ψ) \equiv \sum_{i = 1}^{N} {{\overset{‒}{Y}}_{i} (1; ψ) - \overset{‒}{Y} (1; ψ)}^{2} ∕ (N - 1)$ . Next, note E(S_i) = C/N such that

E {\hat{Var} (\hat{Y} (1; ψ))} = (1 - \frac{C}{N}) \frac{E {{\hat{σ}}_{g 1}^{2} (ψ)}}{C} + \frac{1}{N^{2}} \sum_{i = 1}^{N} (1 - \frac{K_{i}}{n_{i}}) \frac{σ_{i 1}^{2} (ψ)}{K_{i}} .

(A.3)

So the remaining task at hand becomes finding $E {{\hat{σ}}_{g 1}^{2} (ψ)}$ , which can be shown to equal

\frac{1}{N} \sum_{i = 1}^{N} (1 - \frac{K_{i}}{n_{i}}) \frac{σ_{i 1}^{2} (ψ)}{K_{i}} + σ_{g 1}^{2} (ψ) .

Substituting this into (A.3) implies $E {\hat{Var} (\hat{Y} (1; ψ))}$ equals

(1 - \frac{C}{N}) \frac{σ_{g 1}^{2} (ψ)}{C} + {(1 - \frac{C}{N}) \frac{1}{C} \frac{1}{N} + \frac{1}{N^{2}}} \sum_{i = 1}^{N} (1 - \frac{K_{i}}{n_{i}}) \frac{σ_{i 1}^{2} (ψ)}{K_{i}},

which simplifies to (A.2).

A.5 Proof of Theorem 5

The proof follows from Splawa-Neyman (1990) and Rubin (1990); a sketch is given here. First, we derive $Var ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1)$ , which, of course, equals $Var {{\hat{Y}}_{i} (0; ψ) ∣ S_{i} = 1} + Var {{\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1} - 2 Cov {{\hat{Y}}_{i} (0; ψ), {\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1}$ . We know the form of $Var {{\hat{Y}}_{i} (z; ψ) ∣ S_{i} = 1}$ for z = 0, 1 from the proof of Theorem 4. Additionally, one can show $Cov ({\hat{Y}}_{i} (0; ψ), {\hat{Y}}_{i} (1; ψ) ∣ S_{i} = 1) = {σ_{i (0 - 1)}^{2} (ψ) - σ_{i 0}^{2} (ψ) - σ_{i 1}^{2} (ψ)} ∕ (2 n_{i})$ . Therefore,

Var ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) = \frac{K_{i}}{n_{i}} \frac{σ_{i 0}^{2} (ψ)}{n_{i} - K_{i}} + (1 - \frac{K_{i}}{n_{i}}) \frac{σ_{i 1}^{2} (ψ)}{K_{i}} - \frac{1}{n_{i}} {σ_{i (0 - 1)}^{2} (ψ) - σ_{i 0}^{2} (ψ) - σ_{i 1}^{2} (ψ)},

which simplifies to

Var ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) = \frac{σ_{i 0}^{2} (ψ)}{n_{i} - K_{i}} + \frac{σ_{i 1}^{2} (ψ)}{K_{i}} - \frac{σ_{i (0 - 1)}^{2} (ψ)}{n_{i}} .

The proof is then completed by noting that

E {\hat{Var} ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) ∣ S_{i} = 1} = \frac{σ_{i 0}^{2} (ψ)}{n_{i} - K_{i}} + \frac{σ_{i 1}^{2} (ψ)}{K_{i}} .

A.6 Proof of Theorem 6

The proof follows along similar lines as the derivation of $E {\hat{Var} (\hat{Y} (1; ψ))}$ in the proof of Theorem 4. In particular, one can first show

Var ({\hat{C E}}^{D} (ψ)) = (1 - \frac{C}{N}) \frac{σ_{D}^{2} (ψ)}{C} + \frac{1}{C N} \sum_{i = 1}^{N} Var ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1),

(A.4)

where $σ_{D}^{2} (ψ) \equiv \sum_{i = 1}^{N} {{\overset{‒}{C E}}_{i}^{D} (ψ) - {\overset{‒}{C E}}^{D} (ψ)}^{2} ∕ (N - 1)$ . Next, similar to $E {{\hat{σ}}_{g 1}^{2} (ψ)}$ in Theorem 4, one can show $E {{\hat{σ}}_{D}^{2} (ψ)} = \sum_{i = 1}^{N} Var ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) ∕ N + σ_{D}^{2} (ψ)$ . Taking the expected value of (10), we have

E [\hat{Var} {{\hat{C E}}^{D} (ψ)}] = (1 - \frac{C}{N}) \frac{σ_{D}^{2} (ψ)}{C} + (1 - \frac{C}{N}) \frac{1}{C} \frac{1}{N} \sum_{i = 1}^{N} Var ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) + \frac{1}{C N} \sum_{i = 1}^{N} E {\hat{Var} ({\hat{C E}}_{i}^{D} (ψ) ∣ S_{i} = 1) S_{i}} .

Combining this result with Theorem 5 proves the theorem.

A.7 Proof of Theorem 7

The proof follows along the same lines as the proof of Theorem 5. Namely, one can show $Cov (\hat{Y} (0; ϕ), \hat{Y} (0; ψ)) = {σ_{g (0 - 0)}^{2} (ϕ, ψ) - σ_{g 0}^{2} (ϕ) - σ_{g 0}^{2} (ψ)} ∕ (2 N)$ , where $σ_{g 0}^{2} (ϕ)$ and $σ_{g 0}^{2} (ψ)$ are defined analogously to $σ_{g 1}^{2} (ψ)$ in Theorem 4, implying $Var ({\hat{C E}}^{I} (ϕ, ψ))$ equals

Var (\hat{Y} (0; ϕ)) + Var (\hat{Y} (0; ψ)) + \frac{1}{N} {σ_{g 0}^{2} (ϕ) + σ_{g 0}^{2} (ψ) - σ_{g (0 - 0)}^{2} (ϕ, ψ)} .

(A.5)

From the proof of Theorem 4, we have

\begin{matrix} E {\frac{{\hat{σ}}_{g 0}^{2} (ψ)}{C}} & = \frac{σ_{g 0}^{2} (ψ)}{C} + \frac{1}{C N} \sum_{i = 1}^{N} Var ({\hat{Y}}_{i} (0; ψ) ∣ S_{i} = 1) \\ = Var {\hat{Y} (0; ψ)} + \frac{σ_{g 0}^{2} (ψ)}{N}, \end{matrix}

and similarly $E {{\hat{σ}}_{g 0}^{2} (ϕ) ∕ (N - C)} = Var {\hat{Y} (0; ϕ)} + σ_{g 0}^{2} (ϕ) ∕ N$ , which together with (A.5) prove the theorem.

A.8 Proof of Theorem 8

The proof is analogous to the proof of Theorem 7.

A.9 Proof of Theorem 9

As in the proof of Theorem 4, one can show

Var (\hat{Y} (ψ)) = (1 - \frac{C}{N}) \frac{σ_{M}^{2} (ψ)}{C} + \frac{1}{C N} \sum_{i = 1}^{N} Var ({\hat{Y}}_{i} (ψ) ∣ S_{i} = 1),

where $σ_{M}^{2} (ψ) \equiv \sum_{i = 1}^{N} {{\overset{‒}{Y}}_{i} (ψ) - \overset{‒}{Y} (ψ)}^{2} ∕ (N - 1)$ . Following the same lines as the proofs of Theorems 5 and 7, one can also show $Cov {\hat{Y} (ϕ), \hat{Y} (ψ)} = {σ_{M}^{2} (ϕ, ψ) - σ_{M}^{2} (ϕ) - σ_{M}^{2} (ψ)} ∕ (2 N)$ , where $σ_{M}^{2} (ϕ)$ is defined analogously to $σ_{M}^{2} (ψ)$ , implying

Var {C E^{0} (ϕ, ψ)} = Var {\hat{Y} (ϕ)} + Var {\hat{Y} (ψ)} + \frac{1}{N} {σ_{M}^{2} (ϕ) + σ_{M}^{2} (ψ) - σ_{M}^{2} (ϕ, ψ)} .

Additionally, one can show $E {{\hat{σ}}_{M}^{2} (ψ)} = σ_{M}^{2} (ψ) + \sum_{i = 1}^{N} Var {{\hat{Y}}_{i} (ψ) ∣ S_{i} = 1} ∕ N$ , which implies $E {{\hat{σ}}_{M}^{2} (ψ) ∕ C} = Var {\hat{Y} (ψ)} + σ_{M}^{2} (ψ) ∕ N$ . Using an analogous result for ${\hat{σ}}_{M}^{2} (ϕ)$ , the theorem follows.

REFERENCES

Ali M, Emch M, von Seidlein L, Yunus M, Sack DA, Rao M, Holmgren J, Clemens JD. Herd Immunity Conferred by Killed Oral Cholera Vaccines in Bangladesh: A Reanalysis. Lancet. 2005;366:44–49. doi: 10.1016/S0140-6736(05)66550-6. [DOI] [PubMed] [Google Scholar]
Angrist JD, Imbens GW, Rubin DB. Identification of Causal Effects Using Instrumental Variables. Journal of the American Statistical Association. 1996;91:444–455. [Google Scholar]
Bailey RA, Kunert J. On Optimal Crossover Designs When Carryover Effects Are Proportional to Direct Effects. Biometrika. 2006;93:613–625. [Google Scholar]
Borm GF, Melis RJF, Teerenstra S, Peer PG. Pseudo Cluster Randomization: A Treatment Allocation Method to Minimize Contamination and Selection Bias. Statistics in Medicine. 2005;24:3535–3547. doi: 10.1002/sim.2200. [DOI] [PubMed] [Google Scholar]
Cox DR. Planning of Experiments. Wiley; New York: 1958. [Google Scholar]
Friedman LM, Furberg C, DeMets DL. Fundamentals of Clinical Trials. Springer-Verlag; New York: 1998. [Google Scholar]
Halloran ME, Struchiner CJ. Study Designs for Dependent Happenings. Epidemiology. 1991;2:331–338. doi: 10.1097/00001648-199109000-00004. [DOI] [PubMed] [Google Scholar]
Halloran ME, Struchiner CJ. Causal Inference in Infectious Diseases. Epidemiology. 1995;6:142–151. doi: 10.1097/00001648-199503000-00010. [DOI] [PubMed] [Google Scholar]
Halloran ME, Longini IM, Cowart DM, Nizam A. Community interventions and the Epidemic Prevention Potential. Vaccine. 2002;20:3254–3262. doi: 10.1016/s0264-410x(02)00316-x. [DOI] [PubMed] [Google Scholar]
Hayes RJ, Alexander NDE, Bennett S, Cousens SN. Design and Analysis Issues in Cluster-Randomized Trials of Interventions Against Infectious Diseases. Statistical Methods in Medical Research. 2000;9:95–116. doi: 10.1177/096228020000900203. [DOI] [PubMed] [Google Scholar]
Kempton RA. Interference Between Plots. In: Kempton RA, Fox PN, editors. Statistical Methods for Plant Variety Evaluation. Chapman & Hall; London: 1997. pp. 101–116. [Google Scholar]
King JC, Stoddard JJ, Gaglani MB, Moore KA, Magder L, McClure E, Rubin JD, Englund JA, Neuzil K. Effectiveness of School-Based Influenza Vaccination. New England Journal of Medicine. 2006;355:2523–2532. doi: 10.1056/NEJMoa055414. [DOI] [PubMed] [Google Scholar]
Kish L. Survey Sampling. Wiley; New York: 1965. [Google Scholar]
Longini IM, Halloran ME, Nizam A. Model-Based Estimation of Vaccine Effects From Community Vaccine Trials. Statistics in Medicine. 2002;21:481–495. doi: 10.1002/sim.994. [DOI] [PubMed] [Google Scholar]
Melis RJF, van Eijken MIJ, Borm GF, Wensing M, Adang E, van de Lisdonk EH, van Achterberg T, Olde Rikkert MGM. The Design of the Dutch EASYcare Study: A Randomised Controlled Trial on the Effectiveness of a Problem-Based Community Intervention Model for Frail Elderly People. BMC Health Services Research. 2005;5:65. doi: 10.1186/1472-6963-5-65. [DOI] [PMC free article] [PubMed] [Google Scholar]
Moulton LH, O’Brien KL, Kohberger R, Chang I, Reid R, Weatherholtz R, Hackell JG, Siber GR, Santosham M. Design of a Group-Randomized Streptococcus pneumoniae Vaccine Trial. Controlled Clinical Trials. 2001;22:438–452. doi: 10.1016/s0197-2456(01)00132-5. [DOI] [PubMed] [Google Scholar]
Moulton LH, O’Brien KL, Reid R, Weatherholtz R, Santosham M, Siber GR. Evaluation of the Indirect Effects of a Pneumococcal Vaccine in a Community-Randomized Study. Journal of Biopharmaceutical Statistics. 2006;16:453–462. doi: 10.1080/10543400600719343. [DOI] [PubMed] [Google Scholar]
Murray DM. Design and Analysis of Group-Randomized Trials. Oxford University Press; New York: 1998. [Google Scholar]
Robins J, Greenland S. The Probability of Causation Under a Stochastic Model for Individual Risk. Biometrics. 1989;45:1125–1138. [PubMed] [Google Scholar]
Rosenbaum PR. Interference Between Units in Randomized Experiments. Journal of the American Statistical Association. 2007;102:191–200. doi: 10.1080/01621459.2012.655954. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ross R. An Application of the Theory of Probabilities to the Study of a priori Pathometry. I. (Ser. A).Proceedings of the Royal Society. 1916;92:204–230. [Google Scholar]
Rubin DB. Bayesian Inference for Causal Effects: The Role of Randomization. The Annals of Statistics. 1978;6:34–58. [Google Scholar]
Rubin DB. Discussion of “Randomization Analysis of Experimental Data in the Fisher Randomization Test. Journal of the American Statistical Association. 1980;75:591–593. by D. Basu. [Google Scholar]
Rubin DB. Comment: Neyman (1923) and Causal Inference in Experiments and Observations Studies. Statistical Science. 1990;5:472–480. [Google Scholar]
Rubin DB. Causal Inference Using Potential Outcomes: Design, Modeling, Decisions. Journal of the American Statistical Association. 2005;100:322–331. [Google Scholar]
Senn S. Cross-Over Trials in Clinical Research. Wiley; New York: 1993. [Google Scholar]
Sobel M. What Do Randomized Studies of Housing Mobility Demonstrate? Causal Inference in the Face of Interference. Journal of the American Statistical Association. 2006;101:1398–1407. [Google Scholar]
Som RK. A Manual of Sampling Techniques. Heinemann; London: 1973. [Google Scholar]
Splawa-Neyman J, Dabrowska DM, Speed TP, editors. On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9. Statistical Science. 1990;5:465–472. [Google Scholar]
Struchiner CJ, Halloran ME, Robins JM, Spielman A. The Behavior of Common Measures of Association Used to Assess a Vaccination Program Under Complex Disease Transmission Patterns—A Computer Simulation Study of Malaria Vaccines. International Journal of Epidemiology. 1990;19:187–196. doi: 10.1093/ije/19.1.187. [DOI] [PubMed] [Google Scholar]
Thompson SK. Sampling. Wiley; New York: 1992. [Google Scholar]

[R1] Ali M, Emch M, von Seidlein L, Yunus M, Sack DA, Rao M, Holmgren J, Clemens JD. Herd Immunity Conferred by Killed Oral Cholera Vaccines in Bangladesh: A Reanalysis. Lancet. 2005;366:44–49. doi: 10.1016/S0140-6736(05)66550-6. [DOI] [PubMed] [Google Scholar]

[R2] Angrist JD, Imbens GW, Rubin DB. Identification of Causal Effects Using Instrumental Variables. Journal of the American Statistical Association. 1996;91:444–455. [Google Scholar]

[R3] Bailey RA, Kunert J. On Optimal Crossover Designs When Carryover Effects Are Proportional to Direct Effects. Biometrika. 2006;93:613–625. [Google Scholar]

[R4] Borm GF, Melis RJF, Teerenstra S, Peer PG. Pseudo Cluster Randomization: A Treatment Allocation Method to Minimize Contamination and Selection Bias. Statistics in Medicine. 2005;24:3535–3547. doi: 10.1002/sim.2200. [DOI] [PubMed] [Google Scholar]

[R5] Cox DR. Planning of Experiments. Wiley; New York: 1958. [Google Scholar]

[R6] Friedman LM, Furberg C, DeMets DL. Fundamentals of Clinical Trials. Springer-Verlag; New York: 1998. [Google Scholar]

[R7] Halloran ME, Struchiner CJ. Study Designs for Dependent Happenings. Epidemiology. 1991;2:331–338. doi: 10.1097/00001648-199109000-00004. [DOI] [PubMed] [Google Scholar]

[R8] Halloran ME, Struchiner CJ. Causal Inference in Infectious Diseases. Epidemiology. 1995;6:142–151. doi: 10.1097/00001648-199503000-00010. [DOI] [PubMed] [Google Scholar]

[R9] Halloran ME, Longini IM, Cowart DM, Nizam A. Community interventions and the Epidemic Prevention Potential. Vaccine. 2002;20:3254–3262. doi: 10.1016/s0264-410x(02)00316-x. [DOI] [PubMed] [Google Scholar]

[R10] Hayes RJ, Alexander NDE, Bennett S, Cousens SN. Design and Analysis Issues in Cluster-Randomized Trials of Interventions Against Infectious Diseases. Statistical Methods in Medical Research. 2000;9:95–116. doi: 10.1177/096228020000900203. [DOI] [PubMed] [Google Scholar]

[R11] Kempton RA. Interference Between Plots. In: Kempton RA, Fox PN, editors. Statistical Methods for Plant Variety Evaluation. Chapman & Hall; London: 1997. pp. 101–116. [Google Scholar]

[R12] King JC, Stoddard JJ, Gaglani MB, Moore KA, Magder L, McClure E, Rubin JD, Englund JA, Neuzil K. Effectiveness of School-Based Influenza Vaccination. New England Journal of Medicine. 2006;355:2523–2532. doi: 10.1056/NEJMoa055414. [DOI] [PubMed] [Google Scholar]

[R13] Kish L. Survey Sampling. Wiley; New York: 1965. [Google Scholar]

[R14] Longini IM, Halloran ME, Nizam A. Model-Based Estimation of Vaccine Effects From Community Vaccine Trials. Statistics in Medicine. 2002;21:481–495. doi: 10.1002/sim.994. [DOI] [PubMed] [Google Scholar]

[R15] Melis RJF, van Eijken MIJ, Borm GF, Wensing M, Adang E, van de Lisdonk EH, van Achterberg T, Olde Rikkert MGM. The Design of the Dutch EASYcare Study: A Randomised Controlled Trial on the Effectiveness of a Problem-Based Community Intervention Model for Frail Elderly People. BMC Health Services Research. 2005;5:65. doi: 10.1186/1472-6963-5-65. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Moulton LH, O’Brien KL, Kohberger R, Chang I, Reid R, Weatherholtz R, Hackell JG, Siber GR, Santosham M. Design of a Group-Randomized Streptococcus pneumoniae Vaccine Trial. Controlled Clinical Trials. 2001;22:438–452. doi: 10.1016/s0197-2456(01)00132-5. [DOI] [PubMed] [Google Scholar]

[R17] Moulton LH, O’Brien KL, Reid R, Weatherholtz R, Santosham M, Siber GR. Evaluation of the Indirect Effects of a Pneumococcal Vaccine in a Community-Randomized Study. Journal of Biopharmaceutical Statistics. 2006;16:453–462. doi: 10.1080/10543400600719343. [DOI] [PubMed] [Google Scholar]

[R18] Murray DM. Design and Analysis of Group-Randomized Trials. Oxford University Press; New York: 1998. [Google Scholar]

[R19] Robins J, Greenland S. The Probability of Causation Under a Stochastic Model for Individual Risk. Biometrics. 1989;45:1125–1138. [PubMed] [Google Scholar]

[R20] Rosenbaum PR. Interference Between Units in Randomized Experiments. Journal of the American Statistical Association. 2007;102:191–200. doi: 10.1080/01621459.2012.655954. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] Ross R. An Application of the Theory of Probabilities to the Study of a priori Pathometry. I. (Ser. A).Proceedings of the Royal Society. 1916;92:204–230. [Google Scholar]

[R22] Rubin DB. Bayesian Inference for Causal Effects: The Role of Randomization. The Annals of Statistics. 1978;6:34–58. [Google Scholar]

[R23] Rubin DB. Discussion of “Randomization Analysis of Experimental Data in the Fisher Randomization Test. Journal of the American Statistical Association. 1980;75:591–593. by D. Basu. [Google Scholar]

[R24] Rubin DB. Comment: Neyman (1923) and Causal Inference in Experiments and Observations Studies. Statistical Science. 1990;5:472–480. [Google Scholar]

[R25] Rubin DB. Causal Inference Using Potential Outcomes: Design, Modeling, Decisions. Journal of the American Statistical Association. 2005;100:322–331. [Google Scholar]

[R26] Senn S. Cross-Over Trials in Clinical Research. Wiley; New York: 1993. [Google Scholar]

[R27] Sobel M. What Do Randomized Studies of Housing Mobility Demonstrate? Causal Inference in the Face of Interference. Journal of the American Statistical Association. 2006;101:1398–1407. [Google Scholar]

[R28] Som RK. A Manual of Sampling Techniques. Heinemann; London: 1973. [Google Scholar]

[R29] Splawa-Neyman J, Dabrowska DM, Speed TP, editors. On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9. Statistical Science. 1990;5:465–472. [Google Scholar]

[R30] Struchiner CJ, Halloran ME, Robins JM, Spielman A. The Behavior of Common Measures of Association Used to Assess a Vaccination Program Under Complex Disease Transmission Patterns—A Computer Simulation Study of Malaria Vaccines. International Journal of Epidemiology. 1990;19:187–196. doi: 10.1093/ije/19.1.187. [DOI] [PubMed] [Google Scholar]

[R31] Thompson SK. Sampling. Wiley; New York: 1992. [Google Scholar]

PERMALINK

Toward Causal Inference With Interference

Michael G Hudgens

M Elizabeth Halloran

Abstract

1. INTRODUCTION

1.1 Background and Outline

1.2 Motivating Example

Table 1.

2. PRELIMINARIES

2.1 Potential Outcomes

2.2 Treatment Assignment Mechanisms

3. CAUSAL ESTIMANDS

3.1 Average Potential Outcomes

3.2 Direct Causal Effects

3.3 Indirect Causal Effects

Remark

Remark

3.4 Total Causal Effects

Remark

Remark

3.5 Overall Causal Effect

3.6 No Interference

4. INFERENCE

4.1 Estimators

Theorem 1

Corollary

Remark

Theorem 2

Corollary

Theorem 3

Corollary

4.2 Variance Estimators

4.2.1 Stratified Interference

4.2.2 Variance Estimators Assuming Stratified Interference

Theorem 4

Theorem 5

Corollary

Remark

Theorem 6

Corollary

Remark

Theorem 7

Corollary

Theorem 8

Corollary

Theorem 9

Corollary

5. EXAMPLES

5.1 Housing Vouchers

5.2 Vaccines

Table 2.

Table 3.

6. DISCUSSION

Acknowledgments

APPENDIX: PROOFS OF THEOREMS 1-9

A.1 Proof of Theorem 1

A.2 Proof of Theorem 2

A.3 Proof of Theorem 3

A.4 Proof of Theorem 4

A.5 Proof of Theorem 5

A.6 Proof of Theorem 6

A.7 Proof of Theorem 7

A.8 Proof of Theorem 8

A.9 Proof of Theorem 9

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases