Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2008 Dec 10.
Published in final edited form as: J Am Stat Assoc. 2008 Jun;103(482):832–842. doi: 10.1198/016214508000000292

Toward Causal Inference With Interference

Michael G Hudgens 1, M Elizabeth Halloran 1
PMCID: PMC2600548  NIHMSID: NIHMS73860  PMID: 19081744

Abstract

A fundamental assumption usually made in causal inference is that of no interference between individuals (or units); that is, the potential outcomes of one individual are assumed to be unaffected by the treatment assignment of other individuals. However, in many settings, this assumption obviously does not hold. For example, in the dependent happenings of infectious diseases, whether one person becomes infected depends on who else in the population is vaccinated. In this article, we consider a population of groups of individuals where interference is possible between individuals within the same group. We propose estimands for direct, indirect, total, and overall causal effects of treatment strategies in this setting. Relations among the estimands are established; for example, the total causal effect is shown to equal the sum of direct and indirect causal effects. Using an experimental design with a two-stage randomization procedure (first at the group level, then at the individual level within groups), unbiased estimators of the proposed estimands are presented. Variances of the estimators are also developed. The methodology is illustrated in two different settings where interference is likely: assessing causal effects of housing vouchers and of vaccines.

Keywords: Group-randomized trials, Potential outcomes, Stable unit treatment value assumption, SUTVA, Vaccine

1. INTRODUCTION

1.1 Background and Outline

A fundamental assumption usually made in the potential outcomes approach to causal inference is that of no interference between individuals (Cox 1958), a critical component of the stable unit treatment value assumption (SUTVA) (Rubin 1980). Under the no-interference assumption, the potential outcomes of any individual are assumed to be unaffected by the treatment assignment of every other individual. However, in many settings, this assumption obviously does not hold. A classical example is given by the dependent happenings of infectious diseases (Ross 1916, p. 211), where whether one person becomes infected depends on who else in the population is vaccinated. In econometrics, a household’s decision whether to move may be affected by whether their neighbors receive a housing voucher to move (Sobel 2006). In education, interventions given to certain students may affect other students in the same class (Rubin 1990; Rosenbaum 2007). Sobel (2006) and Rosenbaum (2007) gave several other examples where interference is likely. In some settings, interference is a nuisance while in other settings it creates effects of interest. An example of the former includes agricultural experiments, where fallow rows between treatment plots can sometimes eliminate interference between plots. An example of the latter includes vaccinating against infectious diseases, where interference is an inherent result of the biology of transmission and is intrinsically of interest.

The assumption of no interference between individuals is often made without critical examination. Models not requiring this assumption have been considered in the context of plant variety evaluation (Kempton 1997) and cross-over trials (Senn 1993; Bailey and Kunert 2006). However, these methods typically assume a specific interference structure that is local in either space or time. Without making any such assumptions about the nature of interference, Struchiner, Halloran, Robins, and Spielman (1990) and Halloran and Struchiner (1991) conceptually defined several different types of causal effects of interventions that are possible in the presence of interference, namely, direct, indirect, total, and overall effects. To estimate the latter three effects, they noted one needs a population of groups as in group-randomized studies (Murray 1998). Several vaccination studies have been conducted or analyzed with the intent to estimate certain of these effects (Moulton et al. 2001; Longini, Halloran, and Nizam 2002; Ali et al. 2005; King et al. 2006).

Halloran and Struchiner (1995) delineated many of the complications of using potential outcomes to define causal estimands for the different types of effects possible in the presence of interference. They used Rubin’s (1978, 1990) suggestion for a general notation in the presence of interference to define individual direct, indirect, total, and overall effects by letting the potential outcomes for any individual depend on the vector of treatment assignments to the other individuals in the population. However, they found this approach impracticable because the number of possible potential outcomes becomes unwieldy for any reasonably sized population. More recently, Sobel (2006) proposed causal estimands for assessing housing voucher effects defined by averaging causal effects over all possible treatment assignments for a particular voucher allocation strategy compared to a benchmark allocation wherein all households receive no voucher. Rosenbaum (2007) developed nonparametric tests and confidence intervals for assessing treatment effect in the presence of interference.

In this article, we consider a population of groups of individuals where interference is possible between individuals within the same group. We propose causal estimands for direct, indirect, total, and overall causal effects of treatment assignment strategies based on Sobel’s approach of averaging over all possible treatment assignments (Sec. 3). Relations among the estimands are established and inference concerning the estimands is considered (Sec. 4). Using an experimental design with a two-stage randomization procedure (the first at the group level, the second at the individual level within groups), unbiased estimators of the proposed estimands are presented. Estimating the variance of the estimators is also considered. The methodology is illustrated in two different settings where interference is likely: assessing causal effects of housing vouchers and of vaccines (Sec. 5). Proofs are given in the Appendix. We begin with an example to motivate the development of the rest of the article.

1.2 Motivating Example

In this section, we consider data from an individually randomized, placebo-controlled trial of killed oral cholera vaccines to illustrate the direct, indirect, total, and overall effects as defined by Halloran and Struchiner (1991). Table 1 presents data from a reanalysis of this trial where the interest was in determining whether the level of vaccine coverage in a residential area, called a bari, was related to the incidence of cholera in individual vaccine recipients or placebo recipients residing in the bari (Ali et al. 2005). The target population was divided into groups by level of vaccine coverage. For illustration, we consider the groups with more than 50% and less than 28% coverage, which we denote as groups A and B.

Table 1.

Risk of cholera in recipients of killed oral cholera vaccines or placebo, by level of coverage of the bari during one year of follow-up, based on data from Ali et al. (2005)

Level of vaccine coverage Vaccine recipients
Placebo recipients
Target population Total Cases Risk per 1,000 population Total Cases Risk per 1,000 population
>50% 22,394 12,541 16 1.27 6,082 9 1.47
41-50% 24,159 11,513 26 2.26 5,801 27 4.65
36-40% 24,583 10,772 17 1.58 5,503 26 4.72
28-35% 25,059 8,883 22 2.48 4,429 26 5.87
<28% 24,954 5,627 15 2.66 2,852 20 7.01

The effects of vaccination can be estimated based on differences in the incidence of cholera during the first year of follow-up of the trial. The direct effects are estimated by comparing the incidence (risk per 1,000 population) between vaccinated individuals and unvaccinated individuals within each group. For example, the estimated direct effect in group B is 7.01-2.66 = 4.35, suggesting vaccination results in 4.35 fewer cases of cholera per 1,000 individuals per year. The estimated direct effect in group A is 1.47-1.27 = .20, considerably lower than in group B. The difference in the two estimates illustrates one of the challenges in making comparisons directly within groups when interference is present. If an analysis were limited to group A only, the evidence would suggest that the vaccine has little effect.

The indirect effects of vaccination are those effects due to the level of coverage. They can be estimated by comparing the outcomes in the unvaccinated in the two groups or the outcomes in the vaccinated in the two groups. For instance, the estimated indirect effect in the unvaccinated is 7.01 - 1.47 = 5.54. Note this estimate is greater than the estimated direct effect in either of the groups, highlighting the importance of looking beyond direct effects in the presence of interference. Based on similar analyses, Ali et al. concluded that the vaccines provide significant indirect protection to nonvaccinated individuals.

Total and overall effects provide summary measures that combine direct and indirect effects. The total effect of vaccination is the effect of being vaccinated in the group with higher coverage (A) compared to not being vaccinated in the group with lower coverage (B). The estimated total effect (B - A) is 7.01-1.27 = 5.74. Note the total effect (B - A) estimate equals the direct effect estimate in group A plus the indirect effect estimate in the unvaccinated (B - A). The overall effect is the average effect of being in the group with higher coverage compared to being in the group with lower coverage. The overall effect can be estimated by the difference in incidence between the two groups, that is, 35/8,479 - 25/18,623 = 2.79/1,000.

2. PRELIMINARIES

2.1 Potential Outcomes

Suppose there are N > 1 groups of individuals [or blocks of units using Rosenbaum’s (2007) terminology]. For i = 1, . . . , N, let ni denote the number of individuals in group i and let Zi ≡ (Zi1, . . . , Zini) denote the treatments those ni individuals receive. We assume throughout that assignment of an individual to a particular treatment is equivalent to receipt of that treatment that is, there is perfect compliance. Assume Zij is a dichotomous random variable having values 0 or 1 such that Zi can take on 2ni possible values. Let Zi(j) denote the ni - 1 subvector of Zi with the j th entry deleted. The vector Zi will be referred to as an intervention or treatment program, to distinguish it from the individual treatment Zij. Let zi and zij denote possible values of Zi and Zi and Zij. Define Rj to be the set of vectors of possible treatment programs of length j for j = 1, 2, . . . ; for example, R2 ≡ {(0, 0), (0, 1), (1, 0), (1, 1)}. Possible values zi of Zi are elements of Rni. For positive integer n and k ∈ {0, . . . , n}, define Rkn to be the subset of Rn wherein exactly k individuals receive treatment 1; for example, j=1nizij=k for all ziRkni.

Denote the potential outcome of individual j in group i under treatment zi as Yij (zi). Following the usual approach to causal inference (see, e.g., Rosenbaum 2007), we assume the Yij (zi) potential responses are fixed because they do not depend on the realized random assignment of treatments Zi, whereas the observed responses Yij (Zi) do depend on Zi and, thus, are random variables. The notation Yij (zi) allows for the possibility that the potential outcome for individual j may depend on another individual’s treatment assignment in group i; that is, there may be interference between individuals within a group. Implicit in this notation is the assumption that the potential outcomes for individuals in group i do not depend on treatment assignments of individuals in group i’ for i’ ≠ i. In other words, we assume no interference between individuals in different groups but allow for interference between individuals within the same group (Halloran and Struchiner 1991, 1995). This will be a reasonable assumption provided the groups are sufficiently separate (e.g., in space or time). Sobel (2006) called this a partial interference assumption. In the literature of group-randomized studies, violation of no interference across groups is called contamination.

2.2 Treatment Assignment Mechanisms

Let ψ and ϕ denote parameterizations that govern the distribution of Zi for i = 1, . . . , N. For example, ψ might correspond to randomly assigning half of individuals in a group to treatment 1 and the other half to treatment 0, while ϕ might correspond to assigning all individuals in a group to treatment 0. We refer to ψ and ϕ as individual treatment assignment strategies. Our goal is to assess the causal effects of assigning groups to ψ compared to ϕ.

As is typical of causal inference articles, we use randomization inference whereby the randomization distribution induced by the experimental design forms the basis for statistical inference. For the experimental design, we consider a two-stage randomization procedure. In the first stage, each of the N groups is randomly assigned to either ϕ or ψ. In the second stage, individuals are randomly assigned treatment conditional on their group’s assignment in the first stage. For example, in the first stage, half of the N groups might be assigned to an allocation strategy ϕ and the other half ψ; in the second stage, two-thirds of the individuals within a group are randomly assigned treatment 1 for groups assigned ϕ, while one-third of the individuals within a group are randomly assigned treatment 1 for groups assigned ψ. Such a design has been referred to as splitplot (Hayes, Alexander, Bennett, and Cousens 2000) or pseudo-cluster (Borm, Melis, Teerenstra, and Peer 2005) randomization and has been proposed for evaluation of intervention programs in the elderly (Melis et al. 2005) and vaccine efficacy (see Sec. 5.2). This design can be employed to answer questions such as: How many infections will be averted by vaccinating two-thirds of the population compared to only vaccinating one-third of the population? What proportion of households will move if two-thirds receive vouchers compared to only one-third receiving vouchers?

Corresponding to the first stage of randomization, let S ≡ (S1, . . . , SN) denote the group assignments with Si = 1 if the ith group is assigned to ψ and 0 otherwise. Let ν denote the parameterization that governs the distribution of S and let C ≡ Σi Si denote the number of groups assigned ψ. Define ν to be a mixed (Sobel 2006) or permutation (Friedman, Furberg, and DeMets 1998) group assignment strategy if 0 < C < N and Prν (S = s) = C!(N - C)!/N! if sRCN, 0 otherwise. In other words, under a mixed group assignment strategy, a fixed number C of N groups are assigned ψ, with each of the (NC) possible group assignments receiving equal probability. Similarly, corresponding to the second stage of randomization, let Ki ≡ Σj Zij and define ϕ and ψ to be mixed individual group assignment strategies if Ki is fixed given Si, with 0 < Ki < ni and each of the (niKi) possible individual treatment assignments receiving equal probability.

3. CAUSAL ESTIMANDS

3.1 Average Potential Outcomes

A fundamental problem in causal inference is that, in general, it is not possible to observe more than one potential outcome for an individual. Faced with this problem, causal estimands are typically defined in terms of averages of potential outcomes that are identifiable from observable random variables. Following this approach, we begin by writing the potential outcomes for individual j in group i under zij = z as

Yij(zi(j),zij=z) (1)

for z = 0, 1. Because (1) depends on zi(j), define the individual average potential outcome under treatment assignment z by

Yij(z;ψ)ωRni1Yij(zi(j)=ω,zij=z)×Prψ(Zi(j)=ωZij=z).

In other words, the individual average potential outcome is the conditional expectation of Yij (Zi) given Zij = z under assignment strategy ψ. Averaging over individuals, define the group average potential outcome under treatment assignment z as Yi(z;ψ)j=1niYij(z;ψ)ni. Finally, averaging over groups, define the population average potential outcome under treatment assignment z as Y(z;ψ)i=1NYi(z;ψ)N.

The average potential outcomes discussed previously are defined as functions of both the group assignment ψ (or ϕ) and the individual treatment assignment z. We can also define average potential outcomes solely as a function of ψ. For example, define the marginal individual average potential outcome by Yij(ψ)zRniYij(z)Prψ(Zi=z), that is, the average potential outcome for individual j in group i when group i is assigned ψ. Similarly, define the marginal group and population average potential outcomes by Yi(ψ)j=1niYij(ψ)ni and Y(ψ)i=1NYi(ψ)N.

In the following sections, causal estimands are defined in terms of these various average potential outcomes.

3.2 Direct Causal Effects

Halloran and Struchiner (1991) defined the direct effect of a treatment on an individual as the difference between the potential outcome for that individual given treatment compared to the potential outcome for that individual without treatment, all other things being equal. Formally, following Halloran and Struchiner (1995), we define the individual direct causal effect of treatment 0 compared to treatment 1 for individual j in group i by

CEijD(zi(j))Yij(zi(j),zij=0)Yij(zi(j),zij=1). (2)

Next, define the individual average direct causal effect for individual j in group i by

CEijD(ψ)Yij(0;ψ)Yij(1;ψ), (3)

that is, the difference in individual average potential outcomes when zij = 0 and when zij = 1 under ψ. Using Rubin’s (2005) terminology, (3) is a marginal causal effect in that a comparison is being made between expected values of the marginal distributions of Yij (Zi(j), Zij = 0) and of Yij (Zi(j), Zij = 1). Finally, define the group average direct causal effect by CEiD(ψ)Yi(0;ψ)Yi(1;ψ)=j=1niCEijD(ψ)ni and the population average direct causal effect by CED(ψ)Y(0;ψ)Y(1;ψ)=i=1NCEiD(ψ)N.

3.3 Indirect Causal Effects

In contrast to direct effects, an indirect effect describes the effect on an individual of the treatment received by others in the group. In particular, Halloran and Struchiner (1991) defined the indirect effect of a treatment on an individual as the difference between the potential outcomes for that individual without treatment when the group (i) receives an intervention program and (ii) receives the benchmark program of no intervention. Similar to Halloran and Struchiner (1995), we define the individual indirect causal effect of treatment program zi compared with zi on individual j in group i by

CEijI(zi(j),zi(j))Yi(zi(j),zij=0)Yi(zi(j),zij=0), (4)

where zi is another ni-dimensional vector of individual treatment assignments. (Note zi does not denote the transpose of zi.)

Remark

Definition (4) does not restrict either zi or zi to be the benchmark program of no intervention; that is, individual indirect causal effects may exist between two different intervention programs. The same is true for the definitions of individual total and overall causal effects.

Remark

The individual indirect causal effect could be defined analogously for individuals with zij=zij=1 ; that is, individuals under either treatment may experience indirect effects. This yields two individual indirect causal effects, which need not be equal. For simplicity, only indirect effects based on (4) are considered in the rest of this article.

Similar to direct effects, define the individual average indirect causal effect by CEijI(ϕ,ψ)Yij(0;ϕ)Yij(0;ψ). Clearly, if ψ = ϕ, then CEijI(ϕ,ψ)=0; that is, there are no individual average indirect causal effects. Finally, define the group average indirect causal effect as CEiI(ϕ,ψ)Yi(0;ϕ)Yi(0;ψ)=j=1niCEijI(ϕ,ψ)ni and the population average indirect causal effect as CEI(ϕ,ψ)Y(0;ϕ)Y(0;ψ)=i=1NCEiI(ϕ,ψ)N.

3.4 Total Causal Effects

Total effects describe both the direct and the indirect effects of a particular treatment assignment on an individual. Halloran and Struchiner (1991) defined the total effect of a treatment on an individual as the difference between the potential outcomes for that individual (i) with treatment when the group receives an intervention program and (ii) without treatment when the group receives no intervention. Following Halloran and Struchiner (1995), we define the individual total causal effects for individual j in group i as

CEijT(zi(j),zi(j))Yij(zi(j),zij=0)Yij(zi(j),zi(j)=1). (5)

Define the individual average total causal effect by CEijT(ϕ,ψ)Yij(0;ϕ)Yij(1;ψ), the group average total causal effect by CEiT(ϕ,ψ)Yi(0;ϕ)Yi(1;ψ)=j=1niCEijT(ϕ,ψ)ni, and the population average total causal effect by CET(ϕ,ψ)Y(0;ϕ)Y(1;ψ)=i=1NCEiT(ϕ,ψ)N.

Remark

It follows from (2), (4), and (5) that the individual total causal effect is the sum of individual direct and indirect causal effects, that is, CEijT(zi(j),zi(j))=CEijD(zi(j))+CEijI(zi(j),zi(j)). Likewise, the total causal effects can be decomposed as the sum of direct and indirect causal effects at the individual average, group average, and population average levels, for example, CET(ϕ,ψ)=CED(ψ)+CEI(ϕ,ψ). This result formalizes, using a causal framework, models from the vaccine and plant variety evaluation literature, which assume the total effect is the sum of direct and indirect effects (Halloran and Struchiner 1991, 1995; Kempton 1997; Moulton et al. 2006).

Remark

A few other characteristics of the algebra of causal effects bear mentioning. First, total causal effects are not commutative; for example, CET(ϕ,ψ) will not necessarily equal CET(ψ,ϕ) for ϕψ. However, indirect effects have the property CEI(ψ,ϕ)=CEI(ϕ,ψ), implying CED(ψ)+CED(ϕ)=CET(ϕ,ψ)+CET(ψ,ϕ). Thus, the total causal effects, while not necessarily equal, are constrained in sum to equal the sum of the direct effects. Also note that if CEI(ψ,ϕ)=CEI(ϕ,ψ)=0, then CET(ϕ,ψ)=CET(ψ,ϕ) if and only if CED(ϕ)=CED(ψ); that is, in the absence of indirect effects, the total effects are commutative if and only if the direct effects are equal.

3.5 Overall Causal Effect

Halloran and Struchiner (1991) defined the overall causal effect to be the average effect of an intervention program relative to no intervention. We define the individual overall causal effect of treatment zi compared to treatment zi for individual j in group i by CEijO(zi,zi)Yij(zi)Yij(zi). Similarly, for the comparison of ϕ to ψ, define the individual average overall causal effect by CEijO(ϕ,ψ)Yij(ϕ)Yij(ψ), the group average overall causal effect by CEiO(ϕ,ψ)Yi(ϕ)Yi(ψ), and the population average overall causal effect by CEO(ϕ,ψ)Y(ϕ)Y(ψ).

3.6 No Interference

The estimands defined previously simplify under the assumption of no interference between individuals within a group, that is, under the assumption Yij(zi)=Yij(zi) for any two treatment programs zi = (zi1, . . . , zini) and zi=(zi1,,zini) such that zij=zij (Rubin 1980; Angrist, Imbens, and Rubin 1996). Assuming no interference, the potential outcomes for individual j in group i can be written simply as Yij (0) and Yij (1). In turn, the individual direct causal effect equals Yij (0) - Yij (1). The corresponding group average direct causal effect becomes j=1ni{Yij(0)Yij(1)}ni, that is, the usual average causal effect (ACE) estimand. By (4), the individual indirect causal effect equals 0 for all individuals assuming no-interference. Similarly, by (5), the individual total causal effect equals the individual direct causal effect. Likewise, at the group and population average levels, under the no-interference assumption the indirect causal effect is 0 and the direct causal effect equals the total causal effect. Assuming no interference also implies the direct, indirect, and total effects do not depend on the treatment assignment strategies ϕ and ψ, whereas in the presence of interference within a group, they do in general.

4. INFERENCE

In this section, we consider drawing inference about the estimands defined previously. Throughout this section, we assume:

Assumption 1. ν, ϕ, and ψ are mixed assignment strategies.

In Section 4.1, we present estimators for the estimands defined previously and show they are unbiased under Assumption 1. In Section 4.2, we consider the variances of these estimators.

4.1 Estimators

Theorem 1

Suppose Si = 1 and let

Y^i(z;ψ)j=1niYij(Zi)I[Zij=z]j=1niI[Zij=z]forz=0,1; (6)

that is, Y^i(z;ψ) is the average of observed outcomes for individuals in group i receiving treatment z under treatment program Zi. Under Assumption 1, E{Y^i(z;ψ)Si=1}=Yi(z;ψ) for z = 0, 1.

Corollary

Under Assumption 1, CE^iD(ψ)Y^i(0;ψ)Y^i(1;ψ) is a conditionally unbiased estimator of CEiD(ψ) given Si = 1.

Remark

Unbiased estimators of the group average indirect, total, and overall causal effects do not exist without further assumptions because the same group is not observed under ϕ and ψ.

Theorem 2

For z = 0, 1, let Y^(z;ψ)i=1NY^i(z;ψ)×I[Si=1]i=1NI[Si=1] Under Assumption 1, E{Y^(z;ψ)}=Y(z;ψ) for z = 0.1.

Corollary

Under Assumption 1, unbiased estimators for the population average direct, indirect, and total causal effects are given by CE^D(ψ)Y^(0;ψ)Y^(1;ψ), CE^I(ϕ,ψ)Y^(0;ϕ)Y^(0;ψ), and CE^T(ϕ,ψ)Y^(0;ϕ)Y^(1;ψ), where Y^(z;ϕ) is defined analogously to Y^(z;ψ) for z = 0, 1.

Theorem 3

Let Y^i(ψ)j=1niYij(Zi)ni and Y^(ψ)i=1NY^i(ψ)I[Si=1]i=1NI[Si=1]Under Assumption 1, E{Y^i(ψ)Si=1}=Yi(ψ) and E{Y^(ψ)}=Y(ψ).

Corollary

Under Assumption 1, an unbiased estimator of CEO(ϕ,ψ) is given by CE^O(ϕ,ψ)Y^(ϕ)Y^(ψ), where Y^(ϕ) is defined analogously to Y^(ψ).

4.2 Variance Estimators

In general, unbiased estimators of the variances of the estimators discussed previously do not exist without making further assumptions. For example, consider estimating Var(Y^i(z;ψ)Si=1) under Assumption 1. The estimator Y^i(1;ψ) is based on sampling from the set of potential outcomes {Yij(zi):ziRKini,zij=1} for some fixed value of Ki. This set can be partitioned into (niKi) clusters of size Ki, where each cluster corresponds to a particular ziRKini. Moreover, given Si = 1 from the first stage of randomization, the second randomization stage entails selecting exactly one of these clusters according to Zi. Thus, Y^i(1;ψ) can be viewed as the sample mean from a single systematic sample. It is known that, in general, unbiased estimators of the variance of the sample mean from a single systematic sample do not exist without making further assumptions about the underlying population (Som 1973, sec. 4.4; Thompson 1992, chap. 12.4).

Therefore, to make progress in deriving variance estimators, in Section 4.2.1 an additional assumption is introduced about the structure of interference (stratified interference), which may be plausible in a broad range of settings. In Section 4.2.2, variance estimators of the direct, indirect, total, and overall causal effect estimators are proposed. Under the additional assumption of stratified interference, these variance estimators are shown to be unbiased if the causal effects are additive and positively biased otherwise.

4.2.1 Stratified Interference

Suppose that Rni, that is, the set of possible treatment programs for group i, can be partitioned into strata such that within strata there is no interference. In particular, we assume:

Assumption 2 (Stratified interference). For k = 1, . . . , ni - 1, Yij(zi)=Yij(zi) for all zi, ziRkni such that zij=zij.

To illustrate the meaning of Assumption 2, consider a study of the effects of an intervention on children in a school. The stratified interference assumption states that the outcome for a child receiving the intervention will be the same when k - 1 schoolmates also receive the intervention, regardless of which particular k - 1 schoolmates receive the intervention. This assumption can be viewed as an intermediate assumption between (i) assuming no interference within a group and (ii) making no assumptions about the nature of interference within a group. Moreover, because there are ni possible values of Ki given zij = z, it follows that Yij (zi(j), zij = z) can take on ni values. Thus, for a given zij = z, an individual has ni potential outcomes under Assumption 2 compared to only one potential outcome under (i) and 2ni-1 potential outcomes under (ii).

To illustrate the utility of Assumption 2, again consider estimating Var(Y^i(1;ψ)Si=1). Suppose, by way of contradiction, there exists an unbiased estimator Var(Y^i(1;ψ)Si=1) in general, that is, under Assumption 1 only. Denote this estimator by g(Oi1 (Zi)), where Oi1 (Zi) ≡ {Yij (Zi) : Zij = 1} is the set of observed outcomes for individuals in group i assigned treatment Zij = 1 and g is some real-valued function of Oi (Zi). By our supposition, E{g(Oi1(Zi))Si=1}=Var(Y^i(1;ψ)Si=1) under Assumption 1. Consider the following two different scenarios where ni = 3 and Ki = 2 given Si = 1.

First, suppose Yij(zi) = k for j = 1, 2, 3, and ziR23={(011),(110),(101)}, where κ is some constant. In other words, the potential outcomes in group i are constant for all individuals and all treatment programs given Si = 1. Then Var(Y^i(1;ψ)Si=1)=0, implying g({κ, κ}) = 0 for any constant κ.

Second, suppose Yij (zi) = f (zi) for j = 1, 2, 3 and ziR23, where f is some real-valued function of zi. In other words, for any treatment program ziR23, all individuals in group i have the same response. Suppose also that f (011) ≠ f (110) ≠ f (101). Now Y^i(1;ψ)=f(Zi), implying Var(Y^i(1;ψ)Si=1) equals the sample variance of the set {f(zi):ziR23} Thus, Var(Y^i(1;ψ)Si=1)>0. However, because Oi1 (Zi) = {f (Zi), f (Zi)} in this scenario, it follows that E{g(Oi1 (Zi))| Si = 1} = 0. Thus, g is a biased estimator of Var(Y^i(1;ψ)Si=1), a contradiction.

Intuitively, an unbiased estimator of Var(Y^i(1;ψ)Si=1) does not exist in general because the observed data no way to distinguish between these two scenarios under either scenario, the observed outcomes are all equal, that is, Yi1 (Zi) = Yi2(Zi) = Yi3(Zi). However, with the addition of Assumption 2, one can rule out the possibility of the second scenario. Namely, under Assumption 2, Yi1 (110) = Yi1 (101) and Yi2(110) = Yi2(011), implying f (011) = f (110) f (101).

More generally, unbiased variance estimators do not exist without further assumptions (such as stratified interference) because observing Yij (zi) provides no information about Yij (ωi) for ωizi. Under Assumption 2, each individual now has only two potential outcomes, one for zij = 0 and one for zij = 1,, within a particular stratum Rkni. Therefore, given Si, the observed data under one treatment program will provide information about the potential outcomes under other treatment programs. For example, suppose ψ is a mixed strategy such that Ki is fixed. Then, under Assumption 2, the outcomes for individual j are constant for all ziRKini such that zij = 1. Denote this value by Yij (1; ψ), that is, Yij (1; ψ) ≡ Yij (ω, zij = 1) for any ωRKi1ni1. Define Yij (0; ψ) similarly.

4.2.2 Variance Estimators Assuming Stratified Interference

Theorem 4

Let

Var^(Y^i(1;ψ)Si=1)(1Kini)σ^i12(ψ)Ki,

where σ^i12(ψ)j=1ni{Yij(1;ψ)Y^i(1;ψ)}2Zij(Ki1) is the within-group sample variance, and

Var^(Y^i(1;ψ))(1CN)σ^g12(ψ)C+1CNi=1N(1Kini)σ^i12(ψ)KiSi,

where σ^g12(ψ)i=1N{Y^i(1;ψ)Y^(1;ψ)}2Si(C1). Define Var^(Y^i(0;ψ)Si=1), σ^i02(ψ), Var^(Y^(0;ψ)), and σ^g02(ψ) analogously. Under Assumptions 1 and 2,

E{Var^(Y^i(z;ψ)Si=1)Si=1}=Var(Y^i(z;ψ)Si=1) (7)

and E{Var^(Y^(z;ψ))}=Var(Y^(z;ψ))forz=0,1 for z = 0, 1.

Theorem 5

Let

Var^(CE^iD(ψ)Si=1)σ^i12(ψ)Ki+σ^i02(ψ)niKi. (8)

Under Assumptions 1 and 2,

E{Var^(CE^iD(ψ)Si=1)Si=1}=Var(CE^iD(ψ)Si=1)+σi(01)2(ψ)ni,

where σi(01)2(ψ)j=1ni[{Yij(0;ψ)Yij(1;ψ)}{Yi(0;ψ)Yi(1;ψ)}]2(ni1) is the variance of the ni differences Yij (0; ψ) - Yij (1; ψ).

Corollary

Under Assumptions 1 and 2,

E{Var^(CE^iD(ψ)Si=1)Si=1}Var(CE^iD(ψ)Si=1),

with equality holding if and only if

Yij(0;ψ)=Yij(1;ψ)+ηDi (9)

for fixed constant ηDi and j = 1, . . . , ni.

Remark

The corollary to Theorem 5 says (8) is a conditionally unbiased estimator of Var(CE^iD(ψ)Si=1) if and only if the individual direct effect is additive. If (9) does not hold, (8) will be a positively biased estimator of Var(CE^iD(ψ)Si=1). This could occur, for instance, if the potential outcomes are binary, taking on values 0 and 1 only. In this case, (9) will only be true if either (i) ηDi = 0 or (ii) |ηDi| = 1, with (ii) corresponding to the scenario that either (Yij (0; ψ), Yij (1; ψ)) = (0, 1) for all j or (Yij (0; ψ), Yij (1; ψ)) = (1, 0) for all j.

Theorem 6

Let

Var^(CE^D(ψ))(1CN)σ^D2(ψ)C+1CNi=1NVar^(CE^iD(ψ)Si=1)Si, (10)

where σ^D2(ψ)i=1N{CE^iD(ψ)CE^D(ψ)}2Si(C1). Under Assumptions 1 and 2,

E{Var^(CE^D(ψ))}=Var(CE^D(ψ))+1N2i=1Nσi(01)2(ψ)ni.
Corollary

Under Assumptions 1 and 2, E{Var^(CE^D(ψ))}Var(CE^D(ψ)) with equality holding if and only if (9) holds for all i = 1, . . . , N.

Remark

The corollary to Theorem 6 is similar to the corollary to Theorem 5 in that (10) is an unbiased estimator of Var(CE^D(ψ)) if and only if the individual direct effects are additive. If direct additivity does not hold for all individuals, (10) will be positively biased. Analogous results for the group average indirect, total, and overall effects follow from Theorems 7-9.

Theorem 7

Let Var^(CE^I(ϕ,ψ))σ^g02(ϕ)(NC)+σ^g02(ψ)C. Under Assumptions 1 and 2,

E{Var^(CE^I(ϕ,ψ))}=Var(CE^I(ϕ,ψ))+σg(00)2(ϕ,ψ)N,

where σg(00)2(ϕ,ψ)i=1N[{Yi(0;ϕ)Yi(0;ψ)}{Y(0;ϕ)Y(0;ψ)}]2(N1) is the variance of the N differences Yi(0;ϕ)Yi(0;ψ).

Corollary

Under Assumptions 1 and 2, E{Var^(CE^I(ϕ,ψ))}Var(CE^I(ϕ,ψ)) with equality holding if and only if Yi(0;ϕ)=Yi(0;ψ)+ηI for fixed constant ηI and i = 1, . . . , N.

Theorem 8

Let Var^(CE^T(ϕ,ψ))σ^g02(ϕ)(NC)+σ^gI2(ψ)C. Under Assumptions 1 and 2,

E{Var^(CE^T(ϕ,ψ))}=Var(CE^T(ϕ,ψ))+σg(01)2(ϕ,ψ)N,

where σg(01)2(ϕ,ψ)i=1N[{Yi(0;ϕ)Yi(1;ψ)}{Y(0;ϕ)Y(1;ψ)}]2(N1).

Corollary

Under Assumptions 1 and 2, E{Var^(CE^T(ϕ,ψ))}Var(CE^T(ϕ,ψ)) with equality holding if and only if Yi(0;ϕ)=Yi(1;ψ)+ηT for fixed constant ηT and i = 1, . . . , N.

Theorem 9

Let Var^(CE^O(ϕ,ψ))σ^M2(ϕ)(NC)+σ^M2(ψ)C, where σ^M2(ψ)i=1N{Y^i(ψ)Y^(ψ)}2Si(C1) and σ^M2(ϕ) is defined analogously. Under Assumptions 1 and 2,

E{Var^(CE^O(ϕ,ψ))}=Var(CE^O(ϕ,ψ))+σM2(ϕ,ψ)N,

where σM2(ϕ,ψ)i=1N[{Yi(ϕ)Yi(ψ)}{Y(ϕ)Y(ψ)}]2(N1).

Corollary

Under Assumptions 1 and 2, E{Var^(CE^O(ϕ,ψ))}Var(CE^O(ϕ,ψ)) with equality holding if and only if Yi(ϕ)=Yi(ψ)+ηO for fixed constant ηO and i = 1, . . . , N.

5. EXAMPLES

5.1 Housing Vouchers

Motivated by randomized studies designed to assess the effect of vouchers on housing mobility, Sobel (2006) proposed causal estimands and estimators when interference between units is present. At the first level, Sobel considered the effect of housing vouchers on the lease-up rate, that is, whether a household moves. At the second level, he considered voucher effects on other outcomes such as parents’ perceptions of safety, welfare receipt, and child health. In this section, some of these estimands and estimators are shown to be special cases of those defined in Sections 3 and 4.1. To begin, we demonstrate that Sobel’s causal estimand and estimator of the voucher effect on the lease-up rate are examples of the group average total causal effect estimand and estimator. Because Sobel considered just one group, we drop the subscript i for group in the rest of this section.

Consider a study where n households within a neighborhood are randomized to receive a housing voucher. In our terminology, households correspond to individuals and the neighborhood corresponds to a single group. Let Zj = 1 if the j th household receives a voucher, Zj = 0 otherwise for j = 1, . . . , n. Let Yj (z) = 1 if the j th household moves using a voucher, Yj (z) = 0 otherwise. Because moving using a voucher is clearly not possible without a voucher, it follows immediately from the definition of Yj (z) that

Yj(z(j),zj=0)=0forj=1,,n. (11)

Suppose Prϕ(Z = 0) = 1; that is, ϕ corresponds to the benchmark allocation strategy where no household receives a voucher. Then (11) implies

CET(ϕ,ψ)=1nj=1nzRn1Yj(Z(j)=z,Zj=1)×Prϕ(Z(j)=zZj=1).

for any other household assignment strategy ψ. In particular, suppose ψ corresponds to the mixed assignment strategy, where exactly k of n households receive a voucher. Then

CET(ϕ,ψ)=(k1)!(nk)!n!zRk1n1j=1nYj(Z(j)=z,Zj=1)=k!(nk)!n!zRk1n1ζ=011kj=1nYj(Z(j)=z,Zj=ζ)ζ=k!(nk)!n!zRkn1kj=1nYj(Z=z)zj,

which is equivalent (up to a minus sign) to equation (2) of Sobel in the setting where there are two levels of treatment. Sobel actually considered the more general situation of three treatment levels, which is not considered here. Sobel’s corresponding estimator, the observed lease-up rate among voucher recipients, is equivalent to Y^(1;ψ) as given by (6). Under exclusion restriction (11), CE^T(ϕ,ψ)=Y^(1;ψ).

Although interference is possible among those receiving housing vouchers, the exclusion restriction (11) precludes interference when a household does not receive a voucher. Thus, the indirect effect of housing vouchers on mobility is 0, CEI(ϕ,ψ)=0, and the total effects equal the direct effects. If households could move without the aid of a voucher, an analysis based on (11) could potentially overestimate the magnitude of the total effect of vouchers. By instead defining Yj (z) = 1 if the j th household moves and 0 otherwise, a voucher allocation strategy may have indirect effects in those not receiving vouchers as well as direct and total causal effects. For example, this would allow for the possibility that if several neighbors move because they receive vouchers, household j might also move even though they did not obtain a voucher. Estimating such indirect causal effects of voucher distribution within a neighborhood on those households that do not receive a voucher would likely be of interest to policy makers. Of course, in this case, observation of potential outcomes under the benchmark allocation program Z = 0 would also be necessary to estimate the indirect and total effects.

Sobel’s estimand of the effects of vouchers on outcomes such as welfare receipt can be viewed as an example of the group average overall effect estimand. For instance, assume all households that receive a voucher subsequently move and ϕ is the benchmark allocation strategy. Then taking the expected value of Sobel’s “average effect” [his eq. (3)] over the distribution of possible intervention programs under ψ yields the group average overall effect CEO(ϕ,ψ). Sobel noted that the average overall effect is a weighted average of the indirect effects on those not receiving a voucher, which he called spillover effects, and the effects in those receiving a voucher. More precisely, for ϕ the benchmark allocation strategy, the overall effect equals the following weighted sum of the indirect and total effects: CEO(ϕ,ψ)=Prψ(Zj=0)CEI(ϕ,ψ)+Prψ(Zj=1)CET(ϕ,ψ).

Because the design of the housing voucher study does not include randomizing some neighborhoods to the benchmark allocation, outcomes such as welfare receipt and parents’ perception of safety are not observed under this allocation. Thus, without further assumptions akin to (11), voucher effects on these outcomes are not identifiable from the data. Alternatively, these effects are identifiable by considering a population of neighborhoods and a two-stage randomization design. For example, neighborhoods within a city or set of cities could be identified that were sufficiently separated geographically to ensure that the assumption of no interference between neighborhoods is plausible. Then, in the first stage of randomization, some neighborhoods could be randomly assigned the benchmark allocation and other neighborhoods to an allocation strategy where in the second stage, a specified proportion of randomly selected households would receive a voucher. Such a design would permit estimation of the direct, indirect, total, and overall effects of housing vouchers on the outcomes described previously without making exclusion restrictions such as (11). Estimation of the variances of the causal effect estimators would also be possible with this design under the additional assumption of stratified interference.

5.2 Vaccines

Direct application of the proposed methods to the data given in Table 1 is not appropriate because baris were not randomly assigned to particular levels of vaccine coverage in the actual trial. Therefore, for illustrative purposes, we consider a hypothetical two-stage randomized placebo-controlled trial of cholera vaccines in a setting similar to that of Ali et al. (2005). Suppose in the first stage of this hypothetical trial that five geographically separate groups were randomized using a mixed allocation strategy ν such that three groups were assigned ϕ and the remaining two were assigned ψ. Then, in the second stage, suppose 30% of individuals were randomly chosen to receive vaccine within groups assigned ϕ and 50% of individuals were randomly chosen to receive vaccine within groups assigned ψ. Individuals were then followed for one year for detection of cholera. Results from this hypothetical trial are given in Table 2.

Table 2.

Illustrative example of a two-stage randomized placebo-controlled vaccine trial based on data from Ali et al. (2005)

Group assignment Si Vaccine recipients (Zij = 1)
Placebo recipients (Zij = 0)
Group i Total ΣjZij Cases ΣjZijYij (Zi) Total Σj (1 - Zij) Cases Σj (1 - Zij)Yij (Zi)
1 1 12,541 16 12,541 18
2 1 11,513 26 11,513 54
3 0 10,772 17 25,134 119
4 0 8,883 22 20,727 122
5 0 5,627 15 13,130 92

NOTE: Group assignment Si = 1 (0) corresponds to 50% (30%) vaccine coverage.

Estimates of the population average direct, indirect, total, and overall effects are given in Table 3. The estimated variances are also presented. Note the direct effect estimate is nearly three times greater under ϕ (30% coverage) compared to ψ (50% coverage). Ali et al. (2005) noted a similar phenomenon and cautioned that high levels of vaccine coverage can bias estimates of vaccine efficacy (i.e., the direct effect of vaccination). In fact, the issue here is not one of bias, but rather that the parameter being estimated can depend on the level of coverage due to interference between individuals. Moreover, the totality of effects of a vaccination strategy must be viewed by considering indirect, total, and overall effects in addition to direct effects. Estimates of these other effects can easily be interpreted by investigators. For example, the indirect effect estimate in Table 3 suggests 50% vaccine coverage results in 2.8 fewer cholera cases per 1,000 unvaccinated individuals per year compared to only 30% vaccine coverage. Note the estimated total effect of being vaccinated under ψ is over three times the corresponding estimated direct effect, demonstrating the importance of randomizing groups to different levels of vaccine coverage. Had all groups received 50% coverage such that only the direct effect could be estimated, the utility of vaccination would have been substantially underestimated. The estimated overall effect provides a simple summary comparison of the two strategies, indicating that, on average, 50% vaccine coverage results in 2.4 fewer cases of cholera per 1,000 individuals per year compared to 30% vaccine coverage.

Table 3.

Estimates of population average direct, indirect, total, and overall effects per 1,000 individuals per year for data in Table 2

Effect Parameter Estimate Estimated variance
Direct CED(ψ) 1.30 .856
Direct CED(ϕ) 3.64 .178
Indirect CEI(ϕ,ψ) 2.81 3.079
Total CET(ϕ,ψ) 4.11 .672
Overall CEO(ϕ,ψ) 2.37 1.430

6. DISCUSSION

In this article, estimands for direct, indirect, total, and overall causal effects of different treatment strategies are proposed in the setting where interference between individuals is possible. Relations between the estimands are established, and estimators of the proposed estimands are presented. These estimators are shown to be unbiased assuming a two-stage randomization procedure with a mixed assignment mechanism at each stage (Assumption 1). Under an additional assumption of stratified interference (Assumption 2), variance estimators of the causal effect estimators are derived that are unbiased under additivity and positively biased otherwise.

This article builds on previous work in several significant ways. First, causal inference in the presence of interference is considered in a general framework not specific to any one subject area, unifying previous work on housing mobility studies (Sobel 2006) and infectious diseases (Halloran and Struchiner 1991, 1995). The definitions of individual direct, indirect, total, and overall causal effects in the presence of interference (Halloran and Struchiner 1991, 1995) are formally extended to groups and populations of groups by averaging over all possible treatment assignments for particular allocation strategies (Sobel 2006). By considering a population of groups, rather than just one group as in Sobel (2006), unbiased estimators of the causal estimands of interest are derived without requiring exclusion restriction assumptions. The variance of causal effect estimators in the presence of interference is also considered, which had not been done previously. The utility of the proposed variance estimators will depend on whether Assumptions 1 and 2 are reasonable. Assumption 1 is determined by the experimental design and, thus, should be under control of the investigator. Assumption 2 may be reasonable in many settings, such as in the evaluation of the effects of vaccines, educational interventions, or housing vouchers. However, in other contexts, such as in cross-over trials or plant variety studies, assuming different forms of interference may be more appropriate.

The methods developed here could be extended to settings with more than two treatment levels or noncompliance as in Sobel (2006). Consideration of population subgroups may be of interest in the presence of interference. As in Halloran, Longini, Cowart, and Nizam (2002), one could define the indirect, total, and overall effects for different subgroups of the population. For example, a strategy of vaccinating 70% of children against influenza and another strategy of not vaccinating children could be compared by the indirect effects on the incidence of influenza in adults from the same population. Similarly, Moulton et al. (2001) and Sobel (2006) considered populations consisting of participants and nonparticipants; that is, a subset of the population does not receive either treatment.

In addition to these extensions, other areas of research remain to be explored in the setting where interference between individuals is present. Different randomization strategies and interference structures might be considered. An anonymous referee suggested more efficient variance estimators might be derived if one is willing to make certain additional additivity assumptions. For example, under (9), σ^i12(ψ) and σ^i02(ψ) are estimating the same parameter, and, thus, more efficient variance estimators might be obtained by combining data from individuals assigned Zij = 0 and individuals assigned Zij = 1. Stochastic causal models could also be considered, wherein the potential outcomes Yij (zi) are treated as random rather than fixed (e.g., as in Robins and Greenland 1989). Methods are needed to construct confidence intervals for the different causal effects, perhaps by building on the work of Rosenbaum (2007).

Rubin (2005) recently noted that causal inference can be conducted by making only two assumptions: a probabilistic model about the treatment assignment mechanism and SUTVA. By providing a framework where the latter assumption is not completely necessary, our work here will hopefully contribute to a foundation for causal inference in the presence of interference upon which others can build.

Acknowledgments

MGH and MEH were supported in part by NIH grant 2 R01 AI32042. The authors thank Peter Gilbert and Thomas Richardson for helpful comments.

APPENDIX: PROOFS OF THEOREMS 1-9

A.1 Proof of Theorem 1

Without loss of generality, let z = 1. Under Assumption 1, Ki is fixed, so that

E{Y^i(1;ψ)Si=1}=1Kij=1nizRKiniPrψ(Zi=z)Yij(z)I[zij=1].

Now any z such that zij = 0 does not contribute to the summation, so that we can equivalently write

E{Y^i(1;ψ)Si=1}=1Kij=1niωRKi1ni1Prψ(Zi(j)=ω,Zij=1)×Yij(zi(j)=ω,zij=1)=1Kij=1niωRKi1ni1Prψ(Zi(j)=ωZij=1)×Prψ(Zij=1)Yij(zi(j)=ω,zij=1).

Under Assumption 1, Prψ (Zij = 1) = Ki/ni, implying

E{Y^i(1;ψ)Si=1}=1nij=1niωRKi1ni1Prψ(Zi(j)=ωZij=1)×Yij(zi(j)=ω,zij=1)=Yi(1;ψ).

A.2 Proof of Theorem 2

Without loss of generality, let z = 1. Using the fact that E{Y^(1;ψ)}=E[E{Y^(1;ψ)S}], from Theorem 1 it follows that E{Y^(1;ψ)}=E{i=1NYi(1;ψ)SiC}=Y(1;ψ).

A.3 Proof of Theorem 3

The conditional expectation result follows from

E{Y^i(ψ)Si=1}=j=1nizRniYij(z)Prψ(Zi=z)ni=j=1niYij(ψ)ni=Yi(ψ).

The remainder of the proof parallels that of Theorem 2.

A.4 Proof of Theorem 4

The proof follows directly from known properties of estimators of population means using simple random sampling (SRS) and two-stage cluster sampling (see, e.g., Kish 1965, chap. 2; Splawa-Neyman 1990; Thompson 1992, chaps. 2 and 3). For example, given Si = 1 and Assumptions 1 and 2, Y^i(1;ψ) can be viewed as the sample mean from a simple random sample drawn without replacement from {Yi1(1; ψ), . . . , Yini (1; ψ)}. Thus,

Var(Y^i(1;ψ)Si=1)=(1Kini)σi12(ψ)Ki, (A.1)

where σi12(ψ)j=1ni{Yij(1;ψ)Yi(1;ψ)}2(ni1) is the within-group variance. It is also well known that E(σ^i12(ψ)Si=1)=σi12(ψ), implying (7) holds.

Similarly, that E{Var^(Y^(z;ψ))}=Var(Y^(z;ψ)) for z = 0, 1 follows from known results on two-stage cluster sampling. A sketch of a proof of this follows. First, one can show

Var{Y^(1;ψ)}=(1CN)σg12(ψ)C+1CNi=1N(1Kini)σi12(ψ)Ki, (A.2)

where σg12(ψ)i=1N{Yi(1;ψ)Y(1;ψ)}2(N1). Next, note E(Si) = C/N such that

E{Var^(Y^(1;ψ))}=(1CN)E{σ^g12(ψ)}C+1N2i=1N(1Kini)σi12(ψ)Ki. (A.3)

So the remaining task at hand becomes finding E{σ^g12(ψ)}, which can be shown to equal

1Ni=1N(1Kini)σi12(ψ)Ki+σg12(ψ).

Substituting this into (A.3) implies E{Var^(Y^(1;ψ))} equals

(1CN)σg12(ψ)C+{(1CN)1C1N+1N2}i=1N(1Kini)σi12(ψ)Ki,

which simplifies to (A.2).

A.5 Proof of Theorem 5

The proof follows from Splawa-Neyman (1990) and Rubin (1990); a sketch is given here. First, we derive Var(CE^iD(ψ)Si=1), which, of course, equals Var{Y^i(0;ψ)Si=1}+Var{Y^i(1;ψ)Si=1}2Cov{Y^i(0;ψ),Y^i(1;ψ)Si=1}. We know the form of Var{Y^i(z;ψ)Si=1} for z = 0, 1 from the proof of Theorem 4. Additionally, one can show Cov(Y^i(0;ψ),Y^i(1;ψ)Si=1)={σi(01)2(ψ)σi02(ψ)σi12(ψ)}(2ni). Therefore,

Var(CE^iD(ψ)Si=1)=Kiniσi02(ψ)niKi+(1Kini)σi12(ψ)Ki1ni{σi(01)2(ψ)σi02(ψ)σi12(ψ)},

which simplifies to

Var(CE^iD(ψ)Si=1)=σi02(ψ)niKi+σi12(ψ)Kiσi(01)2(ψ)ni.

The proof is then completed by noting that

E{Var^(CE^iD(ψ)Si=1)Si=1}=σi02(ψ)niKi+σi12(ψ)Ki.

A.6 Proof of Theorem 6

The proof follows along similar lines as the derivation of E{Var^(Y^(1;ψ))} in the proof of Theorem 4. In particular, one can first show

Var(CE^D(ψ))=(1CN)σD2(ψ)C+1CNi=1NVar(CE^iD(ψ)Si=1), (A.4)

where σD2(ψ)i=1N{CEiD(ψ)CED(ψ)}2(N1). Next, similar to E{σ^g12(ψ)} in Theorem 4, one can show E{σ^D2(ψ)}=i=1NVar(CE^iD(ψ)Si=1)N+σD2(ψ). Taking the expected value of (10), we have

E[Var^{CE^D(ψ)}]=(1CN)σD2(ψ)C+(1CN)1C1Ni=1NVar(CE^iD(ψ)Si=1)+1CNi=1NE{Var^(CE^iD(ψ)Si=1)Si}.

Combining this result with Theorem 5 proves the theorem.

A.7 Proof of Theorem 7

The proof follows along the same lines as the proof of Theorem 5. Namely, one can show Cov(Y^(0;ϕ),Y^(0;ψ))={σg(00)2(ϕ,ψ)σg02(ϕ)σg02(ψ)}(2N), where σg02(ϕ) and σg02(ψ) are defined analogously to σg12(ψ) in Theorem 4, implying Var(CE^I(ϕ,ψ)) equals

Var(Y^(0;ϕ))+Var(Y^(0;ψ))+1N{σg02(ϕ)+σg02(ψ)σg(00)2(ϕ,ψ)}. (A.5)

From the proof of Theorem 4, we have

E{σ^g02(ψ)C}=σg02(ψ)C+1CNi=1NVar(Y^i(0;ψ)Si=1)=Var{Y^(0;ψ)}+σg02(ψ)N,

and similarly E{σ^g02(ϕ)(NC)}=Var{Y^(0;ϕ)}+σg02(ϕ)N, which together with (A.5) prove the theorem.

A.8 Proof of Theorem 8

The proof is analogous to the proof of Theorem 7.

A.9 Proof of Theorem 9

As in the proof of Theorem 4, one can show

Var(Y^(ψ))=(1CN)σM2(ψ)C+1CNi=1NVar(Y^i(ψ)Si=1),

where σM2(ψ)i=1N{Yi(ψ)Y(ψ)}2(N1). Following the same lines as the proofs of Theorems 5 and 7, one can also show Cov{Y^(ϕ),Y^(ψ)}={σM2(ϕ,ψ)σM2(ϕ)σM2(ψ)}(2N), where σM2(ϕ) is defined analogously to σM2(ψ), implying

Var{CE0(ϕ,ψ)}=Var{Y^(ϕ)}+Var{Y^(ψ)}+1N{σM2(ϕ)+σM2(ψ)σM2(ϕ,ψ)}.

Additionally, one can show E{σ^M2(ψ)}=σM2(ψ)+i=1NVar{Y^i(ψ)Si=1}N, which implies E{σ^M2(ψ)C}=Var{Y^(ψ)}+σM2(ψ)N. Using an analogous result for σ^M2(ϕ), the theorem follows.

REFERENCES

  1. Ali M, Emch M, von Seidlein L, Yunus M, Sack DA, Rao M, Holmgren J, Clemens JD. Herd Immunity Conferred by Killed Oral Cholera Vaccines in Bangladesh: A Reanalysis. Lancet. 2005;366:44–49. doi: 10.1016/S0140-6736(05)66550-6. [DOI] [PubMed] [Google Scholar]
  2. Angrist JD, Imbens GW, Rubin DB. Identification of Causal Effects Using Instrumental Variables. Journal of the American Statistical Association. 1996;91:444–455. [Google Scholar]
  3. Bailey RA, Kunert J. On Optimal Crossover Designs When Carryover Effects Are Proportional to Direct Effects. Biometrika. 2006;93:613–625. [Google Scholar]
  4. Borm GF, Melis RJF, Teerenstra S, Peer PG. Pseudo Cluster Randomization: A Treatment Allocation Method to Minimize Contamination and Selection Bias. Statistics in Medicine. 2005;24:3535–3547. doi: 10.1002/sim.2200. [DOI] [PubMed] [Google Scholar]
  5. Cox DR. Planning of Experiments. Wiley; New York: 1958. [Google Scholar]
  6. Friedman LM, Furberg C, DeMets DL. Fundamentals of Clinical Trials. Springer-Verlag; New York: 1998. [Google Scholar]
  7. Halloran ME, Struchiner CJ. Study Designs for Dependent Happenings. Epidemiology. 1991;2:331–338. doi: 10.1097/00001648-199109000-00004. [DOI] [PubMed] [Google Scholar]
  8. Halloran ME, Struchiner CJ. Causal Inference in Infectious Diseases. Epidemiology. 1995;6:142–151. doi: 10.1097/00001648-199503000-00010. [DOI] [PubMed] [Google Scholar]
  9. Halloran ME, Longini IM, Cowart DM, Nizam A. Community interventions and the Epidemic Prevention Potential. Vaccine. 2002;20:3254–3262. doi: 10.1016/s0264-410x(02)00316-x. [DOI] [PubMed] [Google Scholar]
  10. Hayes RJ, Alexander NDE, Bennett S, Cousens SN. Design and Analysis Issues in Cluster-Randomized Trials of Interventions Against Infectious Diseases. Statistical Methods in Medical Research. 2000;9:95–116. doi: 10.1177/096228020000900203. [DOI] [PubMed] [Google Scholar]
  11. Kempton RA. Interference Between Plots. In: Kempton RA, Fox PN, editors. Statistical Methods for Plant Variety Evaluation. Chapman & Hall; London: 1997. pp. 101–116. [Google Scholar]
  12. King JC, Stoddard JJ, Gaglani MB, Moore KA, Magder L, McClure E, Rubin JD, Englund JA, Neuzil K. Effectiveness of School-Based Influenza Vaccination. New England Journal of Medicine. 2006;355:2523–2532. doi: 10.1056/NEJMoa055414. [DOI] [PubMed] [Google Scholar]
  13. Kish L. Survey Sampling. Wiley; New York: 1965. [Google Scholar]
  14. Longini IM, Halloran ME, Nizam A. Model-Based Estimation of Vaccine Effects From Community Vaccine Trials. Statistics in Medicine. 2002;21:481–495. doi: 10.1002/sim.994. [DOI] [PubMed] [Google Scholar]
  15. Melis RJF, van Eijken MIJ, Borm GF, Wensing M, Adang E, van de Lisdonk EH, van Achterberg T, Olde Rikkert MGM. The Design of the Dutch EASYcare Study: A Randomised Controlled Trial on the Effectiveness of a Problem-Based Community Intervention Model for Frail Elderly People. BMC Health Services Research. 2005;5:65. doi: 10.1186/1472-6963-5-65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Moulton LH, O’Brien KL, Kohberger R, Chang I, Reid R, Weatherholtz R, Hackell JG, Siber GR, Santosham M. Design of a Group-Randomized Streptococcus pneumoniae Vaccine Trial. Controlled Clinical Trials. 2001;22:438–452. doi: 10.1016/s0197-2456(01)00132-5. [DOI] [PubMed] [Google Scholar]
  17. Moulton LH, O’Brien KL, Reid R, Weatherholtz R, Santosham M, Siber GR. Evaluation of the Indirect Effects of a Pneumococcal Vaccine in a Community-Randomized Study. Journal of Biopharmaceutical Statistics. 2006;16:453–462. doi: 10.1080/10543400600719343. [DOI] [PubMed] [Google Scholar]
  18. Murray DM. Design and Analysis of Group-Randomized Trials. Oxford University Press; New York: 1998. [Google Scholar]
  19. Robins J, Greenland S. The Probability of Causation Under a Stochastic Model for Individual Risk. Biometrics. 1989;45:1125–1138. [PubMed] [Google Scholar]
  20. Rosenbaum PR. Interference Between Units in Randomized Experiments. Journal of the American Statistical Association. 2007;102:191–200. doi: 10.1080/01621459.2012.655954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Ross R. An Application of the Theory of Probabilities to the Study of a priori Pathometry. I. (Ser. A).Proceedings of the Royal Society. 1916;92:204–230. [Google Scholar]
  22. Rubin DB. Bayesian Inference for Causal Effects: The Role of Randomization. The Annals of Statistics. 1978;6:34–58. [Google Scholar]
  23. Rubin DB. Discussion of “Randomization Analysis of Experimental Data in the Fisher Randomization Test. Journal of the American Statistical Association. 1980;75:591–593. by D. Basu. [Google Scholar]
  24. Rubin DB. Comment: Neyman (1923) and Causal Inference in Experiments and Observations Studies. Statistical Science. 1990;5:472–480. [Google Scholar]
  25. Rubin DB. Causal Inference Using Potential Outcomes: Design, Modeling, Decisions. Journal of the American Statistical Association. 2005;100:322–331. [Google Scholar]
  26. Senn S. Cross-Over Trials in Clinical Research. Wiley; New York: 1993. [Google Scholar]
  27. Sobel M. What Do Randomized Studies of Housing Mobility Demonstrate? Causal Inference in the Face of Interference. Journal of the American Statistical Association. 2006;101:1398–1407. [Google Scholar]
  28. Som RK. A Manual of Sampling Techniques. Heinemann; London: 1973. [Google Scholar]
  29. Splawa-Neyman J, Dabrowska DM, Speed TP, editors. On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9. Statistical Science. 1990;5:465–472. [Google Scholar]
  30. Struchiner CJ, Halloran ME, Robins JM, Spielman A. The Behavior of Common Measures of Association Used to Assess a Vaccination Program Under Complex Disease Transmission Patterns—A Computer Simulation Study of Malaria Vaccines. International Journal of Epidemiology. 1990;19:187–196. doi: 10.1093/ije/19.1.187. [DOI] [PubMed] [Google Scholar]
  31. Thompson SK. Sampling. Wiley; New York: 1992. [Google Scholar]

RESOURCES