Exact Confidence Intervals in the Presence of Interference

Joseph Rigdon; Michael G Hudgens

doi:10.1016/j.spl.2015.06.011

. Author manuscript; available in PMC: 2016 Oct 1.

Published in final edited form as: Stat Probab Lett. 2015 Oct 1;105:130–135. doi: 10.1016/j.spl.2015.06.011

Exact Confidence Intervals in the Presence of Interference

Joseph Rigdon ¹, Michael G Hudgens ²

PMCID: PMC4504023 NIHMSID: NIHMS704925 PMID: 26190877

Abstract

For two-stage randomized experiments assuming partial interference, exact confidence intervals are proposed for treatment effects on a binary outcome. Empirical studies demonstrate the new intervals have narrower width than previously proposed exact intervals based on the Hoeffding inequality.

Keywords: Causal Inference, Exact Confidence Interval, Interference, Randomization Inference

1. Introduction

In a randomized experiment, it is commonly assumed that an individual only has two potential outcomes: an outcome on control, and an outcome on treatment. That an individual has only two potential outcomes assumes no interference (Cox, 1958) between individuals, i.e., an individual’s potential outcomes are unaffected by the treatment assignment of any other individual in the study. There are many settings where this assumption of no interference is clearly violated (Hong and Raudenbush, 2006; Sobel, 2006; Rosenbaum, 2007).

Partial interference holds when individuals can be partitioned into groups such that there is no interference between individuals in different groups. In settings where partial interference holds, two-stage randomized experiments have been suggested as a study design for drawing inference about treatment (i.e., causal) effects. Two-stage randomized experiments proceed by (i) randomizing groups to treatment strategies and (ii) randomizing individuals within groups to different treatments based on the treatment strategy assigned to their group in stage (i). Two-stage randomized experiments are found in many fields of study, e.g., infectious diseases (Baird et al., 2012), medicine (Borm et al., 2005), economics (Duflo and Saez, 2003), and political science (Ichino and Schündeln, 2012; Sinclair et al., 2012). Building upon ideas in Halloran et al. (1991), Hudgens and Halloran (2008) defined and derived unbiased estimators for the direct, indirect, total, and overall effects of treatment in a two-stage randomized experiment assuming partial interference. Liu and Hudgens (2014) showed that Wald-type confidence intervals based on these estimators perform well when the number of groups is large; however, often the number of groups may not be large enough. For example, Moulton et al. (2001) describe a group-randomized vaccine trial involving approximately 9,000 individuals but only 38 groups. Tchetgen Tchetgen and VanderWeele (2012), henceforth TV, proposed exact confidence intervals using the Hoeffding inequality for these four effects in a two-stage randomized experiment with partial interference. Unfortunately, as will be shown below, the TV intervals can be very wide and conservative.

In this paper, we propose different exact confidence intervals based on inverting exact hypothesis tests that tend to be less conservative than TV. The remainder of the paper is organized as follows. In §2, treatment effects in the presence of interference are defined and existing inferential results are reviewed. In §3, the assumption of stratified interference is presented and bounds are derived for the causal effects under this assumption. In §4 the proposed new exact confidence intervals are described by inverting certain permutation tests. In §5 a simulation study is conducted comparing the TV, asymptotic, and new exact confidence intervals. §6 concludes with a discussion. An R package is available implementing the proposed confidence intervals.

2. Preliminaries

2.1. Estimands

Consider a finite population of N individuals partitioned into k groups with n_i individuals in group i for i = 1, …, k. Assume partial interference, i.e., there is no interference between individuals in different groups. Consider a two-stage randomized experiment wherein h of k groups are assigned to strategy α₁ and k–h are assigned to α₀ in the first stage, where strategy α_s specifies that $m_{i}^{s}$ of n_i individuals will receive treatment. For example, strategy α₀ might entail assigning (approximately) 1/3 of individuals within a group to treatment whereas strategy α₁ might entail assigning (approximately) 2/3 of individuals within a group to treatment (see TV for further discussion about different types of treatment allocation strategies). Let S_i = 1 if group i is randomized to α₁ and 0 otherwise so that Pr[S_i = 1] = h/k. In the second stage, individuals will be randomized to treatment conditional on group assignment in the first stage. Let Z_ij = 1 if individual j in group i is assigned treatment and 0 otherwise. Let Z_i = (Z_i₁, …, Z_{in_i}) be the random vector of treatment assignments for group i taking on values $z_{i} \in R (n_{i}, m_{i}^{s})$ , the set of all vectors of length n_i composed of $m_{i}^{s}$ elements equal to 1 and $n_{i} - m_{i}^{s}$ elements equal to 0. Additionally, let Z_i₍_j₎ denote the random vector of treatment assignments in group i excluding individual j taking on values $z_{i (j)} \in R (n_{i} - 1, m_{i}^{s} - z_{i j})$ .

Let y_ij(z_i) be the binary potential outcome for individual j in group i when group i receives treatment vector z_i. A randomization inference framework is adopted wherein potential outcomes are fixed features of the finite population of N individuals and only treatment assignments S and Z are random (as in Sobel (2006); Rosenbaum (2007); Hudgens and Halloran (2008)). Define the average potential outcome for individual j in group i on treatment z = 0, 1 under strategy α_s as

{\bar{y}}_{i j} (z; α_{s}) = \sum_{ω \in R (n_{i} - 1, m_{i}^{s} - z)} y_{i j} (z_{i j} = z, z_{i (j)} = ω) Pr (Z_{(i) j} = ω ∣ Z_{i j} = z; S_{i} = s)

(1)

where $Pr (Z_{i (j)} = ω ∣ Z_{i j} = z; S_{i} = s) = {(\begin{matrix} n_{i} - 1 \\ m_{i}^{s} - z \end{matrix})}^{- 1}$ . Henceforth, let $\sum_{i} = \sum_{i = 1}^{k}$ and $\sum_{j} = \sum_{j = 1}^{n_{i}}$ . For treatment z under strategy α_s define the group average potential outcome as ${\bar{y}}_{i} (z; α_{s}) \equiv n_{i}^{- 1} \sum_{j} {\bar{y}}_{i j} (z; α_{s})$ , and the population average potential outcome as ȳ(z; α_s) ≡ k⁻¹ Σ_i ȳ_i(z; α_s). Define the average potential outcome for individual j in group i under strategy α_s as

{\bar{y}}_{i j} (α_{s}) \equiv \sum_{ω \in R (n_{i}, m_{i}^{s})} y_{i j} (z_{i} = ω) Pr (Z_{i} = ω; S_{i} = s),

(2)

the group average potential outcome as ${\bar{y}}_{i} (α_{s}) \equiv n_{i}^{- 1} \sum_{j} {\bar{y}}_{i j} (α_{s})$ , and the population average potential outcome as $\bar{y} (α_{s}) \equiv k_{i}^{- 1} \sum_{j} {\bar{y}}_{i} (α_{s})$ . Define the direct effect of treatment for strategy α_s as DE(α_s) = ȳ(0; α_s) − ȳ(1; α_s), the indirect effect of α₀ versus α₁ as IE(α₀, α₁) = ȳ(0; α₀) − ȳ(0; α₁), the total effect as TE(α₀, α₁) = ȳ(0; α₀) − ȳ(1; α₁), and the overall effect of α₀ versus α₁ as OE(α₀, α₁) = ȳ(α₀) − ȳ(α₁); see Hudgens and Halloran (2008) and TV for additional discussion regarding these effects.

2.2. Existing Inferential Results

Hudgens and Halloran (2008) derived unbiased estimators for all population average potential outcomes, and thus for the four causal effects. Noting that Pr[S_i = s] and Pr[Z_ij = z|S_i = s] are known by design, the estimator

\hat{y} (z; α_{s}) = k^{- 1} \sum_{i} \frac{1 {S_{i} = s} {\hat{y}}_{i} (z; α_{s})}{Pr [S_{i} = s]}

(3)

where ${\bar{y}}_{i} (z; α_{s}) = n_{i}^{- 1} \sum_{j} 1 {Z_{i j} = z} y_{i j} (Z_{i j}) / Pr [Z_{i j} = z ∣ S_{i} = s]$ is unbiased for ȳ(z; α_s). Additionally, the estimator

\hat{y} (α_{s}) = k^{- 1} \sum_{i} \frac{1 (S_{i} = s) n_{i}^{- 1} \sum_{j} y_{i j} (Z_{i j})}{Pr [S_{i} = s]}

(4)

is unbiased for ȳ(α_s). Unbiased estimators for the effects of interest follow immediately: $\hat{D E} (α_{s}) = \hat{y} (0; α_{s}) - \hat{y} (1; α_{s}), \hat{I E} (α_{0}, α_{1}) = \hat{y} (0; α_{0}) - \hat{y} (0; α_{1}), \hat{T E} (α_{0}, α_{1}) = \hat{y} (0; α_{0}) - \hat{y} (1; α_{1})$ , and $\hat{O E} (α_{0}, α_{1}) = \hat{y} (α_{0}) - \hat{y} (α_{1})$ .

TV proposed exact confidence intervals based on the Hoeffding inequality for the effects of interest in a two-stage randomized experiment where partial interference is assumed. In particular, for any γ ∈ {0, 1}, $\hat{D E} (α_{s}) \pm ε_{D}^{*} (γ, α_{s}, q_{s}, k)$ is a 1 − γ exact confidence interval for DE(α_s) where $ε_{D}^{*} (γ, α_{s}, q_{s}, k)$ is given in equation (17) of TV for s = 0, 1. Additionally, $\hat{I E} (α_{0}, α_{1}) \pm ε^{*} (γ, α_{0}, q_{0}, α_{1}, q_{1}, k), \hat{T E} (α_{0}, α_{1}) \pm ε^{*} (γ, α_{0}, q_{0}, α_{1}, q_{1}, k)$ , and $\hat{O E} (α_{0}, α_{1}) \pm ε^{*} (γ, α_{0}, q_{0}, α_{1}, q_{1}, k)$ are all 1 − γ exact confidence intervals for their target parameters where ε^*(γ, α₀, q₀, α₁, q₁, k) is given in Theorem 3 of TV.

Liu and Hudgens (2014) examined conditions under which Wald-type intervals $\hat{D E} (α_{s}) \pm z_{(1 - γ / 2)} {\hat{var} (\hat{D E} (α_{s}))}^{1 / 2}$ and Chebyshev-type intervals $\hat{D E} (α_{s}) \pm {\hat{var} (\hat{D E} (α_{s})) / γ}^{1 / 2}$ are valid, large sample confidence intervals for DE(α_s), where z₍₁₋_γ_/2) is the 1 − γ/2 quantile for the standard normal distribution and $\hat{var} (\hat{D E} (α_{s}))$ is an estimator of the variance of $\hat{D E} (α_{s})$ for s = 0, 1. They also considered Wald and Chebyshev-type confidence intervals for the indirect, total, and overall effects.

3. Bounds Under Stratified Interference

Exact randomization based inference about the four effects is challenging without further assumptions as the experiment reveals only N of the $\sum_{i} \sum_{j} {(\begin{matrix} n_{i} \\ m_{i}^{0} \end{matrix}) + (\begin{matrix} n_{i} \\ m_{i}^{1} \end{matrix})}$ total potential outcomes. One such additional assumption is stratified interference (Hudgens and Halloran, 2008), which assumes that individual j in group i has the same potential outcome when assigned control or treatment as long as a fixed number of other individuals in group i are assigned treatment, i.e.,

y_{i j} (z_{i}) = y_{i j} (z_{i}^{'}) for all z_{i}, z_{i}^{'} \in R (n_{i}, m_{i}^{s}) such that z_{i j} = z_{i j}^{'} .

(5)

Under (5), individual j in group i only has four potential outcomes, which we denote by y_ij(z; α_s) for z, s = 0, 1, so that the experiment reveals the observed outcome Y_ij = Σ_z_,_s_=0,1 1{Z_ij = z; S_i = s}y_ij(z; α_s) for each individual and thus N of the 4N total potential outcomes. Furthermore, (5) implies that ȳ_ij(z; α_s) = y_ij(z; α_s), and that ${\bar{y}}_{i j} (α_{s}) = w_{i}^{s} y_{i j} (1; α_{s}) + (1 - w_{i}^{s}) y_{i j} (0; α_{s}) \equiv y_{i j} (α_{s})$ where $w_{i}^{s} = Pr [Z_{i j} = 1 ∣ S_{i} = s] = m_{i}^{s} / n_{i}$ .

Under (5), the observed data form bounded sets for all effects contained in the interval [−1, 1]. The bounded sets have widths less than two where here and in the sequel the width of a set is defined to be the difference between its maximum and minimum values. Consider $D E (α_{0}) = k^{- 1} \sum_{i} n_{i}^{- 1} \sum_{j} {y_{i j} (0; α_{0}) - y_{i j} (1; α_{0})}$ for illustration. For the Σ_i Σ_j(1 − S_i)(1 − Z_ij) individuals with S_i = Z_ij = 0, y_ij(0; α₀) is revealed; however, for the N − Σ_i Σ_j(1 − S_i)(1 − Z_ij) individuals with S_i = 1 or Z_ij = 1, y_ij(0; α₀) is missing and only known to be 0 or 1. Let y⃗(z; α_s) be the N-dimensional vector of potential outcomes for treatment z under strategy α_s. Under (5), a lower bound for DE(α₀) is found by filling in all missing potential outcomes in y⃗(0; α₀) as 0 and all missing potential outcomes in y⃗(1; α₀) as 1. An upper bound for DE(α₀) is found by filling in all missing potential outcomes in y⃗(0; α₀) as 1 and all missing potential outcomes in y⃗(1; α₀) as 0. Simple algebra shows that width of the bounded set for DE(α₀) is equal to 2 − (k − h)/k. The width of this bounded set approaches 1 as (k − h)/k → 1, i.e., as more groups are randomized to α₀.

Similar logic leads to bounds for the other effects. The width of the bounded set for DE(α₁) is equal to 2 − h/k which approaches 1 as h/k → 1. The width of the bounded set for IE(α₀, α₁) is equal to $2 - k^{- 1} \sum_{i} n_{i}^{- 1} \sum_{j} (1 - Z_{i j})$ which approaches 1 as the proportion of individuals assigned Z_ij = 0 approaches 1. The width of the bounded set for TE(α₀, α₁) is equal to $2 - k^{- 1} \sum_{i} n_{i}^{- 1} {(1 - S_{i}) \sum_{j} (1 - Z_{i j}) + S_{i} \sum_{j} Z_{i j}}$ which approaches 1 as the proportion of individuals with S_i = Z_ij = 0 or S_i = Z_ij = 1 approaches 1. Lower and upper bounds for OE(α₀, α₁) can be derived similarly but the corresponding width does not have a simple closed form.

4. EIT Confidence Intervals

In addition to leading to unbiased estimators and bounds, the observed data can be used to form 1 − γ confidence sets for the four effects. The confidence sets are formed by inverting hypothesis tests about the potential outcomes that define the effect of interest. This section is divided into two parts: §4.1 outlines how the confidence sets are formed and §4.2 presents a computationally feasible algorithm for constructing an interval that contains the exact confidence set. Henceforth this interval is referred to as the exact inverted test (EIT).

4.1. An Exact Confidence Set

The methods to follow can be generalized to any effect, so consider DE(α₀). Inference about DE(α₀) concerns the vectors y⃗(0; α₀) and y⃗(1; α₀), which are partially revealed by the experiment. A hypothesis about these vectors is considered sharp if it completely fills in the potential outcomes not revealed by the experiment. A sharp null H₀ : y⃗(0; α₀) = y⃗⁰(0; α₀), y⃗(1; α₀) = y⃗⁰(1; α₀) maps to a value of DE(α₀), which we denote DE⁰(α₀). Only sharp null hypotheses that are compatible with the observed data need to be tested as other sharp nulls can be rejected with zero probability of making a type I error. Thus for each sharp null to be tested, the implied null value DE⁰(α₀) will be a member of the bounded set derived in §3. There are B₁ = 2^{Σ_i(1−S_i)n_i}4^Σ_iS_in_i sharp null hypotheses to test, as individuals with S_i = 0 have only one missing potential outcome with two possible values {0, 1}, and individuals with S_i = 1 have two missing potential outcomes with four possible values {0, 1} × {0, 1}.

After filling in the missing potential outcomes under H₀, the null distribution of the test statistic $\hat{D E} (α_{0})$ can be found by computing the statistic, denoted by ${\hat{D E}}_{c} (α_{0})$ , for each of the c = 1, …, C₁ possible experiments under H₀, where $C_{1} = \sum_{S \in S} Π_{i = 1}^{k} {(\begin{matrix} n_{i} \\ m_{i}^{1} \end{matrix})}^{S_{i}} {(\begin{matrix} n_{i} \\ m_{i}^{0} \end{matrix})}^{(1 - S_{i})}$ and Inline graphic is the set of all possible values of the vector S such that $∣ S ∣ = (\begin{matrix} k \\ h \end{matrix})$ . A two-sided p-value to test H₀ is given by $p_{0} = \sum_{c = 1}^{C_{1}} 1 {∣ {\hat{D E}}_{c} (α_{0}) - {D E}^{0} (α_{0}) ∣ \geq ∣ \hat{D E} (α_{0}) - {D E}^{0} (α_{0}) ∣} / C_{1}$ . If p₀ < γ, H₀ is rejected. Note p₀ is a function of the null hypothesis vectors y⃗⁰(0; α₀) and y⃗⁰(1; α₀). Let p(DE⁰(α₀)) denote the set of all p₀ which are functions of compatible vectors y⃗⁰(0; α₀) and y⃗⁰(1; α₀) that map to DE⁰(α₀). A 1 − γ confidence set for DE(α₀) is {DE⁰(α₀) : max{p(DE⁰(α₀))} ≥ γ}. P-values, and thus confidence sets, can be found in an analogous manner for the other effects.

4.2. A Computationally Feasible Algorithm

Finding the exact confidence set for DE(α₀) described above entails testing B₁ hypotheses, where each hypothesis test involves C₁ randomizations. As N becomes large, the computational time necessary to perform B₁×C₁ operations grows exponentially. For illustration of the problem, consider two examples in which h = 1 of k = 1 groups are assigned α₀, in which $m_{1}^{0} = 10$ of n₁ = 20 individuals are randomized to treatment such that B₁ = 2²⁰ and $C_{1} = (\begin{matrix} 20 \\ 10 \end{matrix}) = 184, 756$ . Suppose there are two cases of observed data: (a) 5 of 10 unexposed experienced an event, and 5 of 10 exposed experienced an event, and (b) 8 of 10 unexposed experienced an event and 2 of 10 exposed experienced an event. Figure 1 displays a plot of DE⁰(α₀) versus p(DE⁰(α₀)) for both examples. The bounded set and 95% exact confidence set for DE⁰(α₀) are, respectively, {−0.5, −0.45, …, 0.45, 0.5} and {−0.35, −0.3, …, 0.3, 0.35} in (a) and {−0.2, −0.15, …, 0.75, 0.8} and {0.15, 0.2, …, 0.75, 0.8} in (b).

Plot of DE(α₀) versus p(DE(α₀)) for examples (a) and (b) as outlined in §4.2.

A computationally feasible algorithm is given below for approximating the confidence sets. The algorithm entails testing a targeted random sample of B₂ of the B₁ total sharp null hypotheses, and computing p-values for each sampled sharp null based on a random sample of C₂ of the C₁ possible randomizations. The set of computed p-values are then used to approximate the confidence set endpoints using local linear interpolation. For intuition underlying the interpolation step, consider the piecewise linear function that connects the maximum p-values for each compatible value of DE(α₀) in Figure 1. Finding the x-coordinates for the intersection points of this function and a horizontal line at γ will conservatively approximate the lower and upper 1 − γ confidence limits for DE(α₀). This suggests the following targeted, local linear interpolation algorithm for estimating the lower bound of a confidence set for DE(α₀). An analogous algorithm can be used to target the upper limit of the confidence set for DE(α₀).

Let $\hat{D E} {(α_{0})}_{l}$ denote the lower bound for DE(α₀), and ŷ(z; α₀)_l and ŷ(z; α₀)_u denote the lower and upper bounds, respectively, for ȳ(z; α₀).

Test the unique sharp null about y⃗(0; α₀) and y⃗(1; α₀) that maps to $\hat{D E} {(α_{0})}_{l}$ . If the corresponding p-value p₀ ≥ γ, let $\hat{D E} {(α_{0})}_{l}$ be the lower limit of the confidence set and do not proceed. Otherwise, let $l = \hat{D E} (α_{0})$ and let p_l = 1 − 1/B₂. Let $L = {\hat{D E} {(α_{0})}_{l}}$ and = {p₀}.
Fill in the missingness in y⃗(0; α₀) with samples from a Bernoulli distribution with mean $f ({\hat{y} {(0; α_{0})}_{l} + \hat{y} {(1; α_{0})}_{u} + q_{p_{l}} (\hat{D E} {(α_{0})}_{l}, l)} / 2)$ and fill in the missingness in y⃗(1; α₀) with samples from a Bernoulli distribution with mean $f ({\hat{y} {(0; α_{0})}_{l} + \hat{y} {(1; α_{0})}_{u} - q_{p l} (\hat{D E} {(α_{0})}_{l}, l)} / 2)$ where q_p(a, b) = (1 − p)a + pb, and f(x) = x if 0 ≤ x ≤ 1, f(x) = 0 if x < 0, and f(x) = 1 if x > 1.
If the sampled sharp null maps to a value ${D E}^{0} (α_{0}) \in [\hat{D E} {(α_{0})}_{l}, l]$ , add DE⁰(α₀) to the set , add the corresponding p₀ to , and if p₀ ≥ γ then update l to equal DE⁰(α₀). Otherwise, do not compute a p-value corresponding to the sampled sharp null and let p_l = p_l − 1/B₂.

Repeat Steps 2 and 3 B₂/2 − 1 times.

Let t be the function from Inline graphic to that maps each p-value p₀ in to the null value of DE⁰(α₀) in which corresponds to the sharp null hypothesis which generated p₀. Let = {max{p ∈ : t(p) = l} : l ∈ }. Let r₁ = min{r ∈ : r ≥ γ} and let r₂ = max{r ∈ : r < γ}. Let l_i = t(r_i) for i = 1, 2. The lower limit of the confidence set l^* is found by local linear interpolation by finding the x-coordinate for the point at which a line drawn from (l₂, r₂) to (l₁, r₁) intersects a horizontal line at γ, i.e., l^* = l₂+(γ−r₂)(l₂−l₁)/(r₂−r₁). The upper limit u^* is found analogously. As B₂ → B₁ and C₂ → C₁, the interval [l^*, u^*] will contain the exact confidence set described in §4.1 with probability approaching 1.

The algorithms for approximating confidence sets for IE(α₀, α₁) and TE(α₀, α₁) are analogous. For OE(α₀, α₁) the algorithm is modified slightly as it involves all four vectors y⃗(z; α_s), z, s = 0, 1. Let ŷ(α_s)_l and y⃗(α_s)_u be the lower and upper limits, respectively, for ȳ(α_s) under (5). If p₀ < γ for OE(α₀, α₁)_l, set $l = \hat{O E} (α_{0}, α_{1})$ and fill in the missingness in y⃗(0; α₀) and y⃗(1; α₀) with samples from a Bernoulli distribution with mean $f ({\hat{y} {(α_{0})}_{l} + \hat{y} {(α_{1})}_{u} + q_{p_{l}} (\hat{O E} {(α_{0}, α_{1})}_{l}, l)} / 2)$ where p_l = 1 − 1/B₂. A p-value is computed if $O E^{0} (α_{0}, α_{1}) \in [\hat{O E} {(α_{0}, α_{1})}_{l}, l]$ and if not p_l = p_l − 1/B₂. If p₀ ≥ γ for OE(α₀, α₁)_l, l is set to equal OE⁰(α₀, α₁). The upper endpoint can be approximated using an analogous approach.

The R package interferenceCI is available on CRAN (Rigdon, 2015) for computing EIT confidence intervals via this algorithm for the four effects assuming stratified interference when the outcome is binary. The Wald, Chebyshev, and TV intervals are also computed in the package.

5. Comparisons Via Simulation

A simulation study was carried out to compare the asymptotic, TV, and EIT confidence intervals. The simulation proceeded as follows for fixed values of α₀, α₁, DE(α₀), DE(α₁), IE(α₀, α₁), k, n_i = n for i = 1, …, k such that N = kn:

0
Potential outcomes were generated by first fixing the vectors y⃗(z; α_s) for z, s = 0, 1 to be length N vectors of all 0s. Group membership was assigned by letting elements n(i − 1) + 1, …, ni of each vector belong to group i = 1, …, k. Then, N(0.5+DE(α₀)/2) elements in y⃗(0; α₀) were randomly set to equal 1 and N(0.5−DE(α₀)/2) elements in y⃗(1; α₀) were randomly set to equal 1. Then, N(0.5 + DE(α₀)/2 − IE(α₀, α₁)) elements in y⃗(0; α₁) were randomly set to equal 1. Finally, N(0.5 + DE(α₀)/2 − IE(α₀, α₁) − DE(α₁)) elements in y⃗(1; α₁) were randomly set to equal 1.
1
Observed data were generated by (i) randomly assigning h of k groups to strategy α₁ and (ii) randomly assigning $m_{i}^{s} = α_{s} n$ of n individuals per group to treatment for s = 0, 1. Observed outcomes followed based on these treatment assignments and the potential outcomes from step 0.
2
For each effect, 95% confidence intervals were computed using the observed data generated in step 1.
3
Steps 1–2 were repeated 1000 times.

In the simulation we let k = n = 10 or k = n = 20 with h = k/2, $m_{i}^{0} = 0.3 n$ under α₀, $m_{i}^{1} = 0.6 n$ under α₁, DE(α₀) = 0.95, DE(α₁) = 0.3, and IE(α₀, α₁) = 0.5 (such that TE(α₀, α₁) = 0.8 and OE(α₀, α₁) = 0.395). In the targeted sampling algorithm, B₂ = C₂ = 100 such that B₂/B₁ and C₂/C₁ were less than 10⁻²⁰ for all effects. Table 1 displays average widths and coverages for Wald, EIT, Chebyshev, and TV. Wald and Chebyshev fail to achieve nominal coverage for DE(α₀) when k = n = 10 and Wald additionally fails to cover for DE(α₀) when k = n = 20 and for IE(α₀, α₁) and TE(α₀, α₁) when k = n = 10. As guaranteed by their respective constructions, EIT and TV achieve nominal coverage for all setups; however, EIT has narrower width than TV in all setups. In fact, EIT is an order of magnitude narrower than TV in three instances: DE(α₀), TE(α₀, α₁), and OE(α₀, α₁) when k = n = 20.

Table 1.

Empirical width [coverage] of Wald (W), exact inverted test (EIT), Chebyshev (C), and TV 95% CIs for simulation study discussed in §5.

	n	k	DE(α₀)	DE(α₁)	IE(α₀, α₁)	TE(α₀, α₁)	OE(α₀, α₁)
W	10	10	0.13 [0.84]	0.51 [0.96]	0.39 [0.93]	0.30 [0.93]	0.24 [0.94]
W	20	20	0.09 [0.89]	0.26 [0.96]	0.21 [0.95]	0.14 [0.98]	0.11 [0.97]
EIT	10	10	0.28 [0.98]	0.52 [0.98]	0.47 [0.99]	0.31 [0.98]	0.36 [1.00]
EIT	20	20	0.12 [0.98]	0.27 [0.98]	0.24 [0.98]	0.14 [0.98]	0.18 [1.00]
C	10	10	0.22 [0.84]	1.15 [1.00]	0.84 [1.00]	0.54 [1.00]	0.54 [1.00]
C	20	20	0.15 [0.99]	0.59 [1.00]	0.49 [1.00]	0.32 [1.00]	0.26 [1.00]
TV	10	10	1.95 [1.00]	2.00 [1.00]	2.00 [1.00]	2.00 [1.00]	2.00 [1.00]
TV	20	20	1.41 [1.00]	2.00 [1.00]	1.86 [1.00]	1.56 [1.00]	1.96 [1.00]

Open in a new tab

6. Discussion

In this paper new exact confidence intervals have been proposed for causal effects in the presence of partial interference. The new intervals are constructed by inverting permutation based hypothesis tests. These intervals do not rely on any parametric assumptions and require no assumptions about random sampling from a larger population. The confidence intervals are exact in the sense that the probability of containing the true treatment effects is at least the nominal level. As there may be many vectors of potential outcomes that map to one value of the causal estimand, a computationally feasible algorithm was proposed in §4.2 to approximate the exact confidence intervals. Empirical studies demonstrate the new exact intervals have narrower width than previously proposed exact intervals based on the Hoeffding inequality. Nonetheless, the empirical coverage of the proposed intervals still tends to exceed the nominal level, suggesting one possible future avenue of research would be to develop alternative intervals which are less conservative and narrower but maintain nominal coverage.

Acknowledgments

This research was supported by the National Institutes of Health. The content is solely the responsibility of the authors and does not necessarily represent the official views of the the NIH. The authors thank Mark Weaver for helpful comments.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Joseph Rigdon, Email: jrigdon@stanford.edu, Quantitative Sciences Unit, Stanford University, Palo Alto, California 94304, U.S.A.

Michael G. Hudgens, Email: mhudgens@bios.unc.edu, Department of Biostatistics, CB 7420, University of North Carolina, Chapel Hill, North Carolina 27516, U.S.A

References

Baird S, Garfein R, McIntosh C, Özler B. Effect of a cash transfer programme for schooling on prevalence of HIV and herpes simplex type 2 in Malawi: a cluster randomised trial. The Lancet. 2012;379:1320–1329. doi: 10.1016/S0140-6736(11)61709-1. [DOI] [PubMed] [Google Scholar]
Borm G, Melis R, Teerenstra S, Peer P. Pseudo cluster randomization: a treatment allocation method to minimize contamination and selection bias. Statistics in Medicine. 2005;24:3535–3547. doi: 10.1002/sim.2200. [DOI] [PubMed] [Google Scholar]
Cox D. Planning of Experiments. Wiley; New York, NY: 1958. [Google Scholar]
Duflo E, Saez E. The role of information and social interactions in retirement plan decisions: Evidence from a randomized experiment. The Quarterly Journal of Economics. 2003;118:815–842. [Google Scholar]
Halloran M, Haber M, Longini I, Struchiner C. Direct and indirect effects in vaccine efficacy and effectiveness. American Journal of Epidemiology. 1991;133:323–331. doi: 10.1093/oxfordjournals.aje.a115884. [DOI] [PubMed] [Google Scholar]
Hong G, Raudenbush S. Evaluating kindergarten retention policy: A case study of causal inference for multi-level observational data. Journal of the American Statistical Association. 2006;101:901–910. [Google Scholar]
Hudgens M, Halloran M. Toward causal inference with interference. Journal of the American Statistical Association. 2008;103:832–842. doi: 10.1198/016214508000000292. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ichino N, Schündeln M. Deterring or displacing electoral irregularities? Spillover effects of observers in a randomized field experiment in ghana. The Journal of Politics. 2012;74:292–307. [Google Scholar]
Liu L, Hudgens M. Large sample randomization inference of causal effects in the presence of interference. Journal of the American Statistical Association. 2014;109:288–301. doi: 10.1080/01621459.2013.844698. [DOI] [PMC free article] [PubMed] [Google Scholar]
Moulton L, O’Brien K, Kohberger R, Chang I, Reid R, Weatherholtz R, Hackell J, Siber G, Santosham M. Design of a group-randomized Streptococcus pneumoniae vaccine trial. Controlled Clinical Trials. 2001;22:438–452. doi: 10.1016/s0197-2456(01)00132-5. [DOI] [PubMed] [Google Scholar]
Rigdon J. interferenceCI: Exact Confidence Intervals in the Presence of Interference. R package version 1.1. 2015 doi: 10.1016/j.spl.2015.06.011. http://CRAN.R-project.org/package=interferenceCI. [DOI] [PMC free article] [PubMed]
Rosenbaum P. Interference between unites in randomized experiments. Journal of the Americal Statistical Association. 2007;102:191–200. [Google Scholar]
Sinclair B, McConnell M, Green D. Detecting spillover effects: Design and analysis of multilevel experiments. American Journal of Political Science. 2012;56:1055–1069. [Google Scholar]
Sobel M. What do randomized studies of housing mobility demonstrate?: Causal inference in the face of interference. Journal of the Americal Statistical Association. 2006;101:1398–1407. [Google Scholar]
Tchetgen Tchetgen E, VanderWeele T. On causal inference in the presence of interference. Statistical Methods in Medical Research. 2012;21:55–75. doi: 10.1177/0962280210386779. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] Baird S, Garfein R, McIntosh C, Özler B. Effect of a cash transfer programme for schooling on prevalence of HIV and herpes simplex type 2 in Malawi: a cluster randomised trial. The Lancet. 2012;379:1320–1329. doi: 10.1016/S0140-6736(11)61709-1. [DOI] [PubMed] [Google Scholar]

[R2] Borm G, Melis R, Teerenstra S, Peer P. Pseudo cluster randomization: a treatment allocation method to minimize contamination and selection bias. Statistics in Medicine. 2005;24:3535–3547. doi: 10.1002/sim.2200. [DOI] [PubMed] [Google Scholar]

[R3] Cox D. Planning of Experiments. Wiley; New York, NY: 1958. [Google Scholar]

[R4] Duflo E, Saez E. The role of information and social interactions in retirement plan decisions: Evidence from a randomized experiment. The Quarterly Journal of Economics. 2003;118:815–842. [Google Scholar]

[R5] Halloran M, Haber M, Longini I, Struchiner C. Direct and indirect effects in vaccine efficacy and effectiveness. American Journal of Epidemiology. 1991;133:323–331. doi: 10.1093/oxfordjournals.aje.a115884. [DOI] [PubMed] [Google Scholar]

[R6] Hong G, Raudenbush S. Evaluating kindergarten retention policy: A case study of causal inference for multi-level observational data. Journal of the American Statistical Association. 2006;101:901–910. [Google Scholar]

[R7] Hudgens M, Halloran M. Toward causal inference with interference. Journal of the American Statistical Association. 2008;103:832–842. doi: 10.1198/016214508000000292. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] Ichino N, Schündeln M. Deterring or displacing electoral irregularities? Spillover effects of observers in a randomized field experiment in ghana. The Journal of Politics. 2012;74:292–307. [Google Scholar]

[R9] Liu L, Hudgens M. Large sample randomization inference of causal effects in the presence of interference. Journal of the American Statistical Association. 2014;109:288–301. doi: 10.1080/01621459.2013.844698. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] Moulton L, O’Brien K, Kohberger R, Chang I, Reid R, Weatherholtz R, Hackell J, Siber G, Santosham M. Design of a group-randomized Streptococcus pneumoniae vaccine trial. Controlled Clinical Trials. 2001;22:438–452. doi: 10.1016/s0197-2456(01)00132-5. [DOI] [PubMed] [Google Scholar]

[R11] Rigdon J. interferenceCI: Exact Confidence Intervals in the Presence of Interference. R package version 1.1. 2015 doi: 10.1016/j.spl.2015.06.011. http://CRAN.R-project.org/package=interferenceCI. [DOI] [PMC free article] [PubMed]

[R12] Rosenbaum P. Interference between unites in randomized experiments. Journal of the Americal Statistical Association. 2007;102:191–200. [Google Scholar]

[R13] Sinclair B, McConnell M, Green D. Detecting spillover effects: Design and analysis of multilevel experiments. American Journal of Political Science. 2012;56:1055–1069. [Google Scholar]

[R14] Sobel M. What do randomized studies of housing mobility demonstrate?: Causal inference in the face of interference. Journal of the Americal Statistical Association. 2006;101:1398–1407. [Google Scholar]

[R15] Tchetgen Tchetgen E, VanderWeele T. On causal inference in the presence of interference. Statistical Methods in Medical Research. 2012;21:55–75. doi: 10.1177/0962280210386779. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Exact Confidence Intervals in the Presence of Interference

Joseph Rigdon

Michael G Hudgens

Abstract

1. Introduction