Skip to main content
Oxford University Press logoLink to Oxford University Press
. 2016 Dec 8;103(4):829–842. doi: 10.1093/biomet/asw047

On inverse probability-weighted estimators in the presence of interference

L Liu 1,, M G Hudgens 2, S Becker-Dreps 3
PMCID: PMC5793685  PMID: 29422692

Abstract

We consider inference about the causal effect of a treatment or exposure in the presence of interference, i.e., when one individual’s treatment affects the outcome of another individual. In the observational setting where the treatment assignment mechanism is not known, inverse probability-weighted estimators have been proposed when individuals can be partitioned into groups such that there is no interference between individuals in different groups. Unfortunately this assumption, which is sometimes referred to as partial interference, may not hold, and moreover existing weighted estimators may have large variances. In this paper we consider weighted estimators that could be employed when interference is present. We first propose a generalized inverse probability-weighted estimator and two Hájek-type stabilized weighted estimators that allow any form of interference. We derive their asymptotic distributions and propose consistent variance estimators assuming partial interference. Empirical results show that one of the Hájek estimators can have substantially smaller finite-sample variance than the other estimators. The different estimators are illustrated using data on the effects of rotavirus vaccination in Nicaragua.

Keywords: Causal inference, Interference, Inverse probability-weighted estimator, Observational study

1. Introduction

In causal inference it is often assumed that there is no interference between individuals, i.e., that the treatment of one individual does not affect the outcome of another. However, this assumption may not hold. For instance, in infectious disease studies, the vaccination status of one individual may affect whether another individual becomes infected (Halloran & Struchiner, 1995). Similarly, encouraging one individual to vote may increase the likelihood that another individual in the same household will vote (Nickerson, 2008). Interference may also occur between students in the same classroom (Hong & Raudenbush, 2006) or between households in the same neighbourhood (Sobel, 2006), and in myriad other contexts (Rosenbaum, 2007; Luo et al., 2012; Manski, 2013).

Inference in the presence of interference is interesting, because a treatment may have multiple types of effects, but difficult, because individuals may have many potential outcomes. Recently, methods have been developed for the setting where individuals can be partitioned into groups such that there may be interference between individuals in the same group but not between individuals in different groups; this is sometimes called partial interference (Sobel, 2006). Assuming partial interference, Hudgens & Halloran (2008) defined the direct, indirect, total and overall causal effects of a treatment in randomized studies. Inference about these types of causal effects has subsequently been considered by VanderWeele & Tchetgen Tchetgen (2011), VanderWeele et al. (2012), Halloran & Hudgens (2012), Liu & Hudgens (2014) and P. M. Aronow and C. Samii in an unpublished 2013 paper (arXiv:1305.6156), among others. For observational settings where the treatment assignment mechanism is not known, Tchetgen Tchetgen & VanderWeele (2012) proposed inverse probability-weighted estimators of these causal effects based on group-level propensity scores. These weighted estimators can be viewed as a generalization of the usual inverse probability-weighted estimator of the causal effect of a treatment in the absence of interference. However, in general, weighted estimators are known to have relatively large variance. Additionally, in some settings the partial interference assumption may be dubious. In this article we consider alternative weighted-type estimators that allow for general forms of interference and tend to be less variable.

2. Preliminaries

Consider a finite population of Inline graphic individuals, and suppose that each individual may receive some treatment or exposure. Let Inline graphic (Inline graphic) be the random variable such that Inline graphic if individual Inline graphic received treatment and Inline graphic otherwise. Suppose that interference may be present between the Inline graphic individuals, and define the interference set Inline graphic for individual Inline graphic to be an ordered set of all other individuals whose treatment received might affect the outcome of individual Inline graphic. Assume that there is no interference between individual Inline graphic and individuals not in Inline graphic. There may or may not be interference between individual Inline graphic and individuals in Inline graphic. A central goal of the inferential methods described below is to quantify the extent to which such interference is present. Let Inline graphic denote the vector of treatment indicators for individuals that possibly interfere with individual Inline graphic; that is, the outcome of individual Inline graphic is allowed to depend not only on Inline graphic but also on Inline graphic. For example, if the outcome of individual 1 possibly depends on their own treatment status as well as that of individuals 2 and 3 but not on that of individuals Inline graphic, then Inline graphic and Inline graphic. The interference sets Inline graphic are assumed to be known a priori. Denote possible values of Inline graphic and Inline graphic by Inline graphic and Inline graphic. Let Inline graphic denote the potential outcome of individual Inline graphic if they receive treatment Inline graphic and their interference set receives Inline graphic. This potential outcome notation is general enough to encompass any possible interference structure, of which partial interference is a special case. Let Inline graphic denote the observed outcome. The potential outcomes Inline graphic are assumed to be deterministic functions of Inline graphic and Inline graphic, and the observed outcome Inline graphic is considered to be random because it depends on the random variables Inline graphic and Inline graphic. Let Inline graphic be the sum over all the components of Inline graphic, and let Inline graphic denote the dimension of the vector Inline graphic. For example, if Inline graphic, then Inline graphic and Inline graphic.

In the absence of interference, a common causal estimand is the average treatment effect, which contrasts the average outcome for the counterfactual scenario where every individual in the population is treated with that of the counterfactual scenario where every individual in the population is not treated. Similarly, in the presence of interference, causal estimands can be defined in terms of counterfactual scenarios corresponding to different treatment allocation strategies (e.g., Hong & Raudenbush, 2006; Sobel, 2006; Hudgens & Halloran, 2008; Tchetgen Tchetgen & VanderWeele, 2012). For example, the indirect effect, defined formally below, contrasts average outcomes of untreated individuals for the counterfactual scenario where one allocation strategy is adopted in the population with those for the counterfactual scenario where some other allocation strategy is adopted in the population. Such estimands quantify interference, if present, at the population level and can be used to inform policy decisions regarding a treatment or exposure. The allocation strategy of interest will in general depend on the setting.

Here we consider Bernoulli allocation strategies proposed by Tchetgen Tchetgen & Vander-Weele (2012), where strategy Inline graphic corresponds to the counterfactual scenario in which individuals independently receive treatment with probability Inline graphic. It is not assumed that the observed treatment indicators Inline graphic are independent Bernoulli random variables; rather, the distribution of treatment under Bernoulli allocation is used below to define the counterfactual estimands of interest. By analogy, direct standardization of mortality rates could entail using the 2010 United States census age distribution, which may differ from the age distribution giving rise to the observed data. Corresponding to Bernoulli allocation, let Inline graphic denote the probability of the interference set for individual Inline graphic receiving treatment Inline graphic under allocation strategy Inline graphic. Let Inline graphic and Inline graphic denote, respectively, the probability of individual Inline graphic receiving treatment Inline graphic and the probability of individual Inline graphic together with their interference set receiving joint treatment Inline graphic under allocation strategy Inline graphic. Define Inline graphic to be the average potential outcome of individual Inline graphic under allocation strategy Inline graphic, where the summation is over all Inline graphic possible values of Inline graphic. Returning to the example where Inline graphic, the average potential outcome of individual 1 is a weighted average of potential outcomes under different combinations of treatment Inline graphic and Inline graphic, with the weights being the corresponding probabilities under Bernoulli allocation. Averaging over all individuals, define the population average potential outcome as Inline graphic. Similarly, define the marginal average potential outcome for individual Inline graphic under allocation strategy Inline graphic by Inline graphic and define the population marginal average potential outcome as Inline graphic.

Various causal effects can be defined by contrasts in the population average potential outcomes. In particular, define the direct effect of treatment under allocation strategy Inline graphic to be Inline graphic, where Inline graphic is some continuous contrast function. A commonly used contrast function is Inline graphic; in vaccine trials with a binary outcome it is typical to use Inline graphic. The direct effect compares the average potential outcomes when an individual receives treatment versus not under allocation strategy Inline graphic. For two allocation strategies Inline graphic and Inline graphic, let Inline graphic be the indirect or spillover effect, which contrasts average potential outcomes when individuals do not receive treatment under different allocation strategies. In the context of vaccines, the indirect effect is sometimes referred to as herd immunity and describes the effect of the proportion of individuals vaccinated, e.g., 30% versus 50%, on the average outcome among unvaccinated individuals. An indirect effect can also be defined for when individuals receive treatment, Inline graphic, but for simplicity we do not consider such indirect effects here. The total effect Inline graphic incorporates both direct and indirect effects, and reflects the difference between the average potential outcomes when individuals receive treatment under one allocation strategy versus when they go without treatment under another allocation strategy. Finally, define Inline graphic to be the overall effect, which describes the contrast in average outcomes under one allocation strategy relative to another.

3. Inverse probability-weighted and hájek-type estimators

In this section we propose inverse probability-weighted and Hájek-type estimators which allow for general interference; that is, no assumption is made regarding the structure or form of interference that might be present. When there is partial interference and the groups are of the same size, the inverse probability-weighted estimators defined below reduce to those proposed by Tchetgen Tchetgen & VanderWeele (2012). Aronow and Samii (arXiv:1305.6156) considered similar estimators in the setting where interference may be present, but where treatment is assigned randomly according to a known experimental design.

Let Inline graphic denote a vector of pretreatment covariates of individual Inline graphic, and let Inline graphic. Assume that conditional on covariates Inline graphic, the treatment allocation for individual Inline graphic is independent of all potential outcomes and other covariates; that is, Inline graphic. Likewise, assume Inline graphic. Define Inline graphic and Inline graphic to be the propensity scores of individual Inline graphic and of individual Inline graphic and their interference set, respectively. Assume that Inline graphic and Inline graphic for all Inline graphic, Inline graphic, Inline graphic and Inline graphic. Define the inverse probability-weighted estimator for treatment Inline graphic under allocation strategy Inline graphic to be

Y^ipw(z,α)=n1iyi(Zi,Si)1(Zi=z)π(Si;α)f(Zi,Sili,lχi)(z=0,1), (1)

and define the inverse probability-weighted marginal estimator under strategy Inline graphic to be

Y^ipw(α)=n1iyi(Zi,Si)π(Zi,Si;α)f(Zi,Sili,lχi), (2)

where Inline graphic means Inline graphic. If the propensity scores are known, then (1) and (2) are unbiased as stated in the following proposition.

Proposition 1.

If Inline graphic is known for all Inline graphic, thenInline graphicandInline graphic.

In the absence of interference, the Hájek (1971) estimator of the mean of a finite population replaces the denominator Inline graphic of the Horvitz & Thompson (1952) inverse probability-weighted estimator with the sum of the inverse of the sampling probabilities, which tends to reduce the variance relative to the Horvitz–Thompson estimator. Returning to the current context, let Inline graphic and note that Inline graphic even if interference is present. This suggests replacing Inline graphic in (1) with Inline graphic to obtain a stabilized Hájek-type estimator. Alternatively, notice that the weighted estimator (1) involves Inline graphic, which suggests replacing Inline graphic with the unbiased estimator Inline graphic instead. Therefore, we will consider two different Hájek-type estimators of the population average outcome for treatment Inline graphic and allocation strategy Inline graphic, defined by

Y^hhaj(z,α)=n^hz1iyi(Zi,Si)1(Zi=z)π(Si;α)f(Zi,Sili,lχi)(h=1,2).

Here and below we assume that there exists at least one Inline graphic such that Inline graphic for Inline graphic. Similarly, let Inline graphic and Inline graphic, and note that Inline graphicInline graphic, which suggests the following estimators of the population average marginal outcome for allocation strategy Inline graphic:

Y^hhaj(α)=n^h1iyi(Zi,Si)π(Zi,Si;α)f(Zi,Sili,lχi)(h=1,2).

Note that Inline graphic, Inline graphic and Inline graphic depend on Inline graphic, but we suppress this dependence for notational convenience. In what follows, Inline graphic and Inline graphic will be referred to as the Hájek 1 and Hájek 2 estimators.

An appealing property of Inline graphic and Inline graphic is the preservation of the bounds of the potential outcome Inline graphic. Specifically, suppose there exist constants Inline graphic and Inline graphic such that Inline graphicInline graphic; then Inline graphic and Inline graphic. For example, if Inline graphic is binary, then Inline graphic. In contrast, preservation of the bounds is not guaranteed for Inline graphic or Inline graphic.

Another attractive property of the Hájek 2 estimators is preservation of linear transformations of the outcome. In particular, suppose that the observed outcomes Inline graphic are transformed by the function Inline graphicInline graphic. Then Hájek 2 estimators computed using the transformed responses will equal Inline graphic and Inline graphic, where Inline graphic and Inline graphic are computed on the original, untransformed observed outcomes. In contrast, the inverse probability-weighted and Hájek 1 estimators have this property only when Inline graphic.

Define Inline graphic to be the inverse probability-weighted estimator of the direct effect. Define Inline graphic, Inline graphic and Inline graphic to be the weighted estimators of the indirect, total and overall effects. Hájek-type causal effect estimators are defined similarly. For example, define Hájek-type estimators of the direct effect by Inline graphicInline graphic. If the contrast function is Inline graphic, then by the property described in the preceding paragraph, the values of Hájek 2 causal effect estimators are invariant under location shift. This is not the case for the inverse probability-weighted and Hájek 1 causal effect estimators.

4. Asymptotic distributions

In this section the large-sample properties of the inverse probability-weighted and Hájek-type estimators are derived assuming partial interference. In particular, assume that individuals can be partitioned into groups such that there is no interference between individuals in different groups. Within groups no additional structure is assumed regarding interference, so there may be interference between any two individuals within a group. That is, we assume the following.

Assumption 1.

There exists a partition Inline graphic of Inline graphic such that Inline graphic (Inline graphic; Inline graphic).

Let Inline graphic denote the number of individuals in group Inline graphic. Let Inline graphic denote the observed outcome for individual Inline graphic in group Inline graphic, and write Inline graphic. Let Inline graphic and Inline graphic denote the observed covariates and treatment for individual Inline graphic in group Inline graphic, and define Inline graphic and Inline graphic analogously to Inline graphic. Assume that Inline graphic is one of the baseline covariates included in Inline graphic.

To derive the large-sample properties of the inverse probability-weighted and Hájek-type estimators, assume that the Inline graphic groups are a random sample from an infinite superpopulation of groups such that the observable random variables Inline graphicInline graphic are independent and identically distributed. Let Inline graphic denote the distribution function of Inline graphic.

Let Inline graphic denote the potential outcome for individual Inline graphic in group Inline graphic, where Inline graphic denotes treatment received by individual Inline graphic and Inline graphic denotes the vector of treatment indicators for all other individuals in group Inline graphic. Unlike in Inline graphic 2 and 3, here the potential outcomes are considered random variables because of the assumed random sampling of the Inline graphic groups from a superpopulation. Denote the observed outcome for individual Inline graphic by Inline graphic, where Inline graphic is the subvector of Inline graphic with Inline graphic removed. Note that Inline graphic is a function of Inline graphic, which for notational simplicity is left implicit. Assume conditional exchangeability, i.e., Inline graphic, where Inline graphic means that Inline graphic and Inline graphic are independent conditional on Inline graphic.

Under Assumption 1, the inverse probability-weighted estimator for treatment Inline graphic and strategy Inline graphic equals

Y^ipw(z,α)=n1v=1mi=1NvYvi1(Zvi=z)π(Svi;α)f(Z~vL~v),

which can be expressed as a solution for Inline graphic to the estimating equation Inline graphic, where

Gzα0(Y~v,Z~v,L~v;μ)=i=1Nv{Yvi1(Zvi=z)π(Svi;α)f(Z~vL~v)μ}.

Let Inline graphic be the solution to Inline graphic. It is straightforward to show that Inline graphic, where Inline graphic is the mean group size in the superpopulation and Inline graphic, with the summation being taken over all vectors Inline graphic. If Inline graphic where Inline graphic, i.e., if the average potential outcome within a group is independent of the number of individuals within the group, then Inline graphic. In other words, Inline graphic is the mean group average potential outcome in the superpopulation, analogous to Inline graphic defined in Inline graphic 2. Define the direct effect in the superpopulation by Inline graphic; the indirect, total and overall effects in the superpopulation can be defined analogously.

The Hájek-type estimators can also be expressed as solutions to estimation equations. Specifically, under Assumption 1,

Y^hhaj(z,α)=n^hz1v=1mi=1NvYvi1(Zvi=z)π(Svi;α)f(Z~vL~v)(h=1,2),

where now

n^1z=v=1mi=1Nv1(Zvi=z)f(ZviLvi),n^2z=v=1mi=1Nv1(Zvi=z)π(Svi;α)f(Z~vL~v).

It follows that Inline graphic solves Inline graphic, where

Gzα1(Y~v,Z~v,L~v;μ)=i=1Nv{Yvi1(Zvi=z)π(Svi;α)f(Z~vL~v)μ1(Zvi=z)f(ZviLvi)},Gzα2(Y~v,Z~v,L~v;μ)=i=1Nv{Yvi1(Zvi=z)π(Svi;α)f(Z~vL~v)μ1(Zvi=z)π(Svi;α)f(Z~vL~v)}.

It is straightforward to show that Inline graphic also satisfies Inline graphicInline graphic.

The asymptotic distributions of the inverse probability-weighted and Hájek-type estimators can be derived from standard estimating equation theory (Stefanski & Boos, 2002; Perez-Heydrich et al., 2014). For example, the proposition below establishes that the three direct effect estimators are asymptotically normal and gives closed-form expressions for the asymptotic variances when the propensity scores are known. The proposition entails the vector estimating equation Inline graphic, where Inline graphic.

Proposition 2.

Suppose that Assumption 1 holds, the propensity scores are known, and the regularity assumptions in the Appendix hold. ThenInline graphicconverges in distribution toInline graphicandInline graphicconverges in distribution toInline graphicInline graphicasInline graphic, where

ΣhD=τUh1Vh(Uh1)TτT

withInline graphic, Inline graphicandInline graphicInline graphic; hereInline graphic.

A comparison between Inline graphicInline graphic and Inline graphic explains why the Hájek-type estimators can vary less than the inverse probability-weighted estimator. For example, suppose that the contrast Inline graphic is the difference function. Denote Inline graphic by Inline graphic and note that Inline graphic and Inline graphicInline graphic, where Inline graphic with Inline graphic and Inline graphic. Thus, the Hájek estimators will have smaller asymptotic variance if and only if Inline graphic, and so are expected to be less variable when Inline graphic and Inline graphic are strongly correlated. In the extreme scenario of Inline graphicInline graphic, we have Inline graphic and Inline graphic but Inline graphic in general.

In observational studies, the mechanism by which individuals select treatment is in general not known, so that Inline graphic and Inline graphic must be estimated in order to construct inverse probability-weighted estimators. In practice, due to the curse of dimensionality, one might assume a parametric model for the propensity scores (Tchetgen Tchetgen & VanderWeele, 2012). Let Inline graphic denote the score function for the likelihood under the assumed propensity score model indexed by a finite-dimensional parameter vector Inline graphic, and let Inline graphic denote the true parameter value, which is the solution to Inline graphic. Now consider the vector estimating equation Inline graphicInline graphic where Inline graphic.

Proposition 3.

Suppose that Assumption 1 holds, the parametric propensity score model is correctly specified, and the regularity assumptions in the Appendix hold. ThenInline graphicconverges in distribution toInline graphicandInline graphicconverges in distribution toInline graphicInline graphicasInline graphic, where

ΣhD=τUh1Vh(Uh1)TτT

withInline graphic, Inline graphicandInline graphicInline graphic; hereInline graphic, Inline graphicdenotes theInline graphiczero vector, andInline graphicis the dimension ofInline graphic.

Proposition 3 establishes the asymptotic normality of Inline graphic, Inline graphic and Inline graphic when the propensity score is correctly modelled. The asymptotic variance can be estimated consistently using empirical sandwich estimators, i.e., by replacing Inline graphic and Inline graphic with their empirical counterparts (Stefanski & Boos, 2002). In the Appendix the asymptotic variance of Inline graphic when the propensity score is estimated is shown to be no greater than when the propensity score is known. This is analogous to the well-known result about weighted estimators in the absence of interference; that is, even if the propensity scores are known, it is more efficient to use estimates of the propensity scores when computing inverse probability-weighted estimators. This relationship between the asymptotic variances when the propensity scores are known and when they are unknown but correctly modelled also holds for the Hájek-type estimators. Asymptotic normality of the indirect, total and overall effect estimators can be derived similarly.

5. SIMULATION STUDY

A simulation study was conducted to investigate the bias, empirical standard error and average estimated standard error of the different estimators discussed in Inline graphic 4. In the simulations the inverse probability-weighted and Hájek-type effect estimators were computed using the true propensity score, an estimated propensity score based on a correct model, and an estimated propensity score based on a misspecified model. Simulations were conducted under partial interference, i.e., Assumption 1, for both continuous and binary outcomes. The simulation study for a continuous outcome was carried out in the steps described below.

Step 1.

A random sample of Inline graphic groups was created as follows. First, the group size Inline graphic was randomly sampled from Inline graphic with corresponding probabilities Inline graphic. For each individual in each group, Inline graphic was randomly sampled from Inline graphicInline graphic. Then the potential outcomes for individual Inline graphic in group Inline graphic were set to Inline graphic.

Step 2.

The covariate vectors Inline graphic were randomly sampled from Inline graphicInline graphic, where Inline graphic denotes the Inline graphic identity matrix.

Step 3.

Treatment variables Inline graphic were simulated from a Bernoulli distribution with mean Inline graphic, where the random effects Inline graphic were randomly sampled from Inline graphicInline graphic and Inline graphic.

Step 4.

A correctly specified logistic regression model Inline graphic and a misspecified logistic regression model Inline graphic, where Inline graphic, Inline graphic, Inline graphic and Inline graphic, were fitted to the simulated data.

Step 5.

The causal effect estimators and their corresponding variance estimators were calculated for Inline graphic and Inline graphic using the known propensity score, the estimated propensity score from the correctly specified mixed-effects model and the estimated propensity score from the misspecified mixed-effects model.

Step 6.

Steps 1–5 were repeated Inline graphic times, and the empirical bias, empirical standard error and average estimated standard error were calculated for the estimators in Step 5.

From the potential outcome model specified in Step 1 it follows that Inline graphic and Inline graphic where Inline graphic. Hence Inline graphic for any Inline graphic, Inline graphic, Inline graphic and Inline graphic. Simulation results for the direct effect estimators are given in Table 1. All three estimators are approximately unbiased when the propensity scores are known or correctly modelled, but are biased if the propensity scores are incorrectly modelled. For all three estimators the average estimated standard error is also relatively close to the empirical standard error when the propensity scores are known or correctly modelled. Note that Inline graphic has substantially smaller empirical standard error than Inline graphic and Inline graphic. For example, when Inline graphic and the propensity scores are known, the empirical standard errors of Inline graphic and Inline graphic are 1Inline graphic4 and 1Inline graphic5, whereas the empirical standard error of Inline graphic is only 0Inline graphic3. Similar results hold when the propensity scores are treated as unknown and either correctly or incorrectly modelled. The results in Table 1 demonstrate that, as well as having smaller empirical standard error, Inline graphic may be more robust than Inline graphic and Inline graphic with respect to misspecification of the propensity score model.

Table 1.

Empirical bias Inline graphic, empirical standard error, and average estimated standard error of the estimators of Inline graphic with a continuous outcome

  Inline graphic   Inline graphic   Inline graphic
Known Inline graphic Bias ESE ASE   Bias ESE ASE   Bias ESE ASE
Inline graphic 0Inline graphic4 1Inline graphic4 1Inline graphic4   0Inline graphic1 0Inline graphic7 0Inline graphic7   0Inline graphic2 1Inline graphic7 1Inline graphic7
Inline graphic 0Inline graphic6 1Inline graphic5 1Inline graphic5   0Inline graphic1 0Inline graphic6 0Inline graphic6   0Inline graphic5 1Inline graphic6 1Inline graphic6
Inline graphic 0Inline graphic1 0Inline graphic3 0Inline graphic3   0Inline graphic0 0Inline graphic2 0Inline graphic2   0Inline graphic0 0Inline graphic3 0Inline graphic3
Correct Inline graphic Bias ESE ASE   Bias ESE ASE   Bias ESE ASE
Inline graphic 4Inline graphic1 1Inline graphic5 1Inline graphic4   0Inline graphic7 0Inline graphic6 0Inline graphic6   7Inline graphic3 1Inline graphic3 1Inline graphic3
Inline graphic 3Inline graphic8 1Inline graphic5 1Inline graphic5   0Inline graphic5 0Inline graphic6 0Inline graphic8   7Inline graphic6 1Inline graphic3 1Inline graphic5
Inline graphic 0Inline graphic2 0Inline graphic3 0Inline graphic3   0Inline graphic8 0Inline graphic2 0Inline graphic2   0Inline graphic5 0Inline graphic3 0Inline graphic3
Mis Inline graphic Bias ESE ASE   Bias ESE ASE   Bias ESE ASE
Inline graphic 0Inline graphic4 1e1 1e1   20 1e3 1e3   10 2e2 1e2
Inline graphic 5Inline graphic2 2e0 1e1   10 1e3 1e3   30 3e2 2e2
Inline graphic 0Inline graphic3 0Inline graphic3 0Inline graphic3   0Inline graphic8 0Inline graphic3 0Inline graphic3   0Inline graphic3 0Inline graphic5 0Inline graphic5

ESE, empirical standard error; ASE, average estimated standard error; Known Inline graphic, true propensity score known; Correct Inline graphic, propensity score unknown but correctly modelled; Mis Inline graphic, propensity score incorrectly modelled.

The simulation study described above was repeated for a binary outcome. Specifically, Step 1 was replaced with the following, while all other steps remained the same.

Step 1.

A random sample of Inline graphic groups was created as follows. First, the group size Inline graphic was randomly sampled from Inline graphic with corresponding probabilities Inline graphic. Then the potential outcomes Inline graphic were set to 0 with probability 0Inline graphic2, 1 with probability 0Inline graphic2, and Inline graphicInline graphic with probability 0Inline graphic6.

For this potential outcome model, Inline graphic, Inline graphic and Inline graphic with Inline graphic. Simulation results for this scenario are given in Table 2. Similar to the continuous outcome simulations, the empirical standard error for Inline graphic is smaller than that for Inline graphic and Inline graphic in all three scenarios, and Inline graphic also tends to be more robust with respect to misspecification of the propensity score model than the other two estimators. Similar results, not shown here, were observed for the other causal effect estimators.

Table 2.

Empirical bias, empirical standard error, and average estimated standard error of the estimators of Inline graphic with a binary outcome; all values have been} multiplied by Inline graphic

  Inline graphic   Inline graphic   Inline graphic
Known Inline graphic Bias ESE ASE   Bias ESE ASE   Bias ESE ASE
Inline graphic 0Inline graphic2 9Inline graphic7 9Inline graphic7   0 4Inline graphic7 4Inline graphic8   0Inline graphic1 9Inline graphic3 9Inline graphic2
Inline graphic 0Inline graphic3 9Inline graphic6 9Inline graphic7   0 4Inline graphic6 4Inline graphic5   0Inline graphic1 8Inline graphic4 8Inline graphic4
Inline graphic 0Inline graphic1 7Inline graphic0 6Inline graphic9   0 3Inline graphic9 3Inline graphic9   0 5Inline graphic4 5Inline graphic3
Correct Inline graphic Bias ESE ASE   Bias ESE ASE   Bias ESE ASE
Inline graphic 1Inline graphic4 10Inline graphic2 9Inline graphic8   0Inline graphic1 4Inline graphic5 4Inline graphic3   3Inline graphic5 7Inline graphic4 7Inline graphic2
Inline graphic 1Inline graphic3 10Inline graphic1 9Inline graphic8   0Inline graphic2 4Inline graphic5 4Inline graphic5   3Inline graphic5 7Inline graphic3 7Inline graphic5
Inline graphic 0Inline graphic1 7Inline graphic3 7Inline graphic1   0Inline graphic5 3Inline graphic8 3Inline graphic6   0Inline graphic9 5Inline graphic2 5Inline graphic0
Mis Inline graphic Bias ESE ASE   Bias ESE ASE   Bias ESE ASE
Inline graphic 1 1e2 1e2   1 5e2 3e2   1e1 3e4 2e4
Inline graphic 1 4e1 1e2   10 5e2 3e2   2e1 3e4 2e4
Inline graphic 0Inline graphic1 7Inline graphic4 7Inline graphic3   1Inline graphic2 6Inline graphic8 6Inline graphic4   1 9Inline graphic7 9Inline graphic5

6. Rotavirus vaccine study in nicaragua

Rotavirus diarrhoea is a major health problem in Nicaragua (Espinoza et al., 1997). The pentavalent rotavirus vaccine was introduced in 2006. Nicaraguan infants are offered the vaccine at two, four and six months of age as part of the country’s Expanded Program on Immunization. In 2010, a study to assess the impact of the immunization programme was carried out in León, Nicaragua’s second largest city, with an estimated population in 2010 of close to 200 000. The Health and Demographic Surveillance Site-León was employed to obtain a simple random sample of households from 50 out of 208 randomly selected geographical clusters of equal size in León (Becker-Dreps et al., 2013). For simplicity, in the following analysis the cluster sampling used to obtain these data is ignored. There were 530 households in the study, and any child in a selected household under the age of five was eligible to participate. Information was collected about each household, including water source, sanitation system, maternal education level, and the dates of birth of study participants. Each individual in the study was visited fortnightly by a fieldworker for approximately one year. At each visit information about diarrhoea episodes in the past 14 days was recorded. The primary outcome Inline graphic was whether a child had at least one diarrhoea episode during the study.

For each child we assumed their interference set to be other children in the same household. A mixed-effects logistic regression model of the probability of having received all three scheduled doses was fitted conditional on the following baseline covariates: child’s age, categorized as 0–11 months, 12–23 months, or 24–59 months; mother’s education level, categorized as primary education only or at least some secondary education; dirt household floor or not; dry or wet season; household indoor toilet, latrine, or none; indoor municipal water supply or not; and breastfeeding or not. Likelihood ratio tests from the fitted logistic model indicated that the odds of having all three doses of vaccine was higher among children whose mothers were more educated, with Inline graphic.

Effect estimates and estimated standard errors are reported in Table 3 for the inverse probability-weighted and the two Hájek estimators for contrast function Inline graphic. The Hájek 2 estimates are closer to the null value of zero and, as expected, have 15–20% smaller estimated standard errors than the inverse probability-weighted and Hájek 1 estimates. The direct effect estimates indicate the expected difference in the proportions of children who will acquire rotavirus diarrhoea among vaccinated versus unvaccinated children for a fixed level of vaccine coverage Inline graphic. The estimated direct effects become closer to the null as Inline graphic increases, suggesting that the direct protective effect of vaccination decreases as additional children in the household are vaccinated. The indirect effect estimates approximate the expected difference in the proportions of unvaccinated children who will acquire diarrhoea when vaccine coverage is Inline graphic% versus 10%. The total effect estimates indicate the expected difference in the proportions of vaccinated children who will acquire diarrhoea when vaccine coverage is Inline graphic% compared with unvaccinated children when vaccine coverage is 10%. The overall effect estimates provide simple summary comparisons between any two allocation strategies; for example, according to the Hájek 2 estimates, 5Inline graphic1 fewer cases of diarrhoea per 100 individuals per year would be expected if on average 80% of children in a household were vaccinated than if on average only 10% of children were vaccinated.

Table 3.

Effect estimates for the rotavirus vaccine study; all values have been multiplied by Inline graphic

  Inline graphic Inline graphic Inline graphic Inline graphic
  Est SE   Est SE   Est SE   Est SE
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic
Inline graphic Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic   Inline graphic Inline graphic

Est, point estimate; SE, estimated standard error.

7. Discussion

The inverse probability-weighted estimator and two Hájek-type estimators in Inline graphic 3 allow for any form of interference between individuals, with the former being unbiased in a finite-population model with known propensity scores. Assuming partial interference and random sampling of groups from a superpopulation, all three estimators are consistent and asymptotically normal when the propensity scores are known or correctly modelled. Empirical results demonstrate that the second Hájek estimator can have substantially smaller finite-sample variance than the other two estimators. One avenue of future research entails deriving the estimators’ large-sample properties without assuming partial interference. Another future direction might involve developing estimators which are robust with respect to misspecification of the propensity score model. Throughout this work conditional exchangeability is assumed, i.e., treatment is assumed to be independent of potential outcomes conditional on an observable set of covariates. In future work one could investigate relaxing this assumption, perhaps via sensitivity analysis or instrumental variable methods. Finally, the target parameters in this paper utilize the Bernoulli allocation strategy proposed by Tchetgen Tchetgen & VanderWeele (2012). These estimands consider the counterfactual scenario where individuals independently select treatment with equal probability. In scenarios where interference is present, it is unlikely that individual treatment selections would be independent. Therefore further interference-related research might target alternative parameters.

Acknowledgments

Acknowledgement

The authors were partially supported by the U.S. National Institutes of Health. The fieldwork was supported by the Thrasher Research Fund. The content of this paper is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The authors thank M. Elizabeth Halloran, Joseph Rigdon, an associate editor, and a reviewer for helpful comments.

Appendix

Proof of Proposition 1

To show that Inline graphic is unbiased, observe that

E{Y^ipw(z,α)}=n1izi,siyi(zi,si)1(zi=z)π(si;α)f(zi,sili,lχi)f(zi,sili,lχi)=n1isiyi(z,si)π(si;α)=y¯(z,α).

That Inline graphic is unbiased can be proved similarly.

Proof of Propositions 2 and 3

To prove Proposition 2, assume that there exist constants Inline graphic and Inline graphic such that Inline graphic, Inline graphic, Inline graphic and Inline graphic with probability 1. Let Inline graphic and Inline graphic (Inline graphic). Let Inline graphic denote the vector estimating equation Inline graphic. Let Inline graphic and write Inline graphic for any vector Inline graphic of length Inline graphic.

First we show that the following four conditions hold for Inline graphic: (i) Inline graphic exists and is nonsingular; (ii) Inline graphic is twice continuously differentiable with respect to Inline graphic for every Inline graphic; (iii) Inline graphic for some integrable measurable function Inline graphic; and (iv) Inline graphic. It is straightforward to show that Inline graphic, where Inline graphic is the Inline graphic identity matrix, implying (i). Note that Inline graphic, so (ii) holds and (iii) is satisfied for the function Inline graphic. To show (iv), observe that

EG0(θ0)2=z=01E{i=1NvYvi1(Zvi=z)π(Svi;α)f(Z~vL~v)Nvμzα}2.

From the boundedness assumptions on Inline graphic, Inline graphic and Inline graphic, it follows that Inline graphic. Similar results can be established for Inline graphic.

Next, note that Inline graphic is a linear function of Inline graphic with slope Inline graphic. For Inline graphic, Inline graphic is also a linear function of Inline graphic with finite, nonzero slope, because by assumption Inline graphic and there exists at least one Inline graphic such that Inline graphic. Hence, the solution for Inline graphic to Inline graphic is unique for Inline graphic. Therefore, because (i)–(iv) hold, by Theorem 5.4.2 of van der Vaart (1998), Inline graphic converges in probability to Inline graphic. Proposition 2 then follows from Theorem 5.4.1 of van der Vaart (1998) and the delta method.

Similar reasoning can be used to prove Proposition 3 under the following additional assumptions about the parametric propensity score model: Inline graphic is in an open subset of Euclidean space; Inline graphic exists and is nonsingular, where Inline graphic; Inline graphic is twice continuously differentiable with respect to Inline graphic and Inline graphic for some integrable measurable function Inline graphic for every Inline graphic; and Inline graphic.

Proof of reduction in variance with a correctly specified propensity score model

Using block matrix notation, write

U0=(U0Uμγ0p×2Uγ),V0=(V0VμγVμγTVγ),

where Inline graphic is the Inline graphic matrix of zeros. It is straightforward to show that Inline graphic and Inline graphic. It follows that

U01=(U01U01VμγVγ10p×2Uγ1)

and therefore

U01V0(U01T)={U01V0(U01T)U01VμγVγ1VμγT(U01T)},

where Inline graphic denotes quantities not expressed explicitly. Hence

Σ0D=Σ0DτU01VμγVγ1VμγT(U01T)τT.

Since Inline graphic is positive semidefinite, so is Inline graphic. Therefore Inline graphic. The same approach can be used to show that Inline graphic for Inline graphic.

References

  1. Becker-Dreps S., MelÉndez M., Liu L., Zambrana L. E., Paniagua M., Weber D. J., Hudgens M. G., CÁceres M., KÄllestÅll C., Morgan D. R. et al. (2013). Community diarrhea incidence before and after rotavirus vaccine introduction in Nicaragua. Am. J. Trop. Med. Hygiene 89, 246–50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Espinoza F. Paniagua M. Hallander H. Svensson L. & StrannegÅrd Ö.. (1997). Rotavirus infections in young Nicaraguan children. Pediatric Inf. Dis. J. 16, 564–71. [DOI] [PubMed] [Google Scholar]
  3. HÁjek J. (1971). Comment on a paper by D. Basu. In Foundations of Statistical Inference, Godambe V. & Sprott D. eds. Toronto:Holt, Rinehart and Winston, p. 236. [Google Scholar]
  4. Halloran M. E. & Hudgens M. G.. (2012). Causal inference for vaccine effects on infectiousness. Int. J. Biostatist. 8, 1–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Halloran M. E. & Struchiner C. J.. (1995). Causal inference in infectious diseases. Epidemiology 6, 142–51. [DOI] [PubMed] [Google Scholar]
  6. Hong G. & Raudenbush S. W.. (2006). Evaluating kindergarten retention policy: A case study of causal inference for multilevel observational data. J. Am. Statist. Assoc. 101, 901–10. [Google Scholar]
  7. Horvitz D. G. & Thompson D. J.. (1952). A generalization of sampling without replacement from a finite universe. J. Am. Statist. Assoc. 47, 663–85. [Google Scholar]
  8. Hudgens M. G. & Halloran M. E.. (2008). Toward causal inference with interference. J. Am. Statist. Assoc. 103, 832–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Liu L. & Hudgens M. G.. (2014). Large sample randomization inference of causal effects in the presence of interference. J. Am. Statist. Assoc. 109, 288–301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Luo X. Small D. S. Li C. S. R. & Rosenbaum P. R.. (2012). Inference with interference between units in an fMRI experiment of motor inhibition. J. Am. Statist. Assoc. 107, 530–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Manski C. F. (2013). Identification of treatment response with social interactions. Economet. J. 16, S1–23. [Google Scholar]
  12. Nickerson D. W. (2008). Is voting contagious? Evidence from two field experiments. Am. Polit. Sci. Rev. 102, 49–57. [Google Scholar]
  13. Perez-Heydrich C. Hudgens M. G. Halloran M. E. Clemens J. D. Ali M. & Emch M. E.. (2014). Assessing effects of cholera vaccination in the presence of interference. Biometrics 70, 731–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Rosenbaum P. R. (2007). Interference between units in randomized experiments. J. Am. Statist. Assoc. 102, 191–200. [Google Scholar]
  15. Sobel M. E. (2006). What do randomized studies of housing mobility demonstrate? Causal inference in the face of interference. J. Am. Statist. Assoc. 101, 1398–407. [Google Scholar]
  16. Stefanski L. A. & Boos D. D.. (2002). The calculus of M-estimation. Am. Statistician 56, 29–38. [Google Scholar]
  17. Tchetgen Tchetgen E. J. & VanderWeele T. J.. (2012). On causal inference in the presence of interference. Statist. Meth. Med. Res. 21, 55–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. van der Vaart A. (1998). Asymptotic Statistics. Cambridge:Cambridge University Press. [Google Scholar]
  19. VanderWeele T. J. & Tchetgen Tchetgen E. J.. (2011). Effect partitioning under interference in two-stage randomized vaccine trials. Statist. Prob. Lett. 81, 861–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. VanderWeele T. J. Tchetgen Tchetgen E. J. & Halloran M. E.. (2012). Components of the indirect effect in vaccine trials: Identification of contagion and infectiousness effects. Epidemiology 23, 751–61. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Biometrika are provided here courtesy of Oxford University Press

RESOURCES