Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2008 Dec 17.
Published in final edited form as: J Am Stat Assoc. 2006 Mar;101(473):51–64. doi: 10.1198/016214505000000970

Causal Vaccine Effects on Binary Postinfection Outcomes

Michael G Hudgens 1, M Elizabeth Halloran 2
PMCID: PMC2603579  NIHMSID: NIHMS73841  PMID: 19096723

Abstract

The effects of vaccine on postinfection outcomes, such as disease, death, and secondary transmission to others, are important scientific and public health aspects of prophylactic vaccination. As a result, evaluation of many vaccine effects condition on being infected. Conditioning on an event that occurs posttreatment (in our case, infection subsequent to assignment to vaccine or control) can result in selection bias. Moreover, because the set of individuals who would become infected if vaccinated is likely not identical to the set of those who would become infected if given control, comparisons that condition on infection do not have a causal interpretation. In this article we consider identifiability and estimation of causal vaccine effects on binary postinfection outcomes. Using the principal stratification framework, we define a postinfection causal vaccine efficacy estimand in individuals who would be infected regardless of treatment assignment. The estimand is shown to be not identifiable under the standard assumptions of the stable unit treatment value, monotonicity, and independence of treatment assignment. Thus selection models are proposed that identify the causal estimand. Closed-form maximum likelihood estimators (MLEs) are then derived under these models, including those assuming maximum possible levels of positive and negative selection bias. These results show the relations between the MLE of the causal estimand and two commonly used estimators for vaccine effects on postinfection outcomes. For example, the usual intent-to-treat estimator is shown to be an upper bound on the postinfection causal vaccine effect provided that the magnitude of protection against infection is not too large. The methods are used to evaluate postinfection vaccine effects in a clinical trial of a rotavirus vaccine candidate and in a field study of a pertussis vaccine. Our results show that pertussis vaccination has a significant causal effect in reducing disease severity.

Keywords: Causal inference, Infectious disease, Maximum likelihood, Principal stratification, Sensitivity analysis

1. INTRODUCTION

The success of vaccines in reducing the burden of infectious diseases is one of the great achievements in public health. The goal of vaccination is often not to prevent infection, but rather to prevent or to ameliorate disease (Clements-Mann 1998). Therefore, many questions of interest in understanding the effects of prophylactic vaccines concern postinfection outcomes, such as severe disease, death, or transmission to others. As early as 1939, Kendrick and Eldering described less severe disease in children inoculated with pertussis vaccine. More recently, Préziosi and Halloran (2003a) estimated the beneficial effects of pertussis vaccination on reducing transmission to others (see also Halloran, Préziosi, and Chu 2003) and preventing severe disease (Préziosi and Halloran 2003b) in breakthrough cases. Individuals with varicella (chickenpox) vaccination also develop less severe disease (Vazquez et al. 2001). Assessing candidate human immunodeficiency virus (HIV) vaccine effects on progression to AIDS through both clinical and surrogate postinfection endpoints (Gilbert, Bosch, and Hudgens 2003a) as well as on transmission (Longini, Datta, and Halloran 1996) is crucial for evaluating their potential public health benefit.

Two main approaches are currently used for estimating postinfection effects of vaccines. The first approach includes all individuals under study, whether they become infected or not; the second includes only the individuals who become infected. The first approach enjoys the statistical validity associated with an intent-to-treat (ITT) analysis and provides an assessment of the overall benefit of vaccination. However, such an approach does not distinguish vaccine effects on susceptibility to infection from effects on the postinfection endpoints of interest. Also, it may be less sensitive in detecting postinfection vaccine effects than an analysis that conditions on infection. The second approach conditions on infection to estimate a net effect of the vaccine on the postinfection endpoint in just those people who become infected. However, conditioning on an event, such as infection, that occurs subsequent to receipt of vaccine or control may result in selection bias (Halloran and Struchiner 1995). Because the set of individuals who would become infected if vaccinated is likely not identical to the set of those who would become infected if given control, comparisons that condition on infection do not have a causal interpretation (Frangakis and Rubin 2002). For example, a vaccine might protect only people with strong immune systems, so that infected vaccinees tend to have weaker immune systems on average compared with infected controls. As a result, infected vaccinees could have worse postinfection outcomes on average than the infected controls due solely to selection bias. The resulting false conclusion that vaccination is harmful with respect to postinfection outcomes could result in the discarding of a potentially useful vaccine candidate (Hudgens, Hoering, and Self 2003; Gilbert et al. 2003a).

As a first motivating example, Préziosi and Halloran (2003b) analyzed data from a pertussis vaccine field study in Niakhar, Senegal from January 1 to December 31, 1993, in children age 6 months through 8 years. During that one calendar year, there were 3,845 and 1,020 person-years at risk in the vaccinated and unvaccinated children (M.-P. Préziosi, personal communication). Of 548 cases in the vaccinated group, 176 were severe, and of 206 cases in the unvaccinated group, 129 were severe. Préziosi and Halloran (2003b) defined an estimator of vaccine efficacy against severe disease conditional on being infected and having disease as

VE^P=1no. severe vaccinated casesno. vaccinated casesno. severe unvaccinated casesno. unvaccinated cases. (1)

The analysis yields VE^P=.49, 95% confidence interval (CI) = (.40, .56).

Vesikari, Rautanen, Varis, Beards, and Kapikian (1990) provided a second motivating example in their analysis of a randomized, double-blinded, placebo-controlled trial of a Rhesus rotavirus candidate vaccine. The trial was conducted in children age 2–5 months in 1985–1987 in Finland, with 100 children randomized to each arm. The effect of the vaccine on the clinical course of infection was considered by comparing severity (mild, moderate, or severe) between vaccinees and placebotreated individuals with confirmed Rotavirus diarrhea using Fisher’s exact test. If severe and moderately severe cases were combined, then the foregoing approach of Préziosi and Halloran could also be used. In particular, of 10 cases in the vaccinated group, 5 were severe or moderately severe, and of 16 cases in the placebo group, 13 were severe or moderately severe, such that, using (1), we have VE^P=.38, 95% CI=(-.11,.74).

At first glance, both vaccines seem to be protective against severe disease. However, due to the possibility of selection bias, we cannot be sure of this without further investigation. Different methods are used to adjust analyses for posttreatment variables, such as infection (Robins and Greenland 1992, 1994; Rosenbaum 1984). Our approach is based on an extension of the potential outcomes framework for estimating causal effects (Rubin 1978; Holland 1986). In particular, Frangakis and Rubin (2002) and Rubin (2000) proposed a method to adjust for posttreatment variables, called principal stratification, that stratifies on the joint potential posttreatment variables (e.g., infection) under each treatment being considered. The causal effects of one treatment compared with the other on a main outcome of interest (e.g., disease severity) are defined within these principal strata and are called principal effects. In studying HIV vaccines, Hudgens et al. (2003) and Gilbert et al. (2003b) adopted the principal stratification approach to assess HIV vaccine effects on the continuous postinfection outcome viral load in the principal stratum of individuals who would be infected under vaccine and placebo. However, they focused on testing, and did not consider binary postinfection endpoints.

In this article we use a similar approach, but in contrast to the previous work, here we use principal stratification to define causal vaccine effects on binary postinfection endpoints and focus on estimation and identifiability. We also analyze data from two actual vaccine studies. In Section 2 we formally define the estimands, particularly the causal estimand of interest and two currently used estimands for vaccine effects on postinfection outcomes, and state our three main assumptions of stable unit treatment value (SUTVA), independence, and monotonicity. In Section 3 we show the causal estimand to be not identifiable without further assumptions regarding the selective mechanism of the vaccine. In Section 4 we developed a variety of selection models that identify the causal estimand. We then derive closedform maximum likelihood estimators (MLEs) under several of these models, including the models that assume maximum possible levels of positive and negative selection bias. Using these approaches, we analyze the rotavirus vaccine candidate and the pertussis vaccine studies.

2. DEFINING VACCINE EFFECTS

Consider a group of individuals, i = 1,…,n, each of whom potentially receives either vaccine or placebo. Following the notation of Rubin (1990a) and similar to the development of Gilbert et al. (2003b), let the vector Z = (Z1,…,Zn) denote the vector of treatment assignments for n individuals with Zi = v if the ith individual is assigned vaccine and Zi = p if the ith individual is assigned placebo or control. In this article assignment is assumed to be equivalent to receipt. Denote the potential infection outcome of the ith individual if the study group were assigned Z as Si(Z), where Si(Z) = 0 if uninfected and Si(Z) 1 if infected. Then S(Z) is the n-vector of potential infection outcomes if the assignment vector were Z. For simplicity, we suppress the dependence on Z by letting S = S(Z). The focus of this work is on evaluating the causal effect of vaccine on a binary outcome Y that occurs after an individual becomes infected, such as transmission to another individual, severe disease, or death. Formally, if Si(Z) = 1, then we define the potential postinfection outcome Yi(Z, S) 1 if the ith individual would have the worse (i.e., more severe) postinfection outcome of interest given (Z, S), and Yi(Z, S) = 0 otherwise. If an individual’s potential infection outcome for an assignment is uninfected [i.e., Si(Z) = 0], then Yi(Z, S) is undefined and denoted by *.

We assume that the potential outcomes for each individual i are independent of the treatment assignment of other individuals. This assumption, known also as the assumption of no interference between units (Cox 1958), is obviously violated in many infectious disease settings (Rubin 1978, 1990a; Halloran and Struchiner 1995); however, we make it to clarify the problem in its simplest form. We also assume there are only two possible infection outcomes and, at most, two possible postinfection outcomes. These two assumptions are expressed formally as follows.

Assumption 1

Stable unit treatment value assumption (SUTVA) (Rubin 1978; Angrist, Imbens, and Rubin 1996). For any two treatment assignments, Z = (Z1,…,Zn) and Z=(Z1,,Zn), the following hold:

  1. If Zi=Zi, then Si(Z) = Si(Z′).

  2. If Zi=Zi and Si=Si, then Yi(Z,S) = Yi(Z′, S′).

The SUTVA assumption allows us to write Si(Zi) and Yi(Zi, Si(Zi)) rather than Si(Z) and Yi(Z, S). Under SUTVA, Si(Zi)=Si(Zi) whenever Zi=Zi, so we can write Yi(Zi) instead of Yi(Zi, Si(Zi)). Finally, we let Siobs denote the observed infection outcome Si(v) or Si(p), depending on treatment assignment, and analogously, Yiobs for the observed postinfection outcome.

We define a basic principal stratification, P0, according to the joint potential infection outcomes SP0 = (S(v), S(p)) (Frangakis and Rubin 2002). Table 1 summarizes the four basic principal strata defined by the joint potential infection outcomes, (S(v), S(p)), and the strata defined by the joint potential postinfection outcomes, (Y(v), Y(p)), within each principal stratum. The four basic principal strata are composed of immune (not infected under both vaccine and placebo), harmed (infected under vaccine but not placebo), protected (infected under placebo but not vaccine), and doomed individuals (infected under both vaccine and placebo). Because membership in a basic principal stratum is not affected by whether an individual is actually assigned to vaccine or placebo, the strata can be used in the same way as pretreatment covariates, with causal postinfection vaccine effects defined within a basic principal stratum SP0.

Table 1.

Basic Principal Stratification P0 Based on the Potential Infection Outcomes (S(v), S(p)) With Potential Postinfection Strata Based on (Y(v), Y(p))

Potential infection strata
Potential postinfection strata
Basic
principal
stratum, SP0
Potential
infection
outcomes
(S(v), S(p))
Potential
postinfection
outcomes
(Y(v), Y(p))
Postinfection interpretation
Immune (0, 0) (*, *) Always undefined
Harmed (1, 0) (0, *) Not severe vaccine, undefined placebo
(1, *) Severe vaccine, undefined placebo
Protected (0, 1) (*, 0) Undefined vaccine, not severe placebo
(*, 1) Undefined vaccine, severe placebo
Doomed (1, 1) (0, 0) Never severe
(1, 0) Harmed by vaccine
(0, 1) Helped by vaccine
(1, 1) Always severe

In general, causal effects are defined in terms of potential outcomes. Table 1 shows that the doomed basic principal stratum, SP0 = (1, 1), is the only stratum in which both potential postinfection endpoints, and thus their joint distribution, are defined. For this reason, in our setting, defining individual postinfection causal vaccine effects makes sense only in the doomed basic principal stratum, SP0 = (1, 1). In other words, we can speak of a vaccine causing an improvement or worsening of a postinfection outcome only for an individual who would become infected whether vaccinated or not. Thus, following Rubin (2000), two population-level causal estimands can be validly defined: (1) the effect of vaccine on infection (S) for all participants, and (2) the effect of vaccine on the binary postinfection outcome (Y) for those participants who would be infected under both treatment assignments.

To define the first causal estimand, the individual causal effect of vaccination on infection in individual i might be expressed as a difference or ratio (Rubin 1990b),

Si(p)Si(v)orSi(v)Si(p). (2)

Vaccine efficacy is typically expressed in terms of a relative risk difference, so that, similar to Struchiner and Halloran (1996), we define the individual causal effect of vaccination on infection as

VESi=Si(p)Si(v)Si(p)=1Si(v)Si(p), (3)

with the convention VESi = 0 when Si(v) = Si(p) = 0. Recall that we assume there is perfect compliance; that is, assignment to vaccine (control) is equivalent to receipt of vaccine (control). Without this assumption, (2) and (3) would be causal effects of assignment to vaccine. The ramifications of noncompliance on the assessment of causal vaccine effects are not considered here.

Because for any individual at least one potential outcome is always unobserved, the individual causal effect of vaccine on infection cannot be observed. Therefore, one must rely on contrasts between different individuals to estimate average or population-level causal effects (Holland 1986). Given the individual causal effect (3), similar to Struchiner and Halloran (1996) and Gilbert et al. (2003b), we define the corresponding population causal vaccine effect by

VES=E{S(p)S(v)}E{S(p)}=E{S(p)}E{S(v)}E{S(p)}=1E{S(v)}E{S(p)}.

We call this the relative average causal effect (RACE) of vaccination on infection. Because S(v) and S(p) are binary, the expectations can be replaced by probabilities, and VES can be interpreted as the relative reduction in the probability of infection given vaccine compared with control.

Estimation of population average causal effects requires specification of the assignment mechanism (Rubin 1990b), that is, how the individuals were assigned to vaccine or placebo. We assume that treatment assignment is independent of the potential outcomes.

Assumption 2

Z is independent of {Y(v), Y(p), S(v), S(p)}.

Here and in the sequel, Y(z) = * if S(z) = 0 for z = v, p. Randomization is one assignment mechanism where the treatment assignment is independent of the potential outcomes. It now follows

VES=1E{S(v)Z=v}E{S(p)Z=p}=1E{SobsZ=v}E{SobsZ=p}.

Next we define three estimands regarding the effect of vaccination on the postinfection outcome Y. Usually, the notation VEI is used specifically for vaccine efficacy for infectiousness, that is, the effect of vaccination on transmission to others (Halloran, Struchiner, and Longini 1997), and VEP is used for vaccine efficacy for progression or severity as in (1). Here we use VEI throughout the development to refer to vaccine efficacy for any binary postinfection outcome. This enables us to use standard notation for the secondary attack rate (SAR), the probability of transmission from an infected to a susceptible individual. In the development that follows, SAR is defined more generally as the proportion of infected individuals with the worse postinfection outcome.

One current approach to assessing vaccine effects on postinfection endpoints defines a net vaccine effect estimand that conditions on infection, that is,

VEInet=1E{YobsSobs=1,Z=v}E{YobsSobs=1,Z=p}=1E{Y(v)S(v)=1}E{Y(p)S(p)=1},

with the second equality following from Assumption 2. However, in general VEInet does not have a causal interpretation, because the set of individuals with S(v) = 1 is not necessarily identical to the set of individuals with S(p) = 1 (Rosenbaum 1984; Frangakis and Rubin 2002).

Another current approach defines an estimand for the effect of vaccination on different levels of severity of disease or mortality without conditioning on being infected. Such an estimand might be considered ITT, because it does not condition on the posttreatment variable Sobs; that is, it incorporates all individuals according to their treatment assignment. An example of an ITT estimand is

VEIITT=1E{Y(v)×S(v)}E{Y(p)×S(p)},

where we adopt the convention Y(z) × S(z) = 0 if S(z) = 0, z = v, p. This is a general form for what Préziosi and Halloran (2003b) called “VES for severity.” Although VEIITT does have a causal interpretation, it combines vaccine effects on susceptibility and postinfection outcome as shown by the following relation:

VEIITT=1(1VES)(1VEInet).

We propose a third estimand for the effect of vaccination on a binary postinfection outcome that has a causal interpretation and is separate from vaccine effects on susceptibility. To define the causal VEI, we use the basic principal stratification shown in Table 1. In particular, the individual causal vaccine effect on the postinfection outcomes is defined as

VEIi=1Yi(v)Yi(p),

for individuals within the doomed principal stratum only. Following the development of VES earlier, we define the population postinfection causal vaccine effect VEI within the doomed principal stratum as

VEI=1E{Y(v)SP0=(1,1)}E{Y(p)SP0=(1,1)}. (4)

Like VES, (4) could equivalently be given in terms of probabilities, because the postinfection random variables Y(v) and Y(p) are assumed to be binary, such that VEI can be interpreted as the causal estimand measuring the relative reduction in the probability of the worse postinfection outcome given vaccine compared with placebo in those individuals who would be infected under either treatment assignment. The challenge in estimating principal effects such as (4) lies in identifying those individuals in the doomed basic principal stratum, because at most one potential infection outcome is ever observed for a particular individual. To help overcome this obstacle, we make the following monotonicity assumption.

Assumption 3

Monotonicity. The individual causal effect of vaccination on infection is nonnegative; that is, VESi ≥ 0 for all i.

Under this assumption, an individual who would become infected under vaccine also would become infected under placebo, so the harmed principal stratum SP0 = (1, 0) is empty. In the context of possible noncompliance with treatment assignment, Angrist et al. (1996) and Imbens and Rubin (1997) made a monotonicity assumption akin to Assumption 3, which immediately eliminates the basic principal stratum of defiers (persons who would always take the opposite of their treatment assignment). By Assumption 3, we are not assuming that the individual postinfection causal effect of vaccination is nonnegative; rather, we allow that vaccination could have a negative (i.e., harmful) causal effect on the postinfection outcome. Indeed, one of our goals is to identify the conditions under which it could be determined the postinfection causal effect is negative.

Under Assumption 3, for each possible observed combination of (Zi,Siobs), we can deduce to which basic principal stratum an individual must belong:

(Zi=p,Siobs=0)SP0=(0,0)(becauseVESi0),(Zi=p,Siobs=1)SP0=(0,1)orSP0=(1,1),(Zi=v,Siobs=0)SP0=(0,1)orSP0=(0,0),(Zi=v,Siobs=1)SP0=(1,1)(becauseVESi0).

Thus the infected vaccine recipients will immediately provide information about the distribution of the postinfection outcome Y(v) in the doomed principal stratum. However, getting a handle on Y(p) in SP0 = (1, 1) is still problematic, because infected placebo recipients may be members of either the protected or the doomed principal stratum. In Section 4 we derive models that allow us to identify the distribution of Y(p) in the doomed principal stratum and, in turn, the postinfection causal vaccine effect VEI given in (4).

3. IDENTIFIABILITY AND ESTIMATION OF VACCINE EFFECTS

In this section we investigate identifiability and estimation of the estimands defined in the previous section under Assumptions 1–3 only.

3.1 Parameterization

Let the parameters θ govern the probabilities associated with the basic principal stratum. In our case the stratum SP0 is empty, so let θ = (θ00, θ01, θ11), where

Pr{SP0=(i,j,);θ}=θijfori,j=0,1;ij. (5)

Next, let the parameters ϕ = (ϕ00, ϕ01, ϕ10, ϕ11) govern the probabilities associated with the joint potential postinfection outcomes in the doomed basic principal stratum SP0 = (1, 1), where

Pr{(Y(v),Y(p))=(k,m)SP0=(1,1);ϕ}=ϕkmfork,m=0,1. (6)

Let the parameters γ = (γ0, γ1) govern the probabilities associated with the two possible potential postinfection outcomes under placebo in the protected basic principal stratum, SP0 = (0, 1), where

Pr{Y(p)=iSP0=(0,1);γ}=γlforl=0,1. (7)

Finally, let the law of Z be given by Pr{Z=z; ψ}=ψz for z = v, p.

Under this parameterization, the causal estimand of vaccine efficacy for susceptibility is

VES=1θ11θ01+θ11.

Based on the definition of the causal estimand VEI given in (4), we are interested not in the joint probabilities ϕkm(k,m=0,1), but rather in just two of the marginal probabilities. In particular, under the foregoing parameterization,

VEI=1ϕ1.ϕ.1, (8)

where

Pr{Y(v)=1SP0=(1,1)}=ϕ10+ϕ11=ϕ1.

and

Pr{Y(p)=1SP0=(1,1)}=ϕ01+ϕ11=ϕ.1.

We also have

VEInet=1ϕ1.γ1VES+ϕ.1(1VES)

and

VEIITT=1ϕ1.(1VES)γ1VES+ϕ.1(1VES).

3.2 Identifiability

Without further assumptions, VEI is not identifiable from the observable random variables (Yobs, Sobs, Z). To see this, first write the probability function for the observable random variables Pr[Z=z,Sobs=s,Yobs=y; θ, γ, ϕ ψ] as

θ00ψpI(z=p,s=0,y=)+{θ01+θ11(θ01γ1+θ11ϕ.1)}ψpI(z=p,s=1,y=0)+(θ01γ1+θ11ϕ.1)ψpI(z=p,s=1,y=1)+(θ00+θ01)ψvI(z=v,s=0,y=)+θ11(1ϕ1.)ψvI(z=v,s=1,y=0)+θ11ϕ1.ψvI(z=v,s=1,y=1), (9)

where I(·) is the usual indicator function. Next, suppose that we have θ01 = θ11, ϕ.1=, and γ1=1- for ε ∈ (0,1). Then the probability function given in (9) is constant for different values of ∊. Moreover, VEI =1-ϕ1./ depends on the choice of ∊, indicating that VEI is not identifiable. In contrast, the estimands VES, VEIITT, and VEInet are identifiable without further assumptions.

3.3 Estimation

Suppose that we observe n independent and identically distributed realizations of (Z, Sobs, Yobs), where Yobs is undefined or does not exist if Sobs = 0. There are six possible observed combinations of (Z, Sobs, Yobs). Let nsy(z) be the number of each combination observed in the study population, where s = 0, 1 is the observed infection outcome Sobs; y = 0, 1, * is the observed postinfection outcome Yobs; and z = v, p. That is,

n0(p)=iI(Zi=p,Siobs=0,Yiobsdoes not exist),n10(p)=iI(Zi=p,Siobs=1,Yiobs=0),n11(p)=iI(Zi=p,Siobs=1,Yiobs=1)

and

n0(v)=iI(Zi=v,Siobs=0,Yiobsdoes not exist),n10(v)=iI(Zi=v,Siobs=1,Yiobs=0),n11(v)=iI(Zi=v,Siobs=1,Yiobs=1),

where the summations are over i = 1,…,, n. (Note that the double subscripts do not have the same meaning for the n’s as for the ϕ’s and θ’s.) We assume each of the six combinations is observed at least once, that is, nsy(z) > 0 for (s, y) = (0, *), (1, 0), (1, 1) and z = v, p. Let n(p) = n0*(p) + n10(p) + n11(p) and n(v) = n0*(v) + n10(v) + n11(v) denote the number of individuals assigned to placebo and to vaccine. Let n1.(p) = n10(p) + n11(p) and n1.(v) = n10(v) + n11(v) denote the number of infected individuals assigned to placebo and vaccine. Let

ARz=n1.(z)n(z)forz=v,p;

that is, ARz is the observed attack rate in the group assigned treatment z. Finally, let

SARz=n11(z)n1.(z)forz=v,p;

that is, SARz is the observed secondary attack rate in the group infected given treatment z.

MLEs of the identifiable vaccine efficacy estimands can be found by maximizing the likelihood

L(θ,γ,ϕ)i=1nPr[Yiobs=yi,Siobs=siZi=zi;θ,γ,ϕ],

subject to constraints on θ, γ, and ϕ that ensure that (5)–(7) are probability functions. Using the results from the Appendix, it follows that the MLE of VES is given by

VE^S={1ARvARpifARvARp0otherwise.} (10)

Further, the MLEs of VEIITT and VEInet are

VE^Inet=1SARvSARp (11)

and

VE^IITT=1(1VE^S)SARvSARp (12)

or, equivalently,

VE^IITT={VE^InetifVE^S=01n11(v)n(v)n11(p)n(p)ifVE^S>0.} (13)

The causal estimand VEI is not identifiable because ϕ.1, the denominator of the right side of (8), is not identifiable. But ϕ1., the numerator of the right side of (8), can be identified by the observable random variables. The corresponding MLE is given by

ϕ^1.=SARv, (14)

that is, the observed secondary attack rate in the vaccine arm. Finally, although ϕ.1 is not identifiable, we can identify

Pr[Y(p)=1S(p)=1;θ,γ,ϕ]=γ1VES+ϕ.1(1VES). (15)

The MLE of (15) is SARp such that any feasible pair (γ^1,ϕ^.1)satisfying

SARp=γ^1VE^S+ϕ^.1(1VE^S) (16)

is an MLE of (γ1, ϕ.1). In the following section we introduce selection models that identify (γ1, ϕ.1). These models add an additional constraint to the parameters such that there is only one feasible pair (γ^1,ϕ^.1) that satisfies (16). Because the value of the likelihood remains the same, we have the unique MLE of ϕ.1 and, therefore, VEI. Before proceeding, we revisit our examples.

3.4 Applications

3.4.1 Rotavirus Candidate Vaccine

In the rotavirus candidate vaccine study (Vesikari et al. 1990), the observed data were

n0(p)=84,n0(v)=90,n10(p)=3,n10(v)=5,n11(p)=13,n11(v)=5.

From (10), VE^S=1 – (10/100)/(16/100) = .375. It then follows from (13) that VE^IITT=1 – (5/100)/(13/100) = .62. The secondary attack rates are SARv=ϕ^1.=510=.50 and SARp = 13/16 = .81, such that VE^Inet=1 – (5/10)/(13/16) = .385.

To consider estimation of VEI, Table 2 shows the relation of the observed data to the basic principal strata and the strata of joint potential postinfection outcomes within each basic principal stratum. By Assumptions 1–3, we know the following:

  • All n10(v) + n11(v) = 10 belong to the doomed stratum SP0 = (1, 1).

  • All n0*(p) = 84 belong to the immune stratum SP0 = (0, 0).

  • The n0*(v) = 90 could belong to SP0 = (0, 0) or SP0 = (0, 1).

  • The n10(p) + n11(p) = 16 could belong to SP0 = (0, 1) or SP0 = (1, 1).

Ignoring statistical variability, by Assumption 2 (independence), because there are 10 vaccine recipients in the doomed stratum, there are 10 placebo recipients in the doomed stratum as well. Because there are 84 placebo recipients in the immune stratum, there are 84 vaccine recipients in the immune stratum as well. Thus there must be 6 recipients from each of the vaccinated and unvaccinated groups in the protected stratum. Thus we can estimate the size of the unobserved principal stratum SP0 = (0, 1). However, we do not know which 6 of the 16 infected placebo recipients are in the protected stratum SP0 = (0, 1) or which 10 of the 16 are in the doomed stratum SP0 = (1, 1). This illustrates the need for further assumptions to identify VEI.

Table 2.

Rotavirus Vaccine Example (Vesikari et al. 1990): Relation of Observed Data to the Basic Principal Strata of the Joint Potential Infection Outcomes and the Strata of Joint Potential Postinfection Outcomes

graphic file with name nihms-73841-t0004.jpg

NOTE: The shaded regions correspond to membership within the basic principal strata with observations outside the shaded regions having unknown principal stratum. Numbers in [ ] are not observed, but follow from Assumptions 1-3 and the observed data.

3.4.2 Pertussis Vaccine

The pertussis vaccine analysis of Préziosi and Halloran (2003b) included exactly 1 year of follow-up (the calendar year 1993), so that the person-years at risk are a close approximation to the number of persons at risk. Thus we use the person-years at risk for n(v) and n(p). Although vaccine status was not randomized, there was no evidence of systematic differences between the vaccinated and unvaccinated groups, indicating that Assumption 2 (independence) might be reasonable. Active surveillance to find cases makes it unlikely that ascertainment bias plays a role. (See Préziosi and Halloran 2003b for further discussion.) The observed data are

n0(p)=814,n0(v)=3,297,n10(p)=77,n10(v)=372,n11(p)=129,n11(v)=176.

From (10), VE^S=1 – (548/3,845)/(206/1,020) =.29. The secondary attack rates are SARv=ϕ^1.=176548=.32 and SARp = 129/206 = .63, such that VE^Inet=1 – (176/548)/(129/206) = .49, which is the same as VE^P given in (1). Finally VE^IITT=1 – (176/3,845)/(129/1,020) = .64, which Préziosi and Halloran (2003b) called VE^S for severity.

4. SELECTION BIAS MODELS

In the previous section we showed that VEI is not identifiable under Assumptions 1–3 (SUTVA, independence, and monotonicity). In particular, the parameterization (5)–(7) defines a family of joint distribution functions of (Z, S(v), S(p), Y(v), Y(p)) for which the resulting conditional distributions of Y(p)|SP0 = (1, 1) are not identified from the observable random variables (Z, Sobs, Yobs). In this section we consider different additional assumptions about the selective effect of the vaccine on susceptibility to infection that restrict inference to a subset of this family such that the distribution of Y(p)|SP0 = (1, 1) and, in turn, VEI are identifiable. The lack of identifiability of VEI derives from ϕ.1 and γ1 occurring in (9) only through the term

θ01γ1+θ11ϕ.1=Pr[Yobs=1,Sobs=1Zobs=p]. (17)

Thus for any fixed values of θ01, θ11, and Pr[Yobs = 1, Sobs = 1| Zobs = p], all pairs of parameters

{(γ1,ϕ.1):0γ11,0ϕ.11,and(17)holds}, (18)

will yield the same distribution of (Z, Sobs, Yobs). The selection models presented in this section place additional constraints on the parameter space such that only one pair of parameters satisfies (18). These selection models are nonparametric in the sense that no additional restrictions are placed on the distribution of (Z, Sobs, Yobs) (Scharfstein, Rotnitzky, and Robins 1999). Although the observed data do not allow us to distinguish among selection models, through sensitivity analyses we can estimate VEI under each selection model in a collection of models that represent a range of plausible selective effects of vaccine. Moreover, if we can show that VEI is significantly greater (less) than 0 over all values of possible selective effects, the observed data provide sufficient evidence to conclude the vaccine has a positive (negative) causal effect on the postinfection endpoint under Assumptions 1–3 only.

We begin in Section 4.1 by assuming there is no selective vaccine effect. In Section 4.2 we present two extreme models that assume maximum possible levels of positive and negative selection bias and, in turn, yield bounds on the MLE of the causal estimand in a fashion similar to the approaches of Manski (1990), Balke and Pearl (1997), and Hudgens et al. (2003). In Section 4.3 we present three approaches to sensitivity analysis that allow a range of selective effects. We discuss making inferences on VEI accounting for statistical variability in Section 4.4, and then apply the different methods to the two vaccine examples in Section 4.5.

4.1 No Selection Bias

The simplest assumption is that there is no selection; that is, the probability of the postinfection outcome conditional on infection under placebo is independent of infection status under vaccine,

Pr{Y(p)=ySP0=(1,1);ϕ}=Pr{Y(p)=ySP0=(0,1);γ}fory=0,1, (19)

which implies ϕ.1 = γ1. It follows immediately that assumption (19) identifies VEI and, from (16), that the resulting MLE is

ϕ^.1=SARp. (20)

From (14), (20), and the definition of VEI given by (8), it follows that the MLE of VEI equals VE^Inet as given in (11). In other words, under the additional assumption of no selection bias as specified by (19), the MLE of the causal vaccine effect is the usual secondary attack rate ratio estimator one obtained when conditioning on infection as in (1).

4.2 Upper and Lower Bounds

In this section we present selection models that provide the largest and smallest estimators of VEI. In particular, the upper bound selection model yields the parameter pair satisfying (18) that has the largest ϕ.1 (and thus the largest VEI). Because (18) is simply the intersection of the unit square and a line with negative slope, it follows that the pair with maximal ϕ.1 must be on the edge of the square, that is, either when

Pr[Y(p)=1SP0=(1,1)]=ϕ.1=1 (21)

or

Pr[Y(p)=1SP0=(0,1)]=γ1=0. (22)

In words, the upper bound selection bias model assumes either that (a) all placebo recipients in the doomed principal stratum have the worse postinfection outcome or (b) all placebo recipients in the protected principal stratum have the better postinfection outcome. Under the model assuming either (21) or (22), ϕ.1 is identifiable; the proof is not shown but is available from the authors. From (16), it follows that the unique MLE of VEI assuming either (21) or (22) is given by

VE^Iupper={1SARvifVE^S>1SARpVE^IITTif0<VE^S1SARpVE^InetifVE^S=0.} (23)

All MLEs obtained under Assumptions 1–3 must be less than or equal to VE^Iupper.

Similarly, the lower bound selection bias model assumes that under assignment to placebo, the worse postinfection outcome occurs either with probability 0 in the doomed principal stratum,

Pr[Y(p)=1SPo=(1,1)]=ϕ.1=0, (24)

or with probability 1 in the protected principal stratum,

Pr[Y(p)=1SP0=(0,1)]=γ1=1. (25)

This model renders ϕ.1 identifiable, and from (16) the resulting unique MLE of VEI is

VE^Ilower={ifVE^S>SARp1SARv{SARpVE^S1VE^S}if0<VE^SSARpVE^InetifVE^S=0.} (26)

In summary, even though VEI is not identifiable from the data at hand, these extreme selection bias models provide bounds on the MLE. Moreover, we have derived the circumstances when the upper bound will be negative (suggesting harm) and when the lower bound will be positive (suggesting benefit). For example, VE^Iupper will be negative if and only if VE^S1SARp and VE^IITT0. Similarly, VE^Ilower will be positive if and only if VE^SSARp and SARv(SARpVE^S)(1VE^S). In contrast, for VE^Smax{SARp,1SARp}, we know that VE^Iupper will be always positive and VE^Ilower will be always negative. In other words, for large enough VE^S we cannot determine the sign of VEI unless further assumptions are made beyond SUTVA, independence, and monotonicity.

4.3 Sensitivity Analyses

Estimating the causal vaccine effect under an extreme degree of selection bias is useful in bounding the estimate of the postinfection effect above and beyond any possible selective effects. However, the true degree of selection bias is likely less than the foregoing extreme models, such that using VE^Iupper or VE^Ilower may be too conservative. Therefore, we consider classes of selection models that include the foregoing no selection and extreme models. Through sensitivity analysis over a particular class, we can explore the relationship between the assumed degree of selection bias and our inference about VEI. This approach allows for potentially more realistic models and less conservative inference.

4.3.1 Log Odds Ratio of Infection

Our first approach is similar to that of Scharfstein et al. (1999) and Robins, Rotnitzky, and Scharfstein (2000) and adapted by Gilbert et al. (2003b). We define a sensitivity model in terms of the log odds ratio of having the severe postinfection endpoint under placebo in the doomed versus protected principal strata,

exp(β)=Pr[Y(p)=1SP0=(1,1)]Pr[Y(p)=0SP0=(1,1)]Pr[Y(p)=1SP0=(0,1)]Pr[Y(p)=0SP0=(0,1)]. (27)

For example, if exp(β) = 2, then doomed individuals have twice the odds of having the worse postinfection outcome under placebo compared with protected individuals. In terms of our parameterization, this implies that

ϕ.1=γ1exp(β)γ0+γ1exp(β). (28)

For fixed β, we can solve (16) and (28) for ϕ.1 and, in turn, VEI. A sensitivity analysis can then be performed by repeating this process over a range of different β’s.

The selection models given in Sections 4.1 and 4.2 are special cases of the class of logistic selection models given by (27) when β = 0 (no selection), β → ∞ (upper bound), or β → —∞ (lower bound). For example, letting β → ∞

Pr[Y(p)=1SP0=(1,1)]1 (29)

or

Pr[Y(p)=1SP0=(0,1)]0. (30)

Because (29) implies (21) and (30) implies (22), we recover the extreme upper selection bias model. The relation among the log odds ratio, upper bound, and no selection models are depicted graphically in Figure 1.

Figure 1.

Figure 1

Graphical Depiction of Selection Bias Models Described in Section 4. The four dotted curves depict the log odds ratio selection model for β = 0, 1, 2, 4. The β = 0 model is equivalent to the no selection model. The thick black lines on the edge of the unit square correspond to the upper-bound selection model. The solid thin line with negative slope represents an example of a family of joint distribution functions of (Z, S(v), S(p), Y(v), Y(p)) for which all combinations of parameters (ϕ.1, γ1) on this line give rise to the same distribution of the observable random variables (Z, Sobs,Yobs). Each selection model identifies exactly one pair of parameters (ϕ.1, γ1) from this family, rendering VEI = 1 − ϕ1..1 identifiable.

4.3.2 Conditioning on γ1 as the Sensitivity Analysis Parameter

Section 4.3.1 posits a relation between the distributions of the postinfection endpoint under placebo in the doomed and protected strata. Alternatively, we might simply condition on the nuisance parameter γ1, which governs the postinfection endpoint distribution in the protected stratum. Assuming that γ1 is known renders ϕ.1 and, hence, VEI identifiable. From (16), the resulting MLE of VEI is

VE^I=1SARv{SARpγ1VE^S1VE^S}, (31)

where γ1 varies between

max{0,SARp(1VE^S)VE^S}γ1min{1,SARpVE^S}, (32)

with the left side of (32) giving rise to VE^Iupper and the right side of (32) giving rise to VE^Ilower.

4.3.3 Complete-Data Model

A third approach to sensitivity analysis regards the unknown basic principal stratum membership of the infected placebo recipients as missing data and formulates the sensitivity analysis in terms of the complete-data likelihood. The observed data are n10(p) and n11(p). If we could know the basic principal stratum membership, then the complete data would be n10d(p) and n10d(p), the number of infected placebo recipients in the doomed stratum with Y(p) 0 and Y(p) = 1, and n10p(p) and n11p(p), the corresponding number in the protected stratum. Given the complete data, ϕ.1 becomes identifiable. The complete-data log-likelihood for (θ, ϕ, γ) is given by

n0(p)log(θ00)+n10p(p)log(θ01γ0)+n11p(p)log(θ01γ1)+n10d(p)logθ11(1ϕ.1))+n11d(p)log(θ11ϕ.1)+n0(v)log(θ00+θ01)+n10(v)log(θ11(1ϕ.1))+n11(v)log(θ11ϕ1.), (33)

which factors in terms of θ, ϕ, and γ, say l(θ), l(ϕ), and l(γ). Maximizing l(ϕ) yields the same MLE ϕ^1=SARv as before. It also yields the MLE

ϕ^.1=n11d(p)n10d(p)+n11d(p), (34)

the secondary attack rate under placebo in the doomed stratum. The sensitivity analysis involves estimating VEI using (34) for all possible complete-data configurations consistent with the assumptions and the constraints implied by the observed data. Molenberghs, Kenward, and Goetghebeur (2001) called this set of point estimates the region of ignorance, and called the collection of CIs or other measures of precision together with the region of ignorance the region of uncertainty.

4.4 Statistical Variability

Once we assume a particular selection model, we have a finite-dimensional parametric inference problem with a unique MLE. Therefore, conditional on a selection model, we use standard methods to obtain CI estimates for VEI. For example, CIs can be computed assuming the usual chi-squared limiting distribution of the profile likelihood ratio (Barndorff-Nielsen and Cox 1994). Alternatively, using the observed information and the delta method, Wald-type CIs for VEI can be determined. The resulting CIs can then be used to determine a region of uncertainty for any of the sensitivity analyses described earlier. A region of uncertainty that excludes 0 will then imply a statistically significant postinfection causal effect of the vaccine.

4.5 Applications, Continued

4.5.1 Rotavirus Candidate Vaccine

For these data, VE^S1SARp, so from (23), VE^Iupper=1SARv=.50. In contrast, 0VE^SSARp, so, from (26),

VE^Ilower=1510{1316(1(10100)(16100))}(10100)(16100)=.29.

Figure 2(a) shows the sensitivity analysis of VE^I as a function of the odds ratio eβ as described in Section 4.3.1. Figure 3(a) shows the sensitivity analysis of VE^I as a function of γ1 as described in Section 4.3.2. From (32), the range for γ1 is [.5, 1.0 in this example. For these figures, profile likelihood–based CIs are presented; Wald-type CIs gave qualitatively similar results. The vertical dotted line in Figures 2 and 3 corresponds to the assumption of no selection bias. In particular, in Figure 3, γ^1 corresponds to the MLE of γ1 under the assumption of no selection bias.

Figure 2.

Figure 2

Sensitivity Analysis Using the Odds Ratio of Having the Severe Postinfection Endpoint Under Placebo in the Doomed versus Protected Principal Strata as Described in Section 4.3.1. (a) Rotavirus; (b) pertussis. The vertical dotted line corresponds to the assumption of no selection bias.

Figure 3.

Figure 3

Sensitivity Analysis Assuming γ1 = Pr{Y(p) = 1|SP0 = (0, 1); γ} Is Known as Described in Section 4.3.2. (a) Rotavirus; (b) pertussis. The vertical dotted line gives the MLE of γ1 under the assumption of no selection bias.

Table 3 gives all possible configurations of the complete data for the placebo arm consistent with the observed data and Assumptions 1–3 as described in Section 4.3.3. On the one hand, the complete data are constrained by the observed data to satisfy n11(p)=n11d(p)+n11p(p)=13 and n10(p)=n10d(p)+n10p(p)=3. On the other hand, Assumptions 1–3 imply that of the total of 16 infected placebo recipients, 6 must be in the protected stratum and 10 must be in the doomed stratum. There are only four complete-data configurations that satisfy these constraints. These four configurations can be envisaged by assigning the three infected placebo recipients with mild infection to either the doomed stratum or the protected stratum. At one extreme, the three are all in the doomed stratum; at the other extreme, they are all in the protected stratum. The resulting four MLEs of VEI given in the bottom row of Table 3 compare the region of ignorance. For this dataset, the assumption of no selection bias does not correspond to any possible complete-data configuration. But the extreme lower and upper bound estimates, VE^Ilower and VE^Iupper, are included in the region of ignorance.

Table 3.

Rotavirus Vaccine Example (Vesikari et al. 1990): All Possible Configurations of the Underlying Complete Data for the Placebo Arm Consistent With the Observed Data and Assumptions 1–3

graphic file with name nihms-73841-t0005.jpg

NOTE: The shaded regions correspond to membership within the basic principal strata with observations outside the shaded regions having unknown principal stratum. Numbers in [ ] are not observed, but follow from the observed data and Assumptions 1-3. Maximum likelihood estimates of causal VEI are given for each possible complete data configuration. The no selection bias model is also included, although it does not correspond to any of the possible complete data configurations.

This example demonstrates the utility of the sensitivity analyses. For these data, the 95% CI for VEI covers 0 for the lower bound selection model, indicating that one cannot conclude that the vaccine has a protective causal effect against severe disease at the α = .05 level. In contrast, the upper bound selection model CI excludes 0, indicating a significant protective effect. In this situation, sensitivity analyses can be used to assess at what point significance is attained. For example, Figures 2(a) and 3(a) indicate that even under the assumption of no selection bias, the effect remains nonsignificant.

Finally, the lack of statistical significance in this example may be attributed simply to small sample size. For instance, had the study been such that there were 1,000/arm with the same observed marginal distributions (i.e., 3% mild and 13% severe in the placebo arm, 5% mild and 5% severe in the vaccine arm), then the 95% CI for VEI under the lower bound model would have been [.09,.46], indicating a significant causal vaccine effect on rotavirus disease severity in individuals who would be infected regardless of vaccination status.

4.5.2 Pertussis Vaccine

For these data, 0VE^S1SARp, so from (23), VE^Iupper=VE^IITT=.64. In contrast, 0VE^SSARp, so from (26), VE^Ilower=.32. Figures 2(b) and 3(b) show the sensitivity analyses of VE^I as described in Sections 4.3.1 and 4.3.2. In both figures, the lower limits of the 95% CIs are well above 0 over the range of both selection models, suggesting that pertussis vaccination causes significant protection against severe disease in children who would develop pertussis regardless of vaccination. The sensitivity analysis using the complete-data likelihood would proceed similarly as for the rotavirus vaccine candidate, with many more possible data configurations and taking the unequal sizes of the vaccinated and unvaccinated arms into account.

5. DISCUSSION

For vaccine studies in which the effect of vaccination on binary postinfection outcomes is of interest, our results will enable researchers to quantify the soundness and limitations of their findings. Combining principal stratification with a maximum likelihood approach, we have developed methods for estimating the causal effect of vaccination on postinfection outcomes in those individuals who would be infected under vaccine and placebo. Closed forms for the MLE of the causal estimand have been derived assuming extreme upper, extreme lower, and no selection bias models. These results indicate specific assumptions under which currently used efficacy estimators have a causal interpretation; for example, the usual net postinfection efficacy estimator is the MLE of our causal estimand under the assumption of no selection bias. In contrast, the usual intent-to-treat estimator provides an upper bound on the postinfection causal vaccine effect when the protective effect of the vaccine against infection is not too large (i.e., VE^s1SARp). We have also presented three approaches to sensitivity analysis that allow selection models to range from no selection to the extreme maximum possible levels. Coupled with confidence intervals, these sensitivity analyses provide a comprehensive description of the uncertainty regarding VEI. For example, in the analysis of the pertussis vaccine study, the estimated postinfection vaccine effect was significantly greater than 0 for all selection bias models, including the extreme lower bound model, indicating a significant causal effect in reducing disease severity. In other words, under only the standard assumptions of SUTVA, monotonicity, and independence of treatment assignment, we have shown that there is a definitive causal vaccine effect on severity of pertussis in individuals who would be infected whether vaccinated or not.

Several extensions of this work and avenues for further research merit mention. For instance, further research is needed on the consequences of relaxing several key assumptions. Regarding Assumption 1, SUTVA is clearly violated in many vaccine field studies (Rubin 1990a; Halloran and Struchiner 1995). When SUTVA is violated, an individual can have different potential infection outcomes under the same treatment assignment. Therefore, defining the basic principal stratum for an individual needs to account for the possible treatment assignments of other individuals. Assumption 2 (independence) clearly holds in the rotavirus example because the trial was randomized, and likely also holds in the pertussis field study, as discussed in Section 3.4.2. In general, the effect of nonrandom allocation of the vaccine can be explored through additional sensitivity analyses and other approaches for observational studies (Rosenbaum 1995). In addition, exclusion restrictions not considered here may be necessary if the independence assumption is not made. Assumption 3 (monotonicity) also likely holds for the rotavirus and pertussis examples, because VE^S0 in both studies. Relaxing Assumption 3 to allow for a harmful vaccine effect on infection would make identifiability and estimation of the postinfection causal estimand VEI more difficult. However, a vaccine that increases susceptibility to infection would be unacceptable from a public health standpoint, so that concerns about postinfection selection bias become less relevant. Finally, we have assumed that assignment to a treatment is equivalent to receipt of that treatment. The increased complexity of defining, identifying, and estimating causal effects in the presence of noncompliance (Angrist et al. 1996) would likely necessitate further exclusion restriction assumptions (e.g., given the actual treatment received, assignment to treatment carries no additional information).

Additional research is also needed to incorporate baseline covariates and allow for missing or mismeasured infection or postinfection outcomes. The operating characteristics of conditional and unconditional (i.e., ITT) tests for binary postinfection vaccine effects remain unknown. Our preliminary comparisons (results not shown) suggest a conditional approach [similar to that of Hudgens et al. (2003)] may be more powerful. These results are consistent with the findings of Mehrotra, Li, and Gilbert (2005). Finally, as pointed out by a reviewer, the multivariate literature on characterizing the set of possible joint distributions given fixed marginal or conditional distributions (see, e.g., Joe 1997) may have direct application in the principal stratification setting.

Correct assessment of vaccine effects on postinfection endpoints has important public health consequences. For example, there is precedent and concern for vaccine-induced immune responses causing disease enhancement (Kliks, Nisalak, Brandt, Wahl, and Burke 1989; Burke 1992; Mascola et al. 1993; Nabel 2001). If the estimated net postinfection vaccine effect suggested a possible harmful effect, then a vaccine could be wrongfully discounted if, in fact, the apparent negative effect were due to selection bias. Our methods could be used to determine whether such an observed effect was consistent with selection bias. In contrast, if the estimated net vaccine effect were positive, then one might be less compelled to distinguish (a) a selective effect of the vaccine on infection from (b) the causal effect of the vaccine in the doomed principal stratum, because either effect is beneficial from a public health perspective. Disentangling (a) from (b) could be quite valuable from a scientific standpoint, however. For example, our analysis of the pertussis vaccine study provides evidence that vaccination has an effect on disease severity above and beyond that attributable to selection bias, suggesting the vaccine produces an immune response that directly hinders severe symptoms, such as paroxysms with whoops. This could suggest directions for further biological research. Another example arises in HIV, where researchers are considering both antibody and T-cell–based vaccines (McMichael, Mwau, and Hanke 2002; Nabel 2001) under the hypothesis that the former will protect from infection and the latter will protect from disease or death. If a combination vaccine proves effective in reducing the risk of morbidity or mortality, then researchers will be interested in understanding the mechanism of protection, that is, to which vaccine component is such an effect attributable. Analysis revealing a signifi-cant post-infection causal vaccine effect in the doomed stratum would provide evidence of T-cell–based protection. Separating (a) and (b) also might be helpful in predicting the success of particular vaccination programs (Hayes, Alexander, Bennett, and Cousens 2000).

Although in this article we have handled the postinfection outcomes of transmission and disease severity or death similarly, there are important differences. In particular, vaccine efficacy for infectiousness (VEI) involves transmission to another susceptible individual, whereas vaccine efficacy for severity or progression (VEP) involves only the infected individual. Thus, unlike VEP, VEI can depend on the characteristics of the individuals susceptible to secondary infection, as well as on the type of contacts between individuals. Also, an infected individual could expose more than one individual, so that there could be several potential postinfection outcomes, one for each exposed individual; for example, the study be in Préziosi and Halloran (2003a), more than 20 individuals were exposed to one infected individual in some cases. Alternatively, an infected individual may expose no other susceptibles, and then the potential postinfection outcome would be undefined. Our method needs to be extended to such multiple outcomes and possible dependencies.

In most vaccine studies in humans, whether or not an individual becomes infected is not under the control of the investigator. The potential postinfection outcomes for individuals who would not become infected under either vaccine or placebo are undefined, or are a priori counterfactual (Frangakis and Rubin 2002), resulting in our defining the postinfection causal estimand in the doomed stratum only. However, there are a few infectious agents that can be used in infection-challenge studies in either human or animal. For these types of studies, a more appropriate approach might be that of Robins and Greenland (1992, 1994), which assumes that the posttreatment variable is controllable; that is, the investigator can force the individuals to take on any possible value of the posttreatment variable. Comparison of the different approaches merits further research.

Acknowledgments

This work was supported in part by National Institutes of Health grants R37-AI29168, R01-AI054165, and R01-AI32042. Halloran was also a visiting faculty member in the Statistics Division, Los Alamos National Laboratories. The authors thanks Peter Gilbert, Andrew Hill, Marie-Pierre Préziosi, and Bryan Shepherd for helpful discussions.

APPENDIX: MAXIMUM LIKELIHOOD ESTIMATION

Let ξ = θ01γ1 + θ11ϕ.1 and consider maximizing the log-likelihood,

no(p)log(θ00)+n10(p)log(θ01+θ11ξ)+n11(p)log(ξ)+n0(v)log(θ00+θ01)+n10(v)log(θ11(1ϕ.1))+n11(v)log(θ11ϕ1.),

subject to the constraints

θ00+θ01+θ11=1,
0θij,i,j=0,1;ij,
0ξθ01+θ11,
0ϕ1.1.

The maximum of ϕ1. is given by ϕ^1.=n11(v)n1.(v), such that we are left with the simpler problem of maximizing

l(θ,ξ)=n0(p)log(θ00)+n10(p)log(θ01+θ11ξ)+n11(p)log(ξ)+n0(v)log(θ00+θ01)+n1.(v)log(θ11),

subject to (A.1)–(A.3).

The function 1(θ,ξ) is strictly concave over the closed, convex parameter space defined by (A.1)–(A.3) such that a unique MLE exists and the (first-order) Kuhn–Tucker conditions are necessary and sufficient for optimality (Fletcher 1987). In particular, define Lagrange multipliers λ0,λ1,…,λ5 and Lagrangian function

L(θ,ξ)=l(θ,ξ)+λ0(1θij)+λ1θ00+λ2θ01+λ3θ11+λ4ξ+λ5(θ01+θ11ξ),

where the summation is over i, j = 0, 1 such that ij. Then (θ^,ξ^) is an MLE iff it is a feasible point according to (A.1)–(A.3) and satisfies

L(θ,ξ)θij=0,i,j=0,1;ij,
L(θ,ξ)ξ=0,
λi0,i=1,...,5,

and

λ0(1θij)=λ1θ00=λ2θ01=λ3θ11=λ4ξ=λ5(θ01+θ11ξ)=0.

Under the assumption nsy(z) > 0 for all possible combinations of s, y, and z, we must have θ^00,θ^11, and ξ^ strictly greater than 0 to maximize l(θ, ξ), such that λ1 = λ3 = λ4 = 0. Similarly, we must also have ξ^θ^01+θ^11, such that λ5 = 0. Multiplying (A.5) and (A.6) by θij and ξ and summing,

θijL(θ,ξ)θij+ξL(θ,ξ)ξ=0

can be shown to yield λ0 = n. It follows that (θ^,ξ^) is an MLE iff it satisfies (A.1)–(A.3) and

l(θ,ξ)θ00=n0(p)θ00+no(v)θ00+θ01=n,
l(θ,ξ)θ01=n10(p)θ01+θ11ξ+n0(v)θ00+θ01n,
l(θ,ξ)θ11=n10(p)θ01+θ11ξ+n10(v)+n11(v)θ11=n,

and

l(θ,ξ)ξ=n11(p)ξn10(p)θ01+θ11ξ=0.

We can show the MLE of θ has the following closed form that depends on the data. In particular, if

n0(v)n(v)n0(p)n(p),

then the MLEs are given by

θ^00=n0(p)n(p),
θ^01=n0(v)n(v)n0(p)n(p),

and

θ^11=1n0(v)n(v).

Otherwise,

θ^00=n0(p)+n0(v)n,
θ^01=0,

and

θ^11=1n0(p)+n0(v)n.

In either case, the MLE of ξ is given by

ξ^=n11(p)n1.(p)(θ^01+θ^11).

Contributor Information

Michael G. Hudgens, Research Assistant Professor, Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599 (E-mail: mhudgens@bios.unc.edu).

M. Elizabeth Halloran, Professor, Department of Biostatistics, Emory University, Atlanta, GA 30322 (E-mail: mehallo@sph. emory.edu)..

REFERENCES

  1. Angrist JD, Imbens GW, Rubin DB. Identification of Causal Effects Using Instrumental Variables. Journal of the American Statistical Association. 1996;91:444–455. [Google Scholar]
  2. Balke A, Pearl J. Bounds on Treatment Effects From Studies With Imperfect Compliance. Journal of the American Statistical Association. 1997;92:1171–1176. [Google Scholar]
  3. Barndorff-Nielsen OE, Cox DR. Inference and Asymptotics. Chapman & Hall; New York: 1994. [Google Scholar]
  4. Burke DS. Human HIV Vaccine Trials: Does Antibody-Dependent Enhancement Pose a Genuine Risk? Perspectives in Biology and Medicine. 1992;35:511–530. doi: 10.1353/pbm.1992.0048. [DOI] [PubMed] [Google Scholar]
  5. Clements-Mann ML. Lessons for AIDS Vaccine Development From Non-AIDS Vaccines. AIDS Research and Human Retroviruses. 1998;14(Suppl 3):S197–203. [PubMed] [Google Scholar]
  6. Cox DR. Planning of Experiments. Wiley; New York: 1958. [Google Scholar]
  7. Fletcher R. Practical Methods of Optimization. Wiley; New York: 1987. [Google Scholar]
  8. Frangakis CE, Rubin DB. Principal Stratification in Causal Inference. Biometrics. 2002;58:21–29. doi: 10.1111/j.0006-341x.2002.00021.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Gilbert PB, Bosch R, Hudgens MG. Sensitivity Analysis for the Assessment of Causal Vaccine Effects on Viral Load in HIV Vaccine Trials. Biometrics. 2003a;59:531–541. doi: 10.1111/1541-0420.00063. [DOI] [PubMed] [Google Scholar]
  10. Gilbert PB, DeGruttola V, Hudgens MG, Self SG, Hammer SM, Corey L. What Constitutes Efficacy for a Human Immunodeficiency Virus Vaccine That Ameliorates Viremia: Issues Involving Surrogate End Points in Phase 3 Trials. Journal of Infectious Diseases. 2003b;188:179–193. doi: 10.1086/376449. [DOI] [PubMed] [Google Scholar]
  11. Halloran ME, Préziosi M-P, Chu H. Estimating Vaccine Efficacy From Secondary Attack Rates. Journal of the American Statistical Association. 2003;98:38–46. [Google Scholar]
  12. Halloran ME, Struchiner CJ. Causal Inference in Infectious Diseases. Epidemiology. 1995;6:142–151. doi: 10.1097/00001648-199503000-00010. [DOI] [PubMed] [Google Scholar]
  13. Halloran ME, Struchiner CJ, Longini IM. Study Designs for Evaluating Different Efficacy and Effectiveness Aspects of Vaccines. American Journal of Epidemiology. 1997;146:789–803. doi: 10.1093/oxfordjournals.aje.a009196. [DOI] [PubMed] [Google Scholar]
  14. Hayes RJ, Alexander NDE, Bennett S, Cousens SN. Design and Analysis Issues in Cluster-Randomized Trials of Interventions Against Infectious Diseases. Statistical Methods in Medical Research. 2000;9:95–116. doi: 10.1177/096228020000900203. [DOI] [PubMed] [Google Scholar]
  15. Holland PW. Statistics and Causal Inference. Journal of the American Statistical Association. 1986;81:945–960. [Google Scholar]
  16. Hudgens MG, Hoering A, Self SG. On the Analysis of Viral Load Endpoints in HIV Vaccine Trials. Statistics in Medicine. 2003;22:2281–2298. doi: 10.1002/sim.1394. [DOI] [PubMed] [Google Scholar]
  17. Imbens GW, Rubin DB. Bayesian Inference for Causal Effects in Randomized Experiments With Noncompliance. The Annals of Statistics. 1997;25:305–327. [Google Scholar]
  18. Joe H. Multivariate Models and Dependence Concepts. Chapman & Hall; London: 1997. [Google Scholar]
  19. Kendrick P, Eldering G.A Study in Active Immunization Against Pertussis American Journal of Hygiene, Sect. B, 193938133 [Google Scholar]
  20. Kliks SC, Nisalak A, Brandt WE, Wahl L, Burke DS. Antibody-Dependent Enhancement of Dengue Virus Growth in Human Monocytes as a Risk Factor for Dengue Hemorrhagic Fever. American Journal of Tropical Medicine and Hygiene. 1989;40:444–451. doi: 10.4269/ajtmh.1989.40.444. [DOI] [PubMed] [Google Scholar]
  21. Longini IM, Datta S, Halloran ME. Measuring Vaccine Efficacy for Both Susceptibility to Infection and Reduction in Infectiousness for Prophylactic HIV-1 Vaccines. Journal of Acquired Immune Deficiency Syndromes and Human Retrovirology. 1996;13:440–447. doi: 10.1097/00042560-199612150-00007. [DOI] [PubMed] [Google Scholar]
  22. Manski CF. Nonparametric Bounds on Treatment Effects. American Economic Review, Papers and Proceedings. 1990;80:319–323. [Google Scholar]
  23. Mascola JR, Mathieson BJ, Zack PM, Walker MC, Halstead SB, Burke DS. Summary Report: Workshop on the Potential Risks of Antibody-Dependent Enhancement in Human HIV Vaccine Trials. AIDS Research and Human Retroviruses. 1993;9:1175–1184. doi: 10.1089/aid.1993.9.1175. [DOI] [PubMed] [Google Scholar]
  24. McMichael A, Mwau M, Hanke T. Design and Tests of an HIV Vaccine. British Medical Bulletin. 2002;62:87–98. doi: 10.1093/bmb/62.1.87. [DOI] [PubMed] [Google Scholar]
  25. Mehrotra DV, Li X, Gilbert PB.Dual-Endpoint Evaluation of Vaccine Efficacy: Application to a Proof-of-Concept Clinical Trial of a Cell-Mediated Immunity-Based HIV Vaccine BARDS Technical Report 111, 2005Merck Research Laboratories [Google Scholar]
  26. Molenberghs G, Kenward MG, Goetghebeur E. Sensitivity Analysis for Incomplete Contingency Tables: The Slovenian Plebiscite Case. Applied Statistics. 2001;50:15–29. [Google Scholar]
  27. Nabel GJ. Challenges and Opportunities for Development of an AIDS Vaccine. Nature. 2001;410:1002–1007. doi: 10.1038/35073500. [DOI] [PubMed] [Google Scholar]
  28. Préziosi M-P, Halloran ME. Effects of Pertussis Vaccination on Transmission: Vaccine Efficacy for Infectiousness. Vaccine. 2003a;21:1853–1861. doi: 10.1016/s0264-410x(03)00007-0. [DOI] [PubMed] [Google Scholar]
  29. Préziosi M-P, Halloran ME. Effects of Pertussis Vaccination on Severity: Vaccine Efficacy for Clinical Severity. Clinical Infectious Diseases. 2003b;37:772–779. doi: 10.1086/377270. [DOI] [PubMed] [Google Scholar]
  30. Robins JM, Greenland S. Identifiability and Exchangeability for Direct and Indirect Effects. Epidemiology. 1992;3:143–155. doi: 10.1097/00001648-199203000-00013. [DOI] [PubMed] [Google Scholar]
  31. Robins JM, Greenland S. Adjusting for Differential Rates of Prophylaxis Therapy for PCP in High- versus Low-Dose AZT Treatment Arms in an AIDS Randomized Trial. Journal of the American Statistical Association. 1994;89:737–749. [Google Scholar]
  32. Robins JM, Rotnitzky A, Scharfstein DO. Sensitivity Analysis for Selection Bias and Unmeasured Confounding in Missing Data and Causal Inference Models. In: Halloran ME, Berry DA, editors. Statistics in Epidemiology, Environment and Clinical Trials. Springer-Verlag; New York: 2000. pp. 1–94. [Google Scholar]
  33. Rosenbaum PR. The Consequences of Adjustment for a Concomitant Variable That Has Been Affected by the Treatment. Journal of the Royal Statistical Society. 1984;147:656–666. [Google Scholar]
  34. Rosenbaum PR. Observational Studies. Springer-Verlag; Berlin: 1995. [Google Scholar]
  35. Rubin DB. Bayesian Inference for Causal Effects: The Role of Randomization. The Annals of Statistics. 1978;6:34–58. [Google Scholar]
  36. Rubin DB. Comment: Neyman (1923) and Causal Inference in Experiments and Observations Studies. Statistical Science. 1990a;5:472–480. [Google Scholar]
  37. Rubin DB. Formal Modes of Statistical Inference for Causal Effects. Journal of Statistical Planning and Inference. 1990b;25:279–292. [Google Scholar]
  38. Rubin DB, Dawid AP, Comment onCausal Inference Without Counterfactuals Journal of the American Statistical Association 200095435–437. [Google Scholar]
  39. Scharfstein DO, Rotnitzky A, Robins JM. Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models. Journal of the American Statistical Association. 1999;94:1096–1146. [Google Scholar]
  40. Struchiner CJ, Halloran ME.Randomization and Baseline Transmission in Vaccine Field Trials technical report, 1996Emory University, Dept. of Biostatistics; [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Vazquez M, LaRussa PS, Gershon AA, Steinberg SP, Freudigman K, Shapiro ED. The Effectiveness of the Varicella Vaccine in Clincial Practice. New England Journal of Medicine. 2001;344:955–960. doi: 10.1056/NEJM200103293441302. [DOI] [PubMed] [Google Scholar]
  42. Vesikari T, Rautanen T, Varis T, Beards GM, Kapikian AZ. Rhesus Rotavirus Candidate Vaccine: Clinical Trial in Children Vaccinated Between 2 and 5 Months of Age. American Journal of Diseases of Children. 1990;144:285–289. [PubMed] [Google Scholar]

RESOURCES