Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2014 May 29;33(21):3601–3628. doi: 10.1002/sim.6181

Identification and estimation of survivor average causal effects

Eric J Tchetgen Tchetgen 1,*,
PMCID: PMC4131726  NIHMSID: NIHMS585592  PMID: 24889022

Abstract

In longitudinal studies, outcomes ascertained at follow-up are typically undefined for individuals who die prior to the follow-up visit. In such settings, outcomes are said to be truncated by death and inference about the effects of a point treatment or exposure, restricted to individuals alive at the follow-up visit, could be biased even if as in experimental studies, treatment assignment were randomized. To account for truncation by death, the survivor average causal effect (SACE) defines the effect of treatment on the outcome for the subset of individuals who would have survived regardless of exposure status. In this paper, the author nonparametrically identifies SACE by leveraging post-exposure longitudinal correlates of survival and outcome that may also mediate the exposure effects on survival and outcome. Nonparametric identification is achieved by supposing that the longitudinal data arise from a certain nonparametric structural equations model and by making the monotonicity assumption that the effect of exposure on survival agrees in its direction across individuals. A novel weighted analysis involving a consistent estimate of the survival process is shown to produce consistent estimates of SACE. A data illustration is given, and the methods are extended to the context of time-varying exposures. We discuss a sensitivity analysis framework that relaxes assumptions about independent errors in the nonparametric structural equations model and may be used to assess the extent to which inference may be altered by a violation of key identifying assumptions. © 2014 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd.

Keywords: truncation by death, principal stratification, double robust, sensitivity analysis

1. Introduction

In longitudinal studies, it sometimes happens that individuals die between follow-up visits, in which case, unobserved outcomes that would have been ascertained during follow-up are said to be truncated by death. It is well known that inference about the effects of a point treatment or exposure, restricted to individuals alive at a follow-up visit, could be biased even if as in experimental studies, treatment assignment were randomized. Similarly, it may be that a vaccine studied in a randomized trial has a protective effect against a viral infection for some but not all individuals in the study. Then, viral load associated with the infection would not be observed unless a person became infected, which is a post-randomization event. As for truncation by death, an evaluation of the effects of the vaccine on viral load among infected individuals in the study likewise could be biased. Such bias may be present, if as we expect is likely the case in the aforementioned settings, there are downstream effects of the exposure or treatment, which affect survival or post-randomization infection, and the outcome of interest. A more fundamental issue is that the outcome may not be well defined for individuals who die or remain uninfected by the virus, under either exposure status, and therefore, it is not clear that a causal effect of the exposure can be defined for such individuals. In order to appropriately account for truncation by death, one can define the survivor average causal effect (SACE), which is the effect of exposure on the outcome for the subset of individuals that would have survived regardless of exposure status 1,2. SACE is an instance of what is sometimes referred to as a principal strata causal effect 24. An analogous principal strata causal effect is likewise defined for the effect of vaccine on viral load, among infected individuals for whom the vaccine has no effect on HIV infection 58. Throughout the paper, we refer to these two types of effects as SACE without further distinguishing between their respective contexts. In this paper, SACE is shown to be nonparametrically identified by leveraging post-exposure longitudinal correlates of survival and outcome that may also mediate exposure effects on survival and outcome. Nonparametric identification is achieved by supposing that the longitudinal data arise from a certain nonparametric structural equations model (NPSEM) 9 and by making the monotonicity assumption that the effect of exposure on survival agrees in its direction across individuals. Under these assumptions, a novel yet simple weighted analysis with weights involving the survival process is shown to produce consistent estimates of SACE, provided that the survival process is estimated consistently. A number of alternative estimators are also described, some with interesting theoretical properties. However, it is argued that these other estimators may be more difficult to implement in practice, particularly in studies with prolonged follow-up, where the simple weighted analysis extends with little additional computational difficulty. An illustration of the simple weighted analysis is given in an application concerning the effects of smoking history on decline of cognitive function in an aging population subject to truncation by death. A general sensitivity analysis technique is described, to assess the extent to which inference might be affected by a violation of an assumption that all common causes of survival and the outcome are fully observed. Finally, in the context of time-updated exposures, the survivor marginal structural model (SMSM) is introduced, which amounts to a standard marginal structural model (MSM) for the subpopulation that would survive irrespective of treatment history. A weighted approach is described for estimating the parameters of an SMSM.

2. A simple three-occasion study

2.1 Causal diagram interpretation of biased analyses

By way of introduction, first consider a simplified version of the study of the effects of smoking on cognitive function decline, here restricted to only three longitudinal occasions, as depicted in the causal diagram in Figure 1.

Figure 1.

Figure 1

Causal diagram associated with the nonparametric structural equations model Equations 14.

The simplified design consists of a baseline j = 0 at which binary smoking status A is observed, and two follow-up contacts, with cognitive function and other covariates assessed at each j = 1,2. Until otherwise stated, assume that all respondents participate at the first follow-up, and thus data C1 are collected for all subjects at j = 1, but some individuals die before the second follow-up, with S = 1 indicating survival. Suppose that in addition to cognitive function, C1 includes all causes of survival (S) that are also correlated with the outcome Y, with the latter defined as cognitive decline between j = 1 and j = 2. The assumption that we have measured important correlates of S and Y will be key to our developments. As indicated in the diagram of Figure 1, any variable in C1 may itself contain effects of smoking that simultaneously affect survival and the outcome Y among survivors. Potential examples of exposure-induced correlates of S and Y to consider for C1 include measures obtained at time j = 1 of alcohol consumption, disability score, cognitive score, self-rated health, and smoking status. In addition, as indicated in Figure 1, we will allow survival to be directly affected by A not through C1, and we will also assume that death is the only source of attrition in this study. Throughout, we assume no measurement error. The double-headed arrow between C1 and Y encodes possible unmeasured common causes of say cognitive function at follow-up and change in cognitive function Y, the presence of which cannot be ruled out with certainty. For instance, there is evidence for genetic determinants of Alzheimer’s disease that suggests a genetic basis for an individual’s cognitive function over time 10. Thus, we would expect a genetic-induced association between cognitive function measured at time j = 1 and cognitive decline between j = 1 and = 2, because such genetic information was not available for adjustment in this study. Note that while the aforementioned genetic determinant of cognitive decline cannot be directly affected by smoking behavior, other unmeasured common causes of cognitive function and cognitive decline might also include unknown epigenetic effects of smoking behavior and, therefore, could also be directly affected and thus mediate the effects of smoking. For simplicity, we further assume until later sections that all analyses are stratified by pre-exposure confounders of A, which is suppressed in the notation, such that the effects of A on (C1,Y,S) are unconfounded, and therefore, A behaves as if it were randomized.

The causal diagram in Figure 1 can be used to formalize the bias associated with an analysis of the effects of baseline smoking status A on decline of cognitive function Y, conditional on being alive at the end of follow-up. To understand how this bias arises, it suffices to note that by d-separation 9, such an analysis would unblock the noncausal pathways ASC1Y and AC1Y, thus indicating an effect of smoking on cognitive decline even if there were none. One should also note that further conditioning on C1 does not resolve the difficulty, because doing so does not block the pathway AC1Y . Both of these strategies essentially fail because conditioning on S implies conditioning on a collider on the pathway ASC1Y as in the first strategy, but also implies conditioning on a direct descendant of a collider on the pathway AC1Y, which induces noncausal associations between A and Y 9. Further conditioning on C1 as in the second strategy earlier does not really help resolve this issue because it also implies conditioning on a collider on the pathway AC1Y. Collider bias is invariably the pitfall, and a primary source of bias, for most analyses involving conditioning on a post-exposure event.

2.2 Nonparametric structural equations model

The following exposition is framed around a structural equation theory of causal inference, described by Pearl 9. Structural equations provide a nonparametric algebraic interpretation of the diagram of Figure 1 corresponding to four functions, one for each variable on the causal graph:

graphic file with name sim0033-3601-m1.jpg (1)
graphic file with name sim0033-3601-m2.jpg (2)
graphic file with name sim0033-3601-m3.jpg (3)
graphic file with name sim0033-3601-m4.jpg (4)

Each of the nonparametric functions {gA, …, gY} represents a causal mechanism that determines the value of the left variable, known as the output, from variables on the right, known as the inputs 9. The errors Inline graphic stand for all factors not included on the graph that could possibly affect their corresponding outputs when all other inputs are held constant. For instance, ɛS includes all causes of death unrelated to cognitive function decline. To be consistent with the causal graph presented in Figure 1, we require that the errors (ɛA,ɛS) be mutually independent, and we require that they be jointly independent of Inline graphic. However, as indicated by the double arrow edge in Figure 1, Inline graphic may not be independent of ɛY . We allow all error distributions to otherwise remain arbitrary. Lack of a causal effect of a given variable on an output is encoded by an absence of the variable from the right-hand side. For example, the absence of a direct effect of smoking on cognitive function at the first follow-up would imply removing A from the arguments of Inline graphic encoding the assumption that variations in A leave C1 unchanged, as long as Inline graphic remains constant, which is consistent with the assumption that there is no unmeasured common cause of smoking and cognitive function.

The last equation makes explicit the fact that Y is observed only among survivors with (S = 1), with corresponding structural equation gY (A,C1,ɛY ). As stated by Pearl 9, the invariance of structural equations permits their use as a basis for modeling causal effects and counterfactuals. In fact, to emulate the intervention in which one sets {A = a} for all individuals simply amounts to replacing the equation for A with A = a, producing the following set of modified equations:

graphic file with name sim0033-3601-m10.jpg (5)
graphic file with name sim0033-3601-m11.jpg (6)
graphic file with name sim0033-3601-m12.jpg (7)
graphic file with name sim0033-3601-m13.jpg (8)

with (C1(a),S(a),Y (a)) denoting the counterfactual outcomes had smoking status been set to a (possibly contrary to fact). We emphasize that while the model specifies a structural equation for survival, survival is not manipulable and, together with Y, should be understood as part of the outcome produced by the system of equations. As previously observed 11, structural equations are particularly helpful to clarify the difficulty with interpreting the effect of smoking when truncation by death is present. Specifically, we note that the individual effect of smoking is recovered by taking the contrast Y (a = 1) − Y (a = 0), which clearly is defined only for individuals in the principal stratum {S(0) = S(1) = 1} and is equal to

graphic file with name sim0033-3601-m14.jpg

and the associated population average gives the SACE estimand denoted β:

graphic file with name sim0033-3601-m15.jpg

The SACE is generally not identified without additional assumptions, even under an NPSEM. At one end of the spectrum of possible identifying assumptions, one might assume that the sharp null hypothesis holds that for all individuals in the population, A has no individual causal effect on survival, that is, S(a = 1) = S(a = 0) = 1 almost surely. The assumption implies that individuals who survive under an exposure status constitute a random sample of individuals who would survive irrespective of exposure value. Then, it is straightforward to verify that

graphic file with name sim0033-3601-m16.jpg

In light of existing scientific evidence on harmful effects of smoking on human health, the above identifying assumption of no individual causal effect of smoking on survival is clearly inappropriate, and therefore, the aforementioned equation is unlikely to be correct.

At the opposite end of the spectrum of possible identifying assumptions, a strategy that is sometimes adopted entails performing a sensitivity analysis, using data on (A,SY,S) 7,1214, possibly also incorporating pre-exposure covariates 9. A sensitivity analysis then typically involves recovering an estimate of SACE upon making a monotonicity assumption about the effects of exposure on survival and by fixing certain nonidendified parameters involving the joint distribution of potential outcomes to some hypothetical value, which is then varied over a certain range to assess the degree to which the estimate of SACE changes as a function of these parameters. The monotonicity assumption may sometimes be relaxed by introducing additional sensitivity parameters 15,16. Worst-case scenarios of a sensitivity analysis give rise to bounds for SACE, but such bounds typically only apply in rather simple settings 3,17. For instance, it is not clear whether such bounds can easily be constructed in the presence of numerous baseline and/or time-dependent covariates and with a continuous possibly unbounded outcome.

An alternative identifying assumption that is sometimes made in the principal strata literature and that falls somewhere in between the aforementioned extremes involves assuming that certain potential outcome independencies about the outcome of interest and survival can be obtained upon conditioning on enough pre-exposure covariates, such that SACE becomes identified within levels of such covariates Hayden et al. 18. Zhang et al. 19 proposed to identify SACE under a parametric model using the maximum likelihood estimation. They discussed identifiability for a mixture normal model; however, as pointed out by Ding et al. 20, the mixture normal model may not be identifiable under the extreme case that the probability of each latent component is the same. For instance, when the outcome of interest is binary, even with a parametric binary mixture model, the causal parameter of interest is still not identifiable without some further assumptions 20. An alternative approach is given by Ding et al. 20 who exploit a fairly strong form of exclusion restriction for an observed pre-exposure covariate to obtain nonparametric identification of SACE. They essentially require that an association between the covariate in question and survival exists solely because the former is an effect of principal strata and thus cannot be a causal risk factor for death nor can it be affected by a common unmeasured risk factor for death. A variable satisfying these rather stringent conditions may be hard to find in most health-related applications, thus limiting the extent to which their proposed framework may be of practical use. Nolen and Hudgens 21 proposed an elegant randomization-based approach about causal effects within principal strata—possibly within pre-exposure covariate levels—that is particularly useful for testing the null hypothesis of no principal strata causal effect but relies for identification and estimation away from the null, on the assumption of a constant individual principal strata causal effect.

The aforementioned methods all share a notable limitation, in that none appears to appropriately incorporate risk factors of survival available in post-exposure follow-up. In our hypothetical example, it is unclear whether these methods, including sensitivity analysis techniques, can make use of follow-up data collected in C1 such as post-exposure cognitive function, a correlate of survival that is likely affected by smoking status. A somewhat related context is considered by Dai et al. 22, who develop a partially hidden Markov model for time-varying principal stratification in HIV prevential trials. Their proposed approach however relies on a categorization of the intermediate variables into discrete event types and therefore does not easily generalize for continuous or high-dimensional intermediate variables. Tchetgen Tchetgen et al. 11 provide an alternative interpretation of standard inverse probability weighting for dependent censoring, incorporating time-updated covariates. They show that under the NPSEM defined by Equations 14, the causal effect identified by inverse probability weighting survivors, in fact, naturally incorporates principal strata causal effects therefore formally establishing a previously unknown relation between inverse probability weighting and principal stratification, two seemingly unrelated analytic frameworks. However, their proposed model does not exactly identify SACE without further assumptions. In the next section, an alternative approach is proposed that does not suffer from these possible limitations. But first, we refine the usual definition of SACE to rule out certain pathological situations. As discussed earlier, SACE is typically defined to be the average exposure effect for individuals that would survive irrespective of exposure. One might further refine the definition of SACE by considering a person’s survival status S(a = 1,C1(a = 0)) in the hypothetical situation in which the person smoked, but C1 behaved as if the individual did not smoke; likewise, one could consider a person’s survival status S(a = 0,C1(a = 1)) in the hypothetical scenario where the person did not smoke, but C1 behaved as if the person smoked. Such ‘cross-world’ potential outcomes feature prominently in recent literature on causal mediation analysis, where they are used for formal causal definitions of direct and indirect effects of an exposure 23,24. They are formally defined by composition of functions defining the NPSEM each possibly evaluated under different exposure values. Consider the following set of an individual’s potential outcomes

graphic file with name sim0033-3601-m17.jpg

which allows further distinction between individuals who would survive irrespective of exposure, that is, S(a = 0) = S(a = 1) = 1. For instance, it may be that a person that would survive whether exposed or not would not survive in certain cross-world situations S(a = 1,C1(a = 0)) = S(a = 0,C1(a = 1)) = 0. Such an individual would be considered rather unusual, and hereafter, the causal effect of exposure for such a person is not further considered, and SACE is redefined to be the causal effect of exposure for individuals that would survive regardless of exposure, including under cross-world situations:

graphic file with name sim0033-3601-m18.jpg

If one were to a priori rule out the possibility that an individual that would survive irrespective of exposure status could die under cross-world conditions, then it would also be that ψ = β, and therefore, the more stringent definition of SACE would match the more common definition. An alternative and perhaps more intuitive condition for this equality is that for every survivor, absence of an individual total effect of exposure on survival implies absence of both an individual direct effect not mediated by C1 and an individual indirect effect of exposure mediated by C1, thus ruling out the possibility of countervailing direct and indirect effects resulting in a null individual total effect.

2.3. Identification of survivor average causal effect

Identification of SACE requires, in addition to the NPSEM assumptions, that we make the following assumptions:

  1. Monotonicity assumption:
    graphic file with name sim0033-3601-m19.jpg

The contrast S(a = 1,C1(a*)) − S(a = 0,C1(a*)) is known in the causal mediation literature as the pure or natural direct effect of exposure, in a hypothetical situation where C1 is set to what it would be under smoking status a*, and captures the direct effects of exposure not mediated by C1. Thus, the assumption states that there is no individual in the population, for whom smoking provides a protective individual direct effect on survival.

In order to state the second assumption, consider the following subsets of individuals. Let Inline graphicdenote the subset of individuals that would survive regardless of smoking, in a hypothetical situation where C1 would behave as if they did not smoke, that is, S(a = 0,C1(a = 0)) = S(a = 1,C1(a = 0)) = 1, and let Inline graphic denote the subset of individuals that would survive irrespective of smoking, in a hypothetical situation where C1 would behave as if they smoked, that is, S(a = 0,C1(a = 1)) = S(a = 1,C1(a = 1)) = 1.

  1. Concordant survivorship assumption: Inline graphic almost surely.

This second assumption states that individuals that would survive irrespective of smoking status, in a hypothetical situation where C1 behaved as if they smoked, would also survive irrespective of smoking status, in the hypothetical situation where C1 behaved as if they did not smoke and that the converse also holds. In terminology used in causal mediation analysis, the assumption states that there is no survivor for whom the natural direct effect of A on S is null when C1 is held at the value it would have been under no exposure, and yet the natural direct effect when C1 is held at the value it would have been under active exposure is non-null. Under the assumption, the converse is also ruled out. A sufficient condition for the assumption is that A and C1 do not interact at the individual level (on the additive scale) in causing S, however the assumption may still hold even if this were not the case. Another interesting special case is if C1 intercepts all causal pathways between A and S, then the assumption (and monotonicity) holds trivially because S(a = 0,C1(a*)) − S(a = 1,C1(a*)) = 0 for all a*. This latter situation highlights the importance of including in C1 mediating factors of the effects A on S, which may potentially interact with exposure.

We are now ready to state our identification result:

Theorem 1

Under the NPSEM given by Equations 14, and under the monotonicity assumption, and the concordant survivorship assumption, we have that SACE:

graphic file with name sim0033-3601-m23.jpg

is nonparametrically identified and is given by μ1μ0, where

graphic file with name sim0033-3601-m24.jpg

and

graphic file with name sim0033-3601-m25.jpg 9

where F(u1 | u2) stands for the CDF of U1 given U2, evaluated at U1 = u1, U2 = u2, and it is assumed that Pr(S = 1,C1 = c | A = 1) / Pr(S = 1,C1 = c | A = 0) < ∞ .

Proofs of all theorems are given in the Appendix. According to the theorem, estimation of the average survivor outcome for smoking presents no particular difficulty and can be obtained by the simple average outcome for the exposed individuals who survived. The situation is quite different for the average survivor outcome for the nonsmoking exposure status. The theorem states that this average can be obtained by using the expression given in Equation 17. Intuition is gained by comparing this expression with that of the average outcome for unexposed individuals who actually survived:

graphic file with name sim0033-3601-m26.jpg

Then, one may note that the primary distinction between these two expressions is that the conditional survival probability for the unexposed used in both numerator and denominator of the second expression is replaced by that for the exposed in the first expression. This substitution essentially amounts to a form of standardization of unexposed individuals who survived, by the survival probability of exposed individuals with similar covariate history. One should note as well that in the special instance where S is independent of A given C1, as one might expect, the two aforementioned expressions coincide. However, in general, the two functionals do not coincide, and the subtle difference between them has nontrivial implications for inference. Specifically, whereas estimation of Inline graphic is straightforward and does not require estimating nuisance parameters, estimation of μ0 is somewhat more involved.

2.4. Estimation of survivor average causal effect

As explained in the previous section, we only need to consider estimation of μ0. Ideally, we may wish to estimate the latter nonparametrically, so as to avoid potential modeling bias; however, this may not be possible in practice. This is because, as shown below, estimation of μ0 is generally not possible without involving an estimate of a subset of the following quantities Inline graphic. In practice, one would probably seek to enrich to the extent possible the set of covariates in C1 in order to ensure that all variables are included, which potentially mediate the effects of exposure on survival and outcome simultaneously. As a result, our primary interest concerns settings in which C1 potentially includes a large number of covariates, a subset of which is possibly continuous, such that nonparametric methods for estimating the aforementioned density and regression models, such as smoothing techniques, may be of limited value. Consequently, next, we present three simple estimation strategies based on low-dimensional models. Let Inline graphic denote estimates obtained using parsimonious parametric working models for the unknown conditional mean and the two unknown conditional densities. Our first strategy entails direct substitution of unknown quantities in 17 by their corresponding estimate, which gives

graphic file with name sim0033-3601-m30.jpg

This estimator depends heavily on correct specification of all three models. An alternative estimator that makes fewer assumptions is based on the following equivalent representation of μ0

graphic file with name sim0033-3601-m31.jpg

which gives the estimator

graphic file with name sim0033-3601-m32.jpg

where Inline graphic. This second approach improves over the first in terms of robustness, because it does not directly involve an estimate of the survival process. Finally, consider yet another representation of μ0:

graphic file with name sim0033-3601-m34.jpg 10

which motivates the estimator

graphic file with name sim0033-3601-m35.jpg

This last estimator has the advantage that it only requires an estimate of the survival process, and both the outcome regression and the covariates distribution are left unrestricted. From the three approaches presented above earlier, the last is most appealing, because it involves fitting a single model, whereas the other two approaches involve multiple models. In the next section, we show that the last strategy readily extends to a general longitudinal design with arbitrary follow-up. We do not further consider the first two strategies on above grounds, although in Section 5, a doubly robust approach is given, which combines all three mentioned strategies such that consistent estimation of μ0 remains possible even under partial model misspecification, that is, when only some but not all required models are correctly specified.

3. Longitudinal studies of arbitrary length

3.1 Longitudinal nonparametric structural equations model and identification of survivor average causal effect

We turn to the more general context of a longitudinal study with arbitrary follow-up j = 0, … J, with Inline graphic is fixed, and at each occasion j, one observes (Sj,SjCj), where Sj indicates survival status at time j, and Cj includes covariates measured at time j. We suppose that S0 = S1 = 1, and therefore, a vector of pre-exposure covariates C0 is measured on all individuals in the target population, and exposure A is measured concurrently with covariates C1 on all individuals in the target population. The variable CJ = Y encodes the outcome measured at the end of follow-up. We consider the general NPSEM:

graphic file with name sim0033-3601-m37.jpg (11)
graphic file with name sim0033-3601-m38.jpg (12)
graphic file with name sim0033-3601-m39.jpg (13)
graphic file with name sim0033-3601-m40.jpg (14)

We assume that

graphic file with name sim0033-3601-m41.jpg

and we also assume that

graphic file with name sim0033-3601-m42.jpg

However, Inline graphic and Inline graphic may be dependent, and Inline graphic and Inline graphic may be dependent, jj ′ . The causal diagram of Figure 2 depicts the observed data, generated under such an NPSEM for an individual alive at the end of follow-up, in the special case where J = 4. We allow all error distributions to otherwise remain arbitrary.

Figure 2.

Figure 2

Causal diagram associated with the nonparametric structural equations model Equations 2427 at J = 4.

This more general NPSEM extends the previous model, to accommodate, both confounding by pre-exposure covariates C0, and longitudinal data Inline graphic, where Inline graphic denotes the history (C0, …, Cj). Note that because C0 and C1 are respectively prior to and concurrent with exposure A (now defined to occur at time j = 1 to ensure proper temporal ordering with C0), they cannot be affected by exposure, and thus C0(a) = C0 and C1(a) = C1. Technically, C0 confounds the effects of A, but C1 is not considered a confounder even though it may be correlated with A and may be used to account for survival bias. Crucially, independence of Inline graphic and Inline graphic implies that for individuals alive at time j − 1, Inline graphic intercepts or blocks all noncausal pathways between Cj − 1 and Sj; in the language of causal graphs, Inline graphic is said to block all back-door paths from Cj − 1 to Sj.

The SACE is defined to be the causal effect of exposure on an outcome measured at the end of follow-up, among individuals that would survive whether exposed or not, and with the covariate history, they would have under possibly conflicting exposure status:

graphic file with name sim0033-3601-m53.jpg

As was the case in the previous section with three time points, we likewise have that if Inline graphic implies Inline graphic almost surely, then

graphic file with name sim0033-3601-m56.jpg

which corresponds to the traditional definition of SACE.

Identification requires an extension of the monotonicity and concordant survivorship assumptions:

  1. Sequential monotonicity assumption for point exposure:
    graphic file with name sim0033-3601-m57.jpg

Let Inline graphic denote the subset of individuals that would survive until the end of follow-up regardless of smoking, in a hypothetical situation where Inline graphic would behave as if they did not smoke, and let Inline graphic denote the subset of individuals that would survive irrespective of smoking, in a hypothetical situation where Inline graphic would behave as if they smoked.

  1. Concordant survivorship assumption for point exposure:
    graphic file with name sim0033-3601-m62.jpg

The sequential monotonicity assumption states that for a person that would survive up to time j − 1 irrespective of smoking status, in the hypothetical situation in which his covariate history behaves as if smoking status were fixed to a, if the person were to survive at time j when exposed and with covariate history as if smoking status were equal to a, the person would also survive at time j, with similar covariate history, if he were not to smoke. The concordant survivorship assumption essentially states that

graphic file with name sim0033-3601-m63.jpg

which is the natural extension of the previous concordant survivorship assumption. Let

graphic file with name sim0033-3601-m64.jpg

Throughout, we further assume that

graphic file with name sim0033-3601-m65.jpg

almost surely, where πk is defined in Theorem 2, and 0 < p(C0) < 1 almost surely.

Then, we have the following result.

Theorem 2

Under the NPSEM given by Equations 2427, the sequential monotonicity assumption, and the concordant survivorship assumption for point exposure, we have that ψJ is nonparametrically identified and is given by μ1,Jμ0,J, where

graphic file with name sim0033-3601-m66.jpg

and

graphic file with name sim0033-3601-m67.jpg (15)

According to the theorem, as was the case in the three-occasion example, estimation of the survivor average outcome for smoking likewise presents no particular difficulty and can be achieved by a simple average of the outcome of exposed individuals who survived until the end of follow-up. The theorem states that the survivor average outcome for nonsmoking is given by the functional in Equation 38. This formula is a generalization of Equation 17 accounting for arbitrary follow-up and pre-exposure confounding by C0. Instead of a marginal SACE parameter, a conditional SACE parameter might be of interest, say

graphic file with name sim0033-3601-m68.jpg

where VC0 is a vector of pre-exposure confounders, such that C0 = (V,W) with W all components of C0 not in V. Then, it is straightforward to verify that a corollary of Theorem 2 gives under the same set of assumptions:

graphic file with name sim0033-3601-m69.jpg

where

graphic file with name sim0033-3601-m70.jpg

and

graphic file with name sim0033-3601-m71.jpg

3.2. Weighted estimation of models of survivor average causal effect

A simple weighted approach is given, which is consistent in the absence of model misspecification. Let Inline graphic denote a fitted model for the survival process, using a standard parametric approach; likewise, let p(C0) = Pr(A = 1 | C0) denote the propensity score for exposure, and Inline graphic its corresponding estimator. Define the following estimated weights for individuals who survive until the end of follow-up:

graphic file with name sim0033-3601-m74.jpg

Then, let Inline graphic denote the weighted ordinary least-squares estimator of the marginal effect of A on Y using only data on individuals who survived until the end of follow-up, with individual weight equal to Inline graphic. Then, we have the following result:

Theorem 3

Suppose that the assumptions of Theorem 2 hold and that Inline graphic and Inline graphic are consistent, then Inline graphic is consistent for ψJ.

The theorem states that ψJ may be estimated consistently, by weighted least-squares estimation of a regression of Y on A among survivors at time J, with weight Inline graphic. Furthermore, the estimator Inline graphic is asymptotically normal under standard regularity conditions. The weights component given by Inline graphic corresponds to standard inverse probability-of-treatment weighting a well-known propensity score technique to control for confounding 25,26. Intuitively, treatment weights create possibly fractional copies of each individual with complete follow-up, such that in the weighted sample, C0 no longer predicts A and therefore is not a confounder. The other component of the weights Inline graphic corrects for selective survival of unexposed individuals. Intuitively, monotonicity of the effects of exposure on survival implies that unexposed survivors may be over-represented relative to the exposed, and thus if C0 were empty so that Inline graphic could be set to 1, then because Inline graphic, we would have that the survival weight essentially adjusts the contribution of unexposed survivors downwards and does so continuously as a function of Inline graphic

Although the theorem identifies SACE on the additive scale, estimation using Inline graphic as weight is a universal strategy for estimating SACE on a variety of scales. For instance, the approach could be used to estimate SACE on the multiplicative scale, or for other choice of link function, such as logit or probit link functions. This could be achieved by simply replacing the normal equations with the corresponding set of estimating equations one would have used in the absence of selective survivorship and by multiplying each survivor’s contribution by the weight. The approach could also be used for quantile regression or for weighting other standard likelihood or quasi-likelihood methods. Finally, suppose that instead of marginal causal effects, a conditional causal effect, say ψJ(C0), was in view. Then, suppose that one were to use a parametric or semiparametric model Inline graphic to describe the survivor average causal effect of A within levels of Co. Then, assuming the model was correct, θ could then be estimated via a weighted approach, using only the survival component of the weight Inline graphic, because the regression model would already account for baseline confounding by conditioning on C0. This strategy is in fact adopted in the data illustration of the next section.

For inference about ψJ or ψJ(C0), in general, one could use the nonparametric bootstrap such that the extra variation due to first-stage estimation of the weights is appropriately accounted for. Alternatively, one could use a consistent estimate of the large sample variance of the weighted estimator of SACE to construct Wald-type CIs; such an estimator of the large sample variance can be computed in a manner similar to the variance estimator given in Section 5.

4. A data application

We illustrate the new methodology in an evaluation of the effects of smoking on cognitive decline in an aging population subject to substantial attrition due to death and dropout for other reasons 27. In their paper, Weuve et al. 27 noted that selective attrition in this population may introduce bias into analyses of the effects of smoking status measured at the start of follow-up on cognitive decline, for the following reasons.

  1. An individual’s evolving health status is likely to be a common cause for attrition and cognitive decline among survivors who do not drop out.

  2. An individual’s evolving health status is likely to mediate the causal effect of smoking on cognitive decline.

To account for (1) and (2), Weuve et al. used inverse probability-of-attrition weights and examined the influence of selective attrition on the estimated association of current smoking (versus never smoking) with cognitive decline in participants of the Chicago Health and Aging Project (n = 3713), aged 65–109 years, who were current smokers or never smokers, and underwent cognitive assessments up to five times at 3-year intervals. Only 20% of the original sample remained at the fourth follow-up, and mortality accounted for most ( ∼ 70%) of the attrition. Weuve et al. used separate pooled logistic regressions to fit predictive models of attrition due to death or study dropout across follow-up visits using both baseline and time-updated data to construct inverse probability-of-attrition weights. We refer the reader to Weuve et al. for additional details on their design and analysis of the study, also see 11,24 for additional discussion. Similar to Weuve et al. 27, we estimated a linear mean regression model contrasting rates of change in cognitive scores in current versus never smokers, adjusting for the following pre-exposure confounders in the regression: age, sex, race, education, and alcohol consumption. As recommended by Tchetgen Tchetgen et al. 28, we assumed an independence correlation structure for the five serial measurements of cognitive function (coded as z-scores). The two main sources of attrition, death and dropout for other reasons (denoted Dj), were modeled separately as discrete-time PH models via pooled logistic regression. Each model included main effects for the following baseline and time-updated variables: age, race (African American versus White), sex (male versus female), education (0–8, 9–12 referent, 13–16, and 17–30 years), alcohol consumption at the previous visit (none referent, up to 1 drink/day, 1 drink/day), social network score at the previous visit, cognitive activity at the previous visit, disability score at the previous visit, self-rated health at the previous visit (per unit worsening in rating), chronic cardiovascular conditions, diabetes, global cognitive score at the previous visit, and smoking status (current versus never). A logistic model for nondeath-related censoring was also estimated using only baseline variables. These predictive models were combined as in the study of Weuve et al. to account for selective censoring other than death via stabilized weights 27:

graphic file with name sim0033-3601-m90.jpg

An additional set of weights was estimated to account for truncation by death using the approach developed in Section 3.2

graphic file with name sim0033-3601-m91.jpg

and the final weight Inline graphic was applied at the level of observations within individuals, such that for each person’s contribution to our analysis at time j, the weight was the product of censoring weights and survival weights.

The CIs were obtained via the bootstrap. In unweighted analyses, current smokers’ cognitive scores declined 0.11 standard units per decade more rapidly than never smokers’ cognitive scores (95% CI = − 0.20 to − 0.02). Weighting for attrition due to dropout or death using Inline graphic for weight gave an estimate that was considerably larger, with smoking’s estimated 10-year rate of decline compared with nonsmoking 55% larger than in the unweighted analysis (95% CI = − 0.27 to − 0.07). Under the assumptions of Theorem 2, this latter estimate may be interpreted as the survivor average causal effect of smoking on cognitive decline conditional on pre-exposure covariates. Monotonicity in the current setting essentially states that smoking does not offer any direct individual survival benefit not mediated by the time-varying factors included in our analysis. We emphasize that the assumption does not rule out that there may be certain pathways through which smoking could offer survival benefits (e.g., by alleviating depressive symptoms in an individual with suicidal tendencies); however, according to the assumption, such pathways are assumed to be outweighed for each individual by the harmful health effects associated with smoking through other pathways. We find this assumption to be credible given the overwhelming amount of evidence of the harmful effects of smoking. The concordant survivorship assumption here essentially states that survivors who experienced no natural direct effect of smoking (not mediated by time-updated factors Inline graphic) would do so irrespective of the reference exposure value used in defining the direct effect. This assumption may be reasonable, if one can safely assume that Inline graphic captures the most important pathways by which smoking affects survival and that the remaining pathways not captured by Inline graphic do not have significant additive interactions with the latter. Our interpretation of our estimated effect as identifying SACE is thus contingent on the extent to which this last assumption may be reasonable. Finally, we should note that similar results were obtained for SACE when dropout for other reasons was simply ignored by redefining the weight as Inline graphic, suggesting that most of the selection bias due to attrition was related to death.

5. Results on double robustness and sensitivity analysis

5.1. Double robustness

Consider the simple three-occasion setting described in Section 2. The following result gives a doubly robust estimator of μ0 in the three-occasion setting, which essentially combines Inline graphic and Inline graphic of Section 2.4, such that consistency is obtained under a union model where either Inline graphic and Inline graphicare both consistent or Inline graphic is consistent, but all models are not necessarily consistent. To state the result, consider the following estimating function:

graphic file with name sim0033-3601-m103.jpg

and define Inline graphic similarly, evaluated under Inline graphic

Theorem 4

Under the assumptions of Theorem 1, Inline graphic is doubly robust and therefore converges to μ0 and is asymptotically normal if one but not necessarily both of the following conditions hold:

  1. Inline graphic and Inline graphic are both consistent.

  2. Inline graphic is consistent.

where Inline graphic satisfies the estimating equation Inline graphic Furthermore, at the intersection submodel where all estimators are consistent, Inline graphic is semiparametric efficient in the nonparametric model where no model assumption is made, at the intersection submodel where both of the aforementioned conditions (1) and (2) hold.

The theorem gives an estimator of μ0 that is doubly robust and semiparametric efficient in the nonparametric model where no modeling assumption is made, at the intersection submodel where all working models are correct. This last property is sometimes called semiparametric local efficiency. At the intersection submodel, the asymptotic variance of Inline graphic can be estimated by the simple expression Inline graphic. Interestingly, one may note that this expression is invariant to the choice of working models and corresponding estimators. This property does not apply outside of the intersection submodel; nonetheless, it remains possible to estimate the asymptotic variance of Inline graphic outside the intersection submodel. To do so, let Inline graphic, and Inline graphic denote the estimates of Inline graphic, and γS, the parameters indexing models for Inline graphic F(C1 | A = 0), and Pr(S = 1 | A,C1), respectively. Suppose that such estimates are obtained by solving a set of score equations with respective scores Inline graphic, and MS(γS). Let Inline graphic and define Inline graphic to equal U(μ0) under the parametric model, such that Inline graphic Inline graphic. Then, a standard Taylor series expansion can be used to show that the large sample variance of Inline graphic is consistently estimated by Inline graphic, where

graphic file with name sim0033-3601-m127.jpg

5.2 Sensitivity analysis

A key assumption we have made in order to identify SACE is that there is no unmeasured common cause of S and Y, which is clearly encoded in the causal diagram of Figure 1 and its associated NPSEM 14. In the current section, a sensitivity analysis technique is developed to assess the extent to which a violation of the assumption might affect results. Unlike previous sensitivity analysis techniques for truncation by death and related contexts 7,8,12,13, the proposed sensitivity analysis technique makes explicit use of post-exposure covariates and therefore extends previous methods to the current more general longitudinal context. We begin by describing the approach in the simple three-occasion study. Define the selection bias function as follows:

graphic file with name sim0033-3601-m128.jpg

We have that t( ⋅ ) = 0 under the independence assumptions encoded in the NPSEM 14; however, if there was an unmeasured common cause of S and Y, such that ɛY and ɛS were no longer independent, then we would expect that t(c1) ≠ 0 for some value of c1, even if all other independencies encoded in the NPSEM continued to hold. In the smoking example, if the selection bias function is not null, we would expect that persons with worse health status and therefore worse cognitive decline would be more likely not to survive if they smoked, and therefore, we expect Inline graphic. We propose to recover inferences about SACE by assuming that the selection bias function t(c1) is known, which encodes the magnitude and direction of the unmeasured common cause of S and Y. Specifically, the sensitivity analysis considers relaxation of the assumption that C1 includes all common causes of Y and S. To motivate the proposed approach, suppose for the moment for the sake of exposition that π(a,C1) = Pr(S | a,C1) were known, then we show in the Appendix (as a special case of the more general expression 52 given in the following text) that

graphic file with name sim0033-3601-m130.jpg 16

therefore, knowing t(c1) allows one to recover the average potential outcome when unexposed in the principal strata of survivors {S(a = 0) = S(a = 1,C1(a = 0) = c1) = 1}, by adjusting the average observed outcome in the unexposed who survived, using the aforementioned expression. The expression in the above display can then be combined with the representation of Inline graphic given by Equation 22, to produce the following modified estimator:

graphic file with name sim0033-3601-m132.jpg

A formal sensitivity analysis can be obtained by repeating this process and reporting inferences about μ1μ0 using Inline graphic and a corresponding CI for each choice of t( ⋅ ), say in a finite set of user-specified functions Inline graphic indexed by a finite dimensional parameter λ with Inline graphic corresponding to the assumption of no unmeasured common cause of Y and S, that is, t0( ⋅ ) ≡ 0.

As is shown next, this sensitivity analysis technique readily extends to a longitudinal study with J > 3 occasions. Define the selection bias function:

graphic file with name sim0033-3601-m136.jpg

Similar to the three-occasion setting, tJ( ⋅ ) = 0 under the independence assumptions encoded in the NPSEM 2427; however, if there were an unmeasured common cause of the survival process and Y among individuals alive at the end of follow-up, such that ɛY and Inline graphic were no longer independent, then we would expect that Inline graphic for some value of Inline graphic, even if all other independencies of the NPSEM were to continue to hold. Then, similar to the three-occasion derivation, one obtains the following relation, which is derived in the Appendix:

graphic file with name sim0033-3601-m140.jpg (17)

This result combines with the representation of μ0,J from Section 3, to produce

graphic file with name sim0033-3601-m141.jpg

where as we recall

graphic file with name sim0033-3601-m142.jpg

which in turn can be used to obtain a consistent estimator. A sensitivity analysis then simply proceeds similar to the three-occasion setting described previously.

6. Survivor marginal structural models

It is now well known that in the context of a time-varying exposure with time-varying confounding, standard confounding adjustment techniques, such as stratification or standard regression analysis in general cannot appropriately account for time-varying confounding and therefore can be biased for the joint causal effects of the exposure, even under the causal null hypothesis of no exposure effect over time. In fact, the standard use of regression models to estimate the causal effect of a time-varying exposure can be biased even in the absence of unmeasured confounders whether or not one adjusts further for the past history of measured covariates in the analysis, when (i) there exists a time-dependent risk factor for the outcome, which also predicts subsequent exposure; and (ii) past exposure history predicts subsequent risk factor level. The reason is, when both conditions (i) and (ii) hold, an analysis that does not adjust for past covariates is biased because of the uncontrolled confounding, yet an analysis that includes current covariates is also biased as it adjusts for a variable affected by past exposure.

The MSMs were introduced by Robins 29 to estimate the joint causal effect of a time-dependent exposure in the presence of time-dependent confounders that are themselves intermediate variables, affected by previous exposure. MSMs were proposed as an alternative approach to the semiparametric g-computation algorithm estimator 1 and to g-estimation of structural nested models 29. Robins 29 and subsequently Hernan et al. 30 described inverse probability-of-treatment-weighted estimation of MSMs, a method which in contrast to standard methods provides consistent estimates of causal effects when unmeasured confounding, model misspecification, and truncation by death are absent. We extend the results of previous sections to the context of MSMs.

First, we redefine the NPSEM to allow for time-updated exposure, and we assume no other form of loss to follow-up is present. Let A0 = 0, such that individuals are assumed to be unexposed at start of follow-up, and let C − 1 = 0, then for j = 0, …, J:

graphic file with name sim0033-3601-m143.jpg (18)
graphic file with name sim0033-3601-m144.jpg (19)
graphic file with name sim0033-3601-m145.jpg (20)

Let Y = CJ denote the outcome. We assume that

graphic file with name sim0033-3601-m146.jpg

and we also assume that

graphic file with name sim0033-3601-m147.jpg

However, as before, Inline graphic and Inline graphic may be dependent, and likewise, for Inline graphic and Inline graphic, jj ′ . The NPSEM 5557 with the associated error independencies essentially states that Inline graphic is sufficiently rich such that Inline graphic account for any association between Sj and Cj, and likewise, Inline graphic accounts for confounding of the effects of Aj.

To account for truncation by death, consider the average potential outcome of survivors:

graphic file with name sim0033-3601-m155.jpg

The conditioning event of the above expectation would be satisfied if an individual would survive irrespective of exposure history, including under certain cross-world situations where the covariate history behaves as if under an exposure history that possibly conflicts with that influencing the outcome. Thus, SMSMs give a natural generalization of standard MSMs to account for truncation by death. In the special case where Inline graphic almost surely, then Inline graphic simplifies and may be written

graphic file with name sim0033-3601-m158.jpg

which extends the standard definition of SACE to time-updated exposure settings.

Identification of Inline graphic requires a modification of the monotonicity and concordant survivorship assumptions:

  1. Sequential monotonicity assumption for time-dependent exposure: For any treatment history Inline graphic, if
    graphic file with name sim0033-3601-m161.jpg

then

graphic file with name sim0033-3601-m162.jpg

where S0( ⋅, ⋅ ) = 1 almost surely; and if

graphic file with name sim0033-3601-m163.jpg

then

graphic file with name sim0033-3601-m164.jpg
  1. Concordant survivorship assumption for time-dependent exposure:
    graphic file with name sim0033-3601-m165.jpg

almost surely, for some exposure history Inline graphic, then

graphic file with name sim0033-3601-m167.jpg

We should note that the sequential monotonicity assumption essentially states that receiving a dose of exposure can never be harmful for survival relative to remaining unexposed over time, and therefore in contrast with previous sections where exposure was smoking history and therefore was assumed to have a harmful effect on survival, exposure is now assumed to have a protective survival effect, for example, highly active antiretroviral therapy taken by HIV patients 23. Specifically, the condition states that a person who would survive up to time j − 1 if either untreated or with treatment history Inline graphic, in the hypothetical situation in which his covariate history behaves as if his treatment history was set to Inline graphic, and the person were to survive at time j when untreated and with covariate history under treatment regime Inline graphic, then the person would also survive at time j, with similar covariate history, if his treatment history was set to Inline graphic.

The concordant survivorship assumption essentially states that a person who would survive under a given treatment history Inline graphic, as well as if he were never exposed, in the hypothetical situation in which his covariate history behaves as if his exposure history was set to Inline graphic, then he would also survive under any other treatment history Inline graphic, and he would likewise survive if he were never exposed, in the hypothetical situation in which his covariate history behaves as if his exposure history was set to Inline graphic. A similar assumption is made for an individual for whom Inline graphic for some Inline graphic Let

graphic file with name sim0033-3601-m178.jpg

Throughout, we make the following positivity assumptions

graphic file with name sim0033-3601-m179.jpg

and if

graphic file with name sim0033-3601-m180.jpg

Then, we have the following result.

Theorem 5

Under the NPSEM given by Equations 5557, the sequential monotonicity assumption for time-dependent exposure, the concordant survivorship assumption for time-dependent exposure, and the positivity assumptions, we have that Inline graphic is nonparametrically identified and is given by

graphic file with name sim0033-3601-m182.jpg

In practice, to estimate Inline graphic, one may proceed as in Hernan et al. 23 and specify a model Inline graphic indexed by an unknown parameter λ. We refer to such a model as a SMSM, because it is an MSM for individuals that would survive under any treatment history, including under certain cross-world situations where the covariate history behaves as if under an exposure history, which conflicts with that influencing the outcome. For instance, consider the simple linear SMSM:

graphic file with name sim0033-3601-m185.jpg

where Inline graphic. Then, similar to the weighted least-squares approach developed in Section 3.2, one can likewise show that a consistent weighted least-squares estimator of the SMSM can be obtained by using the following modified weight Inline graphic, which accounts for time-dependent exposure and confounding and death-related attrition:

graphic file with name sim0033-3601-m188.jpg

where Inline graphic is a consistent estimator of πj and Inline graphic is a consistent estimator of Inline graphic. For inference, it is possible to derive a sandwich variance estimator similar to the one provided in previous sections; alternatively, one may use the nonparametric bootstrap.

7. Discussion

In this paper, we have developed a general framework for identification and estimation of causal effects when the outcome in view is subject to truncation by death. The proposed approach is shown to equally apply in the context of a point exposure but also if joint effects of a time-updated exposure are in view. A simple weighted approach is described for estimation, which readily scales with follow-up of increasing length and applies irrespective of whether the exposure is time-updated or occurs at a single point in time. Doubly robust estimation is shown to be possible a simple three-occasion study. However, it is unclear that a similar doubly robust estimator is available beyond this simple setup, say if one has more than three follow-up visits in a longitudinal study. Although such generalizations are of definite interest and deserve further investigation. A sensitivity analysis technique is developed for a general longitudinal study of arbitrary length, which may be used to evaluate the extent to which a violation of the assumption that one has observed sufficient post-exposure covariates to account for an association between the outcome and survival, may bias the results. In the future, we plan to further develop the sensitivity analysis approach and to implement these techniques to the longitudinal smoking data application illustrated in the paper.

Acknowledgments

This work was supported by Award Number U54GM088558 from the National Institute Of General Medical Sciences and Award Numbers R01AI104459-01A1 from the National Institute of Allergy and Infectious Diseases and R01ES020337 from the National Institute of Environmental Health Sciences. The author would like to thank Denis Evans, Jennifer Weuve, and George Dombrowski for providing access to data from the Chicago Health and Aging Project (CHAP). The author is particularly grateful to Todd L. Beck for his invaluable assistance with the data analysis of CHAP.

Appendix

Proof of Theorem 1

We can write

graphic file with name sim0033-3601-m192.jpg

Then, note that by the concordant survivorship assumption

graphic file with name sim0033-3601-m193.jpg

and by the monotonicity assumption

graphic file with name sim0033-3601-m194.jpg

by the independence assumptions associated with the NPSEM, and similarly,

graphic file with name sim0033-3601-m195.jpg

and therefore,

graphic file with name sim0033-3601-m196.jpg

Next consider

graphic file with name sim0033-3601-m197.jpg

Note that

graphic file with name sim0033-3601-m198.jpg

Likewise,

graphic file with name sim0033-3601-m199.jpg

2. Proof of Theorem

Similarly to the proof of Theorem 1, note that

graphic file with name sim0033-3601-m200.jpg

by the concordant survivorship assumption, furthermore, by the sequential monotonicity assumption:

graphic file with name sim0033-3601-m201.jpg

Then, note that

graphic file with name sim0033-3601-m202.jpg
graphic file with name sim0033-3601-m203.jpg

A similar argument shows that

graphic file with name sim0033-3601-m204.jpg

and therefore,

graphic file with name sim0033-3601-m205.jpg

Next, note that

graphic file with name sim0033-3601-m206.jpg

by the concordant survivorship assumption. We then have that

graphic file with name sim0033-3601-m207.jpg
graphic file with name sim0033-3601-m208.jpg

One can show using similar arguments that

graphic file with name sim0033-3601-m209.jpg

proving the result.□

Proof of Theorem 3

Inline graphic converges in probability to the solution of the following population weighted normal equations:

graphic file with name sim0033-3601-m211.jpg

where WA = [p(C0)A{1 − p(C0)}1 − A]− 1. It is straightforward to verify that under the assumptions of Theorem 3, the left hand-side of the equation in the aforementioned display is equal to

graphic file with name sim0033-3601-m212.jpg

where Inline graphic is the expectation with respect to the law

graphic file with name sim0033-3601-m214.jpg

giving the result.□

Proof of Theorem 4

Let Inline graphic Ga(C1) = F(C1 | A = a). To prove the theorem, it suffice to show that Inline graphic if either Inline graphic, or Inline graphic and Inline graphic, but not necessarily both hold, where

graphic file with name sim0033-3601-m220.jpg

First assume that Inline graphic, then

graphic file with name sim0033-3601-m222.jpg

Next, suppose that Inline graphic and Inline graphic, then

graphic file with name sim0033-3601-m225.jpg

proving the result.□

Proof of Theorem 5

The proof is similar to that of Theorem 2; suppose that Inline graphic, then

graphic file with name sim0033-3601-m227.jpg

and we have

graphic file with name sim0033-3601-m228.jpg
graphic file with name sim0033-3601-m229.jpg

giving the result for

graphic file with name sim0033-3601-m230.jpg

The expression for

graphic file with name sim0033-3601-m231.jpg

is similarly obtained. The result for Inline graphic is obtained by noting that

graphic file with name sim0033-3601-m233.jpg

and it is straightforward to verify that

graphic file with name sim0033-3601-m234.jpg

and the expression for Inline graphic is similarly derived.□

Derivation of Equation (17)

graphic file with name sim0033-3601-m236.jpg

Then, note that

graphic file with name sim0033-3601-m237.jpg

thus by iterating, one obtains

graphic file with name sim0033-3601-m238.jpg

which gives

graphic file with name sim0033-3601-m239.jpg

proving the result. Equation 49 is obtained by setting J = 2.

References

  1. Robins JM. A new approach to causal inference in mortality studies with sustained exposure period—application to control of the healthy worker survivor effect. Mathematical Modelling. 1986;7:1393–1512. [Google Scholar]
  2. Frangakis CE, Rubin DB. Principal stratification in causal inference. Biometrics. 2002;58(1):21–29. doi: 10.1111/j.0006-341x.2002.00021.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Zhang JL, Rubin DB. Estimation of causal effects via principal stratification when some outcomes are truncated by ‘death’. Journal of Educational and Behavioral Statistics. 2003;28(4):353–368. [Google Scholar]
  4. Rubin DB. Causal inference through potential outcomes and principal stratification: application to studies with ‘censoring due to death’ (with discussion) Statistical Science. 2006;21(3):299–321. [Google Scholar]
  5. Hudgens MG, Hoering A, Self SG. On the analysis of viral load endpoints in HIV vaccine trials. Statistics in Medicine. 2003;22(14):2281–2298. doi: 10.1002/sim.1394. [DOI] [PubMed] [Google Scholar]
  6. Hudgens MG, Halloran ME. Causal vaccine effects on binary postinfection outcomes. Journal of the American Statistical Association. 2006;101(473):51–64. doi: 10.1198/016214505000000970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Shepherd BE, Gilbert PB, Jemiai Y, Rotnitzky A. Sensitivity analyses comparing outcomes only existing in a subset selected postrandomization, conditional on covariates, with application to HIV vaccine trials. Biometrics. 2006;62(2):332–342. doi: 10.1111/j.1541-0420.2005.00495.x. [DOI] [PubMed] [Google Scholar]
  8. Jemiai Y, Rotnitzky A, Shepherd BE, Gilbert PB. Semiparametric estimation of treatment effects given base-line covariates on an outcome measured after a post-randomization event occurs. Journal of the Royal Statistical Society Series B-Statistical Methodology 1. 2007;69(5):879–901. doi: 10.1111/j.1467-9868.2007.00615.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Pearl J. Causality: Models, Reasoning, and Inference. 2nd ed. New York: Cambridge University Press; 2009. [Google Scholar]
  10. Bertram L, McQueen MB, Mullin K, Blacker D, Tanzi RE. Systematic meta-analyses of Alzheimer disease genetic association studies: the AlzGene database. Nature Genetics. 2007;39:17–23. doi: 10.1038/ng1934. [DOI] [PubMed] [Google Scholar]
  11. Tchetgen Tchetgen EJ, Glymour M, Weuve J, Shpitser I. To weight or not to weight? On the relation between inverse-probability weighting and principal stratification for truncation by death. Epidemiology. 2012a;23(4):644–646. doi: 10.1097/EDE.0b013e31825727b5. [DOI] [PubMed] [Google Scholar]
  12. Gilbert PB, Bosch RJ, Hudgens MG. Sensitivity analysis for the assessment of causal vaccine effects on viral load in HIV vaccine trials. Biometrics. 2003;59(3):531–540. doi: 10.1111/1541-0420.00063. [DOI] [PubMed] [Google Scholar]
  13. Shepherd BE, Gilbert PB, Lumley T. Sensitivity analyses comparing time-to-event outcomes existing only in a subset selected postrandomization. Journal of the American Statistical Association. 2007;102(478):573–582. doi: 10.1198/016214507000000130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Shepherd B, Gilbert P, Dupont C. Sensitivity analyses for comparing time-to-event outcomes only existing in a subset selected postrandomization and relaxing monotonicity. Biometrics. 2011;67:1100–1110. doi: 10.1111/j.1541-0420.2010.01508.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gilbert P, Jin Y. Semiparametric estimation of the average causal effect of treatment on an outcome measured after a post-randomization event, with missing outcome data. Biostatistics. 2010;11:34–47. doi: 10.1093/biostatistics/kxp034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Chiba Y, VanderWeele TJ. A simple methd for principal strata effects when the outcome has been truncated due to death. American Journal of Epidemiology. 2011;173:745–751. doi: 10.1093/aje/kwq418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Imai K. Sharp bounds on the causal effects in randomized experiments with ‘truncation-by-death’. Statistics and Probability Letters. 2008;78:144–149. [Google Scholar]
  18. Hayden D, Pauler DK, Schoenfeld D. An estimator for treatment comparisons among survivors in randomized trials. Biometrics. 2005;61(1):305–310. doi: 10.1111/j.0006-341X.2005.030227.x. [DOI] [PubMed] [Google Scholar]
  19. Zhang JL, Rubin DB, Mealli F. ‘Likelihood-based analysis of causal effects via principal stratification: new approach to evaluating job training programs’. Journal of the American Statistical Association. 2009;104:166–176. [Google Scholar]
  20. Ding P, Geng Z, Yan W, Zhou X. Identifiability and estimation of causal effects by principal stratification with outcomes truncated by death. Journal of the American Statistical Association. 2011;106(496):1578–1591. [Google Scholar]
  21. Nolen T, Hudgens M. Randomization-based inference within principal strata. Journal of the American Statistical Association. 2011;106(494):581–593. doi: 10.1198/jasa.2011.tm10356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Dai JY, Gilbert PB, Mâsse BR. Partially hidden Markov model for time-varying principal stratification in HIV prevention trials. Journal of the American Statistical Association. 2012;107(497):52–65. doi: 10.1080/01621459.2011.643743. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Tchetgen Tchetgen EJ. On causal mediation analysis with a survival outcome. International Journal of Biostatistics. 2011;7(1):1–38. doi: 10.2202/1557-4679.1351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Tchetgen Tchetgen EJ, Shpitser I. Semiparametric theory for causal mediation analysis: efficiency bounds, multiple robustness, and sensitivity analysis. The Annals of Statistics. 2012;40(3):1816–1845. doi: 10.1214/12-AOS990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Rosenbaum P, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70:41–55. [Google Scholar]
  26. Robins JM. Proceedings of the American Statistical Association. 1997. pp. 1–10. Marginal structural models, Section on Bayesian Statistical Science,
  27. Weuve J, Tchetgen Tchetgen EJ, Glymour MM, Beck TL, Aggarwal NT, Wilson RS, Evans DE, Mendes de Leon CF. Accounting for bias due to selective attrition in analyses of cognitive decline: the example of smoking and cognitive decline in older adults.(with discussion and rejoinder) Epidemiology. 2012;23(1):119–128. doi: 10.1097/EDE.0b013e318230e861. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Tchetgen Tchetgen EJ, Glymour MM, Weuve J, Robins J. A cautionary note on specification of the correlation structure in inverse-probability-weighted estimation for repeated measures. Epidemiology. 2012b;23(4):644–646. doi: 10.1097/EDE.0b013e31825727b5. [DOI] [PubMed] [Google Scholar]
  29. Robins JM. Marginal structural models versus structural nested models as tools for causal inference. In: Halloran M, Berry D, editors. Statistical Models in Epidemiology: The Environment and Clinical Trials IMA. Vol. 116. NY: Springer-Verlag; 1999. pp. 95–134. [Google Scholar]
  30. Hernán M, Brumback B, Robins JM. Marginal structural models to estimate the causal effect of zidovudine on the survival of HIV-positive men. Epidemiology. 2000;11(5):561–570. doi: 10.1097/00001648-200009000-00012. [DOI] [PubMed] [Google Scholar]
  31. Hernán M, Brumback B, Robins JM. Marginal structural models to estimate the joint causal effect of nonrandomized treatments. Journal of the American Statistical Association. 2001;96(454):440–448. [Google Scholar]

Articles from Statistics in Medicine are provided here courtesy of Wiley

RESOURCES