Abstract
A number of results concerning attributable fractions for sufficient cause interactions are given. Results are given both for etiologic fractions (i.e. the proportion of the disease due to a particular sufficient cause) and for excess fractions (i.e. the proportion of disease that could be eliminated by removing a particular sufficient cause). Results are given both with and without assumptions of monotonicity. Under monotonicity assumptions, exact formulas can be given for the excess fraction. When etiologic fractions are of interest or when monotonicity assumptions do not hold for excess fractions then only lower bounds can be given. The interpretation of the results in this paper and in a proposal by Hoffmann et al. (2006) are discussed and compared. A method is described to estimate the lower bounds on attributable fractions using marginal structural models. Identification is discussed in settings in which time-dependent confounding may be present.
Keywords: attributable fraction, interaction, marginal structural models, sufficient cause, synergism
Introduction
A number of recent papers consider tests for sufficient cause interactions (VanderWeele and Robins, 2007a, 2007b, 2008, 2009; Vansteelandt et al., 2008; VanderWeele, 2009; VanderWeele et al., 2010a, 2010b; VanderWeele and Richardson, 2010). In the case of two binary exposures, these sufficient cause interactions indicate the presence of individuals for whom the outcome would occur if both of two exposures were present but for whom the outcome would not occur if only one of the two exposures were present. Within the context of the sufficient cause framework, such individuals signal the presence of synergism between two exposures as conceived by Rothman (1976). In considering the public health impact of exposures, it is often of interest to know what proportion of the disease or outcome is in some sense due to the exposures under study or what proportion could be eliminated if the exposure under study were removed (Miettinen, 1974; Greenland and Robins, 1988). These questions also arise in a natural way within the sufficient cause framework. Rothman conceived of causation as a series of distinct mechanisms or “sufficient causes” each of which would be sufficient to bring about the outcome. We might then be interested in the proportion of disease that is due to, or could be eliminated by removing, a particular mechanism or sufficient cause (Hoffmann et al., 2006).
In this paper we consider several issues concerning inference for attributable fractions for sufficient cause interactions. We discuss a recent proposal by Hoffmann et al. (2006) to calculate the attributable fraction for a particular sufficient cause. We argue that the approach proposed by Hoffmann et al. implicitly requires monotonicity assumptions that are not acknowledged in the paper. We furthermore show that even within the context of monotonicity, the approach described by Hoffmann et al. gives excess attributable fractions rather than etiologic attributable fractions (Greenland and Robins, 1988; Robins and Greenland, 1989). We give a number of results that extend the approach of Hoffmann et al. First, we discuss how one can obtain lower bounds for the etiologic fraction under the assumption of monotonicity. Second, we consider inference for the excess and etiologic fraction when the assumption of monotonicity cannot be made; we also consider the invariance of these results to the sufficient cause representation. Third, we discuss how a marginal structural model approach to inference for sufficient cause interactions (VanderWeele et al., 2010a) can also be used to draw inferences concerning attributable fractions for sufficient cause interactions. Finally, we consider issues concerning time-dependent confounding in the context of inference for attributable fractions. We draw on recent identification results concerning the effect of treatment on the treated (Shpitser and Pearl, 2009) to show that in general the expressions for the lower bounds for etiologic fractions will not be identified when time-dependent confounding is present. However, in the case when the exposures of interest are binary we give an identification result which shows that the expressions for these lower bounds are identified in some settings.
Sufficient Cause Framework
We will let D denote a binary outcome of interest and let X1, X2, . . . , Xk denote binary causes of interest. For some cause Xi we will let X̅i denote the complement of Xi i.e. that the cause Xi is absent. When a particular outcome is in view, Rothman (1976) conceived of the relationship between cause and effect as a collection of causal mechanisms for D. Each causal mechanism would itself be sufficient for the outcome and thus Rothman referred to these causal mechanisms as “sufficient causes.” Each mechanism might require some combination of the causes of interest, X1, X2, . . . , Xk, either their presence or absence to operate. Each mechanism might also require some additional background factors, other than X1, X2, . . ., Xk, in order to operate. For the ith mechanism we will denote the presence of these additional background factors as Ai = 1 with Ai = 0 otherwise. Within Rothman’s description, each mechanism or “sufficient cause” would consists of a minimal set of conditions or “component causes” such that when all the component causes for a particular mechanism were present the mechanism would operate and the outcome would inevitably occur. Within every sufficient cause, every component of that sufficient cause would be necessary for the corresponding mechanism to operate. Synergism would be said to be present between Xi and Xj if there were a sufficient cause which required both Xi and Xj to operate. Further introductory discussion of the sufficient cause model with only two exposures can be found elsewhere (Greenland and Brumback, 2002; VanderWeele and Robins, 2007).
More formally, consider the case of two causes of interest, X1 and X2. We will let Dx1x2 (ω) denote the counterfactual value of D for individual ω if, possibly contrary to fact, we had set X1 = x1 and X2 = x2. Let ∨ denote the disjunctive “or” operator defined by X ∨ Y = X + Y – XY so that X ∨ Y = 1 if X = 1 or Y = 1 or both but X ∨ Y = 0 if X = Y = 0. VanderWeele and Robins (2008) defined a sufficient cause representation to be any set of variables {Ai(ω)}i=0, . . ., 8 that are functions of {Dx1x2 (ω)}x1,x2∈{0,1} such that
(1) |
A sufficient cause interaction was said to be present between X1 and X2 if for every sufficient cause representation there exists an ω such that A5(ω) ≠ 0. A sufficient cause interaction implies synergism in the sense of Rothman (1976). Sufficient cause interactions and synergism between X1 and X̅2, or between X̅1 and X2, or X̅1 and X̅2 can be defined similarly. The results below are stated in terms of sufficient cause interactions between X1 and X2 as the other cases are easily covered by recoding the exposures.
VanderWeele and Robins (2007a, 2008) derived an empirical condition for testing for synergism. Testing for synergism requires that control be made for confounding. Let denote that A is conditionally independent of B given C. We will say that the effects of X1 and X2 on D are unconfounded given C if . VanderWeele and Robins (2007a, 2008) showed that if the effects of X1 and X2 on D are unconfounded given a set of variables C and if we let px1x2c = P(D = 1|x1, x2, c) then if for some c,
(2) |
then a sufficient cause interaction between X1 and X2 must be present. VanderWeele and Robins (2008) furthermore showed that a sufficient cause interaction was equivalent to an individual for whom
(3) |
which itself is equivalent to an individual for whom D11(ω) = 1 and D10(ω) = D01(ω) = 0. Condition (3) is more general than condition (2) in that if (2) is satisfied then (3) must be satisfied for some individual ω but (3) may be satisfied for some individual ω without (2) being satisfied. Essentially condition (2) implies condition (3) and condition (3) implies a sufficient cause interaction and thus the presence of synergism between X1 and X2. However, condition (2), unlike condition (3), can be tested using data.
In some cases the effects of the cause Xi may be in the same direction for all individuals. We will say that X1 and X2 have positive monotonic effects on D if Dx1x2 (ω) is non-decreasing in x1 and x2 for all individuals ω. When X1 and X2 have positive monotonic effects on D then a condition weaker than (2) can be used to test for sufficient cause interactions. In particular, if the effects of X1 and X2 on D are unconfounded given C then a sufficient cause interaction must be present if the following condition holds (VanderWeele and Robins, 2007a, 2008; Rothman et al., 2008):
It can furthermore be shown that, under the monotonicity assumption, a sufficient cause interaction is present if there is an individual ω for whom
(VanderWeele and Robins, 2008). Empirical and counterfactual conditions are also available for testing for 3-way or k-way interactions between binary exposures (VanderWeele and Robins, 2008; VanderWeele and Richardson, 2010).
VanderWeele et al. (2010a) also noted that the expressions [D11 – D10 – D01] and [D11 – D10 – D01+D00] constitute lower bounds on the prevalence of sufficient cause interactions (i.e. the proportion of individuals for whom D11(ω) = 1 and D10(ω) = D01(ω) = 0) without and with the monotonicity assumption respectively. More generally, for some set of variables Q, [D11 – D10 – D01|Q = q] and [D11 – D10 – D01 + D00|Q = q] constitute lower bounds on the prevalence of sufficient cause interactions within strata Q = q and thus
and
will constitute lower bounds on the population prevalence of sufficient cause interactions without and with the monotonicity assumption respectively. In general these lower bounds will be larger (i.e. further from 0) than the crude bounds given by [D11 – D10 – D01] or [D11 – D10 – D01 + D00].
Attributable Fractions for Sufficient Cause Interactions
In many settings it is of interest to consider the proportion of the disease or outcome that is in some sense due to the exposure under study or could be eliminated if the exposure under study were removed (Miettinen, 1974; Robins and Greenland, 1988). For a single binary exposure the “attributable fraction” or “population attributable fraction” is sometimes defined as
(4) |
Greenland and Robins (1988) note that there is ambiguity in the expression “attributable fraction” and draw a distinction between an excess attributable fraction and an etiologic attributable fraction. Excess fractions are the proportion of the disease that could be eliminated by eliminating the exposure; etiologic fractions are the proportion of the disease due to the exposure. Because of the possibility of competing risks, the two quantities (the excess fraction and etiologic fraction) need not coincide. For example, it may be the case that for some individual the occurrence of the outcome is in fact due to the presence of the exposure under study but, for that individual, if the exposure were eliminated the outcome would still occur through some other mechanism. For such an individual the occurrence of the outcome is in fact due to the exposure but the outcome would not be eliminated by eliminating the exposure. Such an individual would be included in the numerator of the etiologic fraction but not for the excess fraction. The quantity given in (4) is the excess fraction. In the absence of further biological knowledge, the etiologic fraction is not identified (Greenland and Robins, 1988). Miettinen (1974) noted that if the effect of X on D was unconfounded given C then PAF(X) =
(5) |
In a recent paper, Hoffmann et al. (2006) proposed a method to estimate a quantity they define as the PDC which they conceived of as “the proportion of disease due to a class of sufficient causes”; however, as pointed out below, the method they describe in fact gives the excess fraction, not the etiologic fraction (i.e. the fraction that could be eliminated not the fraction due to the exposure). Hoffmann et al. consider exposures X1, . . . , Xk and implicitly assume that none of the sufficient causes involve any of the complements of X1, . . . , Xk so that X1, . . . , Xk have positive monotonic effects on D. For example, for two exposures X1 and X2 they assume that of the sufficient causes A0, A1X1, A2X̅1, A3X2, A4X̅2, A5X1X2, A6X̅1X2, A7X1X̅2, A8X̄1X̄2, only A0, A1X1, A3X2, A5X1X2 are present (i.e. A2 = A4 = A6 = A7 = A8 = 0). This assumption of monotonicity is not explicitly stated by Hoffmann et al. but is required in their derivations.
Hoffmann et al. then propose using the formula in (5) from Miettinen (1974) to estimate population attributable fraction (PAF) for a specific exposure or a group of exposures. In particular let X(1), . . . , X(m) be some subset of X1, . . . , Xk and define PAF(X(1), . . . , X(m)) as the proportion of diseased subjects who would not develop the disease if the exposures X(1), . . . , X(m) were eliminated. For I ⊆ {1, . . . , k} let SI denote the sufficient cause which requires Xi for all i ∈ I; for example, with two exposures, S0 would be the sufficient cause A0, S1 would be the sufficient cause A1X1, S2 would be the sufficient cause A3X2, and S12 would be the sufficient cause A5X1X2. Let PDC(SI) denote the proportion of the diseased subjects who would not develop the disease if the effects of the sufficient cause SI could be eliminated. Hoffmann et al. argue in their first Appendix that:
and PDC(S(1), . . . , (m)) is given recursively by
Thus for two exposures, X1 and X2, one would obtain PDC(S12)
This final expression could be estimated using (5). By definition of PAF, this final quantity is also equal to
Hoffmann et al. propose that the quantity PDC(SI) be interpreted as “the proportion of disease due to sufficient cause SI.” From the discussion above, it can be seen that the quantity should be interpreted as “the proportion of disease that would be eliminated by preventing the sufficient cause SI from operating” i.e. as an excess fraction not as an etiologic fraction. Essentially because the formulas they use for the PAF(X(1), . . . , X(m)) correspond to the proportion of diseased subjects who would not develop the disease if the exposures X(1), . . . , X(m) were eliminated, the quantity PDC(SI), calculated by using PAF (X(1), . . . , X(m)), corresponds to the proportion of diseased subjects who would not develop the disease if the sufficient cause SI were eliminated. Note that Hoffmann and Flanders (2006) make a somewhat different clarification concerning the method described by Hoffmann et al. (2006) in that in the application of Hoffmann et al., it was not PDC(S(1), . . . , (m)) that was estimated but rather a different quantity PDC(SE), where E = (X1 = 1, . . . , Xm = 1, Xm+1 = 0, . . . , Xk = 0), for which PDC(SE) is interpreted as the proportion of disease that would be eliminated if all individuals with E had their all exposures Xi set to 0. At the end of their paper Hoffmann et al. note the distinction between excess fraction and etiologic fractions but continue to refer to the quantity PDC as the “the proportion of disease due to a particular sufficient cause [or class of sufficient causes].” The language and interpretation should be modified. The quantities discussed by Hoffmann et al. (2006) correspond to what Greenland and Robins define as the “excess fraction.”
One further point concerning attributable fractions for sufficient causes, not considered by Hoffmann et al. (2006), merits attention. It has been noted (Greenland and Brumback, 2002; VanderWeele and Robins, 2007a) that there will in general be multiple ways to represent the outcome in terms of sufficient causes. For example, as noted above, any set of variables {Ai(ω)}i=0, . . . ,8 that are functions of {Dx1x2 (ω)}x1, x2∈{0,1} that satisfy (1) constitutes a sufficient cause representation of the potential outcomes, and more than one representation is in general possible. It is important to know then whether the approach of Hoffmann et al. described above is invariant to the sufficient cause representation. Hoffmann et al. (2006) obtain the formula for PDC(S(1), . . . , (m)) by arguing that the difference between PAF(X1, . . . , Xk) and PAF (X(m+1), . . . , X(k)) must constitute occurrences of the outcome that arise from not having eliminated sufficient causes containing one or more of X(1), . . . , X(m) but none of X(m+1), . . . , X(k) and from this it follows that PAF(X1, . . . , Xk) – PAF(X(m+1), . . . , X(k)) = ∑I⊆{(1), . . . , (m)} PDC(SI). Removing PDC(S(1), . . . , (m)) from the sum ∑I⊆{(1), . . . , (m)} PDC(SI) and rearranging gives the formula for PDC(S(1), . . . , (m)). The logic applies irrespective of the sufficient cause representation and thus the formula given by Hoffmann et al. (2006) is invariant to the sufficient cause representation. The results of Hoffmann et al. (2006) along with the discussion above establishes the following Theorem.
Theorem 1. If the effects of X1, . . . , Xk on D are monotonic then the formulas for the attributable fractions for sufficient causes given by Hoffmann et al. (2006), namely,
and PDC(S(1), . . . , (m)) given recursively by
give the excess fraction for a sufficient cause S(1), . . . , (m) irrespective of the sufficient cause representation.
As noted above, the quantity PDC(S12) given by Hoffmann et al. for the sufficient cause, A5X1X2, for example, corresponds to the “excess fraction” for the A5X1X2 sufficient cause. Under the monotonicity assumption, the formula for the excess fraction gives a lower bound on the etiologic fraction (Greenland and Robins, 1988). This is because “the proportion of disease that would be eliminated by blocking some sufficient cause S from operating” provides a lower bound on the “the proportion of disease due to the sufficient cause S.” Using theory for sufficient cause interactions we can draw further inferences concerning the proportion of disease that are in fact “due to” particular sufficient causes such as A5X1X2 i.e. concerning etiologic fractions. As is the case with Hoffmann et al., we will assume, at least initially, that the effects of X1 and X2 on D are monotonic. The presence of a sufficient cause interaction indicates, in the terminology of Greenland and Robins, that there is a non-zero “etiologic fraction.” Clearly the outcome will be due to the sufficient cause A5X1X2 only if X1 = 1, X2 = 1. Let Q be some subset of C. From the above discussion a lower bound on the prevalence of sufficient cause interactions amongst the group with X1 = 1, X2 = 1 with Q = q is given by
(6) |
For the group with X1 = 1, X2 = 1 and Q = q, the quantity in (6) will be a lower bound on the prevalence of sufficient cause interactions; for individuals ω with X1(ω) = 1, X2(ω) = 1, Q = q and with a sufficient cause interaction present, we will have that D11(ω) = 1, D10(ω) = D01(ω) = 0 and D = 1. Thus for these individuals the outcome must be due to the sufficient cause A5X1X2 since neither of the sufficient causes A0 or A1X1 can be the cause of the outcome since D10(ω) = 0, the sufficient cause A3X2 cannot be the cause of the outcome because D01(ω) = 0, and none of the sufficient causes A2X̅1, A4X̅2, A6X̅1X2, A7X1X̅2, A8X̅1X̅2 can be the cause of the outcome because X1 = 1, X2 = 1. Note that under an assumption of “sufficient cause monotonicity” the sufficient causes cannot be the cause of the outcome because A2X̅1, A4X̅2, A6X̅1X2, A7X1X̅2, A8X̅1X̅2 are eliminated by definition but even under the weaker counterfactual monotonicity that Dx1x2 (ω) is non-decreasing in x1 and x2 for all ω, clearly none of A2X̅1, A4X̅2, A6X̅1X2, A7X1X̅2, A8X̅1X̅2 can be the cause of D = 1 when X1(ω) = X2(ω) = 1. The above argument holds irrespective of the sufficient cause representation. Now, for stratum, Q = q, the set of individuals for whom X1 = 1, X2 = 1 and for whom the outcome is due to sufficient cause A5X1X2 must be a subset of the set of individuals for whom the outcome is due to sufficient cause A5X1X2. Thus a lower bound for the proportion of individuals in stratum Q = q for whom the outcome is due to sufficient cause A5X1X2 is given by:
From this it follows that a lower bound on the proportion of disease due to sufficient cause A5X1X2 in the population is given by
If the effects of X1 and X2 on D are unconfounded given C then the quantity [D11 – D10 – D01 + D00|X1 = 1, X2 = 1, q] can be estimated by
Because sufficient cause interactions concern statements about all possible sufficient cause representations, the argument above holds irrespective of the sufficient cause representation. We have thus established the following theorem.
Theorem 2. If the effects of X1 and X2 on D are monotonic then a lower bound on the etiologic fraction for the sufficient cause A5X1X2 is given by
(7) |
irrespective of the sufficient cause representation. If the effects of X1 and X2 on D are unconfounded given C then the quantity [D11 – D10 – D01 + D00|X1 = 1, X2 = 1, q] can be estimated by
Theorem 2 is stated for two binary exposures. However, as discussed in Appendix 1, the result generalizes to lower bounds for etiologic fraction for sufficient causes with k factors.
The results of Hoffmann et al. (2006) and the theorems given above required that the effects of all of X1, . . . , Xk on D be monotonic. Using theory for sufficient cause interactions we can, however, derive lower bounds for the etiologic fraction without the monotonicity assumption. Without monotonicity, by arguments similar to those above,
will be a lower bound on the prevalence of sufficient cause interactions amongst the group with X1 = 1, X2 = 1 and Q = q. For individuals ω with X1(ω) = 1, X2(ω) = 1, with a sufficient cause interaction present, we will have that D11(ω) = 1, D10(ω) = D01(ω) = 0 and D = 1. Thus for these individuals the outcome must be due to the sufficient cause A5X1X2 since, irrespective of the sufficient cause representation, neither of the sufficient causes A0 or A1X1 can be the cause of the outcome because D10(ω) = 0, and the sufficient cause A3X2 cannot be the cause of the outcome because D01(ω) = 0, and none of the sufficient causes A2X̅1, A4X̅2, A6X̅1X2, A7X1X̅2, A8X̅1X̅2 can be the cause of the outcome because X1 = 1, X2 = 1. Thus,
will be a lower bound on the proportion of individuals in stratum Q = q for whom the outcome is due to sufficient cause A5X1X2 and with X1 = 1, X2 = 1 and thus also a lower bound on the proportion of individuals in stratum Q = q for whom the outcome is due to the sufficient cause A5X1X2. From this it follows that a lower bound on the proportion of disease due to sufficient cause A5X1X2 in the population is given by
This establishes the following result.
Theorem 3. Without the assumption of monotonicity, a lower bound on the etiologic fraction for the sufficient cause A5X1X2 is given by
(8) |
irrespective of the sufficient cause representation. If the effects of X1 and X2 on D are unconfounded given C then the quantity [D11 – D10 – D01|X1 = 1, X2 = 1, q] can be estimated by
Theorem 3 is stated for two binary exposures but generalizes to lower bounds for etiologic fractions for sufficient causes with k factors. See Appendix 1 for further discussion.
We have seen above that Theorem 1 gives a method for calculating excess fractions for sufficient causes under the assumption of monotonicity. Theorem 2 gives a method for obtaining a lower bound on etiologic fraction for sufficient causes under the assumption of monotonicity. Theorem 3 gives a method for obtaining a lower bound on etiologic fraction for sufficient causes without the assumption of monotonicity. A question thus still remains about how to obtain a lower bound for excess fractions (rather than etiologic fractions) for sufficient causes when the monotonicity assumption does not hold. When monotonicity cannot be assumed, excess fractions for the A5X1X2 sufficient cause are not identified, but a lower bound for the excess fraction can be derived. Theorem 4 give a lower bound for the excess fraction in cases in which neither X1 nor X2 can be assumed to have a monotonic effect on D and also in cases in which just one of X1 or X2 have a monotonic effect on D. The proof of Theorem 4 is given in Appendix 2.
Theorem 4. Without the assumption of monotonicity, a lower bound on the excess fraction for the sufficient cause A5X1X2 is given by
(9) |
irrespective of the sufficient cause representation. If one of X1 or X2 have a monotonic effect on D then a lower bound on the excess fraction for the sufficient cause A5X1X2 is given by expression (8) above. If the effects of X1 and X2 on D are unconfounded given C then the quantity [D11 – D10 – D01 – D00|X1 = 1, X2 = 1, q] can be estimated by
The expression D11 – D10 – D01 – D00 is related to what might be referred a “singular” interaction, defined as the presence of an individual ω for whom D11(ω) = 1 but D10(ω) = D01(ω) = D00(ω) = 0 (VanderWeele and Richardson, 2010); in the context of two genetic factors such interactions are referred to as instances of “compositional epistasis” (Cordell, 2009; VanderWeele, 2010a, 2010b). If either X1 or X2 have a monotonic effect on D then a singular interaction and a sufficient cause interaction are equivalent. When neither X1 nor X2 have a monotonic effect on D, the condition for a singular interaction is stronger than that for a sufficient cause interaction.
Marginal Structural Models for Bounds on Attributable Fractions for Sufficient Cause Interactions
The expressions for attributable fractions given in Theorems 1–3 require making adjustment for a set of confounding factors C. When the set C consists of a small number of categorical variables the quantities in Theorems 1–3 could be estimated by stratifying on C. However when C contains many variables or some continuous variables such stratification will not in general be possible. Logistic regression could be employed to estimate quantities such as [D|X1 = 1, X2 = 1, c]. However, as discussed in VanderWeele et al. (2010a), within the sufficient cause framework, such an approach may be undesirable because regression models for [D|X1 = 1, X2 = 1, c] impose multiplicative relationships between the confounding variables C and the unknown background causes A0, A1, . . . , A8. Imposing such restriction is in general undesirable because in most cases the background causes A0, A1, . . . , A8 will be unknown and it will thus not be entirely clear what substantively is being assumed. To overcome this issue, VanderWeele et al. (2010a) proposed the use of marginal structural models (Robins, 1999; Robins et al., 2000) to draw inferences concerning sufficient cause interactions. In this section we discuss how this approach can be extended to draw inference concerning attributable fractions for sufficient cause interactions.
To apply Theorems 2 and 3, instead of specifying regression models for [D|X1 = 1, X2 = 1, c ] we could alternatively specify a marginal structural model for [Dx1x2|X1 = 1, X2 = 1, q]. For instance, if Q were binary, one could specify a saturated marginal structural model of the form
(10) |
Under the assumption of unconfoundedness conditional on C (i.e. ) marginal structural models can be fit using inverse probability of treatment weighting (IPTW, Robins, 1999; Robins et al., 2000). This IPTW technique has become quite routine in fitting marginal structural model not conditional on the exposures such as
(11) |
For example, if it were the case that then consistent estimators for (β0, β1, β2, β3, β4, β5, β6, β7) in model (11) can be obtained by fitting a Bernoulli regression with identity link of D on 1, X1, X2, X1X2, Q, QX1, QX2, QX1X2 with each subject ω weighted by the inverse probability of treatment weights of
where x1ω, x2ω and cω denote the values of X1, X2 and C respectively for individual ω (Robins, 1999). With two exposures, the weights wω might in practice be obtained by where
and where models for P(X1 = x1|C = c) and P (X2 = x2|C = c, X1 = x1) may, for example, be fit using logistic regression. When the marginal structural model is not saturated (for example, if q were continuous) then so called stabilized weights, where
and where qω denotes the value of Q for individual ω, may give smaller variance for the estimates of (β0, β1, β2, β3, β4, β5, β6, β7) (Robins, 1999). When a marginal structural model conditional on the exposures such as (10) rather than (11) is under consideration, a modified set of weights is needed (Sato and Matsuyama, 2003). In such cases, consistent estimators for (α0, α1, α2, α3, α4, α5, α6, α7) in model (10) can be obtained by fitting a Bernoulli regression with identity link of D on 1, X1, X2, X1X2, Q, QX1, QX2, QX1X2 with each subject ω weighted by weights of
where the weights might in practice be obtained by where
and where once again models for P(X1 = x1|C = c) and P (X2 = x2|C = c, X1 = x1) might be fit using logistic regression. Once estimates for (α0, α1, α2, α3, α4, α5, α6, α7) are obtained, the quantities (7)–(9) for lower bounds for etiologic and excess fraction fractions in Theorems 2–4, could be estimated by:
and
and
respectively. A similar approach could be employed when sufficient causes with k factors are considered.
Time-Dependent Confounding in Inference for Etiologic Fractions for Sufficient Cause Interactions
The preceding discussion assumed that the effects of X1 and X2 on D were unconfounded given C in the sense of . In some settings there may be an effect, L, of the first exposure, X1, that in turn causes both the second exposure, X2, and the outcome, D. Such an example is given in Figure 1.
Figure 1.
Example of time-dependent confounding.
In such cases, the unconfoundedness assumption, , will not in general hold. It may, however, still be the case that a sequential ignorability or unconfoundedness assumptions holds that
(12) |
Assumption (12) might be intuitively interpreted as that the effect of X1 on D is unconfounded given C and the effect of X2 on D is unconfounded given {C, X1, L}. Robins (1986, 1987) showed that under assumption (12), expected counterfactual outcomes of the form [Dx1x2 |q] with Q ⊆ C are identified. Robins (1986, 1987) furthermore conjectured that under assumption (12), expected counterfactual outcomes of the form [Dx1x2 |X1 = 1, X2 = 1, q] would not be identified if x1 = 0 i.e. the expected counterfactual outcomes [D00|X1 = 1, X2 = 1, q] and [D01|X1 = 1, X2 = 1, q] would not be identified under (12). In the discussion in the previous two sections it was the expected counterfactual outcomes of the form, [Dx1x2 |X1 = 1, X2 = 1, q], i.e. of the form not identified under (12), that were used to give lower bounds on etiologic fractions for sufficient causes.
Shpitser and Pearl (2009) recently confirmed the conjecture of Robins (1986, 1987) by providing a set of identification rules on causal directed acyclic graphs (Pearl, 1995) for general expressions involving “the effect of treatment on the treated” including rules for expected counterfactual outcomes of the form [Dx1x2 |X1 = 1, X2 = 1, q]. It follows from Theorem 3 of Shpitser and Pearl (2009) that if Figure 1 constitutes a causal directed acyclic graph then without further restrictions on X1, X2, D and without further assumptions, the expected counterfactual outcome [Dx1x2 |X1 = 1, X2 = 1, q] is unidentified for some x1, x2. In this case, [Dx1x2 |X1 = 1, X2 = 1, q] is not identified for x1 = 0. A proof is given in Appendix 2. The assumption that Figure 1 is a causal directed acyclic graph is a stronger assumption than that (12) holds (Robins, 2003) and thus [Dx1x2 |X1 = 1, X2 = 1, q] is not in general identified for x1 = 0 under (12).
Although [D00|X1 = 1, X2 = 1, q] and [D01|X1 = 1, X2 = 1, q] are not in general identified, we will show that when X1, X2, D are all binary some additional progress can be made in identifying the quantity in (7) used to obtain a lower bound for the etiologic fraction under monotonicity. The following theorem states the result formally. The proof is given in Appendix 2.
Theorem 5. Suppose that X1, X2 and D are binary, that and and furthermore that
(13) |
then expression (7) for the lower bound for the etiologic fraction under monotonicity, namely,
is identified because
An intuitive interpretation of assumption (13) can be given as follows. Assumption (13) will hold on the causal directed acyclic graph in Figure 1 if there is no interaction on the additive scale between the effects of L and X2 on D. A formal statement and proof of this assertion is given in Appendix 2. Assumptions such as (13) are useful also in the identification of natural direct and indirect effects. See Hafeman and VanderWeele (2010) and Robins et al. (2010) for further discussion.
Discussion
In this paper we have considered a number of results for attributable fractions for sufficient cause interactions. We have extended the distinction between “excess fractions” and “etiologic fraction” of Greenland and Robins (1988) from the setting of a single exposure to that of sufficient causes. Under assumptions of unconfoundedness we have discussed how to calculate excess fractions and how to obtain lower bounds for etiologic fractions, both with and without monotonicity and also how to obtain a lower bound for excess fractions without the assumption of monotonicity. The results here extend those of Hoffmann et al. in considering etiologic fractions in addition to excess fractions and in considering settings in which monotonicity assumptions do not hold. We have discussed a procedure using marginal structural models to obtain lower bounds for the etiologic fraction both with and without monotonicity and for the excess fraction without monotonicity; within the sufficient cause framework, this approach based on marginal structural models is preferable to a regression approach because it does not impose restrictions on the relationship between the confounding variables and the potentially unknown background causes. Finally we have discussed issues of identifiability of attributable fractions in settings with time-dependent confounding.
The present paper has been concerned principally with attributable fractions related to sufficient cause interactions. In particular, in the paper and the appendix we have considered excess and etiologic fractions for the sufficient cause with k factors in settings in which there are a total of k factors of interest. Future work could consider excess and etiologic fractions, with and without monotonicity, for sufficient sufficient causes with r < k factors in settings in which there are k factors of interest.
As discussed above, under monotonicity, a lower bound for the excess fraction also constitutes a lower bound for the etiologic fraction; thus the maximum of the excess fraction given in Theorem 1 and the lower bound on the etiologic fraction given in Theorem 2 still constitutes a lower bound on the etiologic fraction. Future work could thus consider whether it is in some sense possible to obtain sharp lower bounds on etiologic fractions and such work could also consider optimal choice of Q in the bounds for the etiologic fraction given in Theorems 2–4. As noted in Appendix 1, additional subtleties arise under the monotonicity assumption when etiologic fraction for sufficient causes with more than two factors are considered.
Appendix 1. Attributable fractions for n-way sufficient cause interactions
For k-way interactions, VanderWeele and Richardson (2010) showed that if there were an individual for whom
then there was an k-way sufficient cause interaction between X1, . . . , Xk. For any set of covariates Q,
(A1) |
constitutes a lower bound on the prevalence of sufficient cause interactions within stratum Q = q. If (i.e. if the effects of {X1, . . . , Xk} on D are unconfounded given C) and if Q ⊆ C, then the quantity in (A1) can be estimated by using
The approach to obtaining lower bounds on etiologic fractions for sufficient causes described in the main text can still be applied to sufficient causes with k factors. Using the same logic as for Theorems 2 and 3, we have that a lower bound for the etiologic fraction for the sufficient cause with X1, . . . , Xk will be given by
where, provided , the expression [Dx1. . . xk|X1 = 1, . . . , Xk = 1, q] can be estimated by
VanderWeele and Robins (2008) and VanderWeele and Richardson (2010) discussed conditions for 3-way and n-way sufficient cause interaction respectively under assumptions of monotonicity. For example, for a 3-way sufficient cause interaction, VanderWeele and Robins (2008) noted that if Dx1x2x3 were non-decreasing in x1, x2 and x3 then if any of the following three expressions are positive this suffices to conclude the presence of a 3-way sufficient cause interaction:
Each of these three quantities also constitutes a lower bound on the prevalence of 3-way sufficient cause interactions within stratum Q = q. Once again, using the logic of Theorems 2 and 3, all three of the following quantities constitute lower bounds on the etiologic fraction for the sufficient cause with X1, X2 and X3:
As before, if then [Dx1x2x3 |X1 = 1, X2 = 1, X3 = 1, q] can be estimated by
Since each of the three quantities above constitutes a lower bound on the etiologic fraction for the sufficient cause with X1, X2 and X3 it also follows that each maximum of these three expressions constitutes a lower bound on the etiologic fraction for the sufficient cause with X1, X2 and X3. Thus it is in fact also the case that ,
also constitutes a lower bound for the etiologic fraction for the sufficient cause with X1, X2 and X3. Whether bounds sharper than this can be obtained is an open question.
Appendix 2. Proofs
Proof of Theorem 4
For any sufficient cause representation satisfying 1, we have that P(D) =
(A2) |
If one were able to eliminate the A5X1X2 sufficient cause then the probability of the outcome would be
(A3) |
Because the event in (A3) is a subset of the event in (A2) the difference between these two expressions is given by
(A4) |
This final expression in (A4) is an exact formula for the excess fraction for the sufficient cause A5X1X2 but it is not in general identified because A0, A1, A3, A5 are latent. Clearly,
since the probability is always non-negative. For any individual ω such that D11(ω) = 1 but D10(ω) = D01(ω) = D00(ω) = 0 we must have by (1) that A0(ω) = A1(ω) = A3(ω) = 0 since D10(ω) = D01(ω) = D00(ω) = 0 and thus that A5(ω) = 1 since D11(ω) = 1. From this it follows that
From this it follows that the excess fraction for the sufficient cause A5X1X2 is greater than
and this establishes (9). If one of X1 or X2 has a monotonic effect on D then for any individual ω such that D11(ω) = 1 but D10(ω) = D01(ω) = 0 we also have D00(ω) = 0 and thus must have by (1) that A0(ω) = A1(ω) = A3(ω) = 0 and A5(ω) = 1. Thus
from which it follows that (8) would then be a lower bound on the excess fraction for the sufficient cause A5X1X2. This completes the proof.
Lack of Identification of Expected Counterfactual Outcomes Conditional on Exposures Under Time-Dependent Confounding
We use Theorem 3 of Shpitser and Pearl (2009) to show that [Dx1x2 |X1 = 1, X2 =1, Q = q] is not identified for arbitrary x1 and x2 in the causal directed acyclic graph constituted by Figure 1 which we will refer to as graph G. In Theorem 3 of Shpitser and Pearl (2009), we have for Figure 1 that X = {X1, X2}, Y = D, Z = {X1, X2}, W = ∅. Choose F = L and note that F is ancestral to Y ∪ Z = {D, X1, X2} in Gz where Gz is the graph G with the edges proceeding from Z = {X1, X2} removed. Now let W = X1 and note that there are directed paths from W = X1 to both X2 ∈ Z and Y = D in the graph Gz\{W} = GX2 i.e. in the graph G with the edges proceeding from X2 removed. The directed path from W = X1 to X2 in GX2 constituted by X1 → L → X2 has its first node in F = L. The directed path from W = X1 to D in GX2 constituted by X1 → L → D has its first node in F = L. From Theorem 3 of Shpitser and Pearl (2009) it follows that [Dx1x2 |X1 = 1, X2 = 1, Q = q] is not identified for arbitrary x1 and x2.
Proof of Theorem 5
We will show that [D11 – D10 – D01 + D00|X1 = 1, X2 = 1, q] is identified. First note that
and thus
Moreover, by (12), [D10|X1 = 1, q]
Thus we see that [D10|X1 = 1, X2 = 1, q] is identified and is given by
Also, clearly [D11|X1 = 1, X2 = 1, q] is identified since
It remains to show that [–D01 + D00|X1 = 1, X2 = 1, q] is identified. For [D01 |X1 = 1, X2 = 1, q] and [D00|X1 = 1, X2 = 1, q]
(A5) |
and
(A6) |
Solving (A5) for [D01|X1 = 1, X2 = 1, q] and (A6) for [D00|X1 = 1, X2 = 1, q] we obtain
Now, under (12), [D01|X1 = 1, q] – [D00|X1 = 1, q] is identified and given by
Furthermore, by (13), [D01|X1 = 1, X2 = 0, q] − [D00|X1 = 1, X2 = 0, q]
Thus, [D01 – D00|X1 = 1, X2 = 1, q =
Assumption 13 as a No-Interaction Assumption
We show that assumption (13) in the text, namely,
will hold for the causal directed acyclic graph given in Figure 1 if there is no additive-scale interaction between the effects of L and X2 on D in the following sense. If Figure 1 constitutes a causal directed acyclic graph (Pearl, 1995) then let f(x1, x2, c, l, ∈D) denote the non-parametric structural equation for D where ∈D is the random term for D. We will argue that assumption (13) above will hold for the causal directed acyclic graph in Figure 1 if there is no interaction between the effects of L and X2 on D in the sense that f (x1, x2, c, l, ∈D) can be written as
(A7) |
To see this note that under (A7) we have that [D01 – D00|X1 = 1, X2 = 0, c, l =
Define F = f2(X1, X2, C, ∈D) then using the counterfactual graphs of Shpitser and Pearl (2007) it follows that and we thus have
and this completes the proof.
Footnotes
The author thanks the editor and an anonymous referee for helpful comments on this paper. The research was supported by NIH grant R01 ES017876.
References
- Cordell HJ. Detecting gene-gene interaction that underlie human diseases. Nature Reviews Genetics. 2009;10:392–404. doi: 10.1038/nrg2579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greenland S, Robins JM. Conceptual problems in the definition and interpretation of attributable fractions. American Journal of Epidemiology. 1988;128:1185–1197. doi: 10.1093/oxfordjournals.aje.a115073. [DOI] [PubMed] [Google Scholar]
- Greenland S, Brumback B. An overview of relations among causal modelling methods. International Journal of Epidemiology. 2002;31:1030–1037. doi: 10.1093/ije/31.5.1030. [DOI] [PubMed] [Google Scholar]
- Hafeman DM, VanderWeele TJ.2010Alternative assumptions for the identification of direct and indirect effects Epidemiologyin press [DOI] [PubMed] [Google Scholar]
- Hoffmann K, Flanders WD. Letter re: Estimating the proportion of disease due to classes of sufficient causes. American Journal of Epidemiology. 2006;164:1253–1255. doi: 10.1093/aje/kwk082. [DOI] [PubMed] [Google Scholar]
- Hoffmann K, Heidemann C, Weikert C, Schulze MB, Boeing H. Estimating the proportion of disease due to classes of sufficient causes. American Journal of Epidemiology. 2006;163:76–83. doi: 10.1093/aje/kwj011. [DOI] [PubMed] [Google Scholar]
- Miettinen OS. Proportion of disease caused or prevented by a given exposure, trait or intervention. American Journal of Epidemiology. 1974;99:325–332. doi: 10.1093/oxfordjournals.aje.a121617. [DOI] [PubMed] [Google Scholar]
- Pearl J. Casual diagrams for empirical research (with discussion) Biometrika. 1995;82:669–710. doi: 10.1093/biomet/82.4.669. [DOI] [Google Scholar]
- Robins JM. A new approach to causal inference in mortality studies with sustained exposure period - application to control of the healthy worker survivor effect. Mathematical Modelling. 1986;7:1393–1512. doi: 10.1016/0270-0255(86)90088-6. [DOI] [Google Scholar]
- Robins JM. Addendum to a new approach to causal inference in mortality studies with sustained exposure period - application to control of the healthy worker survivor effect. Computers and Mathematics with Applications. 1987;14:923–945. doi: 10.1016/0898-1221(87)90238-0. [DOI] [Google Scholar]
- Robins JM. Marginal structural models versus structural nested models as tools for causal inference. In: Halloran ME, Berry D, editors. Statistical Models in Epidemiology: The Environment and Clinical Trials. NY: Springer-Verlag; 1999. pp. 95–134. [Google Scholar]
- Robins JM. Semantics of causal DAG models and the identification of direct and indirect effects. In: Green P, Hjort NL, Richardson S, editors. Highly Structured Stochastic Systems. Oxford University Press; New York: 2003. pp. 70–81. [Google Scholar]
- Robins JM, Greenland S. Estimability and estimation of excess and etiologic fractions. Statistics in Medicine. 1989;8:845–859. doi: 10.1002/sim.4780080709. [DOI] [PubMed] [Google Scholar]
- Robins JM, Hernán MA, Brumback B. Marginal structural models and causal inference in epidemiology. Epidemiology. 2000;11:550–560. doi: 10.1097/00001648-200009000-00011. [DOI] [PubMed] [Google Scholar]
- Robins JM, Richardson TS, Spirtes P.2010On identification and inference for direct effects Epidemiologyin press [Google Scholar]
- Rothman KJ. Causes. American Journal of Epidemiology. 1976;104:587–592. doi: 10.1093/oxfordjournals.aje.a112335. [DOI] [PubMed] [Google Scholar]
- Rothman KJ, Greenland S, Lash TL. Modern Epidemiology. 3rd edition. Philadelphia: Lippincott Williams and Wilkins; 2008. [Google Scholar]
- Sato T, Matsuyama Y. Marginal structural models as a tool for standardization. Epidemiology. 2003;14:680–686. doi: 10.1097/01.EDE.0000081989.82616.7d. [DOI] [PubMed] [Google Scholar]
- Shpitser I, Pearl J. What counterfactuals can be tested. Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence; 2007. pp. 352–359. [Google Scholar]
- Shpitser I, Pearl J.2009Effects of treatment on the treated: identification and generalizationUCLA Cognitive Systems Laboratory, Technical Report (R-349). Full version available at: http://bayes.cs.ucla.edu/csl_papers.html
- VanderWeele TJ. Sufficient cause interactions and statistical interactions. Epidemiology. 2009;20:6–13. doi: 10.1097/EDE.0b013e31818f69e7. [DOI] [PubMed] [Google Scholar]
- VanderWeele TJ.2010aEpistatic interactions Statististical Applications in Genetics and Molecular Biology 9(Article 1)1–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- VanderWeele TJ. Empirical tests for compositional epistasis. Nature Reviews Genetics. 2010b;11:166. doi: 10.1038/nrg2579-c1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- VanderWeele TJ, Richardson TS. General theory for sufficient cause interactions for dichotomous exposures. Under revision for Annals of Statistics. 2010. [DOI] [PMC free article] [PubMed]
- VanderWeele TJ, Robins JM. The identification of synergism in the sufficient-component cause framework. Epidemiology. 2007a;18:329–339. doi: 10.1097/01.ede.0000260218.66432.88. [DOI] [PubMed] [Google Scholar]
- VanderWeele TJ, Robins JM. Directed acyclic graphs, sufficient causes and the properties of conditioning on a common effect. American Journal of Epidemiology. 2007b;166:1096–1104. doi: 10.1093/aje/kwm179. [DOI] [PubMed] [Google Scholar]
- VanderWeele TJ, Robins JM. Empirical and counterfactual conditions for sufficient cause interactions. Biometrika. 2008;95:49–61. doi: 10.1093/biomet/asm090. [DOI] [Google Scholar]
- VanderWeele TJ, Robins JM. Minimal sufficient causation and directed acyclic graphs. Annals of Statistics. 2009;37:1437–1465. doi: 10.1214/08-AOS613. [DOI] [Google Scholar]
- VanderWeele TJ, Vansteelandt S, Robins JM.2010aMarginal structural models for sufficient cause interactions American Journal of Epidemiologyin press 10.1093/aje/kwp396 [DOI] [PMC free article] [PubMed] [Google Scholar]
- VanderWeele TJ, Hernández-Diaz S, Hernán MA.2010bCase-only gene-environment interaction studies: when does association imply mechanistic interaction? Genetic Epidemiologyin press [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vansteelandt S, VanderWeele TJ, Tchetgen EJ, Robins JM. Journal of the American Statistical Association. 2008. Multiply robust inference for statistical interactions; pp. 103pp. 1693–1704. [DOI] [PMC free article] [PubMed] [Google Scholar]