Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Sep 1.
Published in final edited form as: Epidemiology. 2014 Sep;25(5):711–722. doi: 10.1097/EDE.0000000000000096

Attributing effects to interactions

Tyler J VanderWeele 1, Eric J Tchetgen Tchetgen 2
PMCID: PMC4219627  NIHMSID: NIHMS637398  PMID: 25051310

Abstract

A framework is presented that allows an investigator to estimate the portion of the effect of one exposure that is attributable to an interaction with a second exposure. We show that when the two exposures are independent, the total effect of one exposure can be decomposed into a conditional effect of that exposure and a component due to interaction. The decomposition applies on difference or ratio scales. We discuss how the components can be estimated using standard regression models, and how these components can be used to evaluate the proportion of the total effect of the primary exposure attributable to the interaction with the second exposure. In the setting in which one of the exposures affects the other, so that the two are no longer independent, alternative decompositions are discussed. The various decompositions are illustrated with an example in genetic epidemiology. If it is not possible to intervene on the primary exposure of interest, the methods described in this paper can help investigators to identify other variables that, if intervened upon, would eliminate the largest proportion of the effect of the primary exposure.


In some settings, the effect of a particular exposure may be substantially altered in the presence or absence of a second exposure, so that some form of interaction exists between these two exposures.1,2 In such cases, it may be of interest to determine the extent to which the overall effect of the primary exposure of interest is due to the presence of the secondary exposure, and the primary exposure’s interaction with it. We present an analytic framework within which to address such questions. We show that, if the distributions of the two exposures are statistically independent in the population, then the overall effect of the primary exposure can be decomposed into two components - the first being the effect of the primary exposure when the secondary exposure is removed, and the second being a component due to interaction. Such decompositions can be useful in settings in which it is not possible to intervene on the primary exposure of interest and an investigator is interested in trying to identify other variables that, if intervened upon, would eliminate much or most of the effect of the primary exposure of interest. We show how this decomposition applies on an additive scale and on a risk ratio scale, and how regression models can be used to estimate each of the components. We discuss extensions to settings in which the two exposures are not independent but rather when one affects the other, and we also discuss a decomposition of joint effects of both exposures and relate these to Rothman’s measures of the attributable proportion due to interaction.13 The decompositions are illustrated with an example from genetic epidemiology. We begin with introducing notation. We will keep both the notation and the setting relatively simple in the paper but consider more complex settings in the Appendix and eAppendix.

Definitions and Notation

We will let G and E denote two exposures of interest. These may be genetic and environmental exposures, respectively, but they could also both be genetic, or both environmental, or one or both could be behavioral. We will, for simplicity in exposition, refer to the first as a genetic exposure and the second as an environmental exposure. When the ordering of the exposures is relevant we will assume that G precedes E. We will assume for simplicity that both exposures are binary; however we consider more general settings in the appendix.

Let Y be an outcome of interest that may be binary or continuous. When the outcome is binary, for variable(s) X, we will use px = P(Y = 1|X = x) to denote the probability of the outcome conditional on X = x. If the effect of G on Y is unconfounded, then pg=1pg=0 = P(Y = 1|G = 1) − P(Y = 1|G = 0) would equal to the effect of G on Y. If the effect of E on Y is unconfounded, then pe=1pe=0 = P(Y = 1|E = 1) − P(Y = 1|E = 0) would equal to the effect of E on Y. In the exposition in the text, we will assume that there is no confounding for the effects of G and E on Y, but in the appendix we consider analogous results when the effects are unconfounded only conditional on some set of covariates C.

With a binary outcome we will also use pge = P(Y = 1|G = g, E = e) to denote the probability of the outcome when G = g and E = e. The standard interaction contrast on the additive scale would be written as (p11p10p01 + p00) and assesses the extent to which the effect of both exposures together exceeds the effect of each considered separately.

Attributing Total Effects to Interactions Under Independence

Suppose now that the two exposures G and E are statistically independent (and thus uncorrelated) in the population and suppose that the effects of G and E on Y are unconfounded. We show in the Appendix that:

(pe=1pe=0)=(p01p00)+(p11p10p01+p00)P(G=1).

We can decompose the overall effect of E on Y into two pieces. The first piece is the conditional effect of E on Y when G = 0; the second piece is the standard additive interaction, (p11p10p01+p00), multiplied by the prevalence of G = 1. We can then attribute the total effect of E on Y to the part that would be present still if G were 0 (this is p01p00), and to a part that has to do with the interaction between G and E (this is (p11p10p01 + p00)P(G = 1)). If we could set the genetic exposure to 0, we would remove the part that is due to the interaction and would be left with only p01p00.

Since we can do this decomposition, we might de.ne a quantity pAIG=0(E) as the proportion of the overall effect of E that is attributable to interaction, with a reference category for the genetic exposure of G = 0, as

pAIG=0(E)=(p11p10p01+p00)P(G=1)(pe=1pe=0).

The remaining portion (p01p00)/(pe=1pe=0) is the proportion of the effect of E that would remain if G were fixed to 0. The proportion attributable to interaction could then be interpreted as the proportion of the effect of E we would eliminate if we fixed G to 0.

If Y is continuous, again assuming that G and E are independent, we have a similar decomposition, 𝔼[Y|E = 1] − 𝔼[Y|E = 0] =

𝔼[Y|G=0,E=1]𝔼[Y|G=0,E=0]+{𝔼[Y|G=1,E=1]𝔼[Y|G=1,E=0]𝔼[Y|G=0,E=1]+𝔼[Y|G=0,E=0]}P(G=1)

and we could likewise define the proportion attributable to interaction by

pAIG=0(E)={𝔼[Y|G=1,E=1]𝔼[Y|G=1,E=0]𝔼[Y|G=0,E=1]+𝔼[Y|G=0,E=0]}P(G=1)𝔼[Y|E=1]𝔼[Y|E=0]

The two components of the decomposition - the portion due to interaction and the portion due to the effect of E when G is fixed to 0 - also have an intuitive form within a regression framework.

Consider the following regression model in which Y might be binary or continuous:

𝔼[Y|G=g,E=e]=α0+α1g+α2e+α3eg. (1)

We show in the appendix that irrespective of whether the outcome is binary or continuous, if G and E are independent, then the total effect of E on Y is given by α2 + α3P(G = 1), the portion due to interaction is equal to α3P(G = 1), and the portion due to the effect when G is fixed to 0 is equal to α2. Thus the proportion due to interaction is simply

pAIG=0(E)=α3P(G=1)α2+α3P(G=1).

The portion due to the effect when G is fixed to 0 is simply the main effect of E in the regression model, α2. The portion due to interaction is just the product coefficient α3 multiplied by the probability that G = 1.

Note that under the assumption that G and E are independent, the roles of G and E can be interchanged. Thus with a binary outcome we could likewise decompose the total effect of G on Y by: (pg=1pg=0) = (p10p00) + (p11p10p01 + p00)P(E = 1). We could de.ne the proportion of the effect of G that is attributable to interaction (with a reference category for E of E = 0) as pAIE=0(G)=(p11p10p01+p00)P(E=1)(pg=1pg=0). Expressed in terms of the coefficients of the regression model in (1) we have pAIE=0(G)=α3P(E=1)α1+α3P(E=1).

To the best of our knowledge, this approach has not been previously described. The approach we have been considering thus far has assumed that the two exposures G and E are independent. As we will see later in the paper, the decomposition becomes somewhat more complicated when G and E are no longer independent in the population. Even under independence, the implications of the approach are also sometimes more subtle than they first appear. From the formulae above, the proportion attributable to interaction depends on the main effect coefficient for the primary exposure of interest, the interaction coefficient, and the prevalence of the secondary exposure. Because of this, it would, for example, be possible for the main effect of G, namely α1, to be larger than the main effect of E, α2, while it still also being the case that the proportion of the effect of G attributable to the interaction is larger than the proportion of the effect o E attributable to interaction. This could occur, for instance, if the prevalence of G was relatively small with the prevalence of E being comparatively larger.

Attributing Total Effects to Interactions on the Ratio Scale

Often, when an outcome is binary, a ratio scale is used to measure effects. We would define the relative risk for G as RRg=1=pg=1pg=0=P(Y=1|G=1)P(Y=1|G=0). Likewise we would de.ne the relative risk for E by RRe=1=pe=1pe=0=P(Y=1|E=1)P(Y=1|E=0). We can also define relative risks when G and E are considered together; we would de.ne the relative risk for the outcome Y, comparing G = g, E = e to the reference category G = 0, E = 0, as RRge=pgep00=P(Y=1|G=g,E=e)P(Y=1|G=0,E=0).

It is shown in the Appendix that if G and E are independent then we have the decomposition of the excess relative risk for E as:

(RRe=11)=κ(RR011)+κ(RR11RR10RR01+1)P(G=1).

where κ is a scaling factor given by κ=p00pe=0. As on the difference scale, so also on the ratio scale, we can decompose the excess relative risk for E, into two components: the first component is the excess relative risk for E if G were fixed to 0, (RR01 − 1), and the second component is a portion of the effect due to interaction, (RR11RR10RR01 + 1)P(G = 1). The contrast, RR11RR10RR01 + 1, is sometimes referred to as the "relative excess risk due to interaction" (RERI)3 or the "interaction contrast ratio".2 We can thus re-express the decomposition above as: (RRe=1 − 1) = κ(RR01 − 1) + κ(RERI)P(G = 1). Because of the scaling factor κ, it does not necessarily make sense to estimate the specific portions, κ(RR01 − 1), and κ(RERI)P(G = 1), of the total effect, but if we consider the proportion of the effect of E attributable to interaction, then the scaling factor κ drops out and we obtain:

pAIG=0(E)=(RERI)P(G=1)(RR011)+(RERI)P(G=1).

By symmetry a similar decomposition holds for the overall effect of G on Y on the risk ratio scale and we have the proportion of the effect of G attributable to interaction as

pAIE=0(G)=(RERI)P(E=1)(RR101)+(RERI)P(E=1).

Often a logistic regression model is used in analyzing data with a binary outcome on the ratio scale. Consider the logistic regression model

logit{P(Y=1|G=g,E=e)}=γ0+γ1g+γ2e+γ3eg. (2)

If the outcome is rare, then odds ratios approximate risk ratios and RERI is given approximately by RERIeγ123eγ1eγ2 + 1, and RR10 and RR01 can be estimated approximately by RR10eγ1 and RR01eγ2. We can thus still estimate all of the components of the proportions attributable to interaction using the estimates from the logistic regression in (2) and could compute these proportions by:

pAIG=0(E)(eγ1+γ2+γ3eγ1eγ2+1)P(G=1)(eγ21)+(eγ1+γ2+γ3eγ1eγ2+1)P(G=1)pAIE=0(G)(eγ1+γ2+γ3eγ1eγ2+1)P(E=1)(eγ11)+(eγ1+γ2+γ3eγ1eγ2+1)P(E=1).

As discussed in the Appendix, these same expressions can be used even when control is made for covariates in the logistic regression. This approach also works when using logistic regression in a case-control study. If the outcome is rare or incidence density sampling is used then we can estimate the various components in the decomposition by RR10eγ1, RR01eγ2, and RERIeγ123eγ1eγ2 + 1 and, in addition, P(G = 1) and P(E = 1) can be estimated approximately in a case-control study using the probability of G and E respectively among the controls. Thus we can proceed with estimating the components of the decomposition, even in a case-control study.

Standard errors for these various expressions, using the delta method, along with SAS and Stata code to estimate proportions attributable to interaction and their standard errors, using logistic regression, are given in the eAppendix. If the sample size is relatively small it may be preferable to use bootstrapping to obtain standard errors. A similar approach can also be employed if control is made for some set of covariates C or if one or both of the exposures are continuous rather than binary; see eAppendix for details.

One of the motivations often given for studying interaction, specifically on the additive scale, is to identify which subgroups would benefit most from intervention when resources are limited.13 In settings in which it is not possible to intervene directly on the primary exposure of interest, one might instead be interested in which other covariates could be intervened upon to eliminate much or most of the effect of the primary exposure of interest. The methods here for attributing effects to interactions can be useful in assessing this and identifying the most relevant covariates for intervention.

Relaxing the Independence Assumption

Our discussion up until now has assumed that the distributions of the two exposures are statistically independent in the population. This assumption may not always be plausible. If G and E represent genetic and environmental exposures, then the assumption of independence in the population is often not unreasonable, though there are documented cases4, 5 in which genetic variants do affect environmental exposures and so the assumption has to be assessed on a case-by-case basis. When the exposures are two environmental factors, or two behavioral factors, the two exposures may often be correlated with each other. In this section we will consider what can be concluded when the two exposures are instead correlated.

We will assume here that the ordering of the two exposures is known (e.g. that G precedes E). In this setting, even if G affects E, the decompositions we have considered in the previous sections will still apply for the second exposure, i.e. for E, provided the effect of E on Y is unconfounded conditional on G (and conditional on, if applicable, measured covariates C). Under this assumption of no confounding for E, we will still have that the total effect of E decomposes into the sum (p01p00) + (p11p10p01 + p00)P(G = 1) on the absolute risk scale, and we can use the sum of these two components as our estimate of the total effect. Likewise, the regression method in the previous section will still be applicable and (p11p10p01+p00)P(G=1)(p01p00)+(p11p10p01+p00)P(G=1) would constitute the proportion of the effect attributable to interaction. And similarly on the ratio scale, (RERI)P(G=1)(RR011)+(RERI)P(G=1) would still constitute the proportion of the effect attributable to interaction. The methods in the previous two sections still apply even if G affects E, or if G and E are otherwise correlated.

However, the decomposition of a total effect into a conditional effect and an interaction considered in previous sections do not apply directly for the first exposure G, when G affects E. Intuitively, this is because the effect of G on Y does not depend only on the presence or absence of E, but it is also the case that whether E is itself present (and thus whether the interaction operates) depends on G. Said another way, if G affects E, E is not simply an effect modifier for G, but it is also potentially a mediator for G. Our decompositions above are no longer applicable. An alternative decomposition does, however, hold. Specifically it can be shown (see Appendix) that, when G affects E, we have the following decomposition for the total effect of G:

(pg=1pg=0)=(p10p00)+(p11p10p01+p00)P(E=1|G=1)+(p01p00){P(E=1|G=1)P(E=1|G=0)}.

The decomposition of the total effect of G, (pg=1pg=0), now consists of three components. We will consider each component in turn. The first component (p10p00) is simply the effect of G in the absence of E i.e. the portion of the effect of G that would remain if E were fixed to 0. This is analogous to the first component in the two-way decompositions above. The second component, (p11p10p01 + p00)P(E = 1|G = 1), is the effect attributable to interaction, but now the interaction term, (p11p10p01 + p00), is multiplied by P(E = 1|G = 1) when G affects E rather than being multiplied by P(E = 1), as when G and E were independent; note when G and E are independent, P(E = 1|G = 1) reduces to P(E = 1). The third component, (p01p00){P(E = 1|G = 1) − P(E = 1|G = 0)}, was absent from the two-way decomposition; it is essentially the main effect of E in the absence of G, (p01p00), multiplied by the effect of G on E, {P(E = 1|G = 1) − P(E = 1|G = 0)}; it could be interpreted as a mediated main effect; note again when G and E are independent P(E = 1|G = 1) − P(E = 1|G = 0) = 0 and thus this third component vanishes. In the Appendix we further discuss the relationship between this decomposition and the decompositions in the mediation analysis literature.

Thus when G affects E, and we are decomposing the total effect of G two things happen to the decomposition we had under independence. First, because G affects E, we need to take into account the fact that the presence of E (and thus the possibility that the interaction between the two operates) is itself affected by G and thus the interaction term in the second component is multiplied by P(E = 1|G = 1), rather than P(E = 1). Second, when G affects E, a change in G from 0 to 1 will also change E and thus the main effect of E is more likely to operate and we thus introduce a third component, (p01p00){P(E = 1|G = 1) − P(E = 1|G = 0)} to the decomposition.

Under this setting of G affecting E, the proportion of the effect attributable to interaction becomes:

pAIE=0(G)=(p11p10p01+p00)P(E=1|G=1)(pg=1pg=0).

In this context, we might also wonder what the consequences are of ignoring dependence between G and E and proceeding with estimating the proportion attributable to interaction measure when independence of G and E is (incorrectly) assumed, i.e. of using the measure

pAIE=0(G)=(p11p10p01+p00)P(E=1)(pg=1pg=0).

It is shown in the Appendix that if the latter measure is used for the proportion attributable to interaction, incorrectly assuming independence, then although the latter measure does not actually capture the proportion of the effect attributable to interaction, it does nonetheless constitute a lower bound on the proportion of the effect of G that would be eliminated by fixing E to 0, provided G has a non-negative effect on E, and provided E has a non-negative effect on Y (at least in the absence of G). Thus even if one proceeds with the more naive estimate of the proportion attributable to interaction, ignoring (incorrectly) the dependence between G and E, one still, under fairly reasonable assumptions, obtains a lower bound on the proportion of the effect of G eliminated by fixing E to 0.

Further extensions to this approach of relaxing the assumption of independence are discussed in the Appendix. There this is generalized to non-binary exposures and outcomes, to the ratio scale, and to settings in which covariates are needed to control for confounding.

When G affects E, two other alternative approaches are worth noting. First instead of decomposing the total effect into a component due to interaction and the various main effects, one might alternatively use methods for mediation. If G affects E and E affects Y, then E will in general be a mediator for the effect of G on Y, and one can assess how much of the effect of G on Y is mediated by E. Methods for mediation and easy-to-use software packages6,7 are now available to carry out such mediation analysis. These methods now also allow for interactions between the two exposures G and E.7,8 Since these methods are described elsewhere we will not consider them in detail here. It should be noted, however, that these methods address different questions than the ones we have been considering in this paper. However, when G affects the second exposure E, the questions concerning mediation may be the more relevant questions of interest. One can use these methods to assess the proportion of the effect of G on Y mediated through E.

This proportion-mediated measure is related to, but not identical with, the proportion eliminated discussed above.9,10 The proportion eliminated is not always identical to the proportion mediated because it considers what would happen if we fixed the second exposure (the mediator E) to a particular level (rather than allowing G to affect it). See VanderWeele10 for further discussion. The decomposition above also gives an interpretation to the portion eliminated measure: it states that the difference between the total effect and the portion of the effect that would remain if E were fixed to zero is equal to the sum of the interaction term and the mediated main effect (i.e. the second and third terms in the decomposition above). Second, yet another approach to assess the importance of interaction with regard to G when G itself affects E is to decompose not a total effect of G on Y, but rather to focus on the joint effects of G and E together and to decompose this joint effect. This is the approach we consider in the following section.

Decomposition of Joint Effects into Main Effects and an Interactive Component

Another decomposition would be to decompose the joint effects of the two exposures, G and E, into three components, the effect due to G alone, the effect due to E alone and their interaction. On the risk difference scale this is

p11p00=(p10p00)+(p01p00)+(p11p10p01+p00).

We could then also compute the proportion of the effect due to G alone, (p10p00)(p11p00), due to E alone, (p01p00)(p11p00), and due to their interaction, (p11p10p01+p00)(p11p00). We can carry out a decomposition like this even if G affects E.

On the risk ratio scale, we can decompose the excess relative risk for both exposures RR11 − 1 into the excess relative risk for G alone, for E alone, and the excess relative risk due to interaction, RERI. Specifically we have

RR111=(RR101)+(RR011)+RERI.

We could then likewise compute the proportion of the effect due to G alone, RR101RR111, due to E alone, RR011RR111, and due to their interaction RERIRR111.

Under the logistic regression model in (2) for an outcome that is rare, the joint effect attributable to G alone, E alone, and to their interaction are given approximately by:

RR101RR111eγ11eγ1+γ2+γ31RR011RR111eγ21eγ1+γ2+γ31RERIRR111(eγ1+γ2+γ3eγ1eγ2+1)eγ1+γ2+γ31.

As discussed in the Appendix, these same expressions can be used even when control is made for covariates in the logistic regression. In the eAppendix we give standard errors for these proportion measures as well as SAS and Stata code to estimate the proportions and their standard errors and 95% confidence intervals.

Rothman3 considered a measure of interaction that he called the attributable proportion, defined as RERIRR11; the denominator Rothman used was RR11. The measure was meant to capture the proportion of the disease in the doubly exposed group that is due to the interaction. Rothman3 also considered an alternative measure, RERIRR111, which captured the proportion of the effect of both exposures on the additive scale that is due to interaction. Most of the subsequent literature has focused on the former measure and likewise most of the other literature on attributable fractions focuses on the proportion of disease attributable to an exposure,2 or to an interaction. However using the latter measure, i.e. using RR11 − 1, as the denominator, and focusing on the proportion of the effect due to interaction in fact has a number of advantages: both the numerator and the denominator are then on the additive excess relative risk scale; when the entirety of the effect is due to the interaction, the latter measure is then 100% and not some number less than 100%; and the latter measure is moreover invariant to recoding of the outcome.11 Furthermore, as we have shown here, the latter measure is what is involved in the decomposition above. With Rothman’s primary measure, RERIRR11, even if all of the joint effect were due to interaction so that the effect of G alone and E alone were both risk ratios of 1, i.e. RR10 = 1 and RR01 = 1, we would nevertheless have that Rothman’s primary attributable proportion measure would be RERIRR11=R11R10R01+1RR11=R1111+1RR11=R111RR11<1; i.e. even if the entirety of the joint effect of both exposures were due to interaction, the attributable proportion measure is still less than 100%. The measure RERIRR111 does not have this issue. It is 100% when the main effects of G alone and E alone were both risk ratios of 1; i.e. when the entirety of the joint effect is due to interaction. The measure RERIRR111 captures the proportion of the joint effect attributable to interaction.

The attributable proportion of joint effects measure, RERIRR111, is also attractive from another standpoint. Skrondal12 criticized Rothman’s original attributable proportion measure because, in the presence of covariates, if the risks follow a linear risk model that is additive in the covariates, P(Y = 1|G = g, E = e, C = c) = α0 + α1g + α2e + α3ge + α4c, then, although the additive interaction, p11p10p01 + p00 = α3, does not vary across strata of the covariates, Rothman’s primary attributable proportion measure, RERIRR11=α3α0+α1+α2+α3+α4c, does vary across strata of the covariates. One may or may not think that this is an important criticism of the attributable proportion measure; however, attributable proportion measure for effect, RERIRR111=α3α1+α2+α3, does not vary with the covariates and thus circumvents this criticism entirely.

Empirical Illustration

We illustrate the various decompositions with an example from genetic epidemiology. We use data from a case-control study of lung cancer at Massachusetts General Hospital (Miller et al.13) of 1836 cases and 1452 controls. Eligible cases included any person over the age of 18 years; the controls were recruited from among the friends or spouses of cancer patients or the friends or spouses of other surgery patients in the same hospital. The study included information on smoking and genotype information on locus 15q25.1. For simplicity in this illustration, we will code the exposure as binary so that smoking is ever vs. never and the genetic variant is a comparison of 0 vs. 1 or 2 T alleles at rs8034191. See the Appendix and eAppendix for approaches handling ordinal and categorical exposures. Covariate data include age (continuous), sex and educational history (college degree or more, yes / no). Analyses were restricted to white persons. Genetic variants on 15q25.1 have been found to be associated with both smoking and lung cancer5,14,15 and thus we are in a setting in which the first exposure G is correlated with the second exposure E. The prevalences of the exposures among the controls (which approximates that of the underlying population since the outcome is rare) is P(G = 1) = 56.7% and P(G = 1) = 64.94%. When we fit the logistic regression model in (2), adjusting also for covariates, we obtain estimates: γ1 = 0.04 (95% CI= −0.33 to 0.41), γ2 = 1.33 (1.01 to 1.64), γ3 = 0.49 (0.08 to 0.89). The main effect of G is small, the main effect of E is large, and the interaction is of moderate size. If we use the regression coefficients to calculate the proportion attributable to interaction for E we obtain a proportion of 34.5% (7.1% to 61.9%). Even if we eliminated the genetic exposure, 65.5% of the effect would remain (34.5% would be eliminated). The results are summarized in the first two lines of the Table.

Table.

Summary of Results for the Empirical Illustration* on Proportion of the Total Effect of E due to Interaction (36.6%), the Total Effect of G due to Interaction (98.1%) and the Joint Effects of G and E due to Interaction (47.1%).

Decomposition Components Empirical Estimates
The Effect of E
(RR11RR10RR01+1)P(G=1)(RR011)+(RR11RR10RR01+1)P(G=1)+(RR011)(RR011)+(RR11RR10RR01+1)P(G=1)
34.5% + 65.5% = 100%
The Effect of G
(RR11RR10RR01+1)P(E=1)(RR101)+(RR11RR10RR01+1)P(G=1)+(RR101)(RR101)+(RR11RR10RR01+1)P(E=1)
97.6% + 2.4% = 100%
The Joint Effects
RR101RR111+RR011RR111+RERIRR111
0.8% + 51.4% + 47.8% = 100%
*

The estimates and components are all also conditional on measured covarates (age, sex and educational history), omitted from the notation in the second column to conserve space

We could proceed with a similar analysis with G but because G affects E here we need to be somewhat more careful in interpretation. Here, however, the correlation between G and E, although present, is quite weak, and so the decomposition assuming independence might not be a bad approximation. If we proceed with the decomposition we obtain that the proportion of the effect of G due to interaction is 98% (58% to 137%). Almost all of the effect of G is due to the presence of E and its interaction with E. As discussed above if we can assume that the variants increase smoking, and that smoking increases lung cancer (both reasonable assumptions here), then 98% (95% CI= 58% to 137%) would be a lower bound on the proportion of the effect of G that would be eliminated if we were to eliminated smoking. And, indeed, there is now strong evidence elsewhere that the genetic variants do not have an effect on lung cancer for non-smokers.16,17 Almost the entirety of the effect of G appears due to the interaction.

If we proceed with the decomposition of the joint effect, then the proportions attributable to G alone, E alone, and to their interaction are:

RR101RR1110.8%(95%CI=6.2%to7.7%)RR011RR11151.4%(33.4%to69.4%)RERIRR11147.8%(33.3%to62.3%).

The results are summarized in the third line of the Table. Almost none of the joint effect (comparing both G and E present to both absent) is due to the effect of G in the absence of E, about 51% is due to E is the absence of G, and about 48% is due to the interaction between G and E. Note that the decompositions for total effects and for joint effects differ in their denominators and so are not directly comparable to each other: the decomposition for joint effect considers the proportion of the effect due to interaction when comparing both exposures present versus neither present; the decompositions for total effects considers the proportion of the effect due to interaction when one exposure is present (and the other is fixed at its actual level) versus when that same single exposure is absent.

Discussion

In this paper we have considered the decomposition of a total effect into a conditional effect when the other exposure is fixed to 0 and a component due to interaction. This decomposition can be done with both exposures if the two exposures are independent, but can be done only with the second exposure in settings in which the first exposure affects the second. Other decompositions for the first exposure are then possible, but the interpretation becomes somewhat more complicated. Even in this case, the joint effects of both exposures can still be decomposed into the component due to the first exposure alone, that due to the second exposure alone, and that due to their interaction. In the Appendix fairly general methods are given using linear regression for carrying out these decompositions with binary, ordinal or continuous exposures. In the eAppendix methods and software are provided for these decompositions using logistic regression and linear regression when the outcome is binary or outcomes and the exposures are binary or continuous. These various decompositions can shed light on the proportion of various effects that are attributable to interaction.

Several motivations are commonly given for assessing interaction: first, to identify subgroups for which an intervention on the exposure might be most effective in settings in which resources are limited13,18; second, to assess evidence for mechanistic forms of interaction2,1921; third, to leverage interaction to increase power to detect genetic effect2224; and fourth to allow for additional flexibility in statistical models.2,25 The methods described in this paper suggest yet another motivation for assessing interaction. The methods here for attributing effects to interactions may help determine the extent to which an intervention on a potential effect modifier would successfully alter the effect of the exposure of interest. As noted above, one of the motivations often given for studying interaction, specifically on the additive scale, is to identify which subgroups would benefit most from intervention when resources are limited. However in some settings it may not be possible to intervene directly on the primary exposure of interest, and one might then instead be interested in which other covariates could be intervened upon to eliminate much or most of the effect of the primary exposure of interest. The methods here for attributing effects to interactions can be useful in assessing this and identifying the most relevant covariates for intervention.

When used for this purpose it is important that it is the effect modifier itself that affects the outcome and that the effect modifier is not simply serving as a proxy for some other variable that does.26,27 In other words, we need to make sure we have controlled for confounding for the effects of the effect modifier itself. These issues of confounding control are discussed in greater detail in the Appendix. We have assumed throughout, for simplicity, that the effects of both factors are unconfounded, but these assumptions need to be thought about more carefully if these measures are to be used in making policy decisions. However, provided such control for confounding for both factors has been made, the measures considered in this paper can be useful in determining how much of an effect could be eliminated by intervening on an effect modifier.

Supplementary Material

Supplemental Digital Content

Acknowledgements

Tyler J. VanderWeele was supported by National Institutes of Health grant ES017876.

Appendix

Decomposition of a Total Effect into a Conditional Effect and a Portion due to Interaction

We will let G and E denote two exposures of interest which may be binary, continuous or categorical and let Y be an outcome of interest that may be binary or continuous. Let Yg denote the counterfactual outcome for an individual if G were set to g, let Ye denote the counterfactual outcome for an individual if E were set to e, and let Yge denote the counterfactual outcome for an individual if G were set to g and E were set to e. We will say that the effect of G on Y is unconfounded conditional on C if YgG|C. We will say that the effect of E on Y is unconfounded conditional on C if YeE|C. We will say the joint effects of G and E on Y are unconfounded conditional on C if Yge ⫫ (G,E)|C.

Proposition 1. For any two levels e1 and e0 of E and any level g0 of G we have the decomposition:

𝔼[Ye1Ye0|c]=𝔼[Ye1Ye0|g0,c]+{𝔼[Ye1Ye0|g,c]𝔼[Ye1Ye0|g0,c]}dP(g|c).

Proof. We have

𝔼[Ye1Ye0|c]=𝔼[Ye1Ye0|g0,c]+𝔼[Ye1Ye0|c]𝔼[Ye1Ye0|g0,c]=𝔼[Ye1Ye0|g0,c]+{𝔼[Ye1Ye0|g0,c]𝔼[Ye1Ye0|g0,c]}dP(g|c).

In Proposition 1, we can decompose a total effect, 𝔼[Ye1Ye0 |c], into an effect conditional on G = g0, namely, 𝔼[Ye1Ye0 |g0, c], and a component which is a summary measure of effect modification, ∫ {𝔼[Ye1Ye0 |g, c] − 𝔼[Ye1Ye0 |g0, c]}dP(g|c). The proportion attributable to interaction is then defined by pAIG=g0(E)={𝔼[Ye1Ye0|g,c]𝔼[Ye1Ye0|g0,c]}dP(g|c)𝔼[Ye1Ye0|c]. The decomposition here is given at the counterfactual level and, as noted above, it is a decomposition of a total effect into an effect conditional on G and a measure of effect modification. Note that this decomposition and the proportion due to interaction will vary for different values of G = g0 and thus the reference value g0 must be specified. This reference value was taken as G = 0 in the text; it is the value at which the conditional effect, 𝔼[Ye1Ye0 |g0, c], is estimated. The decomposition is given for a particular level of the covariates C = c but we can also marginalize over C to obtain 𝔼[Ye1Ye0] = ∫ 𝔼[Ye1Ye0 |g0, c]dP(c)+ ∫ {𝔼[Ye1Ye0|g, c] − 𝔼[Ye1Ye0|g0, c]}dP(g, c). Note then, however, that the first term in the decomposition, ∫ 𝔼[Ye1Ye0|g0, c]dP(c), is the effect of E on Y conditional on G = g0, and marginalized over the distribution P(C). It will not in general equal 𝔼[Ye1Ye0 |g0] since 𝔼[Ye1Ye0|g0, c] is marginalized over P(C) rather than P(C|g0).

Under assumptions about confounding we can identify each component of the decomposition.

Proposition 2. Suppose that the effect of E on Y is unconfounded conditional on (C,G) then:

𝔼[Ye1Ye0|g,c]=𝔼[Y|g,e1,c]𝔼[Y|g,e0,c]

and we can thus identify the components in Proposition 1 and the right hand-side of the decomposition in Proposition 1 can be written in terms of observed data as: 𝔼[Ye1Ye0 |c]

=𝔼[Y|g0,e1,c]𝔼[Y|g0,e0,c]+{𝔼[Y|g,e1,c]𝔼[Y|g,e0,c]𝔼[Y|g0,e1,c]+𝔼[Y|g0,e0,c]}dP(g|c).

If, moreover, the joint effects of G and E are unconfounded conditional on C then we can write the decomposition as:

𝔼[Ye1Ye0|c]=𝔼[Yg0e1|c]𝔼[Yg0e0|c]+{𝔼[Yge1|c]𝔼[Yge0|c]𝔼[Yg0e1|c]+𝔼[Yg0e0|c]}dP(g|c).

Proof. If the effect of E on Y is unconfounded conditional on (C,G), then we have 𝔼[Ye1Ye0 |g, c] = 𝔼[Y |g, e1, c] − 𝔼[Y |g, e0, c]. If the joint effects of G and E are unconfounded conditional on C then we have 𝔼[Y|g, e, c] = 𝔼[Yge|c] and thus:

𝔼[Ye1Ye0|c]=𝔼[Yg0e1|c]𝔼[Yg0e0|c]+{𝔼[Yge1|c]𝔼[Yge0|c]𝔼[Yg0e1|c]+𝔼[Yg0e0|c]}dP(g|c).

If the effect of E on Y is unconfounded conditional on C alone as would be the case under Proposition 2 if G and E were independent conditional C then we would also have 𝔼[Ye1Ye0|c] = 𝔼[Y|e1, c] − 𝔼[Y|e0, c]. Otherwise, we will not have 𝔼[Ye1Ye0|c] = 𝔼[Y|e1, c] − 𝔼[Y|e0, c], but we could still obtain 𝔼[Ye1Ye0|c] under Proposition 2 using the sum of the two components, 𝔼[Y|g0, e1, c] − 𝔼[Y|g0, e0, c] and ∫ {𝔼[Y|g, e1, c] − 𝔼[Y|g, e0, c] − 𝔼[Y|g0, e1, c]+𝔼[Y|g0, e0, c]}dP(g).

Note that in the second part of Proposition 2, to obtain the decomposition, 𝔼[Ye1Ye0|c] = 𝔼[Yg0e1|c] − 𝔼[Yg0e0|c] + ∫ {𝔼[Yge1|c] − 𝔼[Yge0|c] − 𝔼[Yg0e1|c] + 𝔼[Yg0e0|c]}dP(g|c), we required that joint effects of both G and E on Y were unconfounded given C. Under this assumption, what we estimate as the portion attributable to interaction is equal to the difference, 𝔼[Ye1Ye0|c] − {𝔼[Yg0e1|c] − 𝔼[Yg0e0|c]} i.e. to the portion of the effect of E on Y that could be eliminated if we fixed G to g0. This measure may be of relevance from a policy perspective insofar as we can determine the extent to which intervening to fix G to some level g0 would eliminate the effect of E on the outcome. We might thus decide whether to intervene on G in order to eliminate the effect of E. Importantly, however, to interpret the measure in this manner it is important that control is made for confounding for both exposures, G and E. Viewed intuitively, this ensures that it is the effect modifier itself that affects the outcome and that the effect modifier is not simply serving as a proxy for some other variable that does.26,27 When this is the case the proportion attributable to interaction is equal to the proportion eliminated by fixing G to g0.

If no covariates are necessary for confounding control and we let pge = P(Y = 1|G = g,E = e), pg = P(Y = 1|G = g), and pe = P(Y = 1|E = e) then the first decomposition in Proposition 2 written in terms of the observed data simplifies to:

(pe=1pe=0)=(p01p00)+(p11p10p01+p00)P(G=1).

and the second decomposition written in terms of counterfactuals simplifies to

𝔼[Ye=1Ye=0]=𝔼[Y01Y00|c]+𝔼[Y11Y10Y01+Y00]P(G=1).

For the linear model

𝔼[Y|G=g,E=e,C=c]=α0+α1g+α2e+α3eg+α4c,

we have

𝔼[Y|g,e1,c]𝔼[Y|g,e0,c]=(α0+α1g+α2e1+α3e1g+α4c)(α0+α1g+α2e0+α3e0g+α4c)=(α2+gα3)(e1e0)

and thus the first component in the empirical decomposition in Proposition 2 is equal to:

𝔼[Y|g0,e1,c]𝔼[Y|g0,e0,c]=(α2+g0α3)(e1e0)

and the second is equal to:

{𝔼[Y|g,e1,c]𝔼[Y|g,e0,c]𝔼[Y|g0,e1,c]+𝔼[Y|g0,e0,c]}dP(g|c)=(α2+gα3)(e1e0)(α2+g0α3)(e1e0)dP(g|c)=α3{𝔼[G|c]g0}(e1e0).

The proportion due to interaction is then α3{𝔼[G|c]g0}(α2+α3𝔼[G|c]). When G and E are binary and g0 = 0 and there are no covariates, the two components reduce to α2 and α3P(G = 1) and the proportion due to interaction is α3P(G=1)α2+α3P(G=1), as in the text. Note, however, that when the exposures are not binary the measures themselves (and thus the proportion attributable to interaction) may vary depending on the values, e1 and e0, of E that are being compared, also and also again on the reference value, g0 of G.

On the risk ratio scale, we let RRg=1=pg=1pg=0=P(Y=1|G=1)P(Y=1|G=0) and RRe=1=pe=1pe=0=P(Y=1|E=1)P(Y=1|E=0) and RRge=pgep00=P(Y=1|G=g,E=e)P(Y=1|G=0,E=0). The decomposition (pe=1pe=0) = (p01p00) + (p11p10p01 + p00)P(G = 1) when divided by pe=0 is

(RRe=11)=κ(RR011)+κ(RR11RR10RR01+1)P(G=1).

where κ is a scaling factor given by κ=p00pe=0. The proportion of the effect of E attributable to interaction is given by:

pAIG=0(E)=(RR11RR10RR01+1)P(G=1)(RR011)+(RR11RR10RR01+1)P(G=1).

As noted in the text, if we use the logistic regression model

logit{P(Y=1|G=g,E=e,C=c)}=γ0+γ1g+γ2e+γ3eg+γ4c.

then proportion attributed to interaction if the exposures are binary can be approximated by pAIG=0(E)(eγ1+γ2+γ3eγ1eγ2+1)P(G=1)(eγ21)+(eγ1+γ2+γ3eγ1eγ2+1)P(G=1). In the eAppendix we discuss estimating standard errors for this proportion attributed to interaction.

Analogous Results for G

Note that, by symmetry, from Proposition 1, we have the decomposition

𝔼[Yg1Yg0|c]=𝔼[Yg1Yg0|e0,c]+{𝔼[Yg1Yg0|e,c]𝔼[Yg1Yg0|e0,c]}dP(e|c).

This decomposition applies even if G affects E. If G and E were independent so that G did not affect E, then we would have an analogue of Proposition 2 which would be that if the effect of G on Y is unconfounded conditional on (C, E) then we have 𝔼[Yg1Yg0|e, c] = 𝔼[Y|g1, e, c]− 𝔼[Y|g0, e, c], and under independence also, 𝔼[Yg1Yg0|c] = 𝔼[Y|g1, c] − 𝔼 [Y|g0, c], and we can thus write the decomposition of the total effect of G in terms of observed data as: 𝔼[Y|g1, c] − 𝔼[Y|g0, c]

=𝔼[Y|g1,e0,c]𝔼[Y|g0,e0,c]+{𝔼[Y|g1,e,c]𝔼[Y|g0,e,c]𝔼[Y|g1,e0,c]+𝔼[Y|g0,e0,c]}dP(e|c).

If, moreover, the joint effects of G and E are unconfounded conditional on C then we can write the decompositions as:

𝔼[Yg1Yg0|c]=𝔼[Yg1e0Yg0e0|c]+{𝔼[Yg1eYg0e|c]𝔼[Yg1e0Yg0e0|c]}dP(e|c).

Settings in which G Affects E

If G affects E, then the conditions in Proposition 2 still apply. We can still thus empirically decompose the total effect of E on Y into a conditional effect and the portion due to interaction. If G affects E we no longer have the simple relation [Ye1Ye0|c] = 𝔼[Y|e1, c] − 𝔼[Y|e0, c] because control for G will in general be needed to control for confounding for E. But we can still obtain 𝔼[Ye1Ye0|c], even if G affects E under Proposition 2, using the sum of the two components, 𝔼[Y|g0, e1, c] − 𝔼[Y|g0, e0, c] and ∫{𝔼[Y|g, e1, c] − 𝔼[Y|g, e0, c] − 𝔼[Y|g0, e1, c]+𝔼[Y|g0, e0, c]}dP(g).

However, if G affects E then the analogue of Proposition 2 for G will not apply. We still have the analogous decomposition to that in Proposition 1:

𝔼[Yg1Yg0|c]=𝔼[Yg1Yg0|e0,c]+{𝔼[Yg1Yg0|e,c]𝔼[Yg1Yg0|e0,c]}dP(e|c).

However, the counterfactuals of the form 𝔼[Yg1Yg0|e0, c] will not be identified and so we cannot empirically estimate the various parts of the decomposition. This is because when G affects E, the analogue Proposition 2 for G would require that the effect of G on Y is unconfounded on (C, E) and this fails because G itself affects E.

However, when G affects E we still have the decomposition in the Proposition below.

Proposition 3. If the effect of G on Y is unconfounded conditional on C, and the effects of G and E are unconfounded conditional on C then we have

𝔼[Yg1Yg0|c]=𝔼[Yg1e0Yg0e0|c]+{𝔼[Yg1eYg0e|c]𝔼[Yg1e0Yg0e0|c]}dP(e|g1,c)+{𝔼[Yg0eYg0e0|c]}{dP(e|g1,c)dP(e|g0,c)}.

Moreover, each component of the decomposition above identified and the corresponding decomposition expressed in terms of the observed data is 𝔼[Yg1Yg0|c]

={𝔼[Y|g1,e0,c]𝔼[Y|g0,e0,c]}+{𝔼[Y|g1,e,c]𝔼[Y|g0,e,c]}{𝔼[Y|g1,e0,c]𝔼[Y|g0,e0,c]}dP(e|g1,c)+{𝔼[Y|g0,e,c]𝔼[Y|g0,e0,c]}{dP(e|g1,c)dP(e|g0,c)}.

Proof. We have that 𝔼[Yg1Yg0|c]

=𝔼[Y|g1,c]𝔼[Y|g0,c]=𝔼[Y|g1,e0,c]𝔼[Y|g0,e0,c]+{𝔼[Y|g1,c]𝔼[Y|g1,e0,c]}{E[Y|g0,c]𝔼[Y|g0,e0,c]}=𝔼[Y|g1,e0,c]𝔼[Y|g0,e0,c]+{𝔼[Y|g1,e,c]𝔼[Y|g1,e0,c]}dP(e|g1,c){𝔼[Y|g0,e,c]𝔼[Y|g0,e0,c]}dp(e|g0,c)={𝔼[Y|g1,e0,c]𝔼[Y|g0,e0,c]}+{𝔼[Y|g1,e,c]𝔼[Y|g0,e,c]}{𝔼[Y|g1,e0,c]𝔼[Y|g0,e0,c]}dP(e|g1,c)+{𝔼[Y|g0,e,c]𝔼[Y|g0,e0,c]}{dP(e|g1,c)dP(e|g0,c)}=𝔼[Yg1e0Yg0e0|c]+{𝔼[Yg1eYg0e|c]𝔼[Yg1e0Yg0e0|c]}dP(e|g1,c)+{𝔼[Yg0eYg0e0|c]}{dP(e|g1,c)dP(e|g0,c)}.

In the decomposition above, the first term, 𝔼[Yg1e0Yg0e0|c] = {𝔼[Y|g1, e0, c] − 𝔼[Y|g0, e0, c]} is the controlled direct effect9,10 of G, comparing levels, g1 and g0, when E is fixed to e0. The second term, ∫ {𝔼[Yg1eYg0e|c] − 𝔼[Yg1e0Yg0e0|c]}dP(e|g1, c), is the portion attributable to interaction; it is an interaction, 𝔼[Yg1eYg0e|c] − 𝔼[Yg1e0Yg0e0|c], standardized by the distribution, P(e|g1, c). The third and final term, ∫ {𝔼[Yg0eYg0e0|c]}{dP(e|g1, c) − dP(e|g0, c)}, is the main effect of E when G = g0, standardized by P(e|g1, c) versus P(e|g0, c), which, provided the effect of G on E is unconfounded conditional on C, is essentially the effect of G on E and thus the third term is in some sense a mediated main effect.

When G, E and Y are binary and g0 = 0 is selected as the reference level, and no covariates are required for confounding, the decomposition reduces to: 𝔼[Y1Y0]

=𝔼[Y10Y00]+𝔼[Y11Y01Y10Y00]P(E=1|G=1)+𝔼[Y01Y00]{P(E=1|G=1)P(E=1|G=0)}.

Or, expressed in terms of the observed data, as (pg=1pg=0)

=(p10p00)+(p11p10p01+p00)P(E=1|G=1)+(p01p00){P(E=1|G=1)P(E=1|G=0)}

as in the text. The proportion attributable to interaction is then:

pAIE=0(G)=(p11p10p01+p00)P(E=1|G=1)(pg=1pg=0).

Note that when G has a non-negative effect on E, and E has a non-negative effect on Y (in the absence of G) so that P(E = 1|G = 1) − P(E = 1|G = 0) ≥ 0 and thus P(E = 1) = P(E = 1|G = 1)P(G = 1) + P(E = 1|G = 0)P(G = 0) ≤ P(E = 1|G = 1) and (p01p00){P(E = 1|G = 1) − P(E = 1|G = 0)} ≥ 0 we then have that (p11p10p01 + p00)P(E = 1) ≤ (p11p10p01 + p00)P(E = 1|G = 1) = (pg=1pg=0) − (p10p00) − (p01p00){P(E = 1|G = 1) − P(E = 1|G = 0)} ≤ (pg=1pg=0) − (p10p00) and from this it follows that if the dependence between G and E is incorrectly ignored and

(p11p10p01+p00)P(E=1)(pg=1pg=0).

is used for the proportion attributable to interaction, then although the latter measure does not actually capture the proportion of the effect attribution to interaction, it does nonetheless constitute a lower bound on the proportion of the effect of G that would be eliminated by fixing E to 0, as indicated in the text. Thus even if one proceeds with the more naive estimate of the proportion attributable to interaction, ignoring (incorrectly) the dependence between G and E one still, under fairly reasonable assumptions, obtains a lower bound on the proportion of the effect of G eliminated by fixing E to 0.

The decomposition in Proposition 3 with binary exposures when the effect of G on Y and on E are unconfounded, and the effects of (G, E) on Y are unconfounded can be rewritten as: 𝔼[Y1Y0] = 𝔼[Y10Y00]+𝔼[Y11Y01Y10Y00]P(Eg=1 = 1) + 𝔼[Y01Y00]{P(Eg=1 = 1)−P(Eg=0 = 1)} where Eg is the counterfactual outcome for E fixing G to g. The analogue on the individual counterfactual level is Y1Y0 = (Y10Y00) + (Y11Y01Y10Y00)Eg=1 + (Y01Y00)(Eg=1Eg=0). These are 3-way decompositions of a total effect. These differ somewhat from the decompositions in the mediation analysis literature.69 In the mediation analysis literature, the total effect is decomposed into either two components, what is called a natural indirect effect and a natural direct effect given respectively by 𝔼[Y1E1Y1E0] and 𝔼[Y1E0Y0E0] respectively; or into three components8, a so-called pure indirect effect,9 a pure direct effect (equivalent to the natural direct effect just given), and a mediated interaction, which, when identified can be written as 𝔼[Y0E1Y0E0], 𝔼[Y1E0Y0E0], and 𝔼[Y11Y01Y10Y00], respectively. In contrast, for the decomposition given in Proposition 3, the "direct effect" given in this decomposition is a controlled direct effect, 𝔼[Y10Y00], not a natural direct effect; and the interaction term, 𝔼[Y11Y01Y10Y00]P(Eg=1 = 1), is the proportion of the effect attributable to interaction. Note also that if G does not affect E (i.e. if there is no mediation) then the mediation decomposition into three components8 reduces to a single component (the pure direct effect). However if G does not affect E, then, with the decompositions considered in this paper, there are still two components: the controlled direct effect for G and also the interaction term.

For the ratio scale, the decomposition, (pg=1pg=0) = (p10p00) + (p11p10p01+p00)P(E = 1|G = 1) + (p01p00){P(E = 1|G = 1) − P(E = 1|G = 0)}{P(Eg=1 = 1) − P(Eg=0 = 1)}, when divided by pg=0 is

(RRg=11)=κ(RR101)+κ(RR11RR10RR01+1)P(E=1|G=1)+κ(RR011){P(E=1|G=1)P(E=1|G=0)}.

where κ is a scaling factor given by κ=p00pe=0. The proportion of the effect of G attributable to interaction is:

pAIE=0(G)=(RR11RR10RR01+1)P(E=1|G=1)(RR101)+(RR11RR10RR01+1)P(E=1|G=1)+(RR011){P(E=1|G=1)P(E=1|G=0)}.

Decomposition of Joint Effects

At the counterfactual level, we can decompose the joint effects of the two exposures, G and E, into the effect due to G alone, the effect due to E alone and their interaction. We have:

𝔼[Yg1e1Yg0e0|c]=𝔼[Yg1e0Yg0e0|c]+𝔼[Yg0e1Yg0e0|c]+𝔼[Yg1e1Yg1e0Yg0e1+Yg0e0|c]

If the joint effects of G and E are unconfounded conditional on C each of these components is identified from the observed data and the decomposition can be rewritten as:

𝔼[Y|g1,e1,c]𝔼[Y|g0,e0,c]={𝔼[Y|g1,e0,c]𝔼[Y|g0,e0,c]}+{𝔼[Y|g0,e1,c]𝔼[Y|g0,e0,c]}+{𝔼[Y|g1,e1,c]𝔼[Y|g1,e0,c]𝔼[Y|g0,e1,c]+𝔼[Y|g0,e0,c]}.

We can then also compute the proportion of the joint effect due G alone as 𝔼[Y|g1,e0,c]𝔼[Y|g0,e0,c]𝔼[Y|g1,e1,c]𝔼[Y|g0,e0,c], due to E alone as 𝔼[Y|g0,e1,c]𝔼[Y|g0,e0,c]𝔼[Y|g1,e1,c]𝔼[Y|g0,e0,c], and due to their interaction as 𝔼[Y|g1,e1,c]𝔼[Y|g1,e0,c]𝔼[Y|g0,e1,c]+𝔼[Y|g0,e0,c]𝔼[Y|g1,e1,c]𝔼[Y|g0,e0,c]. Dividing the first decomposition above by 𝔼[Yg0e0 |c], or the second by 𝔼[Y |g0, e0, c], or both the numerator and the denominator of the proportions by 𝔼[Y |g0, e0, c] yields decompositions and proportions on the ratio scale. All of these decompositions are applicable even if G affects E.

On a difference scale, under the linear model

𝔼[Y|G=g,E=e,C=c]=α0+α1g+α2e+α3eg+α4c,

we have that the three components are given by:

{𝔼[Y|g1,e0,c]𝔼[Y|g0,e0,c]}=(α1+α3e0)(g1g0){𝔼[Y|g0,e1,c]𝔼[Y|g0,e0,c]}=(α2+α3g0)(e1e0){𝔼[Y|g1,e1,c]𝔼[Y|g1,e0,c]𝔼[Y|g0,e1,c]+𝔼[Y|g0,e0,c]}=α3(g1e1g1e0g0e1+g0e0).

When G and E are binary, these three components reduce to α1, α2, and α3, respectively. Note, however, that when the exposures are not binary the measures themselves (and thus the proportion attributable to each component) may vary depending on the values, e1 and e0, of E and the values, g1 and g0, of G that are being compared.

On a ratio scale, under the logistic regression model with a rare outcome,

logit{P(Y=1|G=g,E=e,C=c)}=γ0+γ1g+γ2e+γ3eg+γ4c

if G and E then the proportions discussed in the text of the joint effect attributable to G alone, E alone, and to their interaction are given approximately by:

RR101RR111eγ11eγ1+γ2+γ31RR011RR111eγ21eγ1+γ2+γ31RERIRR111(eγ1+γ2+γ3eγ1eγ2+1)eγ1+γ2+γ31,

respectively. See the eAppendix for standard errors.

Footnotes

SDC Supplemental digital content is available through direct URL citations in the HTML and PDF versions of this article (www.epidem.com). This content is not peer-reviewed or copy-edited; it is the sole responsibility of the author.

Contributor Information

Tyler J. VanderWeele, Department of Epidemiology and Biostatistics, Harvard School of Public Health

Eric J. Tchetgen Tchetgen, Department of Epidemiology and Biostatistics, Harvard School of Public Health

References

  • 1.Rothman KJ, Greenland S, Walker AM. Concepts of interaction. Am. J. Epidemiol. 1980;112(4):467–470. doi: 10.1093/oxfordjournals.aje.a113015. [DOI] [PubMed] [Google Scholar]
  • 2.Rothman KJ, Greenland S, Lash TL. Modern Epidemiology. 3rd ed. Philadelphia: Lippincott Williams & Wilkins; 2008. pp. 71–83. [Google Scholar]
  • 3.Rothman KJ. Modern Epidemiology. 1st ed. Boston, MA: Little, Brown and Company; 1986. pp. 311–326. [Google Scholar]
  • 4.Khoury MJ, Flanders WD. Nontraditional epidemiologic approaches in the analysis of gene-environment interaction: case-control studies with no controls! Epidemiology. 1996;144(3):207–213. doi: 10.1093/oxfordjournals.aje.a008915. [DOI] [PubMed] [Google Scholar]
  • 5.VanderWeele TJ, Asomaning K, Tchetgen Tchetgen EJ, Han Y, Spitz MR, Shete S, Wu X, Gaborieau V, Wang Y, McLaughlin J, Hung RJ, Brennan P, Amos CI, Christiani DC, Lin X. Genetic variants on 15q25.1, smoking and lung cancer: an assessment of mediation and interaction. American Journal of Epidemiology. 2012;175:1013–1020. doi: 10.1093/aje/kwr467. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Imai K, Keele L, Tingley D. A general approach to causal mediation analysis. Psychological Methods. 2010;15(4):309–334. doi: 10.1037/a0020761. [DOI] [PubMed] [Google Scholar]
  • 7.Valeri L, VanderWeele TJ. Mediation analysis allowing for exposure-mediator interactions and causal interpretation: theoretical assumptions and implementation with SAS and SPSS macros. Psychological Methods. 18:137–150. doi: 10.1037/a0031034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.VanderWeele TJ. A three-way decomposition of a total effect into direct, indirect, and interactive effects. Epidemiology. 2013;24:224–232. doi: 10.1097/EDE.0b013e318281a64e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Robins JM, Greenland S. Identifiability and exchangeability for direct and indirect effects. Epidemiology. 1992;3(2):143–155. doi: 10.1097/00001648-199203000-00013. [DOI] [PubMed] [Google Scholar]
  • 10.VanderWeele TJ. Policy-relevant proportions for direct effects. Epidemiology. 2013;24:175–176. doi: 10.1097/EDE.0b013e3182781410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.VanderWeele TJ. Reconsidering the denominator of the attributable proportion for interaction. European Journal of Epidemiology. doi: 10.1007/s10654-013-9843-6. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Skrondal A. Interaction as departure from additivity in case-control studies: a cautionary note. American Journal of Epidemiology. 2003;158(3):251–258. doi: 10.1093/aje/kwg113. [DOI] [PubMed] [Google Scholar]
  • 13.Miller DP, Liu G, De Vivo I, et al. Combinations of the variant genotypes of GSTP1, GSTM1, and p53 are associated with an increased lung cancer risk. Cancer Res. 2002;62(10):2819–2823. [PubMed] [Google Scholar]
  • 14.Hung RJ, McKay JD, Gaborieau V, et al. A susceptibility locus for lung cancer maps to nicotinic acetylcholine receptor subunit genes on 15q25. Nature. 2008;452(7187):633–637. doi: 10.1038/nature06885. [DOI] [PubMed] [Google Scholar]
  • 15.Amos CI, Wu X, Broderick P, et al. Genome-wide association scan of tag SNPs identi.es a susceptibility locus for lung cancer at 15q25.1. Nat Genet. 2008;40(5):616–622. doi: 10.1038/ng.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Li Y, Sheu CC, Ye Y, et al. Genetic variants and risk of lung cancer in never smokers: a genome-wide association study. Lancet Oncol. 2010;11(4):321–330. doi: 10.1016/S1470-2045(10)70042-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Truong T, Hung RJ, Amos CI, et al. Replication of lung cancer susceptibility loci at chromo- somes 15q25, 5p15, and 6p21: a pooled analysis from the International Lung Cancer Consortium. J Natl Cancer Inst. 2010;102(13):959–971. doi: 10.1093/jnci/djq178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.VanderWeele TJ, Knol MJ. A tutorial on interaction. Epidemiologic Methods [Google Scholar]
  • 19.VanderWeele TJ, Robins JM. The identification of synergism in the sufficient-component cause framework. Epidemiology. 2007;18:329–339. doi: 10.1097/01.ede.0000260218.66432.88. [DOI] [PubMed] [Google Scholar]
  • 20.VanderWeele TJ. Sufficient cause interactions and statistical interactions. Epidemiology. 2009;20:6–13. doi: 10.1097/EDE.0b013e31818f69e7. [DOI] [PubMed] [Google Scholar]
  • 21.VanderWeele TJ. Epistatic interactions. Statistical Applications in Genetics and Molecular Biology. 9:1–22. doi: 10.2202/1544-6115.1517. Article 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kraft P, Yen YC, Stram DO, Morrison J, Gauderman WJ. Exploiting gene-environment interaction to detect disease susceptibility loci. Human Heredity. 2007;63:111–119. doi: 10.1159/000099183. [DOI] [PubMed] [Google Scholar]
  • 23.Chatterjee N, Kalaylioglu Z, Moslehi R, et al. Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions. Am J Hum Genet. 2006;79(6):1002–1016. doi: 10.1086/509704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Dai J, Logsdon B, Huang Y, et al. Simultaneous testing for marginal genetic association and gene-environment interaction in genome-wide association studies. Am J Epidemiol. 2012;176(2):164–173. doi: 10.1093/aje/kwr521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Greenland S. Interactions in epidemiology: relevance, identification, and estimation. Epidemiol. 2009;20(1):14–17. doi: 10.1097/EDE.0b013e318193e7b5. [DOI] [PubMed] [Google Scholar]
  • 26.VanderWeele TJ. On the distinction between interaction and effect modification. Epidemiology. 2009;20:863–871. doi: 10.1097/EDE.0b013e3181ba333c. [DOI] [PubMed] [Google Scholar]
  • 27.VanderWeele TJ, Knol MJ. The interpretation of subgroup analyses in randomized trials: heterogeneity versus secondary interventions. Annals of Internal Medicine. 2011;154:680–683. doi: 10.7326/0003-4819-154-10-201105170-00008. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Digital Content

RESOURCES