Skip to main content
American Journal of Epidemiology logoLink to American Journal of Epidemiology
. 2017 Nov 13;187(6):1311–1318. doi: 10.1093/aje/kwx355

Decomposition of the Total Effect in the Presence of Multiple Mediators and Interactions

Andrea Bellavia 1,2, Linda Valeri 3,4,
PMCID: PMC6692646  PMID: 29140421

Abstract

Mediation analysis allows decomposing a total effect into a direct effect of the exposure on the outcome and an indirect effect operating through a number of possible hypothesized pathways. Recent studies have provided formal definitions of direct and indirect effects when multiple mediators are of interest and have described parametric and semiparametric methods for their estimation. Investigating direct and indirect effects with multiple mediators, however, can be challenging in the presence of multiple exposure-mediator and mediator-mediator interactions. In this paper we derive a decomposition of the total effect that unifies mediation and interaction when multiple mediators are present. We illustrate the properties of the proposed framework in a secondary analysis of a pragmatic trial for the treatment of schizophrenia. The decomposition is employed to investigate the interplay of side effects and psychiatric symptoms in explaining the effect of antipsychotic medication on quality of life in schizophrenia patients. Our result offers a valuable tool to identify the proportions of total effect due to mediation and interaction when more than one mediator is present, providing the finest decomposition of the total effect that unifies multiple mediators and interactions.

Keywords: causal inference, effect decomposition, interaction, mediation


Mediation analysis allows decomposing a given exposure-outcome association (total effect (TE)) into the effect that operates through one or more intermediate variables of interest (indirect effect) and the effect that is due to other independent mechanisms (direct effect) (1). Defining direct and indirect effects in counterfactual terms has been crucial for overcoming major limitations of the classical approaches to mediation, and the field of causal mediation analysis has rapidly expanded over the last decades (2). Most of the current literature has focused on the situation where one single mediator is of interest. It may often be the case, however, that multiple mediators are simultaneously contributing to the exposure-outcome effect. Daniel et al. (3) have presented the counterfactual definition of all direct and indirect effects that can be theoretically defined when multiple mediators are of interest, and the identification of path-specific effects has been discussed (4). Two alternative estimation procedures, one based on weighting and one based on regression, have been also presented (5).

Investigating direct and indirect effects with multiple mediators can be particularly challenging when multiple exposure-mediator and mediator-mediator interactions are also present. A framework that incorporates multiple mediators together with multiple and potentially high-dimensional interactions has not been fully investigated, and a decomposition that separates interaction and mediation effects in this context is not available. While several papers have investigated path-specific effects and multiple mediators (35), none of them have formally included terms of causal interactions between the different components. In the context of one single mediator, VanderWeele (6) showed that the TE can be decomposed into a direct effect, an indirect effect, a proportion due to the exposure-mediator interaction alone, and a proportion due to both interaction and mediation. This finest decomposition of the TE provides the maximum insight to identify and separate the contribution of interaction and mediation when these are simultaneously present in explaining an exposure-outcome effect. We aimed to derive a decomposition of the TE that unifies mediation and interaction when multiple mediators and interactions are present.

We first revise the counterfactual definitions of total, direct, indirect, and interaction effects, in the context of multiple mediators. Next, we introduce a decomposition of the TE that unifies mediation and interaction with 2 mediators. The decomposition is provided for both binary and continuous mediators and exposures. We proceed by discussing the assumptions required for the identification of these effects and presenting the nonparametric empirical analogues for each of the components. We illustrate the properties of the proposed framework for multiple mediators and interactions in a secondary analysis of a pragmatic trial for the treatment of schizophrenia. Finally, we provide an extension of these results to the general situation with more than 2 mediators.

COUNTERFACTUAL DEFINITIONS OF EFFECTS AND INTERACTIONS

Potential outcomes

Let A denote the exposure of interest, Y the outcome, and M1 and M2 2 mediators that may be on the pathway from A to Y and are conditionally independent given a baseline factor C, a potential confounder of the mediators-outcome relationships. Figure 1 depicts the possible causal pathways through which A has an effect on Y.

Figure 1.

Figure 1.

An illustration of mediation analysis with 2 mediators conditionally independent given a baseline factor C.

The generic counterfactuals of interest for the outcome and the mediators can be written as M1(a), M2(a), and Y(aM1(a)M2(a)), representing, respectively, the value M1 would take were A set to a, the value M2 would take were A set to a, and the value Y would take were A set to a, M1 to M1(a), and M2 to M2(a). In the simplified case of binary exposure and mediators, there is a total of 8 composite outcomes [Y(1M1(1)M2(1)), Y(1M1(1)M2(0)), Y(1M1(0)M2(1)), Y(1M1(0)M2(0)), Y(0M1(1)M2(1)), Y(0M1 (1)M2(0)), Y(0M1(0)M2(1)), Y(0M1(0)M2(0))] and 8 potential outcomes [Y(111), Y(110), Y(101), Y(001), Y(100), Y(010), Y(001), Y(000)] that could be defined.

Effects definitions

We review here the definitions of mediation contrasts and interaction terms in the case of 2 binary mediators and 1 binary exposure. Extension of these definitions to the general case of continuous exposure and mediators is straightforward, and it only requires setting specific reference values a*, m1*, m2*. By following the definition and notation introduced in the context of one single mediator (68), we can define the TE of A on Y as:

TE=Y(1)Y(0)=Y(1M1(1)M2(1))Y(0M1(0)M2(0)).

The controlled direct effect (CDE), regardless of the number of mediators involved, retains its interpretation as the effect of A on Y if both the mediators are fixed to a specific value. In the binary case, CDE can be defined across 4 different strata. Throughout the paper we will refer to the strata where both binary mediators are set to the referent value 0:

CDE(0,0)=Y(100)Y(000).

In terms of differences in composite potential outcomes, the pure natural direct effect (PNDE) can be defined as the effect of A on Y if both the mediators are set on the value they would naturally take at the referent value of the exposure (i.e., 0):

PNDE=Y(1M1(0)M2(0))Y(0M1(0)M2(0)).

The pure natural indirect effect (PNIE) is intuitively defined as the effect of the mediator in the absence of exposure. When 2 independent mediators are of interest, it can be further divided into 3 components. The effect of M1 in the absence of both A and M2:

PNIEM1=Y(0M1(1)M2(0))Y(0M1(0)M2(0)).

The effect of M2 in the absence of both A and M1:

PNIEM2=Y(0M1(0)M2(1))Y(0M1(0)M2(0)).

The combined effect of M2 and M1 in the absence of A:

PNIEM1M2=Y(0M1(1)M2(1))Y(0M1(0)M2(0)).

PNDE, PNIEM1, PNIEM2 are defined by changing one of the 3 arguments (A, M1(a), M2(a), respectively) from the reference value, while leaving the other 2 arguments fixed. PNIEM1M2 is defined by changing 2 of the arguments (M1 and M2) and leaving the other (A) fixed. Fixed arguments could be set either to the referent or to the alternative value, if binary, or to any potential value if continuous. Here we are showing, and in the next sections using, the definitions of PNDE and PNIE where the fixed arguments are left to the reference value. We refer to previous publications for the description of all other possible definitions in the context of multiple mediators (3).

Interaction definitions

Evaluating a mediation model with 2 potential mediators increases the number of interactions that need to be assessed. The exposure can interact with the first mediator or with the second mediator, and the 2 mediators can interact within each other, thus requiring the specification of 3 2-way interactions (i.e., A·M1, A·M2, and M1·M2). In addition, all components can interact within each other, so that a measure of 3-way interaction (i.e., A·M1·M2) must be specified. Following the classical notation, measures of 2-way interactions, on the additive scale, can be defined by taking the differences between the combined effect and the sum of the 2 main effects of interest (9). Each of these measures can be further specified in both the presence and the absence of the third component (Table 1).

Table 1.

Illustration of the Possible Definitions of 2-Way Interactions With 1 Binary Exposure and 2 Binary Mediators

Varying Argument Interaction Definition
A·M1 varying, M2 = 1 p(111) − p(101) − p(011) + p(001)
A·M1 varying, M2 = 0 p(110) − p(100) − p(010) + p(000)
A·M2 varying, M1 = 1 p(111) − p(110) − p(011) + p(010)
A·M2 varying, M1 = 0 p(101) − p(100) − p(001) + p(000)
M1·M2 varying, A = 1 p(111) − p(110) − p(101) + p(100)
M1·M2 varying, A = 0 p(011) − p(010) − p(001) + p(000)

Abbreviations: A, exposure of interest; M, potential mediator.

A measure of additive 3-way interaction can be defined in 3 different ways:

  • the change in the product A·M1 when M2 goes from absent to present: (p(111) – p(101) – p(011) + p(001) > p(110) – p(100) – p(010) + p(000));

  • the change in the product A·M2 when M1 goes from absent to present: (p(111) – p(110) – p(011) + p(010) > p(101) – p(100) – p(001) + p(000)); or

  • the change in the product M1·M2 when A goes from absent to present: (p(111) – p(110) – p(101) + p(100) > p(011) – p(010) – p(001) + p(000)).

All these definitions yield the same measure of 3-way interaction on the additive scale:

p(111)p(110)p(101)+p(100)+p(011)p(010)p(001)+p(000).

As for the classical 2-way interaction on the additive scale, depending on whether this measure is smaller than, equal to, or larger than 0, we can define a situation of 3-way subadditivity, additivity, or superadditivity, respectively.

DECOMPOSITION OF THE TOTAL EFFECT

When one single mediator M is evaluated, it has been shown that the TE can be decomposed into 4 components that detect mediation mechanisms through M and interactive effects between M and the exposure (6). Specifically, the TE can be decomposed into a component that is due neither to interaction nor to mediation (corresponding to the CDE), a component due only to mediation (corresponding to the PNIE), a component due only to the additive interaction between the exposure and the mediator (defined reference interaction, or INTref), and a component due to both mediation and interaction, defined as an additive interaction that operates only if the mediator has an effect on the outcome (defined mediated interaction, or INTmed). When moving to the situation of 2 mediators simultaneously contributing to the A-Y association (Figure 1), a 4-way decomposition of the TE is still valid. In such context, however, the PNIE, the INTref, and the INTmed, can be additionally decomposed into 3 components each, capturing effects that operate through specific pathways and interactions. Specifically:

TE=CDE+PNIEM1+PNIEM2+PNIEM1M2+INTrefAM1+INTrefAM2+INTrefAM1M2+INTmedAM1+INTmedAM2+INTmedAM1M2. (1)

The definition of each component is presented in Table 2. A complete proof of the derivation of equation 1 is provided in Web Appendix 1 (available at https://academic.oup.com/aje). In brief, the derivation uses PNDE and PNIEM1M2, as defined earlier in the text in terms of differences of composite potential outcomes, and calculates the operationalized version of all effects in terms of differences of potential outcomes, operating only under specific scenarios.

Table 2.

Illustration of the Decomposition of the Total Effect of a Binary Exposure on an Outcome, in the Presence of 2 Binary Mediators, Exposure-Mediators, and Mediator-Mediator Interactions

Component Definition
CDE (Y(100)) − Y(000))
PNIEM1 (Y(010)) − Y(000)) [M1(1)) − M1(0)]a
PNIEM2 (Y(001)) − Y(000)) [M2(1)) − M2(0)]b
PNIEM1M2 (Y(011)) − Y(010)) − Y(001) + Y(000)) [M1(1)M2(1)) − M1(0)M2(0)]
INTrefAM1 (Y(110)) − Y(100)) − Y(010) + Y(000)) [M1(0)]
INTrefAM2 (Y(101)) − Y(100)) − Y(001) + Y(000)) [M2(0)]
INTrefAM1M2 (Y(111)) − Y(110)) − Y(101)) − Y(011) + Y(001) + Y(010) + Y(100)) − Y(000)) [M1(0)M2(0)]
INTmedAM1 (Y(110)) − Y(100)) − Y(010) + Y(000)) [M1(1)) − M1(0)]
INTmedAM2 (Y(101)) − Y(100)) − Y(001) + Y(000)) [M2(1)) − M2(0)]
INTmedAM1M2 (Y(111)) − Y(110)) − Y(101)) − Y(011) + Y(001) + Y(010) + Y(100)) − Y(000)) [M1(1)M2(1)) − M1(0)M2(0)]

Abbreviations: A, exposure of interest; CDE, controlled direct effect; INTref, defined reference interaction; INTmed, defined mediated interaction; M, possible mediator; PNIE, pure natural indirect effect; Y, outcome.

aM1(1) = I[M1(1) = m1], M1(0) = I[M1(0) = m1].

bM2(1) = I[M2(1) = m2], M2(0) = I[M2(0) = m2].

The first component of the decomposition is the CDE. The component of the effect due only to mediation (PNIE) is divided into: 1) the effect of M1 when both A and M2 are absent (PNIEM1) and A has an effect on M1 so that change in M1 captures an exposure-induced change; 2) the effect of M2 when both A and M1 are absent (PNIEM2) and A has an effect on M2 so that change in M2 captures an exposure-induced change; and 3) an additional component of the indirect effect (taking the form of an additive interaction between M1 and M2) that is active only when both M1 and M2 have an effect in the absence of the exposure. The component due to interaction alone (INTref) is divided into: 1) the 2-way interaction between A and M1 when M2 is absent (INTrefAM1); 2) the 2-way interaction between A and M2 when M1 is absent (INTrefAM2); and 3) the 3-way interaction between A, M1, and M2(INTrefAM1M2). These components, respectively, operate only when the first mediator is present in the absence of exposure (i.e., M1(0) = 1), when the second mediator is present in the absence of exposure (i.e., M2(0) = 1), and when both mediators are present in the absence of exposure (i.e., M1(0) = 1 and M2(0) = 1). The component due to both mediation and interaction (INTmed) is divided into the same 3 components of INTref. However, these, respectively, are active only when M1 affects the outcome when A is absent and A itself affects M1, when M2 affects the outcome when A is absent and A itself affects M2, and when M1 and M2 have a combined effect on the outcome when A is absent and A itself affects both M1 and M2.

Definition of the components in the case of continuous exposure and mediators is also provided in Web Appendix 2 and Web Table 1.

It is straightforward to observe that the decomposition presented here is a natural extension of the 4-way decomposition introduced by VanderWeele (6) in the single mediator setting. With one single mediator, say M1, all components including M2 would be null, and the decomposition would reduce to TE=CDE+PNIEM1+INTrefAM1+INTmedAM1. Additional decompositions presented in the context of a single mediator can also be extended to our setting. For example, we could calculate the proportion of effect attributable to interaction (PAI = INTref + INTmed), and the total indirect effect (TIE = INTmed + PNIE) (10, 11). Both of these measures could be further decomposed if we were interested in the proportion attributable to the interaction between the exposure and one specific mediator (e.g., PAIAM1=INTrefAM1+INTmedAM1). It is also of interest to note that, while previous studies obtained multiple decompositions of the TEs (3), by accounting for the possible interactions between exposure and mediators, we obtained a single unique decomposition.

IDENTIFICATION OF THE EFFECTS

Individual values for each component, as they are defined within the counterfactual framework, cannot be estimated. However, all components of equation (1) can be correctly identified and estimated at the population level under specific assumptions (12).

To identify the CDE, control must be made for a covariate set C that includes all confounders of not only the exposure-outcome relationship but also the mediator-outcome relationships. We formally require that, conditional on C, there be no unmeasured confounding for the exposure-outcome relationship (assumption 1), and no unmeasured confounding for the mediator-outcome relationship conditional on (A, C) (assumption 2). Identification of natural direct and indirect effects requires 2 additional assumptions to hold. First, conditional on C, there must be no unmeasured confounding of the exposure-mediator relationships (assumption 3). Second, there must be no effect of exposure A that itself affects both M and Y (i.e., no mediator-outcome confounder that is itself affected by the exposure) (assumption 4) (13, 14). Assumptions 2, 3, and 4 are required to hold for all mediators included in the analysis. Formally we can write the assumptions as: 1) Y(am1m2)⊥A|C; 2) Y(am1m2)⊥M1|[A,C], Y(am1m2)⊥M2|[A,C]; 3) M1A|C, M2A|C; 4) Y(am1m2)⊥M1(a)*|C, Y(am1m2)⊥ M2(a)*|C.

When these assumptions hold, the average value of each component of the decomposition is given by the empirical expressions presented in Table 3. In Web Appendix 3, we present a simulation study empirically proving the results of our decomposition and the calculation of these empirical analogues. The Web Material also includes the empirical analogues for the nonparametric estimation of the components with continuous exposure and mediators (Web Table 2).

Table 3.

Illustration of the Empirical Analogues for the Components of the Total Effect Decomposition With 1 Binary Exposure and 2 Binary Mediators

Component Empirical Analogues
CDE (p(100) − p(000))a
PNIEM1 (p(010) − p(000)) [P(M1 = 1|A = 1) − P(M1 = 1|A = 0)]
PNIEM2 (p(001) − p(000)) [P(M2 = 1|A = 1) − P(M2 = 1|A = 0)]
PNIEM1M2 (p(011) − p(010) − p(001) + p(000)) [P(M2 = 1, M2 = 1|A = 1) − P(M1 = 1, M2 = 1|A = 0)]
INTrefAM1 (p(110) − p(100) − p(010) + p(000))P(M2 = 1|A = 0)
INTrefAM2 (p(101) − p(100) − p(001) + p(000))P(M2 = 1|A = 0)
INTrefAM1M2 (p(111) − p(110) − p(101) + p(100) + p(011) − p(010) − p(001) + p(000))P(M1 = 1, M2 = 1|A = 0)
INTmedAM1 (p(110) − p(100) − p(010) + p(000)) [P(M1 = 1|A = 1) − P(M1 = 1|A = 0)]
INTmedAM2 (p(101) − p(100) − p(001) + p(000)) [P(M2 = 1|A = 1) − P(M2 = 1|A = 0)]
INTmedAM1M2 (p(111) − p(110) − p(101) + p(100) + p(011) − p(010) − p(001) + p(000))[P(M1 = 1, M2 = 1|A = 1) − P(M1 = 1, M2 = 1|A = 0)]

Abbreviations: A, exposure of interest; CDE, controlled direct effect; INTref, defined reference interaction; INTmed, defined mediated interaction; M, possible mediator; PNIE, pure natural indirect effect; Y, outcome.

apam1m2 = E(Y|A = a; M1 = m1; M2 = m2).

ILLUSTRATION

To illustrate the concepts and methods presented above we used an example from psychiatric epidemiology. Antipsychotic treatments are generally divided into first-generation (typical) and second-generation (atypical) medications. Several observational and clinical studies have attempted to elucidate the efficacy and effectiveness of atypical antipsychotics in comparison both with placebo and with first-generation antipsychotics (15). Moderate improvement in schizophrenia symptoms, such as those assessed by the Positive and Negative Syndrome Scale (PANSS) (16), have been observed (17), but new-generation medications also showed consistently higher rates of side effects such as excessive weight gain (18). Among patients affected by psychotic disorders, schizophrenia patients display the highest deficit in social functioning. The ability of typical and atypical agents to improve social functioning has not been fully explored. To take into account the complex effects of antipsychotics and clarify the relative effect on patients’ social functioning, it is critical to investigate the interplay of treatment, symptoms, and side effects over the course of treatment. Here we focused on a specific atypical medication (olanzapine), which has been claimed to be the most effective antipsychotic medication but is also associated with the most severe metabolic side effects, especially in terms of weight gain (19, 20). We used data from the Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) (21) to evaluate the effect of olanzapine as compared to a typical agent (perphenazine), on individual quality-of-life scores that capture level of social activity, involvement in social network, and social initiatives. We employed the effect decomposition to assess the mediating and interactive role of percent weight gain, PANSS symptoms, and extrapyramidal symptoms (EPS, the most frequent neurological adverse events induced by first-generation antipsychotics).

For this example, we included data from 497 patients in the CATIE trial assigned to either olanzapine (n = 336) or perphenazine (n = 161). The continuous outcome (quality-of-life score, ranging from 0 to 18 with higher score representing better condition) was assessed 9 months after the beginning of the study. The 3 continuous mediators (weight gain (in pounds), EPS symptoms (ranging from 0 to 1.67), and PANSS total score (ranging from 30 to 116)) were assessed after 6 months from the beginning of the study. Analyses were adjusted for sex, age, race/ethnicity, education, employment, marital status, systolic and diastolic blood pressure, prior treatment, hospitalization, waist-hip ratio, body mass index, PANSS total score, and EPS symptoms measured at baseline. We used the parametric procedure for multiple mediators as presented by VanderWeele and Vansteelandt (5), using linear regression models for both outcome and mediators, and used R (R Foundation for Statistical Computing, Vienna, Austria) to implement our decomposition.

Overall, in the exposure-outcome model not adjusting for the mediators (TE), a nonsignificant lower quality-of-life score at 9 months was observed (TE = −0.45, 95% CI: −1.45, 0.51) between patients treated with olanzapine and those with perphenazine. Results from the 4-way decomposition are presented in Table 4. We found a significant interaction between weight gain and treatment, whereby increase in weight appeared to increase quality of life among patients treated with perphenazine while it reduced quality of life among patients treated with olanzapine. Interactive and mediated effects summed up to the TE previously estimated and are suggestive of a complex interplay among the factors under study. The CDE showed that had the patient experienced no weight gain and no PANSS or EPS symptoms, the atypical treatment’s effect on quality of life score would still not be significantly different from the typical antipsychotic (CDE = −0.63, 95% CI: −1.68, 0.41). However, if we still assumed that the patient did not experience any positive PANSS or EPS symptoms, but that the patient experienced moderate to severe weight gain (>4% increase), olanzapine would yield a significantly lower quality of life (Figure 2). The relationship between weight gain and treatment also appears to be complex. While the indirect effect is positive and significant (PNIEWG = 0.81, 95% CI: 0.20, 1.58), the mediated interaction is significant and negative (INTmedWG = −0.85, 95% CI: −1.67, −0.20). This indicates that mediating and interactive mechanisms operate in opposite directions. Perhaps this result is due to the fact that weight gain is a proxy of treatment efficacy through neurological pathways related to glucose metabolism.

Table 4.

Decomposition of the Treatment Effect on Quality of Life Scores for Social Functioning Into Direct Effect and Mediated and Interactive Effects Due to Weight Gain (M1), Positive and Negative Syndrome Scale Total Symptoms (M2), and Extrapyramidal Symptoms (M3), Using Data From the Clinical Antipsychotic Trials of Intervention Effectiveness, United States, 2000–2004

Component Estimate 95% CI
CDE −0.63 −1.68, 0.41
PNIEWG 0.81 0.20, 1.58
PNIEPANSS+ 0.00 0.16, 0.17
PNIEEPS 0.35 0.05, 1.11
INTrefA·WGb −0.18 −0.90, 0.33
INTmedA·WGb −0.85 −1.67, −0.20
NDE −0.81 −2.10, 0.34
NIE 0.35 −0.26, 1.20
TE −0.45 −1.45, 0.51

Abbreviations: CDE, controlled direct effect; CI: confidence interval; EPS, extrapyramidal symptoms; INTref, defined reference interaction; INTmed, defined mediated interaction; NDE, natural direct effect; NIE, natural indirect effect; PANSS, Positive and Negative Syndrome Scale; PNIE, pure natural indirect effect; TE, total effect.

a Bootstrap 95th percentile CI.

b PNIEWG·PANSS+, PNIEWG·EPS, PNIEPANSS+·EPS, PNIEWG·PANSS+·EPS null because no mediator-mediator interaction. INTrefA·PANSS+, INTrefA·EPS, INTrefA·WG·PANSS+, INTrefA·WG·EPS, INTrefA·PANSS+·EPS, INTrefA·WG·PANSS+·M3, INTmedA·PANSS+, INTmedA·EPS, INTmedA·WG·PANSS+, INTmedA·WG·EPS, INTmedA·PANSS+·EPS, INTmedA·WG·PANSS+·EPS null because no interaction between treatment and positive PANSS and EPS symptoms and no 3- or 4-way interactions.

c NDE = CDE + INTref term.

d NIE = PNIE terms + INTmed term.

e TE = NDE + NIE.

Figure 2.

Figure 2.

An illustration of the controlled direct effect of olanzapine relative to perphenazine on quality-of-life score, fixing Positive and Negative Syndrome Scale total score and extrapyramidal symptoms score at zero, while fixing percent weight gain from −7% to 11%, using data from the Clinical Antipsychotic Trials of Intervention Effectiveness, United States, 2000–2004. Dashed lines represent 95% confidence intervals.

Symptoms do not appear to play a major role, given that no differences between first- and second-generation antipsychotics were observed (PNIEPositive symptoms = 0.00, 95% CI: −0.16, 0.17). EPS symptoms significantly mediate the association between treatment and quality of life, whereby reduction in EPS symptoms in patients treated with olanzapine leads to an improvement in quality of life (PNIEEPS = 0.35, 95% CI: 0.05, 1.11). By combining pure indirect effect and mediated interaction, we obtain a positive indirect effect, which is driven mostly by the pathway through reduction in EPS symptoms.

EXTENSION TO THE GENERAL CASE OF n MEDIATORS

Extending the 4-way decomposition of the TE to the general case of more than 2 mediators is theoretically straightforward but requires defining an increasingly higher number of components. In the Web Material, we provide the illustration of such decomposition in the case of 3 mediators (Web Table 3). A decomposition of the TEs into the 4 components can always be identified. The CDE(0, . . . ,0) always retains its interpretation as the effect of the exposure by setting all n mediators at 0 (or at their reference values if continuous). The 3 other components are further divided into 2n − 1 parts detecting effects operating through all the specific pathways. The scenario previously presented, with n = 2 mediators, may be seen as a special case of this general decomposition (with PNIE, INTref, and INTmed, divided into 22 − 1 = 3 components each). The main caveat in extending the decomposition to the general case of n mediators is that a measure of n-way interaction on the additive scale should be defined. This can be done by fixing n − 1 components and evaluating the change when the nth component moves from present to absent. For instance, with 3 mediators we can evaluate the change in A·M1·M2 when M3 goes from present to absent as p(1111) − p(1101) − p(1011) p(0111) + p(1001) + p(0101) + p(0011) − p(0001) > p(1110) − p(1100) − p(1010) − p(0110) + p(1000) + p(0100) + p(0010) − p(0000). Regardless of which of the n components is let to vary, these comparisons will always yield the same measure of n-way interaction. In the case of 3 mediators, this is equal to p(1111) − p(1101) − p(1011) − p(0111) + p(1001) + p(0101) + p(0011) − p(0001) + p(1110) − p(1100) − p(1010) − p(0110) + p(1000) + p(0100) + p(0010) − p(0000).

The decomposition that we described in the context of 2 mediators assumes that M1 and M2 are conditionally independent given a baseline factor C. As the number of mediators increases, finding a set of confounders that satisfies this assumption may not be feasible, and assuming dependency (i.e., sequentially) between sets of mediators may be required. Evaluating sequential mediators, however, poses critical challenges for the identifiability of the components of the decomposition (with the exception of the CDE, which is always identifiable), because mediator-outcome confounding induced by the exposure is introduced, thus violating assumption 4. Methods for the identification of path-specific effects in the context of sequential mediators, including the use and definition of randomized interventional analogues (22), have been recently described (4). Extending these methods to take high-dimension interactions into account is beyond the scope of this work and is a primary goal of future work.

FINAL REMARKS

In this work, we derived a decomposition of the TE that unifies mediation and interaction when multiple mediators are present. We showed that a 4-way decomposition of the TE into 1) a direct effect, 2) an indirect effect, 3) a component due to interaction, and 4) a component due to mediation and interaction can always be derived. Components 2–4 are further divided into subcomponents that identify indirect effects and interactions that operate through specific pathways. This decomposition provides useful advantages for practical application because it allows the investigator, given a TE, to distinguish the proportion due to mediation effects and the proportion due to interaction components. As shown in our illustration, this decomposition may be a valuable tool for situations in which several interactions, both synergistic and antagonistic, may be hypothesized on the evaluated causal pathway.

This decomposition is a natural extension of the 4-way decomposition presented in the context of a single mediator, which can be viewed as a particular case of that presented here. The classical assumptions for the identification of direct and indirect effects are also required for identification of the components of the 4-way decomposition, and they should be assessed for every newly included mediator. We showed that currently available parametric regression approaches can easily be extended to estimate the decomposition (see code in the Web Material). As the number of mediators increases, the definition, identification, and estimation of the components become challenging. Future studies focusing on high-dimension mediation and interaction are required and would represent a major contribution to the field. Also, the high number of mediators may hamper the efficiency of currently available estimation methods for multiple mediators. Alternative procedures, especially semiparametric methods, should be developed and integrated into statistical software. Additional research is also needed to evaluate the robustness of the components to residual confounding, extending current sensitivity-analysis techniques (23, 24) to the specific components of the decomposition.

In conclusion, we have presented a decomposition that identifies the proportions of TE due to mediation and interaction when more than 1 mediator is present. The decomposition can be used to identify the extent to which the TE would be changed by intervening on 1 or more mediators, and to exploit the mechanism through which the TE is generated, thus providing a valuable tool to understand the interplay of multiple factors in explaining a given exposure-outcome effect.

Supplementary Material

Web Material

ACKNOWLEDGMENTS

Author affiliations: Department of Environmental Health, Harvard T.H. Chan School of Public Health, Boston, Massachusetts (Andrea Bellavia); Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts (Andrea Bellavia); Department of Psychiatry, Harvard Medical School, Boston, Massachusetts (Linda Valeri); and Psychiatric Biostatistics Laboratory, McLean Hospital, Belmont, Massachusetts (Linda Valeri).

The authors were supported by a Harvard Catalyst OPTICS Pilot Grant. This work was conducted with support from Harvard Catalyst | The Harvard Clinical and Translational Science Center (National Center for Advancing Translational Sciences, National Institutes of Health, award UL1 TR001102) and financial contributions from Harvard University and its affiliated academic health-care centers.

Results from this study were presented at the Eastern North American Region International Biometric Society Spring Meeting, in March 12–15, 2017, Washington, DC.

The content is solely the responsibility of the authors and does not necessarily represent the official views of Harvard Catalyst, Harvard University, and its affiliated academic healthcare centers, or the National Institutes of Health. Conflict of interest: none declared.

Abbreviations

CDE

controlled direct effect

EPS

extrapyramidal symptoms

PANSS

Positive and Negative Syndrome Scale

PNDE

pure natural direct effect

PNIE

pure natural indirect effect

TE

total effect

REFERENCES

  • 1. Baron RM, Kenny DA. The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. J Pers Soc Psychol. 1986;51(6):1173–1182. [DOI] [PubMed] [Google Scholar]
  • 2. VanderWeele TJ. Explanation in Causal Inference: Methods for Mediation and Interaction. New York, NY: Oxford University Press; 2015:729. [Google Scholar]
  • 3. Daniel RM, De Stavola BL, Cousens SN, et al. . Causal mediation analysis with multiple mediators. Biometrics. 2015;71(1):1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Vansteelandt S, Daniel RM. Interventional effects for mediation analysis with multiple mediators. Epidemiology. 2017;28(2):258–265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. VanderWeele TJ, Vansteelandt S. Mediation analysis with multiple mediators. Epidemiol Methods. 2014;2(1):95–115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. VanderWeele TJ. A unification of mediation and interaction: a 4-way decomposition. Epidemiology. 2014;25(5):749–761. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Robins JM, Greenland S. Identifiability and exchangeability for direct and indirect effects. Epidemiology. 1992;3(2):143–155. [DOI] [PubMed] [Google Scholar]
  • 8. Valeri L, VanderWeele TJ. Mediation analysis allowing for exposure-mediator interactions and causal interpretation: theoretical assumptions and implementation with SAS and SPSS macros. Psychol Methods. 2013;18(2):137–150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. VanderWeele TJ, Knol MJ. A tutorial on interaction. Epidemiol Methods. 2014;3(1):33–72. [Google Scholar]
  • 10. VanderWeele TJ, Tchetgen Tchetgen EJ. Attributing effects to interactions. Epidemiology. 2014;25(5):711–722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. VanderWeele TJ. A three-way decomposition of a total effect into direct, indirect, and interactive effects. Epidemiology. 2013;24(2):224–232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Pearl J. Causality: models, reasoning and inference. Econom Theory. 2003;19:675–685. [Google Scholar]
  • 13. VanderWeele TJ, Vansteelandt S. Conceptual issues concerning mediation, interventions and composition. Stat Interface. 2009;2(4):457–468. [Google Scholar]
  • 14. Pearl J. Direct and indirect effects. In Proceedings of the seventeenth conference on uncertainty in artificial intelligence. San Francisco, CA: Morgan Kaufmann Publishers Inc; 2001:411–420.
  • 15. Leucht S, Cipriani A, Spineli L, et al. . Comparative efficacy and tolerability of 15 antipsychotic drugs in schizophrenia: a multiple-treatments meta-analysis. Lancet. 2013;382(9896):951–962. [DOI] [PubMed] [Google Scholar]
  • 16. Kay SR, Fiszbein A, Opfer LA. The Positive and Negative Syndrome Scale (PANSS) for schizophrenia. Schizophr Bull. 1987;13(2):261–276. [DOI] [PubMed] [Google Scholar]
  • 17. Lieberman JA, Stroup TS, McEvoy JP, et al. . Effectiveness of antipsychotic drugs in patients with chronic schizophrenia. N Engl J Med. 2005;353(12):1209–1223. [DOI] [PubMed] [Google Scholar]
  • 18. Meyer JM, Davis VG, Goff DC, et al. . Change in metabolic syndrome parameters with antipsychotic treatment in the CATIE schizophrenia trial: prospective data from phase 1. Schizophr Res. 2008;101(1–3):273–286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Davis JM, Leucht S, Glick ID. CATIE findings revisited. Psychiatr Serv. 2009;60(1):125–126. [DOI] [PubMed] [Google Scholar]
  • 20. Allison DB, Mentore JL, Heo M, et al. . Antipsychotic-induced weight gain: a comprehensive research synthesis. Am J Psychiatry. 1999;156(11):1686–1696. [DOI] [PubMed] [Google Scholar]
  • 21. Stroup TS, McEvoy JP, Swartz MS, et al. . The National Institute of Mental Health Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) project: schizophrenia trial design and protocol development. Schizophr Bull. 2003;29(1):15–31. [DOI] [PubMed] [Google Scholar]
  • 22. VanderWeele TJ, Tchetgen Tchetgen EJ. Mediation analysis with time varying exposures and mediators. J R Stat Soc Series B Stat Methodol. 2017;79(3):917–938. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. VanderWeele TJ. Bias formulas for sensitivity analysis for direct and indirect effects. Epidemiology. 2010;21(4):540–551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. VanderWeele TJ, Mukherjee B, Chen J. Sensitivity analysis for interactions under unmeasured confounding. Stat Med. 2012;31(22):2552–2564. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Web Material

Articles from American Journal of Epidemiology are provided here courtesy of Oxford University Press

RESOURCES