Mediation Analysis With Intermediate Confounding: Structural Equation Modeling Viewed Through the Causal Inference Lens

Bianca L De Stavola; Rhian M Daniel; George B Ploubidis; Nadia Micali

doi:10.1093/aje/kwu239

. 2014 Dec 11;181(1):64–80. doi: 10.1093/aje/kwu239

Mediation Analysis With Intermediate Confounding: Structural Equation Modeling Viewed Through the Causal Inference Lens

Bianca L De Stavola ^*, Rhian M Daniel, George B Ploubidis, Nadia Micali

PMCID: PMC4383385 PMID: 25504026

Abstract

The study of mediation has a long tradition in the social sciences and a relatively more recent one in epidemiology. The first school is linked to path analysis and structural equation models (SEMs), while the second is related mostly to methods developed within the potential outcomes approach to causal inference. By giving model-free definitions of direct and indirect effects and clear assumptions for their identification, the latter school has formalized notions intuitively developed in the former and has greatly increased the flexibility of the models involved. However, through its predominant focus on nonparametric identification, the causal inference approach to effect decomposition via natural effects is limited to settings that exclude intermediate confounders. Such confounders are naturally dealt with (albeit with the caveats of informality and modeling inflexibility) in the SEM framework. Therefore, it seems pertinent to revisit SEMs with intermediate confounders, armed with the formal definitions and (parametric) identification assumptions from causal inference. Here we investigate: 1) how identification assumptions affect the specification of SEMs, 2) whether the more restrictive SEM assumptions can be relaxed, and 3) whether existing sensitivity analyses can be extended to this setting. Data from the Avon Longitudinal Study of Parents and Children (1990–2005) are used for illustration.

Keywords: eating disorders, estimation by combination, G-computation, parametric identification, path analysis, sensitivity analysis

The epidemiologic literature on causal inference is alight with contributions dedicated to the study of mediation. (A PubMed search for articles on mediation analysis in epidemiology produced 118 “hits” for articles published in 2012 and 110 “hits” for articles published in 2013.) The topic owes its origins, however, to an older body of literature that is well known in the social sciences. This school is often referred to as the “Baron and Kenny approach” (1, 2) but is linked to Sewall Wright's path analysis (3) and its extension, structural equation models (SEMs) (4). It includes several important publications that are less well known in the epidemiologic literature (5–10).

Contributions from the causal inference school have formalized and generalized notions intuitively developed in the SEM school, first by defining (using potential outcomes) precisely what is meant by direct and indirect effects, then by giving clear assumptions under which they can be identified, and lastly by generalizing the statistical methods available for carrying out such analyses to allow for nonlinearities, interactions, discrete outcomes, and semiparametric estimation (11–26).

With a few notable exceptions (11, 27–29), the literature on natural direct and indirect effects focuses predominantly on nonparametric identification, which leads to the strong assumption of “no intermediate confounders”—that is, that no confounders (measured or unmeasured) of the mediator and outcome may be affected by the exposure. By relying on parametric models, however, such confounders are naturally dealt with in the SEM framework. Therefore, it is pertinent and timely to revisit SEMs with intermediate confounders, armed with the formal definitions and (parametric) identification assumptions from causal inference to reconcile the 2 approaches in this particular context.

In this article, we review how paths are traced in order to derive direct and indirect effects in simple linear SEMs which include intermediate confounders but exclude nonlinearities, and show their equivalence to the definitions based on potential outcomes. We then investigate how different parametric assumptions for identification of the natural effects in the presence of intermediate confounders affect the specification of an extended SEM that includes nonlinearities. We further investigate whether the usual SEM assumption of “no omitted influences” of any pair of variables in the system can be relaxed when estimation of the natural effects is the goal. Finally, we widen existing sensitivity analyses to the setting with intermediate confounding, exploiting the SEM framework.

THE 2 FRAMEWORKS

Settings and aims

We will discuss settings involving an exposure X, an outcome Y, a mediator M, background confounders C of 1 or more of the relationships X-Y, M-Y, and X-M, and intermediate confounders L of the M-Y relationship (Figure 1). The aim is to separate the causal effect of X acting along pathways that include M from the causal effect of X acting along other pathways that do not involve M (the indirect and direct effects, respectively).

Figure 1. — Causal diagram for exposure X, mediator M, outcome Y, background confounder C, and intermediate confounder L.

For simplicity, we let X be a binary variable and assume that observations are not affected by missingness or measurement error.

The causal inference framework

The causal inference framework (11, 12) invokes potential outcomes (30). For mediation analysis, these are: M(x), the potential value of M if X had been set, possibly counter to fact, to the value x; Y(x, m), the potential value of Y if X had been set to x and M to m; and Y(x, M(x′)), the composite potential value of Y if X had been set to x and M to M(x′).

Several definitions of direct and indirect effects have been proposed, with the choice depending on the causal question being addressed. We focus here on those most widely used and define them as linear contrasts, although definitions on other scales have been given (31–33).

Definitions

The controlled direct effect (CDE) of X on Y when M is controlled at m, CDE(m), and the pure natural direct effect (PNDE) of X on Y (11, 12) are

\begin{aligned} CDE (m) & = E {Y (1, m)} - E {Y (0, m)} . \\ PNDE & = E {Y (1, M (0))} - E {Y (0, M (0))} . \end{aligned}

CDE(m) is a comparison of 2 hypothetical worlds where, in the first, X is set to 1 and, in the second, X is set to 0, while in both worlds M is set to m. The PNDE is also a comparison of 2 hypothetical worlds where X is set to 0 or 1 but M is set to take its natural value M(0). Because in each of these comparisons M is set at the same value in both worlds (at least within the individual), they are measures of effects of X unmediated by M, that is, “direct.”

The complement of the PNDE is the total natural indirect effect (TNIE) of X on Y (11, 34):

TNIE = TCE - PNDE = E {Y (1, M (1))} - E {Y (1, M (0))},

where TCE = E{Y(1)} − E{Y(0)} represents the total causal effect. The TNIE is a comparison of 2 hypothetical worlds in which X is set to 1 in both, while M changes from its natural value when X is 1 to its natural value when X is 0. Intuitively, this is an indirect effect, since it captures the part of the effect of X on Y that is transmitted by M. There is no equivalent complement of CDE(m) (35).

Assumptions

In the absence of intermediate confounders

Identification of these estimands is possible if certain assumptions hold. Those most commonly invoked are specific versions of no interference, consistency, and conditional exchangeability.

Briefly, in the setting with no intermediate confounders and for CDE(m), the assumption of no interference states that an individual's outcome is not influenced by the exposure status of another person (36–39) and also that the mediator value for one individual has no effect on the outcome in another. The assumption of consistency states that Y(x, m) equals Y among subjects with observed exposure level X = x and mediator level M = m (40–43). The assumption of conditional exchangeability states that once individuals are stratified according to confounders C, their allocation to X is essentially “random” within these strata, and once they are stratified according to X and C, their allocation to M is essentially random within those strata. More formally, conditional exchangeability states that $Y (x) ⊥ ⊥ X | C$ and $Y (x, m) ⊥ ⊥ M | C, X$ , implying no X-Y confounding conditionally on C and no M-Y confounding conditionally on C and X (30, 44). Under these extended assumptions, CDE(m) is nonparametrically identified by regression standardization. For discrete C (45, 46),

\begin{aligned} CDE (m) = & \sum_{c} {E (Y | X = 1, M = m, C = c) \\ - E (Y | X = 0, M = m, C = c)} Pr (C = c) . \end{aligned}

(1)

The sums here are replaced by integrals and Pr(C = c) by the corresponding density, if C is continuous.

In order to identify the PNDE, the assumption of no interference is expanded also to mean that the exposure of one individual has no effect on the mediator of another; the assumption of consistency is expanded also to mean that M(x) = M when X = x and that Y(x, M(x)) = Y when X = x (denoted generalized consistency or composition (46)); and the assumption of conditional exchangeability is expanded to mean that there is also no X-M confounding conditional on C (formally, $M (x) ⊥ ⊥ X | C$ ).

Under these extended assumptions, and when M and C are discrete, the PNDE is nonparametrically identified (12, 45, 46) by

\begin{aligned} \sum_{c} \sum_{m} {E (Y | X = 1, M = m, C = c) \\ - E (Y | X = 0, M = m, C = c)} \\ \times Pr (M = m | X = 0, C = c) Pr (C = c) . \end{aligned}

(2)

The same assumptions are invoked to nonparametrically identify the TNIE, leading to (46)

\begin{aligned} TNIE & = \sum_{c} \sum_{m} E (Y | X = 1, M = m, C = c) \\ \times {Pr (M = m | X = 1, C = c) \\ - Pr (M = m | X = 0, C = c)} Pr (C = c) . \end{aligned}

(3)

For continuous C/M, summations are replaced by integrals and probabilities by density functions (see part A of the Web Appendix, available at http://aje.oxfordjournals.org/). Equations 2 and 3 are known as the mediation formula (45).

In the presence of intermediate confounders

Identifying CDE(m) in the presence of intermediate confounders L can be achieved by adapting the assumption of no unaccounted M-Y confounding to include conditioning on L $(Y (x, m) ⊥ ⊥ M | C, X, L)$ and updating identification formula 1 (equation 1) to include the contribution via L. This is commonly referred to as the G-computation formula (46, 47) (Web Appendix, part B).

In contrast, identification of the natural effects, PNDE and TNIE, additionally involves some parametric restrictions on the relationships among X, M, L, and Y. Originally the restriction was stated by Robins and Greenland (11) as no X-M interaction at an individual level. Alternatively, Petersen et al. (27) suggested assuming that, conditional on C, the CDE does not vary with M(0). Under either of these additional parametric assumptions, PNDE and TNIE are identified by formulae that are extensions of equations 2 and 3. (Identification can also be obtained under certain “no-3-way-interaction” assumptions when the exposure is randomly assigned (48) or under no average L-M interaction in a nonparametric SEM with mutually independent errors (29).)

Estimation

Several approaches have been proposed for the estimation of these estimands, with standard errors typically obtained by sandwich estimation or bootstrapping (for a review, see Vansteelandt (46)). Among them, an extension of Robins' (47) G-computation that incorporates the mediation formula posits regression models for each of the (conditional) expectations/probabilities/densities in the identifying equations, estimates their parameters (e.g., using maximum likelihood), and then plugs these estimates into the sums/integrals above (47, 49). When the G-computation formula is too cumbersome to be evaluated analytically, the integration can be approximated through Monte Carlo simulation (47, 50) (see Appendix 2). The advantage of this approach is efficiency when all models are correctly specified, as well as flexibility. Essentially any combination of types (binary/categorical/continuous) of outcomes, mediators, and intermediate confounders can be modeled with little restriction on the assumed models, although the resulting complexities are a drawback (26).

To lessen the reliance on parametric modeling assumptions, many alternative semiparametric estimation approaches have been suggested, in particular G-estimation of structural nested models (21), inverse probability weighting of marginal structural models (20), doubly and multiply robust methods that combine 1 or more of these approaches (24, 25), and multiply robust methods based on targeted maximum likelihood (51).

The SEM framework

Unlike the above, the definitions of direct and indirect effects given in the SEM literature depend on the specification of a particular statistical model (49). In the setting of Figure 2 (with single C and L), the following model for continuous Y, M, and L could be specified:

{\begin{aligned} L & = γ_{0} + γ_{x} X + γ_{c} C + ϵ_{l} \\ M & = α_{0} + α_{x} X + α_{l} L + α_{c} C + ϵ_{m} \\ Y & = β_{0} + β_{x} X + β_{m} M + β_{l} L + β_{c} C + ϵ_{y}, \end{aligned}

(4)

where X and C are exogenous variables (no equations are specified for them), Y, M, and L are endogenous variables, and $ϵ_{l}$ , $ϵ_{m}$ , and $ϵ_{y}$ are mean-zero error terms, uncorrelated with each other and with the exogenous variables. This is a linear path model for the joint distribution of Y, M, and L (4, 52).

Figure 2. — Structural equation model for exposure X, mediator M, outcome Y, background confounder C, and intermediate confounder L (error terms omitted for simplicity).

Sequentially replacing the expression for L into the equation for M and that for M into the equation for Y, we obtain the reduced form of model 4 (equation 4):

\begin{aligned} Y & = (β_{0} + α_{0} β_{m} + α_{l} β_{m} γ_{0} + β_{l} γ_{0}) + (β_{x} + α_{x} β_{m} + α_{l} β_{m} γ_{x} + β_{l} γ_{x}) X \\ + (β_{c} + α_{c} β_{m} + β_{l} γ_{c} + α_{l} β_{m} γ_{c}) C + (β_{m} ϵ_{m} + α_{l} β_{m} ϵ_{l} + β_{l} ϵ_{l} + ϵ_{y}) . \end{aligned}

Here (β_x + α_xβ_m + α_lβ_mγ_x + β_lγ_x) is taken to represent the total causal effect of X on Y. It can be partitioned into the direct (not mediated by M) and indirect (mediated) effects of X by tracing the paths in Figure 2 that make up the total effect (52). The indirect effect is found by multiplying the parameters along each of the (directed) paths from X to Y that include M and summing them; here, this is (α_xβ_m + γ_xα_lβ_m). The direct effect is the sum of the remainder, (β_x + γ_xβ_l). This is a more general version of the product of coefficients method (2, 13, 53).

Tracing the paths is possible only when the models for the endogenous variables are linear and do not include any interactions or other nonlinearities, although generalizations to settings with binary outcomes (via logit or probit regression) have been suggested, with standardization of the estimated parameters used to deal with their differences in scale across models (54). Other approaches within the SEM framework (i.e., without relying on counterfactuals) have also been proposed for general link functions and for models with interactions and other nonlinearities (9, 10, 49, 55), but these are only approximate and do not explicitly deal with settings with intermediate confounding.

Assumptions and estimation

Depending on the author, the identifying assumptions given in the SEM literature vary in detail, but essentially they are (5, 7, 8, 52, 56):

Correct temporal order between X, L, M, and Y.
“No omitted influences” (8), or “no lack of self-containment” (7), or “no other hidden relevant causes” (52).
Correct functional forms of each equation in the model.
Accurate measurements of all of the observed variables.
Error terms that are uncorrelated with each other and with the exogenous variables.

The first 2 assumptions are structural, that is, causal, meaning that the regression equations fully reflect the underlying data-generating process and that they justify the apportioning of the mediation effects described above (7, 8, 52). For settings with intermediate confounders, “no omitted influences” is a stronger assumption than the conditional exchangeability assumption invoked in the causal inference literature, since it also involves no L-Y confounding.

The last 3 assumptions are statistical. The first refers to the linearity and additivity of the relationships among the variables, the second to the reliability of the available data, and the third to the behavior of the error terms. Requiring the error terms to be uncorrelated with each other and with the exogenous variables guarantees unbiased estimation of the model's parameters via least squares. These estimated parameters can then be combined to obtain estimates of the direct and indirect effects, with measures of their precision obtained via the delta method (6) or bootstrapping (57). Importantly, departures from the statistical assumptions have repercussions for the structural ones. Correlated error terms—or correlated error terms and exogenous variables—would indicate departures from the structural assumption of no omitted relevant variables (52). Departures from the assumption of accurate measurements of the observed variables would lead to biased estimates of the model parameters and consequently of the mediation parameters (58).

Interestingly, the SEM literature does not mention the assumptions of no interference and consistency invoked by the causal inference literature, even though both are required for the estimated parameters to be interpreted as causal (59).

INSIGHTS

The causal inference estimands are defined in generality, although identification is achieved only parametrically when intermediate confounding is present. The SEM estimands are derived from specific parametric structural models that naturally include intermediate confounders. The 2 approaches are therefore very different, but they converge under certain scenarios. We believe that understanding their overlap when intermediate confounding is present can offer useful analytical insights.

Equivalence in estimands

The SEM approach to mediation applied to model 4 identifies the mediated effect of X on Y via M as (α_xβ_m + γ_xα_lβ_m) and the nonmediated one as (β_x + γ_xβ_l).

Under the same structural and parametric assumptions, the causal inference estimands can be written in closed form (see Web Appendix, part B):

\begin{aligned} PNDE & = \int_{c} {\int_{l^{'}} \int_{m} \int_{l} {E (Y | X = 1, M = m, L = l, C = c) f_{L} (l | X = 1, C = c) \\ - E (Y | X = 0, M = m, L = l, C = c) f_{L} (l | X = 0, C = c)} d l \\ \times f_{M} (m | L = l^{'}, X = 0, C = c) f_{L} (l^{'} | X = 0, C = c) d m d l^{'}} f_{C} (c) d c \\ = \int_{c} {\int_{l^{'}} \int_{m} (β_{x} + β_{l} γ_{x}) f_{M} (m | L = l^{'}, X = 0, C = c) f_{l} (l^{'} | X = 0, C = c) d m d l^{'}} f_{C} (c) d c \\ = β_{x} + β_{l} γ_{x} . \end{aligned}

\begin{aligned} CDE (m) & = \int_{c} {\int_{l} E {(Y | X = 1, M = m, C = c, L = l) f_{L} (l | X = 1, C = c) d l \\ - \int_{l} E (Y | X = 0, M = m, C = c, L = l) f_{L} (l | X = 0, C = c) d l} f_{C} (c) d c \\ = \int_{c} (β_{x} + β_{l} γ_{x}) f_{C} (c) d c \\ = β_{x} + β_{l} γ_{x} . \end{aligned}

\begin{aligned} TNIE & = \int_{c} {\int_{l^{'}} \int_{m} \int_{l} E (Y | X = 1, M = m, L = l, C = c) f_{L} (l | X = 1, C = c) \\ \times {f_{M} (m | X = 1, L = l^{'}, C = c) f_{L} (l^{'} | X = 1, C = c) \\ - f_{M} (m | X = 0, L = l^{'}, C = c) f_{L} (l^{'} | X = 0, C = c)} d l d m d l^{'}} f_{C} (c) d c \\ = \int_{c} {β_{m} (α_{x} + α_{l} γ_{x})} f_{C} (c) d c \\ = β_{m} (α_{x} + α_{l} γ_{x}) . \end{aligned}

Hence the estimands from the 2 approaches coincide when the same parametric assumptions are made; likewise in the simple setting without intermediate confounders (10, 13, 45, 49). Although these equivalences apply only to linear SEMs that have no interactions or other nonlinear terms involving X, M, and L, closed-form solutions for the causal estimands above are not restricted to these simple models. Appendix 1 shows the closed-form solutions obtained for a more general linear SEM:

\{\begin{cases} L & = γ_{0} + γ_{x} X + γ_{c} C + ϵ_{l} \\ M & = α_{0} + α_{x} X + α_{l} L + α_{c} C + α_{x l} X L + ϵ_{m} \\ Y & = β_{0} + β_{x} X + β_{l} L + β_{l l} L^{2} + β_{m} M + β_{m m} M^{2} \\ + β_{c} C + β_{x l} X L + β_{x m} X M + ϵ_{y}, \end{cases}

(5)

where the residual terms are uncorrelated with each other and the explanatory variables in their equations and have constant variances $σ_{l}^{2}$ , $σ_{m}^{2}$ , and $σ_{y}^{2}$ , respectively.

Parametric G-computation of the causal estimands above can then be achieved by combining the relevant estimated parameters of the assumed SEM, leading to what we refer to as estimation by combination (see Appendix 2 for its implementation in Mplus (Muthén and Muthén, Los Angeles, California); this implementation is more general than those in the papers by Valeri and VanderWeele (15) and Emsley et al. (60), which deal only with settings without L). Comparing the results obtained from analytical (i.e., by-combination) and Monte Carlo G-computation allows evaluation of the extent of the Monte Carlo error, as illustrated in the example.

Understanding the assumptions required for parametric identification

Identifiability of the natural direct and indirect effects in the presence of intermediate confounding involves some parametric restrictions on the relationships among X, M, L, and Y. Specifically, Robins and Greenland (11) proposed the assumption of no individual X-M interaction—formally, that $Y (1, m) - Y (0, m)$ is the same for all m. For settings in which parametric models for Y, M, and L are specified via linear regression, this can be formally examined.

For example, consider model 5 (equation 5). Assuming it is correctly specified, we see that

\begin{aligned} Y (1, m) - Y (0, m) & = β_{x} + β_{l} (L (1) - L (0)) + β_{l l} (L (1)^{2} - L (0)^{2}) \\ + β_{x l} L (1) + β_{x m} m \\ = β_{x} + β_{l} γ_{x} + β_{l l} {γ_{x}^{2} + 2 γ_{x} (γ_{0} + γ_{c} C + ϵ_{l})} \\ + β_{x l} (γ_{0} + γ_{x} + γ_{c} C + ϵ_{l}) + β_{x m} m, \end{aligned}

and thus the Robins and Greenland assumption holds if and only if $β_{x m} = 0.$ Note that, had our model for Y included a term in LM, the Robins and Greenland assumption would also have constrained its coefficient (β_lm) to be zero (in line with the constraint proposed by Tchetgen Tchetgen and VanderWeele (29)).

Petersen et al. (27) propose the alternative identifying assumption that, within levels of C, the CDE does not vary with M(0). Formally,

\begin{aligned} E {Y (1, m) - Y (0, m) | M (0) = m, C = c} \\ = E {Y (1, m) - Y (0, m) | C = c} . \end{aligned}

Again, assuming that model 5 is correct, we see that

\begin{aligned} M (0) & = α_{x} + α_{l} L (0) + α_{c} C + ϵ_{m} \\ = α_{x} + α_{l} (γ_{0} + γ_{c} C + ϵ_{l}) + α_{c} C + ϵ_{m} . \end{aligned}

Conditional on C, therefore, we see that both Y(1, m) − Y(0, m) and M(0) are functions of $ϵ_{l}$ , except when $β_{l l} = β_{x l} = 0.$ Note that, given our model, assuming that γ_x = 0 (in place of β_ll) or that α_l = 0 would be equivalent to assuming no intermediate confounding, which is why we do not consider them.

Thus, given this particular model, we have 2 options in the presence of intermediate confounders: Either we identify the PNDE and TNIE under the assumption that β_xm = 0 or we identify them under the assumption that β_ll = β_xl = 0. Hence, examining the significance of these parameters in an associational model for Y that contains all of these terms should aid in the selection of identification assumptions.

Equivalence in assumptions

As we stated above, there is an interesting difference with regard to the identifying assumptions invoked by the 2 approaches when the model involves intermediate confounders. Under the SEM, all of the error terms are assumed to be uncorrelated with each other, a scenario which would not be satisfied were the L-Y relationship affected by unmeasured confounding, given C and X (represented by U in Figure 3). This is not a restriction invoked by the causal inference framework (as it concerns only confounding of X-Y, X-M, and M-Y).

Figure 3. — Causal diagram for exposure X, mediator M, outcome Y, intermediate confounder L, and unmeasured intermediate *L-Y* confounder U. The circle around U indicates that it is unmeasured.

However, when the focus is identification of mediation effects within the SEM framework, the assumption of no L-Y confounding is actually not required once the parametric assumptions discussed above are made (for a justification based on the theory described by Wermuth and Cox (61), see part C of the Web Appendix and—for a simpler setting—Moerkerke et al. (62); also see Pearl (63)). Thus, there is no contradiction in fitting a SEM without assuming no L-Y confounding.

Sensitivity analyses

It is possible to perform simple sensitivity analyses of the assumption of no unmeasured M-Y confounding by fitting SEMs that allow for $ϵ_{y}$ and $ϵ_{m}$ to be correlated (10, 49, 64). We extend the sensitivity analysis of Imai et al. (49) to a setting with intermediate confounders—for example,

\{\begin{cases} L & = γ_{0} + γ_{x} X + ϵ_{l} \\ M & = α_{0} + α_{x} X + α_{l} L + ϵ_{m} \\ Y & = β_{0} + β_{x} X + β_{m} M + β_{l} L + ϵ_{y}, \end{cases}

(6)

where, for simplicity, there are no confounders or interaction terms and the residuals are uncorrelated with the explanatory variables in their equations and have constant variance ( $Var(ϵ_{l}) = Var (ϵ_{l} | X) = σ_{l}^{2}$ , $Var (ϵ_{m}) = Var (ϵ_{m} | X, L) = σ_{m}^{2}$ , and $Var (ϵ_{y}) = Var (ϵ_{y} | X, L, M) = σ_{y}^{2}$ )) but $ϵ_{m}$ and $ϵ_{y}$ are correlated with $Corr (ϵ_{m}, ϵ_{y}) = Corr (ϵ_{m}, ϵ_{y} | X, L, M) = ρ$ . This would occur in the presence of uncontrolled M-Y confounding.

Now consider the alternative specification:

\{\begin{cases} L & = γ_{0} + γ_{x} X + ϵ_{l} \\ M & = α_{0} + α_{x} X + α_{l} L + ϵ_{m} \\ Y & = {β^{'}}_{0} + {β^{'}}_{x} X + {β^{'}}_{l} L + {ϵ^{'}}_{y}, \end{cases}

(7)

where the model for Y does not include M and $Var (ϵ_{y}^{'}) = Var (ϵ_{y}^{'} | X, L) = σ_{y}^{' 2}$ , and $Corr (ϵ_{m}, ϵ_{y}^{'}) = Corr (ϵ_{m}, ϵ_{y}^{'} | X, L) = ρ^{'}$ . The parameters of model 6 (equation 6) are not identified because β_m and ρ are collinear, whereas the parameters of model 7 (equation 7) are.

Similarly to Imai et al. (49), we focus on ρ′ and interpret it as a measure of the strength of any unmeasured M-Y confounding that would imply an indirect effect of zero. Estimating ρ′ is straightforward: Model 7 is fitted and the residuals are calculated, with their sample correlation being ${\hat{ρ}}^{'}$ . A confidence interval for ${\hat{ρ}}^{'}$ is then obtained by bootstrapping (Stata code (StataCorp LP, College Station, Texas) given in Appendix 3).

RESULTS

To illustrate the advantages of fitting SEMs when studying mediation, we analyze data on eating-disorder behaviors in adolescent girls. An adolescent eating-disorder study was carried out as part of the Avon Longitudinal Study of Parents and Children (ALSPAC), a birth cohort study of babies born between 1990 and 1992 in the South West of the United Kingdom (65). It involved data on eating-disorder behaviors collected by parental questionnaire on nearly 3,000 girls when they were around age 13.5 years. This information was used to identify 3 (standardized) latent scores for disordered eating patterns via factor analysis (66). For illustration, we use one of these latent dimensions, “bingeing or overeating,” as the outcome of interest and study whether the influence of high maternal prepregnancy body mass index (BMI; weight (kg)/height (m)²; coded >25 for high and ≤25 for low) is mediated by the daughter's BMI in childhood (prospectively calculated from measurements taken at about age 7 years). It is of interest to separate the effects that maternal BMI may have through and not through potentially modifiable childhood factors.

The assumed causal diagram is shown in Figure 4, with maternal prepregnancy mental illness and education as background confounders (C₁ and C₂) and birth weight as an intermediate confounder (L). The appropriate extension (i.e., incorporating the mediation formula) of the G-computation formula by Monte Carlo simulation was performed via the gformula command in Stata 13 (50) (details given in Appendix 2, part A); estimation by combination was performed after fitting models by maximum likelihood in Mplus 7.11 (67) and combining the relevant estimated parameters as appropriate (details given in Appendix 2, part B). Standard errors were obtained via the bootstrap and delta methods, respectively.

Analyses are restricted to the 2,749 girls with complete data on all variables. Table 1 characterizes the data and shows marginal and partial correlations. “Bingeing or overeating” is both marginally and conditionally correlated with all other variables except maternal education, while maternal BMI (but not childhood BMI) is correlated with birth weight.

Table 1.

Mean Values/Percentages and Marginal (Above Main Diagonal) and Partial (Below Main Diagonal) Correlations for Variables Used in an Analysis of Eating-Disorder Behaviors Among Adolescent Girls (n = 2,749), Avon Longitudinal Study of Parents and Children, United Kingdom, 1990–2005^a

Variable	Symbol	Mean (SD)	%	Correlation
Variable	Symbol	Mean (SD)	%	Bingeing or Overeating	Childhood BMI^b	Birth Weight	High Maternal BMI	Low Maternal Education	Poor Maternal Mental Health
Bingeing or overeating	Y	0.00 (1.00)		1	0.33^c	0.05^c	0.06^c	−0.01	0.11^c
Childhood BMI^d	M	−0.02 (0.99)		0.34^c	1	−0.02	0.26^c	0.10^c	−0.02
Birth weight^e	L	0.10 (0.92)		0.05^c	0.01	1	0.12^c	−0.04	−0.04
High maternal BMI^f,g	X		19	0.17^c	0.31^c	0.13^c	1	0.17^c	−0.03
Low maternal education^g,h	C₁		55	0.04	0.13^c	−0.03^c	0.20^c	1	0.04
Poor maternal mental health^g	C₂		13	0.11^c	0.01	−0.03	−0.01	0.04	1

Open in a new tab

Abbreviations: BMI, body mass index; SD, standard deviation.

Information on maternal education, prepregnancy BMI, and history of mental illness was obtained from postal questionnaires administered during pregnancy. Birth weight was measured at the time of birth. Childhood BMI was prospectively calculated from measurements taken at about age 7 years.

Weight (kg)/height (m)².

P < 0.05.

Childhood BMI was age-standardized (leading to a standardized score). Because of missing values on other variables, its mean and SD were not exactly 0 and 1.

Birth weight was internally standardized using the complete sample (leading to a standardized score). Because of missing values on other variables, its mean and SD were not exactly 0 and 1.

Maternal prepregnancy BMI was dichotomized (<25, low; ≥25, high).

Polychoric (or tetrachoric) correlations are reported when calculations involved this variable.

Maternal education was dichotomized: “no high school” versus “at least high school.”

Table 2 shows the estimated coefficients for the conditional expectation of Y expressed without any of the parametric constraints needed for identification in the presence of intermediate confounders. In particular, we allowed interactions between X and M, L and M, and nonlinearities in L and M. It appears that there is little evidence to reject β_xm = 0 (P = 0.76), while the evidence for β_xl and β_ll being nonzero is greater (P = 0.08 and P = 0.01, respectively), suggesting that the Robins and Greenland assumption may be more plausible in this example. We nevertheless report the estimates of the mediation effects obtained under both assumptions in Table 3 (see also Web Table 1). The results suggest a strong mediated effect of high maternal BMI on “bingeing or overeating” via childhood BMI, with a smaller direct effect capturing all other pathways. It appears therefore that more than 60% of the total effect of maternal overweight is transmitted via the daughter's own size in childhood and not via other pathways, including birth weight, implicating a contribution of childhood environmental factors. Table 3 also highlights the closeness of the results obtained using Monte Carlo G-computation and G-computation via estimation by combination; however, this required the size of the Monte Carlo sample to be increased to 100,000.

Table 2.

Estimated Coefficients From a Regression Model for “Bingeing or Overeating” Among Adolescent Girls (n = 2,749), Avon Longitudinal Study of Parents and Children, United Kingdom, 1990–2005

Variable	Symbol	Parameter	Estimate (SE)	P Value
High maternal BMI^a	X	β_x	0.068 (0.050)	0.18
Childhood BMI score	M	β_m	0.312 (0.021)	<0.001
Childhood BMI score squared	M²	β_mm	0.043 (0.012)	<0.001
Birth weight score	L	β_l	0.034 (0.022)	0.13
Birth weight score squared	L²	β_ll	0.032 (0.012)	0.01
High maternal BMI × birth weight	XL	β_xl	0.078 (0.045)	0.08
High maternal BMI × child BMI	XM	β_xm	0.014 (0.045)	0.76
Low maternal education	C₁	β_c₁	−0.011 (0.036)	0.76
Poor maternal mental health	C₂	β_c₂	0.207 (0.054)	<0.001

Open in a new tab

Abbreviations: BMI, body mass index; SE, standard error.

Weight (kg)/height (m)².

Table 3.

Estimation of the Total Effect of High Maternal BMI on “Bingeing or Overeating” Among Adolescent Girls (n = 2,749) and of the Effects Mediated and Not Mediated by Childhood BMI (Estimation by Monte Carlo Simulation vs. Estimation by Combination), Avon Longitudinal Study of Parents and Children, United Kingdom, 1990–2005

Model and Estimand	Estimation Method and Estimate (SE)
Model and Estimand	Monte Carlo G-Computation^a	Estimation by Combination^b
Model 1^c
TCE	0.287 (0.052)	0.287 (0.049)
PNDE	0.102 (0.050)	0.103 (0.047)
TNIE	0.185 (0.021)	0.184 (0.019)
CDE(0)	0.104 (0.050)	0.103 (0.047)
Model 2^d
TCE	0.297 (0.052)	0.297 (0.049)
PNDE	0.102 (0.051)	0.103 (0.051)
TNIE	0.195 (0.031)	0.194 (0.028)
CDE(0)	0.105 (0.049)	0.105 (0.049)

Open in a new tab

Abbreviations: CDE, controlled direct effect; PNDE, pure natural direct effect; SE, standard error; TCE, total causal effect; TNIE, total natural indirect effect.

Estimation by G-computation via Monte Carlo simulation was carried out using the gformula command (50) in Stata 13, with an enlarged Monte Carlo sample of 100,000 to increase agreement with closed-form results (see Appendix 2, part A); SEs were estimated via bootstrap.

Estimation by combination was carried out by combining the maximum likelihood estimates of the relevant structural equation model parameters obtained in Mplus, version 7.11 (see Appendix 2, part B); SEs were estimated via the delta method.

Model 1 follows the Robins and Greenland assumption (11) that there is no interaction between X and M at the individual level in their effects on Y. The model was specified as follows. The equation for “bingeing or overeating” (Y) included childhood BMI (M; linear and quadratic terms), high maternal BMI (X; binary), birth weight (L; linear and quadratic terms), the interaction between high maternal BMI and birth weight, maternal education (C₁; binary), and prepregnancy mental health (C₂; binary). The equation for childhood BMI included high maternal BMI (binary), birth weight (linear term), the interaction between high maternal BMI and birth weight, maternal education (binary), and prepregnancy mental health (binary). The equation for birth weight included high maternal BMI (binary), maternal education (binary), and prepregnancy mental health (binary).

Model 2 follows the Petersen et al. assumption (27) that (conditional on C) the CDE does not vary with M(0). The model was specified as follows. The equation for “bingeing or overeating” included childhood BMI (linear and quadratic terms), high maternal BMI (binary), birth weight (linear term), the interaction between high maternal BMI and childhood BMI, maternal education (binary), and prepregnancy mental health (binary). The equation for childhood BMI included high maternal BMI (binary), birth weight (linear term), the interaction between high maternal BMI and birth weight, maternal education (binary), and prepregnancy mental health (binary). The equation for birth weight included high maternal BMI (binary), maternal education (binary), and prepregnancy mental health (binary).

Sensitivity analyses show that a noncausal residual correlation between childhood BMI and “bingeing or overeating” would have to be very large, at least equal to 0.324 (95% confidence interval: 0.287, 0.361), to remove the path mediated by childhood BMI.

DISCUSSION

We have reviewed 2 alternative approaches to the study of mediation in settings with intermediate confounding. The one emerging from the SEM framework has a long tradition in the social sciences and uses definitions of direct and indirect effects that are intuitive but are embedded within simple linear models. In contrast, the approach proposed within the causal inference literature is general, as it compares expected potential outcomes without reference to any particular model.

We have extended work done by others (10, 13, 45, 49, 64) in deriving closed-form solutions to the identification equations for the causal inference estimands for general linear SEMs that include intermediate confounders. This has helped in clarifying the parametric assumptions needed for identification—and the consequent advantages of examining certain regression parameters, justifying the relaxation of the assumption of no L-Y unmeasured confounders made by the causal inference school and extending sensitivity analyses of unmeasured M-Y confounding. These results are novel and should help analysts investigating mediation in the presence of intermediate confounding. Although these results are restricted to settings that can be modeled with systems of linear equations, the insights gained here should also apply more generally, given the approximate closed-form expressions recently derived for binary outcomes and mediators (31, 68) and the recent nonparametric identifying constraints involving L-M interactions (29, 64).

Supplementary Material

Web Material

supp_181_1_64__index.html^{(984B, html)}

ACKNOWLEDGMENTS

Author affiliations: Centre for Statistical Methodology, London School of Hygiene and Tropical Medicine, University of London, London, United Kingdom (Bianca L. De Stavola, Rhian M. Daniel, George B. Ploubidis); and Institute of Child Health, Faculty of Population Health Sciences, University College London, London, United Kingdom (Nadia Micali).

This work was partly funded by the Economic and Social Research Council (grants ES/I025561/1, ES/I025561/2, and ES/I025561/3), the Medical Research Council (postdoctoral fellowships G1002283 and 74882), the Wellcome Trust (grant 076467), and the University of Bristol (Bristol, United Kingdom), which provides core support for the Avon Longitudinal Study of Parents and Children (ALSPAC). N.M. was supported by a National Institute of Health Research clinician scientist award.

We are grateful to the midwives who helped to recruit the ALSPAC families and to the entire ALSPAC study team.

The views expressed in this publication are those of the authors and not necessarily those of the United Kingdom National Health Service, the National Institute for Health Research, or the United Kingdom Department of Health.

Conflict of interest: none declared.

APPENDIX 1

Estimation by Combination for a More General Linear SEM

Consider the following linear structural equation model (SEM):

{\begin{matrix} L = γ_{0} + γ_{x} X + γ_{c} C + ϵ_{l} \\ M = α_{0} + α_{x} X + α_{l} L + α_{c} C + α_{x l} X L + ϵ_{m} \\ Y = β_{0} + β_{x} X + β_{l} L + β_{l l} L^{2} + β_{m} M + β_{m m} M^{2} + β_{c} C + β_{x l} X L + β_{x m} X M + ϵ_{y}, \end{matrix}

(8)

where the residual terms are uncorrelated with each other and the endogenous variables and have variances $σ_{l}^{2}$ , $σ_{m}^{2}$ , and $σ_{y}^{2}$ , respectively. (Note that these variances are assumed to be constant, so that $Var (ϵ_{l} | X, C) = σ_{l}^{2}$ , $Var (ϵ_{m} | X, L, C) = σ_{m}^{2}$ , and $Var (ϵ_{y} | X, L, M, C) = σ_{y}^{2}$ .)

For this model, the expression for the pure natural direct effect (PNDE) (see Web Appendix, part B),

\begin{aligned} \int_{c} {\int_{l^{'}} \int_{m} \int_{l} {E (Y | X = 1, M = m, L = l, C = c) f_{L} (l | X = 1, C = c) \\ - E (Y | X = 0, M = m, L = l, C = c) f_{L} (l | X = 0, C = c)} d l \\ \times f_{M} (m | L = l^{'}, X = 0, C = c) f_{L} (l^{'} | X = 0, C = c) d m d l^{'}} f_{C} (c) d c, \end{aligned}

(9)

can be written in closed form. Consider first its inner component:

\begin{aligned} \int_{l} {E (Y | X = 1, M = m, L = l, C = c) f_{L} (l | X = 1, C = c) \\ - E (Y | X = 0, M = m, L = l, C = c) f_{L} (l | X = 0, C = c)} d l . \end{aligned}

This is equal to

\begin{aligned} {β_{0} + β_{x} + (β_{l} + β_{x l}) \bar{L_{1}} (c) + β_{l l} \bar{L_{1}^{2}} (c) + β_{m} m + β_{m m} m^{2} + β_{c} c + β_{x m} m} \\ - {β_{0} + β_{l} \bar{L_{0}} (c) + β_{l l} \bar{L_{0}^{2}} (c) + β_{m} m + β_{m m} m^{2} + β_{c} c} \\ = β_{x} + β_{l} (\bar{L_{1}} (c) - \bar{L_{0}} (c)) + β_{l l} (\bar{L_{1}^{2}} (c) - \bar{L_{0}^{2}} (c)) + β_{x l} \bar{L_{1}} (c) + β_{x m} m, \end{aligned}

(10)

where

\begin{aligned} \bar{L_{x}} (c) = E (L | X = x, C = c) = γ_{0} + γ_{x} x + γ_{c} c \\ \bar{L_{x}^{2}} (c) = E (L^{2} | X = x, C = c) = (\bar{L_{x}} (c))^{2} + σ_{l}^{2} \\ \bar{L_{1}} (c) - \bar{L_{0}} (c) = γ_{x} \\ \bar{L_{1}^{2}} (c) - \bar{L_{0}^{2}} (c) = (\bar{L_{1}} (c))^{2} - (\bar{L_{0}} (c))^{2} = γ_{x}^{2} + 2 γ_{x} (γ_{0} + γ_{c} c) . \end{aligned}

Let

A (c) = β_{x} + β_{l} γ_{x} + β_{l l} {γ_{x}^{2} + 2 γ_{x} (γ_{0} + γ_{c} c)} + β_{x l} (γ_{0} + γ_{x} + γ_{c} c) .

Writing equation 10 as A(c) + β_xmm, we can rewrite equation 9 as

\begin{aligned} \int_{c} \int_{l^{'}} \int_{m} (A (c) + β_{x m} m) f_{M} (m | L = l^{'}, X = 0, C = c) f_{l} (l^{'} | X = 0, C = c) f_{C} (c) d m d l^{'} d c \\ = \int_{c} {A (c) + β_{x m} \bar{M_{0}} (c)} f_{C} (c) d c \\ = \bar{A} + β_{x m} \bar{\bar{M_{0}}}, \end{aligned}

where

\begin{aligned} \bar{M_{x}} (c) & = E (M | X = x, C = c) = α_{0} + α_{x} x + (α_{l} + α_{x l} x) \bar{L_{x}} (c) + α_{c} c \\ \bar{M_{0}} (c) & = α_{0} + α_{l} (γ_{0} + γ_{c} c) + α_{c} c \\ \bar{A} & = \int A (c) f_{c} (c) d c \\ = β_{x} + β_{l} γ_{x} + β_{l l} {γ_{x}^{2} + 2 γ_{x} (γ_{0} + γ_{c} μ_{c})} + β_{x l} (γ_{0} + γ_{x} + γ_{c} μ_{c}) \\ \bar{\bar{M_{0}}} & = \int \bar{M_{0}} (c) f_{c} (c) d c \\ = α_{0} + α_{l} (γ_{0} + γ_{c} μ_{c}) + α_{c} μ_{c} . \end{aligned}

Thus, equation 9 becomes

β_{x} + β_{l} γ_{x} + β_{l l} {γ_{x}^{2} + 2 γ_{x} (γ_{0} + γ_{c} μ_{c})} + β_{x l} (γ_{0} + γ_{x} + γ_{c} μ_{c}) + β_{x m} {α_{0} + α_{l} (γ_{0} + γ_{c} μ_{c}) + α_{c} μ_{c}} .

If the model is correctly specified and if, additionally, the assumptions of no interference, strong consistency, and conditional exchangeability are met and one of the parametric assumptions described in the text is met, then this expression can be interpreted as the PNDE. However, note that the additional parametric assumptions constrain some of the parameters above to be zero, and thus the expression simplifies.

Similar calculations for CDE(m), the controlled direct effect (CDE) of X on Y when M is controlled at m (see Web Appendix, part B), lead to

\begin{aligned} CDE (m) & = \bar{A} + β_{x m} m \\ = β_{x} + β_{l} γ_{x} + β_{l l} {γ_{x}^{2} + 2 γ_{x} (γ_{0} + γ_{c} μ_{c})} + β_{x l} (γ_{0} + γ_{x} + γ_{c} μ_{c}) + β_{x m} m, \end{aligned}

with the interpretation as CDE(m) being justified if the model is correctly specified and if the appropriate assumptions (no interference, consistency, conditional exchangeability) are met; note that the parametric restrictions described in the text are not required for this estimand.

Finally, for the total natural indirect effect (TNIE) (see Web Appendix, part B), we have the expression

\begin{aligned} \int_{c} \int_{l^{'}} \int_{m} \int_{l} E (Y | X = 1, M = m, L = l, C = c) f_{L} (l | X = 1, C = c) \\ \times {f_{M} (m | X = 1, L = l^{'}, C = c) f_{L} (l^{'} | X = 1, C = c) \\ - f_{M} (m | X = 0, L = l^{'}, C = c) f_{L} (l^{'} | X = 0, C = c)} f_{C} (c) d l d m d l^{'} d c, \end{aligned}

which can be rewritten as

\int_{c} {(β_{m} + β_{x m}) (\bar{M_{1}} (c) - \bar{M_{0}} (c)) + β_{m m} (\bar{M_{1}^{2}} (c) - \bar{M_{0}^{2}} (c))} f_{C} (c) d c,

(11)

where

\begin{aligned} \bar{M_{1}} (c) - \bar{M_{0}} (c) & = α_{x} + α_{x l} (γ_{0} + γ_{x} + γ_{c} c) + α_{l} γ_{x} \\ \bar{M_{1}^{2}} (c) - \bar{M_{0}^{2}} (c) & = E (M^{2} | X = 1, C = c) - E (M^{2} | X = 0, C = c) \\ = {\bar{M_{1}} (c)}^{2} - {\bar{M_{0}} (c)}^{2} + Var (M | X = 1, C = c) - Var (M | X = 0, C = c) \\ = {α_{0} + α_{x} + (α_{l} + α_{x l}) (γ_{0} + γ_{x} + γ_{c} c) + α_{c} c}^{2} - {α_{0} + α_{l} (γ_{0} + γ_{c} c) + α_{c} c}^{2} \\ + (α_{l} + α_{x l})^{2} σ_{l}^{2} + σ_{m}^{2} - (α_{l}^{2} σ_{l}^{2} + σ_{m}^{2}) \\ = {α_{0} + α_{l} (γ_{0} + γ_{x} + γ_{c} c) + α_{c} c}^{2} \\ + 2 {α_{0} + α_{l} (γ_{0} + γ_{x} + γ_{c} c) + α_{c} c} {α_{x} + α_{x l} (γ_{0} + γ_{x} + γ_{c} c)} \\ + (2 α_{l} + α_{x l}) α_{x l} σ_{l}^{2} . \end{aligned}

Thus, equation 11 can be rewritten as

\begin{aligned} (β_{m} + β_{x m}) {α_{x} + α_{x l} (γ_{0} + γ_{x} + γ_{c} μ_{c}) + α_{l} γ_{x}} + β_{m m} ({α_{x} + α_{l} γ_{x} + α_{x l} (γ_{0} + γ_{x})}^{2} \\ + 2 (α_{0} + α_{l} γ_{0}) {α_{x} + α_{l} γ_{x} + α_{x l} (γ_{0} + γ_{x})} \\ + 2 [(α_{0} + α_{l} γ_{0}) α_{x l} γ_{c} + {α_{x} + α_{l} γ_{x} + α_{x l} (γ_{0} + γ_{x})} (α_{c} + α_{l} γ_{c} + α_{x l} γ_{c})] μ_{c} \\ + {2 (α_{c} + α_{l} γ_{c}) + α_{x l} γ_{c}} α_{x l} γ_{c} (μ_{c}^{2} + σ_{c}^{2}) + (2 α_{l} + α_{x l}) α_{x l} σ_{l}^{2}), \end{aligned}

where $σ_{c}^{2}$ is the variance of C.

Again, this can be interpreted as the TNIE if the model is correctly specified and if, additionally, the assumptions of no interference, strong consistency, and conditional exchangeability are met and one of the parametric assumptions is met. Note again that the additional parametric assumptions constrain some of the parameters above to be zero, simplifying the expression.

APPENDIX 2

G-Computation in Stata and Mplus

y: dependent variable
x: exposure
m: mediator
l: intermediate confounder
c_1: first baseline confounder
c_2: second baseline confounder
m2: m²
l2: l²
xl: x × l
xm: x × m

A. G-computation by Monte Carlo simulations using Stata

To implement G-computation by Monte Carlo simulation, we have used the user-written command gformula. The syntax used was as follows (for more details, refer to Daniel et al. (50)):

1. Model 1 (Robins and Greenland's identifying assumptions (11)):

#delimit ;

gformula y x m m2 l l2 c1 c2 xl,

mediation outcome(y) exposure(x) mediator(m)

post_confs(l) base_confs(c1 c2)

obe control( m:0)

commands(y:regress, m:regress, l:regress)

equations(y:x m m2 l l2 c1 c2 xl, m:x l c1 c2 xl, l:x c1 c2)

derived(m2 l2 xl) derrules(m2:m*m,l2:l*l, xl:x*l)

minsim samples(1000) moreMC simulations(100000) replace seed(79);

#delimit cr

2. Model 2 (Petersen et al.'s identifying assumptions (27)):

#delimit ;

gformula y x m m2 l l2 c1 c2 xl xm,

mediation outcome(y) exposure(x) mediator(m)

post_confs(l) base_confs(c1 c2)

obe control( m:0)

commands(y:regress, m:regress, l:regress)

equations(y:x m m2 l c1 c2 xm, m:x l c1 c2 xl, l:x c1 c2)

derived(m2 l2 xl xm) derrules(m2:m*m,l2:l*l, xl:x*l, xm:x*m)

minsim samples(1000) moreMC simulations(100000) replace seed(79);

#delimit cr

B. G-computation via estimation by combination using Mplus

The implementation with 2 confounders requires an extension of the expressions given in Appendix 1.

Let μ_c₁ and μ_c₂ be the mean values of the 2 confounders, $σ_{c 1}^{2}$ and $σ_{c 1}^{2}$ their variances, and $σ_{12}$ their covariance. Also let

\begin{aligned} L_{0} & = γ_{0} + (γ_{c 1} μ_{c 1} + γ_{c 2} μ_{c 2}) \\ L_{1} & = L_{0} + γ_{x} \\ P_{1} & = α_{0} + α_{l} γ_{0} \\ P_{2} & = α_{x} + α_{l} γ_{x} + α_{x l} (γ_{0} + γ_{x}) \\ \bar{A} & = β_{x} + β_{l} γ_{x} + β_{l l} {γ_{x}^{2} + 2 γ_{x} L_{0}} + β_{x l} L_{1} \\ \bar{\bar{M_{0}}} & = {α_{0} + α_{l} L_{0} + α_{c 1} μ_{c 1} + α_{c 2} μ_{c 2}} \\ P_{c 1} & = (α_{c 1} + α_{l} γ_{c 1} + α_{x l} γ_{c 1}) μ_{c 1} \\ P_{c 2} & = (α_{c 2} + α_{l} γ_{c 2} + α_{x l} γ_{c 2}) μ_{c 2} . \end{aligned}

Then,

\begin{aligned} CDE (m) & = \bar{A} + β_{x m} m \\ PNDE & = \bar{A} + β_{x m} \bar{\bar{M_{0}}} \\ TNIE & = (β_{m} + β_{x m}) {α_{x} + α_{x l} L_{1} + α_{l} γ_{x}} \\ + β_{m m} (P_{2}^{2} + 2 P_{1} P_{2} + 2 [P_{1} α_{x l} γ_{c 1} μ_{c 1} + P_{2} P_{c 1}] μ_{c 1} + 2 [P_{1} α_{x l} γ_{c 2} μ_{c 2} + P_{2} P_{c 2}] μ_{c 2} \\ + [2 (α_{c 1} + α_{l} γ_{c 1}) + α_{x l} γ_{c 1}] α_{x l} γ_{c 1} (μ_{c 1}^{2} + σ_{c 1}^{2}) + [2 (α_{c 2} + α_{l} γ_{c 2}) + α_{x l} γ_{c 2}] α_{x l} γ_{c 2} (μ_{c 2}^{2} + σ_{c 2}^{2}) \\ + ([2 (α_{c 1} + α_{l} γ_{c 1}) + α_{x l} γ_{c 2}] α_{x l} γ_{c 1} + [2 (α_{c 2} + α_{l} γ_{c 2}) + α_{x l} γ_{c 1}] α_{x l} γ_{c 2}) (μ_{c 1} μ_{c 2} + σ_{12}) \\ + (2 α_{l} + α_{x l}) α_{x l} σ_{l}^{2}) . \end{aligned}

The code below is for Mplus, version 7.11 (67), where we use the labeling options to identify the relevant parameters.

1. Model 1 (Robins and Greenland's identifying assumptions (11)):

TITLE: Model 1

DATA: FILE IS “……”;

Format is free;

LISTWISE=ON;

VARIABLE: NAMES ARE id y x m m2 l l2 c_1 c_2 xl xm;

USEV ARE y x m m2 l l2 c_1 c_2 xl;

MISSING ARE .;

IDVARIABLE= id;

MODEL:

[y] (beta0);

y ON x (betax);

y ON m (betam);

y ON m2 (betamm);

y ON l (betal);

y ON l2 (betall);

y ON c_1 (betac1);

y ON c_2 (betac2);

y ON xl (betaxl);

[m] (alpha0);

m ON x (alphax);

m ON l (alphal);

m ON c_1 (alphac1);

m ON c_2 (alphac2);

m ON xl (alphaxl);

m (sigma2m);

[l] (gamma0);

l ON x (gammax);

l ON c_1 (gammac1);

l ON c_2 (gammac2);

l (sigma2l);

[c_1] (muc1);

[c_2] (muc2);

c_1 (sigma2c1);

c_2 (sigma2c2);

c_1 WITH c_2 (covc1c2);

MODEL CONSTRAINT:

!this command lists all the terms used for the calculations

!and gives them starting values:

NEW (betaxm*0 L0*.1 L1*.1 P1*.1 P2*.1 P_c1*.1 P_c2*.1

A_bar*.1 M_barbar_0*.1 cde0*.1 pnde*.1 tnie*.1 tce*0.1 );

!this is to remind ourselves of the Robins and Greenland assumption

! while using the general expressions

betaxm=0;

!for CDE(0)

L0 = gamma0+(gammac1*muc1+gammac2*muc2);

L1 = gamma0+gammax+(gammac1*muc1+gammac2*muc2);

A_bar=betax+betal*gammax+betall*(gammax*gammax+2*gammax*L0)+betaxl*L1;

cde0=A_bar+betaxm*0;

!for PNDE

M_barbar_0=alpha0+alphal*L0+(alphac1*muc1+alphac2*muc2);

pnde=A_bar+betaxm*M_barbar_0;

!for TNIE

P1=alpha0+alphal*gamma0;

P2= (alphax +alphal*gammax+alphaxl*(gamma0+gammax));

P_c1=(alphac1+alphal*gammac1+alphaxl*gammac1)*muc1;

P_c2=(alphac2+alphal*gammac2+alphaxl*gammac2)*muc2;

tnie=(betam+betaxm)*(alphax+alphaxl*L1+gammax*alphal)

+ betamm*(P2*P2+2*P1*P2

+2*(P1*alphaxl*gammac1*muc1+P2*P_c1)

+2*(P1*alphaxl*gammac2*muc2+P2*P_c2)

+(2*(alphac1+alphal*gammac1)+alphaxl*gammac1)*alphaxl*gammac1*

(muc1*muc1+sigma2c1)

+(2*(alphac2+alphal*gammac2)+alphaxl*gammac2)*alphaxl*gammac2*

(muc2*muc2+sigma2c2)

+( (2*(alphac1+alphal*gammac1)+alphaxl*gammac1)*alphaxl*gammac2

+(2*(alphac2+alphal*gammac2)+alphaxl*gammac2)*alphaxl*gammac1

)*(muc1*muc2+covc1c2)

+(2*alphal+alphaxl)*alphaxl*sigma2l

);

tce=tnie+pnde;

OUTPUT: SAMPSTAT ;

2. Model 2 (Petersen et al.'s identifying assumptions (27)):

TITLE: Model 2

DATA: FILE IS “ ……..dat”;

Format is free;

LISTWISE=ON;

VARIABLE: NAMES ARE id y x m m2 l l2 c_1 c_2 xl xm;

USEV ARE y x m m2 l c_1 c_2 xm xl;

MISSING ARE .;

IDVARIABLE= id;

MODEL:

[y] (beta0);

y ON x (betax);

y ON m (betam);

y ON m2 (betamm);

y ON l (betal);

y ON c_1 (betac1);

y ON c_2 (betac2);

y ON xm (betaxm);

[m] (alpha0);

m ON x (alphax);

m ON l (alphal);

m ON c_1 (alphac1);

m ON c_2 (alphac2);

m ON xl (alphaxl);

m (sigma2m);

[l] (gamma0);

l ON x (gammax);

l ON c_1 (gammac1);

l ON c_2 (gammac2);

l (sigma2l);

[c_1] (muc1);

[c_2] (muc2);

c_1 (sigma2c1);

c_2 (sigma2c2);

c_1 WITH c_2 (covc1c2);

MODEL CONSTRAINT:

NEW (betall*0 betaxl*0 L0*.1 L1*.1

P1*.1 P2*.1 P_c1*.1 P_c2*.1

A_bar*.1 M_barbar_0*.1

cde0*.1 pnde*.1 tnie*.1 tce*0.1 );

!this is to remind us of the Petersen et al assumptions

! while using the general expressions

betall=0;

betaxl=0;

!for CDE(0)

L0 = gamma0+(gammac1*muc1+gammac2*muc2);

L1 = gamma0+gammax+(gammac1*muc1+gammac2*muc2);

A_bar=betax+betal*gammax+betall*(gammax*gammax+2*gammax*L0)+betaxl*L1;

cde0=A_bar+betaxm*0;

!for PNDE

M_barbar_0=alpha0+alphal*L0+(alphac1*muc1+alphac2*muc2);

pnde=A_bar+betaxm*M_barbar_0;

!for TNIE

P1=alpha0+alphal*gamma0;

P2= (alphax +alphal*gammax+alphaxl*(gamma0+gammax));

P_c1=(alphac1+alphal*gammac1+alphaxl*gammac1)*muc1;

P_c2=(alphac2+alphal*gammac2+alphaxl*gammac2)*muc2;

tnie=(betam+betaxm)*(alphax+alphaxl*L1+gammax*alphal)

+ betamm*(P2*P2+2*P1*P2

+2*(P1*alphaxl*gammac1*muc1+P2*P_c1)

+2*(P1*alphaxl*gammac2*muc2+P2*P_c2)

+(2*(alphac1+alphal*gammac1)+alphaxl*gammac1)*alphaxl*gammac1*

(muc1*muc1+sigma2c1)

+(2*(alphac2+alphal*gammac2)+alphaxl*gammac2)*alphaxl*gammac2*

(muc2*muc2+sigma2c2)

+( (2*(alphac1+alphal*gammac1)+alphaxl*gammac1)*alphaxl*gammac2

+(2*(alphac2+alphal*gammac2)+alphaxl*gammac2)*alphaxl*gammac1

)*(muc1*muc2+covc1c2)

+(2*alphal+alphaxl)*alphaxl*sigma2l

);

tce=tnie+pnde;

OUTPUT: SAMPSTAT ;

APPENDIX 3

Stata—Sensitivity Analysis

Sensitivity analyses were carried out using the ado file called sens_rho.ado, outlined below. It fits a posited structural equation model (SEM), with an equation each for Y, M, and L. In this example, it fits a model consonant with Robins and Greenland's assumption (11). Note that the model for Y does not include M or any function of M among its explanatory variables, in order to allow for a correlation between the error terms of the Y and M equations.

program define sens_rho, rclass

version 13

preserve

cap matrix drop Psi

sem (y <- x l xl l2 c_1 c_2) (l<- x c_1 c_2) (m <- x l xl l2 c_1 c_2), \\\

nocapslatent cov(e.y*e.m)

qui estat framework, fitted

matrix Psi=r(Psi)

matrix list Psi

scalar rho_dash=(Psi[3,1])/(sqrt(Psi[1,1]*Psi[3,3]))

scalar list rho_dash

return scalar rho=rho_dash

restore

end

It is best to check that sens_rho.ado picks the right elements of the error term's variance-covariance matrix by running the program once:

. sens_rho

Then, one needs to type

. bootstrap rho_dash=r(rho), reps(1000) saving(sens_rho,replace):sens_rho

. estat bootstrap, all

to run the Stata bootstrap command with 1,000 replications and see the results.

REFERENCES

1.Judd CM, Kenny DA. Process analysis: estimating mediation in treatment evaluation. Eval Rev. 1981;5(5):602–619. [Google Scholar]
2.Baron RM, Kenny DA. The moderator–mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. J Pers Soc Psychol. 1986;51(6):1173–1182. doi: 10.1037//0022-3514.51.6.1173. [DOI] [PubMed] [Google Scholar]
3.Wright S. The method of path coefficients. Ann Math Stat. 1934;5(3):161–215. [Google Scholar]
4.Bollen KA. Structural Equations with Latent Variables. New York, NY: John Wiley & Sons, Inc.; 1989. Causality and causal models; pp. 40–79. [Google Scholar]
5.Duncan OD. Path analysis: sociological examples. AJS. 1966;72(1):1–16. [Google Scholar]
6.Sobel ME. Asymptotic confidence intervals for indirect effects in structural equation models. Sociol Methodol. 1982;13:290–312. [Google Scholar]
7.James LR, Brett JM. Mediators, moderators, and tests for mediation. J Appl Psychol. 1984;69(2):307–321. [Google Scholar]
8.MacKinnon D. Introduction to Statistical Mediation Analysis. New York, NY: Taylor & Francis; 2008. Single mediator model; pp. 47–78. [Google Scholar]
9.Hayes AF, Preacher KJ. Quantifying and testing indirect effects in simple mediation models when the constituent paths are nonlinear. Multivariate Behav Res. 2010;45(4):627–660. doi: 10.1080/00273171.2010.498290. [DOI] [PubMed] [Google Scholar]
10.Muthén B. Applications of Causally Defined Direct and Indirect Effects in Mediation Analysis Using SEM in Mplus. Los Angeles, CA: Muthén and Muthén; 2011. http://statmodel2.com/download/causalmediation.pdf . Accessed August 8, 2014. [Google Scholar]
11.Robins JM, Greenland S. Identifiability and exchangeability for direct and indirect effects. Epidemiology. 1992;3(2):143–155. doi: 10.1097/00001648-199203000-00013. [DOI] [PubMed] [Google Scholar]
12.Pearl J. Direct and indirect effects; Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence; San Francisco, CA: Morgan Kaufmann; 2001. pp. 411–420. [Google Scholar]
13.VanderWeele TJ, Vansteelandt S. Conceptual issues concerning mediation, interventions and composition. Stat Interface. 2009;2(4):457–468. [Google Scholar]
14.VanderWeele TJ. Invited commentary: structural equation modeling and epidemiologic analysis. Am J Epidemiol. 2012;176(7):608–612. doi: 10.1093/aje/kws213. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Valeri L, VanderWeele TJ. Mediation analysis allowing for exposure-mediator interactions and causal interpretation: theoretical assumptions and implementation with SAS and SPSS macros. Psychol Methods. 2013;18(2):137–150. doi: 10.1037/a0031034. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Emsley R, Dunn G, White IR. Mediation and moderation of treatment effects in randomised controlled trials of complex interventions. Stat Methods Med Res. 2010;19(3):237–270. doi: 10.1177/0962280209105014. [DOI] [PubMed] [Google Scholar]
17.Hafeman DM, VanderWeele TJ. Alternative assumptions for the identification of direct and indirect effects. Epidemiology. 2011;22(6):753–764. doi: 10.1097/EDE.0b013e3181c311b2. [DOI] [PubMed] [Google Scholar]
18.Ten Have TR, Joffe MM. A review of causal estimation of effects in mediation analyses. Stat Methods Med Res. 2012;21(1):77–107. doi: 10.1177/0962280210391076. [DOI] [PubMed] [Google Scholar]
19.Pearl J. Los Angeles, CA: Department of Computer Science, University of California, Los Angeles; 2012. Interpretable conditions for identifying direct and indirect effects. (Technical report R-389) [Google Scholar]
20.VanderWeele TJ. Marginal structural models for the estimation of direct and indirect effects. Epidemiology. 2009;20(1):18–26. doi: 10.1097/EDE.0b013e31818f69ce. [DOI] [PubMed] [Google Scholar]
21.Robins JM. Testing and estimation of direct effects by reparameterizing directed acyclic graphs with structural nested models. In: Glymour C, Cooper G, editors. Computation, Causation, and Discovery. Menlo Park, CA/Cambridge, MA: AAAI Press/The MIT Press; 1999. pp. 349–405. [Google Scholar]
22.Vansteelandt S. Estimating direct effects in cohort and case-control studies. Epidemiology. 2009;20(6):851–860. doi: 10.1097/EDE.0b013e3181b6f4c9. [DOI] [PubMed] [Google Scholar]
23.Joffe MM, Greene T. Related causal frameworks for surrogate outcomes. Biometrics. 2009;65(2):530–538. doi: 10.1111/j.1541-0420.2008.01106.x. [DOI] [PubMed] [Google Scholar]
24.Goetgeluk S, Vansteelandt S, Goetghebeur E. Estimation of controlled direct effects. J R Stat Soc Series B Stat Methodol. 2008;70(5):1049–1066. [Google Scholar]
25.Tchetgen Tchetgen EJ, Shpitser I. Semiparametric theory for causal mediation analysis: efficiency bounds, multiple robustness and sensitivity analysis. Ann Stat. 2012;40(3):1816–1845. doi: 10.1214/12-AOS990. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Vansteelandt S, Bekaert M, Lange T. Imputation strategies for the estimation of natural direct and indirect effects. Epidemiol Methods. 2012;1(1):131–158. doi: 10.1093/aje/kwr525. [DOI] [PubMed] [Google Scholar]
27.Petersen ML, Sinisi SE, van der Laan MJ. Estimation of direct causal effects. Epidemiology. 2006;17(3):276–284. doi: 10.1097/01.ede.0000208475.99429.2d. [DOI] [PubMed] [Google Scholar]
28.VanderWeele TJ, Vansteelandt S, Robins JM. Effect decomposition in the presence of an exposure-induced mediator-outcome confounder. Epidemiology. 2014;25(2):300–306. doi: 10.1097/EDE.0000000000000034. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Tchetgen Tchetgen EJ, VanderWeele TJ. Identification of natural direct effects when a confounder of the mediator is directly affected by exposure. Epidemiology. 2014;25(2):282–291. doi: 10.1097/EDE.0000000000000054. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol. 1974;66(5):688–701. [Google Scholar]
31.VanderWeele TJ, Vansteelandt S. Odds ratios for mediation analysis for a dichotomous outcome. Am J Epidemiol. 2010;172(12):1339–1348. doi: 10.1093/aje/kwq332. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Vansteelandt S. Estimation of controlled direct effects on a dichotomous outcome using logistic structural direct effect models. Biometrika. 2010;97(4):921–934. [Google Scholar]
33.Martinussen T, Vansteelandt S, Gerster M, et al. Estimation of direct effects for survival data by using the Aalen additive hazards model. J R Stat Soc Series B Stat Methodol. 2011;73(5):773–788. [Google Scholar]
34.Robins JM. Semantics of causal DAG models and the identification of direct and indirect effects. In: Green P, Hjort N, Richardson S, editors. Highly Structured Stochastic Systems. New York, NY: Oxford University Press; 2003. pp. 70–81. [Google Scholar]
35.VanderWeele TJ. Mediation and mechanism. Eur J Epidemiol. 2009;24(5):217–224. doi: 10.1007/s10654-009-9331-1. [DOI] [PubMed] [Google Scholar]
36.Cox DR. Planning of Experiments. New York, NY: John Wiley & Sons, Inc.; 1958. [Google Scholar]
37.Rubin DB. Comment on: “Randomization analysis of experimental data in the Fisher randomization test” by D. Basu. J Am Stat Assoc. 1980;75(371):591–593. [Google Scholar]
38.Hudgens MG, Halloran ME. Toward causal inference with interference. J Am Stat Assoc. 2008;103(482):832–842. doi: 10.1198/016214508000000292. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Tchetgen Tchetgen EJ, VanderWeele TJ. On causal inference in the presence of interference. Stat Methods Med Res. 2012;21(1):55–75. doi: 10.1177/0962280210386779. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Hernán MA, Taubman SL. Does obesity shorten life? The importance of well-defined interventions to answer causal questions. Int J Obes (Lond) 2008;32(suppl 3):S8–S14. doi: 10.1038/ijo.2008.82. [DOI] [PubMed] [Google Scholar]
41.Cole SR, Frangakis CE. The consistency statement in causal inference: a definition or an assumption? Epidemiology. 2009;20(1):3–5. doi: 10.1097/EDE.0b013e31818ef366. [DOI] [PubMed] [Google Scholar]
42.VanderWeele TJ. Concerning the consistency assumption in causal inference. Epidemiology. 2009;20(6):880–883. doi: 10.1097/EDE.0b013e3181bd5638. [DOI] [PubMed] [Google Scholar]
43.Pearl J. On the consistency rule in causal inference: axiom, definition, assumption, or theorem? Epidemiology. 2010;21(6):872–875. doi: 10.1097/EDE.0b013e3181f5d3fd. [DOI] [PubMed] [Google Scholar]
44.Rubin DB. Bayesian inference for causal effects: the role of randomization. Ann Stat. 1978;6(1):34–58. [Google Scholar]
45.Pearl J. The mediation formula: a guide to the assessment of causal pathways in nonlinear models. In: Berzuini C, Dawid AP, Bernardinelli L, editors. Causality: Statistical Perspectives and Applications. Chichester, United Kingdom: John Wiley & Sons Ltd.; 2012. pp. 151–179. [Google Scholar]
46.Vansteelandt S. Estimation of direct and indirect effects. In: Berzuini C, Dawid AP, Bernardinelli L, editors. Causality: Statistical Perspectives and Applications. Chichester, United Kingdom: John Wiley & Sons Ltd.; 2012. pp. 126–150. [Google Scholar]
47.Robins J. A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect. Math Model. 1986;7(9-12):1393–1512. [Google Scholar]
48.Vansteelandt S, VanderWeele TJ. Natural direct and indirect effects on the exposed: effect decomposition under weaker assumptions. Biometrics. 2012;68(4):1019–1027. doi: 10.1111/j.1541-0420.2012.01777.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Imai K, Keele L, Yamamoto T. Identification, inference and sensitivity analysis for causal mediation effects. Stat Sci. 2010;25(1):51–71. [Google Scholar]
50.Daniel RM, De Stavola BL, Cousens SN. gformula: estimating causal effects in the presence of time-varying confounding or mediation using the g-computation formula. Stata J. 2011;11(4):479–517. [Google Scholar]
51.Zheng W, van der Laan MJ. Targeted maximum likelihood estimation of natural direct effects. Int J Biostat. 2012;8(1) doi: 10.2202/1557-4679.1361. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Mulaik S. Linear Causal Modeling with Structural Equations. Boca Raton, FL: CRC Press; 2009. Structural equation models; pp. 119–138. [Google Scholar]
53.MacKinnon DP, Warsi G, Dwyer JH. A simulation study of mediated effect measures. Multivariate Behav Res. 1995;30(1):41–62. doi: 10.1207/s15327906mbr3001_3. [DOI] [PMC free article] [PubMed] [Google Scholar]
54.MacKinnon DP, Dwyer JH. Estimating mediated effects in prevention studies. Eval Rev. 1993;17(2):144–158. [Google Scholar]
55.Muthén B, Asparouhov T. Los Angeles, CA: Muthén and Muthén; 2014. Causal effects in mediation modeling: an introduction with applications to latent variables. http://www.statmodel.com/Mediation.shtml . Accessed November 19, 2014. [Google Scholar]
56.Bentler PM. Multivariate analysis with latent variables: causal modeling. Annu Rev Psychol. 1980;31:419–456. [Google Scholar]
57.MacKinnon D. Introduction to Statistical Mediation Analysis. New York, NY: Taylor & Francis; 2008. Computer intensive methods for mediation models; pp. 325–346. [Google Scholar]
58.Hoyle R, Kenny D. Sample Size, Reliability, and Tests of Statistical Mediation. Thousand Oaks, CA: Sage Publications; 1999. [Google Scholar]
59.Hernán MA. Beyond exchangeability: the other conditions for causal inference in medical research. Stat Methods Med Res. 2012;21(1):3–5. doi: 10.1177/0962280211398037. [DOI] [PubMed] [Google Scholar]
60.Emsley R, Liu H, et al. PARAMED: Stata module to perform causal mediation analysis using parametric models. St. Louis, MO: Federal Reserve Bank of St. Louis; 2013. https://ideas.repec.org/c/boc/bocode/s457581.html . Accessed November 19, 2014. [Google Scholar]
61.Wermuth N, Cox DR. Distortion of effects caused by indirect confounding. Biometrika. 2008;95(1):17–33. [Google Scholar]
62.Moerkerke B, Loeys T, Vansteelandt S. Structural equation modeling versus marginal structural modeling for assessing mediation in the presence of post-treatment confounding. Psychol Methods. doi: 10.1037/a0036368. In press. [DOI] [PubMed] [Google Scholar]
63.Pearl J. Interpretation and Identification of Causal Mediation. Los Angeles, CA: Department of Computer Science, University of California, Los Angeles; 2014. http://ftp.cs.ucla.edu/pub/stat_ser/r389.pdf . Accessed August 8, 2014. [Google Scholar]
64.Imai K, Yamamoto T. Identification and sensitivity analysis for multiple causal mechanisms: revisiting evidence from framing experiments. Polit Anal. 2013;21(2):141–171. [Google Scholar]
65.Boyd A, Golding J, Macleod J, et al. Cohort profile: the ‘children of the 90s’—the index offspring of the Avon Longitudinal Study of Parents and Children. Int J Epidemiol. 2013;42(1):111–127. doi: 10.1093/ije/dys064. [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Micali N, Ploubidis G, De Stavola B, et al. Frequency and patterns of eating disorder symptoms in early adolescence. J Adolesc Health. 2014;54(5):574–581. doi: 10.1016/j.jadohealth.2013.10.200. [DOI] [PubMed] [Google Scholar]
67.Muthén LK, Muthén BO. Mplus User's Guide. 7th ed. Los Angeles, CA: Muthén and Muthén; 1998. [Google Scholar]
68.Tchetgen Tchetgen EJ. Berkeley, CA: Collection of Biostatistics Research Archive, Berkeley Electronic Press; 2012. Formulae for causal mediation analysis in an odds ratio context without a normality assumption for the continuous mediator. (Harvard University Biostatistics Working Paper no. 139) http://biostats.bepress.com/cgi/viewcontent.cgi?article=1147&context=harvardbiostat. Accessed November 19, 2014. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Web Material

supp_181_1_64__index.html^{(984B, html)}

supp_kwu239_kwu239supp.pdf^{(195.2KB, pdf)}

[KWU239C1] 1.Judd CM, Kenny DA. Process analysis: estimating mediation in treatment evaluation. Eval Rev. 1981;5(5):602–619. [Google Scholar]

[KWU239C2] 2.Baron RM, Kenny DA. The moderator–mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. J Pers Soc Psychol. 1986;51(6):1173–1182. doi: 10.1037//0022-3514.51.6.1173. [DOI] [PubMed] [Google Scholar]

[KWU239C3] 3.Wright S. The method of path coefficients. Ann Math Stat. 1934;5(3):161–215. [Google Scholar]

[KWU239C4] 4.Bollen KA. Structural Equations with Latent Variables. New York, NY: John Wiley & Sons, Inc.; 1989. Causality and causal models; pp. 40–79. [Google Scholar]

[KWU239C5] 5.Duncan OD. Path analysis: sociological examples. AJS. 1966;72(1):1–16. [Google Scholar]

[KWU239C6] 6.Sobel ME. Asymptotic confidence intervals for indirect effects in structural equation models. Sociol Methodol. 1982;13:290–312. [Google Scholar]

[KWU239C7] 7.James LR, Brett JM. Mediators, moderators, and tests for mediation. J Appl Psychol. 1984;69(2):307–321. [Google Scholar]

[KWU239C8] 8.MacKinnon D. Introduction to Statistical Mediation Analysis. New York, NY: Taylor & Francis; 2008. Single mediator model; pp. 47–78. [Google Scholar]

[KWU239C9] 9.Hayes AF, Preacher KJ. Quantifying and testing indirect effects in simple mediation models when the constituent paths are nonlinear. Multivariate Behav Res. 2010;45(4):627–660. doi: 10.1080/00273171.2010.498290. [DOI] [PubMed] [Google Scholar]

[KWU239C10] 10.Muthén B. Applications of Causally Defined Direct and Indirect Effects in Mediation Analysis Using SEM in Mplus. Los Angeles, CA: Muthén and Muthén; 2011. http://statmodel2.com/download/causalmediation.pdf . Accessed August 8, 2014. [Google Scholar]

[KWU239C11] 11.Robins JM, Greenland S. Identifiability and exchangeability for direct and indirect effects. Epidemiology. 1992;3(2):143–155. doi: 10.1097/00001648-199203000-00013. [DOI] [PubMed] [Google Scholar]

[KWU239C12] 12.Pearl J. Direct and indirect effects; Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence; San Francisco, CA: Morgan Kaufmann; 2001. pp. 411–420. [Google Scholar]

[KWU239C13] 13.VanderWeele TJ, Vansteelandt S. Conceptual issues concerning mediation, interventions and composition. Stat Interface. 2009;2(4):457–468. [Google Scholar]

[KWU239C14] 14.VanderWeele TJ. Invited commentary: structural equation modeling and epidemiologic analysis. Am J Epidemiol. 2012;176(7):608–612. doi: 10.1093/aje/kws213. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C15] 15.Valeri L, VanderWeele TJ. Mediation analysis allowing for exposure-mediator interactions and causal interpretation: theoretical assumptions and implementation with SAS and SPSS macros. Psychol Methods. 2013;18(2):137–150. doi: 10.1037/a0031034. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C16] 16.Emsley R, Dunn G, White IR. Mediation and moderation of treatment effects in randomised controlled trials of complex interventions. Stat Methods Med Res. 2010;19(3):237–270. doi: 10.1177/0962280209105014. [DOI] [PubMed] [Google Scholar]

[KWU239C17] 17.Hafeman DM, VanderWeele TJ. Alternative assumptions for the identification of direct and indirect effects. Epidemiology. 2011;22(6):753–764. doi: 10.1097/EDE.0b013e3181c311b2. [DOI] [PubMed] [Google Scholar]

[KWU239C18] 18.Ten Have TR, Joffe MM. A review of causal estimation of effects in mediation analyses. Stat Methods Med Res. 2012;21(1):77–107. doi: 10.1177/0962280210391076. [DOI] [PubMed] [Google Scholar]

[KWU239C19] 19.Pearl J. Los Angeles, CA: Department of Computer Science, University of California, Los Angeles; 2012. Interpretable conditions for identifying direct and indirect effects. (Technical report R-389) [Google Scholar]

[KWU239C20] 20.VanderWeele TJ. Marginal structural models for the estimation of direct and indirect effects. Epidemiology. 2009;20(1):18–26. doi: 10.1097/EDE.0b013e31818f69ce. [DOI] [PubMed] [Google Scholar]

[KWU239C21] 21.Robins JM. Testing and estimation of direct effects by reparameterizing directed acyclic graphs with structural nested models. In: Glymour C, Cooper G, editors. Computation, Causation, and Discovery. Menlo Park, CA/Cambridge, MA: AAAI Press/The MIT Press; 1999. pp. 349–405. [Google Scholar]

[KWU239C22] 22.Vansteelandt S. Estimating direct effects in cohort and case-control studies. Epidemiology. 2009;20(6):851–860. doi: 10.1097/EDE.0b013e3181b6f4c9. [DOI] [PubMed] [Google Scholar]

[KWU239C23] 23.Joffe MM, Greene T. Related causal frameworks for surrogate outcomes. Biometrics. 2009;65(2):530–538. doi: 10.1111/j.1541-0420.2008.01106.x. [DOI] [PubMed] [Google Scholar]

[KWU239C24] 24.Goetgeluk S, Vansteelandt S, Goetghebeur E. Estimation of controlled direct effects. J R Stat Soc Series B Stat Methodol. 2008;70(5):1049–1066. [Google Scholar]

[KWU239C25] 25.Tchetgen Tchetgen EJ, Shpitser I. Semiparametric theory for causal mediation analysis: efficiency bounds, multiple robustness and sensitivity analysis. Ann Stat. 2012;40(3):1816–1845. doi: 10.1214/12-AOS990. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C26] 26.Vansteelandt S, Bekaert M, Lange T. Imputation strategies for the estimation of natural direct and indirect effects. Epidemiol Methods. 2012;1(1):131–158. doi: 10.1093/aje/kwr525. [DOI] [PubMed] [Google Scholar]

[KWU239C27] 27.Petersen ML, Sinisi SE, van der Laan MJ. Estimation of direct causal effects. Epidemiology. 2006;17(3):276–284. doi: 10.1097/01.ede.0000208475.99429.2d. [DOI] [PubMed] [Google Scholar]

[KWU239C28] 28.VanderWeele TJ, Vansteelandt S, Robins JM. Effect decomposition in the presence of an exposure-induced mediator-outcome confounder. Epidemiology. 2014;25(2):300–306. doi: 10.1097/EDE.0000000000000034. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C29] 29.Tchetgen Tchetgen EJ, VanderWeele TJ. Identification of natural direct effects when a confounder of the mediator is directly affected by exposure. Epidemiology. 2014;25(2):282–291. doi: 10.1097/EDE.0000000000000054. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C30] 30.Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. J Educ Psychol. 1974;66(5):688–701. [Google Scholar]

[KWU239C31] 31.VanderWeele TJ, Vansteelandt S. Odds ratios for mediation analysis for a dichotomous outcome. Am J Epidemiol. 2010;172(12):1339–1348. doi: 10.1093/aje/kwq332. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C32] 32.Vansteelandt S. Estimation of controlled direct effects on a dichotomous outcome using logistic structural direct effect models. Biometrika. 2010;97(4):921–934. [Google Scholar]

[KWU239C33] 33.Martinussen T, Vansteelandt S, Gerster M, et al. Estimation of direct effects for survival data by using the Aalen additive hazards model. J R Stat Soc Series B Stat Methodol. 2011;73(5):773–788. [Google Scholar]

[KWU239C34] 34.Robins JM. Semantics of causal DAG models and the identification of direct and indirect effects. In: Green P, Hjort N, Richardson S, editors. Highly Structured Stochastic Systems. New York, NY: Oxford University Press; 2003. pp. 70–81. [Google Scholar]

[KWU239C35] 35.VanderWeele TJ. Mediation and mechanism. Eur J Epidemiol. 2009;24(5):217–224. doi: 10.1007/s10654-009-9331-1. [DOI] [PubMed] [Google Scholar]

[KWU239C36] 36.Cox DR. Planning of Experiments. New York, NY: John Wiley & Sons, Inc.; 1958. [Google Scholar]

[KWU239C37] 37.Rubin DB. Comment on: “Randomization analysis of experimental data in the Fisher randomization test” by D. Basu. J Am Stat Assoc. 1980;75(371):591–593. [Google Scholar]

[KWU239C38] 38.Hudgens MG, Halloran ME. Toward causal inference with interference. J Am Stat Assoc. 2008;103(482):832–842. doi: 10.1198/016214508000000292. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C39] 39.Tchetgen Tchetgen EJ, VanderWeele TJ. On causal inference in the presence of interference. Stat Methods Med Res. 2012;21(1):55–75. doi: 10.1177/0962280210386779. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C40] 40.Hernán MA, Taubman SL. Does obesity shorten life? The importance of well-defined interventions to answer causal questions. Int J Obes (Lond) 2008;32(suppl 3):S8–S14. doi: 10.1038/ijo.2008.82. [DOI] [PubMed] [Google Scholar]

[KWU239C41] 41.Cole SR, Frangakis CE. The consistency statement in causal inference: a definition or an assumption? Epidemiology. 2009;20(1):3–5. doi: 10.1097/EDE.0b013e31818ef366. [DOI] [PubMed] [Google Scholar]

[KWU239C42] 42.VanderWeele TJ. Concerning the consistency assumption in causal inference. Epidemiology. 2009;20(6):880–883. doi: 10.1097/EDE.0b013e3181bd5638. [DOI] [PubMed] [Google Scholar]

[KWU239C43] 43.Pearl J. On the consistency rule in causal inference: axiom, definition, assumption, or theorem? Epidemiology. 2010;21(6):872–875. doi: 10.1097/EDE.0b013e3181f5d3fd. [DOI] [PubMed] [Google Scholar]

[KWU239C44] 44.Rubin DB. Bayesian inference for causal effects: the role of randomization. Ann Stat. 1978;6(1):34–58. [Google Scholar]

[KWU239C45] 45.Pearl J. The mediation formula: a guide to the assessment of causal pathways in nonlinear models. In: Berzuini C, Dawid AP, Bernardinelli L, editors. Causality: Statistical Perspectives and Applications. Chichester, United Kingdom: John Wiley & Sons Ltd.; 2012. pp. 151–179. [Google Scholar]

[KWU239C46] 46.Vansteelandt S. Estimation of direct and indirect effects. In: Berzuini C, Dawid AP, Bernardinelli L, editors. Causality: Statistical Perspectives and Applications. Chichester, United Kingdom: John Wiley & Sons Ltd.; 2012. pp. 126–150. [Google Scholar]

[KWU239C47] 47.Robins J. A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect. Math Model. 1986;7(9-12):1393–1512. [Google Scholar]

[KWU239C48] 48.Vansteelandt S, VanderWeele TJ. Natural direct and indirect effects on the exposed: effect decomposition under weaker assumptions. Biometrics. 2012;68(4):1019–1027. doi: 10.1111/j.1541-0420.2012.01777.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C49] 49.Imai K, Keele L, Yamamoto T. Identification, inference and sensitivity analysis for causal mediation effects. Stat Sci. 2010;25(1):51–71. [Google Scholar]

[KWU239C50] 50.Daniel RM, De Stavola BL, Cousens SN. gformula: estimating causal effects in the presence of time-varying confounding or mediation using the g-computation formula. Stata J. 2011;11(4):479–517. [Google Scholar]

[KWU239C51] 51.Zheng W, van der Laan MJ. Targeted maximum likelihood estimation of natural direct effects. Int J Biostat. 2012;8(1) doi: 10.2202/1557-4679.1361. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C52] 52.Mulaik S. Linear Causal Modeling with Structural Equations. Boca Raton, FL: CRC Press; 2009. Structural equation models; pp. 119–138. [Google Scholar]

[KWU239C53] 53.MacKinnon DP, Warsi G, Dwyer JH. A simulation study of mediated effect measures. Multivariate Behav Res. 1995;30(1):41–62. doi: 10.1207/s15327906mbr3001_3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C54] 54.MacKinnon DP, Dwyer JH. Estimating mediated effects in prevention studies. Eval Rev. 1993;17(2):144–158. [Google Scholar]

[KWU239C55] 55.Muthén B, Asparouhov T. Los Angeles, CA: Muthén and Muthén; 2014. Causal effects in mediation modeling: an introduction with applications to latent variables. http://www.statmodel.com/Mediation.shtml . Accessed November 19, 2014. [Google Scholar]

[KWU239C56] 56.Bentler PM. Multivariate analysis with latent variables: causal modeling. Annu Rev Psychol. 1980;31:419–456. [Google Scholar]

[KWU239C57] 57.MacKinnon D. Introduction to Statistical Mediation Analysis. New York, NY: Taylor & Francis; 2008. Computer intensive methods for mediation models; pp. 325–346. [Google Scholar]

[KWU239C58] 58.Hoyle R, Kenny D. Sample Size, Reliability, and Tests of Statistical Mediation. Thousand Oaks, CA: Sage Publications; 1999. [Google Scholar]

[KWU239C59] 59.Hernán MA. Beyond exchangeability: the other conditions for causal inference in medical research. Stat Methods Med Res. 2012;21(1):3–5. doi: 10.1177/0962280211398037. [DOI] [PubMed] [Google Scholar]

[KWU239C60] 60.Emsley R, Liu H, et al. PARAMED: Stata module to perform causal mediation analysis using parametric models. St. Louis, MO: Federal Reserve Bank of St. Louis; 2013. https://ideas.repec.org/c/boc/bocode/s457581.html . Accessed November 19, 2014. [Google Scholar]

[KWU239C61] 61.Wermuth N, Cox DR. Distortion of effects caused by indirect confounding. Biometrika. 2008;95(1):17–33. [Google Scholar]

[KWU239C62] 62.Moerkerke B, Loeys T, Vansteelandt S. Structural equation modeling versus marginal structural modeling for assessing mediation in the presence of post-treatment confounding. Psychol Methods. doi: 10.1037/a0036368. In press. [DOI] [PubMed] [Google Scholar]

[KWU239C63] 63.Pearl J. Interpretation and Identification of Causal Mediation. Los Angeles, CA: Department of Computer Science, University of California, Los Angeles; 2014. http://ftp.cs.ucla.edu/pub/stat_ser/r389.pdf . Accessed August 8, 2014. [Google Scholar]

[KWU239C64] 64.Imai K, Yamamoto T. Identification and sensitivity analysis for multiple causal mechanisms: revisiting evidence from framing experiments. Polit Anal. 2013;21(2):141–171. [Google Scholar]

[KWU239C65] 65.Boyd A, Golding J, Macleod J, et al. Cohort profile: the ‘children of the 90s’—the index offspring of the Avon Longitudinal Study of Parents and Children. Int J Epidemiol. 2013;42(1):111–127. doi: 10.1093/ije/dys064. [DOI] [PMC free article] [PubMed] [Google Scholar]

[KWU239C66] 66.Micali N, Ploubidis G, De Stavola B, et al. Frequency and patterns of eating disorder symptoms in early adolescence. J Adolesc Health. 2014;54(5):574–581. doi: 10.1016/j.jadohealth.2013.10.200. [DOI] [PubMed] [Google Scholar]

[KWU239C67] 67.Muthén LK, Muthén BO. Mplus User's Guide. 7th ed. Los Angeles, CA: Muthén and Muthén; 1998. [Google Scholar]

[KWU239C68] 68.Tchetgen Tchetgen EJ. Berkeley, CA: Collection of Biostatistics Research Archive, Berkeley Electronic Press; 2012. Formulae for causal mediation analysis in an odds ratio context without a normality assumption for the continuous mediator. (Harvard University Biostatistics Working Paper no. 139) http://biostats.bepress.com/cgi/viewcontent.cgi?article=1147&context=harvardbiostat. Accessed November 19, 2014. [Google Scholar]

PERMALINK

Mediation Analysis With Intermediate Confounding: Structural Equation Modeling Viewed Through the Causal Inference Lens

Bianca L De Stavola

Rhian M Daniel

George B Ploubidis

Nadia Micali

Abstract

THE 2 FRAMEWORKS

Settings and aims

Figure 1.

The causal inference framework

Definitions

Assumptions

In the absence of intermediate confounders

In the presence of intermediate confounders

Estimation

The SEM framework

Figure 2.

Assumptions and estimation

INSIGHTS

Equivalence in estimands

Understanding the assumptions required for parametric identification

Equivalence in assumptions

Figure 3.

Sensitivity analyses

RESULTS

Figure 4.

Table 1.

Table 2.

Table 3.

DISCUSSION

Supplementary Material

ACKNOWLEDGMENTS

APPENDIX 1

APPENDIX 2

A. G-computation by Monte Carlo simulations using Stata

B. G-computation via estimation by combination using Mplus

APPENDIX 3

REFERENCES

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases