Abstract
Third-variable effects, such as mediation and confounding, are core concepts in prevention science, providing the theoretical basis for investigating how risk factors affect behavior and how interventions change behavior. Another third variable, the collider, is not commonly considered but is also important for prevention science. This paper describes the importance of the collider effect as well as the similarities and differences between these three third-variable effects. The single mediator model in which the third variable (T) is a mediator of the independent variable (X) to dependent variable (Y) effect is used to demonstrate how to estimate each third-variable effect. We provide difference in coefficients and product of coefficients estimators of the effects and demonstrate how to calculate these values with real data. Suppression effects are defined for each type of third-variable effect. Future directions and implications of these results are discussed.
Keywords: Methods, Causal effects, Mediator, Confounder, Collider, Third-variable
A Unification of Mediator, Confounder, and Collider Effects
Over the last 30 years, there have been great strides in the definition and estimation of causal effects, as summarized by statistician Gary King (2015): “More has been learned about causal inference in the last few decades than the sum total of everything that had been learned about it in all prior recorded history.” Similar pronouncements have been made in epidemiology and public health (Glymour & Hamad, 2018; Hernán, 2018), but applications of modern causal inference remain rare in prevention research with some exceptions (MacKinnon et al., 2020; Musci & Stuart, 2020; Stuart et al., 2015). These advancements have established the role a third variable plays in causal inference, as a mediator, confounder, or collider (Pearl, 2009). In particular, colliders are an important new contribution of modern causal inference not often considered in prevention science. Understanding third-variable effects is critical for drawing accurate conclusions from prevention research, such as knowing when it is necessary to adjust1statistically for a variable and when adjustment for a variable creates spurious effects.
Twenty-one years ago in this journal, MacKinnon et al. (2000) described the equivalence of the mediation and confounder models and showed how suppression occurs for each model. That paper did not describe colliders nor how mediators, confounders, and colliders fit within a general third-variable model. Thus, the purpose of this paper is fourfold. First, we describe the three different effects a third variable can have in causal inference (confounder, mediator, and collider). Second, we generalize statistical methods from mediation analysis for estimating confounder and collider effects. Third, we describe methods to investigate statistical suppression and interaction effects for each model. Fourth, we illustrate each effect using real-world examples and discuss the implications of third-variable effects for prevention. The overall goal of the paper is to help readers understand third-variable concepts and therefore be better prepared for causal inference.
Third‑Variable Effects
The case where a third variable acts as a mediator (X → Mediator → Y) has received extensive application and development over the last 30 years in prevention and other fields because it addresses a fundamental question of how a mediator transmits the causal effect of an independent variable (X) to a dependent variable (Y). Mediation analysis has been applied in research exploring how poverty affects behavior, how tobacco prevention programs reduce tobacco use, and how knowledge leads to behavior change, to name a few (MacKinnon, 2008; VanderWeele, 2015). Mediation is particularly important in the development and evaluation of treatment and prevention programs because the investigation of mediating mechanisms can make programs more effective and require fewer resources by designing programs to change the critical mediating mechanism that drives the change in the outcome. For example, the mediation model has provided evidence that interventions that change the social norms mediator prevent drug use (Botvin et al., 1999; Cuijpers, 2002; MacKinnon et al., 1991).
The case where a third variable is a confounder (X ← Confounder → Y) has also been useful in many disciplines, particularly in epidemiology and prevention, because of the ubiquitous confounding variables present in observational studies in those fields (Greenland & Morgenstern, 2001; James, 1980). Confounding variables interfere with the ability to make causal claims if they are not included in statistical analysis. Because a confounding variable causes both X and Y, the X to Y effect must be adjusted for the confounder to obtain an unbiased estimate of the causal effect between X and Y (Elwert & Winship, 2014). For example, age is a confounder of many effects, such as the effect between income and cancer risk. Age is associated with higher income as well as cancer risk, resulting in an apparent positive effect between income and cancer risk if age is not considered. Adjustment for age as a confounding variable yields a more accurate assessment of the effect between income and cancer rates.
The final effect a third variable can have in causal inference, that of a collider (X → Collider ← Y), is not as widely known as mediators and confounders in prevention science. A collider variable is caused by both X and Y, as opposed to a confounder which causes X and Y. Collider variables are challenging to understand because unlike adjustment for a mediator or confounder, which clarifies causal effects, adjustment for a collider obscures causal effects by inducing bias in the estimate of the X–Y relation. Collider effects occur in two main situations (Asendorpf, 2012): (1) explicitly, when a collider variable caused by X and Y is measured in the study and adjusted for in statistical analysis, or (2) implicitly, when there is not a measure of the collider variable in the study, but the sample is selected based on a collider variable that is caused by X and Y. We discuss each situation below.
For the explicit collider situation, X, Y, and the collider are all measured variables; for example, when X and Y are measured at an earlier time and cause a construct measured later. The later variable is a collider of the effect of X and Y, so adjustment for the collider will bias the estimate of the X to Y effect. For a hypothetical example, adjustment for physical fitness at age 40 will bias the effect between stress at age 14 and body mass index (BMI) at age 15. Explicit adjustment for the collider leads to bias because it combines the causal effect of stress (X) to BMI (Y) with the causal effect of stress and BMI to physical fitness (collider) at age 40. Richardson et al., (2019) describe a similar explicit collider of the relation of neuroticism and body mass index (BMI), where a small positive relation between neuroticism and BMI becomes negative when adjusted for self-reported health. Richardson et al. (2019) argued that it does not make sense to adjust for self-reported health because neuroticism and BMI cause self-reported health. Because self-reported health is an outcome of both neuroticism and BMI, adjusting for self-reported health will bias the estimate of the neuroticism to BMI relation, inducing a collider effect. Although measured at the same time, self-reported health is a consequent of neuroticism and BMI in the collider model. For the most part, researchers are aware of the problems introduced by adjusting for an outcome of the two variables they study, especially when the outcome occurs later in time. Researchers are less aware of how sample selection can also introduce implicit collider bias.
Samples are often selected based upon some criteria (Cresswell & Clark, 2017), e.g., selecting patients at a hospital, selecting clients in a clinical population, selecting middle school students, selecting a high-risk group, etc. When the criteria used to select a sample is a collider variable, the restricted scores on the collider make the sample non-representative of the population and cause spurious effects between X and Y (Cole et al., 2010; Elwert & Winship, 2014). Restriction of samples by a collider is an example of implicit collider bias. For example, the population effect of impulsivity on delinquency is inaccurately assessed in a sample restricted to youth at high risk of drug use for a causal model in which impulsivity and delinquency both cause drug use. High-risk status is a potential collider of the effect of impulsivity on delinquency, so adjusting for high-risk status would lead to a spurious relation between impulsivity and delinquency.
Collider bias is also known as Berkson’s bias or Berkson’s fallacy because Berkson (1946) demonstrated the effect in research on risk factors for disease. Berkson demonstrated how two diseases, such as diabetes and gall bladder inflammation, that are unrelated in the population can become spuriously associated when a sample is restricted to individuals in a hospital sample. Having diabetes and gall bladder inflammation both caused people to go to the hospital, thereby restricting the range of the sample to hospital patients. Going to the hospital was a collider that implicitly restricted the sample. A hospital patient who does not have diabetes was much more likely to have another disease like gall bladder inflammation than someone from the general population. Thus, the selection of hospital patients created a spurious positive association between gall bladder inflammation and diabetes.
Other examples of collider effects related to prevention science include the obesity paradox and the low birth weight paradox. Both the obesity paradox and the low birth weight paradox involve more complicated models with more variables than what is present in the typical three-variable collider model, but the presence of a collider forms the basis for these seemingly paradoxical results. In the obesity paradox, obesity is negatively related to mortality in samples of patients with cardiovascular disease (Banack & Kaufman, 2014). By restricting the sample to people with cardiovascular disease, it appears that obesity has the beneficial effect of reducing mortality, when in the general population obesity increases mortality. The pattern emerges in part because of an unmeasured fourth variable, such as genetic factors, that cause cardiovascular disease and mortality. Selection by the collider, cardiovascular disease, distorts the relation between obesity and mortality because genetic factors are related to mortality and cardiovascular disease. The low birth weight paradox refers to the finding that low birth weight infants born to mothers who smoke cigarettes appear to have a lower mortality rate than other low birth weight babies (Hernández-Díaz et al., 2006; Whitcomb et al., 2009). By selecting a sample of only low birth weight babies, a biased effect of maternal smoking on infant mortality emerges because smoking, along with genetic factors, is likely a cause of low birth weight and mortality in newborns. Selection by low birth weight alters the relation between smoking and mortality because genetic factors also cause mortality and low birth weight.
Most collider examples are implicit, featuring restriction of range in which two variables cause an outcome variable, and the sample is restricted based on the values of the outcome variable. Despite the lack of published examples of colliders in prevention science, it is likely that these effects are present in many studies due to common sampling techniques in the field. It is important for prevention scientists to consider possible collider effects whenever samples are restricted in some way, such as samples of clinical populations restricted by their clinical status, samples selected because of their high-risk status, or samples of college students restricted by their academic achievement. If the variable that restricts the sample is a collider of the two variables studied, there will be collider bias.
In summary, mediators, confounders, and colliders are important for planning and interpreting prevention research. Colliders, though relatively new to prevention science, are an important third variable to consider. Verbal description of mediators, confounders, and colliders is an important step in understanding the different effects a third variable can have on causal inference. Diagrams of the mediation (i.e., a chain; X → Mediator → Y), confounding (i.e., a fork; X ← Confounder → Y), and collider (i.e., an inverted fork; X → Collider ← Y) models also clarify the underlying structure of these effects (Morgan & Winship, 2015, p. 81). Translation of the models to regression equations and estimation of effects with real data further clarifies the third variable effects, as done in the next section.
Estimation of the Third‑Variable Effects
In this section, we describe how the same three regression equations are used to estimate each third-variable effect. Equations (1), (2), and (3) below are the three equations necessary for estimation of the mediated effect that can be extended to estimate confounder and collider effects. Intercepts are represented by i1, i2, and i3, and residuals are represented by e1, e2, and e3. For the sake of explanation, we assume that there are no other observed or unobserved variables that affect the three variables. Point and interval estimation of each third-variable effect uses information from three regression equations. The subscripts for the coefficients represent the causal direction of the effects. An effect adjusted for another variable is represented in the subscript, where the independent and dependent variables are followed by a dot and then the variable that is adjusting the primary relationship. For example, bYX is the effect to Y from X, and bYX⋅T is the effect to Y from X adjusted for T. Equation (1) shows the equation where Y is regressed on X and Eq. (2) shows the equation where Y is regressed on X and the third variable (T). The third equation regresses T on X, estimating the value of the coefficient bTX , a coefficient needed for a product of coefficients method to estimate the third-variable effect. These three equations are the same for the estimation of each third-variable effect because each effect involves adjusting a bivariate effect for the third variable.
| (1) |
| (2) |
| (3) |
Estimates of the parameters in each regression equation are used to estimate third-variable effects in sample data (above coefficients such as represent sample estimates). In each model, the difference in coefficients estimator, is the third-variable effect using Eqs. (1) and (2). The difference in coefficients estimator estimates the X to Y effect before and after adjusting for the mediator, the X to Y effect before and after adjusting for the confounder, and the X to Y effect before and after adjusting for the collider. For mediation, including the mediator third variable in Eq. (2) provides a way to estimate mediated (indirect), , and direct effects, of X to Y. For confounding, including the confounder in the model adjusts the X–Y effect for the confounder’s effect, making the adjusted X–Y effect more accurate. The confounder effect, , provides an estimate of how failure to adjust for the confounder would affect the estimate (Moldonado & Greenland, 1993; Selvin, 2004). For the collider model, the correct model should not adjust for the collider, but it is possible to obtain an estimate of the collider effect by comparing the effect of X on Y adjusted and unadjusted for the collider, The collider effect provides a numerical estimate of how much the collider will bias the estimate if it is mistakenly included in the statistical analysis.
In addition to the difference in coefficients estimator, there is an equivalent product of coefficients estimator, , for each third variable effect in the linear model using Eqs. (2) and (3). The equivalence of the difference and product of coefficients estimators was shown for mediation for linear models in MacKinnon et al. (1995), where for notation in the mediation literature c–c′ corresponds to , and ab corresponds to . The supplemental material for this article demonstrates the equivalence of the difference in coefficients and product of coefficients for the confounder and collider effects. This equality between the product and difference estimators is not present for nonlinear models such as logistic regression (MacKinnon & Dwyer, 1993).
Standard errors are available for the product of coefficients and difference of coefficients estimators using methods developed for mediation analysis such as those based on the multivariate delta method (Clogg et al., 1995; MacKinnon & Dwyer, 1993). The equations for the standard error of the product and difference estimators are shown in the supplemental material for this paper. As established in the mediation research literature, the mediated effect estimator does not have a normal distribution, and so the most accurate significance tests and confidence intervals are obtained using methods that do not require normal distributions such as the distribution of the product and resampling methods such as the bootstrap method (MacKinnon et al., 2002, 2004). Results for bootstrap resampling of the third-variable effect are reported in this paper.
Suppression for Each Third‑Variable Effect
Suppression may be present for each third-variable effect. An effect between the independent and dependent variables may be reduced or may be enhanced when adjusted by a third variable, whether that third variable is a mediator, confounder, or collider. MacKinnon et al. (2000) describe suppression for models with mediation and confounding effects. In each of these models, suppression occurs when the effect between X and Y increases in magnitude or changes signs after adjusting for a third variable (Conger, 1974). For example, introducing a mediator usually reduces the strength of the relationship between X and Y, but there are some situations where the magnitude of the X to Y effect is actually larger when adjusting for a mediator. Similarly, adjustment for a confounder may increase or decrease the effect between two variables (Breslow, Day, & Heseltine, 1980). Although we are not aware of empirically demonstrated suppression effects for collider bias in the research literature (though see Richardson et al., 2019, for a possible example), adjustment for a collider may also increase or decrease the effect between the independent and dependent variable in a similar manner as mediation and confounding.
The most popular definition of suppression is when the magnitude of a bivariate relation gets larger when adjusted for a third variable (Conger, 1974). For this definition, suppression occurs when the magnitude of the adjusted effect is greater than the unadjusted effect, , and this can occur for each third variable effect. A more general definition of suppression for mediation compares the sign of the direct and indirect effect from the product of coefficient estimator, e.g., versus (MacKinnon et al., 2000). The same quantity indicates suppression for confounding and collision. The difference in sign between and method is more general than the increase in magnitude method because adjustment for a third variable may change the sign of the adjusted coefficient but may not increase the magnitude of the absolute value of the adjusted coefficient. Using the same logic, suppression occurs if the product of the three coefficients, , has a negative sign for each third-variable effect (Muniz & MacKinnon, 2021). Suppression is present if the product of the three coefficients is negative and is not present if the product is positive.
Interactions for Each Third‑Variable Effect
Mediation, confounding, and collider effects may also have an interaction effect such that an effect differs across the levels of the third variable. Several sources describe interactions in confounder and mediation models (Fairchild & MacKinnon, 2009; Selvin, 2004). Interactions for collider variables are possible (see Richardson et al., 2019, for an example).
Adding an interaction to Eq. (2) is shown below in Eq. (4). The interaction coefficient, hYXT∙X,T , represents whether the effect of T on Y differs at different levels of X and whether the effect of X on Y differs at different levels of T. The intercept is represented by i4 and the residual by e4. For these models, the third-variable effect may differ at separate values of the third variable. With interaction effects, there may be situations where there is not a third-variable effect at certain values but there is a third-variable effect at other values. Similarly, with interactions, it may be possible that there is a suppressor effect at certain values but not a suppressor effect at other values defined by the interaction.
| (4) |
Illustrative Examples of Mediation, Confounding, and Collision
To make mediator, confounder, and collider effects more concrete, we describe three-variable real-data examples of each effect. The examples illustrate the mediation, confounding, and collision models, assuming that the underlying causal models are true. We do not include additional covariates to simplify the presentation. The results are not definitive until a proper causal analysis is performed including specifying model assumptions, identification of estimators, sensitivity analyses, and consideration of additional mediators, confounders, and colliders. Table 1 shows estimates for each model. More details about the regression equations and the calculation of each effect are provided in the supplemental material.
Table 1.
Parameter estimates and confidence intervals for the three data examples
| Third Variable | Effect | Parameter | Estimate | Standard error | Est/SE | LCL | UCL |
|---|---|---|---|---|---|---|---|
| Mediation | Group on Recall | bYX | 2.517 | 1.084 | 2.32 | .329 | 4.705 |
| Group on Recall | bYX⋅T | .332 | 1.292 | .26 | -2.277 | 2.940 | |
| Adjusted for imagery | |||||||
| Imagery on Recall | bYT⋅X | .614 | .226 | 2.72 | .158 | 1.071 | |
| Adjusted for group | |||||||
| Group on imagery | bTX | 3.558 | .689 | 5.16 | 2.167 | 4.950 | |
| Mediated effect | bTXbYT⋅X = bYX – bYX⋅T | 2.185 | .922 | 2.40 | .631 | 3.847 | |
| Confounding | BMI on bench press | bYX | 12.746 | 1.697 | 7.51 | 9.414 | 16.078 |
| BMI on bench press | bYT⋅T | 10.916 | 1.649 | 6.62 | 7.678 | 14.153 | |
| Adjusted for age | |||||||
| Age on bench press | bYT⋅X | 53.233 | 6.706 | 7.94 | 40.069 | 66.397 | |
| Adjusted for BMI | |||||||
| BMI on age | bTX | .034 | .009 | 3.92 | .017 | .052 | |
| Confounder effect | bTXbYT⋅X = bYX – bYX⋅T | 1.830 | .524 | 3.49 | .854 | 2.933 | |
| Collision | Conscientiousness on Income | bYX | .022 | .006 | 3.70 | .010 | .033 |
| Conscientiousness on | bYT⋅T | .017 | .006 | 2.90 | .006 | .029 | |
| Income adjusted for health | |||||||
| Health on income | bYT⋅X | .041 | .007 | 6.00 | .028 | .055 | |
| Adjusted for conscientiousness | |||||||
| Conscientiousness on | bTX | .112 | .012 | 9.59 | .089 | .134 | |
| health | |||||||
| Collider effect | bTXbYT⋅X = bYX – bYX⋅T | .005 | .001 | 5.00 | .003 | .007 |
Bootstrap samples (1000 samples) were used to compute confidence limits for mediated, confounder, and collider effects. Other confidence limits were based on the normal distribution computed with estimates and standard errors. Table entries are rounded estimates
Est estimate, SE standard error, LCL lower confidence limit, UCL upper confidence limit
Imagery as a Mediator of Rehearsal Instructions on Memory for Word Recall
Our first example is for mediation, using experimental data from a study that hypothesized that using images to encode words leads to greater recall of the words than word repetition (MacKinnon et al., 2018). Forty-four participants were randomly assigned to one of two instructions for memorizing twenty words and were asked to recall as many words as they could. Participants also reported the degree of mental imagery they engaged in while memorizing the words. Instructions (X) were hypothesized to increase imagery (T), which would increase word recall (Y).
Any linear regression statistical software package can provide the estimates of Eqs. (1), (2), and (3) for the calculation of the mediated effect. For (1), perform a regression analysis from X (group) to Y (word recall). For (2), perform a regression analysis from X (group) and the mediator imagery (T) to Y (word recall), with X and T as two predic- tor variables of Y. Equation (3) follows the same procedure, except that the analysis is from X (group) to T (imagery). These three steps provide the coefficients needed to calculate the mediated effect of the manipulation to imagery to word recall.
The difference of coefficients estimate is calculated by taking the difference between the unadjusted effect between X and Y and the adjusted effect between X and Y for T. This difference provides an estimate of the mediated effect, . The product of coefficients estimate is calculated by taking the product of the two path coefficients, , (3.558) (0.614) = 2.185, which gives the same estimate as the difference in coefficient estimator. The mediated effect through imagery on word recall equals 2.185 words which corresponds to a d-effect size (mediated effect divided by the standard deviation of Y) of so the mediated effect is about half a standard deviation (Miočević et al., 2018). The bootstrap confidence interval (LCL = 0.631, UCL = 3.847) does not contain zero, leading to the conclusion that the mediated effect is larger than expected by chance. The model is not a suppression model because and have the same sign and, correspondingly, the product of the estimated coefficients is positive. There is no evidence for a statistically significant interaction between X and T on Y, , t (40) = 1.05. Randomization of X in this example clarifies the causal interpretation of the X to the mediator (T) and X to Y effects. However, randomization of X does not ensure causal interpretation of the mediator (T) to Y effect, because there is no randomization of the mediator (T). One way to enhance the causal interpretation of the T to Y effect would be to include additional confounding variables related to T and Y in the statistical analysis.
Age as a Confounder of the Effect of Bench Press Performance and Body Mass Index
The next example illustrates the calculation of a confounding effect using data from a study of a steroid prevention program known as ATLAS (Adolescents Teaching and Learning to Avoid Steroids) with high-school athletes aged 13–19 (Goldberg et al., 1996). The example uses each athlete’s age, body mass index (BMI), and bench press performance (measured as pounds lifted multiplied by the numbers of repetitions). There were 773 athletes with complete information for all three variables used for this analysis.
Age is a likely confounder of the effect between BMI and bench press performance because age affects both variables (e.g., we would expect that a 13-year-old would have a lower BMI than a 19-year-old, which will affect their respective bench press performance). The steps to calculate the confounding effect are the same as for the mediated effect using Eqs. (1), (2), and (3). Here, T (age) is the confounder of the BMI (X) to bench press performance (Y) effect. To calculate the confounding effect, the unadjusted effect is subtracted by the adjusted effect, , 12.746 − 10.916 = 1.83. The product of coefficients method with and provides the same result, (53.233) (0.034) = 1.83. The confounder effect equals 1.83 bench press by weight units which corresponds to a 0.0375 standard deviation change in bench press performance for a one standard deviation change in BMI, so the confounder effect is about 4% of a standard deviation. The bootstrap confidence interval (LCL = 0.854, UCL = 2.933) led to the conclusion that the confounder effect is larger than expected by chance. The model is not a suppression model because have the same sign and the product of is positive. There is not a statistically significant interaction of age and BMI on bench press performance, using Eq. (4).
In summary, the confounder effect was larger than expected by chance, but this result is tentative until all relevant variables are included in a thorough causal analysis. For example, there are likely additional important variables related to BMI, bench press performance, and age not considered in this illustrative example.
General Health as a Collider of the Effect of Conscientiousness and Family Income
We use the National Longitudinal Survey of Youth 1997 cohort data (Bureau of Labor Statistics, 2019) to demonstrate how to calculate a collider effect. We extract information from 5404 participants about their general health, levels of conscientiousness, and family income. For this example, we treat general health as a continuous collider variable and assume that conscientiousness causes family income. Theoretically, higher levels of conscientiousness and greater family income may independently lead to better overall health, making health a potential collider of the conscientiousness-income effect (Murray et al., 2014). In a real study, a researcher would not want to adjust for this collider variable, as it will bias the results. The purpose of this demonstration is to show the extent to which adjusting for the collider variable can bias the effect of interest, and to encourage researchers to think carefully about the causal structure of their models before including certain variables in analysis.
Conscientiousness scores and family income were measured in 2008, and ratings of general health were collected in 2017. Family income was transformed to normalize the distribution of the scores by taking the logarithm of each reported income plus 1000. Adding 1000 prior to taking logarithms allowed for participants that reported having no income to obtain a log-transformed score. The original scores for family income ranged from $0 to $450,000+.
The calculations proceed in the same manner as for mediated and confounding effects but using the collider third variable. The effect between conscientiousness (X) and family income (Y) is adjusted for the collider general health (T). The difference in coefficient estimate, , and product of coefficients estimate, , both equal 0.005. The collider effect is a 0.005 change in family income for a one-unit change in conscientiousness. Another interpretation for this value is that adjusting for the collider variable of general health biases the conscientiousness-income relationship by a value of 0.005. Putting this value in terms of a d-effect size gives a change in the effect of conscientiousness and general health of standard deviations for a one standard deviation change in general health, so the collider effect corresponds to about a one hundredth of a standard deviation. The bootstrap confidence interval (LCL = 0.003, UCL = 0.007) led to conclusion that the collider effect is larger than expected by chance. The model is not a suppression model because and have the same sign, and the product of the three coefficients is positive. There is no evidence for a statistically significant interaction of conscientiousness and general health on income, , t (5400) =−0.07.
As for the other illustrative examples, there are additional important variables related to income, health, and conscientiousness not considered in this example. We emphasize that a collider should not be adjusted in the statistical analysis for an accurate regression of Y on X unless a researcher is interested in testing how a collider variable could influence their results. Including the collider in the analysis illustrates the estimation of the collider effect, which represents the change in the effect if a collider is mistakenly adjusted in the statistical analysis.
Discussion
Third-variable effects are not distinguishable solely by statistical methods. Each third-variable effect can be fit to the same data, and if the relations between the variables are substantial, there will be evidence for each effect. In this sense, the confounder, mediator, and collider models are equivalent, providing an equal representation of the information contained in the data for three variables (Stelzl, 1986). Although mediation, confounding, and collision may equally explain the statistical associations among three variables, they describe different causal relations among those variables. Like much recent research on causal analysis, this paper highlights the centrality of the causal model underlying a research study and the important distinction between the causal model and the statistical model. The appropriate causal model is determined by prior empirical research and theory. The statistical analysis provides estimates for the proposed causal model.
Mediator and confounder models have received considerable attention in modern causal inference (Morgan & Winship, 2015; Pearl, 2009). The causal estimators for mediation have been defined in a counterfactual framework (Imai et al., 2010; Valeri & VanderWeele, 2013) and represent causal effects in a world in which all participants were in the treatment group or all participants were in a control group. Adjustment for a confounder gives the correct causal estimate because the confounder removes spurious effects from the relation of two variables (Greenland & Morgenstern, 2001; Pearl, 2009). That is, failing to adjust for the confounder introduces spurious effects. Although described in modern causal inference, there are fewer examples and applications of colliders than confounders or mediators. For the collider, the correct causal effect does not adjust for the collider because adjusting for the collider introduces bias that forms a spurious effect that masks the true causal effect (Elwert & Winship, 2014).
This paper described third-variable models relevant for prevention science. It is important to consider each third-variable model in planning studies and as a possible explanation of observed statistical analysis results. We described the corresponding equations and methods of estimation for mediation, confounding, and collision and illustrated these three-variable models in three datasets. We emphasized the need for further causal analysis in order to draw causal conclusions from these examples, including the consideration of additional mediators, confounders, and colliders that may be relevant for each example. Adding replications extended to new situations further clarify the veracity of the causal model. We acknowledge that prevention science models may contain many more than three variables, but mediators, confounders, and colliders form the building blocks of these models.
In summary, the linear regression models described in this paper describe a general model that is useful for investigating each third-variable effect, given the correct causal model. We wish to note that the correct causal model is an exacting criterion, requiring a program of research with precise definition of causal effects, specification of assumptions, and sensitivity analysis for how violating assumptions affects results. The statistical procedures for estimating each third-variable effect provide no information about which causal effect is being estimated, highlighting that the statistical model is different from the causal model. The correct causal model is an exacting qualification,requiring a program of research with precise definition of causal effects, specification of assumptions, and sensitivity analysis for how violatingassumptions affects results. Statistical analysis is useful for demonstrating associations between variables that are consistent or inconsistent with a causal model. The primary challenge of research is to identify the status of different variables before the study in order to test a model. Colliders in addition to mediators and confounders are core conceptual variables for specifying and testing these models in prevention science.
Supplementary Material
Acknowledgements
We thank Adam Cohen, Matthew Fritz, Linda Luecken, June Tangney, Jenn Tein, Matthew Valente, members of the Research in Prevention Laboratory, and reviewers for helpful comments.
Funding A grant from the National Institute on Drug Abuse (R37DA09757) supported this research in part. Some of this research was presented at the 2019 conference of the American Psychological Association.
Footnotes
Supplementary Information The online version contains supplementary material available at https://doi.org/10.1007/s11121–021-01268-x.
Declarations
Ethics Approval All analyses in the article were secondary data analyses. All procedures, including the informed consent process, were conducted in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1975, as revised in 2000 and the Arizona State University Human Subjects Internal Review Board.
Informed Consent Data were collected via Informed Consent in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1975, as revised in 2000.
Conflict of Interest The authors declare that they have no conflict of interest.
This paper describes adjustment as including an additional predictor in a regression model. Adjustment comes in other forms and names including conditioning on a variable, controlling for a variable, stratifying by a variable, and selection into a study by a variable (see Elwert & Winship, 2014 and Morgan & Winship, 2015 for more on these topics).
References
- Asendorpf JB (2012). Bias due to controlling a collider: A potentially important issue for personality research. European Journal of Personality, 26, 391–392. [Google Scholar]
- Banack HR, & Kaufman JS (2014). The obesity paradox: Understanding the effect of obesity on mortality among individuals with cardiovascular disease. Preventive Medicine, 62, 96–102. 10.1016/j.ypmed.2014.02.003 [DOI] [PubMed] [Google Scholar]
- Berkson J (1946). Limitations of the application of fourfold table analysis to hospital data. Biometrics Bulletin, 2, 47–53. 10.2307/3002000 [DOI] [PubMed] [Google Scholar]
- Botvin GJ, Griffin KW, Diaz T, Miller N, & Ifill-Williams M (1999). Smoking initiation and escalation in early adolescent girls: One-year follow-up of a school-based prevention intervention for minority youth. Journal of the American Medical Women’s Association., 54, 139–143. [PubMed] [Google Scholar]
- Breslow NE, Day NE, & Heseltine E (1980). Statistical methods in cancer research (Vol. 1, pp. 248–279). Lyon: International Agency for Research on Cancer. [Google Scholar]
- Bureau of Labor Statistics (2019). National Longitudinal Survey of Youth 1979 cohort, 1979–2014. Retrieved from https://www.nlsinfo.org/content/cohorts/nlsy79
- Clogg CC, Petkova E, & Haritou A (1995). Statistical methods for comparing regression coefficients between models. American Journal of Sociology, 100, 1261–1293. 10.1086/230638 [DOI] [Google Scholar]
- Cole SR, Platt RW, Schisterman EF, Chu H, Westreich D, Richardson D, & Poole C (2010). Illustrating bias due to conditioning on a collider. International Journal of Epidemiology, 39, 417–420. 10.1093/ije/dyp334 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Conger AJ (1974). A revised definition for suppressor variables: A guide to their identification and interpretation. Educational and Psychological Measurement, 34, 35–46. [Google Scholar]
- Cresswell JW, & Clark VLP (2017). Designing and conducting mixed methods research. Sage. [Google Scholar]
- Cuijpers P (2002). Effective ingredients of school-based drug prevention programs: A systematic review. Addictive Behaviors, 27, 1009–1023. 10.1016/s0306-4603(02)00295-2 [DOI] [PubMed] [Google Scholar]
- Elwert F, & Winship C (2014). Endogenous selection bias: The problem of conditioning on a collider variable. Annual Review of Sociology, 40, 31–53. 10.1146/annurev-soc-071913-043455 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fairchild AJ, & MacKinnon DP (2009). A general model for testing mediation and moderation effects. Prevention Science, 10, 87–99. 10.1007/s11121-008-0109-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Glymour MM, & Hamad R (2018). Causal thinking as a critical tool for eliminating social inequalities in health. American Journal of Public Health, 108, 623. 10.2105/AJPH.2018.304383 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goldberg L, Elliot D, Clarke GN, MacKinnon DP, Moe E, Zoref L, & Lapin A (1996). Effects of a multidimensional anabolic steroid prevention intervention: The Adolescents Training and Learning to Avoid Steroids (ATLAS) program. JAMA, 276, 1555–1562. 10.1001/jama.1996.03540190027025 [DOI] [PubMed] [Google Scholar]
- Greenland S, & Morgenstern H (2001). Confounding in health research. Annual Review of Public Health, 22, 189–212. 10.1146/annurev.publhealth.22.1.189 [DOI] [PubMed] [Google Scholar]
- Hernán M (2018). The C-word: Scientific euphemisms do not improve causal inference from observational data. American Journal of Public Health, 108, 616–619. 10.2105/AJPH.2018.304337 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hernández-Díaz S, Schisterman EF, & Hernán MA (2006). The birth weight “paradox” uncovered? American Journal of Epidemiology, 164, 1115–1120. 10.1093/aje/kwj275 [DOI] [PubMed] [Google Scholar]
- Imai K, Keele L, & Tingley D (2010). A general approach to causal mediation analysis. Psychological Methods, 15, 309–334. 10.1037/a0020761 [DOI] [PubMed] [Google Scholar]
- James LR (1980). The unmeasured variables problem in path analysis. Journal of Applied Psychology, 65, 415–421. 10.1037/0021-9010.65.4.415 [DOI] [Google Scholar]
- King G (2015). Cover endorsement in Morgan, S. L., & Winship C. (2015). Counterfactuals and Causal Inference. University Press. [Google Scholar]
- MacKinnon DP (2008). Introduction to statistical mediation analysis. Lawrence Erlbaum. [Google Scholar]
- MacKinnon DP, & Dwyer JH (1993). Estimating mediated effects in prevention studies. Evaluation Review, 17, 144–158. 10.1177/0193841X9301700202 [DOI] [Google Scholar]
- MacKinnon DP, Johnson CA, Pentz MA, Dwyer JH, Hansen WB, Flay BR, & Wang EYI (1991). Mediating mechanisms in a school-based drug prevention program: First-year effects of the Midwestern Prevention Project. Health Psychology, 10, 164–172. [DOI] [PubMed] [Google Scholar]
- MacKinnon DP, Krull JL, & Lockwood CM (2000). Equivalence of the mediation, confounding, and suppression effect. Prevention Science, 1, 173–181. 10.1023/A:1026595011371 [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKinnon DP, Lockwood CM, Hoffman JM, West SG, & Sheets V (2002). A comparison of methods to test mediation and other intervening variable effects. Psychological Methods, 7, 83–104. 10.1037/1082-989X.7.1.83 [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKinnon DP, Lockwood CM, & Williams J (2004). Confidence limits for the indirect effect: Distribution of the product and resampling methods. Multivariate Behavioral Research, 39, 99–128. 10.1207/s15327906mbr3901_4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKinnon DP, Valente MJ, & Gonzalez OJ (2020). The correspondence between causal and traditional mediation analysis: The link is the mediator by treatment interaction. Prevention Science, 21, 147–157. 10.1007/s11121-019-01076-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKinnon DP, Valente MJ, & Wurpts IC (2018). Benchmark validation of statistical mediation analysis: Application to imagery and memory theory. Psychological Methods, 23, 654–671. 10.1037/met0000174 [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKinnon DP, Warsi G, & Dwyer JH (1995). A simulation study of mediated effect measures. Multivariate Behavioral Research, 30, 41–62. 10.1207/s15327906mbr3001_3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miočević M, O’Rourke HP, MacKinnon DP, & Brown HC (2018). Statistical properties of four effect size measures for mediation models. Behavior Research Methods, 50, 285–301. 10.3758/s13428-017-0870-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moldonado G, & Greenland S (1993) Simulation study of confounder-selection strategies. American Journal of Epidemiology, 138, 923–936. 10.1093/oxfordjournals.aje.a116813 [DOI] [PubMed] [Google Scholar]
- Morgan SL, & Winship C (2015). Counterfactuals and causal inference (2nd Edition). New York: NY: Cambridge, MA. [Google Scholar]
- Muniz FB, & MacKinnon DP (2021). Four tests for statistical suppression. Manuscript submitted for publication.
- Murray AL, Johnson W, McGue M, & Iacono WG (2014). How are conscientiousness and cognitive ability related to one another? A re-examination of the intelligence compensation hypothesis. Personality and Individual Differences, 70, 17–22. 10.1016/j.paid.2014.06.014 [DOI] [Google Scholar]
- Musci RJ, & Stuart E (2020). Ensuring causal, not casual inference. Prevention Science, 3, 452–456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pearl J (2009). Causal inference in statistics: An overview. Statistics Surveys, 3, 96–146. 10.1214/09-SS057 [DOI] [Google Scholar]
- Richardson TG, Davey Smith G, & Munafo MR (2019). Conditioning on a collider may induce spurious associations: Do the results of Gale et al., (2017) support a health-protective effect of neuroticism in population subgroups? Psychological Science, 1–4. 10.1177/0956797618774532 [DOI] [PMC free article] [PubMed]
- Selvin S (2004). Statistical analysis of epidemiological data. Oxford. [Google Scholar]
- Stelzl I (1986). Changing a causal hypothesis without changing the fit. Some rules for generating equivalent path models. Multivariate Behavioral Research, 21, 309–331. 10.1207/s15327906mbr2103_3 [DOI] [PubMed] [Google Scholar]
- Stuart EA, Bradshaw CP, & Leaf PJ (2015). Assessing the generalizability of randomized trial results to target populations. Prevention Science, 16, 475–485. 10.1007/s11121-014-0513-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Valeri L, & VanderWeele TJ (2013). Mediation analysis allowing for exposure–mediator interactions and causal interpretation: Theoretical assumptions and implementation with SAS and SPSS macros. Psychological Methods, 18, 137–150. 10.1037/a0031034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- VanderWeele T (2015). Explanation in causal inference: Methods for mediation and interaction. Oxford: University Press. [Google Scholar]
- Whitcomb BW, Schisterman EF, Perkins NJ, & Platt RW (2009). Quantification of collider-stratification bias and the birth- weight paradox. Pediatric and Perinatal Epidemiology, 23, 394–402. 10.1111/j.1365-3016.2009.01053.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
