Skip to main content
American Journal of Public Health logoLink to American Journal of Public Health
. 2013 Mar;103(3):394–396. doi: 10.2105/AJPH.2012.301047

Important Considerations in Conducting Statistical Mediation Analyses

Osvaldo F Morera 1,, Felipe González Castro 1
PMCID: PMC3673508  PMID: 23327275

Abstract

An investigator wishes to examine mediation in a randomized control trial of the effectiveness of an intervention, which consists of a computerized decision aid for promoting colorectal cancer screening. Mediation is a naturally occurring process, and in any given instance, research investigators seek to ascertain whether it has occurred. In the case of a prevention intervention for a specific chain of events, mediation occurs (1) when the prevention intervention effects a change on a targeted intermediate condition: a mediator, for example, a person’s intentions to get a colorectal screening examination; and (2) when, at a later point in time, this condition effects a change on a targeted outcome, for example, the actual behavior of getting a colorectal screening exam. Full mediation is said to occur when the effectiveness of the intervention on the targeted outcome only takes place through the intermediate condition and does not directly affect the targeted outcome. Partial mediation is said to occur when the intervention causes changes in both the intermediate condition and the targeted outcome. The assessment of mediation is important because conclusions about the efficacy of a public health intervention may depend on how these indirect influences are assessed.


graphic file with name AJPH.2012.301047f1.jpg

graphic file with name AJPH.2012.301047f2.jpg

The proposed study regarding the effectiveness of this aforementioned decision aid would include 200 individuals who will be randomized to a decision aid (intervention) condition or to a usual care (control) condition. Thus, the group variable that contains these 2 conditions serves as the independent variable. The researcher hypothesizes that the independent variable will cause changes in a mediator variable (e.g., increased intentions to get screened for colorectal cancer). The mediator variable is then hypothesized to cause changes in the outcome (the dependent variable), which is the actual screening behavior that would occur at a later time point.

STATISTICAL MEDIATION

A popular way to test the described relationship is derived from the work of Baron and Kenny,1 and is known as the causal steps approach to statistical mediation. The causal steps approach involves estimating the following regression equations:

graphic file with name AJPH.2012.301047eq1.jpg
graphic file with name AJPH.2012.301047eq2.jpg
graphic file with name AJPH.2012.301047eq3.jpg

An overlooked assumption2 underlying mediation analysis is that it is a causal process in which the independent variable transmits an effect onto the mediator variable, which in turn transmits an effect onto the outcome variable. This modeling approach is preferred to an analysis that simply controls for the effect of each variable on all others, as a hypothesized causal mechanism is being tested. Moreover, such approaches result in increased type I errors.3

Under the causal steps approach, mediation is said to occur under the following conditions: (1) there must be a statistically significant slope, c, in equation 1; (2) there must be a statistically significant slope, a, in equation 2; (3) there must be a statistically significant slope, b, and the size of c′ in equation 3 must be smaller in magnitude than is c in equation 1. If the c′ slope in equation 3 is no longer statistically significant, then full mediation is said to have occurred. If the c′ slope in equation 3 is smaller in magnitude than the original c but is still statistically significant, then partial mediation is said to have occurred. From the equations above, it is also known that

graphic file with name AJPH.2012.301047eq4.jpg

There are several problems associated with the causal steps approach for assessing mediation. First, this approach requires that the total effect of the decision aid intervention on screening, c, be statistically significant. If equation 1 does not result in a statistically significant slope, the assessment of mediation terminates. Thus, a researcher who hypothesizes that the decision aid intervention will only have an indirect effect on screening cannot test this hypothesis with the causal steps approach.

Second, this approach1 never directly tests the indirect effect formed by the cross product of ab and infers it from a series of tests. Given the increasing attention to report effect sizes in mediation analyses,4 strategies are needed that directly test and quantify this ab effect. Moreover, the statistical significance of ab does not depend on the statistical significance of c.5

Third, the causal steps procedure involves a series of multiple tests in which each test has its own type I and type II error rate and, at present, is the least powerful approach in assessing mediation.6

Fourth, the presence (or absence) of the requisite slopes in equations 2 and 3 does not mean that the ab cross product is (or is not) statistically significant.

Last, the distinction between full and partial mediation using the causal steps approach is problematic. For example, as sample size increases, the SE of the c′ will become smaller, which increases the likelihood that the c′ effect will be statistically significant. Conversely, the likelihood of demonstrating full mediation decreases as sample size increases. Moreover, the presence of full mediation does not rule out the possibility that other mediator variables could have fully mediated the effect or that the inclusion of other mediator variables in the presence of full mediation could be detected.7

To assess mediation, approaches must be used that test the ab cross product directly. In approaches in which the outcome variable is continuous, the Sobel8 test has been used to test the ab cross product, but this test assumes that the sampling distribution of the ab cross product is normally distributed. This assumption is not reasonable in smaller samples. Alternative approaches include the distribution of products approach,9 Monte Carlo simulations,10 and bootstrapping approaches. The distribution of products approach involves taking the estimates of a and b in equations 2 and 3, converting them into z scores, and then calculating their product.

The resulting cross product is compared with a statistical table of cross products,11 which accounts for nonnormality of standardized cross products. The values from this table would replace the customary ±1.96 from the standard normal distribution, and 95% confidence limits around the point estimate of the cross product can be created.12 These confidence intervals (CIs) are readily available in the statistical program, RMediation.9 If the CI does not contain 0, we conclude mediation is present.

Monte Carlo simulations require the assumption that estimates of a and b in equations 2 and 3 are normally distributed. Given the estimates of the a and b parameters and their SEs, a researcher would randomly select values from a normal distribution of a and b and create the ab cross product. This procedure would be repeated 9999 times to create a distribution of 10 000 ab cross products. The middle 95% of this distribution would constitute a CI. The absence of 0 from this interval would indicate that the ab cross product is statistically significant. This procedure works well when raw data are not available and software is readily available to perform such simulations.13

Bootstrapping is a distribution-free resampling method that allows the estimation of a sampling distribution of a statistic. In our example of the randomized control trial involving a total sample of 200 individuals, a researcher samples (with replacement) from the original sample to create a sample of 200 individuals. It should be noted that some individuals from the original sample could appear more than once in this second sample, and other individuals from the original sample may not appear at all. The investigator would then estimate equations 2 and 3 in this new sample to derive estimates of a and b and form an ab cross product.

Thus in bootstrapping, this procedure of sampling with replacement from the original sample to form samples of 200 individuals would be repeated 9999 more times to create a sampling distribution of 10 000 ab cross products. From this distribution of 10 000 cross products, the researcher can create a 95% CI by taking the cross product values that fall at the 97.5th and 2.5th percentiles. No distributional assumptions have been imposed to construct this CI. If 0 does not fall in this interval, then we can declare that the ab cross product is statistically significant.

Bias associated with the bootstrapping procedure can also be addressed by adjusting the upper and lower bounds of this interval. Hayes14 has developed SAS and SPSS macros that perform bootstrapping for his forthcoming text. Unlike the distribution of products and Monte Carlo simulations, bootstrapping can be used for outcome variables that are either continuous or binary.14

CONCLUSIONS AND RECOMMENDATIONS

Given the availability of easy-to-use software, there is no reason to continue using the causal steps approach to assess mediation. Adopting more powerful approaches like those derived from bootstrapping, Monte Carlo simulations, or the distribution of products approach provides a more robust assessment of indirect intervention effects. Finally, researchers are encouraged to abandon the nomenclature of full and partial mediation. Instead, researchers should report effect size measures of the indirect effect, such as the proportion of the maximum indirect effect.4

Acknowledgments

The efforts of both authors were supported by the National Institutes of Health (grant 1 P20 MD002287-05 to Elias Provencio-Vasquez) and the Grant Proposal Preparation Program in the College of Liberal Arts at the University of Texas at El Paso (to O. F. M.).

References

  • 1. Baron RM, Kenny DA. The moderator–mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. J Pers Soc Psychol. 1986;51(6):1173–1182. [DOI] [PubMed]
  • 2. Kenny DA. Reflections on mediation. Organ Res Methods. 2007;11(2):353–358.
  • 3. MacKinnon DP, Lockwood CM, Hoffman JM, West SG, Sheets V. A comparison of methods to test mediation and other intervening variable effects. Psychol Methods. 2002;7(1):83–104. [DOI] [PMC free article] [PubMed]
  • 4. Preacher KJ, Kelley K. Effect size measures for mediation models: quantitative strategies for communicating indirect effects. Psychol Methods. 2011;16(2):93–115. [DOI] [PubMed]
  • 5. Preacher KJ, Hayes AF. Asymptotic and resampling strategies for assessing and comparing indirect effects in multiple mediator models. Behav Res Methods. 2008;40(3):879–891. [DOI] [PubMed]
  • 6. Fritz MS, MacKinnon DP. Required sample size to detect the mediated effect. Psychol Sci. 2007;18(3):233–239. [DOI] [PMC free article] [PubMed]
  • 7. Rucker DD, Preacher KJ, Tormala ZL, Petty RE. Mediation analysis in social psychology: current practices and new recommendations. Soc Pers Psych Compass. 2011;5(6):359–371.
  • 8. Sobel ME. Asymptotic confidence intervals for indirect effects in structural equation models. In: Leinhardt S, ed. Sociological Methodology. Washington, DC: American Sociological Association; 1982:290–312.
  • 9. Tofighi D, MacKinnon DP. RMediation: an R package for mediation analysis confidence intervals. Behav Res Methods. 2011;43(3):692–700. [DOI] [PMC free article] [PubMed]
  • 10. Preacher KJ, Selig JP. Advantages of Monte Carlo confidence intervals for indirect effects. Comm Meth Meas. 2012;6:77–98.
  • 11. Meeker WQ Jr, Cornwell LW, Aroian LA. Selected Tables in Mathematical Statistics, Vol. 7: The Product of Two Normally Distributed Random Variables. Providence, RI: American Mathematical Society; 1981.
  • 12. Mackinnon DP, Lockwood CM, Williams J. Confidence limits for the indirect effect: distribution of the product and resampling methods. Multivariate Behav Res. 2004;39(1):99–128. [DOI] [PMC free article] [PubMed]
  • 13. Selig JP, Preacher KJ. Monte Carlo method for assessing mediation: an interactive tool for creating confidence intervals for indirect effects. [computer software]; 2008. Available at: http://quantpsy.org. Accessed August 12, 2012.
  • 14. Hayes AF. An Introduction to Mediation, Moderation, and Conditional Process Analysis: A Regression-Based Approach. New York, NY: Guilford; 2013.

Articles from American Journal of Public Health are provided here courtesy of American Public Health Association

RESOURCES