To the Editor:
Meta-analysts often must convert effect sizes reported on different scales to a common scale for analysis.1 In particular, it is common to convert Pearson’s correlation, r, computed between an exposure X and an outcome Y to Cohen’s d (also called the “standardized mean difference”), which is the difference in expected Y for a fixed contrast in X, standardized by the standard deviation of Y conditional on X . Letting N denote the total sample size, the standard conversion1 from r to d is:
(1.1) |
An important, yet infrequently discussed, point is that this conversion was derived for a Pearson correlation computed between a binary exposure X and a continuous outcome Y , also called a “point-biserial” correlation.2-4 Note that when X represents a dichotomization of a truly continuous underlying exposure, a special approach3 is required to estimate the correlation between the underlying, continuous exposure and Y ; one cannot simply apply the standard Pearson’s correlation formula to the observed, dichotomized X . Stated otherwise, the point-biserial correlation does not consistently estimate the Pearson correlation that would have been obtained using the underlying continuous variable.
Despite the standard conversion’s origins in the binary-exposure setting, meta-analysts in practice often unknowingly apply Equation (1.1) to obtain Cohen’s d from correlations and regression results computed using a continuous X . In fact, a widely referenced textbook on meta-analysis describes Equation (1.1) without stipulating that it is only known to apply for the point-biserial case.1 Even if Equation (1.1) can be used for correlations computed using a continuous X , its interpretation is unclear: that is, the interpretation of Cohen’s d depends on the choice of “groups” in X whose means are compared, but because Equation (1.1) applies for a correlation in which X is already binary, it is not clear which “groups” of X are created when the conversion is instead applied to a correlation using a continuous X .
To allow direct computation of Cohen’s d from Pearson’s r or simple linear regression, we provide a similar conversion and approximate standard error that apply when X is continuous. The resulting effect size represents the average increase in the standardized Y associated with an increase in X of Δ units. To preserve the sign of the effect size, Δ should be set to be positive regardless of the sign of r . Letting sx denote the sample standard deviation of X , the conversion is:
(1.2) |
Derivations of these estimates of d and its standard error are provided in the eAppendix, http://links.lww.com/EDE/B601. The standard error estimate assumes that X is approximately normal and that N is large. If the standard deviation of X is known rather than estimated, then the term should be omitted. As a potential practical limitation, some papers to be meta-analyzed might not report sX , in which case the meta-analyst might need to substitute an estimate from, for example, a comparable second study or a subsample of the study used to estimate r . In this case, the N in the term should be replaced with the size of the second sample used to estimate sX (see Supplement, http://links.lww.com/EDE/B601). The conversion is easy to calculate manually or using the function r_to_d in the R package MetaUtility.
Comparing Equations (1.1) and (1.2) clarifies the meaning of the “Cohen’s d ” that results from unknowingly applying Equation (1.1) to a correlation computed with a continuous X. Specifically, the result coincides with the effect size associated with an increase in X of two standard deviations. (However, even with Δ = 2sx, the standard error estimates in Equations (1.1) and (1.2) will, in general, still not coincide.) In many applications, this may represent a rather extreme contrast: for example, if X is normal, then a two-standard-deviation contrast with the reference level set to the mean would involve comparison to the 97.7th quantile of X . Alternatively, a two-standard deviation contrast from one standard deviation below the mean to one standard deviation above is a comparison of the 15.8th quantile to the 81.1th quantile of X . Additionally, the absolute size of a two-standard-deviation contrast in X may differ substantially across study populations and may therefore be challenging to interpret in practice.5 Thus, it is perhaps preferable, when possible, to instead fix a specific, scientifically meaningful contrast of interest, Δ, which is held constant across all meta-analyzed studies, and then to apply the proposed conversion in Equation (1.2). The meta-analytic pooled estimate would then correspond to a well-defined contrast in X of Δ units, rather than to a contrast whose size may vary arbitrarily across studies.
The standard conversion is alternatively sometimes described in terms of the contrast that arises from dichotomizing X at a given threshold,1 yet in fact, the conversion often substantially overestimates the contrast produced by dichotomization, even at extreme thresholds of X . For example, we simulated bivariate normal data (1×105 observations) where X ~ N(0,1) and Y ~ N(X ,1) , such that r = 0.70 . The standard conversion estimates d = 2.0 (Figure, dashed red line). We also calculated the true two-group Cohen’s d arising from dichotomizing X at various thresholds in [−2,2] (Figure, solid black curve). The Figure shows that the “Cohen’s d “ from the standard conversion is 47% larger than the true two-group d arising from dichotomization at the mean and still overestimates the true two-group d for extreme dichotomization thresholds near −2 or 2 . For example, for dichotomization at X = 2 (i.e., the 97.7th percentile), the standard conversion still overestimates the true two-group Cohen’s d by 14%.
In summary, when approximating Cohen’s d from Pearson’s r or simple linear regression with a continuous X , we caution against using conversions derived for a binary X . We provide a straightforward conversion designed to accommodate the case of a continuous X through specification of a fixed contrast in X ; we believe its use in meta-analysis would enable more precisely interpretable and scientifically meaningful effect sizes.
Supplementary Material
ACKNOWLEDGMENTS
An anonymous contributor on a statistics forum made helpful contributions to the proof of Lemma 2.1 (Supplement).
M.B.M. and T.V.W. were supported by NIH grant R01 CA222147. The funders had no role in the design, conduct, or reporting of this research.
Footnotes
The authors report no conflicts of interest.
Contributor Information
Maya B. Mathur, Department of Epidemiology, Harvard T. H. Chan School of Public Health, Boston, MA; Quantitative Sciences Unit, Stanford University, Palo Alto, CA
Tyler J. VanderWeele, Quantitative Sciences Unit, Stanford University, Palo Alto, CA
REFERENCES
- 1.Borenstein M, Hedges LV, Higgins J, Rothstein HR. Introduction to Meta-Analysis. Wiley Online Library; 2009. [Google Scholar]
- 2.McGrath RE, Meyer GJ. When effect sizes disagree: the case of r and d . Psychol Methods. 2006:11:386. [DOI] [PubMed] [Google Scholar]
- 3.Jacobs P, Viechtbauer W. Estimation of the biserial correlation and its sampling variance for use in meta-analysis. Res Synth Methods. 2017;8:161–180. [DOI] [PubMed] [Google Scholar]
- 4.Hunter JE, Schmidt FL. Methods of Meta-Analysis: Correcting Error and Bias in Research Findings. Sage; 2004. [Google Scholar]
- 5.Greenland S, Schlesselman JJ, Criqui MH. The fallacy of employing standardized regression coefficients and correlations as measures of effect. Am J Epidemiol. 1986;123:203–208. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.