Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2025 Nov 25.
Published in final edited form as: Trends Neurosci. 2022 Oct 21;46(2):94–96. doi: 10.1016/j.tins.2022.09.009

A misadventure of the correlation coefficient

Dmitri A Rusakov 1,*
PMCID: PMC7618396  EMSID: EMS210872  PMID: 36280457

Abstract

The correlation coefficient gauges linear association between two variables. However, interpreting its value depends on the question at hand. This article argues that relying on the correlation coefficient may be irrelevant for many neuroscience research tasks. When the experimental dataset is contextually suitable for binning-averaging, other indicators of statistical association could prove more suitable.

Interpreting Pearson’s R

The commonly used Pearson’s correlation coefficient, Pearson’s R, evaluates linear association between two experimental variables, say X and Y. It has been widely accepted, at least across biological sciences, including neurobiology, that R > 0.7 indicates a strong linear relationship between X and Y, R < 0.3 weak, and the rest is moderate. This rule of thumb normally reflects the aim to predict the value of Y from measuring X, based on the XY relationship established for the population. The prediction reliability aspect of Pearson’s R follows from the fact that the R2 value, termed the coefficient of determination, stands for the proportion of the variation in Y that could be predicted from X. In essence, the correlation coefficient indicates how closely the data fit a linear pattern [1,2], which may explain practical usage of the foregoing traditional rule of thumb, particularly in clinical studies where prediction reliability is of importance.

However, in many documented cases these cut-off points appear arbitrary and inconsistent [2,3]. The correlation coefficient is of little importance unless it can be properly interpreted, which has been considered a difficult task for all scale values [1]. In fact, it has long been emphasised that the interpretation and usability of R depend on the specific question under study [2,4]. The use of R may be of particular relevance if the question falls into one of the following three categories: (i) How accurate is the prediction? (ii) What is the magnitude of error in a reliability task? (iii) What is the strength of the XY relationship? [4]. The present article will argue that, in many instances, neuroscience research tasks that involve the linear XY relationship fall outside this scope, so that referring to R appears irrelevant to the posed question.

Cumulative effects arising due to a populational trend

In many experimental quests, predicting the Y value from measuring X for individual objects is not among the relevant, or even feasible, objectives. Such cases arise, for instance, when one or both variables show high inherent (biological) variability and/or when there has been a large measurement error. Multiple examples for this can be found in neurophysiology. In cellular neurophysiology, for example, a common objective is to understand whether cell behaviour is affected by some cumulative populational feature arising across thousands of synaptic connections. The XY datasets describing individual synapses normally display significant scatters. Such datasets may include, for instance, dendritic spine density against dendrite diameter [5]; local numbers of synaptic AMPA receptors against distance from the soma [6]; quantal content against a short-term plasticity parameter for glutamate release [7]; and so on. Here, the XY relationship among synapses varies too strongly to reliably predict Y from X for any individual synapse, which is duly reflected in low R values. It can be argued, however, that the focus of these studies is beyond the scope of questions relevant to R: here, given the unlimited numbers of realisations or events (synaptic inputs times synaptic discharges), relatively weak (but highly significant) positive correlations can have a clear cumulative effect, which is not duly captured via the analyses of R. For instance, a weak correlation between synaptic release probability and the distance from the synapse to the neuronal soma can result in a highly significant shift in synaptic signal summation leading to cell spiking [8].

In this context, the question is therefore not about the strength of the XY relationship per se but whether the XY relationship is statistically significant (i.e., whether it transpires over virtually unlimited sampling). Here, the magnitude of the physiological effect will depend not so much on the adherence of the XY scatter to the straight line, but rather on how steep the XY dependence is. A conventional estimator of this steepness is linear regression, where the regression coefficient (algebraic slope of the best-fit straight line) describes the expected magnitude of change in Y for a one-unit change in X. In such cases, referring to the R value as an indicator of the effect strength could be highly ambiguous, as illustrated later.

Data binning-averaging increases R for the same dataset

The XY data scatter shown in Figure 1A is produced by generating a linear dependence between uniformly distributed X values over the 0–1 interval and Y values that follow the Gaussian distribution centred around the slope line (coefficient 0.5). This data scatter is illustrative of the experimental datasets mentioned earlier [57]: it reflects high data variability, low R value, yet highly significant regression (Figure 1A; β, linear regression coefficient). However, if we bin this dataset, by averaging Y values over the regular 0.1-long X intervals, we arrive at the reduced data scatter that shows an excellent linear relationship, with the strongly increased R but virtually unchanged β (Figure 1B). As an experimental example from work by the author and colleagues, a similar increase in R by data averaging can be shown in a dataset reporting release probability values versus distance from the synapse to the soma [8]. In the latter case, averaging (binning) synaptic features over a dendritic segment length reflects a realistic scenario when the bulk of synapses in that segment are activated [9]. Thus, essentially the same data sample could generate different R values, as long as the biological question under study permits data binning-averaging.

Figure 1. Binning raw data in the XY dataset elevates the correlation coefficient.

Figure 1

Hypothetical datasets were computer generated to illustrate the effect of binning on Pearson’s correlation coefficient. (A) A Monte Carlo generated data set (n = 300 data points; OriginPro random number generator), with X values distributed uniformly randomly over [0; 1] interval and Y values following a Gaussian distribution centred at the slope line Y = βX (β = 0.5), with standard deviation σ = 0.7; unbroken line, best-fit linear regression; R, Pearson’s correlation coefficient; P, confidence level to reject the null-hypothesis ‘zero-slope of linear regression’. (B) The dataset as in (A), but binned with Y values averaged over 0.1 X intervals; X values shown at bin centres; other notations as in (A). (C) An example of Pearson’s R based image colocalisation analysis: computer generated brightness signals in a pair of image channels (green and magenta; Gaussian pixel noise added; ImageJ Fiji Process/Noise). The size of image pixels (binning) increases from left to right column as indicated. The Pearson’s R values for the interchannel pixel brightness colocalisation, and the statistical significance of the respective linear regression slope (H0-hypothesis rejection level p; ImageJ Fiji Colocalization/Coloc 2), are shown for the respective image pairs. (D) 'True signal': images as in (C) 'Raw data' panel but with all noise removed; notations as in (C).

Image binning and Pearson’s correlation

Pearson’s R has also been used to evaluate colocalisation or correlation between distributed signals recorded in two separate imaging channels, either static or dynamic (e.g., [10,11]). Digital images consist of pixels or voxels, the ‘elementary’ square or cubic areas over which the intensity of native signal sources (diffraction-limited light in case of light microscopy) is averaged. Thus, setting the pixel or voxel size by the imaging system is equivalent to the averaging-binning of raw data, as discussed earlier. It is no surprise, therefore, that Pearson’s R calculated to assess signal colocalisation between two images or image channels may depend on the pixel size [12]. The simulated example in Figure 1C illustrates a colocalisation test for a pair of image channels, each with two brightness spots representing the signal of interest; Gaussian noise has been added to reflect typical imaging conditions. Calculating Pearson’s R for signal colocalisation shows that its value increases strongly with image binning (increasing pixel size; Figure 1C). This increase does not appear spurious because the ‘true signal’ (noise removed) corresponds to the highest R (Figure 1D). At the same time, the linear correlation slope remains significantly above zero throughout (P < 0.001), suggesting that colocalisation is statistically significant. However, quantifying the strength of signal colocalisation in such cases is a complex, context-dependent task, which is outside the scope of this article. Here we simply argue that relying on Pearson’s R may not be the most robust approach for establishing signal correlation or colocalisation for multiplexed imaging.

Concluding remarks

This article outlined some basic considerations around the use of Pearson’s R and argues against overinterpreting Pearson’s R as an unequivocal indicator of the ‘strength’ or ‘weakness’ of the effect arising from the linear dependence of two variables. In some experimental designs where the task in question permits data binning or averaging, as often encountered in neuroscience, a regression analysis should provide more reliable insights. In this context, it might be particularly ambiguous to rely on R in the case of experimental metrics with inherent binning-averaging, such as variable pixel/voxel values used in cellular or brain imaging.

Acknowledgments

The study was supported by: Wellcome Trust (212251_Z_18_Z), Medical Research Council (UK) (MR/W019752/1), European Research Council (323113), and European Commission NEUROTWIN (857562).

Footnotes

Declaration of interests

The author declares no competing interests in relation to this work.

References

  • 1.Taylor R. Interpretation of the correlation-coefficient - a basic review. J Diagn Med Sonogr. 1990;6:35–39. [Google Scholar]
  • 2.Schober P, et al. Correlation coefficients: appropriate use and interpretation. Anesth Analg. 2018;126:1763–1768. doi: 10.1213/ANE.0000000000002864. [DOI] [PubMed] [Google Scholar]
  • 3.Mukaka MM. Statistics corner: a guide to appropriate use of correlation coefficient in medical research. Malawi Med J. 2012;24:69–71. [PMC free article] [PubMed] [Google Scholar]
  • 4.Monroe WS, Stuit DB. The interpretation of the coefficient of correlation. J Exp Educ. 1933;1:186–203. [Google Scholar]
  • 5.Bannister NJ, Larkman AU. Dendritic morphology of CA1 pyramidal neurones from the rat hippocampus: II. Spine distributions. J Comp Neurol. 1995;360:161–171. doi: 10.1002/cne.903600112. [DOI] [PubMed] [Google Scholar]
  • 6.Andrasfalvy BK, Magee JC. Distance-dependent increase in AMPA receptor number in the dendrites of adult hippocampal CA1 pyramidal neurons. J Neurosci. 2001;21:9151–9159. doi: 10.1523/JNEUROSCI.21-23-09151.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Mendonca PRF, et al. Asynchronous glutamate release is enhanced in low release efficacy synapses and dispersed across the active zone. Nat Commun. 2022;13:3497. doi: 10.1038/s41467-022-31070-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Jensen TP, et al. Release probability increases towards distal dendrites boosting high-frequency signal transfer in the rodent hippocampus. eLife. 2021;10:62588. doi: 10.7554/eLife.62588. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Grillo FW, et al. A distance-dependent distribution of presynaptic boutons tunes frequency-dependent dendritic integration. Neuron. 2018;99:275–282. doi: 10.1016/j.neuron.2018.06.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Dunn KW, et al. A practical guide to evaluating colocalization in biological microscopy. Am J Physiol Cell Physiol. 2011;300:C723–C742. doi: 10.1152/ajpcell.00462.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Mann K, et al. Whole-brain calcium imaging reveals an intrinsic functional network in Drosophila. Curr Biol. 2017;27:2389. doi: 10.1016/j.cub.2017.06.076. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Heddleston JM, et al. A guide to accurate reporting in digital image acquisition - can anyone replicate your microscopy data? J Cell Sci. 2021;134:jcs254144. doi: 10.1242/jcs.254144. [DOI] [PubMed] [Google Scholar]

RESOURCES