Skip to main content
. 2013 Jul 4;14:214. doi: 10.1186/1471-2105-14-214

Figure 2.

Figure 2

Mean differences across a pool ofN groups causes Simpson’s paradox. (a)rxy was obtained from combining N groups of simulated data; simulation parameter ρxy is the true correlation of the pair xy within each group i = 1,N for Inline graphic. All other simulation parameters are as follows: μxy,i = (i,N−(i−1)), −0.9 ≤ ρxy ≤ 0.9, ni = 10, λi = λ, 0.01 ≤ λ ≤ 0.1, for i = 1,N, and 10 ≤ N ≤ 100. (b) Scatterplot of a pair xy obtained with the simulation parameters: μxy,i = (i,(11−i)), ρxy = 0.9 and ni = 50 for i = 1,10 groups. This plot shows clearly that even though there is a positive trend within each of the 10 groups, the trend across the pool of 10 groups is negative (Simpson’s paradox).