Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 May 22.
Published in final edited form as: Biometrics. 2012 May 2;68(4):1278–1284. doi: 10.1111/j.1541-0420.2012.01761.x

A Method of Moments Estimator for Random Effect Multivariate Meta-Analysis

Han Chen 1,*, Alisa K Manning 2, Josée Dupuis 1
PMCID: PMC4030295  NIHMSID: NIHMS452482  PMID: 22551393

SUMMARY

Meta-analysis is a powerful approach to combine evidence from multiple studies to make inference about one or more parameters of interest, such as regression coefficients. The validity of the fixed effect model meta-analysis depends on the underlying assumption that all studies in the meta-analysis share the same effect size. In the presence of heterogeneity, the fixed effect model incorrectly ignores the between-study variance and may yield false positive results. The random effect model takes into account both within-study and between-study variances. It is more conservative than the fixed effect model and should be favored in the presence of heterogeneity. In this paper, we develop a noniterative method of moments estimator for the between-study covariance matrix in the random effect model multivariate meta-analysis. To our knowledge, it is the first such method of moments estimator in the matrix form. We show that our estimator is a multivariate extension of DerSimonian and Laird’s univariate method of moments estimator, and it is invariant to linear transformations. In the simulation study, our method performs well when compared to existing random effect model multivariate meta-analysis approaches. We also apply our method in the analysis of a real data example.

Keywords: Between-study covariance matrix, Heterogeneity, Method of moments estimator, Multivariate meta-analysis, Random effect model

1. Introduction

Meta-analysis has been widely used to increase precision and power by combining studies (Cohn and Becker, 2003). Assuming that the effect to be estimated is the same in all studies, the fixed effect meta-analysis is often successfully used to combine the studies and obtain the point estimate for the effect size and its standard error. However, the underlying assumption of equal effect sizes of the fixed effect model may be violated (DerSimonian and Laird, 1986). Different studies may come from different populations, use different protocols and have different levels of confounding or effect modifying variables. Thus, the studies may not share a common effect size of interest. In the presence of heterogeneity, the fixed effect model underestimates the standard error of the point estimate by ignoring the between-study variance. False positive findings may be generated when the fixed effect model is inappropriately used.

The random effect model allows both within-study and between-study variances, and is more conservative than the fixed effect model in declaring significance of the effect size of interest. In the presence of heterogeneity, the random effect model provides more appropriate standard error of the point estimate and better confidence interval than the fixed effect model. When meta-analyzing a single parameter of interest, one can estimate the between-study variance by using the EM algorithm (Dempster et al., 1977) or other iterative methods; a noniterative method of moments estimator has also been proposed (DerSimonian and Laird, 1986).

Meta-analysis has also been applied to two or more correlated effect estimates (Raudenbush et al., 1988), such as regression coefficients (Becker and Wu, 2007). Analogous to the univariate case in which the inverse variance is used as the weight, the inverse of the covariance matrix is the weight in the multivariate fixed effect model. Random effect models have also been proposed to incorporate the between-study covariance matrix (Berkey et al., 1998, van Houwelingen et al., 2002, Riley et al., 2007). However, the iterative procedure is often computer intensive and may not reach convergence. Recently, an extended DerSimonian and Laird’s method of moments estimator was proposed to solve the between-study covariance matrix (Jackson et al., 2010). Though this method is a noniterative approach, it requires calculating each element of the matrix separately, and it is not invariant to reparametrization of effect sizes.

In this paper, we proposed a novel method of moments estimator for the between-study covariance matrix in the multivariate meta-analysis. It is also a multivariate extension of DerSimonian and Laird’s univariate method of moments estimator. To our knowledge, this is the first noniterative estimator for the between-study covariance matrix in the matrix form. It is invariant to linear transformations. We perform a simulation study to compare our method with the restricted maximum likelihood (REML) method (Jennrich and Schluchter, 1986) and Jackson’s multivariate DerSimonian and Laird’s method. We also apply the three random effect methods as well as the fixed effect approach to a real data example.

2. Fixed Effect Multivariate Meta-Analysis

Suppose we are interested in meta-analyzing p correlated effects from k studies. To estimate the true effect sizes

β=[β1β2βp],

we need effect estimates and their covariance matrix from individual studies. For study i (1 ≤ ik), we denote the effect estimates

bi=[bi1bi2bip]

and their covariance matrix Σi.

To meta-analyze vectors b1, b2, …, bk and get a summary estimate, we need generalized least squares (GLS) methods instead of ordinary least squares (OLS) methods, because the variances of effect estimates from different studies are unequal. We first stack the k vectors to get a long vector with length kp

b=[b1b2bk].

Assuming that the k studies are uncorrelated, we make a blockwise diagonal matrix

Σ=[Σ1000Σ2000Σk]kp×kp.

This is the covariance matrix of vector b.

We assume the following model holds:

bkp×1=Wkp×pβp×1+ekp×1,

where W is a stack of k identity matrices of size p × p, and we assume that the error e follows a multivariate normal distribution with means 0 and covariance matrix Σ, which is the covariance matrix of vector b.

The fixed effect model summary estimator (Raudenbush et al., 1988, Becker and Wu, 2007) is

β^F=(WΣ1W)1WΣ1b,

with covariance estimator

Cov(β^F)=(WΣ1W)1.

The null hypothesis of homogeneity

E(b1)=E(b2)==E(bk)=β

can be tested using the homogeneity test statistic

Q=(bWβ^F)Σ1(bWβ^F).

Q is a scalar. Under the null hypothesis of no heterogeneity, it asymptotically follows a chi-square distribution with (k − 1)p degrees of freedom (Becker and Wu, 2007).

3. Random Effect Multivariate Meta-Analysis

Similarly to the fixed effect model, we assume

bkp×1=Wkp×pβp×1+δkp×1+ekp×1.

We assume that the error e follows a multivariate normal distribution with means 0 and covariance matrix Σ. The random effect vector

δ=[δ1δ2δk],

where δi (1 ≤ ik) follows a multivariate normal distribution with means 0 and covariance matrix T. Thus the covariance matrix of vector b is

Ωkp×kp=Σkp×kp+Ik×kTp×p=[Σ1+T000Σ2+T000Σk+T]kp×kp.

Σi (1 ≤ ik) is the within-study covariance matrix for study i defined above, Ik×k is the k × k identity matrix, T is the between-study covariance matrix. The symbol ⊗ denotes the Kronecker product of two matrices.

The random effect model summary estimator is

β^R=(WΩ1W)1WΩ1b,

with covariance estimator

Cov(β^R)=(WΩ1W)1.

The crucial step for the random effect approach is to estimate the between-study covariance matrix T. In this paper we use the following three methods. In all the methods, the values of the covariance matrices Σi (1 ≤ ik) are fixed at the obtained study-specific estimates.

3.1 Restricted maximum likelihood method

The REML estimation can be performed by maximizing

L(T)=12log|Ω|12log|WΩ1W|12rΩ1r,

where

r=bW(WΩ1W)1WΩ1b,

under the constraint that T is positive semi-definite (Jackson et al., 2010, Jennrich and Schluchter, 1986).

3.2 Jackson’s multivariate DerSimonian and Laird’s method

Jackson’s multivariate DerSimonian and Laird’s method calculates each entry of T separately (Jackson et al., 2010). Let

Ti,j=Σu=1k(bu(i)bi[j])(bu(j)bj[i])Σu(i,i)Σu(j,j),

where

bi[j]=Σu=1kbu(i)Σu(i,i)Σu(j,j)Σu=1k1Σu(i,i)Σu(j,j),

and bu(i) is the ith (1 ≤ ip) element in the vector bu (1 ≤ uk), Σu(i,j) is the element on row i (1 ≤ ip) and column j (1 ≤ jp) of Σu (1 ≤ uk). Let

Ei,j=Σu=1kΣu(i,j)Σu(i,i)Σu(j,j)Σu=1kΣu(i,j)Σu(i,i)Σu(j,j)Σu=1k1Σu(i,i)Σu(j,j),Fi,j=Σu=1k1Σu(i,i)Σu(j,j)Σu=1k1Σu(i,i)Σu(j,j)Σu=1k1Σu(i,i)Σu(j,j),

then

T^i,j=Ti,jEi,jFi,j

is the element on row i (1 ≤ ip) and column j (1 ≤ jp) of .

While the calculation is generally faster than REML, Jackson’s method requires calculating p2 weighted means i[j] as intermediates for a p × p matrix , and it is not invariant to reparametrization of effect sizes.

3.3 A novel multivariate DerSimonian and Laird’s method

Let

Ψ=Cov(β^F)=(WΣ1W)1=(Σj=1kΣj1)1,
Φ=Ψ1Σi=1kΣi1ΨΣi1=Σi=1k[Σi1Σi1(Σj=1kΣj1)1Σi1],
A=Σj=1kΣj1(bjβ^F)(bjβ^F)(k1)Ip×p,

where Σj, bj (1 ≤ jk) are defined above, β̂F is the fixed effect estimate defined above. Then

E(A)=ΦT.

A symmetric method of moments estimator for T is

T^=Φ1A+AΦ12.

See Web Appendix A for the derivation. A very nice property of our estimator is that it is invariant to linear transformation (Web Appendix B).

When p = 1, this estimator is scalar and is equal to DerSimonian and Laird’s estimator (Web Appendix C).

In the presence of heterogeneity, the homogeneity test statistic Q no longer follows a chi-square distribution. However, its expectation is a function of the true between-study covariance matrix T (Web Appendix D).

Since T is a covariance matrix, it should be positive semi-definite. While we can do the maximization under the constraint that T is positive semi-definite for the REML method, we generally have no guarantee that Jackson’s or our method of moments estimator would be positive semi-definite, especially when heterogeneity is low. A remedy for this issue is discussed in Web Appendix E, which we adopted in all the analyses for this paper. Jackson et al. used the same strategy in their paper (Jackson et al., 2010).

4. Simulation

4.1 Simulation design

To compare the performance of the REML method, Jackson’s multivariate DerSimonian and Laird’s method (MDLJ) and Chen’s multivariate DerSimonian and Laird’s method (MDLC) developed in this paper, we conducted simulation studies in the context of bivariate meta-analysis. We performed all the calculation and analysis in R-2.9.2 (R Development Core Team, 2009).

We considered 10 studies with different sample sizes. 100 between-study variances were generated from a chi-square distribution with 1 df, and values less than 0.016 or greater than 2.7 were discarded (corresponding approximately to the 10% and 90% quantiles of 1 df chi-square distribution). Then we randomly chose 2 sets of 10 variances out of the remaining values, sorted and paired them. The smallest pair was assigned to the first study as the within-study variances of the two effects and so on until the largest pair was assigned to the last study. The within-study correlation was set to 0.2 or 0.8 for all 10 studies.

We followed the procedure by Higgins and Thompson to calculate the between-study variances (Jackson et al., 2010, Higgins and Thompson, 2002). Since the variances for both outcomes were simulated from the same population, we first calculated the typical within-study variance

σ2=limn(n1)Σi=1n1σi2(Σi=1n1σi2)2Σi=1n1σi4,

where σi2 was generated as discussed above. For this parameter setting we have σ2=0.15. Then we computed the between-study variances using

I12=T11T11+σ2=T11T11+0.15I22=T22T22+σ2=T22T22+0.15

where I12 and I22 are the proportions of marginal variation in the first and second effects due to heterogeneity, respectively. We considered 9 scenarios, in which each of I12 and I22 was set to 0.2, 0.5 or 0.8 to simulate low, moderate and high heterogeneity for each effect. T11 and T22 are the between-study variances for the first and second effects. The between-study correlation was set to 0.2 or 0.8 to calculate the covariance.

For each study i (1 ≤ i ≤ 10), we generated the effect size vector bi from a bivariate normal distribution with mean 0 and covariance matrix Σi + T, where Σi is the within-study covariance matrix and T is the between-study covariance matrix.

4.2 Simulation results

We summarized the simulation results for between-study correlation 0.2 and within-study correlation 0.2 in Table 1. We only presented results for the first effect and the covariance, since the second effect shows similar results. Columns 7 – 9 show the bias, mean squared error for the first summary effect estimator β̂1 and 95% confidence interval coverage for the first effect size. The confidence interval was constructed as β̂1 ± t0.025,9SE (β̂1) (Follmann and Proschan, 1999). Columns 10 – 13 show the bias of the between-study variance estimator for the first effect 11 and that of the between-study covariance estimator 12, and their corresponding mean squared errors. Column 14 shows the proportion of the between-study covariance matrix estimate lying on the boundary of its parameter space: either at least one of the variance estimates is 0, or the absolute value of the correlation coefficient is 1. Since the between-study covariance matrix after the Web Appendix E correction when necessary is always positive semi-definite, this column also indicates the percentage of the between-study covariance matrix not being positive definite, that is, its minimum eigenvalue is equal to 0. To allow for rounding errors, we defined being less than 10-8 as being equal to 0.

Table 1.

Simulation results from 1000 replicates for 10 studies with between-study correlation 0.2 and within-study correlation 0.2.

I12
I22
T11 T12 T12 Method Bias β̂1 MSE β̂1 Coverage β̂1 Bias 11 Bias 12 MSE 11 MSE 12 % boundary
0.2 0.2 0.0375 0.0075 0.0375 REML −0.0065 0.0331 0.962 0.0491 0.0167 0.0235 0.0078 0.888
MDLJ −0.0073 0.0326 0.963 0.0473 0.0086 0.0142 0.0051 0.819
MDLC −0.0072 0.0326 0.963 0.0476 0.0085 0.0142 0.0052 0.824
0.2 0.5 0.0375 0.0150 0.1500 REML −0.0001 0.0311 0.952 0.0452 0.0096 0.0218 0.0118 0.838
MDLJ −0.0017 0.0307 0.957 0.0447 0.0011 0.0154 0.0092 0.748
MDLC −0.0016 0.0307 0.958 0.0447 0.0009 0.0152 0.0094 0.751
0.2 0.8 0.0375 0.0300 0.6000 REML 0.0048 0.0334 0.962 0.0553 0.0082 0.0270 0.0343 0.708
MDLJ 0.0054 0.0328 0.969 0.0499 −0.0044 0.0164 0.0262 0.629
MDLC 0.0058 0.0328 0.967 0.0505 −0.0044 0.0168 0.0262 0.641
0.5 0.2 0.1500 0.0150 0.0375 REML 0.0103 0.0562 0.934 0.0369 0.0185 0.0470 0.0137 0.814
MDLJ 0.0104 0.0548 0.938 0.0265 0.0091 0.0361 0.0104 0.738
MDLC 0.0102 0.0549 0.939 0.0267 0.0089 0.0361 0.0105 0.737
0.5 0.5 0.1500 0.0300 0.1500 REML 0.0108 0.0527 0.924 0.0278 0.0051 0.0490 0.0195 0.731
MDLJ 0.0121 0.0521 0.930 0.0207 −0.0017 0.0317 0.0161 0.621
MDLC 0.0120 0.0521 0.929 0.0213 −0.0018 0.0315 0.0162 0.622
0.5 0.8 0.1500 0.0600 0.6000 REML −0.0001 0.0499 0.925 0.0350 0.0034 0.0530 0.0500 0.552
MDLJ −0.0012 0.0492 0.937 0.0265 −0.0079 0.0383 0.0454 0.454
MDLC −0.0009 0.0492 0.935 0.0268 −0.0071 0.0380 0.0458 0.451
0.8 0.2 0.6000 0.0300 0.0375 REML 0.0099 0.1132 0.929 0.0296 0.0125 0.2405 0.0347 0.739
MDLJ 0.0112 0.1130 0.933 0.0296 −0.0009 0.2763 0.0280 0.619
MDLC 0.0113 0.1131 0.933 0.0299 −0.0006 0.2764 0.0279 0.612
0.8 0.5 0.6000 0.0600 0.1500 REML 0.0055 0.1147 0.925 0.0769 0.0149 0.3099 0.0567 0.545
MDLJ 0.0046 0.1136 0.934 0.0504 0.0007 0.2876 0.0501 0.443
MDLC 0.0045 0.1136 0.932 0.0507 0.0013 0.2879 0.0500 0.439
0.8 0.8 0.6000 0.1200 0.6000 REML 0.0256 0.1003 0.925 0.0065 0.0034 0.2153 0.1108 0.288
MDLJ 0.0264 0.0998 0.928 0.0033 0.0048 0.2551 0.1358 0.210
MDLC 0.0262 0.0998 0.930 0.0042 0.0055 0.2559 0.1350 0.214

The true values of β1 and β2 were both set to 0. I12 and I22 denote the proportions of marginal variation in the first and second effects due to heterogeneity. T11 and T22 denote the between-study variances for the first and second effects, and T12 denotes the between-study covariance. REML, MDLJ and MDLC denote restricted maximum likelihood method, Jackson’s multivariate DerSimonian and Laird’s method and Chen’s multivariate DerSimonian and Laird’s method in this paper. The Web Appendix E correction was applied when the between-study covariance matrix estimate from MDLJ or MDLC was not positive semi-definite. Coverage denotes the coverage of 95% confidence interval. % boundary denotes the proportion of the between-study covariance matrix estimate lying on the boundary of its parameter space.

We can see from Table 1 that all three methods give very similar results, and our new approach is closer to Jackson’s method rather than the REML. This is not surprising because REML is a likelihood-based iterative method, while the other two are method of moments. In all scenarios REML gives more boundary-valued between-study covariance matrix estimates than the other non-iterative methods.

Bias 11 is generally greater than 0, because we pull back negative eigenvalues to 0 when fixing covariance matrix estimates that are not positive semi-definite, we somehow bias the diagonal elements upwards. We can see that as the heterogeneity increases, this bias generally decreases, and the proportion of boundary-valued between-study covariance matrix estimates also decreases. This is consistent with our prior knowledge that fixed effect models are usually preferred when heterogeneity is low and random effect models are more appropriate when heterogeneity is high. Simulation results for other correlation coefficient settings are summarized in Web Tables 1 – 3.

4.3 Additional simulation study

We performed 6 additional simulation studies to investigate the effect of the number of studies on the results. We set I12 and I22 to be 0.5, between-study and within-study correlation coefficients to be 0.2, and let the number of studies change from 10 to 5, 20, 50, 100, 200 and 500. Since all three methods give similar results, only results using our new method are shown in Figure 1 and Web Table 4. As the number of studies increases, mean squared errors of β̂1, the summary effect estimate for the first effect, and 11, the between-study variance estimate for the first effect decrease, which is reasonable because the large-sample properties of estimators in a meta-analysis depend on the number of studies. As the number of studies goes to infinity, mean squared errors converge to 0.

Figure 1.

Figure 1

Simulation results from 1000 replicates for various numbers of studies with I12=I22=0.5, between-study correlation 0.2 and within-study correlation 0.2. (A) Mean squared errors of β̂1, the first summary effect estimate, and 11, the between-study variance estimate for the first effect. Proportion of the between-study covariance matrix estimate lying on the boundary of its parameter space. (B) Coverage of 95% confidence interval for β1, the first effect size.

We can also see that as the number of studies increases, the proportion of the between-study covariance matrix estimate lying on the boundary of its parameter space (matrix with minimum eigenvalue 0) decreases dramatically, even though the heterogeneity is only moderate. As we fix fewer and fewer covariance matrices that are not positive semi-definite, bias of 11 also decreases quickly to near 0. There is no clear relationship between the coverage of 95% confidence interval for β1 and the number of studies, although only the largest sample of 500 studies has the correct coverage.

5. Application

In this application we use data from the base year of the High School Longitudinal Study of 2009 (HSLS:09) (Ingels et al., 2011). HSLS:09 is a nationally representative, longitudinal study of more than 21,000 9th graders in 944 schools who will be followed through their secondary and postsecondary years. We are interested in testing whether sex, socio-economic status and sex by socio-economic status interaction are predictive of the mathematics standardized theta score. We estimate the regression coefficients in each of the 8 race groups and perform multivariate meta-analyses on the regression coefficients to obtain the summary effect estimates.

Within each race group i, our model is

Yij=βi0+βi1Xij1+βi2Xij2+βi3Xij1Xij2+εij

where Yij is the mathematics standardized theta score, Xij1 is the sex, Xij2 is the socio-economic status score for student j. εij is the normally distributed error. The regression results are summarized in Table 2.

Table 2.

Regression results for 8 race groups.

Race i Ni bi1 bi2 bi3 Var(bi1) Var(bi2) Var(bi3) Cov(bi1, bi2) Cov(bi1, bi3) Cov(bi2, bi3)
1 163 0.3161 7.4015 0.4278 2.3568 9.7029 4.4114 −1.2105 0.8524 −6.1753
2 1672 −0.3201 6.9426 −0.9816 0.2529 0.7016 0.2743 0.1498 −0.1019 −0.4167
3 2218 0.6983 4.6680 −0.2415 0.1444 0.6481 0.2608 −0.0652 0.0433 −0.3899
4 204 3.2736 4.3080 0.2052 3.8428 10.3517 4.8268 −4.5587 3.2892 −6.6684
5 3311 −0.1599 5.6398 −0.6782 0.1161 0.4363 0.1733 −0.0992 0.0645 −0.2610
6 1912 −0.6989 6.3158 −0.7918 0.1603 0.7697 0.3180 0.0242 −0.0129 −0.4686
7 110 −3.6094 9.3429 −2.8711 3.2054 17.8889 7.2101 −1.1984 0.8437 −10.7697
8 11854 0.2172 6.4078 −0.6093 0.0278 0.1184 0.0482 0.0136 −0.0091 −0.0716

Ni is the sample size in race group i. bi1 is the regression coefficient for sex, bi2 is the regression coefficient for socio-economic status score, bi3 is the regression coefficient for sex by socio-economic status score interaction in race group i.

We use both the fixed effect model and random effect models to meta-analyze the regression coefficients from the 8 race groups. Table 3 shows the meta-analysis results. For this data, the homogeneity test statistic Q is 54.6, which asymptotically follows a chi-square distribution with 21 degrees of freedom under the null hypothesis of no heterogeneity. The p-value of the homogeneity test is 8.1×10-5. Thus, the assumption of homogeneous effect sizes for the fixed effect model is violated. Since the fixed effect model does not take between-study variance into consideration, it greatly underestimates the covariance matrix for the summary effect estimates, resulting in an inflated Wald test statistic for testing the hypotheses H0: β1 = β2 = β3 = 0 versus H1: at least one of β1, β2, β3 is not 0, where β1, β2, β3 are the summary effect sizes for sex, socio-economic status score and sex by socio-economic status score interaction, respectively.

Table 3.

Meta-analysis results for 8 race groups.

Method β̂1 (SE) β̂2 (SE) β̂3 (SE) 11 22 33 12 13 23 Wald statistic
FEMA 0.0788 (0.1208) 6.2031 (0.2448) −0.6590 (0.1550) 0 0 0 0 0 0 4141
REML −0.0244 (0.2427) 6.1674 (0.4843) −0.6679 (0.1865) 0.2090 1.0028 0.0657 −0.2946 0.1172 −0.2567 431
MDLJ −0.0612 (0.2599) 6.1873 (0.2973) −0.7038 (0.1888) 0.2558 0.1279 0.0501 −0.1221 0.0097 0.0542 567
MDLC −0.0604 (0.2684) 6.1821 (0.2887) −0.7009 (0.1894) 0.2805 0.1024 0.0532 −0.0948 0.0030 0.0602 571

FEMA denotes the fixed effect meta-analysis. REML, MDLJ and MDLC denote restricted maximum likelihood method, Jackson’s multivariate DerSimonian and Laird’s method and Chen’s multivariate DerSimonian and Laird’s method in this paper. β̂1, β̂2 and β̂3 denote the summary effect size estimates for sex, socio-economic status score and sex by socio-economic status score interaction, respectively. 11, 22, 33, 12, 13, 23 are corresponding elements of the matrix , the between-study covariance matrix estimate. Wald statistic is for testing the hypotheses H0: β1 = β2 = β3=0 versus H1: at least one of β1, β2, β3 of is not 0.

Jackson’s multivariate DerSimonian and Laird’s method of moments and our method give close results, while the restricted maximum likelihood method yields a different between-study covariance matrix estimate. However, all three random effect methods give very close summary effect size estimates, and the Wald test statistic reduces dramatically, compared to that from the fixed effect model.

6. Discussion

We propose a new method of moments estimator for the between-study covariance matrix in the random effect multivariate meta-analysis. We have shown in our simulation that our method gives similar results as existing random effect model multivariate meta-analysis methods. Furthermore, our method and Jackson’s multivariate DerSimonian and Laird’s method give very close results in both simulation studies and real data analysis.

Our estimator is the first matrix form method of moments estimator for the between-study covariance matrix in the random effect model multivariate meta-analysis. It is invariant to linear transformations. As a non-iterative estimator, it is very easy to calculate.

Despite its long history in combining published analysis results, meta-analysis is also very useful in multi-center or multi-ethnic studies, when different cohorts can share results from the same analysis but it is often not feasible to share original data. Generally, the fixed effect model is the first choice in a meta-analysis as it is easier to calculate and interpret, and it is more powerful than random effect models. However, in the presence of heterogeneity, results from the fixed effect model are not valid, then random effect models are preferred. When performing the random effect meta-analysis for p effects, Jackson’s method requires calculating p2 weighted means i[j] (1 ≤ ip, 1 ≤ jp) in intermediate steps, which are not related to corresponding fixed effect summary estimates. Specifically, the weighted means i[i] (1 ≤ ip) are the fixed effect summary estimates in univariate meta-analyses, instead of the multivariate meta-analysis we desire. In the contrast, our method directly uses the fixed effect summary estimates vector to calculate the between-study covariance matrix estimate. It does not require performing p2 additional univariate meta-analyses to calculate the intermediates.

Recently, a U-statistic based random effect model multivariate meta-analysis approach was proposed (Ma and Mazumdar, 2011). This approach does not require the normality assumption for the effect sizes as it is nonparametric. However, similar with Jackson’s method, the U-statistic method also requires estimating each element of the between-study covariance matrix separately. Future work involves the development of a matrix form estimator for the U-statistic approach.

Supplementary Material

Supporting Information

Acknowledgments

This research was partially supported by NIH awards R01 DK078616 and U01 DK85526. A portion of this research was conducted using the Linux Clusters for Genetic Analysis (LinGA) computing resource funded by the Robert Dawson Evans Endowment of the Department of Medicine at Boston University School of Medicine and Boston Medical Center and contributions from individual investigators.

Footnotes

Supplementary Materials

Web Appendices and Tables referenced in Sections 3.3, 4.2 and 4.3 are available with this paper at the Biometrics website on Wiley Online Library.

References

  1. Becker BJ, Wu M. The synthesis of regression slopes in meta-analysis. Statistical Science. A Review Journal of the Institute of Mathematical Statistics. 2007;22:414–429. [Google Scholar]
  2. Berkey CS, Hoaglin DC, Antczak-Bouckoms A, Mosteller F, Colditz GA. Meta-analysis of multiple outcomes by regression with random effects. Statistics in medicine. 1998;17:2537–2550. doi: 10.1002/(sici)1097-0258(19981130)17:22<2537::aid-sim953>3.0.co;2-c. [DOI] [PubMed] [Google Scholar]
  3. Cohn LD, Becker BJ. How Meta-Analysis Increases Statistical Power. Psychological methods. 2003;8:243–253. doi: 10.1037/1082-989X.8.3.243. [DOI] [PubMed] [Google Scholar]
  4. Dempster AP, Laird NM, Rubin DB. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society. Series B (Methodological) 1977;39:1–38. [Google Scholar]
  5. DerSimonian R, Laird N. Meta-analysis in clinical trials. Controlled clinical trials. 1986;7:177–188. doi: 10.1016/0197-2456(86)90046-2. [DOI] [PubMed] [Google Scholar]
  6. Follmann DA, Proschan MA. Valid Inference in Random Effects Meta-Analysis. Biometrics. 1999;55:732–737. doi: 10.1111/j.0006-341x.1999.00732.x. [DOI] [PubMed] [Google Scholar]
  7. Higgins JPT, Thompson SG. Quantifying heterogeneity in a meta-analysis. Statistics in medicine. 2002;21:1539–1558. doi: 10.1002/sim.1186. [DOI] [PubMed] [Google Scholar]
  8. Ingels SJ, Pratt DJ, Herget DR, Burns LJ, Dever JA, Ottem R, Rogers JE, Jin Y, Leinwand S. High School Longitudinal Study of 2009 (HSLS:09). Base-Year Data File Documentation. Washington, DC: U.S. Department of Education, National Center for Education Statistics; 2011. (NCES 2011-328). Retrieved Nov. 16, 2011 from http://nces.ed.gov/pubsearch. [Google Scholar]
  9. Jackson D, White IR, Thompson SG. Extending DerSimonian and Laird’s methodology to perform multivariate random effects meta-analyses. Statistics in medicine. 2010;29:1282–1297. doi: 10.1002/sim.3602. [DOI] [PubMed] [Google Scholar]
  10. Jennrich RI, Schluchter MD. Unbalanced Repeated-Measures Models with Structured Covariance Matrices. Biometrics. 1986;42:805–820. [PubMed] [Google Scholar]
  11. Ma Y, Mazumdar M. Multivariate meta-analysis: a robust approach based on the theory of U-statistic. Statistics in medicine. 2011;30:2911–2929. doi: 10.1002/sim.4327. [DOI] [PubMed] [Google Scholar]
  12. R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing; Vienna, Austria: 2009. URL http://www.R-project.org. [Google Scholar]
  13. Raudenbush SW, Becker BJ, Kalaian H. Modeling multivariate effect sizes. Psychological bulletin. 1988;103:111–120. [Google Scholar]
  14. Riley RD, Abrams KR, Lambert PC, Sutton AJ, Thompson JR. An evaluation of bivariate random-effects meta-analysis for the joint synthesis of two correlated outcomes. Statistics in medicine. 2007;26:78–97. doi: 10.1002/sim.2524. [DOI] [PubMed] [Google Scholar]
  15. van Houwelingen HC, Arends LR, Stijnen T. Advanced methods in meta-analysis: multivariate approach and meta-regression. Statistics in medicine. 2002;21:589–624. doi: 10.1002/sim.1040. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

RESOURCES