Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Dec 1.
Published in final edited form as: Stat Methods Med Res. 2010 Dec 21;21(6):621–633. doi: 10.1177/0962280210393712

Bivariate Random Effects Models for Meta-Analysis of Comparative Studies with Binary Outcomes: Methods for the Absolute Risk Difference and Relative Risk

Haitao Chu 1,*, Lei Nie 2, Yong Chen 3, Yi Huang 4, Wei Sun 5
PMCID: PMC3348438  NIHMSID: NIHMS351256  PMID: 21177306

Abstract

Multivariate meta-analysis is increasingly utilized in biomedical research to combine data of multiple comparative clinical studies for evaluating drug efficacy and safety profile. When the probability of the event of interest is rare or when the individual study sample sizes are small, a substantial proportion of studies may not have any event of interest. Conventional meta-analysis methods either exclude such studies or include them through ad-hoc continuality correction by adding an arbitrary positive value to each cell of the corresponding 2 by 2 tables, which may result in less accurate conclusions. Furthermore, different continuity corrections may result in inconsistent conclusions. In this article, we discuss a bivariate Beta-binomial model derived from Sarmanov family of bivariate distributions and a bivariate generalized linear mixed effects model for binary clustered data to make valid inferences. These bivariate random effects models use all available data without ad hoc continuity corrections, and accounts for the potential correlation between treatment (or exposure) and control groups within studies naturally. We then utilize the bivariate random effects models to reanalyze two recent meta-analysis data sets.

Keywords: clustered binary data, bivariate random effects models, Beta-binomial distribution, meta-analysis, bivariate generalized linear mixed models

1. Introduction

The growth of evidence-based medicine has led to an increase in attention to meta-analysis.1 Meta-analysis, also known as systematic overview, is a statistical process commonly used in biomedical research of combining the information from several independent studies concerned with the same clinical question including the treatment or exposure effect, with the aim of being able to resolve contradictory issues that cannot be concluded from a single study alone.

In meta-analysis of a set of N clinical trials with a binary outcome comparing an experimental treatment with a placebo, data can be represented as a series of 2 by 2 tables. The standard fixed and random meta-analysis methods for providing an overall estimate of the treatment effect across all studies rely on some assumptions.2 Specifically, the fixed effect model assumes homogeneous treatment effects across all studies. Let θi be the value of a chosen measure (e.g., risk difference or log relative risk) of treatment effect in the ith study (i = 1, 2, …, N), the homogeneity requires θi = θ (i = 1, 2, …, N). Let θ̂i be an estimate of θi, and wi denote the weight, which is often taken to be the reciprocal of the estimated variance ν̂i of θ̂i (i.e., ŵi = 1/ν̂i),3 then the overall treatment effect based on the fixed effect model is estimated as a weighted average of the individual study estimated treatment effects, that is, θ^w=iw^iθ^i/iw^i. Under the combined null hypothesis H0: θi = 0 (i = 1, 2, …, N), the test statistic U=(iw^iθ^i)2/iw^i follows a χ2 distribution with 1 degree of freedom. A formal test of homogeneity can be performed using the Cochran’s Q statistic, defined by Q=iw^i(θ^iθ^w)2, which has approximately a χN12 distribution under the null hypothesis H0: θi= θ (i = 1, 2, …, N).

Through a random-effects model, DerSimonian and Laird4 provided a way of incorporating heterogeneity into the overall estimate by including a between-study variance component σb2. It basically assumes that θ̂i ~ N (θi, i) and θiN(θ,σb2).2 The overall treatment effect is once again obtained as a weighted average with the weight being estimated as w^i=1/(v^i+σ^b2), i.e., θ^w=iw^iθ^i/iw^i. Under the combined null hypothesis H0: θi = 0 (i = 1, 2, …, N), the test statistic U=(iw^iθ^i)2/iw^i follows a χ2 distribution with 1 degree of freedom. The method of moments estimate of σb2 is given by σ^b2=max{[iw^i(iw^i2)/iw^i]1[Q(N1)],0}.4

A concern on these conventional two-step meta-analysis methods is that they require estimating study- specific treatment effect θ̂i (commonly expressed by log relative risk, log odds ratio or risk difference) and its variance ν̂i based on the normal approximation. When the probability of the event of interest is rare or if the individual study sample sizes are small, this normality assumption might not hold. Furthermore, a substantial proportion of studies may not have any event of interest. To circumvent the issues of zero cells, the conventional meta-analysis methods either exclude such studies5 or add an arbitrary positive value to each cell of the corresponding 2 by 2 tables in the analysis, which may lead to less accurate conclusions. For example, different continuity corrections may result in different conclusions.6 An interesting yet challenging methodology question is how to use all available data without assigning an arbitrary number to the empty cells in meta-analysis.710 Furthermore, it has been noted that these weighting-according-to-the-variance methods may introduce biases in meta-analyses of binary outcomes because this weighting scheme favors studies with certain frequencies of outcome events.11 The relative weights for the individual studies in a meta-analysis may change considerably among different choices of effect measurements, which may lead to contradictory conclusions. This is particularly true for the sparse data scenario. Specifically, a study with zero event in both treatment and placebo groups, which would be excluded on a relative scale, would be included and even be given a large weight on a risk difference scale.6

Recently, multivariate random effects models for meta-analyses have become increasingly popular in biomedical research. For example, multivariate random effects models have been proposed for meta-analyses of diagnostic test studies1218 and correlated multiple outcomes.19, 20 Given the potential issues of applying conventional meta-analysis methods based on a univariate outcome, we discuss bivariate random effects models to deal with those challenges for meta-analyses of comparative clinical trials with binary outcomes in this article. Although the proposed methods were primarily presented for bivariate meta-analyses, they can be easily generalized to multivariate meta-analyses. Specifically, Section 2 shows the estimation of marginal treatment effects using the maximum likelihood methods under two models, i.e., a generalized linear mixed effects model and a bivariate Beta-binomial model. In Section 3, we reanalyze the data from two case studies: the study of type 2 diabetes mellitus after gestational diabetes 21 and the study of myocardial infarction with Rosiglitazone.5 Section 4 concludes this article with a brief discussion.

2. Bivariate Random Effects Models for Meta-Analysis of Comparative Studies

Let nki be the number of subjects, and pki be the probability of “success” for the ith study (i = 1, 2, …, N) in the kth treatment (or exposure) group with k = 1 denoting the placebo (or unexposed) group and k =2 denoting the treated (or exposed) group. Let Ykij denote a Bernoulli random variable with value 1 denoting a “success” and value 0 denoting a “failure” for the jth subject (j = 1, 2, …, n) of the ith study in the kth treatment group. Let Xki=j=1nkiYkij be the total number of “success” in the kth treatment group in the ith study. In the first stage, conditional on the probability of events (i.e., pki) and the number of subjects (i.e., nki) of the kth treatment group in the ith study, the bivariate random effects model assumes that Xki is independently binomially distributed as Bin (nki, pki) for k = 1, 2 and i = 1, 2, …, N, that is,

P(X1i=x1i,X2i=x2in1i,n2i,p1i,p2i)=k=12(nkixki)(pki)xki(1pki)nkixki. (1)

In the second stage, the joint distribution f(p1i, p2i), which is also denoted as f(p1, p2) for ease of notation, is specified. Specifically, we first review the bivariate generalized linear mixed effects models and then propose the bivariate Beta-binomial models as an alternative, for the evaluation of marginal treatment or exposure effect. Note that bivariate models are commonly used when there are two outcomes (e.g. the response to a treatment and the appearance of a side effect), in this article, we use bivariate models to jointly model the study-specific response rates in the placebo group and the treatment group in a meta-analysis with multiple studies.

2.1 Bivariate Generalized Linear Mixed Effects Models

In the second stage, the bivariate generalized linear mixed effects model (BGLMM) assumes a bivariate normal distribution of (p1i, p2i) in a transformed scale, which implies a linear relationship between p1i and p2i on a transformed scale. It is generally specified as follows,

g(p1i)=ν1+ν1i,g(p2i)=ν2+ν2i,and(ν1i,ν2i)TN(0,v), (2)

where g() is the link function such as the commonly used logit, probit and complementary log-log transformation functions, (ν1,ν2) are the fixed effects, and v=(σ12ρσ1σ2ρσ1σ2σ22) is the variance-covariance matrix. To implement the natural constraint of −1≤ρ≤1, one can use the Fisher’s z transformation as ρ= [exp (2z) − 1]/[exp(2z)+1].

Based on the model in equation (2), the median success probability in the kth treatment group for the population can be estimated as M(pk)= g−1 (νk), k =1, 2. And, its mean can be estimated as

E(pk)=+g1(νk+z)σk1φ(z/σk)dz,k=1,2, (3)

where φ() is the standard Gaussian density function. Based on the bivariate normality assumption of (ν1i, ν2i)T, the expected success probability in group k (k = 1, 2) at a given success probability in group l (l = 1, 2) in the transformed scale is given by

E[g(pk)g(pl)]=νk+ρσk/σl[g(pl)νl]=(νkρνlσk/σl)+ρσk/σlg(pl),kl;k,l=1,2. (4)

Thus, the BGLMM implies a linear relationship between p1 and p2 on a transformed scale.

2.2 Bivariate Beta-Binomial Models

As an alternative, beta-binomial distributions can be used to model the success probabilities of the treatment and control groups to account for the within-study correlation. To allow for the possible correlation between the success probabilities in the treatment and control groups, Lee22 introduced a class of bivariate Beta-binomial distributions using the framework introduced by Sarmanov.23 Such bivariate Beta-binomial distributions can be used to model the success probabilities of (p1i, p2i) jointly as follows,

f(p1i,p2i)=f(p1,p2)=[1+ωk=12(pkμk)]k=12[(pk)αk1(1pk)βk1B(αk,βk)], (5)

where αk, βk > 0, E(pk)=μk=αkαk+βk and B(αk,βk)=01xαk1(1x)βk1dx. The bivariate beta-binomial distribution specified by equation (5) has several attractive features. First, the marginal distribution of pk follows a Beta distribution f(pk)= Beta(αk, βk). Secondly, a correlation between the success probabilities in the treatment and control groups is allowed and modeled by ρ= ωδ1δ2, where δk2=αkβk(αk+βk)2(αk+βk+1) is the variance of pk. When ω = 0, equation (5) collapses to the product of two univariate Beta densities, corresponding to independent Beta distributions for p1 and p2. To ensure a valid joint probability density function, ω must satisfy the condition

[max(α1α2,β1β2)]1k=12(αk+βk)ω[max(α1β2,β1α2)]1k=12(αk+βk). (6)

To ensure a valid joint probability density function for (p1, p2) and avoid computational difficulties, we re-parameterize ω by the unconstrained parameter η as follows,

ω=k=12(αk+βk){exp(η)1+exp(η)[max(α1β2,β1α2)]111+exp(η)[max(α1α2,β1β2)]1}. (7)

The conditional distribution of pk for a chosen pl is f(pkpl)=f(p1,p2)f(pl)=f(pk)[1+ωk=12(pkμk)](kl;k=1,2;l=1,2). Thus, the conditional mean of pk for a chosen pl is given by E(pk | pl) = μk + ρδk/δl(plμl), and the conditional variance of pk for a chosen pl is given by

Var(pkpl)=δk2{1ρ2(plμl)2δl2}+ρplμlδkδl{2βkαk(βkαk)(αk+βk+2)(αk+βk+1)(αk+βk)3}. (8)

Thus, the bivariate Beta-Binomial model implies a linear relationship between p1 and p2 on the original scale. The unconditional joint probability density function for (X1= x1i, X2= x2i) is,

P(X1=x1i,X2=x2in1i,n2i,α1,α2,β1,β2,ω)=0101k=12(nkixki)(pki)xki(1pki)nkixki[1+ωk=12(pkiμki)]k=12[(pki)αk1(1pki)βk1B(αk,βk)]dp1idp2i=k=12(nkixki)B(αk+xki,βk+nkixki)B(αk,βk)×[1+ωk=12xkinkiμkαk+βk+nki], (9)

which leads to the following log-likelihood function for the observed 2 × 2 tables after ignoring some constant,

logL(ω,αk,βk)=i=1Nk=12{logB(αk+xki,βk+nkixki)logB(αk,βk)+log[1+ωk=12xkinkiμkαk+βk+nki]}, (10)

where ω must satisfy the condition in equation (6) to ensure nonnegative probability.

2.3 Marginal Treatment Effects: Risk Difference and Risk Ratio

Although the issue of deciding which effect measure to use in a particular application is non-trivial,1 we focus on the estimation of risk ratio (RR) and risk difference (RD) here for two reasons: 1) the interpretation of odds ratio (OR) as an estimate of RR often leads to exaggerated associations when the binary outcome of interest is common;2426 and 2) the well-known non-collapsibility issue related to OR makes it undesirable in interpretation and estimation.27, 28 For example, in the presence of effect modification, when an exposure increases risk but all risks are less than 0.5, it is possible for the relative risk and the risk difference to change in the same direction, but the odds ratio to change in the opposite direction.29 In this article, we focus on the overall marginal (or population averaged) treatment (or exposure) effect, as suggested by McCullagh,30, 31 which is defined as the risk difference (RD) = E(p1) − E(p2) and the risk ratio (RR) = E(p1)/E(p2), where E(pk)=+g1(νk+z)σk1φ(z/σk)dz for the BGLMM and E(pk) = αk/(αk + βk) for the bivariate Beta-binomial model, k =1,2. Furthermore, although the computation of E(pk)(k = 1,2) from BGLMM involves integration, there is a closed-form formula of E(pk)=Φ(νk/1+σk2) (k= 1,2) for the bivariate probit random effects model, and a well-established approximation formula of E(pk)expit(νi/1+C2σk2) (k= 1,2) for the bivariate logit random effects models,32 where C=163/(15π). For the complementary log-log random effects models, E(pk) can be easily computed by numerical integration, for example, by the trapezoidal rule with 1,000 equal space subintervals as implemented in this article.

2.4 Model Implementation

The bivariate Beta-binomial model and the bivariate generalized linear mixed model can be fitted using commonly used statistical software. We implement it through the SAS NLMIXED procedure (SAS Institute Inc., Cary, NC), which maximizes the likelihood function by dual quasi-Newton optimization techniques for the bivariate Beta-binomial model, and uses an adaptive Gaussian quadrature to approximate the likelihood integrated over the random effects for BGLMM.33 Furthermore, the delta method built in SAS NLMIXED is used to compute the population averaged overall treatment effect estimates and their standard errors based on the normal approximation. To select a model that can give a better goodness of fit, either the finite sample corrected Akaike’s Information Criterion (AICC) or the Bayesian Information Criterion (BIC) can be used as the guideline.34

3. Two Case Studies

To illustrate and compare the performance of the bivariate Beta-binomial model and the bivariate generalized linear mixed effects models discussed in this article, we apply them to two recently published meta-analyses.

3.1 Example 1: Meta-analysis of type 2 diabetes mellitus after gestational diabetes

Recently, Bellamy et al.21 presented an interesting comprehensive systematic review and meta-analysis to assess the strength of association between gestational diabetes and type 2 diabetes mellitus. In summary, 20 cohort studies were included in the meta-analysis with 675,455 women and 10,859 type 2 diabetic events. Table 1 shows the frequencies of the diabetic events for these 20 studies.

Table 1.

Example 1: Data from a Meta-analysis of Studies on Type 2 Diabetes Mellitus after Gestational Diabetes21

Study Type 2 Diabetes Mellitus with Gestational Diabetes Type 2 Diabetes Mellitus without Gestational Diabetes
# events # observations # events # observations
1 2874 21823 6628 637341
2 71 620 22 868
3 21 68 0 39
4 43 166 150 2242
5 53 295 1 111
6 405 5470 16 783
7 6 70 7 108
8 13 35 8 489
9 7 23 0 11
10 23 435 0 435
11 44 696 0 70
12 21 229 1 61
13 10 28 0 52
14 15 45 1 39
15 105 801 7 431
16 10 15 0 35
17 33 241 0 57
18 14 47 3 47
19 224 615 18 328
20 5 145 0 41

We fitted the bivariate Beta-binomial and the bivariate generalized linear mixed effects models as described in Section 2 to study the association between gestational diabetes and type 2 diabetes mellitus. Table 2 presents the parameter estimates and their standard errors, including the population averaged risk of type 2 diabetes mellitus for those with and without gestational diabetes, population averaged risk difference and risk ratio, and the goodness of fit measures including the finite sample corrected Akaike’s Information Criterion (AICC) and the Bayesian Information Criterion (BIC). As shown in Table 2, there is not enough evidence to support that the risks of type 2 diabetes mellitus for those with and without gestational diabetes are correlated within studies from both the bivariate Beta-binomial model and the bivariate generalized linear mixed models with three link functions, i.e., the models with ρ = 0 provide better goodness-of-fit for all four models considered. Note that the results from different models are very similar here. Based on AICC and BIC, the best fitted model is a bivariate logit generalized linear mixed effects model with ρ = 0 and σ12=σ22. Based on this model, the population averaged risk of type 2 diabetes mellitus for those with and without gestational diabetes are estimated to be 0.200 (standard error = 0.031) and 0.025 (standard error = 0.006) respectively. The population averaged risk difference is estimated to be 0.175 (standard error = 0.031), where the population averaged risk ratio is estimated to be 7.948 (standard error = 2.167). It is interesting to note that the population averaged risk ratio estimates from all models are slightly higher than what Bellamy et al.21 reported (i.e., 7.43 with 95% confidence interval of 4.79 to 11.51) based on the random effects model by DerSimonian and Laird.4 The reason might be the fact that an ad hoc continuity correction was implemented for the studies with zero diabetic events in the group without gestational diabetes in Bellamy et al.,21 where our models do not.

Table 2.

Point Estimates (Standard Errors) for Meta-analysis of Studies on Type 2 Diabetes Mellitus after Gestational Diabetes21

Model Description Modification from the standard model Probability 1 Probability 2 Risk Difference RD Relative Risk RR BIC AICC
Bivariate Probit GLMM
No change .2014 (.0306) .0238 (.0061) .1776 (.0292) 8.4556 (2.1305) 306.1 302.8
Random effects v1i = v2i .1934 (.0262) .0254 (.0064) .1680 (.0202) 7.6029 (0.9612) 341.6 339.3
Correlation ρ = 0 .2036 (.0313) .0226 (.0059) .1810 (.0319) 9.0077 (2.7280) 305.1 302.2
Correlation ρ = 0 and σ12=σ22 .2002 (.0291) .0237 (.0067) .1765 (.0295) 8.4589 (2.6317) 302.5 300.1
Bivariate Logit GLMM
No change .1973 (.0307) .0266 (.0068) .1707 (.0293) 7.4154 (1.8853) 306.2 303.0
Random effects v1i = v2i .2012 (.0298) .0233 (.0051) .1779 (.0251) 8.6212 (0.8395) 356.3 354.0
Correlation ρ = 0 .1991(.0313) .0255 (.0066) .1736 (.0320) 7.8038 (2.3676) 305.2 302.4
Correlation ρ = 0 and σ12=σ22 .1999 (.0312) .0252 (.0059) .1748 (.0314) 7.9476 (2.1669) 302.2 299.9
Bivariate Complementary log-log GLMM
No change .2037 (.0353) .0224 (.0063) .1813 (.0353) 9.0950 (2.8922) 306.2 303.0
Random effects v1i = v2i .2113 (.0361) .0197 (.0055) .1916 (.0355) 10.7288 (3.2038) 373.3 371.0
Correlation ρ = 0 .2051 (.0359) .0209 (.0059) .1842 (.0375) 9.9088 (3.5280) 305.4 302.6
Correlation ρ = 0 and σ12=σ22 .2082 (.0364) .0225 (.0057) .1857 (.0376) 9.2608 (3.0130) 302.6 300.3
Bivariate Beta-binomial model
No change .2039 (.0308) .0222 (.0056) .1817 (.0313) 9.1890 (2.6929) 307.7 307.0
Correlation ρ = 0 .2041 (.0308) .0222 (.0054) .1819 (.0313) 9.2114 (2.6571) 304.7 303.4
*

AICC = the finite sample corrected Akaike’s Information Criterion and BIC = the Bayesian Information Criterion. 1= the risk of type 2 Diabetes Mellitus with gestational diabetes, 2= the risk of type 2 Diabetes Mellitus without gestational diabetes. RD = 12; RR = 1/2.

Because one of the studies has almost all the cases (9502 out of 10859 cases), we did a sensitivity meta-analysis by excluding that study. The results are presented in Appendix eTable 1. In summary, it suggests similar conclusions. Specifically, the best fitted model is a bivariate probit generalized linear mixed effects model with ρ = 0 and σ12=σ22, and the population averaged risk of type 2 diabetes mellitus for those with and without gestational diabetes are estimated to be .205 (standard error = 0.031) and 0.025 (standard error = 0.007) respectively. The population averaged risk difference is estimated to be 0.181 (standard error = 0.032), where the population risk ratio is estimated to be 8.371 (standard error = 2.756).

3.2 Example 2: Meta-analysis of the risk of myocardial infarction with Rosiglitazone

To investigate whether rosiglitazone, a drug for treating type 2 diabetes mellitus, significantly increases the risk of myocardial infarction (MI) or cardiovascular disease (CVD)-related death, Nissen and Wolski5 performed a meta-analysis of 48 clinical trials that satisfied the inclusion criteria for their analysis. Among them, 10 studies have no MI events and 25 studies have no CVD-related deaths, which were simply excluded by Nissen and Wolski from their analysis. This meta-analysis data set has been reanalyzed by Shuster et al.,35 Tian et al.8 and others,3638 and updated by Dahabreh.39 For the illustration purpose, we will only focus on the association between rosiglitazone usage and the risk of myocardial infarction. In summary, 86 out of 16,856 in the rosiglitazone group and 72 out of 12,962 in the control group had MI event in the 48 clinical trials.

Similar to Section 3.1, we fitted the bivariate Beta-binomial and the bivariate generalized linear mixed effects models as described in Section 2 on those 48 clinical trials to study the association between rosiglitazone usage and the risk of MI in type 2 diabetes mellitus patients. Table 3 presents the parameter estimates and their standard errors, including the population averaged risk of MI event for those with and without rosiglitazone usage, the population averaged risk difference and risk ratio, and the goodness of fit measurements including the finite sample corrected Akaike’s Information Criterion (AICC) and the Bayesian Information Criterion (BIC). The BGLMM models assuming random effects v1i = v2i with any of the three link functions provide better model fit than the bivariate Beta-binomial model with either ρ ≠ 0 or ρ = 0. Based on AICC and BIC, the bivariate logit and complementary log-log generalized linear mixed effects models with random effects v1i = v2i provide similar best fit.

Table 3.

Point Estimates (Standard Errors) for Meta-analysis of the Risk of Myocardial Infarction with Rosiglitazone5

Model Description Modification from the standard model Probability 1 Probability 2 Risk Difference RD Relative Risk RR BIC AICC
Bivariate Probit GLMM
No change .00493 (.00091) .00359 (.00079) .00133 (.00075) 1.3710 (.2455) 262.4 253.7
Random effects v1i = v2i .00494 (.00093) .00364 (.00077) .00130 (.00074) 1.3577 (.2325) 254.8 249.5
Correlation ρ = 0 .00486 (.00095) .00379 (.00914) .00107 (.00132) 1.2811 (.3973) 272.4 265.3
Correlation ρ = 0 and σ12=σ22 .00491 (.00096) .00377 (.00087) .00115 (.00128) 1.3037 (.3884) 269.2 263.8
Bivariate Logit GLMM
No change .00624 (.00131) .00484 (.00115) .00141 (.00086) 1.2929 (.2023) 261.9 253.2
Random effects v1i = v2i .00627 (.00133) .00480 (.00109) .00146 (.00085) 1.3046 (.1933) 254.2 248.9
Correlation ρ = 0 .00618 (.00140) .00508 (.00141) .00110 (.00199) 1.2164 (.4359) 272.7 265.7
Correlation ρ = 0 and σ12=σ22 .00627 (.00134) .00496 (.00113) .00131 (.00153) 1.2643 (.3441) 268.8 263.5
Bivariate Complementary log-log GLMM
No change .00501 (.00094) .00353 (.00099) .00148 (.00092) 1.4187 (.3389) 261.9 253.2
Random effects v1i = v2i .00499 (.00094) .00373 (.00088) .00126 (.00075) 1.3390 (.2399) 254.2 248.9
Correlation ρ = 0 .00494 (.00097) .00367 (.00106) .00126 (.00133) 1.3423 (.4361) 272.7 265.6
Correlation ρ = 0 and σ12=σ22 .00497 (.00097) .00384 (.00092) .00113 (.00130) 1.2945 (.3885) 268.8 263.5
Bivariate Beta-binomial model
No change .00491 (.00090) .00385 (.00138) .00111 (.00126) 1.2915 (.3823) 277.8 269.9
Correlation ρ = 0 .00491 (.00090) .00380 (.00088) .00111 (.00126) 1.2932 (.3826) 274.0 267.5
*

AICC = the finite sample corrected Akaike’s Information Criterion and BIC = the Bayesian Information Criterion. 1 = the risk of myocardial infarction in the rosiglitazone group, 2 = the risk of myocardial infarction in the control group, RD = 12; RR = 1/2.

It is worthy to mention that for the logit BGLMM model, it seems that the approximation of E(pk)expit(νk/1+C2σk2), where C=163/(15π), slightly overestimate the population averaged probability of MI for each group. For example, for the logit BGLMM model assuming random effects v1i = v2i, the estimated population averaged probabilities of MI in the rosiglitazone treatment and control groups are 0.00627 (standard error = 0.00133) and 0.00480 (standard error = 0.00109) using the approximation of E(pk)expit(νk/1+C2σk2). While, using the numerical integration by E(pk)=+1/[1+exp(νkz)]σk1φ(z/σk)dz, the corresponding estimates are 0.00493 (standard error = 0.00140) and 0.00366 (standard error = 0.00114), which are consistent with the estimates from other models. However, we notice that the overestimation of the population averaged probability of MI using the approximation formula of the logit BGLMM does not seem to have any noticeable effects on the estimation of risk difference or risk ratio.

4. Discussion

In this article, we discussed bivariate Beta-binomial models derived from Sarmanov family of bivariate distributions and bivariate generalized linear mixed effects models using a general link function for meta-analysis of 2 by 2 tables in comparative clinical studies. Specifically, we have discussed logit, probit and complementary log-log link functions as special cases. These bivariate random effects models naturally account for the potential correlation between treatment (or exposure) and control groups within studies. Moreover, they can be used to make valid inferences using all available data without using ad hoc continuity corrections for the sparse data scenario. We illustrated the utilization of the bivariate random effects models in two recent meta-analysis data sets, which emphasizes the importance of model selection. In particular, based on AICC and BIC, in the example one, the best fitted model is a bivariate logit generalized linear mixed effects model with ρ = 0 and σ12=σ22, which suggests that the study-specific risks of type 2 diabetes mellitus (in logit scale) for those with and without gestational diabetes are independent and have similar variations. In the example two, both the bivariate logit and complementary log-log generalized linear mixed effects models with random effects v1i = v2i provide similar best fit, which suggests that one can reasonably assume a fixed effect of rosiglitazone on the risk of myocardial infarction (in logit or complementary log-log scale). Furthermore, we provided methods to estimate the population averaged risk difference and relative risk. It is worth to noting that the bivariate Beta-binomial model and the bivariate generalized linear mixed effects models involves two different distributional assumptions, one would imagine that their performance would heavily depend on whether the distributional assumptions approximate the underline data generating mechanism. In particular, the bivariate generalized linear mixed effects models implies a linear relationship between p1 and p2 on a transformed scale, and after transforming back to the scale of probability, the relation between p1 and p2 is no longer linear. The Beta-binomial model implies a linear relationship between p1 and p2 on the original scale. So which model works better in a particular application depends on whether the relation between p1 and p2 is linear on the original scale or on the transformed scale. We suggest that fitting both models and comparing goodness-of-fit to select the best model to make inference in practice.

Alternative approaches using Bayesian methods can be fitted by free downloadable software such as WinBUGS, for example, by the Bayesian random effect models as in Warn, Thompson and Spiegelhalter.40 However, Warn et al. 40 focused on the conditional treatment effects. Here, we focus on the overall marginal (or population averaged) treatment effects, as suggested by McCullagh.30, 31 Remark that our parameterization of BGLMM is slightly different from the random effects models by Smith et al.41 and Warn et al.40 Specifically, Smith et al. 41 considered a Bayesian logit random effects model assuming logit (p1i)= μiδi/2, logit (p2i) = μi + δi/2, δi ~ N(δi, σ2), and non-informative priors for the average event rates, μi s, which are treated as the nuisance parameters. It implicitly restricts Var[logit(p1i)] = Var[logit(p2i)], i.e., restricting σ12=σ22 as in the BGLMM model. Warn et al.40 assumes that g(p1i) = μi, g(p2i) = μi + δi and δi ~ N(δ, σ2) where g() is a link function, which implicitly restricts Var[g(p1i)] ≤ Var[g(p2i)], i.e., restricting σ12σ22 in our BGLMM parameterization. It is worth pointing out that our purpose here is not to demonstrate the advantage of our approach over a Bayesian approach, because both BGLMM and Bivariate Beta-binomial models can be fitted using a Bayesian approach. For the general model that we considered in equation (2), we do not make any restrictions on the variances of σ12 and σ22. Furthermore, the bivariate Beta-binomial model and the bivariate generalized linear mixed models we proposed in this article do not include any study-level or individual level covariates. It is straightforward to include covariates when using the BGLMM through the SAS NLMIXED procedure.

Supplementary Material

Acknowledgments

Dr. Haitao Chu was supported in part by 1P01CA142538-01 from the U.S. National Cancer Institute. The authors are grateful to the Editor Brian Everitt and 2 anonymous reviewers for their helpful comments on an earlier version of this manuscript.

Reference List

  • 1.Egger M, Smith GD, Altman DG. Systematic reviews in health care: meta-analysis in context. BMJ Publishing Group; 2001. [Google Scholar]
  • 2.Whitehead A, Whitehead J. A General Parametric Approach to the Meta-analysis of Randomized Clinical-Trials. Statistics in Medicine. 1991;10(11):1665–1677. doi: 10.1002/sim.4780101105. [DOI] [PubMed] [Google Scholar]
  • 3.Woolf B. On Estimating the Relation Between Blood Group and Disease. Annals of Human Genetics. 1955;19 (3):251–253. doi: 10.1111/j.1469-1809.1955.tb01348.x. [DOI] [PubMed] [Google Scholar]
  • 4.Dersimonian R, Laird N. Meta-analysis in Clinical Trials. Controlled Clinical Trials. 1986;7(3):177–188. doi: 10.1016/0197-2456(86)90046-2. [DOI] [PubMed] [Google Scholar]
  • 5.Nissen SE, Wolski K. Effect of Rosiglitazone on the Risk of Myocardial Infarction and Death from Cardiovascular Causes. The New England Journal of Medicine. 2007;356(24):2457–2471. doi: 10.1056/NEJMoa072761. [DOI] [PubMed] [Google Scholar]
  • 6.Sweeting MJ, Sutton AJ, Lambert PC. What to add to nothing? Use and avoidance of continuity corrections in meta-analysis of sparse data. Statistics in Medicine. 2004;23:1351–1375. doi: 10.1002/sim.1761. [DOI] [PubMed] [Google Scholar]
  • 7.Bradburn MJ, Deeks JJ, Berlin JA, Localio AR. Much ado about nothing: a comparison of the performance of meta-analytical methods with rare events. Statistics in Medicine. 2007;26(1):53–77. doi: 10.1002/sim.2528. [DOI] [PubMed] [Google Scholar]
  • 8.Tian L, Cai T, Pfeffer MA, Piankov N, Cremieux PY, Wei LJ. Exact and efficient inference procedure for meta-analysis and its application to the analysis of independent 2 x 2 tables with all available data but without artificial continuity correction. Biostatistics. 2009;10(2):275–281. doi: 10.1093/biostatistics/kxn034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Vandermeer B, Bialy L, Hooton N, Hartling L, Klassen TP, Johnston BC, Wiebe N. Meta-analyses of safety data: a comparison of exact versus asymptotic methods. Statistical Methods in Medical Research. 2009;18(4):421–432. doi: 10.1177/0962280208092559. [DOI] [PubMed] [Google Scholar]
  • 10.Rucker G, Schwarzer G, Carpenter J, Olkin I. Why add anything to nothing? The arcsine difference as a measure of treatment effect in meta-analysis with zero cells. Statistics in Medicine. 2009;28 (5):721–738. doi: 10.1002/sim.3511. [DOI] [PubMed] [Google Scholar]
  • 11.Tang JL. Weighting bias in meta-analysis of binary outcomes. Journal of Clinical Epidemiology. 2000;53 (11):1130–1136. doi: 10.1016/s0895-4356(00)00237-7. [DOI] [PubMed] [Google Scholar]
  • 12.Chu H, Cole SR. Bivariate meta-analysis of sensitivity and specificity with sparse data: a generalized linear mixed model approach. Journal of Clinical Epidemiology. 2006;59(12):1331–1332. doi: 10.1016/j.jclinepi.2006.06.011. [DOI] [PubMed] [Google Scholar]
  • 13.Riley R, Abrams K, Sutton A, Lambert P, Thompson J. Bivariate random-effects meta-analysis and the estimation of between-study correlation. BMC Medical Research Methodology. 2007;7(1):3. doi: 10.1186/1471-2288-7-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Harbord RM, Deeks JJ, Egger M, Whiting P, Sterne JAC. A unification of models for meta-analysis of diagnostic accuracy studies. Biostatistics. 2007;8(2):239–251. doi: 10.1093/biostatistics/kxl004. [DOI] [PubMed] [Google Scholar]
  • 15.Riley RD, Thompson JR, Abrams KR. An alternative model for bivariate random-effects meta-analysis when the within-study correlations are unknown. Biostatistics. 2008;9(1):172–186. doi: 10.1093/biostatistics/kxm023. [DOI] [PubMed] [Google Scholar]
  • 16.Chu H, Chen S, Louis TA. Random Effects Models in a Meta-Analysis of the Accuracy of Two Diagnostic Tests without a Gold Standard. Journal of the American Statistical Association. 2009;104:512–523. doi: 10.1198/jasa.2009.0017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Chu H, Guo H, Zhou Y. Bivariate Random Effects Meta-Analysis of Diagnostic Studies Using Generalized Linear Mixed Models. Medical Decision Making. 2010;30(4):499–508. doi: 10.1177/0272989X09353452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Chu H, Nie L, Cole SR, Poole C. Meta-analysis of diagnostic accuracy studies accounting for disease prevalence: alternative parameterizations and model selection. Stat Med. 2009;28(18):2384–2399. doi: 10.1002/sim.3627. [DOI] [PubMed] [Google Scholar]
  • 19.Riley RD, Abrams KR, Lambert PC, Sutton AJ, Thompson JR. An evaluation of bivariate random-effects meta-analysis for the joint synthesis of two correlated outcomes. Statistics in Medicine. 2007;26 (1):78–97. doi: 10.1002/sim.2524. [DOI] [PubMed] [Google Scholar]
  • 20.Jackson D, White IR, Thompson SG. Extending DerSimonian and Laird’s methodology to perform multivariate random effects meta-analyses. Statistics in Medicine. 2009 doi: 10.1002/sim.3602. in press. [DOI] [PubMed] [Google Scholar]
  • 21.Bellamy L, Casas JP, Hingorani AD, Williams D. Type 2 diabetes mellitus after gestational diabetes: a systematic review and meta-analysis. The Lancet. 2009;373(9677):1773–1779. doi: 10.1016/S0140-6736(09)60731-5. [DOI] [PubMed] [Google Scholar]
  • 22.Lee MLT. Properties and applications of the Sarmanov family of bivariate distributions. Communications in Statistics-Theory and Methods. 1996;25(6):1207–1222. [Google Scholar]
  • 23.Sarmanov OV. Generalized Normal Correlation and Two Dimensional Frechet Classes. Doklady (Soviet Mathematics) 1966;168:596–599. [Google Scholar]
  • 24.Schwartz LM, Woloshin S, Welch HG. Misunderstandings about the Effects of Race and Sex on Physicians’ Referrals for Cardiac Catheterization. The New England Journal of Medicine. 1999;341(4):279–283. doi: 10.1056/NEJM199907223410411. [DOI] [PubMed] [Google Scholar]
  • 25.Cummings P. Early Exposure to Marijuana and Risk of Later Drug Use. JAMA: The Journal of the American Medical Association. 2003;290(3):329–32a. doi: 10.1001/jama.290.3.329-b. [DOI] [PubMed] [Google Scholar]
  • 26.Kalilani L, Atashili J. Measuring additive interaction using odds ratios. Epidemiologic Perspectives & Innovations. 2006;3(1):5. doi: 10.1186/1742-5573-3-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Greenland S. Interpretation and Choice of Effect Measures in Epidemiologic Analyses. American Journal of Epidemiology. 1987;125(5):761–768. doi: 10.1093/oxfordjournals.aje.a114593. [DOI] [PubMed] [Google Scholar]
  • 28.Greenland S, Robins JM, Pearl J. Confounding and collapsibility in causal inference. Statistical Science. 1999;14(1):29–46. [Google Scholar]
  • 29.Brumback B, Berg A. On effect-measure modification: Relationships among changes in the relative risk, odds ratio, and risk difference. Statistics in Medicine. 2008;27(18):3453–3465. doi: 10.1002/sim.3246. [DOI] [PubMed] [Google Scholar]
  • 30.McCullagh P. Sampling bias and logistic models. Journal of the Royal Statistical Society Series B-Statistical Methodology. 2008;70:643–664. [Google Scholar]
  • 31.Longford NT, Diggle PJ, Nelder J, Copas JB, Ross GJS, Moller J, Cox DR, Keogh R, Kuk AYC, Lauritzen SL, Richardson TS, Lee Y, Meng XL, Rathouz P, VanderWeele TJ. Sampling bias and logistic models - Discussion. Journal of the Royal Statistical Society Series B-Statistical Methodology. 2008;70:664–677. [Google Scholar]
  • 32.Zeger SL, Liang KY, Albert PS. Models for Longitudinal Data - A Generalized Estimating Equation Approach. Biometrics. 1988;44(4):1049–1060. [PubMed] [Google Scholar]
  • 33.Pinheiro JC, Bates DM. Approximations to the log-likelihood function in the nonlinear mixed-effects model. Journal of Computational and Graphical Statistics. 1995;4(1):12–35. [Google Scholar]
  • 34.Burnham KP, Anderson DR. Model Selection and Inference: A Practical Information-Theoretic Approach. Springer-Verlag; New York: 1998. [Google Scholar]
  • 35.Shuster JJ, Jones LS, Salmon DA. Fixed vs random effects meta-analysis in rare event studies: The Rosiglitazone link with myocardial infarction and cardiac death. Statistics in Medicine. 2007;26 (24):4375–4385. doi: 10.1002/sim.3060. [DOI] [PubMed] [Google Scholar]
  • 36.Shuster JJ, Jones LS, Salmon DA. Rebuttal to Carpenter et al. Comments on ‘Fixed vs. random effects meta-analysis in rare event studies: The rosiglitazone link with myocardial infarction and cardiac death’ - Reply. Statistics in Medicine. 2008;27(19):3912–3914. doi: 10.1002/sim.3173. [DOI] [PubMed] [Google Scholar]
  • 37.Carpenter J, Rucker G, Schwarzer G. Comments on ‘Fixed vs random effects meta-analysis in rare event studies: the rosiglitazone link with myocardial infarction and cardiac death’. Stat Med. 2008;27 (19):3910–3912. doi: 10.1002/sim.3173. [DOI] [PubMed] [Google Scholar]
  • 38.Waksman J, Kollar C. Comments on ‘Rebuttal to Carpenter et al. ’ Comments on ‘Fixed vs random effects meta-analysis in rare event studies: The rosiglitazone link with myocardial infarction and cardiac death’. Statistics in Medicine. 2009;28(3):534–536. doi: 10.1002/sim.3485. [DOI] [PubMed] [Google Scholar]
  • 39.Dahabreh IJ. Meta-analysis of rare events: an update and sensitivity analysis of cardiovascular events in randomized trials of rosiglitazone. Clinical Trials. 2008;5(2):116–120. doi: 10.1177/1740774508090212. [DOI] [PubMed] [Google Scholar]
  • 40.Warn DE, Thompson SG, Spiegelhalter DJ. Bayesian random effects meta-analysis of trials with binary outcomes: methods for the absolute risk difference and relative risk scales. Statistics in Medicine. 2002;21(11):1601–1623. doi: 10.1002/sim.1189. [DOI] [PubMed] [Google Scholar]
  • 41.Smith TC, Spiegelhalter DJ, Thomas A. Bayesian approaches to random-effects meta-analysis: A comparative study. Statistics in Medicine. 1995;14(24):2685–2699. doi: 10.1002/sim.4780142408. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

RESOURCES