Abstract
Many meta-analysis models contain multiple parameters, for example due to multiple outcomes, multiple treatments or multiple regression coefficients. In particular, meta-regression models may contain multiple study-level covariates, and one-stage individual participant data meta-analysis models may contain multiple patient-level covariates and interactions. Here, we propose how to derive percentage study weights for such situations, in order to reveal the (otherwise hidden) contribution of each study toward the parameter estimates of interest. We assume that studies are independent, and utilise a decomposition of Fisher’s information matrix to decompose the total variance matrix of parameter estimates into study-specific contributions, from which percentage weights are derived. This approach generalises how percentage weights are calculated in a traditional, single parameter meta-analysis model. Application is made to one- and two-stage individual participant data meta-analyses, meta-regression and network (multivariate) meta-analysis of multiple treatments. These reveal percentage study weights toward clinically important estimates, such as summary treatment effects and treatment-covariate interactions, and are especially useful when some studies are potential outliers or at high risk of bias. We also derive percentage study weights toward methodologically interesting measures, such as the magnitude of ecological bias (difference between within-study and across-study associations) and the amount of inconsistency (difference between direct and indirect evidence in a network meta-analysis).
Keywords: Percentage study weights, Fisher’s information, individual patient data meta-analysis, meta-regression, network meta-analysis
1 Introduction
Meta-analysis is the synthesis of quantitative information from related studies to produce summary (pooled) results to help answer clinically relevant questions, such as whether a treatment is effective. Statistical models for meta-analysis often use aggregate data (such as a treatment effect estimate and its variance) from each study, but increasingly they utilise individual participant data (IPD).1,2 Regardless of the approach taken, forest plots are an important way to disseminate results to a clinical audience as they quickly summarise the size and spread of individual study results alongside the summary meta-analysis result. In relation to forest plots, the PRISMA Statement recommends that ‘… it is preferable also to include, for each study, the numerical group-specific summary data, the effect size and confidence interval, and the percentage weight’.3 Percentage study weights aim to break down the summary meta-analysis result into the relative contribution of each individual study, and are easily interpretable by non-statisticians.
In this article, we propose a general approach to deriving percentage study weights in meta-analysis models with multiple parameters, including meta-regression, network (multivariate) meta-analysis and IPD models with interactions. Our approach includes, as a special case, the simpler, traditional situation where just a single parameter (such as a treatment effect) is to be synthesised. However, most meta-analysis models contain multiple parameters, for example due to multiple outcomes, multiple treatments or multiple regression coefficients. In particular, meta-regression models may contain multiple study-level covariates,4 and one-stage IPD meta-analysis models may contain multiple patient-level covariates and interactions.5,6
Our aim is to allow researchers to produce percentage study weights for each parameter of interest within a multi-parameter meta-analysis model. Our motivation is to increase transparency of each study’s contribution, which is especially important when some studies are potential outliers or at high risk of bias, to reveal how much their data contributed to the overall meta-analysis result. Similarly, if clinical decision makers are concerned that a meta-analysis result may be unreliable for translation to practice due to the inclusion of a study from a different clinical population or setting, then knowing the study’s contribution is important. Percentage study weights are also helpful for methodological purposes, for example to: (a) help understand and explain differences between one- and two-stage IPD meta-analysis results, for which there is much interest;7–11 (b) help understand differences between results from IPD meta-analyses and those based on aggregate data obtained from published studies, which may include different studies and/or patients;12–14 (c) to ascertain the contribution of IPD studies in meta-analyses that combine IPD and non-IPD (aggregate data) studies;15,16 (d) understand the contribution of small studies in meta-analyses with funnel plot asymmetry (i.e. with potential publication or availability bias)17 or (e) to understand which studies are contributing most toward potential bias or inconsistency terms, for example within a network meta-analysis.18
The article is structured as follows. Section 2 recaps percentage study weights in a traditional meta-analysis of a single parameter, and reveals how they can be expressed on the variance scale using a decomposition of Fisher’s information. In Section 3, we extend this concept to multiple parameter models expressed within a general or generalised linear mixed model framework. These include one-stage IPD models, meta-regression and multivariate (network) meta-analysis. Section 4 provides three applied examples, covering IPD meta-analysis, meta-regression and network meta-analysis, for the estimation of treatment effects and treatment-covariate interactions. Special consideration of weights toward bias and inconsistency terms is also given. Section 5 concludes with discussion.
2 Derivation of percentage study weights in a traditional, single-parameter meta-analysis
We begin by outlining percentage study weights within a traditional meta-analysis that uses a two-stage approach to summarise a single parameter. We focus on summarising a treatment effect from multiple randomised controlled trials, but the concepts are immediately applicable to a different parameter of interest, such as a treatment-covariate interaction.
2.1 First stage
Let us assume that there are i = 1 to K randomised trials for meta-analysis, each comparing a particular treatment to control, and that the treatment effect is of interest. In the two-stage approach, the first stage involves obtaining (e.g. from study publications or by using IPD) the K treatment effect estimates () and their within-study variances (). Let us assume IPD are available for all trials, so that we can analyse the raw data for each trial separately. The choice of analysis model depends on the outcome data type.5 For example, for continuous outcome responses, such as blood pressure, a linear regression model might be fitted,19 potentially adjusting for baseline values (analysis of covariance, ANCOVA),20
(1) |
where and denote the final (F) and baseline (B) value, respectively, for patient j in trial i, and is 0/1 for participants in the control/treatment group, and is the residual variance, assumed normally distributed with mean zero and variance . Thus, is the intercept, is the effect of a one-unit increase in baseline blood pressure and is the treatment effect. Estimation of model (1) gives = (, ) and also . Taking the latter as known, we can invert Fisher’s observed information matrix for (i.e. invert the variance of the score for ) to estimate the variance matrix for :
2.2 Second stage
In the second stage, traditionally only a single parameter estimate from each study is of interest, such that a univariate meta-analysis is needed. In the above example, the treatment effect estimates () are the focus, and so a meta-analysis model is needed to combine these across trials. A fixed effect model assumes that the underlying treatment effect is the same in all studies, and thus the values are all estimates of a common treatment effect, θ. The model can be written as:
(2) |
In model (2), the var() estimates are usually assumed known,21 and the model can be fitted using maximum likelihood to give the summary estimate, , of the common treatment effect. A random effects meta-analysis model allows for between-study heterogeneity in the true treatment effect, and can be written as,
(3) |
where the var() estimates are again typically assumed known, and ui denotes a random effect that indicates the treatment effect in the ith trial, , is assumed normally distributed about an average treatment effect, θ, with between-study variance, . Model (3) can be estimated using, for example, method of moments22 or restricted maximum likelihood, to give the summary estimate, , of the average treatment effect across trials. Extension of model (3) to a meta-regression within study-level covariates will be considered in Section 3.
2.3 Decomposing Fisher’s information for
The maximum likelihood solution for the summary meta-analysis estimate () is
(4) |
where in the fixed effect model, and in the random effects model. The summary estimate, , is thus a weighted average of the , with the study weights (wi) depending on the ‘known’ sampling error (var()) and, in the random effects setting, also the estimated between-study variance ().
The variance of is = 1/. This is obtained by the inverse of Fisher’s observed information matrix which, as models (2) and (3) contain only one main parameter (θ), is just a scalar (i.e. a 1 by 1 matrix) equal to . The total observed information (, say) toward can be written as,
(5) |
where is used to denote the information attributed to study i. As studies are independent, the total information is simply the sum of the . This decomposition will be utilised again in Section 3, as it generalises to more complex meta-analysis models. The can be computed by re-fitting the meta-analysis model but with just study i included, whilst holding remaining variance estimates at their values from the full analysis (i.e. hold and, in the random effects model, also ).
Interestingly, we can now re-express the variance of into a linear sum of independent study terms by utilising the decomposition of Fisher’s information matrix, as follows:
(6) |
Thus, we have , and these Wi allow us to derive study weights on the variance scale, as now described.
2.4 Derivation of percentage study weights
Given the meta-analysis solution of (4), it is well-known that the corresponding percentage study weights are obtained by:
Thus, those studies with the largest wi values will have the largest weight. Following equation (6), we can equivalently consider percentage weights in terms of Wi by:
(7) |
Therefore, the well-known percentage weights for a single parameter meta-analysis are equivalent to a decomposition of the variance, , into the sum of independent Wi terms. This concept is utilised in Section 3 to extend percentage study weights to more complex, multi-parameter meta-analysis models. Indeed, the traditional use of to define percentage weights actually works on the precision scale, which does not generalise to a multi-parameter meta-analysis model. However, the use of the variance scale does generalise, and so use of Wi will become increasingly important.
3 Derivation of percentage study weights in multi-parameter meta-analysis and meta-regression models
We now use the concept of decomposing the variance of parameter estimates to derive percentage study weights in more complex, multi-parameter models. We begin with one-stage IPD models with continuous outcomes via linear mixed models, before then extending to meta-regression, multivariate meta-analysis and generalised linear mixed models. We focus on random effects models, as fixed effect models are merely a simplified case.
3.1 One-stage meta-analysis using linear mixed models
3.1.1 Model specification and estimation
A one-stage IPD meta-analysis model for continuous outcomes can be expressed within a linear mixed model in matrix form, using a design matrix, X, for the fixed effects, , and a design matrix, Z, for the random effects, u:23,24
(8) |
Here Y is a column vector containing all the continuous response values from all the participants from all studies, with length therefore equal to the total number of participants across studies. β is a column vector containing all the unknown fixed (mean) effects in the model, u contains the study-level random effects (typically assumed to follow a multivariate normal distribution with mean vector 0 and variance matrix G), and e contains the participant-level residuals (typically assumed to follow a multivariate normal distribution with mean vector 0 and variance matrix R). A detailed example is given in the supplementary material 1(a), for a one-stage ANCOVA meta-analysis model with a random treatment effect.20
Let V be the variance of Y conditional on X, which can be expressed as:23
(9) |
The supplementary material 1(a) gives the specification of V for a one-stage ANCOVA model (also see the example in Section 4.1). The generalised least squares procedure minimises with respect to by differentiating with respect to and setting the first derivative to zero; this gives the well-known solution of .23,24 Note that V must also be estimated simultaneously alongside , through an iterative procedure until convergence is achieved. However, here we assume V is considered ‘known’ when used within the solution for , and thus V is usually replaced with its estimate.
The estimated variance matrix of can be obtained as the inverse of Fisher's observed information matrix by , where again V is usually replaced with its estimate and assumed known. Each diagonal element of gives the estimated variance of one of the parameter estimates in the model. For instance, for the ANCOVA example in the supplementary material 1(a), is a 7 by 7 matrix, and the element (7,7) of the matrix gives , the variance of the treatment effect, .
3.1.2 Decomposing Fisher’s information matrix
Let be the estimated variance matrix of after a one-stage IPD meta-analysis including all participants from all trials, with the diagonal elements for this matrix giving the corresponding variance of each parameter estimate (e.g. ). Fisher’s observed information matrix for is .
We now generalise the decomposition outlined in Section 2.3. That is, we decompose Fisher’s information matrix for into the sum of study-specific information matrices, using . This assumes the meta-analysis contains independent studies, and thus, although we give the general form of model (8) above, we are only focusing on meta-analysis with independent studies.
To obtain , we suggest re-computing Fisher’s information for after removing the participants in all but the ith study, whilst keeping the remaining elements of X and V exactly as specified/estimated in the full meta-analysis of all studies. This extends the observation noted under equation (5) that can be computed by re-fitting the meta-analysis model but with just study i included, whilst holding remaining variance estimates at their values from the full analysis. Therefore, the user must specify and derive,
(10a) |
where is the reduced design matrix containing rows just for participants in study i, and is the corresponding reduced variance matrix with entries as estimated in the full analysis.
An alternative approach is to derive using,
(10b) |
where X is as specified in the full analysis and is held to be the same as V in the full analysis, but for those patients not in study i the associated diagonal elements of V are replaced with a very large number (e.g. 1000000000) and all associated covariance terms set to zero (thus, the ‘’ notation denotes the elements of V have changed for all participants except those in study i). This data augmentation approach ensures that participants external to study i have negligible contribution toward . Appendix 1 shows how SAS Proc Mixed can be used for this purpose, capitalising on the ‘parms’ statement;24 it also conveniently invokes the fast SAS estimation procedure to derive , and thus the user avoids needing to specify design and variance matrices themselves.
3.1.3 Derivation of percentage study weights
We now propose how to derive percentage study weights in one-stage, multi-parameter IPD models by generalising the approach outlined in equations (6) and (7) for single parameter meta-analysis models. Recall that in equations (6) and (7), we decomposed the total variance of a parameter estimate into the sum of independent Wi terms (), and used this to derive percentage weights by comparing each Wi with the total variance. For one-stage models, we now have a multi-parameter situation, and so aim to decompose the variance matrix () into the sum of independent weight matrices, say. This can be achieved by generalising equation (6) by utilising the decomposition of Fisher’s information matrix, as follows:
(11) |
Equation (11) uses the variance scale (left-hand side), whilst utilising (within the right-hand side) the property that the total information matrix can be decomposed into the sum of the . This is consistent with the framework for weights in the simpler, single-parameter meta-analysis model (as derived in equation (6)), and follows the same decomposition of the variance as shown for multivariate meta-analysis by Jackson et al.25 It therefore forms the basis of our proposal for deriving percentage study weights in all meta-analysis models containing independent studies. It provides study-specific weight matrices (), and these sum to give the total variance matrix for .
For each parameter estimate within , percentage study weights can now be derived by comparing the corresponding diagonal entries of and . So, if the parameter corresponding to row r of is of interest, we can derive
(12) |
where the ‘r,r’ notation refers to the element (r,r) of the corresponding matrix. For the ANCOVA example in the supplementary material 1(a), there are seven parameter estimates and so each is a 7 by 7 matrix; the treatment effect parameter (θ) is the seventh, and so the values of and are needed to derive the percentage study weight toward the treatment effect estimate, .
It is important to note that the diagonal elements of each can be different. Therefore, the percentage weight of study i may differ for each parameter in the model, and so should be reported separately.
3.2 Meta-regression models
Meta-regression models extend model (3) by including study-level covariates, and could be applied in the second stage of a two-stage approach. Such models are themselves general linear models containing multiple parameters, and so percentage study weights toward a parameter can be derived akin to the approach described above. That is, one needs to:
fit the meta-regression and obtain ,
derive using either equation (10a) or (10b), which corresponds to Fisher’s information matrix from a ‘meta-regression’ of just study i, with within and between-study variances held fixed at their values from the full analysis,
derive ,
calculate percentage study weights for each parameter using equation (12).
An example is given in Section 4.2.
3.3 Multivariate meta-analysis and network meta-analysis models
A multivariate meta-analysis or multivariate meta-regression model jointly synthesises multiple correlated effects simultaneously (e.g. for multiple outcomes), whilst accounting for their correlation.26 Typically, they assume a multivariate normal distribution within and between studies, and thus are a multiple outcome extension of univariate models (2) and (3). As such, they also fall within the linear mixed model framework of equation (8),27 and so percentage study weights in multivariate models can also be derived using our proposal based on equations (11) and (12). Indeed, these will produce percentage weights that correspond exactly to those proposed by Jackson et al. for multivariate meta-analysis and implemented within the ‘mvmeta’ module of Stata.28 Jackson et al. derive their percentage weights by utilising a decomposition of the score statistic, but this reduces to the same proposal as in equation (12).
Multivariate meta-analysis is illustrated further in Section 4.3, in the context of a network meta-analysis of multiple treatment comparisons. Like an IPD meta-analysis, a network meta-analysis can be conducted as either a two- or a one-stage approach. The two-stage approach first derives treatment effect estimates (contrasts) between pairs of treatments in each study (together with their variances and correlations), and then synthesises them in a multivariate meta-analysis model.18 This can be implemented in, for example, the Stata module ‘network’,29 and provides percentage weights according to the approach of Jackson et al.25 An alternative one-stage approach is possible,30,31 and rather fits the framework in Section 3.1 for a continuous outcome or Section 3.4 for a binary outcome.
3.4 One-stage meta-analysis using generalised linear mixed models
We now extend the principles above to generalised linear mixed models, for example to derive percentage study weights in a multi-parameter one-stage IPD meta-analysis of binary outcomes. This is especially important when events are rare, as models utilising a more exact (e.g. binomial) within-study likelihood are then preferred.32 Further, network meta-analysis of binary outcomes typically uses a one-stage approach.33
3.4.1 Model specification
The generalised linear mixed model can be expressed as,23
(13) |
and Y is the vector of patient responses, β is a column vector containing all the unknown fixed parameter effects in the model, X and Z the design matrices, is the expected value of Y (conditional on X, Z and u), u contains the study-level random effects and e contains the participant-level residuals. As in the linear mixed model (8), u are typically assumed to follow a multivariate normal distribution with mean vector 0 and variance matrix G. We can write , where R is the residual variance matrix, var(e). However, unlike the linear mixed model, V is not easily specified; the random effects and residuals are on different scales, and therefore V is no longer a simple linear addition of the residual variance matrix plus random effects variance matrix. Brown and Prescott note that a first-order approximation to V is,23
(14) |
where B is a matrix of variance terms, relating to the underlying distribution of data Y. For example, for independent responses from the Bernoulli and binomial distributions, B would be a diagonal matrix with diagonal entries of , where post-estimation would be replaced by , the best linear unbiased predictor (BLUP) of from the model (otherwise known as the empirical Bayes estimates). These are usually available post-estimation in statistical software. For independent responses from a Poisson distribution, the diagonal terms in B would simply be the number of predicted events (counts). For further explanation about V, we refer to supplementary material 1(b). If residuals are uncorrelated then R = ,23 where A is typically a matrix of numerical constants that again depends on the distribution of the data. For Bernoulli data and Poisson data with no offset term, A is the identity matrix. For binomial data, A is a diagonal matrix with diagonal entries , where is the total number of attempts.
3.4.2 Decomposing Fisher’s information matrix
To acquire a decomposition of Fisher’s observed information matrix for , we propose using the solution for based on maximising the pseudo-likelihood for a linearised pseudo-variable, Y*, as proposed by Wolfinger and O-Connell.34 This transforms the response variable Y to an approximately linear scale, Y*, and allows the generalised least squares solution for to be used. Therefore, our use of the pseudo-likelihood approach is a convenient way to provide a tractable solution for in order to decompose it using , and then proceed to derive percentage study weights as outlined for general linear mixed models. The process is as follows.
First, the user should calculate based on the pseudo-likelihood estimation solution, which is given by Brown and Prescott as,23
(15) |
where
(16) |
and all matrices are as defined previously. The elements of G, B and R are forced to have their estimated values from the full meta-analysis, as obtained from the model estimation of choice (for example, Gauss-Hermite quadrature). For B, this requires the user to specify (functions of) BLUP values (such as for Bernoulli or binomial responses).
As , the can be obtained using,
(17a) |
and ‘i’ again refers to the matrix as specified in the full analysis, except with the removal of participants except those in study i. As previously noted for equation (10b), a potentially simpler way of deriving is to compute
(17b) |
where X is as specified in the full analysis, and is the same as in the full analysis, except that all those participants external to study i have corresponding diagonal elements of V replaced with a very large number (e.g. 1000000000), and all covariance terms set to zero. This data augmentation approach ensures that all participants, except those in study i, have negligible contribution.
3.4.4 Deriving percentage study weights
By obtaining and using equations (15) and (17a/17b), respectively, we can now use equation (11) to derive a weight matrix () for each study, and then derive percentage study weights for each parameter using equation (12). An example is given in Section 4.3 for network meta-analysis.
3.5 Study information is not only based on the number of participants
Kontopantelis and Reeves35 developed a Stata module for generating forest plots from a one-stage, multi-parameter IPD meta-analysis, and state that ‘Patient weights are uniform and therefore each study’s weight is the ratio of its participants over the total number of participants across all studies’. However, this will not usually be correct as the contribution of each study toward Fisher’s total information of a parameter estimate depends on their patients’ elements within , and these are not necessarily the same. For example, in a one-stage ANCOVA meta-analysis model (see Section 4.1 below), a study’s contribution to V will depend on the proportion of patients in the treatment group and its residual variance (). Even if all studies were of the same total sample size, those studies with large residual variances and a small proportion of treated patients are likely to have less information relative to other studies with small residual variances and an equal proportion of treated and control patients. This is illustrated further in Section 4.1.
4 Applied examples
We now illustrate the proposed methods with some examples.
4.1 IPD meta-analysis models to estimate a summary treatment effect for a continuous outcome
Wang et al.36 investigated whether active anti-hypertensive treatments lower systolic blood pressure (SBP) compared to placebo or no treatment. Ten trials were ultimately included, providing IPD for a total of 28,581 patients as detailed elsewhere.15,20 Table 1 provides the treatment effects in each trial for SBP at the end of follow-up, with the treatment effect defined as the mean difference in treated and control groups after adjusting for baseline SBP. Of interest is using meta-analysis to give a summary treatment effect.
Table 1.
Number of patients |
Final SBP (mm Hg) |
Treatment effect estimates in each trial |
Weight values used to derive percentage
study weights |
Percentage study weights |
|||||
---|---|---|---|---|---|---|---|---|---|
Trial ID (i) | Control/ treatment | Control mean (sd) | Treatment mean (sd) | Treatment effect estimatea , for each trial from first stage of two-stage IPD meta-analysis () | Two-stage model Wi | One-stage model (equation (18)) | Two-stage model: Weights derived using equation (12)b | One-stage model (equation (18)): Weights derived using equation (12) | One-stage model (equation (18)): Weights derived using number of patientsc |
1 | 750/780 | 139.75 (17.81) | 132.85 (16.68) | −6.66 (0.72) | 0.095 | 0.096 | 11.05 | 11.06 | 5.35 |
2 | 199/150 | 179.89 (22.15) | 165.06 (20.03) | −14.17 (4.73) | 0.064 | 0.064 | 7.34 | 7.32 | 1.22 |
3 | 82/90 | 170.45 (26.91) | 156.88 (21.26) | −12.88 (10.31) | 0.043 | 0.043 | 4.99 | 4.99 | 0.60 |
4 | 2371/2427 | 138.54 (21.26) | 130.09 (19.25) | −8.71 (0.30) | 0.101 | 0.102 | 11.69 | 11.70 | 16.79 |
5 | 3445/3546 | 144.25 (17.58) | 135.49 (16.32) | −8.70 (0.14) | 0.103 | 0.104 | 11.93 | 11.95 | 24.46 |
6 | 1337/1314 | 164.58 (19.71) | 153.99 (20.13) | −10.60 (0.58) | 0.097 | 0.098 | 11.25 | 11.25 | 9.28 |
7 | 2371/2365 | 156.24 (20.12) | 145.10 (19.05) | −11.36 (0.30) | 0.101 | 0.102 | 11.68 | 11.70 | 16.57 |
8 | 131/137 | 189.11 (21.90) | 171.46 (19.29) | −17.93 (5.82) | 0.058 | 0.058 | 6.72 | 6.68 | 0.94 |
9 | 1139/1252 | 156.55 (16.86) | 150.20 (15.84) | −6.55 (0.41) | 0.099 | 0.100 | 11.50 | 11.52 | 8.37 |
10 | 2297/2398 | 165.24 (16.33) | 154.87 (16.31) | −10.26 (0.20) | 0.102 | 0.103 | 11.84 | 11.86 | 16.43 |
Sum = var() = 0.87 | Sum = var() = 0.87 | 100 | 100 | 100 |
The estimated between-study variance () from REML was 7.13 in the one-stage analysis, and 7.18 in the two-stage analysis.
Treatment effect is the mean difference in final systolic blood pressure, after adjusting for baseline (i.e. using an ANCOVA model).
Equation (12) is equivalent to equation (7) here.
Kontopantelis and Reeves approach; θ is the summary treatment effect.
Figure 1 and Table 1 show the study-specific estimates, the summary meta-analysis results and the percentage study weights from a one- and a two-stage IPD meta-analysis. The two-stage approach used an ANCOVA model (1) in each study separately to obtain treatment effect estimates, which were then pooled using random effects model (3). The alternative one-stage approach also used an ANCOVA model with separate intercepts () and baseline adjustments () per trial and a random treatment effect:
(18) |
Parameters for model (18) are as defined previously; the supplementary material 1(a) also gives the matrix expression for the model. The key parameter of interest is the summary treatment effect, θ, defined as the mean difference in SBP at follow-up for the treatment minus the control group, after adjusting for baseline values.
A step-by-step guide to the analyses is given in Appendix 1, alongside SAS Proc Mixed code. REML was used to estimate both models. After fitting each model, was obtained directly from the software package, and then was derived using equation (10b) with variance components held fixed to those in the full meta-analysis. A weight matrix was then derived for each study using equation (11), and percentage study weights derived using equation (12).
The summary treatment effect estimate and 95% CI are almost identical for the one- and two-stage models (Figure 1). This is unsurprising as all trials are large, and one- and two-stage approaches are considered to be very similar in such situations.7 This is reinforced further by examining the percentage study weights (Figure 1, Table 1). These too are almost identical, reflecting that the studies have the same contribution regardless of whether a one- or two-stage approach is used.
In Table 1, we also present percentage study weights based on the proportion of participants, as suggested by Kontopantelis and Reeves.35 Clearly, these are somewhat different to those percentage values from the one- and two-stage models, as they ignore differences in residual variances and proportions of participants in the treatment group. For example, for study 3, the contribution based on Kontopantelis and Reeves is 0.60%, whilst in the one- and two-stage analyses, it is 4.99%.
4.2 IPD and meta-regression models to evaluate a treatment-covariate interaction
We now extend the hypertension example to compare percentage study weights in three IPD meta-analysis models that are commonly used for estimating treatment-covariate interactions. Interest is in whether age is a treatment effect modifier, and thus whether older patients are more (or less) likely to respond well to hypertension treatment than younger patients.
Let define the age of patient j in study i, and let be the mean age of all patients in study i. To estimate treatment-covariate interactions, an often used two-stage approach (which is also possible without IPD) is to fit model (1) in the first stage and then a meta-regression in the second stage such as:
(19) |
This performs a weighted regression of the overall treatment effect estimates () against the mean age in each study, with the across-study (‘A’) interaction of denoting how a one-unit in mean age increases (or decreases) the overall treatment effect in a study.
A better approach is a two-stage analysis of within-study (‘W’) interactions.6,15 Here, model (1) is extended to include an adjustment term for age and an interaction term between age and treatment effect (). This model is estimated in each study separately to give and , which are pooled, for example in a fixed effect meta-analysis akin to model (2):
(20) |
This leads to the summary within-study interaction, , which estimates the change in treatment effect for a one-unit increase in age.
Some researchers suggest one-stage models that amalgamate within-trial and across-trial interactions.37,38 This extends model (18) by including an adjustment term for age and an interaction term between age and treatment effect (), as follows:
(21) |
This leads to , the amalgamated interaction estimate (a weighted average of both within-trial and across-trial associations). This may be prone to ecological bias: that is, the difference between the summary within-trial interaction and the across-trial interaction may be non-zero. The ecological bias can be estimated by re-parameterising as follows,39
(22) |
where is the ecological bias (-) and other terms are as defined previously.
REML was used to fit each of the four models, and then percentage weights were obtained in the same manner as described in Appendix 1. The summary results and percentage study weights are very different for , and (Table 2). The from meta-regression model (19) is the largest and statistically significant at the 5% level. Model (21) also produces a significant , but the from model (20) is closer to zero and non-significant. The differences arise due to the discrepant use of within-trial and across-trial associations in each analysis, as mentioned. These leads to differences in the percentage study weights for each of , and . For example, study 1 has a 21.3% contribution toward in the meta-regression, a 8.9% contribution toward in model (21), and an even lower 4.5% contribution toward in model (20).
Table 2.
Percentage study weights using equation
(12) |
|||||
---|---|---|---|---|---|
Trial ID (i) | Mean age () | Model (19): Meta-regression to estimate (across-trial interaction) | Model (20): Two-stage approach to estimate (within-trial interaction) | Model (21): One-stage approach to estimate (amalgamated interaction) | Model (22): One-stage approach to estimate (ecological bias) |
1 | 42.27 | 21.26 | 4.52 | 8.88 | 17.33 |
2 | 69.63 | 2.91 | 0.66 | 1.24 | 2.36 |
3 | 73.34 | 3.64 | 0.81 | 1.56 | 3.13 |
4 | 41.56 | 24.90 | 11.64 | 15.13 | 21.72 |
5 | 45.27 | 16.68 | 28.78 | 25.62 | 19.63 |
6 | 70.42 | 6.15 | 1.45 | 2.70 | 4.93 |
7 | 71.59 | 8.16 | 17.13 | 14.81 | 10.21 |
8 | 75.95 | 7.56 | 0.29 | 2.15 | 5.87 |
9 | 66.58 | 2.35 | 8.35 | 6.79 | 3.74 |
10 | 70.23 | 6.40 | 26.36 | 21.11 | 11.09 |
Meta-analysis result (95% CI) | = −0.132 (−0.246, −0.018) | = −0.050 (−0.116, 0.016) | = −0.072 (−0.128, −0.0149) | = −0.086 (−0.220, 0.048) |
Using model (22) to quantify the ecological bias gives = −0.086, highlighting that the across-trial interaction estimate is more negative in magnitude than the within-trial interaction. The percentage study weights (Table 2) show that studies 1, 4 and 5 have the most contribution toward this bias with weights of 17.3%, 21.7% and 19.6%, respectively. These studies have substantially lower mean ages (close to 40 years) compared to other studies (close to 70 years) and were also the most dominant in the meta-regression.
4.3 One- and two-stage network (multivariate) meta-analysis of multiple treatments
Our final example is a network meta-analysis of 28 trials to compare eight thrombolytic treatments after acute myocardial infarction30; for brevity, we refer to these treatments as A to H. Of interest is a comparison of the odds of mortality by 30–35 days for each pair of treatments. Table 3 provides the raw data in terms of rij (the number of deaths) and nij (the total patients) for each treatment group, in each study. It is also provided as a Stata dataset in supplementary material 2. As there are eight treatments, there are 28 comparisons of interest overall; however, only 13 of these comparisons are directly reported in at least one trial. Further, the maximum number of trials providing direct evidence for a particular comparison is only eight (C versus A). Thus, there is potentially large opportunity to borrow strength in this example; in other words, each treatment comparison can utilise correlated and indirect information from other pairs of treatments (contrasts) in the network, alongside any direct evidence.
Table 3.
Study | Treatments evaluated (design)a | rA | nA | rB | nB | rC | nC | rD | nD | rE | nE | rF | nF | rG | nG | rH | nH |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | A B D | 1472 | 20173 | 652 | 10344 | 723 | 10328 | ||||||||||
2 | A C H | 1455 | 13780 | 1418 | 13746 | 1448 | 13773 | ||||||||||
3 | A C | 9 | 130 | 6 | 123 | ||||||||||||
4 | A C | 5 | 63 | 2 | 59 | ||||||||||||
5 | A C | 3 | 65 | 3 | 64 | ||||||||||||
6 | A C | 887 | 10396 | 929 | 10372 | ||||||||||||
7 | A C | 7 | 85 | 4 | 86 | ||||||||||||
8 | A C | 12 | 147 | 7 | 143 | ||||||||||||
9 | A C | 10 | 135 | 5 | 135 | ||||||||||||
10 | A D | 4 | 107 | 6 | 109 | ||||||||||||
11 | A F | 285 | 2992 | 270 | 2994 | ||||||||||||
12 | A G | 10 | 203 | 7 | 198 | ||||||||||||
13 | A H | 3 | 58 | 2 | 52 | ||||||||||||
14 | A H | 3 | 86 | 6 | 89 | ||||||||||||
15 | A H | 3 | 58 | 2 | 58 | ||||||||||||
16 | A H | 13 | 182 | 11 | 188 | ||||||||||||
17 | B E | 522 | 8488 | 523 | 8461 | ||||||||||||
18 | B F | 356 | 4921 | 757 | 10138 | ||||||||||||
19 | B F | 13 | 155 | 7 | 169 | ||||||||||||
20 | B G | 2 | 26 | 7 | 54 | ||||||||||||
21 | B G | 12 | 268 | 16 | 350 | ||||||||||||
22 | B H | 5 | 210 | 17 | 211 | ||||||||||||
23 | B H | 3 | 138 | 13 | 147 | ||||||||||||
24 | C G | 8 | 132 | 4 | 66 | ||||||||||||
25 | C G | 10 | 164 | 6 | 166 | ||||||||||||
26 | C G | 6 | 124 | 5 | 121 | ||||||||||||
27 | C H | 13 | 164 | 10 | 161 | ||||||||||||
28 | C H | 7 | 93 | 5 | 90 |
A = SK; B = AtPA; C = t − PA; D = SK + tPA; E = Ten; F = Ret; G = UK; H = ASPAC as referred to in Lu and Ades.30
There are many possible models to perform a network meta-analysis of this data. In particular, either a two-stage ‘contrast-based’ approach, or a one-stage ‘arm-based’ approach can be used, as defined by Salanti et al.33 For the two-stage approach, first the data (such as in Table 3) are used to calculate log odds ratio estimates, and their within-study variances and correlations, for each pair of treatments in each study. Second, a multivariate meta-analysis model can be written to synthesise all the effect estimates jointly, whilst accounting for their within and between-study correlations, which enables the incorporation of both direct and indirect evidence toward each summary treatment effect. A common reference group is needed and a design matrix used to express all available treatment comparison estimates (contrasts) in relation to this reference group. For example, with treatment A as the reference group, the log odds ratio estimate for C versus B can be expressed in terms of the log odds ratio estimate for C versus A and the log odds ratio for B versus A (as logOR(C,B) = logOR(C,A) − logOR(B,A)). Under an assumption of consistency in the direct and indirect evidence, the general form of this network meta-analysis model can be written as a linear mixed model,18,29
(23) |
where is a vector of treatment effect estimates from study i, which has ‘known’ within-study variance matrix ; is a column vector containing the basic parameters, which are the average treatment effects (for each treatment compared to the chosen reference treatment); is a design matrix linking the treatment effect estimates in study i to the basic parameters; and G contains the variances and covariance of the random effects. Typically, G contains diagonal entries of and off-diagonal entries of 0.5, which ensures there is a common between-study variance of for all treatment contrasts in the network. Missing treatment effect estimates in some studies can be accommodated easily, for example using data augmentation, and for further details on this model specification, we refer to White et al.18,29 Our key focus in this article is that equation (23) is a linear mixed model, and so percentage weights follow according to the description in Section 3.1. For the thrombolytics example, Stata code is given in supplementary material 3 for this model, and is a 7 by 1 column vector and G a 7 by 7 matrix, as there are eight treatments and thus seven basic parameters. The two-stage approach assumes effects follow a multivariate normal distribution both within and between studies. This is an approximation, and a more exact approach is to model the raw data in Table 3 directly in a one-stage, arm-based model that includes a binomial likelihood within studies. These data are essentially the IPD here (as no patient-level covariates are of interest), and so this can be viewed as a one-stage IPD model that falls within the generalised linear mixed model framework outlined in Section 3.4. The general specification of this model can be written as
where is the probability of death for patients in treatment group j of study i; and are the number of participants and events, respectively, in study i for treatment group j; the are separate study intercept terms, which relate to a chosen reference group in that study; are the study-specific true treatment effects (in relation to the reference group); and , and G are as defined above. For the thrombolytics example, this one-stage model can be written as,
(24) |
where are separate study intercept terms that relate to the reference group in that study; the BAij to HAij terms are either 1, 0 or −1 depending on the treatment group that corresponds to ; is a 7 by 1 column vector containing the basic parameters, which are the seven summary treatment effects (for B compared to A, up to H compared to A); and G is a 7 by 7 matrix with diagonal entries of and off-diagonal entries of 0.5. The model is explained in more detail elsewhere,18,31 but supplementary material 3 provides SAS code to fit the model and details the coding of the BAij to HAij variables more explicitly, plus the subsequent derivation of percentage study weights.
The seven summary treatment effect results are given in Table 4 for both the one- (24) and the two-stage (multivariate) approach, with corresponding percentage study weights. The estimate of was held at the same value in both models, to illustrate a comparison of summary estimates and percentage weights for one- and two-stage network meta-analyses when was the same. We chose a of 0.00023, which was the REML estimate from the two-stage approach. The ML estimate from the one-stage approach was zero and thus only slightly smaller. The summary estimates and percentage study weights barely changed when using = 0, but for completeness, these are shown in supplementary material 4.
Table 4.
Percentage weights from the two-stage
network (multivariate) meta-analysis |
Percentage weights from the one-stage
network meta-analysis |
|||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
B vs A | C vs A | D vs A | E vs A | F vs A | G vs A | H vs A | B vs A | C vs A | D vs A | E vs A | F vs A | G vs A | H vs A | |
Study 1 | 81.14 | 0.01 | 97.70 | 26.67 | 18.44 | 0.53 | 0.01 | 80.62 | 0.01 | 97.65 | 26.44 | 18.24 | 0.56 | 0.03 |
Study 2 | 0.02 | 58.05 | 0 | 0.01 | 0 | 0.23 | 90.34 | 0.03 | 57.88 | 0 | 0.01 | 0.01 | 0.22 | 89.78 |
Study 3 | 0 | 0.35 | 0 | 0 | 0 | 0 | 0.06 | 0 | 0.36 | 0 | 0 | 0 | 0 | 0.06 |
Study 4 | 0 | 0.14 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0.17 | 0 | 0 | 0 | 0 | 0.03 |
Study 5 | 0 | 0.15 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0.15 | 0 | 0 | 0 | 0 | 0.02 |
Study 6 | 0 | 39.18 | 0 | 0 | 0 | 0.15 | 6.33 | 0.01 | 39.14 | 0 | 0 | 0 | 0.15 | 6.39 |
Study 7 | 0 | 0.25 | 0 | 0 | 0 | 0 | 0.04 | 0 | 0.27 | 0 | 0 | 0 | 0 | 0.04 |
Study 8 | 0 | 0.43 | 0 | 0 | 0 | 0 | 0.07 | 0 | 0.46 | 0 | 0 | 0 | 0 | 0.07 |
Study 9 | 0 | 0.33 | 0 | 0 | 0 | 0 | 0.05 | 0 | 0.37 | 0 | 0 | 0 | 0 | 0.06 |
Study 10 | 0.05 | 0 | 0.55 | 0.02 | 0.01 | 0 | 0 | 0.05 | 0 | 0.57 | 0.02 | 0.01 | 0 | 0 |
Study 11 | 10.58 | 0 | 0.99 | 3.48 | 46.54 | 0.07 | 0 | 10.52 | 0 | 0.97 | 3.45 | 46.50 | 0.07 | 0 |
Study 12 | 0.13 | 0.07 | 0.01 | 0.04 | 0.03 | 19.36 | 0.01 | 0.13 | 0.07 | 0.01 | 0.04 | 0.03 | 19.28 | 0.01 |
Study 13 | 0 | 0.03 | 0 | 0 | 0 | 0 | 0.18 | 0 | 0.03 | 0 | 0 | 0 | 0 | 0.18 |
Study 14 | 0 | 0.05 | 0 | 0 | 0 | 0 | 0.29 | 0 | 0.05 | 0 | 0 | 0 | 0 | 0.33 |
Study 15 | 0 | 0.03 | 0 | 0 | 0 | 0 | 0.18 | 0 | 0.03 | 0 | 0 | 0 | 0 | 0.18 |
Study 16 | 0 | 0.14 | 0 | 0 | 0 | 0 | 0.86 | 0 | 0.14 | 0 | 0 | 0 | 0 | 0.86 |
Study 17 | 0 | 0 | 0 | 67.13 | 0 | 0 | 0 | 0 | 0 | 0 | 67.21 | 0 | 0 | 0 |
Study 18 | 5.82 | 0 | 0.54 | 1.91 | 33.80 | 0.04 | 0 | 5.78 | 0 | 0.53 | 1.90 | 33.86 | 0.04 | 0 |
Study 19 | 0.12 | 0 | 0.01 | 0.04 | 0.68 | 0 | 0 | 0.13 | 0 | 0.01 | 0.04 | 0.74 | 0 | 0 |
Study 20 | 0.11 | 0.03 | 0.01 | 0.04 | 0.03 | 6.72 | 0 | 0.13 | 0.03 | 0.01 | 0.04 | 0.03 | 8.17 | 0 |
Study 21 | 0.51 | 0.12 | 0.05 | 0.17 | 0.12 | 31.02 | 0.02 | 0.50 | 0.11 | 0.05 | 0.16 | 0.11 | 30.45 | 0.02 |
Study 22 | 0.77 | 0.09 | 0.07 | 0.25 | 0.17 | 0 | 0.56 | 1.05 | 0.12 | 0.10 | 0.35 | 0.24 | 0 | 0.76 |
Study 23 | 0.49 | 0.06 | 0.05 | 0.16 | 0.11 | 0 | 0.35 | 0.76 | 0.09 | 0.07 | 0.25 | 0.17 | 0 | 0.55 |
Study 24 | 0.08 | 0.09 | 0.01 | 0.03 | 0.02 | 12.07 | 0.01 | 0.08 | 0.08 | 0.01 | 0.02 | 0.02 | 11.04 | 0.01 |
Study 25 | 0.11 | 0.12 | 0.01 | 0.04 | 0.03 | 17.24 | 0.02 | 0.12 | 0.13 | 0.01 | 0.04 | 0.03 | 17.77 | 0.02 |
Study 26 | 0.08 | 0.09 | 0.01 | 0.03 | 0.02 | 12.55 | 0.01 | 0.08 | 0.09 | 0.01 | 0.03 | 0.02 | 12.23 | 0.01 |
Study 27 | 0 | 0.14 | 0 | 0 | 0 | 0 | 0.36 | 0 | 0.14 | 0 | 0 | 0 | 0 | 0.37 |
Study 28 | 0 | 0.07 | 0 | 0 | 0 | 0 | 0.19 | 0 | 0.07 | 0 | 0 | 0 | 0 | 0.19 |
Total | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
Summary log odds ratio (s.e.) | −0.161 (0.046) | 0.002 (0.032) | −0.044 (0.049) | −0.156 (0.080) | −0.113 (0.062) | −0.197 (0.222) | 0.014 (0.039) | −0.166 (0.046) | 0.003 (0.032) | −0.045 (0.049) | −0.160 (0.080) | −0.117 (0.062) | −0.197 (0.219) | 0.017 (0.039) |
The between-study variance was estimated at 0.000231 for the two-stage analysis, and kept at this value for the one-stage analysis; this was done to illustrate a comparison of the summary results and percentage weights for one- and two-stage models when the between-study variance was the same. Results for different between-study variance estimates are given in supplementary material 4.
Table 4 shows that the summary effects and percentage study weights are almost identical for both one- and two-stage models. Thus, the multivariate normal approximation to the one-stage binomial approach performs well, and the contribution of each study is barely changed. In other network meta-analysis applications, this may not be the case, especially when there are rare events as then the one-stage approach is preferred.40
The described one- and two-stage models are known as ‘consistency’ models as they assume direct and indirect evidence are consistent (in agreement). One can extend them to add inconsistency terms for each treatment contrast, either by including them as additional random effects with mean zero,41–43 or by including them as fixed effects.18,31 When the latter is done for this dataset, as described by White et al., there is generally strong support for the consistency model, as seven of the eight inconsistency parameters are smaller than their standard errors. The only inconsistency parameter that is statistically significant (p = 0.024) relates to the indirect evidence toward the H versus B treatment comparison. Though this may be a chance finding, it is useful to examine it. Derivation of percentage study weights is helpful here to reveal which studies are contributing most toward the apparent inconsistency. Just four studies contribute: study 1 (17.1%), study 2 (16.8%), study 22 (39.6%) and study 23 (26.5%). Studies 22 and 23 have about two thirds of the weight, and they both compare H and B. They represent the direct evidence, which is discrepant from the indirect evidence coming from studies 1 and 2, which compare A, B and D, and A, C and H, respectively. Therefore, the inconsistency is arising from the indirect comparison through A (i.e. the loop involving A, B and H), and our percentage weights reveal that studies 1 and 2 have a very similar contribution toward this indirect evidence. In situations like this, it is helpful for analysts to compare studies that give indirect information with those that give direct information, to see if there are any obvious differences that might explain the inconsistency. Nothing was identified here and, as mentioned above, there is generally little evidence to support inconsistency in this network meta-analysis overall.
5 Discussion
In this article, we proposed how to calculate percentage study weights in multi-parameter meta-analysis and meta-regression models. Figure 2 summarises the four necessary steps. The approach generalises how percentage weights are calculated in a traditional single parameter meta-analysis, and now allows percentage weights to be derived for more complex models including meta-regression, one-stage IPD analyses and multivariate and network meta-analysis. Though focus will usually be on deriving percentage study weights toward summary (treatment) effects and (treatment-covariate) interactions, our approach is applicable for any parameter that is specified within a meta-analysis model that can be expressed as a general or generalised linear mixed model. This was shown in our second and third examples, where percentage study weights toward bias and inconsistency terms were examined, which may be of more interest to methodologists than clinicians.
Our proposal assumes that studies are independent, and utilises a decomposition of Fisher’s observed information matrix to decompose the total variance matrix of parameter estimates into study-specific contributions, from which percentage weights are derived. The variance scale is a sensible one for quantifying percentage weights, as the variance of a parameter estimate summarises the total information data toward that estimate, which comprises both direct (from the parameter itself) and indirect information (from other, correlated parameters in the model). Derivation of percentage study weights is simple for traditional (single parameter) meta-analysis models, and is straightforward to extend for one-stage (multiple parameter) general linear mixed models as the information matrix is not dependent on the parameter estimates themselves; thus the study-specific information matrix () can easily be derived by using any software that can re-fit the one-stage model including just study i, whilst allowing variance terms to be fixed. However, for generalised linear models, the residual variances are a function of the expected response values, and therefore it is harder to re-fit models whilst fixing variances after taking out studies (or cases44,45). Supplementary material 1 illustrates this with an example. Furthermore, the original scale of the analysis is non-linear and so identifying sums of information matrices is not immediate. For this reason, we proposed utilising the pseudo-likelihood approach of Wolfinger and O-Connell,34 which transforms the response data to a pseudo-linearised variable, so that the maximum likelihood (generalised least squares) solution for Fisher’s information matrix can again be utilised and decomposed.
In many situations, percentage study weights derived from a two-stage approach will be a close approximation to those from a one-stage approach, as two- and one-stage analyses often give similar meta-analysis results, as seen in examples 1 and 3.8 However, in our second example, there were substantial clinically and statistically important differences in the one- and two-stage models, as they were making discrepant use of between-study information. Similarly, when there are rare outcomes, then the percentage study weights for one- and two-stage models may differ, as the former uses a more exact (e.g. binomial) within-study likelihood and so summary results often differ.32,40 Recent evidence suggests one-stage models are increasingly being used.2
For time-to-event data, it has been shown how the Cox regression model can be fitted using a Poisson model due to the shared form of the contribution to the partial log-likelihood, by splitting follow-up time into as many intervals as there are events.46 Crowther et al. show how to perform a one-stage fixed or random effects IPD meta-analysis of survival data in this manner,47 which additionally provides the baseline hazard, and may be useful for prognostic modelling. Therefore, following this approach, meta-analysts can derive percentage study weights as described above for generalised linear mixed models. However, as each patient now has multiple rows (one for each interval), the approach for model estimation and subsequently derivation of percentage study weights may be computationally intensive. To address this, one can rather use larger intervals, such as quarter year or half year, to approximate the likelihood and improve computation time for derivation of percentage study weights. When analysing without the inclusion of continuous individual-level covariates, it may be possible to collapse the IPD to just a few rows per study to make the computation easier. This approach was used in the network meta-analysis example.
Our work complements the PRISMA-IPD reporting guidelines, which state that: ‘… the display of forest plots for key outcomes is advocated, irrespective of the type of approach to statistical analysis’.48 Percentage study weights should be routinely presented on forest plots, which in our experience, are rarely provided for IPD meta-analyses. Most software packages link forest plots with a two-stage meta-analysis. To our knowledge, the only exception is the ‘ipdforest’ module in Stata, which provides a forest plot with a one-stage summary result after either a one-stage linear or logistic (mixed) model is fitted. However, this module computes percentage study weights as the ratio of a study’s total participants over the total number of participants across all studies. Our first example (Table 1) showed that this does not correspond closely to percentage values derived using our approach. We therefore hope the ‘ipdforest’ module may be updated in the future.
Our work should also be applicable to other types of datasets involving independent clusters, where the weight of each cluster is of interest. This may include the analysis of a multi-centre randomised trial, or the analysis of an observational study with clustering by hospital, practice or region.
Finally, our described approach utilises Fisher’s information matrix for , but ignores Fisher’s information matrix for any estimated variance parameters. In other words, our derivation of percentage study weights ignores uncertainty in variance parameter estimates, and thus assumes that Fisher’s information matrix for is independent from Fisher’s information matrix for estimated (residual and between-study) variances. We consider this is sensible, as the most commonly used meta-analysis models either assume variance estimates are ‘known’ or, post-estimation, inflate confidence intervals for to account for uncertainty in variance estimates.49 For the latter situation, our approach should be valid as it focuses on decomposing the observed information toward itself, which does not change when post-estimation correction factors (such as Hartung-Knapp and robust variance estimators50–52) are applied in order to derive confidence intervals. Nevertheless, further consideration of this issue is needed, and in particular, whether our approach should be modified to situations where the uncertainty in variance estimates is propagated toward the estimation of itself.53 A further extension might consider percentage weights toward variance estimates, but this should be considered in a standard (pair-wise) meta-analysis first, before extension to multi-parameter (e.g. network) meta-analysis models.
In summary, we have proposed how to derive percentage study weights for multi-parameter meta-analysis and meta-regression models, which extend those for single parameter models. We hope this encourages researchers to reveal the contribution of each study toward meta-analysis results, and will be useful for methodologists aiming to understand and explain differences or potential biases in meta-analysis models.
Supplementary Material
Acknowledgements
We thank Ian White for helpful comments on an early draft of this work, and a reviewer for helpful feedback that helped us to improve the article upon revision.
Appendix 1
Example SAS Proc Mixed code to obtain percentage study weights toward one-stage IPD meta-analysis model (18)
The method described below is a way of utilising existing software to obtain percentage study weights in a convenient way for an IPD meta-analysis model with a continuous outcome. It avoids the user needing to specify the entire X and V matrices by hand. It can also be applied to a meta-regression model, as this can also be expressed within the general linear mixed model framework.
Step (i): Fit one-stage IPD meta-analysis model (18) to all studies
proc mixed cl method = reml data = hypertension;
class idnr trial;
/* save the estimated variance matrix of the parameter estimates as a dataset ‘varbfull’ */
ods listing;
ods output covb = varbfull;
/* write down the model, where sbpl is final systolic blood pressure, idnr is the unique id number of the patient, trial is the trial identifier, sbpi is the baseline systolic blood pressure, and treat is the treatment group identifier */
model sbpl = trial trial*sbpi treat / noint s cl covb;
/* place a random effect on the treatment effect to allow for between-trial heterogeneity */
random treat / type = un subject = trial;
/* the following ensures a separate residual variance per trial */
repeated / type = un subject = idnr(trial) group = trial;
run;
This provides the summary meta-analysis results, including , the residual variances (one per study), the between-study variance, and , the latter which is saved as ‘varbfull’. There are 21 estimated parameters in the main equation, and parameter 21 is the treatment effect (θ). Therefore, the (21,21) entry of = var() = 0.868.
Step (ii): implement equation 10(b) to obtain . For example, for study 1 do the following. Re-fit model (18) to all studies but for studies other than study 1, give the residual variances a large value (to ensure they have negligible contribution); and for study 1, hold fixed residual and between-study variances at those values from the full analysis.
proc mixed cl method = reml data = hypertension;
class idnr trial;
ods listing;
ods output covb = varb;
model sbpl = trial trial*sbpi treat / noint s cl covb;
random treat / type = un subject = trial;
repeated / type = un subject = trial(idnr) group = trial;
/* use the ‘parms’ statement to specify values for the residual variances and between-study variance */
parms
/* set the between-study variance at the value from the full analysis obtained from step (i) */
7.1295
/* fix the residual variance for study 1 at its value from the full analysis obtained from step (i) */
276.31
/* set the residual variances for study 2 to 10 at a large value */
100000000000
100000000000
100000000000
100000000000
100000000000
100000000000
100000000000
100000000000
100000000000
/* use ‘eqcons’ to ensure that SAS holds fixed the 11 specified variances */
/ eqcons = 1 to 11;
run;
We then simply need to invert the stored (named ‘varb’) to obtain
Step (iii): obtain a weight matrix () for each study, by twice multiplying its own information matrix by the variance matrix from the meta-analysis. Below is example code to do the matrix calculations for study 1:
proc iml;
/* convert the stored dataset ‘varb’ from the analysis of study 1 into a matrix /*
use varb;
read all var {Col1 Col2 Col3 Col4 Col5 Col6 Col7 Col8 Col9 Col10 Col11 Col12 Col13Col14 Col15 Col16 Col17 Col18 Col19 Col20 Col21} into varb;
/* invert this matrix to obtain Fisher’s information matrix for study 1 */
fish = inv(varb);
/* convert the stored dataset ‘varbfull’ into a matrix */
use varfull;
read all var {Col1 Col2 Col3 Col4 Col5 Col6 Col7 Col8 Col9 Col10 Col11 Col12 Col13Col14 Col15 Col16 Col17 Col18 Col19 Col20 Col21} into varfull;
/* obtain the weight matrix (W1) for study 1 */
weight = varfull*fish*varfull;
quit;
The element (21,21) of is 0.096
Step (iv): Apply equation (12) to obtain the percentage weights of interest. For example, the percentage weight of study 1 toward the summary treatment effect estimate is:
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Dan Jackson is supported by the Medical Research Council Unit Programme (number U105260558). Richard D Riley, Joie Ensor and Dan Jackson were supported by funding from a multivariate meta-analysis grant from the MRC Methodology Research Programme (grant reference number: MR/J013595/1). Danielle Burke is funded by a NIHR School for Primary Care Research Post-Doctoral Fellowship. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.
Supplementary material
Supplementary material is available for this article online.
References
- 1.Riley RD, Lambert PC, Abo-Zaid G. Meta-analysis of individual participant data: rationale, conduct, and reporting. BMJ 2010; 340: c221–c221. [DOI] [PubMed] [Google Scholar]
- 2.Simmonds M, Stewart G, Stewart L. A decade of individual participant data meta-analyses: A review of current practice. Contemp Clin Trials 2015; 45: 76–83. [DOI] [PubMed] [Google Scholar]
- 3.Liberati A, Altman DG, Tetzlaff J, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate healthcare interventions: explanation and elaboration. BMJ 2009; 339: b2700–b2700. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Berkey CS, Hoaglin DC, Mosteller F, et al. A random-effects regression model for meta-analysis. Stat Med 1995; 14: 395–411. [DOI] [PubMed] [Google Scholar]
- 5.Debray TP, Moons KG, van Valkenhoef G, et al. Get real in individual participant data (IPD) meta-analysis: a review of the methodology. Res Synth Methods 2015; 6(4): 293–309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Fisher DJ, Copas AJ, Tierney JF, et al. A critical review of methods for the assessment of patient-level interactions in individual participant data meta-analysis of randomized trials, and guidance for practitioners. J Clin Epidemiol 2011; 64: 949–967. [DOI] [PubMed] [Google Scholar]
- 7.Olkin I, Sampson A. Comparison of meta-analysis versus analysis of variance of individual patient data. Biometrics 1998; 54: 317–322. [PubMed] [Google Scholar]
- 8.Stewart GB, Altman DG, Askie LM, et al. Statistical analysis of individual participant data meta-analyses: a comparison of methods and recommendations for practice. PLoS One 2012; 7: e46042–e46042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bowden J, Tierney JF, Simmonds M, et al. Individual patient data meta-analysis of time-to-event outcomes: one-stage versus two-stage approaches for estimating the hazard ratio under a random effects model. Res Synth Methods 2011; 2: 150–162. [DOI] [PubMed] [Google Scholar]
- 10.Debray TP, Moons KG, Abo-Zaid GM, et al. Individual participant data meta-analysis for a binary outcome: one-stage or two-stage? PLoS One 2013; 8: e60650–e60650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Matthew T, Nordstorm K. Comparison of one-step and two-step meta-analysis models using individual patient data. Biometr J 2010; 52: 271–287. [DOI] [PubMed] [Google Scholar]
- 12.Jeng GT, Scott JR, Burmeister LF. A comparison of meta-analytic results using literature vs individual patient data. Paternal cell immunization for recurrent miscarriage. JAMA 1995; 274: 830–836. [PubMed] [Google Scholar]
- 13.Lambert PC, Sutton AJ, Abrams KR, et al. A comparison of summary patient-level covariates in meta-regression with individual patient data meta-analysis. J Clin Epidemiol 2002; 55: 86–94. [DOI] [PubMed] [Google Scholar]
- 14.Steinberg KK, Smith SJ, Stroup DF, et al. Comparison of effect estimates from a meta-analysis of summary data from published studies and from a meta-analysis using individual patient data for ovarian cancer studies. Am J Epidemiol 1997; 145: 917–925. [DOI] [PubMed] [Google Scholar]
- 15.Riley RD, Lambert PC, Staessen JA, et al. Meta-analysis of continuous outcomes combining individual patient data and aggregate data. Stat Med 2008; 27: 1870–1893. [DOI] [PubMed] [Google Scholar]
- 16.Riley RD, Simmonds MC, Look MP. Evidence synthesis combining individual patient data and aggregate data: a systematic review identified current practice and possible methods. J Clin Epidemiol 2007; 60: 431–439. [DOI] [PubMed] [Google Scholar]
- 17.Ahmed I, Sutton AJ, Riley RD. Assessment of publication bias, selection bias and unavailable data in meta-analyses using individual participant data: a database survey. BMJ 2012; 344: d7762–d7762. [DOI] [PubMed] [Google Scholar]
- 18.White IR, Barrett JK, Jackson D, et al. Consistency and inconsistency in network meta-analysis: model estimation using multivariate meta-regression. Res Synth Method 2012; 3: 111–125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Higgins JP, Whitehead A, Turner RM, et al. Meta-analysis of continuous outcome data from individual patients. Stat Med 2001; 20: 2219–2241. [DOI] [PubMed] [Google Scholar]
- 20.Riley RD, Kauser I, Bland M, et al. Meta-analysis of randomised trials with a continuous outcome according to baseline imbalance and availability of individual participant data. Stat Med 2013; 32: 2747–2766. [DOI] [PubMed] [Google Scholar]
- 21.Whitehead A. Meta-analysis of controlled clinical trials, West Sussex: Wiley, 2002. [Google Scholar]
- 22.DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials 1986; 7: 177–188. [DOI] [PubMed] [Google Scholar]
- 23.Brown H, Prescott R. Applied mixed models in medicine, Chichester: John Wiley, 1999. [Google Scholar]
- 24.Littell RC, Milliken GA, Stroup WW, et al. SAS system for mixed models, Cary, NC, USA: SAS Institute Inc., 1996. [Google Scholar]
- 25.Jackson D, White IR, Price M, et al. Borrowing of strength and study weights in multivariate and network meta-analysis. Stat Methods Med Res 2017; 26: 2853–2868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Van Houwelingen HC, Arends LR, Stijnen T. Advanced methods in meta-analysis: multivariate approach and meta-regression. Stat Med 2002; 21: 589–624. [DOI] [PubMed] [Google Scholar]
- 27.Jackson D, Riley RD, White IR. Multivariate meta-analysis: potential and promise. Stat Med 2011; 30: 2481–2498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.White IR. Multivariate random-effects meta-regression: Updates to mvmeta. STATA J 2011; 11: 255–270. [Google Scholar]
- 29.White IR. Network meta-analysis. Stata J 2015; 15: 951–985. [Google Scholar]
- 30.Lu G, Ades AE. Assessing evidence inconsistency in mixed treatment comparisons. J Am Stat Assoc 2006; 101: 447–459. [Google Scholar]
- 31.Higgins JPT, Jackson D, Barrett JK, et al. Consistency and inconsistency in network meta-analysis: concepts and models for multi-arm studies. Res Synth Method 2012; 3: 98–110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Stijnen T, Hamza TH, Özdemir P. Random effects meta-analysis of event outcome in the framework of the generalized linear mixed model with applications in sparse data. Stat Med 2010; 29: 3046–3067. [DOI] [PubMed] [Google Scholar]
- 33.Salanti G, Higgins JP, Ades AE, et al. Evaluation of networks of randomized trials. Stat Methods Med Res 2008; 17: 279–301. [DOI] [PubMed] [Google Scholar]
- 34.Wolfinger R, O’Connell M. Generalized linear mixed models: a pseudo-likelihood approach. J Stat Comput Simul 1993; 48: 233–243. [Google Scholar]
- 35.Kontopantelis E, Reeves D. A short guide and a forest plot command (ipdforest) for one-stage meta-analysis. Stata J 2014; 13: 574–587. [Google Scholar]
- 36.Wang JG, Staessen JA, Franklin SS, et al. Systolic and diastolic blood pressure lowering as determinants of cardiovascular outcome. Hypertension 2005; 45: 907–913. [DOI] [PubMed] [Google Scholar]
- 37.Sutton AJ, Kendrick D, Coupland CA. Meta-analysis of individual- and aggregate-level data. Stat Med 2008; 27: 651–669. [DOI] [PubMed] [Google Scholar]
- 38.Jackson C, Best N, Richardson S. Improving ecological inference using individual-level data. Stat Med 2006; 25: 2136–2159. [DOI] [PubMed] [Google Scholar]
- 39.Riley RD, Steyerberg EW. Meta-analysis of a binary outcome using individual participant data and aggregate data. Res Synth Methods 2010; 1: 2–9. [DOI] [PubMed] [Google Scholar]
- 40.Hamza TH, van Houwelingen HC, Stijnen T. The binomial distribution of meta-analysis was preferred to model within-study variability. J Clin Epidemiol 2008; 61: 41–51. [DOI] [PubMed] [Google Scholar]
- 41.Law M, Jackson D, Turner R, et al. Two new methods to fit models for network meta-analysis with random inconsistency effects. BMC Med Res Methodol 2016; 16: 87–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Jackson D, Law M, Barrett JK, et al. Extending DerSimonian and Laird’s methodology to perform network meta-analyses with random inconsistency effects. Stat Med 2016; 35: 819–839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Jackson D, Barrett JK, Rice S, et al. A design-by-treatment interaction model for network meta-analysis with random inconsistency effects. Stat Med 2014; 33: 3639–3654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Williams DA. Generalized linear model diagnostics using the deviance and single case deletions. Appl Stat 1987; 36: 181–191. [Google Scholar]
- 45.Pregibon D. Logistic regression diagnostics. Ann Stat 1981; 9: 705–724. [Google Scholar]
- 46.Clayton D, Hills M. Statistical methods in epidemiology, Oxford: Oxford University Press, 1993. [Google Scholar]
- 47.Crowther MJ, Riley RD, Staessen JA, et al. Individual patient data meta-analysis of survival data using Poisson regression models. BMC Med Res Methodol 2012; 12: 34–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Stewart LA, Clarke M, Rovers M, et al. Preferred reporting items for systematic review and meta-analyses of individual participant data: the PRISMA-IPD Statement. JAMA 2015; 313: 1657–1665. [DOI] [PubMed] [Google Scholar]
- 49.Cornell JE, Mulrow CD, Localio R, et al. Random-effects meta-analysis of inconsistent effects: a time for change. Ann Intern Med 2014; 160: 267–270. [DOI] [PubMed] [Google Scholar]
- 50.Hartung J, Knapp G. A refined method for the meta-analysis of controlled clinical trials with binary outcome. Stat Med 2001; 20: 3875–3889. [DOI] [PubMed] [Google Scholar]
- 51.Hedges LV, Tipton E, Johnson MC. Robust variance estimation in meta-regression with dependent effect size estimates. Res Synth Methods 2010; 1: 39–65. [DOI] [PubMed] [Google Scholar]
- 52.Sidik K, Jonkman JN. Robust variance estimation for random effects meta-analysis. Comput Stat Data Anal 2006; 50: 3681–3701. [Google Scholar]
- 53.Biggerstaff BJ, Tweedie RL. Incorporating variability in estimates of heterogeneity in the random effects model in meta-analysis. Stat Med 1997; 16: 753–768. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.