Abstract
Individual patient-data meta-analysis of randomized controlled trials is the gold standard for investigating how patient factors modify the effectiveness of treatment. Because participant data from primary studies might not be available, reliable alternatives using published data are needed. In this paper, I show that the maximum likelihood estimates of a participant-level linear random effects meta-analysis with a patient covariate-treatment interaction can be determined exactly from aggregate data when the model's variance components are known. I provide an equivalent aggregate-data EM algorithm and supporting software with the R package ipdmeta for the estimation of the “interaction meta-analysis” when the variance components are unknown. The properties of the methodology are assessed with simulation studies. The usefulness of the methods is illustrated with analyses of the effect modification of cholesterol and age on pravastatin in the multicenter placebo-controlled regression growth evaluation statin study. When a participant-level meta-analysis cannot be performed, aggregate-data interaction meta-analysis is a useful alternative for exploring individual-level sources of treatment effect heterogeneity.
Keywords: Clinical trials, Meta-analysis, Random effects models, Statistical methods in Health service research
1. Introduction
Subgroup analyses are important for identifying possible sources of treatment effect heterogeneity (Wang and others, 2007). A formal way to assess subgroup effects is through the estimate of a treatment-covariate interaction in a regression analysis, the ‘covariate’ being the variable that defines the subgroups of interest (Brookes and others, 2001). The power and generalizability of this estimate can be improved using meta-analysis, where data from multiple compatible trials is combined to obtain a pooled estimate of the subgroup effect (Rothwell, 2005). However, available methods for studying subgroup effects with meta-analysis of aggregate data have important practical and inferential limitations.
Two approaches for estimating a treatment-covariate interaction with an aggregate-data meta-analysis have been discussed in the literature (Fisher and others, 2011). In ‘meta-analysis of interactions’ the meta-analyst obtains study estimates of a given treatment-covariate interaction and pools them across studies using a standard fixed or random effects model (Simmonds and Higgins, 2007). The major drawback of this method is that covariate-treatment interactions are not typically included in primary clinical trial reports. The second approach is meta-regression, which regresses a measure of treatment effect on aggregated covariates, typically the covariate means for the trial sample. Meta-regression is more practical than meta-analysis of interactions because the covariate sample characteristics and estimates of treatment effect are routinely reported in the primary studies, but the ecological association between a covariate and treatment is a potentially biased and inefficient estimate of the individual-level association (Berlin and others, 2002; Lambert and others, 2002).
Compared to meta-regression and meta-analysis of interactions, a treatment-covariate interaction estimated from a regression analysis of complete individual patient data (IPD), where both between- and within-trial information is included, is more powerful and less vulnerable to bias (Schmid and others, 2004; Riley and others, 2007; Koopman and others, 2008). Yet, owing to the difficulties of collecting IPD, a regression analysis of the original trial data will generally not be feasible (Riley and others, 2008). In fact, <5% of current published meta-analyses in the medical sciences are based on patient-level data (Kovalchik, 2012). For this reason, it would be useful to establish whether aggregate statistics are sufficient to obtain the estimates of an IPD meta-analysis with individual-level covariates.
Olkin and Sampson (1998) showed that the treatment contrasts in a fixed effects IPD meta-analysis are equivalent to the aggregate-data treatment effect estimates in a fixed effects meta-analysis. Mathew and Nordström(1999) later showed that this result holds when studies have heterogeneous residual variance. More generally, Lin and Zeng (2010) found that the efficiency loss in estimating the regression parameters for standard fixed effects parametric or semiparametric IPD models using meta-regression is small when maximum likelihood estimation is used and there is minimal residual between-trial heterogeneity. No study has considered to what extent these findings can be extended to models with random effects and individual-level associations.
In this paper, I show that the maximum likelihood estimates for an IPD linear random effects model with a patient–covariate-treatment interaction term for a categorical covariate can be determined exactly from published clinical trial data when the model's variance parameters are known. I then provide variance estimators and an optimization algorithm for estimating all of the model parameters when the variance components are unknown. The statistical properties of the estimation methodology are investigated in simulation studies. The methods are demonstrated with an aggregate-data meta-analysis of real clinical trial data to quantify how age and cholesterol modify the effectiveness of the statin pravastatin for the prevention of cardiovascular disease progression.
2. Model
A general form for a linear mixed effects meta-analytic model with continuous outcome is
![]() |
(2.1) |
Each quantity of(2.1) is a study-specific vector for
studies. The Yj is the nj×1 column vector of continuous outcomes based on
subjects, Xj is an nj×p matrix of covariates, and Zj is a subset of Xj containing q variables with random effects. The study random effects αj follow a shared multivariate normal distribution with unrestricted covariance–variance D, αj∼N(0,D). The within-trial errors are iid normal with study-specific variance,
, where Im is the m×m identity matrix.
The specific case of model(2.1) that will be considered in this paper is a random effects interaction model with a categorical covariate. Given tj treated and cj control subjects in the jth study, define the treatment group indicator,
, and the diagonal matrix with Gj elements on the diagonal as
. Suppose that the categorical covariate of interest has K categories and let the dummy vector indicating membership in the kth subgroup be Sj(k). In terms of these quantities, the interaction model's fixed and random effects design matrices are
![]() |
(2.2) |
The model specified by (2.2) incorporates trial heterogeneity in baseline and treatment effects αj through the bivariate normal random effects having covariance–variance
. The covariate and interaction effects are constant for all trials.
To see how the individual-level and standard aggregate-level approaches differ, consider the meta-regression model to investigate the treatment-covariate interactions of model (2.2),
![]() |
(2.3) |
Here,
is the study estimate of treatment effect. For the linear model with continuous outcome this is
, the mean difference in outcome between the treatment and control group. The parameter θj is a study random treatment effect, θj∼N(θ,τ2). Each ζj is a residual normal error
, whose variance is considered known. Importantly, meta-regression does not include individual-level information about covariates. Instead, covariates enter as the sample mean of their corresponding variable in the IPD model, in this case, the proportion of individuals in the kth subgroup,
. Although one could consider using the ecological parameters γ for inference about individual-level treatment-covariate interactions, these quantities are vulnerable to bias and inefficiency (Berlin and others, 2002; Lambert and others, 2002; Fisher and others, 2011). Consequently, meta-analysis experts and software developers generally recommend that the covariates of meta-regression be limited to study-level variables (Thompson and Higgins, 2002; Viechtbauer, 2007).
3. Estimation
3.1. Estimation with known variances
Maximum likelihood (Hartley and Rao, 1967) is a common method used to estimate the parameters of the linear mixed effects regression model. Given the variance components σj and D, the maximum likelihood estimates (MLEs) for the fixed and random effects of model(2.1) are (Laird and Ware, 1982),
![]() |
(3.1) |
with
. The variance for this estimator is
![]() |
(3.2) |
The empirical Bayes estimator for the random effects is
![]() |
(3.3) |
and has the variance
![]() |
(3.4) |
In the supplementary material (Section A available at Biostatistics online), I show that aggregate data is sufficient to compute the above MLEs and their variances under the “interaction meta-analysis”(2.2) with known variance components. However, in general, the variance components will not be known. Previous work of Kovalchik and Cumberland (2012) derived an aggregate-data estimate for the error of the patient-level interaction term in a general linearized random effects meta-analysis. The motivation in that study was to estimate what the power of the IPD test of interaction would be if the patient-level data of the primary studies were available. In the following subsections, I build on this work by providing an EM algorithm to estimate all parameters of the IPD random effects model defined by(2.2) from aggregate data when the model's variance components are unknown.
3.2. Estimation with unknown variances
Given the parameters αj and β, maximum-likelihood estimates for the variance components of model(2.2) are
and
, with
. We see that
can be reconstructed exactly from the aggregate-data form of the
. Because the maximum likelihood estimates in a random effects model have been recognized to underestimate the target parameters, the denominator J−1 is used in place of J.
The estimator for
can be formed from the sample variances
of the continuous outomce yij within each of the K subgroups. Variance decomposition yields the expression
![]() |
(3.5) |
where
is the study-specific mean estimate for the kth treatment-covariate subgroup. The above expressions are equivalent to the E-step of the EM algorithm for the linear mixed model of(2.2) (Laird and Ware, 1982).
When the subgroup-within-treatment sample variances
are not available, an estimate for the
can be obtained from the overall trial sample variance
![]() |
(3.6) |
using
to denote the global outcome mean for the jth study. Under a homogeneous residual variance model,
,
![]() |
3.3. Optimization algorithm
This section outlines a complete aggregate-data EM algorithm for the interaction meta-analysis based on the M-step described in Section3.1 and the E-step described in Section3.2. The algorithm estimates the effects and variance components of model (2.2) using the following sufficient aggregate data: the sample sizes (tkj and ckj) and outcome means (
and
) in each subgroup and treatment arm; the sample variance (
) for the outcomes of each trial.
Let ζ be some small convergence criterion, e.g. ζ=10−7.
- Initialization. Set initial values for
and
.
and
, with
, and
.- The subscript ‘.’ indicates the index over which the mean is taken.
- Iterative updating. At the rth iteration for
:
- Compute
and
. - Compute
and
.
- Termination rule.
- For r>1,
. - While Δ>ζ repeat(3)–(4).
4. Simulation study
The theoretical results have shown that grouped statistics are sufficient to construct an equivalent EM algorithm for the IPD interaction meta-analysis. Simulation studies were performed to assess the performance of the optimization algorithm for trial number and sample size settings that are typical of current meta-analyses of randomized trials (Kovalchik, 2012; Engels and others, 2000). In these analyses, yij were generated under model(2.2) for fixed β,
, and
. Separate simulations were run with a two-group and three-group categorical covariate. Covariate group assignment was drawn from a multinomial distribution with an equal probability for each group. Meta-analyses of {10,15,20} trials, 200 total subjects in each trial and equal allocation to treatment and control groups were considered. The aggregate-data used for estimation were the outcome means and sample sizes in each treatment arm and covariate subgroup, and the trial sample variances for the yij outcomes. Summaries of the bias, standard error, mean standard error for the aggregate-data estimates of β and D were based on 1000 iterations of each simulation scenario.
The results of the simulation studies for the two-category covariate show that the aggregate-data approach yields unbiased fixed effects for all of the trial number and sample size conditions considered (Table1). Overall, the mean standard errors (MSEs) were in good agreement with the empirical standard errors (ESE). The estimates for the random effects variance were accurate for all conditions. The MSEs for the variance components are not reported because variance estimates for
are not considered in this work.
Table 1.
Bias and standard error of aggregate-data estimates of individual patient data linear mixed effects model with a two-group categorical patient-level covariate. Summaries are based on 1000 iterations.
| Trials | Parameter | Mean | ESE | MSE |
|---|---|---|---|---|
| 10 | β0=0.0 | 0.00 | 0.15 | 0.14 |
| β1=1.0 | 1.00 | 0.11 | 0.10 | |
| β2=0.0 | 0.00 | 0.05 | 0.06 | |
| β3=0.5 | 0.51 | 0.08 | 0.08 | |
| τ0=0.20 | 0.20 | 0.09 | ||
| τ1=0.10 | 0.09 | 0.05 | ||
| 15 | β0=0.0 | 0.00 | 0.12 | 0.11 |
| β1=1.0 | 1.00 | 0.09 | 0.08 | |
| β2=0.0 | 0.00 | 0.05 | 0.05 | |
| β3=0.5 | 0.50 | 0.06 | 0.06 | |
| τ0=0.20 | 0.19 | 0.08 | ||
| τ1=0.10 | 0.09 | 0.04 | ||
| 20 | β0=0.0 | 0.00 | 0.10 | 0.10 |
| β1=1.0 | 1.00 | 0.08 | 0.07 | |
| β2=0.0 | 0.00 | 0.04 | 0.04 | |
| β3=0.5 | 0.50 | 0.05 | 0.06 | |
| τ0=0.20 | 0.20 | 0.06 | ||
| τ1=0.10 | 0.09 | 0.04 |
ESE=empirical standard error; MSE=mean standard error β0=intercept β1=treatment β2=covariate β3=covariate-treatment interaction
The supplementary material (Section B available at Biostatistics online) shows that the statistical properties for the two-category covariate case were unchanged when a three-category covariate was used.
5. Application
The Regression Growth Evaluation Statin Study (REGRESS) (Jukema and others, 1995) was a placebo-controlled, multicenter study to assess the effect of treatment with the HMG-CoA reductase inhibitor pravastatin on progression and regression of coronary atherosclerosis in male patients with a serum cholesterol between 4 and 8 mmol/L (155 and 310 mg/dL). The primary endpoint of the study was the change from baseline in the average mean segment diameter of the coronary artery after 2 years of treatment. Less change is more indicative of treatment benefit. The study found that pravastatin was associated with a statistically significant mean difference of 0.04 mm in the mean change in segment diameter.
To demonstrate the paper's methodology, I investigate how baseline cholesterol and age influenced responsiveness to pravastatin using the grouped data reported in Table2. The data have been permuted within the treatment group and the study in order to be used for purely demonstrative purposes. The optimization algorithm of Section(3.3) was applied to estimate the aggregate-data IPD interaction model with a two-group categorization of baseline cholesterol (≤6, >6 mmol/L) and a three-group categorization of age (<50, 50–59, ≥60 years). To assess the performance of the methods, these estimates were compared with the results of complete IPD analyses, which were fit with the R package lme4 (Pinheiro and Bates, 2000).
Table 2.
Trial and subgroup summary statistics on change in mean segment coronary artery diameter (in mm) used to estimate interaction meta-analysis in the REGRESS study.
Cholesterol (mmol/L), (n) |
Age (years), (n) |
||||||
|---|---|---|---|---|---|---|---|
| Center | Pravastatin | Placebo | Pravastatin | Placebo | Sample variance | ||
| A | ≤6 | −0.2266 (17) | −0.1709 (23) | <50 | −0.1492 (10) | −0.2215 (13) | 0.0361 |
| >6 | −0.2406 (25) | −0.1830 (17) | 50−59 | −0.3164 (16) | −0.1485 (17) | ||
| ≥60 | −0.2071 (16) | −0.1636 (10) | |||||
| B | ≤6 | −0.0369 (6) | 0.3368 (9) | <50 | 0.2258 (1) | 0.3504 (4) | 0.0572 |
| >6 | 0.1077 (1) | 0.3701 (5) | 50−59 | −0.1049 (4) | 0.4273 (6) | ||
| ≥60 | 0.0401 (2) | 0.2291 (4) | |||||
| C | ≤6 | −0.0663 (10) | 0.1688 (13) | <50 | −0.0036 (2) | 0.1095 (6) | 0.0474 |
| >6 | −0.0553 (12) | 0.1278 (9) | 50−59 | −0.0867 (10) | 0.1658 (10) | ||
| ≥60 | −0.0452 (10) | 0.1718 (6) | |||||
| D | ≤6 | −0.0338 (15) | 0.1482 (12) | <50 | −0.0688 (8) | 0.1901 (9) | 0.0486 |
| >6 | −0.0533 (21) | 0.1554 (27) | 50−59 | −0.0730 (14) | 0.1562 (20) | ||
| ≥60 | −0.0038 (14) | 0.1141 (10) | |||||
| E | ≤6 | −0.2407 (30) | −0.1928 (21) | <50 | −0.2317 (13) | −0.1069 (15) | 0.0408 |
| >6 | −0.2975 (19) | −0.0383 (32) | 50−59 | −0.2440 (17) | −0.0962 (20) | ||
| ≥60 | −0.3007 (19) | −0.0969 (18) | |||||
| F | ≤6 | −0.4243 (23) | −0.5048 (20) | <50 | −0.3796 (10) | −0.6132 (10) | 0.0284 |
| >6 | −0.4206 (14) | −0.5313 (20) | 50−59 | −0.4372 (14) | −0.547 (12) | ||
| ≥60 | −0.4408 (13) | −0.4458 (18) | |||||
| G | ≤6 | 0.0517 (12) | −0.0344 (12) | <50 | 0.0606 (4) | −0.0614 (8) | 0.0343 |
| >6 | 0.0568 (11) | −0.0526 (15) | 50−59 | 0.0694 (12) | −0.0828 (8) | ||
| ≥60 | 0.0242 (7) | −0.0045 (11) | |||||
| H | ≤6 | 0.1675 (18) | 0.1311 (18) | <50 | 0.1371 (6) | 0.1972 (3) | 0.0236 |
| >6 | 0.1671 (9) | 0.2009 (9) | 50-59 | 0.1849 (11) | 0.1739 (13) | ||
| ≥60 | 0.1662 (10) | 0.1196 (11) | |||||
| I | ≤6 | 0.3068 (4) | 0.1794 (6) | <50 | 0.1172 (3) | 0.1880 (3) | 0.0564 |
| >6 | 0.0760 (10) | 0.0450 (6) | 50-59 | 0.1043 (7) | 0.0383 (4) | ||
| ≥60 | 0.2263 (4) | 0.1258 (5) | |||||
| J | ≤6 | −0.0468 (9) | −0.3057 (9) | <50 | −0.0854 (8) | −0.1818 (6) | 0.0392 |
| >6 | −0.0186 (16) | −0.1849 (11) | 50-59 | 0.0063 (8) | −0.2739 (13) | ||
| ≥60 | −0.0096 (9) | −0.1343 (1) | |||||
| K | ≤6 | −0.0602 (21) | 0.1178 (19) | <50 | 0.0341 (9) | 0.0841 (11) | 0.0335 |
| >6 | −0.0552 (9) | 0.1179 (10) | 50-59 | −0.1007 (14) | 0.1173 (15) | ||
| ≥60 | −0.0940 (7) | 0.2441 (3) | |||||
The aggregate-data and IPD meta-analyses yielded very similar results (Table3). Each found that a baseline cholesterol level less than or equal to 6 mmol/L is associated with a 0.04 mm greater reduction with pravastatin treatment, suggesting that subjects with higher cholesterol tend to have more treatment benefit from the statin. However, this difference was not statistically significant. The results for age, which are provided as supplementary material (Section C available at Biostatistics online), showed similar consistency between methods.
Table 3.
IPD interaction meta-analysis estimates to quantify effect modification of cholesterol for change in mean segment diameter of the coronary artery (in mm) using complete and aggregate data from the REGRESS study.
| Complete IPD |
Aggregate data (Table2) |
|||
|---|---|---|---|---|
| Term | Est. | 95% CI | Est. | 95% CI |
| Intercept | 0.0096 | (−0.1286, 0.1478) | 0.0097 | (−0.1278, 0.1462) |
| Pravastatin | −0.0865 | (−0.1858, 0.0128) | −0.0865 | (−0.1973, 0.0242) |
| Cholesterol ≤6 mmol/L | −0.0256 | (−0.0661, 0.0149) | −0.0256 | (−0.0662, 0.0149) |
| Pravastatin × Cholesterol ≤6 mmol/L | 0.0393 | (−0.0187, 0.0973) | 0.0401 | (−0.0179, 0.0983) |
| Residual variance, σ2 | 0.0327 | 0.0327 | ||
| Random effects variance, D | ![]() |
![]() |
||
The random effects of the interaction meta-analysis can be useful for evaluating intertrial heterogeneity. For the cholesterol analysis, the left panel of Figure1 shows the study-specific deviations from the population intercept; the right panel shows deviations from the population treatment effect.
Fig. 1.
Study-specific intercept and treatment random effects from interaction meta-analysis of cholesterol and pravastatin in the REGRESS study. Lines denote the 95% confidence interval.
6. Discussion
The methods provided in the present paper broaden the findings for linear models discussed by Olkin and Sampson (1998), Mathew and Nordström (1999), and Lin and Zeng (2010). In comparing the properties of patient-level and aggregate-data analyses, these authors focused on study-level effects and fixed-effect conditions. The present paper has shown that the estimates of an IPD random effects meta-analysis with an individual-level interaction term, for any categorical covariate, can be reliably estimated from published aggregate data. These are important extensions of earlier work because between-trial heterogeneity is common (Higgins and Thompson, 2002) and there is growing interest in exploring to what extent patient-level modifiers can explain trial differences in treatment effect.
There are a number of advantages to gathering participant level data that no current aggregate-based method can substitute. These include the ability to ensure uniformity in variable definitions and eligibility criteria and to diagnose model fit. It is because of these strengths that the presented methodology is not intended to compete with IPD meta-analysis but rather to provide a good aggregate-data approximation to a specific kind of IPD interaction model when participant-level data cannot be obtained. Owing to the heightened concern over individual-data security that has come with the genomic era (Homer and others, 2008), there is likely to be a growing demand for increasingly sophisticiated aggregate-data methods in the future.
The approach introduced here has important advantages over available aggregate-data methods. Unlike meta-regression, it allows the assessment of individual-level associations. And, in contrast to meta-analysis of interactions, there are fewer barriers to obtaining the data necessary to perform the analysis. Meta-analysis of interactions requires that primary investigators perform a regression analysis with an interaction term and make their findings available. The data needed for the method presented here can be obtained by stratifying the standard outcome analysis recommended by the CONsolidated Standards of Reporting Trials (CONSORT) (Schulz and others, 2010) on the covariate of interest.
The IPD random effects model considered in this paper was limited to a continuous outcome and a single categorical covariate with fixed covariate and interaction effects. Without requiring additional data from the primary studies, it is unclear how the paper's methods could be extended to include multiple covariates or to allow random covariate effects. Though it would be a challenge to extend the aggregate-data approach in this direction, the work of Lin and Zeng (2010) suggests that, with some additional assumptions, the single interaction model could be generalized to standard parametric and semiparametric models of binary and time-to-event outcomes.
In clinical trial research, the aims to personalize treatment and avoid spurious findings from subgroup analyses have often seemed irreconcilable (Lagakos, 2006). The approach in this paper offers a practical solution to this dilemma for trials with a continuous endpoint. The methodology and supporting software give clinical researchers a reliable tool to approximate an individual level assessment of effect modification with aggregate data from multiple trials of the same treatment. To facilitate the utility of this tool, clinical trialists should be encouraged to make stratified analyses of the study endpoint available upon request or as supplementary material.
Supplementary material
Supplementary material is available at http://biostatistics.oxfordjournals.org.
Funding
This research was supported by the intramural research program of the NIH/NCI.
Supplementary Material
Acknowledgements
The author is grateful to Dr Erik van Iperen, Prof J. Wouter Jukema, and Prof Koos Zwinderman for providing the REGRESS data. Conflict of Interest: None declared.
References
- Berlin J. A., Santanna J., Schmid C. H., Szczech L. A., Feldman H. I. Anti-Lymphocyte Antibody Induction Therapy Study Group. Individual patient- versus group-level data meta-regressions for the investigation of treatment effect modifiers: ecological bias rears its ugly head. Statistics in Medicine. 2002;21:371–387. doi: 10.1002/sim.1023. [DOI] [PubMed] [Google Scholar]
- Brookes S. T., Whitley E., Peters T. J., Mulheran P. A., Egger M., Davey Smith G. Subgroup analyses in randomised controlled trials: quantifying the risks of false-positives and false-negatives. Health Technology Assessment. 2001;5:1–56. doi: 10.3310/hta5330. [DOI] [PubMed] [Google Scholar]
- Engels E. A., Schmid C. H., Terrin N., Olkin I., Lau J. Heterogeneity and statistical significance in meta-analysis: an empirical study of 125 meta-analyses. Statistics in Medicine. 2000;19:1707–1728. doi: 10.1002/1097-0258(20000715)19:13<1707::aid-sim491>3.0.co;2-p. [DOI] [PubMed] [Google Scholar]
- Fisher D. J., Copas A. J., Tierney J. F., Parmar M. K. A critical review of methods for the assessment of patient-level interactions in individual participant data meta-analysis of randomized trials, and guidance for practitioners. Journal of Clinical Epidemiology. 2011;64:949–967. doi: 10.1016/j.jclinepi.2010.11.016. [DOI] [PubMed] [Google Scholar]
- Hartley H. O., Rao J. N. K. Maximum-likelihood estimation for mixed analysis of variance model. Biometrika. 1967;54:93–108. [PubMed] [Google Scholar]
- Higgins J. P. T., Thompson S. G. Quantifying heterogeneity in a meta-analysis. Statistics in Medicine. 2002;21:1539–1558. doi: 10.1002/sim.1186. [DOI] [PubMed] [Google Scholar]
- Homer N., Szelinger S., Redman M., Duggan D., Tembe W., Muehling J., Pearson J. V., Stephan D. A., Nelson S. F., Craig D. W. Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays. PloS Genetics. 2008;4:e1000167. doi: 10.1371/journal.pgen.1000167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jukema J. W., Bruschke A. V., van Boven J. A., Reiber J. H., Bal E. T., Zwinderman A. H., Jansen H., Boerma G. J., van Rappard M. F., Lie K. I. Effects of lipid lowering by pravastatin on progression and regression of coronary artery disease in symptomatic men with normal to moderately elevated serum cholesterol levels. The Regression Growth Evaluation Statin Study (REGRESS) Circulation. 1995;91:2528–2540. doi: 10.1161/01.cir.91.10.2528. [DOI] [PubMed] [Google Scholar]
- Koopman L., van der Heijden G. J., Hoes A. W., Grobbee D. E., Rovers M. M. Empirical comparison of subgroup effects in conventional and individual patient data meta-analyses. International Journal of Technology Assessment in Health Care. 2008;24:358–361. doi: 10.1017/S0266462308080471. [DOI] [PubMed] [Google Scholar]
- Kovalchik S. A. Survey finds that most meta-analysts do not attempt to collect individual patient data. Journal of Clinical Epidemiology (in press) 2012 doi: 10.1016/j.jclinepi.2012.07.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kovalchik S. A., Cumberland W. G. Using aggregate data to estimate the standard error of a treatment-covariate interaction in an individual patient data meta-analysis. Biometrical Journal. 2012;54:370–384. doi: 10.1002/bimj.201100167. [DOI] [PubMed] [Google Scholar]
- Lagakos S. W. The challenge of subgroup analyses–reporting without distorting. New England Journal of Medicine. 2006;354:1667–1669. doi: 10.1056/NEJMp068070. [DOI] [PubMed] [Google Scholar]
- Laird N. M., Ware J. H. Random-effects models for longitudinal data. Biometrics. 1982;38:963–974. [PubMed] [Google Scholar]
- Lambert P. C., Sutton A. J., Abrams K. R., Jones D. R. A comparison of summary patient-level covariates in meta-regression with individual patient data meta-analysis. Journal of Clinical Epidemiology. 2002;55:86–94. doi: 10.1016/s0895-4356(01)00414-0. [DOI] [PubMed] [Google Scholar]
- Lin D. C., Zeng D. On the relative efficiency of using summary statistics versus individual-level data in meta-analysis. Biometrika. 2010;97:321–332. doi: 10.1093/biomet/asq006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mathew T., Nordström K. On the equivalence of meta-analysis using literature and using individual patient data. Biometrics. 1999;55:1221–1223. doi: 10.1111/j.0006-341x.1999.01221.x. [DOI] [PubMed] [Google Scholar]
- Olkin I., Sampson A. Comparison of meta-analysis versus analysis of variance of individual patient data. Biometrics. 1998;54:317–322. [PubMed] [Google Scholar]
- Pinheiro J. C., Bates D. M. Mixed-effects Models in S and S-PLUS. New York: Springer; 2000. Statistics and Computing. [Google Scholar]
- Riley R. D., Lambert P. C., Staessen J. C., Wang J., Gueyffier F., Thijs L., Boutitie F. Meta-analysis of continuous outcomes combining individual patient data and aggregate data. Statistics in Medicine. 2008;27:1870–1893. doi: 10.1002/sim.3165. [DOI] [PubMed] [Google Scholar]
- Riley R. D., Simmonds M. C., Look M. P. Evidence synthesis combining individual patient data and aggregate data: a systematic review identified current practice and possible methods. Journal of Clinical Epidemiology. 2007;60:431–439. doi: 10.1016/j.jclinepi.2006.09.009. [DOI] [PubMed] [Google Scholar]
- Rothwell P. M. Treating individuals 2. Subgroup analysis in randomised controlled trials: importance, indications, and interpretation. Lancet. 2005;365:176–186. doi: 10.1016/S0140-6736(05)17709-5. [DOI] [PubMed] [Google Scholar]
- Schmid C. H., Stark P. C., Berlin J. A., Landais P., Lau J. Meta-regression detected associations between heterogeneous treatment effects and study-level, but not patient-level, factors. Journal of Clinical Epidemiology. 2004;57:683–697. doi: 10.1016/j.jclinepi.2003.12.001. [DOI] [PubMed] [Google Scholar]
- Schulz K. F., Altman D. G., Moher D. the CONSORT Group. CONSORT 2010 Statement: Updated guidelines for reporting parallel group randomized trials. Annals of Internal Medicine. 2010;152:726–32. doi: 10.7326/0003-4819-152-11-201006010-00232. [DOI] [PubMed] [Google Scholar]
- Simmonds M. C., Higgins J. P. Covariate heterogeneity in meta-analysis: criteria for deciding between meta-regression and individual patient data. Statistics in Medicine. 2007;26:2982–2999. doi: 10.1002/sim.2768. [DOI] [PubMed] [Google Scholar]
- Thompson S. G., Higgins J. P. T. How should meta-regression analyses be undertaken and interpreted? Statistics in Medicine. 2002;21:1559–1573. doi: 10.1002/sim.1187. [DOI] [PubMed] [Google Scholar]
- Viechtbauer W. Conducting meta-analyses in R with the metafor package. Journal of Statistical Software. 2010;36:1–48. [Google Scholar]
- Wang R., Lagakos S. W., Ware J. H., Hunter D. J., Drazen J. M. Statistics in medicine–reporting of subgroup analyses in clinical trials. New England Journal of Medicine. 2007;357:2189–2194. doi: 10.1056/NEJMsr077003. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.















