Abstract
Dose–response models express the effect of different dose or exposure levels on a specific outcome. In meta-analysis, where aggregated-level data is available, dose–response evidence is synthesized using either one-stage or two-stage models in a frequentist setting. We propose a hierarchical dose–response model implemented in a Bayesian framework. We develop our model assuming normal or binomial likelihood and accounting for exposures grouped in clusters. To allow maximum flexibility, the dose–response association is modelled using restricted cubic splines. We implement these models in R using JAGS and we compare our approach to the one-stage dose–response meta-analysis model in a simulation study. We found that the Bayesian dose–response model with binomial likelihood has lower bias than the Bayesian model with normal likelihood and the frequentist one-stage model when studies have small sample size. When the true underlying shape is log–log or half-sigmoid, the performance of all models depends on choosing an appropriate location for the knots. In all other examined situations, all models perform very well and give practically identical results. We also re-analyze the data from 60 randomized controlled trials (15,984 participants) examining the efficacy (response) of various doses of serotonin-specific reuptake inhibitor (SSRI) antidepressant drugs. All models suggest that the dose–response curve increases between zero dose and 30–40 mg of fluoxetine-equivalent dose, and thereafter shows small decline. We draw the same conclusion when we take into account the fact that five different antidepressants have been studied in the included trials. We show that implementation of the hierarchical model in Bayesian framework has similar performance to, but overcomes some of the limitations of the frequentist approach and offers maximum flexibility to accommodate features of the data.
Keywords: Clusters, one-stage model, hierarchical model, antidepressants, random effects
1 Introduction
Dose-response associations examine the effect of different levels of exposure (e.g. levels of smoking or drug doses) on a health outcome.1,2 In pairwise meta-analysis,3–5 combining dose–response associations from different studies and settings may lead to more precise and generalizable conclusions.6 When aggregate-level data are available from multiple studies, dose–response associations can be synthesized using either a one-stage or two-stage model. The one-stage model is implemented as a linear mixed model, which estimates a dose–response fixed effect and accounts for the heterogeneity by allowing shapes to vary across studies.7 In a two-stage model, the dose–response model is fitted first within each study, and then the regression coefficients (or shape characteristics) are synthesized across studies.8–10
The one-stage model takes into account heterogeneity but provides relevant information via the estimate of a between-studies variance–covariance matrix. The two-stage model employs standard meta-analytical techniques and provides the usual heterogeneity measures, such as I2, in case this is of interest. However, to fit non-linear shapes, frequentist implementation of the two-stage model requires multiple dose levels to be reported in each study. For example, if the dose–response curve is assumed to be approximated by a -order polynomial, all studies need to report outcomes for at least dose levels. This situation will result in excluding studies that report dose levels or fewer levels.
The one-stage and two-stage models are implemented in a frequentist setting, and their performance has been evaluated in simulations and examples.11 Fitting dose–response meta-analysis in a Bayesian framework, in the form of a hierarchical model, is, in our view, highly desirable. Several papers12,13 have described the advantages of Bayesian evidence synthesis. First, Bayesian models14,15 can be easily extended to incorporate, for example, study-specific covariates, to combine observational and randomized data, or to deal with multiple outcomes and exposure types. Second, one can employ informative priors for the dose–response shape to reflect expert knowledge or evidence from external data sources. Third, one can easily extend the model to explore the variation in dose–response curves within and across groups of similar exposures or drugs. Finally, probabilistic statements follow naturally as the posterior distributions can be interpreted as the true distributions of quantities of interest as uncertainty about all parameters is incorporated in the results.16,17
The paper is structured as follow. In Section 2 we present a Bayesian hierarchical dose–response meta-analysis model with normal or binomial likelihood and the cluster-specific dose–response model. The evaluation of the properties of the models follows in Section 3, alongside comparisons with the frequentist model in a simulations study. In Section 4, we re-analyze a dataset of the dose–response association of various doses of antidepressants. Finally, we discuss the strengths and limitations of the model in Section 5.
2 Methods
We introduce a Bayesian hierarchical model for dose–response meta-analysis. We focus on a dichotomous outcome, although the models could easily accommodate continuous outcomes.
2.1 Notation
Table 1 summarizes the notation. Suppose there are studies ( ) and each study has a number of doses ( ). Each study reported an empirical estimate of the outcome at each dose level. The doses are denoted by where the minimum dose is set as the reference level (control group). The observed outcome is expressed as number of events out of total observed or relative treatment effects. The dose-specific number of events is out of a total sample size . The estimated change in the outcome from the reference dose to dose , summarized for the participants, is indicated by . can be log odds ratio (logOR), log risk ratio (logRR), log hazard ratio (logHR), or any relative treatment effect for continuous outcomes such as mean difference. Relative effects rather than number of events are commonly reported in the context of studying environmental exposures or other exposures examined in observational studies.18 In this case, the relative effects are the estimates from multivariable models adjusted for possible confounding variables. The vector comprises all relative effects, on a natural logarithmic scale, for study .
Table 1.
study id | |
dose levels in study | |
dose level in study | |
reference dose in study | |
number of events in dose within study | |
sample size in dose within study | |
within study , the relative effect (on a log-scale) of dose contrasted to the effect in the reference dose ( ) e.g. log odds ratio | |
vector of all dose-specific (log) relative effects in study | |
number of dose transformations associated with the dose–response shape. For a linear shape and for quadratic and restricted cubic splines | |
exposure clusters |
2.2 Dose–response meta-analysis model
We propose a hierarchical two-level model. In the first level, the dose–response model is fitted within each study assuming either normal (normal dose–response model) or binomial likelihood (binomial dose–response model) for the observed data. In the second level, we synthesize the dose–response regression coefficients across studies. The hierarchical structure allows coefficients to borrow strength across studies, via the exchangeability assumption.
2.2.1 Dose–response model within each study.
Within each study , a multivariate normal distribution is assumed for
where the vector contains the underlying relative effects of dose relative to dose . The ( −1) ( −1) variance–covariance matrix can be estimated assuming a multinomial distribution for the number of events per dose and using the delta-method for large sample sizes.19,20 For logOR, the elements of are
where refers to the number of non-events and the zero index refers to the quantities in the reference dose. The formula above is suitable when the logORs are estimated from 2 × 2 tables. If the logORs originate from adjusted logistic models from observational studies, then a correction in the correlations between dose strata should be applied to , using the Longnecker and Greenland method9,10 or the approach suggested by Hamling.21
If the data are from a randomized trial and the table of counts is available, it is straightforward to assume a binomial distribution of events
where are the underlying probabilities of having an event in dose within study . Then the underlying relative treatment effects are parametrized as
with defined as above. The function is specified based on the effect size we want to estimate; for example, it is the logit function for logOR and the log function for logRR. The parameter is the log-odds of the event in the reference dose level.
Note that continuous outcome data can be accommodated if are mean differences or standardized mean differences. Alternatively, if the outcome is available for each dose level, the normal likelihood is used instead of the binomial, and is parameterized as the mean difference or standardized mean difference.
2.2.2 Dose–response functions.
The underlying relative effect can be modelled as
where is the dose–response function and are the shape parameters that need to be estimated. Note that the function could also be any transformation, including linear, quadratic, cubic or fractional polynomials and resulting in being a vector of length and .22 The simplest case is to assume a linear ( is the identity function) shape where the statistical model needs to estimate only one parameter in study ; and . However, investigating dose–response relations underlying several studies may require non-linear models.23 A flexible choice is using restricted cubic splines.24 With knots, there are regression coefficients in to be estimated. Setting (say ), will result into consisting of dose-transformations; is the identity function and is the restricted cubic spline transformation24 with coefficients .
where
and
with and 0 otherwise.
The total number and location of knots should be identical for all studies, when are to be pooled. To set the location of knots, Harrell suggests to use fixed sample quantiles. With three knots, we set them at the 10%, 50% and 90% percentiles of the observed dose range (see Section 2.4 in Harrell24). It has been shown that the location of knots in restricted cubic spline is not very critical in most situations as Stone found in a series of simulations.25 However, we further examine this issue in our simulations study.
2.2.3 Synthesize dose–response functions across studies.
In dose–response meta-analysis, the study-specific regression coefficients can then be synthesized. Random dose–response coefficients model assumes that the underlying study-specific coefficients are normally distributed with mean and variance–covariance matrix, that is
This model acknowledges the presence of a distribution of true dose–response relationships underlying the studies and is capable of predicting study-specific curves by borrowing strength from their variation across studies. is a variance-covariance matrix with diagonal elements and in the off-diagonal there are the covariances between the coefficients. To improve estimation, we will assume that and we will explore whether the correlations in are different from zero. Note that this model assumes that the heterogeneity across the study-specific estimates is fully captured by heterogeneity in the dose–response shapes. For a model with a common dose–response coefficient we set .
Note that for studies with , is not identifiable. However, because of the exchangeability (or equality) of across studies, we are able to estimate them.
2.3 Dose–response meta-analysis model accounting for clustering in the exposure
Consider an exposure (or drug) variable that can take on different values. For example, daily intake of omega 3 fatty acids in relation to risk of cardiovascular events, possibly accounting for the different assessment of omega 3 (food supplements versus diet with fish and nuts). The differences between these two dose–response curves can be modelled by inserting type-specific regression coefficients , where
Overall, for a random of exposure clusters the relative effects are mapped to the transformed dose as
Next, the vectors of coefficients from study examining the same cluster of exposures, are synthesized using a multivariate normal distribution with a common mean and variance–covariance matrix ; that is a matrix with diagonal and on the off-diagonal the covariances between the coefficients
At the next step, the cluster-specific dose–response associations are synthesized across the clusters. Again, a multivariate normal distribution with mean vector and variance–covariance matrix is employed. has the same dimension as and in the diagonal the parameter measures the heterogeneity between the clusters
2.4 Predicting an absolute mean response to a dose
Once the data are synthesized and the dose–response parameters are estimated, we can predict the absolute mean response for any dose within the range of studied dose levels. These predictions are straightforward within a Bayesian model as the total uncertainty in the parameters is propagated in the final predictions. Assume there is a natural reference dose, such as a dose zero or no-exposure (baseline dose). The observations from the zero dose levels can be parametrized to estimate an average summary response to zero-dose
Then, the estimated common baseline mean effect (measured on the log or logit probability scale) can be combined with to obtain the predicted absolute response to any given dose level
2.5 Bayesian estimation
We will use Markov chain Monte Carlo (MCMC) techniques to estimate all parameters in a Bayesian setting. An approximate non-informative prior distribution is chosen for the coefficients and the baseline effects in the binomial model
Given that both in the simulations and in the example our outcome is dichotomous and measured on the natural log scale, we place a half-normal prior to the heterogeneity parameter
This heterogeneity prior is minimally informative in most cases as ORs rarely exceed 5 and hence the underlying values of ’s should not often exceed |0.2|.
For correlations in the off-diagonal of the variance–covariance matrices, we use a uniform prior
All Bayesian models are implemented in JAGS within R.26,27 The codes can be found in GitHub at https://github.com/htx-r/DoseResponsePMA. To obtain the spline transformations, we use the rcs function from the rms package.28 To evaluate the convergence of the models, we employed various diagnostic tools for MCMC included in the coda package.29 We explored convergence plots for the MCMC (histograms, trace plots, Geweke plot and Gelman-Rubin plot) and relevant statistics (Raftery and Lewis statistic and Heidelberger and Welch test).30
3 Simulations study
We aim to investigate the agreement between the estimations of the dose–response meta-analysis curve under our two Bayesian models, assuming random-effects for the coefficients, and the frequentist one-stage model.31 The codes are available in GitHub.
3.1 Simulation design
We investigated the performance of the three models assuming that the true dose–response function is curvilinear (setting 1), half-sigmoid or log–log (setting 2). Characteristics of the simulated settings are summarized in Table 2.
Table 2.
Characteristics of the dose–response curve | Function form | Effect size | Number of studies | Knot positions | Dose distribution | Study sample size | Main finding | Results in Figure/Table | ||
---|---|---|---|---|---|---|---|---|---|---|
1. True dose–response curve as a restricted cubic spline | OR, RR | 40 | 0, 00.04, 00.1, 0.030.2, −0.2 | 0.0010.01 | 25%, 50%, 75% | Unif | Unif | The three models give unbiased estimates | Supplementary Appendix Tables 1 to 3 and 7 to 9Supplementary Appendix Figure 2 and 15 | |
2. Other true dose-response forms | OR | 20 | – | – | 10%, 50%, 90%at dose 0, 1, 3 | Unif | Unif | The three models give biased estimates with high uncertainty for quantile knots and unbiased results for knots at 0,1,3 | Figure 1 Supplementary Appendix Figure 6 | |
3. Smaller sample size | OR | 20 | 0, 00.04, 00.1, 0.030.2, –0.2 | 0.0010.01 | 25%, 50%, 75% | Unif | Unif | The binomial model gives unbiased estimates | Table 3 Supplementary Appendix Tables 4 to 6 Figure 2 | |
4. Fewer trials | OR | 816 | 0.04, 0.03 | 0.001 | 25%, 50%, 75% | Unif | Unif | The three models give unbiased estimates | Supplementary Appendix Figure 3 | |
5. Partially overlapping doses | OR | 20 | 0.04, 0.03 | 0.001 | 25%, 50%, 75% | Half the studies from Unif Half the studies from Unif | Unif | The three models give unbiased estimates | Supplementary Appendix Figure 4 | |
6. Discrete doses | OR | 20 | 0.04, 0.03 | 0.001 | 25%, 50%, 75% | sample | Unif | The three models give unbiased estimates | Supplementary Appendix Figure 5 |
Under the assumption of a curvilinear relationship (setting 1), we used restricted cubic splines with three knots at fixed percentiles (25th, 50th, and 75th) of the dose, with shape defined by the spline coefficients. We modelled the logOR and the logRR. For 40 clinical trials, we simulated study-level aggregated data. For each study, we simulated two non-zero doses from uniform distribution 1, 10) and assumed each study reported one zero dose. The study-specific coefficients and are generated independently from a univariate normal distribution with means and , respectively, and a common heterogeneity . Values for and were chosen to represent four dose–response shapes: constant (both equal to zero), linear ( , monotonic increasing and umbrella-like shape (see supplementary Appendix Figure 1 and Figure 1). We chose non-zero coefficient values that cover a reasonable range for ORs (0.3–5). We introduced between-study heterogeneity, . The assumed mean and heterogeneity values result in eight scenarios, as shown in supplementary Appendix Tables 1 to 3.
Using , and we calculated the underlying treatment effect ( are set as explained above in Section 2.2.2). To improve computing time, we assumed that the two shape coefficients and are unrelated ( ). Per dose, the observed number of events are generated from binomial distributions with probability and sample size . The event rate in the zero-dose group is set to 0.1. The underlying event rate at dose is . The sample size per dose is generated from a uniform distribution . In this way, the number of events and the sample size per dose for each study are generated and used as input for the Bayesian binomial model. Using these counts, we then estimate and their standard errors to use as inputs for the Bayesian normal and frequentist models.31
Following the same steps as in logOR above, we simulated the dataset expressing the underlying treatment effect, instead, in terms of risk ratio . The additional concern, particularly for RR, is that we need to confine probabilities within 0 and 1. Therefore, we inserted and then we set . Along with that, we restrict the values of both and ; and , to avoid numerical problems that emerge near the boundaries.
In the second setting (Table 2), we assumed the true shape is log–log ( and half-sigmoid function ( ). We used two different dose distributions: 1, 10) and 2). We investigated two different knot positions; at the 10%, 50% and 90% quantiles (1, 5 and 9 for the uniform dose and 0.2, 1.3 and 4.3 for the distribution) and then at doses 0, 1 and 3 which is where changes in the shape are taking place. This results in eight scenarios for setting 2.
The other four settings in Table 2 result from modifying setting 1. First, we assumed smaller trials by generating the sample size per dose as . Second, we assumed fewer trials; . Third, we explored the case of having partially overlapping dose levels across trials, by generating the doses in half of studies from and the other half from . Finally, we consider the case where doses are discrete by sampling from a list of integer values.
The Bayesian models were estimated using 1× iterations with three chains, with a burn-in of 1× and a thinning of one. Given that the simulated data was produced assuming , we did not use bivariate distributions but two independent distributions for and . Each scenario was studied in 1000 simulations. We used the dosresmeta command to fit the frequentist model.32
For each method, we estimated the mean bias in the regression coefficients and and as the difference between the true coefficient and the corresponding mean estimated value. We computed the mean squared error (MSE) as the sum of the squared bias and the variance of the estimates to quantify the variation in sample estimates. As graphical output is difficult to monitor in a simulation study, the convergence of the MCMC was quantified here only by computing the Gelman statistics ; when the MCMC converges. Additionally, we report the coverage for each estimate as the proportion of credible intervals that captured the true value. We computed the power to detect when the estimated credible interval does not include zero and the mean of the coefficients’ standard error (SE2mean). Finally, we report the Monte Carlo standard error (MCse) to quantify the uncertainty of all the quantities presented above. We present the results from OR for bias and MSE in the main text, whereas the remaining results are presented in the supplementary Appendix.
3.2 Simulation results
3.2.1 Setting 1.
Supplementary Appendix Tables 1 to 3 present the results from the eight scenarios of the first setting for logORs using splines. Supplementary Appendix Figure 2 shows the average estimated curves for scenarios 2 to 4 (results from scenarios 6 to 8 provide similar conclusions to those in supplementary Appendix Figure 2; scenarios 1 and 5 refer to no dose–response association and are not presented in the figure). The three estimated dose–response lines are indistinguishable and all three models perform very well (supplementary Appendix Tables 1 to 3). The binomial Bayesian model has a slightly lower bias in the coefficients than the normal Bayesian and the frequentist approach in all scenarios. The spline coefficients exhibit more bias and are less accurate than those of . For both binomial and normal Bayesian models, larger heterogeneity resulted in less bias than when . The coverage of all estimates exceeds 90%. The power to detect a nonzero linear coefficient ranges between 85% and 93% when and 100% for . The power to detect a non-linear association ranges between 20% and 28% when and is 100% when −0.2. The MCMC converged in all simulations as . Finally, the largest MCse of bias is . The results for logRR agree with the ones based on logOR and are presented in supplementary Appendix Figure 15 and supplementary Appendix Tables 7 to 9.
3.2.2 Setting 2.
Simulating under different shapes, we found that estimations from the three models improves when knots are placed in dose ranges where the risk changes a lot compared to when we set knots at 10%, 50% and 90% percentiles as suggested by Harrell24 (see Figure 1 and supplementary Appendix Figure 6).
3.2.3 Setting 3.
Modifications of setting 1 resulted in similar very good performance of all three models, with one exception. We found that the binomial Bayesian model gives less biased dose–response curve estimates compared with the normal Bayesian and frequentist models (see Table 3, Figure 1, and supplementary Appendix Tables 4 to 6).
Table 3.
(a) Estimated | |||||||||
---|---|---|---|---|---|---|---|---|---|
True values | Binomial Bayesian | Normal Bayesian | One-stage (frequentist) | ||||||
Scenario | Bias | MSE | Bias | MSE | Bias | MSE | |||
S1 | 0.001 | 0 | 0 | 2.1 | 0 | 23.7 | 0 | 22.0 | 0 |
S2 | 0.001 | 0.04 | 0 | 1.7 | 0 | 21.4 | 0 | 20.1 | 0 |
S3 | 0.001 | 0.1 | 0.03 | 2.1 | 0 | 20.3 | 0 | 18.9 | 0 |
S4 | 0.001 | 0.2 | −0.2 | 3.9 | 0 | 13.4 | 0.01 | 17.2 | 0.01 |
S5 | 0.01 | 0 | 0 | 2.2 | 0 | 24.7 | 0 | 23.1 | 0 |
S6 | 0.01 | 0.04 | 0 | 4.3 | 0.01 | 24.6 | 0.01 | 23.3 | 0.01 |
S7 | 0.01 | 0.1 | 0.03 | 4.5 | 0.01 | 21.7 | 0.01 | 20.2 | 0.01 |
S8 | 0.01 | 0.2 | −0.2 | 4.2 | 0 | 14.5 | 0.01 | 17.6 | 0.01 |
(b) Estimated | |||||||||
True values | Binomial Bayesian | Normal Bayesian | One-stage (frequentist) | ||||||
Scenario |
|
|
|
Bias |
MSE |
Bias |
MSE |
Bias |
MSE |
S1 | 0.001 | 0 | 0 | −7.5 | 6.0 | −35.8 | 7.0 | −30.1 | 7.5 |
S2 | 0.001 | 0.04 | 0 | −2.5 | 5.2 | −31.0 | 6.3 | −27.0 | 7.2 |
S3 | 0.001 | 0.1 | 0.03 | 2.2 | 4.4 | −28.3 | 5.8 | −22.6 | 6.2 |
S4 | 0.001 | 0.2 | −0.2 | −3.2 | 3.6 | −12.2 | 4.5 | −19.6 | 5.2 |
S5 | 0.01 | 0 | 0 | −5.7 | 6.0 | −35.7 | 6.9 | −30.7 | 7.8 |
S6 | 0.01 | 0.04 | 0 | −5.9 | 6.3 | −34.5 | 7.8 | −29.8 | 9.0 |
S7 | 0.01 | 0.1 | 0.03 | −4.7 | 4.8 | −32.6 | 6.6 | −26.9 | 6.7 |
S8 | 0.01 | 0.2 | −0.2 | −2.3 | 4.3 | −14.4 | 5.5 | −20.4 | 6.3 |
(c) Estimated | |||||||||
True values | Binomial Bayesian | Normal Bayesian | |||||||
Scenario |
|
|
|
Bias |
MSE |
Bias |
MSE |
|
|
S1 | 0.001 | 0 | 0 | 32.7 | 1.2 | 37.3 | 1.7 | ||
S2 | 0.001 | 0.04 | 0 | 32.0 | 1.2 | 38.0 | 1.8 | ||
S3 | 0.001 | 0.1 | 0.03 | 28.7 | 1.0 | 36.7 | 1.7 | ||
S4 | 0.001 | 0.2 | −0.2 | 28.4 | 0.9 | 37.0 | 1.8 | ||
S5 | 0.01 | 0 | 0 | 25.5 | 0.9 | 30.5 | 1.3 | ||
S6 | 0.01 | 0.04 | 0 | 22.7 | 0.7 | 30.1 | 1.3 | ||
S7 | 0.01 | 0.1 | 0.03 | 20.1 | 0.5 | 28.8 | 1.2 | ||
S8 | 0.01 | 0.2 | −0.2 | 21.2 | 0.6 | 29.9 | 1.4 |
Note: We assume 20 trials reporting aggregated-level data with three dose-levels each where the sample size is generated from . The bias and MSE are reported for linear coefficient, spline coefficient and their common heterogeneity (a) (b) (c) , respectively. Bias and MSE are divided by .
Overall, the spline coefficients are more biased and less accurate than those of . For both binomial and normal Bayesian models, heterogeneity was estimated with considerably more bias than in setting 1 where larger studies have been synthesized (see Table 3 and supplementary Appendix Tables 4 to 6).
Additional results from setting 3 are presented in supplementary Appendix Tables 4 to 6. The coverage of all estimates exceeds 87%; coverage with the binomial Bayesian model is slightly larger than with the normal likelihood models. The power to detect a nonzero linear coefficient ranges from 21% to 40% when and is 100% for . The power to detect a non-linear association ranges from 4.5% to 7.6% when and from 85% to 88% when . The convergence of MCMC is achieved in all simulations as . Finally, the largest MCse of bias is .
3.2.4 Settings 4–6.
In settings 4, 5 and 6, the three estimated dose–response curves are indistinguishable and unbiased, see supplementary Appendix Figures 3 to 5.
4 Dose–response for antidepressants in major depression
We illustrate the methods by synthesizing the dose–response association reported in 60 randomized controlled trials (145 arms, 15,174 participants) examining the efficacy and tolerability of various doses of serotonin-specific reuptake inhibitor (SSRI) antidepressant drugs.33 Using a previously validated formula, we first transformed the dosages of the different antidepressants into fluoxetine-equivalents.33 The response to antidepressant is defined as 50% reduction in symptoms. We estimated the dose–response relationship using restricted cubic spline with three knots placed at fixed percentiles of the dose: 10, 20, and 50 mg/day.
The results are displayed in Table 4 and the dose–response curves based on the three approaches are shown in supplementary Appendix Figure 17. The estimated correlation indicates a substantial uncertainty. The two Bayesian models agree to a large extent with the frequentist approach in the estimated linear and spline coefficients and in the precision of the estimations, as shown in results in Table 4. There are immaterial differences between the frequentist and the Bayesian models in the estimation of heterogeneity and correlation ; the latter is estimated with large uncertainty in Bayesian models.
Table 4.
Binomial Bayesian |
Normal Bayesian |
One-stage (frequentist) |
Binomial Bayesian with drug clusters |
|||||
---|---|---|---|---|---|---|---|---|
Mean | SD | Mean | SD | Mean | SE | Mean | SD | |
0.0214 | 0.0024 | 0.0210 | 0.0037 | 0.0209 | 0.0025 | 0.0213 | 0.0036 | |
−0.0397 | 0.0070 | −0.0396 | 0.0085 | −0.0376 | 0.0060 | −0.0387 | 0.0079 | |
0.0087 | 0.0028 | 0.0072 | 0.0031 | – | 0.0076 | 0.0028 0.0040 | ||
−0.4782 | 0.4952 | −0.2488 | 0.5652 | −1 | – | 0.5153 |
Note: Dose is measured as fluoxetine-equivalent in mg/day. The model is fitted with restricted cubic splines and assuming random dose–response coefficients.
In Figure 3 we present the absolute response using the binomial Bayesian model. The response in the placebo arm was estimated at 37.6% (blue line in Figure 3). We conducted a meta-analysis to synthesize evidence from studies of each drug separately, and we also present the absolute response for all drugs together. The precision in the dose–response curve is high for lower doses and increases for higher doses, as less data are available.
We also fit the clustered dose–response model where studies have first been synthesized within drugs and then across drugs using the binomial likelihood. The coefficients were very similar to those estimated from the model that ignores clusters (see Table 4). The within-drug variance was estimated 0.0076, a bit smaller than the total heterogeneity from the binomial model ( . There were some differences between the eight drugs as indicated from the . However, the dose–response shape is practically identical to that of the model that ignores the drug clustering. Finally, the within and between cluster correlations are estimated with large uncertainty like in all models.
We examined the convergence of MCMC for all Bayesian models. Overall, convergence is achieved for all the estimated parameters of the three models, see supplementary Appendix Tables 11 to 16 and supplementary Appendix Figures 18 to 30.
5 Discussion
In this paper, we present a hierarchical dose–response meta-analysis model in a Bayesian framework. At the first level, the dose–response relationship is fitted within each study. Then the curves are combined to get the average dose–response. An additional pooling level can be added, if there are different clusters of exposure or drugs. The exact likelihood of the outcome (binomial or normal) can be employed if arm-level data is available.
We performed extensive simulations under different scenarios, in the majority of which the three tested models (binomial Bayesian, normal Bayesian and one-stage frequentist model) perform equally well and provide unbiased results. When the study sample size is small, we showed that the binomial Bayesian model performs better than the Bayesian or frequentist models that use normal likelihood. This is to be expected because with small sample size the observed logORs are poorly approximated by the normal distribution. One could use a logistic regression to model dose–response within a study; that would provide results equivalent to the binomial Bayesian approach. However, to the best of our knowledge, this approach has not been implemented in any frequentist software. When restricted cubic splines are used to model the association, we showed that prior knowledge about the underlying dose–response shape can direct the choice of the knot location and improve estimation.
Some articles have previously described methods for dose–response meta-analyses.7,8,11,34,35 Langford et al. proposed four methods to conduct dose–response meta-analysis in a frequentist setting.34 The first two methods focus on the synthesis of arm-level responses which is undesirable in a meta-analysis context, in particular when randomized trials are synthesized. Method 3 is a one-stage approach; the contrasts between the arms are treated as observations to fit the dose–response curve using weighted least squares. Method 4 is a two-stage approach. In the first stage, all possible dose contrasts and their variances are estimated using network meta-analysis (where doses are placed in the network nodes). In the second stage, the obtained dose-specific estimates are used to fit the dose–response curve. Although Methods 3 and 4 are proposed to conduct the analysis based on contrasts, they ignore the correlations between them.
Wu et al. proposed a Bayesian hierarchical dose–response meta-analysis approach.35 The approach aimed to examine the effects of various drugs extracted from living organism (biologics). The analysis was conducted assuming Emax and linear dose–response shapes. Such shapes are common in dose-findings trials. Our approach is, however, more general, in the sense that it can accommodate any dose–response shape. Our approach, additionally, considers both random- and common-effect models, whereas only the latter is considered by Wu et al. Allowing for heterogeneity between studies can be crucial as illustrated by Shi et al. who showed that the associations between breast cancer and alcohol consumption gets substantially weaker when the heterogeneity is incorporated in the model.36
Among the limitations common to all Bayesian approaches, two are particularly challenging for our model.12,17 First, for some scenarios, the estimation can be sensitive to the prior choice.37 In these cases, sensitivity analysis is recommended with either different prior distributions or by varying the characteristics (hyperparameters) of the specific prior distribution. This is particularly important for the heterogeneity parameter when we have few studies or dose levels.37 Second, time-consuming, intensive computation may be required until MCMC convergence is achieved. In this context, we emphasize the importance of investigating the convergence of MCMC using CODA approaches (e.g. as those presented in our supplementary Appendix). Furthermore, the usual challenges of dose–response meta-analysis apply, including ambiguity in the categorization of the exposure, and the reporting of different categories by different studies or of open-ended categories.38 These issues are discussed in detail elsewhere.18 Finally, important considerations are to be made, that are specific to the data and context, when the (often continuous) dose has been categorized.39
Simulations under a half-sigmoid and log–log dose–response shape revealed that an agnostic placement of the knots (e.g. in quantiles) might lead to biased and very imprecise estimation. Govindarajulu et al. also showed that restricted cubic splines can’t always capture the true shape.40 Restricted cubic splines can perform poorly when most of the change in response is occurring in a narrow exposure interval and knots are located far away from it. In such scenario, subject-matter knowledge can inform the generation of restricted cubic splines by placing the knots where the investigator can anticipate most of the effect to occur, while keeping constant the number of parameters to be estimated. In meta-analysis of dose–response, investigators often know or suspect the underlying dose–response shape or at least the range of doses where the outcome is changing a lot. This should be incorporated in the assumed shape of the modelled dose–response association (e.g. by choosing a piecewise constant) or by locating the knots where changes are expected. In our example of antidepressants, psychiatrists would be interested about the shape in doses lower than 50 mg fluoxetine-equivalent and changes in response are expected to occur between 20 and 40 mg (the minimum therapeutic dose). Therefore, knots have to be placed at these dosages.
A strength of our Bayesian approach is its flexibility. We were able to evaluate whether studies that examine the same drug are more similar than studies examining different drugs by using an extension of our model that adds a layer in the hierarchy according to the specific kind of antidepressant that was studied. We were also able to estimate the absolute response to each dose. Such estimates can also be obtained in a frequentist setting by using best linear unbiased prediction (BULPs) in mixed models.41,42 However, the process is easier in a Bayesian framework, which also allows the use of external data to estimate the outcome at zero dose. The approach will be particularly valuable in the context of policy- and decision-making where the absolute event rates play a more important role than the relative treatment effects.
The hierarchical structure of the model allows the borrowing of strength across studies.14 Studies that report only one dose-specific effect can thus be included and a nonlinear dose–response model fitted. This is also possible in a frequentist setting using the one-stage approach; however, our model can be extended to separate between the heterogeneity due to variability in dose–response shape and residual between-study heterogeneity. The latter can be explored by including covariates that may explain this residual variability, which could lead to a dose–response meta-regression. Our model could also be extended to multiple treatments, thus offering an alternative to published network meta-analysis models,43 or it could be used to model simultaneously several outcomes with similar dose–response shapes. Another potential extension, which we have implemented in our paper, is accounting for cluster of the exposure in estimating the dose–response shape. Finally, external knowledge can be incorporated, for example, evidence from observational studies. The use of observational data will be particularly relevant when assessing long-term outcomes, as the majority of RCTs, in psychiatry and elsewhere, are of relatively short duration.17
In conclusion, we suggest that the Bayesian model with the binomial likelihood could be the default approach as it outperforms the alternative models when the synthesized studies are small. Prior knowledge about the underlying association should be incorporated in the model by defining an appropriate dose–response shape or by locating knots at doses where the outcome is expected to change a lot.
Supplemental Material
Supplemental material, sj-pdf-1-smm-10.1177_0962280220982643 for A Bayesian dose–response meta-analysis model: A simulations study and application by Tasnim Hamza, Andrea Cipriani, Toshi A Furukawa, Matthias Egger, Nicola Orsini and Georgia Salanti in Statistical Methods in Medical Research
Acknowledgements
The authors would like to acknowledge two anonymous reviewers who made very helpful comments and suggestion to this work. The views expressed are those of the authors and not necessarily those of the UK National Health Service, the NIHR, or the UK Department of Health.
Authors’ Note: Andrea Cipriani and Matthias Egger are also affiliated with Oxford Health NHS Foundation Trust, Warnford Hospital, Oxford, UK and Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK, respectively.
Declaration of conflicting interests: The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Andrea Cipriani has received research and consultancy fees from INCiPiT (Italian Network for Paediatric Trials), CARIPLO Foundation and Angelini Pharma. The other authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding: The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: TH and GS were funded by the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825162. ME was supported by a special project funding from the Swiss National Science Foundation (grant no. 17481). AC is supported by the National Institute for Health Research (NIHR) Oxford Cognitive Health Clinical Research Facility, by an NIHR Research Professorship (grant RP-2017-08-ST2-006), by the NIHR Oxford and Thames Valley Applied Research Collaboration and by the NIHR Oxford Health Biomedical Research Centre (grant BRC-1215-20005).
ORCID iDs: Tasnim Hamza https://orcid.org/0000-0002-4700-6990
Georgia Salanti https://orcid.org/0000-0002-3830-8508
Supplemental material: Supplemental material for this article is available online.
References
- 1.Bretz F, Pinheiro JC, Branson M. Combining multiple comparisons and modeling techniques in dose–response studies. Biometrics 2005; 61: 738–748. [DOI] [PubMed] [Google Scholar]
- 2.Bretz F, Hsu J, Pinheiro JC, et al. Dose finding – a challenge in statistics. Biomed J 2008; 50: 480–504. [DOI] [PubMed] [Google Scholar]
- 3.Borenstein M, Hedges L, Higgins J, et al. A basic introduction to fixed-effect and random-effects models for meta-analysis. Res Synth Meth 2010; 1: 97–111. [DOI] [PubMed] [Google Scholar]
- 4.DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin Trials 1986; 7: 177–188. [DOI] [PubMed] [Google Scholar]
- 5.Higgins JPT, Thomas J, Chandler J, et al. Cochrane handbook for systematic reviews of interventions. 2nd ed. Chichester, UK: John Wiley & Sons, 2019. [Google Scholar]
- 6.Furukawa TA, Salanti G, Cowen P, et al. No benefit from flexible titration above minimum licensed dose in prescribing antidepressants for major depression: systematic review. Acta Psychiatr Scand 2019. [DOI] [PubMed] [Google Scholar]
- 7.Crippa A, Discacciati A, Bottai M, et al. One-stage dose–response meta-analysis for aggregated data. Stat Meth Med Res 2019; 28: 1579–96. [DOI] [PubMed] [Google Scholar]
- 8.Crippa A, Orsini N. Dose–response meta-analysis of differences in means. BMC Med Res Methodol 2016; 16: 91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Berlin J, Longnecker MP, Greenland S. Meta-analysis of epidemiologic dose–response data. Chestnut Hill, MA: Epidemiol Resour Inc, 1992. [DOI] [PubMed] [Google Scholar]
- 10.Greenland S, Longnecker MP. Methods for trend estimation from summarized dose–response data, with applications to meta-analysis. Am J Epidemiol 1992; 135: 1301–1309. [DOI] [PubMed] [Google Scholar]
- 11.Orsini N, Li R, Wolk A, et al. Meta-analysis for linear and nonlinear dose–response relations: examples, an evaluation of approximations, and software. Am J Epidemiol 2012; 175: 66–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Sutton AJ, Abrams KR. Bayesian methods in meta-analysis and evidence synthesis. Stat Meth Med Res 2001; 10: 277–303. [DOI] [PubMed] [Google Scholar]
- 13.Jansen JP, Crawford B, Bergman G, et al. Bayesian meta-analysis of multiple treatment comparisons: an introduction to mixed treatment comparisons. Value Health J Int Soc Pharmacoeconom Outcome Res 2008; 11: 956–964. [DOI] [PubMed] [Google Scholar]
- 14.Gelman A, Carlin JB, Stern HS, et al. Bayesian data analysis. 2nd ed. Boca Raton, FL: Chapman & Hall/CRC, 2004. [Google Scholar]
- 15.Bernardo JM, Smith AFM. Bayesian theory. New York, NY: John Wiley & Sons, Inc, 1994. [Google Scholar]
- 16.Hong H, Carlin BP, Shamliyan TA, et al. Comparing Bayesian and frequentist approaches for multiple outcome mixed treatment comparisons. Med Decis Making 2013; 5: 702–714. [DOI] [PubMed] [Google Scholar]
- 17.Smith TC, Spiegelhalter DJ, Thomas A. Bayesian approaches to random-effects meta-analysis: a comparative study. Stat Med 1995; 14: 2685–2699. [DOI] [PubMed] [Google Scholar]
- 18.Schwartz J, Zanobetti A. Using meta-smoothing to estimate dose–response trends across multiple studies, with application to air pollution and daily death. Epidemiology 2000; 11: 666–672. [DOI] [PubMed] [Google Scholar]
- 19.Robins J, Greenland S, Breslow N. A general estimator for the variance of the Mantel-Haenszel odds ratio. Am J Epidemiol 1986; 124: 719–723. [DOI] [PubMed] [Google Scholar]
- 20.Breslow N, Day N. Statistical methods in cancer research. Vol 2 – The design and analysis of cohort studies. Lyon Fr Int Agency Res Cancer 1987;:65–72. [PubMed] [Google Scholar]
- 21.Hamling J, Lee P, Weitkunat R, et al. Facilitating meta-analyses by deriving relative effect and precision estimates for alternative comparisons from a set of estimates presented by exposure level or disease category. Stat Med 2008; 27: 954–970. [DOI] [PubMed] [Google Scholar]
- 22.Royston P, Altman DG. Regression using fractional polynomials of continuous covariates: parsimonious parametric modelling. Appl Stat 1994; 43: 429–467. [Google Scholar]
- 23.Bagnardi V, Zambon A, Quatto P, et al. Flexible meta-regression functions for modeling aggregate dose–response data, with an application to alcohol and mortality. Am J Epidemiol 2004; 159: 1077–1086. [DOI] [PubMed] [Google Scholar]
- 24.Harrell FE. Regression modelling strategies: with applications to linear models, logistic regression, and survival analysis. New York, NY: Springer, 2015. [Google Scholar]
- 25.Stone CJ. Comment: generalized additive models. Stat Sci 1986; 1: 312–314. [Google Scholar]
- 26.Plummer M. JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling. 2003.
- 27.RStudio Team. RStudio: Integrated Development for R. RStudio, PBC, Boston, MA, 2020. http://www.rstudio.com/
- 28.Harrell F. rms: Regression modeling strategies. R package version 5.1-3.1, https://CRAN.R-project.org/package=rms
- 29.Plummer M, Best N, Cowles K, et al. CODA: convergence diagnosis and output analysis for MCMC. R News 2008; 6: 7–11. [Google Scholar]
- 30.Cowles M, Carlin BP. Markov Chain Monte Carlo convergence diagnostics: a comparative review. J Am Stat Assoc 1994; 91: 883–904. [Google Scholar]
- 31.Morris TP, White IR, Crowther MJ. Using simulation studies to evaluate statistical methods. Stat Med 2019; 38: 2074–2102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Crippa A, Orsini N. Multivariate dose–response meta-analysis: the dosresmeta R Package. J Stat Softw 2016; 72: 1–15. [Google Scholar]
- 33.Furukawa TA, Cipriani A, Cowen JP, et al. Optimal dose of selective serotonin reuptake inhibitors, venlafaxine, and mirtazapine in major depression: a systematic review and dose–response meta-analysis. Lancet Psychiatr 2019; 6: 601–609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Langford O, Aronson JK, Valkenhoef G van, et al. Methods for meta-analysis of pharmacodynamic dose–response data with application to multi-arm studies of alogliptin. Stat Meth Med Res 2016; 27: 564–578. [DOI] [PubMed] [Google Scholar]
- 35.Wu J, Banerjee A, Jin B, et al. Clinical dose–response for a broad set of biological products: a model-based meta-analysis. Stat Meth Med Res 2018; 27: 2694–2721. [DOI] [PubMed] [Google Scholar]
- 36.Shi J, Copas J. Meta-analysis for trend estimation. Stat Med 2004; 23: 3–19. [DOI] [PubMed] [Google Scholar]
- 37.Gelman A. Prior distributions for variance parameters in hierarchical models. Bayesian Anal 2006; 1: 515–33. [Google Scholar]
- 38.Hartemink N, Boshuizen HC, Nagelkerke NJD, et al. Combining risk estimates from observational studies with different exposure cutpoints: a meta-analysis on body mass index and diabetes type 2. Am J Epidemiol 2006; 163: 1042–1052. DOI: 10.1093/aje/kwj141. [DOI] [PubMed] [Google Scholar]
- 39.Turner EL, Dobson JE and Pocock SJ. Categorisation of continuous risk factors in epidemiological publications: a survey of current practice. Epidemiol Perspect Innov 2010; 7: 9. DOI: 10.1186/1742-5573-7-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Govindarajulu US, Malloy EJ, Ganguli B, et al. The comparison of alternative smoothing methods for fitting non-linear exposure-response relationships with Cox models in a simulation study. Int J Biostat 2009; 5: Article 2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Stijnen T, Hamza TH, Özdemir P. Random effects meta – analysis of event outcome in the framework of the generalized linear mixed model with applications in sparse data. Stat Med 2010; 29: 3046–3067. [DOI] [PubMed] [Google Scholar]
- 42.Stram DO. Meta-analysis of published data using a linear mixed-effects model. Biometrics 1996; 52: 536–544. [PubMed] [Google Scholar]
- 43.Mawdsley D, Bennetts M, Dias S, et al. Model-based network meta-analysis: a framework for evidence synthesis of clinical trial data. CPT Pharmacomet Syst Pharmacol 2016; 5: 393–401. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplemental material, sj-pdf-1-smm-10.1177_0962280220982643 for A Bayesian dose–response meta-analysis model: A simulations study and application by Tasnim Hamza, Andrea Cipriani, Toshi A Furukawa, Matthias Egger, Nicola Orsini and Georgia Salanti in Statistical Methods in Medical Research