Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2019 May 27;38(20):3861–3879. doi: 10.1002/sim.8207

Participants' outcomes gone missing within a network of interventions: Bayesian modeling strategies

Loukia M Spineli 1,, Chrysostomos Kalyvas 2, Konstantinos Pateras 3
PMCID: PMC7754380  PMID: 31134664

Abstract

Objectives: To investigate the implications of addressing informative missing binary outcome data (MOD) on network meta‐analysis (NMA) estimates while applying the missing at random (MAR) assumption under different prior structures of the missingness parameter.

Methods: In three motivating examples, we compared six different prior structures of the informative missingness odds ratio (IMOR) parameter in logarithmic scale under pattern‐mixture and selection models. Then, we simulated 1000 triangle networks of two‐arm trials assuming informative MOD related to interventions. We extended the Bayesian random‐effects NMA model for binary outcomes and node‐splitting approach to incorporate these 12 models in total. With interval plots, we illustrated the posterior distribution of log OR, common between‐trial variance (τ2), inconsistency factor and probability of being best per intervention under each model.

Results: All models gave similar point estimates for all NMA estimates regardless of simulation scenario. For moderate and large MOD, intervention‐specific prior structure of log IMOR led to larger posterior standard deviation of log ORs compared to trial‐specific and common‐within‐network prior structures. Hierarchical prior structure led to slightly more precise τ2 compared to identical prior structure, particularly for moderate inconsistency and large MOD. Pattern‐mixture and selection models agreed for all NMA estimates.

Conclusions: Analyzing informative MOD assuming MAR with different prior structures of log IMOR affected mainly the precision of NMA estimates. Reviewers should decide in advance on the prior structure of log IMOR that best aligns with the condition and interventions investigated.

Keywords: Bayesian methods, missing outcome data, network meta‐analysis, pattern‐mixture model, simulation study

1. INTRODUCTION

Plenty of empirical studies on reporting quality of systematic reviews with conventional meta‐analyses have revealed several shortcomings in the reporting and administration of missing binary outcome data (MOD).1, 2, 3, 4 Recommendations aiming to improve reporting of systematic reviews with regards to MOD already exist and are built upon this comprehensive empirical evidence. Contrariwise, proposed guidelines for the administration of MOD in systematic reviews have evolved in the absence of simulation studies using only intuitive argumentations5, 6; for example, in the Cochrane Handbook, it is stated that “[imputing the missing data with replacement values] fails to acknowledge uncertainty in the imputed values and results, typically, in confidence intervals that are too narrow” (see chapter 16.1.2 in the work of Higgins and Green6). Current directions to deal with MOD in systematic reviews include (i) analysis of observed outcomes as a primary analysis, (ii) imputation of MOD under plausible scenarios as a sensitivity analysis, and (iii) statistical modeling of missingness mechanisms (ie, reasons that triggered MOD).6 The first two options are the most commonly adopted in systematic reviews.1, 2, 3 Nonetheless, they have been criticized for being employed inefficiently through data elimination or augmentation before analysis, respectively, and hence for ignoring the uncertainty induced by the scenarios considered.6, 7, 8 In turn, these options may compromise the conclusions of the systematic review.9

Statistical modeling of MOD has received little attention in systematic reviews with two (for example, the works of Ejere et al,10 Mayo‐Wilson et al,11 and Virgili et al12) or more interventions (for example, the works of Watt et al13 and Veroniki et al14). As opposed to imputation or exclusion, modeling MOD comprises an elegant framework that adjusts for bias due to MOD and fully acknowledges the uncertainty about the scenarios considered for the missingness mechanism. This is achieved by modeling the joint distribution of the outcomes (observed and missing) and missingness indicator.15 This joint distribution is further factorized in two ways: a distribution of the outcome, given the missingness indicator, and a distribution of that indicator (pattern‐mixture model)16 or a distribution of the missingness indicator, given the underlying outcome, and a distribution of the underlying outcome (selection model).17 Selection model is more prevalent in the literature for clinical trials,18 while pattern‐mixture model has been most frequently described in the analysis of series of trials.19 Modeling MOD using either pattern‐mixture or selection models offers a thorough investigation of the underlying missingness mechanisms across different trials and interventions.8, 20, 21 These mechanisms can be naturally explored using Bayesian approaches, where the reviewer assigns an informative prior distribution on the missingness parameter (ie, an absolute or relative measure of the relationship between outcome and missingness indicator) to indicate a specific scenario alongside the uncertainty for that scenario.8

The existing directions on reporting and handling MOD in conventional systematic reviews are of great relevance and importance also for systematic reviews with network meta‐analysis (NMA). NMA offers an in‐depth exploration of the missingness mechanisms in the network as interventions may carry a different degree of and reasons for MOD in different comparisons and this information cannot be located in isolated conventional meta‐analyses. Moreover, due to the addition of interventions, assumptions, and model parameters that structure this framework, addressing MOD in NMA can reveal their implications on model parameters beyond the standard meta‐analytic ones. Since the statistical methodology of NMA has been refined and implemented mainly within the Bayesian framework,22, 23, 24 we view statistical modeling with the assignment of carefully selected prior distribution on the missingness parameter as a natural way to handle MOD in a network of interventions.

To our knowledge, there is currently no published empirical or simulation study on the comparative performance of models for MOD using Bayesian approaches in terms of meta‐analysis or NMA estimates. Consequently, the analyst misses the knowledge of the overall performance of models for aggregated MOD to critically decide on the proper models to apply. To shed light on this knowledge gap, we set up a comprehensive simulation study using empirical evidence from published NMAs in a wide range of health‐related fields to inform the simulation setting for a triangle network of two‐arm trials. This simulation study aims to designate the factors that may affect the performance of modeling informative MOD (ie, the missingness mechanism depends on the unobserved outcomes25) on the basis of core NMA estimates while assuming missing at random (MAR) for analysis as a starting point.7, 20, 26 Furthermore, the simulation results supplement the observations from a relevant empirical study27 in order to provide empirically‐based recommendations for a proper modeling of MOD in systematic reviews.

This article is organized as follows. In Section 2, we present the Bayesian random‐effects NMA model for binary outcomes in the absence of MOD (as described by Dias et al28), and then, we expand the model to incorporate MOD through pattern‐mixture and selection models.8, 21 Then, we present the prior structures for the missingness parameter that we considered in the simulation study. In Section 3, we illustrate these prior structures under pattern‐mixture and selection models in three published systematic reviews with NMA. In Section 4, we describe a novel simulation setup that combines already established data generation models for conventional meta‐analysis with specific algorithms to incorporate MOD in NMA, and we present the results in Section 5; in Section 6, we discuss the findings and limitations of the study and we provide recommendations, and we conclude in Section 7.

2. MISSING OUTCOME DATA IN NETWORK META‐ANALYSIS

2.1. Bayesian random‐effects NMA model

Consider a network of N trials that investigate different sets of T interventions for a specific condition. The outcome of interest is binary and the frequency of outcome in arm k = 1, 2, …, ai of trial i = 1, 2, …, N is assumed to be a realization from the binomial distribution

ri,kBinpi,k,ni,k,

with pi,k being the underlying risk of an event (the parameter of interest) and ni,k the randomized sample in arm k of trial i. Then, using a logit function, as described by Dias et al,28 the log odds of event in arm k of trial i are defined as follows:

logitpi,k=ui+θi,k1, (1)

ui = logit(pi,1) is the log odds of event in the baseline arm of trial i and θi,k1 is the log odds ratio (OR) of event in arm k relative to the baseline arm that typically follows a normal distribution with mean μti,kti,1 and variance τ2 commonly assumed to be constant across different comparisons. Index ti,k indicates the intervention studied in arm k of trial i.

2.1.1. Incorporating multi‐arm trials

In a trial i with ai > 2 arms, log ORs are correlated since they share the same comparator, and therefore, the vector θi of ai − 1 log ORs follows a multivariate normal distribution28, 29

θi=θi,21θi,ai1MVNai1μti,2ti,1μti,aiti,1,τ2τ2/2τ2/2τ2τ2/2τ2/2τ2/2τ2/2τ2,

which, under the consistency assumption, is equivalent to conditional univariate normal distributions as follows28:

θi,k1θi,21θi,ai11Nμti,kAμti,1A+1aij=2ai1θi,j1μti,jAμti,1A,ai2·ai1·τ2,

where μtA reflects the relative treatment effects of the comparisons with the reference intervention of the network (known as basic parameters30), A. Then, using the consistency equation, the relative treatments effects of all possible nonreference comparisons can be obtained as functions of the basic parameters

μtl=μtAμlA,

with t, l = {B, C, …, T} ∌ A and t ≠ l.

In the Bayesian framework, all parameters of the model are random variables that need proper prior distributions. In the present study, we used noninformative normal prior distribution with mean 0 and variance 10 000 for the location parameters (ie, ui and μtA), whereas we considered HN(0, 1) (median: 0.98, interquartile range [IQR]: 0.51‐1.96) as a weakly informative prior distribution on τ due to trial sparsity in the investigated networks that may compromise a proper estimation of τ.

2.1.2. Rank probabilities for each intervention

To facilitate decision‐making, we can estimate for each intervention the probability of being first, second, third, and so on for a specific outcome.31 These rank probabilities are estimated by ordering the basic parameters in each iteration of the Markov chain Monte Carlo (MCMC) simulation and then, for each intervention, calculating the frequency to achieve a specific rank out of the number of iterations.

2.1.3. Node‐splitting approach to assessing local inconsistency

To assess possible inconsistency locally while using the whole network to obtain an indirect effect for a comparison of a closed loop, Dias et al32 proposed the node‐splitting approach within a Bayesian framework. Specifically, a comparison from a closed loop is isolated (split) and random‐effects meta‐analysis is applied, whereas the remaining network is used to estimate an indirect effect for the split comparison. Then, the difference between direct and indirect effect for that comparison yields a posterior distribution for the inconsistency between these two effects, known as inconsistency factor (IF). A large posterior probability of IF being different from zero (eg, above 95%) provides sufficient evidence that inconsistency may be present in a specific loop. To improve the estimation of τ2, a common τ2 is assumed for both meta‐analysis and NMA model after removing the trials of the split comparison.

2.2. Modeling missing outcome data

2.2.1. Pattern‐mixture model

Suppose that mi,k participants were missing (for reasons related or not to the design and conduct of the trial) in arm k of trial i with probability qi,k, whereas among those ni,ko=ni,kmi,k participants who were observed, only ri,ko experienced the studied outcome with probability pi,ko. It follows that the number of MOD and the number of observed events in arm k of trial i are realizations from the respective binomial distributions

mi,kBinqi,k,ni,kandri,koBinpi,ko,ni,ko.

In the presence of MOD, a pattern‐mixture model can be considered, where pi,k is modeled conditional on whether the underlying event is observed or missing

pi,k=pi,ko·1qi,k+pi,km·qi,k, (2)

where pi,km is the missingness parameter and indicates the probability of event conditional on MOD in arm k of trial i. The parameters pi,ko and qi,k can be estimated directly from the data, whereas we need a proper prior distribution on pi,km to describe a plausible missingness mechanism.

Following the work of Turner et al,8 after rearranging Equation (2) to link pi,ko with the remaining parameters, we obtain the following equation:

pi,ko=pi,kpi,km·qi,k1qi,k.

Subsequently, we use Equation (1) with a random‐effects model for θi,k1 to apply the NMA model.

2.2.2. Selection model

Instead of applying separate binomial distributions, we can jointly model all observed data via the following multinomial distribution20, 21:

Li,kMp1,i,k,p2,i,k,p3,i,k,ni,k,

where Li,k is a vector of all data observed in arm k of trial i, namely, ri,ko,ni,kri,komi,k,mi,kT and

p1,i,k=1c1,i,k·pi,kp2,i,k=1c0,i,k·1pi,k
qi,k=p3,i,k=c1,i,k·pi,k+c0,i,k·1pi,k, (3)

where p1,i,k reflects the marginal probability of observing the underlying event, p2,i,k reflects the marginal probability of observing the underlying nonevent, and qi,k is the probability of MOD out of the randomized sample in arm k of trial i, respectively. The latter equation actually describes the selection model that has already been proposed in a conventional meta‐analysis20 and extended to operate in NMA.21 Then, parameters c1,i,k and c0,i,k indicate the probability of MOD conditional on those participants with the underlying event and the probability of MOD conditional on those participants without the underlying event, respectively. Apart from qi,k, all other parameters are not estimable from the data, and hence, we need to assign proper prior distributions for precise inference to be possible.

2.2.3. Informative missingness odds ratio as missingness parameter

In the present study, we focus on the informative missingness odds ratio (IMOR) parameter, which, under the pattern‐mixture model, is defined as follows7, 8, 33:

δi,kPM=pi,km/1pi,kmpi,kο/1pi,ko,

while under the selection model, it is defined as20, 21

δi,kS=c1,i,k/1c1,i,kc0,i,k/1c0,i,k.

Similar to OR, IMOR takes nonnegative values; nevertheless, due to different factorizations of the same joint distribution of outcome and missingness indicator under pattern‐mixture (PM) and selection (S) models, IMOR has different interpretation with respect to these models:

  • δi,kPM>1, the odds of underlying event among those participants being missing is more likely than the odds of underlying event among those participants being observed in arm k of trial i;

  • δi,kS>1, the odds of MOD among participants with underlying event is more likely than the odds of MOD among participants without underlying event in arm k of trial i;

  • δi,kPM<1, the odds of underlying event among those participants being observed is more likely than the odds of underlying event among those participants being missing in arm k of trial i;

  • δi,kS<1, the odds of MOD among participants without underlying event is more likely than the odds of MOD among participants with underlying event in arm k of trial i;

  • δi,kPM=1, the outcome is similarly distributed between those participants being missing and those being observed in arm k of trial i (ie, MAR assumption);

  • δi,kS=1, MOD are equally likely to occur among participants with underlying event and those without underlying event in arm k of trial i (ie, MAR assumption).

Like OR, IMOR is applied in the logarithmic scale but it is back‐transformed to facilitate in the interpretation

logδi,kPM=φi,kPM=logitpi,kmlogitpi,kologδi,kS=φi,kS=logitc1,i,klogitc0,i,k

under pattern‐mixture model and selection model, respectively.

2.2.4. Structural assumptions to model informative missingness odds ratio

To investigate the underlying missingness mechanisms while acknowledging the uncertainty regarding our prior belief, normal prior distributions are assigned on φi,kl with carefully selected values for the mean ( μi,kφ) and variance ( σi,k2) that reflect a plausible belief about the missingness mechanism on average and make φi,kl identifiable, respectively,

φi,klNμi,kφ,σi,k2forl=PM,S.

Following the work of White et al,20 we considered φi,kl's to be on average MAR (as recommended by relevant published literature to address MOD in the primary analysis7, 20, 26) and exchangeable across trials and interventions, that is, μi,kφ=0 and σi,k2=σ2. White et al20, 33 recommended choosing σ2 ∈ [0.25, 4], which covers a range of values for log IMOR reflecting liberal to conservative uncertainty about the missingness scenario considered. In the present study, we used σ2 = 1:

φi,klN0,1forl=PM,S. (4)

The prior distribution (4) can be shaped further to accommodate our prior beliefs regarding how different φi,kl's can be related within the network.8, 20 Following our empirical study,27 we considered identical and hierarchical prior structure for φi,kl. Under identical structure, φi,kl is assumed to be the same across trials that investigate the same interventions but different across interventions (intervention‐specific)

φi,kl=φtikl,φtiklN0,1

or the same across interventions compared in a trial but different across trials (trial‐specific)

φi,kl=φil,φilN0,1

or identical across all trials and interventions (common‐within‐network)

φi,kl=φl,φlN0,1.

Hierarchical structure “relaxes” the identical structure by assuming φi,kl's to be different yet related to each other. Then, intervention‐specific φi,kl under on average MAR is defined as

φi,klNμtikφ,σtik2withμtikφN0,1,σtikU0,1;

trial‐specific φi,kl on average MAR is defined as

φi,klNμiφ,σi2withμiφN0,1andσiU0,1;

and common‐within‐network φi,kl on average MAR is defined as

φi,klNμφ,σ2withμφN0,1,σU0,1.

We assigned a uniform distribution on σ, σi, and σtik; however, other appropriate prior distributions for variance components can be also considered.34, 35

3. ILLUSTRATIVE EXAMPLES

Among the NMAs we retrieved in our previous study,36 we considered three NMAs with at least one closed loop: one with low MOD in the included trials; one with moderate MOD that are balanced within the trials; and one with moderate MOD that are unbalanced within the trials. Only one NMA had large MOD in the included trials37; however, it was a star‐shaped network, and therefore, we did not consider it in the present study. This classification of networks according to the amount of MOD is based on a decision rule we developed.27 A brief description of this decision rule is available as in Supporting Information (S.1).

We analyzed all three networks under pattern‐mixture and selection models using the prior structures for log IMOR described in Section 2.2.4. These networks included essentially different interventions (placebo and active interventions of different composition), and hence, we expected log IMORs to differ across the interventions as well as to be different yet related across the corresponding trials. Therefore, we considered hierarchical, intervention‐specific prior for log IMORs as the most plausible modeling strategy for the motivating examples. We used interval plots to present the results on the posterior distribution of log ORs for the basic parameters, τ2, and IFs for all 16 models considered, whereas we used rankograms to illustrate the rank probabilities of all interventions on all possible ranks. For each model, we used HN(0, 1) on τ. Two parallel chains with different initial values were used for 100 000 updates and a burn‐in of 10 000 MCMC samples.38 Convergence assessment was based on the Gelman‐Rubin convergence diagnostic, R^,39 and inspection of trace and autocorrelation plots. Initially, we used the R package gemtc 40, 41 to identify the comparisons to split in each network, and then, we inserted these comparisons in the node‐splitting model developed by Dias et al,32 which we expanded further to incorporate the IMOR parameter. The network plots were created with the R package pcnetmeta,42 while the figures illustrating the results were created with the R package ggplot2.43 All analyses were performed in the statistical software R version 3.3.1 using JAGS via the R package R2jags.44, 45

3.1. Example 1: low missing outcome data

Bottomley et al46 investigated the effectiveness of seven interventions measured as the investigator's global assessment response at 4 weeks in patients with moderately severe scalp psoriasis. A total of 9 trials (7 two‐arm, 1 three‐arm, and 1 four‐arm trials) with 5889 patients (median per trial: 237, IQR: 136‐419) formed the network (Figure 1A). For this outcome, MOD were low (median per trial: 3%, IQR: 1%‐6%) in the included trials. Positive log OR indicated a beneficial effect of the first intervention of the comparison.

Figure 1.

sim8207-fig-0001

A series of network plots on (A) the effectiveness of topical therapies for moderately severe scalp psoriasis,46 (B) the efficacy of antidepressants in Parkinson's disease,47 and (C) the prevention of a stroke episode in patients with atrial fibrillation using oral antithrombotics.48 The thickness of the lines and the size of the nodes are proportional to the number of trials and the number of patients randomized in the respective treatments, respectively. ASA+, aspirin plus clopidogrel; Dab110, dabigatran 110 mg; Dab150, dabigatran 150 mg; BDP, betamethasone dipropionate; BMV, betamethasone valerate; SNRI, serotonin–norepinephrine reuptake inhibitor; SSRI, selective serotonin reuptake inhibitor; TCA, tricyclic antidepressant; TCF, two‐compound formulation; VKA, vitamin K antagonist [Colour figure can be viewed at wileyonlinelibrary.com]

Overall, results on log ORs were almost identical for all missingness models (pattern‐mixture or selection model) and prior structures of log IMOR (Supporting Information S.2; Figure S1). As a result, the ranking curves were indistinguishable for different prior structures of log IMOR in both missingness models (Supporting Information S.2; Figure S2). Results were also similar for τ2, although the 95% credible intervals (CrIs) were slightly narrower for hierarchical, trial‐specific prior structure of log IMORs in both missingness models (Supporting Information S.2; Figure S1). Results on node‐splitting were in line with those on basic parameters (Supporting Information S.2; Figure S3).

3.2. Example 2: moderate and balanced missing outcome data

Liu et al47 assessed the comparative effectiveness of four antidepressants and placebo in Parkinson's disease measured as the proportion of patients who had a reduction of at least 50% from the baseline score (Figure 1B). For this outcome, the authors included a total of 11 trials (8 two‐arm and 3 three‐arm trials) with 801 patients (median per trial: 19, IQR: 17‐33). MOD were moderate (median per trial: 16%, IQR: 12%‐24%) and balanced (median per trial: 4%, IQR: 2%‐11%) in the included trials. Positive log OR indicated beneficial effect of the first intervention of the comparison.

Results on log ORs were similar overall, albeit the 95% CrIs were slightly wider for (identical and hierarchical) intervention‐specific prior structure of log IMORs in both missingness models (Supporting Information S.3; Figure S4). Nevertheless, τ2 was slightly lower (and with slightly narrower 95% CrIs) for hierarchical as compared to identical prior structure of log IMOR regardless of further structural assumptions or missingness model. No profound differences were observed on rank probabilities (Supporting Information S.3; Figure S5) and the results from node‐splitting approach (Supporting Information S.3; Figure S6).

3.3. Example 3: moderate and unbalanced missing outcome data

Dogliotti et al48 assessed the comparative effectiveness of seven antithrombotic therapies and placebo in terms of preventing a stroke episode in patients with atrial fibrillation (Figure 1C). The authors included 16 trials (12 two‐arm and 4 three‐arm trials) with 79 808 patients (median per trial: 391, IQR: 211‐2940). MOD were moderate (median per trial: 19%, IQR: 13%‐23%) and slightly unbalanced (median per trial: 7%, IQR: 3%‐10%). Negative log OR indicated a beneficial effect of the first intervention in the comparison.

Different assumptions about the prior structure of log IMOR appeared to implicate mostly on the width of 95% CrIs for all NMA estimates. Overall, intervention‐specific prior of log IMOR led to wider 95% CrIs for log ORs in both missingness models, whereas common‐within‐network prior led to narrower 95% CrIs for log ORs to some extent. In fact, 95% CrI for log ORs were slightly wider under hierarchical than identical structure. Consequently, the superiority of dabigatran at 110 mg and rivaroxaban against placebo turned into inconclusive when log IMOR was assumed to have intervention‐specific prior structure (Figure 2). Furthermore, τ2 was relatively lower and slightly more precise under hierarchical structure, especially, for common‐within‐network log IMORs. Since, the common‐within‐network structure provided the narrowest 95% CrIs for log ORs, it led to relatively larger rank probabilities as opposed to intervention‐specific prior structure, especially for aspirin, aspirin plus clopidogrel, and VKA (Figure 3). Results on node‐splitting were in line with those on basic parameters (Supporting Information S.4; Figure S7).

Figure 2.

sim8207-fig-0002

Interval plots on log ORs for basic parameters (posterior mean and 95% credible interval) and between‐trial variance (τ2; posterior median and 95% credible interval) when there are moderate and unbalanced missing outcome data (MOD) in the network.48 Results are compared in terms of model for MOD (pattern‐mixture, model selection model), structure (hierarchical, identical), and assumption (intervention‐specific, trial‐specific, common‐within‐network) for prior normal distribution on log IMOR assuming missing at random. IMOR, informative missingness odds ratio; OR, odds ratio [Colour figure can be viewed at wileyonlinelibrary.com]

Figure 3.

sim8207-fig-0003

Rankograms of seven interventions when there are moderate and unbalanced missing outcome data (MOD) in the network.48 Posterior mean rank probabilities are compared in terms of model for MOD (pattern‐mixture model, selection model), structure (hierarchical, identical) and assumption (intervention‐specific, trial‐specific, common‐within‐network) for prior normal distribution on log IMOR under missing at random. IMOR, informative missingness odds ratio [Colour figure can be viewed at wileyonlinelibrary.com]

4. SIMULATION SETTING

4.1. Data generation without missing outcome data

We simulated a triangle network of two‐arm trials and three interventions: placebo, new intervention, and old intervention. The comparison of interest was new versus old intervention. We assumed a typical loop like that in the work of Veroniki et al49 with four trials for old intervention versus placebo, three trials for new intervention versus placebo, and one trial for new versus old intervention. To determine the sample size in each arm of every trial, we used information directly from the networks that we collected in our previous empirical work.27 For each trial, we considered equally sized arms with sample size generated from a uniform distribution with support in the range defined by the second and third quartile of the arm sizes (Supporting Information S.5; Figure S8(a))

ni,kE=ni,PCU102,187,k=N,Oplacebo‐controlled trials
ni,NE=ni,OCU128,241old‐controlled trials,

where N, O, and P stand for new intervention, old intervention and placebo, respectively, whereas E and C stand for experimental and control arm, respectively.

We considered a binary (beneficial) outcome measured in the log OR scale. We assumed μNP =  log(2) and μOP =  log(1.5) to be the underlying log OR for new and old intervention against placebo, respectively, whereas we obtained the underlying log OR for new versus old intervention through the consistency equation

μNO=μNPμOP+IF,

with IF being sampled from the t‐distributions t(μ = 0, σ2 = 0.442, df = 3) and t(μ = 1, σ2 = 0.442, df = 3) to reflect low and moderate inconsistency on average, respectively, according to our empirical work (Supporting Information S.5; Figure S8(b)).27

We generated the number of events in each arm of every trial using the data‐generating model (DGM) described by Hartung and Knapp for a random‐effects pairwise meta‐analysis.50, 51 The description of this DGM is available as in Supporting Information (S.6). Using information from our network collection,27 initial event risks for the control arms were generated from a uniform distribution with support in the range defined by the second and third quartile of the event risks (Supporting Information S.5; Figure S8(c))

pi,PC,0U0.27,0.40andpi,OC,0U0.63,0.76

for placebo‐controlled and old‐controlled trials, respectively.

We incorporated τ2 (assumed common‐within‐network) in the DGM assuming smaller variability in log odds for placebo (Supporting Information S.5; Figure S8(d)) but equal in log odds for active arms, respectively. In terms of scenarios for τ2, we selected the predictive log‐normal distributions L N(−3.95, 1.342) (median: 0.02; IQR: 0.01‐0.04) and L N(−2.56, 1.742) (median: 0.08; IQR: 0.03‐0.26) to reflect small and substantial τ2, respectively. These predictive distributions referred to the expected τ2 in a future meta‐analysis for all‐cause mortality and a generic healthcare setting, respectively.52

Finally, we generated the true probability of being best for each intervention by ordering the simulated true log ORs of placebo comparisons as generated from the normal distribution N(μkP, τ2) with k = N, O and then calculating the number of times each intervention ranked first out of the total simulations.

4.2. Data generation while incorporating missing outcome data

Following the motivating examples (Section 3), we focused only on moderate and large MOD as they affected the performance of the modeling strategies to some extent, contrary to low MOD. Note that, under low MOD, we found that all modeling strategies had almost the same performance for log OR, IF and probability of being best but similar performance for τ2 (results not shown). To ensure balance in MOD between the compared arms, we generated %MOD in the experimental arm, qi,kE with k = N, O, from U(0.05, 0.20) and U(0.21, 0.40) to indicate moderate and large MOD, respectively (in line with the “five‐and‐twenty rule,” as described in Supporting Information S.1), and we considered qi,PC=qi,kE with k = N, O and qi,OC=qi,NE for the control arms in placebo‐controlled and old‐controlled trials, respectively. In another scenario, to capture the imbalance in MOD between the compared arms, we assumed placebo to have more MOD than the active arms following our empirical study (Supporting Information S.5; Figure S8(e)) and old intervention to have more MOD in the old‐controlled trials.27 Details on the generation of unbalanced MOD are available as in Supporting Information (S.7).

Then, we generated the number of MOD in each arm of every trial through the following binomial distributions:

mi,kEBinqi,kE,ni,kE,k=N,Omi,kCBinqi,kC,ni,kC,k=O,P

for the experimental and control arm, respectively. We used intervention‐specific log IMORs under the pattern‐mixture model to indicate the outcome among the missing participants in each arm of every trial. Specifically, for each trial, we assumed patients randomized in the new or old intervention to be twice more likely to be missing due to the improvement of their outcome as opposed to patients receiving placebo. We considered σ2 = 1 for the variance of log IMORs. As another scenario, we assumed MAR on average (ie, μi,kφ=0) with σ2 = 1. Details on the generation of log IMORs are available as in Supporting Information (S.8).

Then, we used the linkage function as described by Turner et al8 (equation 7, there) to obtain the probability of events given observed outcomes, pi,kE,obs and pi,kC,obs in arm k of trial i for the experimental and control arm, respectively. The formula to obtain the probability of observed events in each arm is available as in Supporting Information (S.9). Finally, we generated the number of events given the observed outcomes in each arm of every trial as follows:

ri,kE,obsBinpi,kE,obs,ni,kEmi,kE,k=N,Ori,kC,obsBinpi,kC,obs,ni,kCmi,kC,k=O,P

for the experimental and control arm, respectively. Table 1 summarizes all simulation scenarios considered in the present study.

Table 1.

Scenarios for the simulation setup

Number of trials per comparison
Typical loop NO = 1, NP = 3OP = 4
Trial size (  ni,kE=ni,kC=ni in trial i)
Placebo‐controlled trials ni ∼ U(102, 187)
Old‐controlled trials ni ∼ U(128, 241)
Initial event rates of control arm in trial i
Placebo‐controlled trials
pi,PC,0U0.27,0.40
Old‐controlled trials
pi,OC,0U0.63,0.76
Balanced risk of missing outcome data ( 
qi,kE=qi,kC=qi
in triali)
Moderate qi ∼ U(0.05, 0.20)
Large qi ∼ U(0.21, 0.40)
Unbalanced risk of missing outcome data ( 
qi,kE<qi,kC
in triali)
Moderate qi,kEU0.05,0.10, qi,kCU0.11,0.20
Large qi,kEU0.21,0.30, qi,kCU0.31,0.40
Missingness mechanisms via log (IMOR)
Informative φi,P ∼ TN(μ =  −log (2), σ2 = 1, a =  log (1))
φi,k ∼ TN(μ =  log (2), σ2 = 1, a =  log (1)) k = N, O
Missing at random φi,k ∼ N(0, 1) k = N,O,P
Treatment effects
Basic parameters LORNP =  log (2), LOROP =  log (1.5)
Functional parameter LORNO = LORNP − LOROP + IF
Loop inconsistency
Inconsistency factor (IF) IF ∼ t(μ = 0, σ2 = 0.442, df = 3) (low)
IF ∼ t(μ = 1, σ2 = 0.442, df = 3) (moderate)
Common between‐trial variance
Predictive distribution τ2 ∼ (−3.95, 1.342) (small)
τ2 ∼ (−2.56, 1.742) (substantial)
Probability of being best
New intervention 93% and 76% for small and substantial τ2, respectively
Old intervention 7.3% and 24% for small and substantial τ2, respectively
Placebo 0% and 0.1% for small and substantial τ2, respectively

Note: C: control; E: experimental arm; IF: inconsistency factor; IMOR: informative missingness odds ratio; LOR: log odds ratio; N: new intervention; O: old intervention; P: placebo.

Typical loop as defined by Veroniki et al.49

Using predictive log‐normal distributions that correspond to all‐cause mortality and generic health setting for small and substantial between‐trial variance, respectively.52

4.3. Results presentation and model specification

For each scenario, we simulated 1000 triangle networks and, for each scenario, we evaluated the posterior distribution of μNO, τ2, IF and probability of being best for each intervention. For each NMA estimate, we used interval plots to present the simulation results in order to fully reflect the dispersion of the results for each scenario. We decided to present in the main text only results on prior structures of log IMOR under pattern‐mixture model as it is the most frequently reported model in systematic reviews.19 Results on prior structures of log IMOR under selection model are available in Supporting Information (S.11; Figures S10‐S13). Furthermore, we focused on informative MOD with moderate and large extent for being the most plausible scenarios in a medical setting. Results on prior structures of log IMOR when MOD are MAR can be found in Supporting Information (S.12; Figures S14‐S17). Simulations and analyses were performed in the line with the motivating examples (Section 3). For each of the 1000 simulations, thinning equal to 3 was used for 20 000 updates and a burn‐in of 2000 MCMC samples.38

5. RESULTS

5.1. Posterior distribution of log OR (μNO)

Under low inconsistency, the posterior mean of log OR almost converged with the simulated distribution for all prior structures of log IMOR regardless of extent and balance of MOD (Figure 4). Credible intervals were broadly similar for moderate MOD. Subtle differences in the CrIs were observed for large MOD: assuming intervention‐specific log IMORs led to slightly wider CrIs (similarly for identical and hierarchical structure) compared to trial‐specific and common‐within‐network prior structure. Substantial τ2 naturally led to wider CrIs compared to small τ2 without affecting the point estimate. With moderate inconsistency, the posterior distribution of log ORs deviated from the simulated distribution in all prior structures of log IMOR.

Figure 4.

sim8207-fig-0004

Posterior distribution of log OR (between new and old intervention) under informative missingness while using pattern‐mixture model and accounting for the extent of missing outcome data (moderate, large), balance of missing outcome data (balance, imbalance), extent of τ2 (small, substantial), and extent of inconsistency (low, moderate). The horizontal dotted lines reflect the 95% interval and mean of the simulated distribution of log OR under low and moderate true inconsistency. IF, inconsistency factor; MOD, missing outcome data [Colour figure can be viewed at wileyonlinelibrary.com]

5.2. Posterior distribution of τ2

Posterior median of τ2 was close to zero in all prior structures of log IMOR for low inconsistency and small τ2, whereas, as expected, it increased for moderate inconsistency and/or substantial τ2. For moderate MOD and low inconsistency, posterior median and CrI for τ2 were quite similar across all prior structures of log IMOR, whereas for large MOD, posterior median for τ2 increased slightly with wider widths of CrIs that slightly differed for different assumptions of log IMOR within the hierarchical and identical structure (Figure 5). Identical structure led systematically to slightly wider CrIs in most prior structures of log IMOR as compared to hierarchical structure. In addition, the point estimates were slightly larger for identical structure, particularly, for moderate inconsistency and large MOD.

Figure 5.

sim8207-fig-0005

Posterior distribution of τ2 under informative missingness while using pattern‐mixture model and accounting for the extent of missing outcome data (moderate, large), balance of missing outcome data (balance, imbalance), extent of τ2 (small, substantial), and extent of inconsistency (low, moderate). The horizontal dotted lines reflect the 95% interval and median of the simulated distribution of small and substantial τ2. IF, inconsistency factor; MOD, missing outcome data [Colour figure can be viewed at wileyonlinelibrary.com]

5.3. Posterior distribution of IF

Under low inconsistency, the posterior mean of IF was almost zero (ie, evidence of consistency on average) in all prior structures of log IMOR and for all scenarios (Figure 6). Overall, CrIs were similarly wider in the presence of substantial τ2. In the presence of moderate inconsistency, all prior structures of IMOR estimated the true IF, and hence, the point estimates deviated from zero irrespective of extent and balance of MOD.

Figure 6.

sim8207-fig-0006

Posterior distribution of inconsistency factor (IF) under informative missingness while using pattern‐mixture model and accounting for the extent of missing outcome data (moderate, large), balance of missing outcome data (balance, imbalance), extent of τ2 (small, substantial) and extent of inconsistency (low, moderate). The horizontal dotted lines reflect the 95% interval and mean of the simulated distribution of low and moderate IF. MOD, missing outcome data [Colour figure can be viewed at wileyonlinelibrary.com]

5.4. Posterior distribution of probability of being best

The posterior mean of the probability of being best for new intervention was consistently below the simulated distribution for all prior structures of log IMOR, especially, for large MOD and low inconsistency (Supporting Information S.10; Figure S9). Interestingly, contrary to low inconsistency, moderate inconsistency lowered the posterior mean of the probability of being best the least for all prior structures. Within each scenario, the posterior mean of the probability of being best was similar across all prior structures but slightly larger for unbalanced MOD. Nevertheless, intervention‐specific log IMORs led to slightly smaller posterior mean of the probability of being best, especially, for large MOD, moderate inconsistency and small τ2. The posterior mean of the probability of being best almost overlapped with the simulated distribution for moderate MOD, small τ2, and moderate inconsistency. Results on the posterior mean of the probability of being best for old intervention and placebo can be found in the Supporting Information (results not shown). Overall, different scenarios and prior structures of log IMOR did not impact on the hierarchy of the interventions.

6. DISCUSSION

Using three published networks with different extent of MOD as motivating examples, we compared pattern‐mixture with selection model while considering six different prior structures of log IMOR that reflected our prior beliefs about the (dis)similarity of log IMORs within the network. Then, on the basis of the results from the motivating examples, we set up a simulation study using empirical‐based scenarios to evaluate more in‐depth the performance of these prior structures of log IMOR in terms of posterior distribution of log OR, τ2, IF and probability of being best per intervention. We focused on the performance of prior structures when informative MOD (the most plausible scenario in a medical setting) were analyzed under MAR (the recommended primary analysis for MOD). To our knowledge, this is the first simulation study that evaluates statistical modeling of aggregated MOD using Bayesian approaches.

Ultimate goal of the present study was to supplement our observations from our empirical study on these modeling strategies.27 In our empirical study,27 we used Bland‐Altman plots to investigate the degree of agreement among these strategies in terms of NMA estimates. The majority of the networks considered had either low or moderate and balanced MOD. Therefore, we were not able to conclude on the agreement of the strategies when MOD were large or moderate and unbalanced. Furthermore, with an empirical study, we cannot infer on performance measures, such as bias. Consequently, the present simulation study addressed the aforementioned limitations and, additionally, allowed us investigating the performance of the strategies under different scenarios for the NMA estimates in order to understand the circumstances that may compromise the performance of the strategies.

The last two motivating examples agreed with our empirical study,27 which indicated that, for moderate and large MOD, (hierarchical and identical) intervention‐specific prior structure of log IMOR led to larger posterior standard deviation of log ORs as compared to trial‐specific and common‐within‐network prior structures—the latter two led overall to similar posterior distributions of log ORs. White et al also noticed that the uncertainty around meta‐analysis log OR was larger for intervention‐specific prior structure while similar for trial‐specific and common‐within‐network prior structures.20 Our simulation revealed this pattern for large MOD only, regardless of balance of MOD. This performance of intervention‐specific prior structure was anticipated as it assumes MOD to be differently informative in different interventions, and therefore, it substantially down‐weights trials with moderate or large MOD leading to larger posterior standard deviation of summary log OR.33

Furthermore, both the present study and our empirical study27 demonstrated that hierarchical prior structure of log IMOR led to slightly more precise τ2 compared to identical prior structure, particularly for moderate, unbalanced MOD (Section 3.3). According to our simulation study, this performance was more profound for large MOD with concurrence of inconsistent evidence and/or substantial τ2. The extent of informative missingness (as quantified via log IMOR) was simulated to vary across the included trials for the same intervention (Equation (1) in Supporting Information S.8); however, the identical structure did not capture this variability yielding spuriously narrower CrIs for the study‐specific log ORs as compared to hierarchical structure which, in turn, led to relatively larger τ2 and uncertainty thereof.

The third motivating example indicated that common‐within‐network prior structure provided slightly more precise estimation of τ2 compared to intervention‐ and trial‐specific prior structures. Nevertheless, in the simulation study, this pattern was less obvious for moderate, unbalanced MOD, and small τ2. Possible explanation may be that the motivating example had almost three times more trials than the simulated networks, and in conjunction with the common‐within‐network being the least data demanding structure of log IMOR, τ2 was estimated with relatively more precision for this prior structure in the motivating example.

We found that pattern‐mixture and selection models gave almost identical results for each prior structure of log IMOR in the motivating examples and simulation study (Supporting Information S.12). While these two models lead to fundamentally opposite factorizations of the joint distribution of the missingness indicator and outcome, the parameter of interest pi,k is not affected by this factorization, because, in both models, pi,k is function of qi,k and φi,kl (see Equations (2) and (3)) with the same informative prior distribution being assigned on φi,kl. Where these models differ is on the conditional probabilities that define φi,kl (Section 2.2.3). Nevertheless, if one is interested in investigating the interventions to subgroups of trials that are believed to have different measurement patterns, then pattern‐mixture model may be the proper option.18 For example, (as judged, for instance, by the Cochrane's risk of bias tool; chapter 8 in the work of Higgins and Green6), if poorly conducted trials have more MOD than well‐conducted trials—and the researcher believes that compared to those leaving poorly conducted trials, patients completing these trials may be more likely to have experienced the beneficial outcome—the researcher should investigate whether the pattern of outcomes in these two trial settings may affect differently the interventions compared. To our knowledge, pattern‐mixture model has not been applied yet in series of trials with the aim to provide further insights on the effectiveness of the interventions on subgroups of different patterns of outcome. Instead, if one is interested in the effectiveness of the interventions in the whole population, then pattern‐mixture and selection models may be used interchangeably in the analysis of series of trials—although, in principle, the latter is a more natural option18 as it directly reflects the taxonomy of missingness mechanisms as described by Little and Rubin.15

Deciding on the assumption for log IMOR shall be primarily tailored to empirical knowledge about the intervention and trial characteristics for the condition under investigation.8, 20 For example, contrary to active‐controlled trials in schizophrenia, placebo‐controlled trials lead to greater dropout rate among patients without improvement in their outcomes.53, 54 Then, the researcher can consider placebo‐ and active‐specific priors on log IMOR and further investigate the sensitivity of results to using identical and hierarchical structures. In another example, multi‐center trials in psychiatry tend to have higher dropout rate (and hence log IMOR in these trials is more likely to be different from 0) than single‐center trials; if log IMORs are believed not to differ among the compared interventions, and the researcher has collected for each trial information on the number of centers, then he/she should assign hierarchical, multi‐center‐specific, and single‐center‐specific priors on log IMOR so that log IMORs are different yet related in the corresponding trials. In our simulation study, the proper prior structure of log IMOR was intervention‐specific because we assumed placebo to trigger different missingness mechanisms as opposed to new and old intervention. However, by misspecifying the prior structure using trial‐specific or common‐within‐network prior structure appeared to affect the uncertainty around the log OR leading to narrower CrIs of log OR when MOD were moderate or large. While the inferences about the relative effectiveness of the interventions were not be affected in our simulations, the robustness of the inferences for dabigatran 110 mg and rivaroxaban against placebo (third motivating example) were sensible to the prior structure of log IMOR.

In the present study, we addressed aggregated MOD using two popular models for MOD and six different prior structures of log IMOR without accounting, in addition, for important effect modifiers. van Buuren et al55 developed a multiple imputation (MI) model that incorporates a delta parameter like IMOR under pattern‐mixture model to investigate the degree of departure from MAR in survival analysis in a clinical trial. Extending this model to operate in a collection of trials investigating two or more interventions is an interesting yet unexplored area (to our knowledge) for further work. Provided that we had access to individual patient data (IPD) and enough studies in the network to allow for effect‐modification adjustments, MI based on missing not at random (MNAR) assumptions would be a more elegant modeling strategy—though computationally more intensive. This is because MI is already increasingly used for offering a relatively simply and attractive way to account also for the uncertainty induced by imputations (commonly applied under MAR) while adjusting the model for important predictors. In addition, IPD has been often considered as gold standard for synthesizing series of trials as it allows a more rigorous investigation of statistical heterogeneity that—contrary to standard aggregated analysis—protects against the risk for ecological bias, particularly for subject‐level characteristics.35 Since addressing MOD is based on untestable assumptions about missing outcomes (the popular MAR assumption cannot be tested from the observed outcomes), extending “standard” MI to investigate the sensitivity to MAR via MNAR models offers more flexibility.

The limitations of our study pertain mostly to the simulation setup. Firstly, we used Bayesian approaches as we intended to compare different Bayesian modeling strategies for binary MOD in terms of NMA estimates. Consequently, we preferred not to infer on the performance of the models in terms of frequentist measures, such as type I error, efficiency, and coverage; contrariwise, our inferences stemmed from the posterior distribution of the NMA estimates for different scenarios and models. Secondly, we considered a simple network of three interventions and two‐arm trials with binary outcome data (the most prevalent outcome type in systematic reviews23). A more complex network with the addition of multi‐arm trials—a “typical” network in practice24—will shed more light on the implications of network complexity on the NMA estimates across different prior structures of log IMOR. For instance, in a complex yet sparse network (where the number of trials and observed comparisons are limited), identical prior structure may perform better to hierarchical structure as it is the least data demanding (alike the common‐within‐network prior structure). Thirdly, we did not investigate the impact of event frequency since we considered only frequent events. As noted in the work of Carpenter and Kenward,56 “if an event (eg, death or a serious side effect) is rare, missing [outcome] data on very few patients can markedly alter estimated event rates,” and therefore, affect substantially the NMA estimates. Fourthly, the degree of unbalanced MOD considered in the simulation setup was much smaller than the total extent of MOD in each trial (Supporting Information S.7). Consequently, the width of CrI for log OR under common‐within‐network and trial‐specific prior structures (they assume MOD to be equally informative in the whole network and within each trial, respectively, and hence, they down‐weight trials with unbalanced MOD in the compared arms33) remained narrower than the width of CrI for log OR under intervention‐specific prior structure when MOD were unbalanced. A much larger imbalance of MOD may have resulted in more imprecise log OR under common‐within‐network and trial‐specific prior structures. However, we did not observe such extent of imbalance in our empirical study (Supporting Information S.5; Figure S8(f)). Lastly, we dealt with convergence issues (via inspection of the trace and autocorrelation plots) after applying identical common‐within‐network in both pattern‐mixture and selection models; this issue was not tackled after we increased thinning at 6 and 10 (Figures not shown).

Recommendations for the reviewer

  • The reviewer should decide in advance on the proper prior structure of log IMOR to address aggregated MOD that best aligns with the condition investigated and the interventions forming the network; otherwise, misspecification of the prior structure may lead to spurious estimation of the uncertainty around log OR with implications for the conclusions—as shown in the motivating examples and simulation study.

  • Pattern‐mixture and selection models can be applied interchangeably to infer on the effectiveness of the compared interventions on the whole population.

  • Both identical and hierarchical structure may be considered in the context of a sensitivity analysis; though, we expect log IMORs to be different (since the extent of MOD will differ across trial‐arms, among other reasons) yet related to each other, and hence, we regard hierarchical structure to be more plausible in practice.

7. CONCLUSIONS

Assuming MAR on average as a starting point to analyze informative MOD under different prior structures of log IMOR appeared to implicate mainly the precision of the NMA estimates without affecting our conclusions about the effectiveness and the hierarchy of the interventions. Nevertheless, the inferences from the present simulation study were greatly restricted by the scenarios considered. Reviewers should decide already at the protocol of the systematic review on the prior structure of log IMOR according to the condition and interventions investigated. Our results may be also generalized to conventional meta‐analyses with binary outcome.

Supporting information

SIM_8207‐Supp‐0001‐Supplementary Information.docx

ACKNOWLEDGEMENTS

Loukia M. Spineli was supported by the German Research Foundation (Deutsche Forschungsgemeinschaft) (grant SP 1664/1‐1). Chrysostomos Kalyvas (CK) is employed by Merck Sharp & Dohme (MSD). This article reflects the views of CK (and the other authors) and should not be construed to represent MSD's views or policies. We would like to thank the Utrecht Bioinformatics Center for utilizing their high‐performance computing cluster to perform the extensive simulation study.

DATA AVAILABILITY

The data that support the findings of this study (motivating examples) are not available in the article but they are cited in the reference section.

Spineli LM, Kalyvas C, Pateras K. Participants' outcomes gone missing within a network of interventions: Bayesian modeling strategies. Statistics in Medicine. 2019;38:3861–3879. 10.1002/sim.8207

The copyright line for this article was changed on 12 November 2020 after original online publication.

REFERENCES

  • 1. Spineli LM, Pandis N, Salanti G. Reporting and handling missing outcome data in mental health: a systematic review of Cochrane systematic reviews and meta‐analyses. Res Synth Methods. 2015;6(2):175‐187. [DOI] [PubMed] [Google Scholar]
  • 2. Kahale LA, Diab B, Brignardello‐Petersen R, et al. Systematic reviews do not adequately report or address missing outcome data in their analyses: a methodological survey. J Clin Epidemiol. 2018;99:14‐23. [DOI] [PubMed] [Google Scholar]
  • 3. Akl EA, Carrasco‐Labra A, Brignardello‐Petersen R, et al. Reporting, handling and assessing the risk of bias associated with missing participant data in systematic reviews: a methodological survey. BMJ Open. 2015;5(9):e009368 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Spineli LM. Missing binary data extraction challenges from Cochrane reviews in mental health and Campbell reviews with implications for empirical research. Res Synth Methods. 2017;8(4):514‐525. [DOI] [PubMed] [Google Scholar]
  • 5. Akl EA, Johnston BC, Alonso‐Coello P, et al. Addressing dichotomous data for participants excluded from trial analysis: a guide for systematic reviewers. PLoS ONE. 2013;8(2):e57132 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Higgins JP, Green S, ed. Cochrane Handbook for Systematic Reviews of Interventions Version 5.1.0 [Updated March 2011]. The Cochrane Collaboration; 2011. http://handbook-5-1.cochrane.org/ [Google Scholar]
  • 7. Higgins JPT, White IR, Wood AM. Imputation methods for missing outcome data in meta‐analysis of clinical trials. Clinical Trials. 2008;5(3):225‐239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Turner NL, Dias S, Ades AE, Welton NJ. A Bayesian framework to account for uncertainty due to missing binary outcome data in pairwise meta‐analysis. Statist Med. 2015;34(12):2062‐2080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Spineli LM. Modeling missing binary outcome data while preserving transitivity assumption yielded more credible network meta‐analysis results. J Clin Epidemiol. 2019;105:19‐26. [DOI] [PubMed] [Google Scholar]
  • 10. Ejere H, Schwartz E, Wormald R, et al. Ivermectin for onchocercal eye disease (river blindness). Cochrane Database Syst Rev. 2012;(8):CD002219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Mayo‐Wilson E, Hutfless S, Li T, et al. Integrating multiple data sources (MUDS) for meta‐analysis to improve patient‐centered outcomes research: a protocol for a systematic review. Systematic Reviews. 2015;4:143 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Virgili G, Michelessi M, Parodi MB, et al. Laser treatment of drusen to prevent progression to advanced age‐related macular degeneration. Cochrane Database Syst Rev. 2015;(10):CD006537. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Watt J, Goodarzi Z, Tricco AC, Veroniki AA, Straus SE. Comparative safety and efficacy of pharmacological and non‐pharmacological interventions for the behavioral and psychological symptoms of dementia: protocol for a systematic review and network meta‐analysis. Systematic Reviews. 2017;6(1):182 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Veroniki AA, Straus SE, Ashoor HM, et al. Comparative safety and effectiveness of cognitive enhancers for Alzheimer's dementia: protocol for a systematic review and individual patient data network meta‐analysis. BMJ Open. 2016;6(1):e010251 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Little RJA, Rubin DB. Statistical Analysis With Missing Data. 2nd ed. Hoboken, NJ: John Wiley & Sons, Inc; 2002. [Google Scholar]
  • 16. Little RJA. Pattern‐mixture models for multivariate incomplete data. J Am Stat Assoc. 1993;88(421):125‐134. [Google Scholar]
  • 17. Rubin DB. Inference and missing data. Biometrika. 1976;63(3):581‐592. [Google Scholar]
  • 18. Michiels B, Molenberghs G, Lipsitz SR. Selection models and pattern‐mixture models for incomplete data with covariates. Biometrics. 1999;55(3):978‐983. [DOI] [PubMed] [Google Scholar]
  • 19. Akl EA, Shawwa K, Kahale LA, et al. Reporting missing participant data in randomised trials: systematic survey of the methodological literature and a proposed guide. BMJ Open. 2015;5(12):e008431 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. White IR, Welton NJ, Wood AW, et al. Allowing for uncertainty due to missing data in meta‐analysis—part 2: hierarchical models. Statist Med. 2008;27(5):728‐745. [DOI] [PubMed] [Google Scholar]
  • 21. Spineli LM, Higgins JPT, Cipriani A, Leucht S, Salanti G. Evaluating the impact of imputations for missing participant outcome data in a network meta‐analysis. Clinical Trials. 2013;10(3):378‐388. [DOI] [PubMed] [Google Scholar]
  • 22. Efthimiou O, Debray TP, van Valkenhoef G, et al. GetReal in network meta‐analysis: a review of the methodology. Res Synth Methods. 2016;7(3):236‐263. [DOI] [PubMed] [Google Scholar]
  • 23. Petropoulou M, Nikolakopoulou A, Veroniki AA, et al. Bibliographic study showed improving statistical methodology of network meta‐analyses published between 1999 and 2015. J Clin Epidemiol. 2017;82:20‐28. [DOI] [PubMed] [Google Scholar]
  • 24. Nikolakopoulou A, Chaimani A, Veroniki AA, Vasiliadis HS, Schmid CH, Salanti G. Characteristics of networks of interventions: a description of a database of 186 published networks. PLOS ONE. 2014;9(1):e86754 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Diggle P, Kenward MG. Informative dropout in longitudinal data analysis (with discussion). J Royal Stat Soc Series C. 1994;43(1):49‐93. [Google Scholar]
  • 26. White IR, Carpenter J, Horton NJ. Including all individuals is not enough: lessons for intention‐to‐treat analysis. Clinical Trials. 2012;9(4):396‐407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Spineli LM. An empirical comparison of Bayesian modelling strategies for missing binary outcome data in network meta‐analysis. BMC Med Res Methodol. 2019;19(1):86 10.1186/s12874-019-0731-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Dias S, Sutton AJ, Ades AE, Welton NJ. Evidence synthesis for decision making 2: a generalized linear modeling framework for pairwise and network meta‐analysis of randomized controlled trials. Med Decis Making. 2013;33(5):607‐617. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Franchini AJ, Dias S, Ades AE, Jansen JP, Welton NJ. Accounting for correlation in network meta‐analysis with multi‐arm trials. Res Synth Methods. 2012;3(2):142‐160. [DOI] [PubMed] [Google Scholar]
  • 30. Lu G, Ades AE. Assessing evidence inconsistency in mixed treatment comparisons. J Am Stat Assoc. 2006;101(474):447‐459. [Google Scholar]
  • 31. Salanti G, Ades AE, Ioannidis JPA. Graphical methods and numerical summaries for presenting results from multiple‐treatment meta‐analysis: an overview and tutorial. J Clin Epidemiol. 2011;64(2):163‐171. [DOI] [PubMed] [Google Scholar]
  • 32. Dias S, Welton NJ, Caldwell DM, Ades AE. Checking consistency in mixed treatment comparison meta‐analysis. Statist Med. 2010;29(7‐8):932‐944. [DOI] [PubMed] [Google Scholar]
  • 33. White IR, Higgins JP, Wood AM. Allowing for uncertainty due to missing data in meta‐analysis—part 1: two‐stage methods. Statist Med. 2008;27(5):711‐727. [DOI] [PubMed] [Google Scholar]
  • 34. Lambert PC, Sutton AJ, Burton PR, Abrams KR, Jones DR. How vague is vague? A simulation study of the impact of the use of vague prior distributions in MCMC using WinBUGS. Statist Med. 2005;24(15):2401‐2428. [DOI] [PubMed] [Google Scholar]
  • 35. Debray TP, Moons KG, van Valkenhoef G, et al. Get real in individual participant data (IPD) meta‐analysis: a review of the methodology. Res Synth Methods. 2015;6(4):293‐309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Spineli LM, Yepes‐Nuñez JJ, Schünemann H. A systematic survey shows that reporting and handling of missing outcome data in networks of interventions is poor. BMC Med Res Methodol. 2018;18(1):115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Gallego‐Galisteo M, Villa‐Rubio A, Alegre‐Del Rey E, Márquez‐Fernández E, Ramos‐Báez JJ. Indirect comparison of biological treatments in refractory rheumatoid arthritis. J Clin Pharm Ther. 2012;37(3):301‐307. [DOI] [PubMed] [Google Scholar]
  • 38. Lunn DJ, Thomas A, Best N, Spiegelhalter D. WinBUGS ‐ a Bayesian modelling framework: concepts, structure, and extensibility. Stat Comput. 2000;10(4):325‐337. [Google Scholar]
  • 39. Gelman A, Rubin D. Inference from iterative simulation using multiple sequences. Statistical Science. 1992;7(4):457‐472. [Google Scholar]
  • 40. van Valkenhoef G, Dias S, Ades AE, Welton NJ. Automated generation of node‐splitting models for assessment of inconsistency in network meta‐analysis. Res Synth Methods. 2016;7(1):80‐93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. van Valkenhoef G, Kuiper J. gemtc: Network Meta‐Analysis Using Bayesian Methods. 2016. https://cran.r-project.org/web/packages/gemtc/gemtc.pdf
  • 42. Lin L, Zhang J, Hodges JS, Chu H. Performing arm‐based network meta‐analysis in R with the pcnetmeta package. J Stat Softw. 2017;80(5). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Wickham H. ggplot2: Elegant Graphics for Data Analysis. New York, NY: Springer‐Verlag; 2009. [Google Scholar]
  • 44. Plummer M. JAGS Version 3.4.0 User Manual. 2013:1‐42.
  • 45. R Core Team . A Language and Environment for Statistical Computing. 2016. https://www.r-project.org
  • 46. Bottomley JM, Taylor RS, Ryttov J. The effectiveness of two‐compound formulation calcipotriol and betamethasone dipropionate gel in the treatment of moderately severe scalp psoriasis: a systematic review of direct and indirect evidence. Curr Med Res Opin. 2011;27(1):251‐268. [DOI] [PubMed] [Google Scholar]
  • 47. Liu J, Dong J, Wang L, Yan P, Sun S. Comparative efficacy and acceptability of antidepressants in Parkinson's disease: a network meta‐analysis. PLOS ONE. 2013;8(10):e76651 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Dogliotti A, Paolasso E, Giugliano RP. Current and new oral antithrombotics in non‐valvular atrial fibrillation: a network meta‐analysis of 79808 patients. Heart. 2014;100(5):396‐405. [DOI] [PubMed] [Google Scholar]
  • 49. Veroniki AA, Mavridis D, Higgins JPT, Salanti G. Characteristics of a loop of evidence that affect detection and estimation of inconsistency: a simulation study. BMC Med Res Methodol. 2014;14:106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Hartung J, Knapp G. A refined method for the meta‐analysis of controlled clinical trials with binary outcome. Statist Med. 2001;20(24):3875‐3889. [DOI] [PubMed] [Google Scholar]
  • 51. Pateras K, Nikolakopoulos S, Roes K. Data‐generating models of dichotomous outcomes: heterogeneity in simulation studies for a random‐effects meta‐analysis. Statist Med. 2018;37(7):1115‐1124. [DOI] [PubMed] [Google Scholar]
  • 52. Turner RM, Jackson D, Wei Y, Thompson SG, Higgins JPT. Predictive distributions for between‐study heterogeneity and simple methods for their application in Bayesian meta‐analysis. Statist Med. 2015;34(6):984‐998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Kemmler G, Hummer M, Widschwendter C, Fleischhacker WW. Dropout rates in placebo‐controlled and active‐control clinical trials of antipsychotic drugs: a meta‐analysis. Arch Gen Psychiatry. 2005;62(12):1305‐1312. [DOI] [PubMed] [Google Scholar]
  • 54. Spineli LM, Leucht S, Cipriani A, Higgins JPT, Salanti G. The impact of trial characteristics on premature discontinuation of antipsychotics in schizophrenia. Eur Neuropsychopharmacol. 2013;23(9):1010‐1016. [DOI] [PubMed] [Google Scholar]
  • 55. van Buuren S, Boshuizen HC, Knook DL. Multiple imputation of missing blood pressure covariates in survival analysis. Statist Med. 1999;18(6):681‐694. [DOI] [PubMed] [Google Scholar]
  • 56. Carpenter JR, Kenward MG. Missing data in randomised controlled trials–a practical guide. Health Technology Assessment Methodology Programme. Birmingham, UK: London School of Hygiene and Tropical Medicine; 2007. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SIM_8207‐Supp‐0001‐Supplementary Information.docx


Articles from Statistics in Medicine are provided here courtesy of Wiley

RESOURCES