Abstract
Objectives:
In the field of relapsed or refractory multiple myeloma (RRMM), between-trial or indirect comparisons are required to estimate relative treatment effects between competing interventions based on the available evidence. Two approaches are frequently used in RRMM: network meta-analysis (NMA) and unanchored matching-adjusted indirect comparison (MAIC). The objective of the current study was to evaluate the relevance and credibility of published NMA and unanchored MAIC studies aiming to estimate the comparative efficacy of treatment options for RRMM.
Methods:
Twelve relevant studies were identified in the published literature (n = 7) and from health technology assessment agencies (n = 5). Data from trials were extracted to identify between-trial differences that may have biased results. Credibility of the performed analyses and relevance of the research questions were critically appraised using the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) checklist and feedback based on consultations with clinical experts.
Results:
The identified studies concerned NMAs of randomized controlled trials (RCTs; n = 7), unanchored MAICs (n = 4), or both types of analyses (n = 1). According to clinical expert consultation, the majority of the identified NMAs did not consider differences in prior therapies or treatment duration across the RCTs included in the analyses, thereby compromising the relevance.
Conclusion:
Based on the results and feedback from clinicians, the majority of NMAs did not consider prior treatment history or treatment duration, which resulted in nonrelevant comparisons. Furthermore, it may have compromised the credibility of the estimates owing to differences in effect-modifiers between the different trials. Pairwise comparisons by means of unanchored MAICs require clear justification given the reliance on non-randomized comparisons.
Introduction
Although treatments for newly diagnosed multiple myeloma (MM) patients are well established, treatment options for patients progressing beyond first-line therapy were limited before 2004, particularly for relapsed or refractory MM (RRMM), a stage of MM defined by disease that is nonresponsive while on salvage therapy or progression within 60 days of last therapy in patients who have achieved minimal response or better at some point previously.1,2 Between 2004 and 2018, however, more than eight treatments with multiple combination options became available for use in RRMM patients. Guideline recommendations emphasize that treatment goals for patients with MM vary depending on disease stage and treatment experience, and treatment choice may depend on age, performance status, comorbidities, number of prior treatment lines, remaining treatment options, interval since last therapy, and type of relapse.3–5 Nevertheless, recommendations regarding newer treatments are broad and do not specify which of the newer treatments is optimal for a typical second-line (2L), third-line (3L), or fourth- or subsequent-line (4L+) patient. Given the multitude of interventions for RRMM, it is important to evaluate comparative efficacy and safety to determine best practices, especially where no head-to-head clinical trials exist.
The National Institute for Health and Care Excellence (NICE) outlines two main approaches for estimating relative treatment effects between competing interventions depending on the context and evidence base.6,7 Network meta-analyses (NMAs) are based on randomized controlled trials (RCTs) where each study directly compares a subset of, but not necessarily all, treatments of interest. Another approach is matching-adjusted indirect comparison (MAIC), where the outcomes with an index intervention of interest evaluated in one or more RCT(s) or single-arm trial(s) for which individual patient-level data (IPD) is available are compared with the outcomes for a competing intervention in another trial with study-level data. MAICs can be either anchored, where there is a shared comparator creating a connected network, or unanchored in the absence of a shared comparator (ie, disconnected network). In both cases, propensity score matching is used to estimate the relative effect between the two interventions of interest by adjusting for the differences in treatment effect modifiers (anchored and unanchored) or prognostic factors (unanchored) of the trial for which IPD is available to match the population for which study-level data is available.
There are challenges unique to RRMM that may limit the validity of treatment effect estimates based on NMAs or MAICs. For instance, some trials do not share common comparators, meaning that a standard NMA is not feasible and alternative methodologies, such as unanchored MAIC, may be required.8 There may also be substantial heterogeneity with regard to patient characteristics and study design. The aim of this study was to identify and critically appraise NMA and MAIC studies evaluating the comparative effectiveness of alternative interventions for the treatment of RRMM based on between-study comparisons.
Methods
A systematic literature review identified studies evaluating the comparative efficacy of pharmacologic interventions for the treatment of patients with RRMM in terms of overall survival (OS), progression-free survival (PFS), or response (see Appendix A in Supplemental Materials found at https://doi.org/10.1016/j.jval.2019.11.003 for full details).
NMAs were evaluated using the Comparative Effectiveness Research-Collaborative Initiative Instrument created in conjunction with the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) to assess their credibility and relevance (referred to as the ISPOR checklist).9 Questions regarding credibility were grouped together in terms of the evidence base, network of evidence (ie, whether included studies formed a connected network), treatment-effect modifiers (ie, whether there were differences in treatment effect modifiers across treatment comparisons), analysis (ie, whether the statistical methods were used to preserve within-study randomization), and reporting (ie, whether all pair-wise contrasts were reported along with measures of uncertainty). For published MAICs, the ISPOR checklist questions were used where possible, which were complemented by recommendations from the NICE Decision Support Unit (DSU) Technical Support Document 18 to determine whether best practices were followed in terms of anchoring (appropriate methods based on whether common comparators exist), selection of covariates (ensuring all prognostic and effect modifiers were adjusted for), and reporting (covariate choice and reasoning).7,9
The process involved consultation from both methods and clinical experts identified through a request for participation based on experience working in specialized centers or collaborating on major clinical trials in Europe and the United States. Conclusions regarding relevance and credibility were based on the consensus of two independent investigators and were then validated by clinicians. During a series of teleconferences, clinicians were presented with the NMA and MAIC research questions and asked about their relevance. Additionally, experts evaluated the credibility of each analysis based on the networks of evidence and key assumptions.
Results
Indirect Comparisons and Network Meta-Analyses
Twelve indirect comparison studies were identified comprising six publications,8,10–14 five health technology assessment (HTA) reports,15–19 and two conference posters20,21 (see flow chart in Appendix A in Supplemental Materials found at https://doi.org/10.1016/j.jval.2019.11.003), which included eight NMAs10–13,15,17,18,20 and five MAICs.8,14–16,19 Details of the included studies are presented in Appendix B (in Supplemental Materials found at https://doi.org/10.1016/j.jval.2019.11.003). Themes identified during the critical appraisal are described in terms of the evidence base, relevance, and credibility of each analysis (see details of each appraisal in Appendix C in Supplemental Materials found at https://doi.org/10.1016/j.jval.2019.11.003).
Evidence Base
Figure 1 illustrates all 37 studies included across the different NMAs and MAICs, comprising 32 RCTs,22–52 two single-arm studies (PANORAMA-253 and STRATUS54), two observational studies (Gooding et al55 and Tarant et al56) and one non-randomized trial (GEN50157).
Figure 1.

Clinical trials included in the studies and the reported distribution of prior treatments: two disconnected networks.
BEND indicates bendamustine; BEV, bevacizumab; BOR, bortezomib; CARF, carfilzomib; CYCLO, cyclosporine; DARA, daratumumab; DEX, dexamethasone; DOX, doxorubicin; ELO, elotuzumab; IXA, ixazomib; LEN, lenalidomide; OBLI, oblimersen; PANO, panobinostat; POM, pomalidomide; SIL, siltuximab; sub, subcutaneous; THAL, thalidomide; TX, treatment; VORI, vorinostat.
Figure 2 highlights select NMAs that used alternative methods to compare treatments across the two disconnected networks, which were centered around bortezomib in combination with dexamethasone (BOR + DEX), or lenalidomide in combination with dexamethasone (LEN + DEX).10,12,15,18
Figure 2.


Network diagrams from selected studies. (A) Ollendorf et al18 (ICER), RRMM ≥2L: ITC using observational evidence (dotted line) and subgroup analysis for 2L (square); (B) Durand et al,15 RRMM ≥2L: ITC using observational evidence (black dotted line) and MAIC (red dotted line); (C) Botta et al,10 RRMM ≥2L: ITC with treatment classes pooled according to grey bars; (D) Maiese et al,12 RRMM 2L: two ITCs for each separate network.
2L indicates second-line; BOR, bortezomib; CARF, carfilzomib; CYCLO, cyclosporine; DARA, daratumumab; DEX, dexamethasone; DOX, doxorubicin; ELO, elotuzumab; HDAC, histone deacetylase inhibitors; HR, hazard ratio; HTA, health technology assessment; ICER, Institute for Clinical and Economic Review; IMID, immunomodulatory agent; ITC, indirect treatment comparison; IXA, ixazomib; LEN, lenalidomide, mAB, monoclonal antibody; MAIC, matching-adjusted indirect comparison; OBS, observational; PANO, panobinostat; PFS, progression-free survival; PI, proteasome inhibitor; POM, pomalidomide; RRMM, relapsed and/or refractory multiple myeloma; sub, subcutaneous; THAL, thalidomide.
The distribution of prior treatments reported in the trials is illustrated, which was considered to be one of the most important treatment effect modifiers given the expected magnitude of the effect modification and the imbalance across the trials. Other important treatment effect modifiers and prognostic factors are summarized in Box 1. Remaining evidence networks are illustrated in Appendix D in Supplemental Materials (found at https://doi.org/10.1016/j.jval.2019.11.003), which were similar or evaluated a subset of those interventions assessed in Figure 2 10,11,13,17,20
BOX 1. Key potential treatment effect modifiers as identified by clinical experts.
Patient and outcome characteristics that may act as treatment effect modifiers and/or prognostic factors:
Number of prior treatment lines
Refractory to proteasome inhibitor or immunomodulatory agent
European Cooperative Oncology Group score
International Staging System stage
Cytogenetic risk
Extra-medullary disease
Progression-free survival versus time-to-progression
Study characteristics that may act as treatment effect modifiers:
Cross-over within randomized controlled trials (yes or no)
Open-label versus blinded
Relevance
Experts from both Europe and the United States suggested that the comparisons between interventions in the NMAs and MAICs were often not relevant for current decision making and several were considered irrelevant even at the time of the analysis. Generally, the relevance related to the purpose of the analysis; therefore we first discuss the relevance of the HTA evaluations followed by the non-HTA publications.
The NMA by Ollendorf et al18 was performed by the Institute for Clinical and Economic Review (ICER) in the United States. This was the most recent HTA-related publication comparing carfilzomib (CARF), elotuzumab (ELO), ixazomib (IXA), and daratumumab (DARA), which evaluated subgroups based on treatment experience where possible (2L versus 3L to 4L).19 The most recent NICE submission identified (Edwards et al16) evaluated the use of DARA versus panobinostat (PANO) + BOR + DEX and pomalidomide (POM) + DEX using an MAIC among patients previously treated with a proteasome inhibitor as well as an immunomodulatory agent.14 Nevertheless, clinicians indicated that DARA monotherapy would be rare in clinical practice. Other HTA MAICs compared PANO + BOR + DEX with LEN + DEX (Durand et al,15 2L+) and POM + DEX versus both PANO + BOR + DEX and bendamustine (Wood et al,19 3L+ for patients with LEN and BOR experience), both of which were critiqued for not being relevant to clinical practice. Beyond the interventions compared, NICE evidence review groups often critiqued the population evaluated, which did not account for relevant treatment effect modifiers in the target population, such as age and performance status (Durand et al),15 in addition to previous treatment experience (Hoyle et al).17 Therefore none of the HTA NMAs or MAICs assessed a comparison or population that was considered to be relevant for current clinical practice.
The most comprehensive NMAs in terms of the interventions were published after 2016 (Botta et al,10 van Beurden-Tan et al,13 and Maiese et al12), which were considered to include more relevant treatments for current clinical practice. However, studies by Botta et al10 and van Beurden-Tan et al13 appeared to be driven by an interest in combining all available trials rather than defining a relevant question in terms of prior treatment experience. In addition, treatment classes (Botta et al10) or specific interventions (van Beurden-Tan et al13) were pooled, further limiting the relevance of the comparisons, which did not provide insight regarding the specific treatment comparisons of interest. Maiese et al12 was the exception, which specified a more focused research question dependent on the backbone therapy and line of treatment; this approach precluded the need to connect the disconnected network. Clinicians agreed that treatments with a BOR backbone were more relevant to Europe, whereas treatments with a LEN+DEX backbone were more relevant for the United States. Consequently, there was no need to connect the two networks as comparisons between these regimens likely had no clinical relevance.
Credibility
Most studies identified RCTs through a systematic literature review. There is a risk that the two NMAs and two MAICs without a systematic review may have missed relevant trials, but this did not appear to be the case.
NMAs approached the two disconnected networks using alternative methods. Van Beurden-Tan et al13 pooled THAL with THAL + DEX, as well as BOR with BOR + DEX to connect the two networks, making the assumption that the DEX backbone in each of the treatment regimens had no effect on outcomes (see Appendix C in Supplemental Materials found at https://doi.org/10.1016/j.jval.2019.11.003). Clinicians did not agree with this assumption; therefore there is a risk that comparisons across the disconnected network may be biased. Ollendorf et al18 included an observational study to connect the network, which compared BOR + DEX versus BOR based on a matched-comparison analysis.58 PFS results (hazard ratio [HR] 0.56; 95% confidence interval [CI] 0.35–1.00) suggest it may not be appropriate to pool these treatments, whereas OS results were more uncertain (HR 0.96; 95% CI 0.54–1.70). Ignoring these differences may have biased results in favor of interventions compared with BOR + DEX. Clinicians preferred methods that incorporated observational evidence over pooling the treatments; however, as mentioned, comparisons across the disconnected network may not be relevant, which is why the credibility of Maiese et al12 was not affected by these assumptions.
Ollendorf et al18 and Maiese et al12 were the only NMAs considered relevant; they attempted to account for differences in prior treatment experience using subgroup analyses. Nevertheless, because the small number of RCTs per treatment comparison precluded a random effects models or meta-regression, the limitations to explore these differences in potential treatment effect modifiers (see Box 1) in the absence of IPD affected the credibility of the results. Also, differences in terms of backbone or active treatment were not acknowledged. For example, BOR was fixed in some trials (eight cycles in APEX45 and CASTOR43) and “treat until disease progression” in others (ENDEAVOR27), which may have biased results against regimens in combination with fixed duration BOR + DEX.23 Similarly, CARF was the only treatment that was limited to 18 cycles, whereas other treatments were administered until progression. Furthermore, OS results may have been biased due to crossover in some trials. Ollendorf et al18 predicted OS results based on the relationship between PFS and OS59; however, a model that considered PFS and OS simultaneously or integrated external evidence may have provided a more comprehensive approach.
All MAICs were unanchored, did not provide full details regarding the methodology, and compared different treatments. Appendix E in Supplemental Materials (https://doi.org/10.1016/j.jval.2019.11.003) outlines the covariates included across the MAICs, which differed across the studies. Edwards et al16 and van Sanden et al14 had the most comprehensive models and aligned most closely with recommendations to include all treatment effect modifiers and prognostic factors.7 Generally, clinicians considered the covariates relevant and comprehensive; however, cytogenetic risk was an important factor not considered, likely owing to inconsistent reporting across the trials. Importantly, two of the five MAICs did not justify their choice of covariates, whereas the remaining three described a consultation with clinical experts. Generally, studies were limited by the absence of a clear process for covariate selection founded on evidence from the literature. Ultimately, although many covariates were included in the model, it was difficult to determine whether all relevant prognostic factors and treatment effect modifiers were included, highlighting the key assumption that undermines unanchored MAICs. Therefore there is a risk that MAIC estimates were still biased given that unreported or unobserved covariates may have affected the outcomes.
All NMAs and MAICs combined PFS, assuming a constant HR over time, which may not have been appropriate given that Kaplan-Meier curves of some of the included studies diverge, as illustrated in TOURMALINE-MM140 after 10 months. Thus combining an average effect over time may not be the appropriate way to synthesize the trial results. None of the studies other than Ollendorf et al18 evaluated the proportional hazards assumption, despite the availability of more flexible synthesis methods.60,61
Durand et al15 was the only study that conducted both an NMA and MAIC. Nevertheless, differences between estimates could not be attributed to the different methods because the NMA included 2L to 4L patients, whereas the MAIC was restricted to 3L to 4L patients (PFS HRs for PANO + BOR + DEX comparing LEN + DEX for NMA was 1.87 versus 1.06 for the MAIC).
Discussion
Our aim was to critically appraise the relevance and credibility of published NMAs and unanchored MAICs in RRMM as identified through a systematic literature review. The critical appraisal of NMAs was based on an established ISPOR checklist. Because this checklist was not designed for MAICs, we also included questions from the NICE Decision Support Unit Technical Support Document 18.7 Although these provided useful tools, we felt it was important to conduct a complete feasibility assessment involving consultation with clinicians.
A majority of the NMAs and unanchored MAICs published between 2015 and 2018 were driven by the NICE HTA process in the UK and are summarized over time to illustrate the evolution of key trials and indirect comparisons (Fig. 3). Historically, the standard of care for RRMM in Europe was BOR. This was challenged in a 2008 NICE submission, based on an indirect comparison that resulted in a positive NICE recommendation of LEN + DEX for a very specific population (3L+) based on a patient access scheme.17 Between 2008 and 2015, phase III trials were published evaluating POM + DEX (vs DEX in the 3L+; MM-00339) and PANO + BOR + DEX (vs BOR + DEX in 2L to 4L; PANORAMA-146). Nevertheless, it was not until 2015 and 2017 that each of these treatments were evaluated in separate single-technology appraisals, respectively. Between these trial publications and their respective submissions to NICE, phase III trials were published for CARF + LEN + DEX,49 ELO + LEN + DEX,37 IXA + LEN + DEX,40 DARA + BOR + DEX,43 and DARA + LEN + DEX.28 Although the consultation with clinicians did not represent the UK perspective, NICE reports suggest there was a disconnect between UK HTA evaluations and clinical practice in other jurisdictions even at the time of the assessments because clinicians were interested in comparing CARF, ELO, IXA, and DARA in the 2L+ setting, which had yet to be compared in the HTA setting. The evolution of several single-technology appraisals over a period where the treatment landscape evolved rapidly was exacerbated by the fact that trials were relatively small and evaluated unique populations in terms of treatment history. RRMM highlights the need for faster comprehensive multiple technology appraisals spanning the alternative populations to better identify the place of different therapies in the treatment pathway.
Figure 3.

Timeline of key clinical trials and network meta-analyses or matching-adjusted indirect comparisons.
BOR indicates bortezomib, CARF, carfilzomib; DARA, daratumumab; DEX, dexamethasone; ELO, elotuzumab; HTA, health technology assessment; ICER, Institute for Clinical and Economic Review; ITC, indirect treatment comparison; IXA, ixazomib; LEN, lenalidomide; MAIC, matching-adjusted indirect comparison; NICE, National Institute for Health and Care Excellence; NMA, network meta-analysis; PANO, panobinostat; POM, pomalidomide.
Published NMAs were not considered to be relevant for today’s clinical decision making because they introduced strong assumptions regarding the pooling of interventions and represented a “catch-all” for the available trials rather than defining a relevant research question specific to interventions and comparators that could be used in clinical practice for a specific line of therapy. The NMA by Maiese et al12 was considered the most relevant because it evaluated networks separately based on prior treatment experience and also targeted the research question toward populations with specific treatment experience. Dimopoulos et al62 published an NMA in 2018 after the systematic literature review was completed with a similar research question and approach. Although these were the most relevant studies and yielded consistent results, it is unclear whether these results can be considered credible. Given the rapid evolution of RRMM landscape, additional synthesis studies may have been performed that were not captured in our study following the systematic literature review.
Even in cases where the research question was specific and it was not necessary to connect the two disconnected networks through strong pooling assumptions or observational evidence, the credibility of the estimates resulting from these between-study comparisons was limited by the available data. Although the RCTs were generally high quality, there was a substantial risk of bias given differences in potential treatment effect modifiers, including treatment experience, crossover, duration of therapy, and cytogenetic risk. Most NMAs did not attempt to account for differences in treatment experience; where it was assessed, it was only based on subgroup data comparing previous lines of therapy (2L versus 3L+), which did not provide sufficient information regarding the type of prior therapy. None of the studies accounted for differences in duration of treatments, which ranged from fixed treatment duration (APEX,45 CASTOR,43 PANORAMA-146) to treatment until progression (ENDEAVOR27) and may have acted as a driver of outcomes, potentially reducing BOR+DEX efficacy. Moreover, differences in cytogenetic risk were rarely reported or identified as a potential treatment effect modifier. It will be important for future clinical trials to report more clearly the biology of the disease to account for heterogeneity appropriately.
Beyond limitations due to the available study-level data, none of the studies followed a systematic or transparent feasibility assessment process for the NMA.63 Although it was not feasible for synthesis studies to account for the differences given the limited number of trials overall and per treatment comparison, this was rarely acknowledged clearly in the findings or discussion.64,65 Despite substantial between-study differences in characteristics considered to act as effect modifiers (such as treatment experience), random effects models were not feasible in most cases. Nevertheless, authors failed to highlight that NMA results should be interpreted with caution given that estimates did not account for between-study heterogeneity. Although a “best evidence” framework favors NMAs over simple naive comparisons, using this approach means it is important to clarify the underlying assumptions and limitations of the analysis.63 In general, although methods for NMA models have become more standardized and it is possible to reproduce results based on study-level data, there is a clear need for more transparent assessment of effect modifiers and reporting, as per ISPOR recommendations.
Although the unanchored MAICs tended to be more targeted in terms of their research questions, the relevance to clinical practice was criticized because they made untestable assumptions. The validity of unanchored MAICs is largely dependent on how well the analysis accounts for differences in prognostic factors and treatment effect modifiers for both observed and unobserved variables between the trials indirectly compared. When conducting an NMA, randomization is preserved and therefore study-level differences in prognostic factors do not bias results; however, in an unanchored MAIC, there is uncertainty regarding the distribution of study-level differences (measured and unmeasured) that may influence the outcome of interest due to isolation of treatment arms in the analysis.6 In cases where the manufacturer has access to IPD from their trial(s), this additional information may allow for a more selective population matched to external studies in terms of between-study differences in patient characteristics, such as treatment experience. Nevertheless, these methods cannot be considered as strong as a standard NMA of RCTs given remaining uncertainty regarding other prognostic factors or treatment effect modifiers. Moreover, it is often the case that the external studies to which the populations are matched differ from the target populations of interest, which may limit the relevance for decision makers. Finally, the MAICs were generally not described in sufficient detail to allow for replication, which is also not feasible in the absence of IPD from the manufacturer. More detailed methods for the identification and selection of the prognostic factors and treatment effect modifiers, in addition to the model specification and selection process, are required to improve the transparency in combination with broader access to IPD.66
An alternative approach to an unanchored MAIC would be to perform an outcomes-regression based indirect comparison or simulated treatment comparison (STC), based on a regression model for the index intervention for the outcome of interest as a function of relevant patient-related factors using the IPD of the index trial (ie, typically manufacturer trial[s]). This model could then be refit using the same IPD by centering the covariates at the mean values of the competing disconnected studies (for which only aggregate study-level data is available) to obtain the population-adjusted estimates with the index intervention for a target population as well as each of the competitor trials (ie, single-arm or disconnected RCTs). STCs may provide less biased estimates than an unanchored MAIC when the model is mis-specified (either outcome regression or propensity score model), although precision may be overestimated.67 STCs may also be less prone to dramatic changes to Kaplan-Meier curves based on upweighting a very small number of individuals. Finally, STCs may also align more broadly with a multiple treatment framework, although this is an ongoing area of research.
Because a connected network may have been feasible by integrating evidence from an observational analysis, an anchored MAIC or a meta-regression combining study and patient-level data may provide more credible treatment estimates.68,69 In general, access to IPD across treatment comparisons is necessary to truly evaluate the credibility of current estimates based on between-study comparisons, which would allow for a meta-regression to adjust for differences in treatment-effect modifiers and prognostic factors, as well as an assessment of the specified target population of interest in either a connected or disconnected network.70–73
One assumption used in both the NMAs and MAICs regarding survival outcomes was that of a constant HR over time, which was not supported by the evidence informing these analyses. Time-varying HRs may have better represented the Kaplan-Meier data, and incorporation of a meta-regression accounting for previous lines of therapy may have been feasible had IPD been available.60,61,69 When cost-effectiveness analyses integrate the synthesis estimates, outcomes are typically extrapolated over a lifetime horizon. This is particularly important in the context of RRMM, where many of the trials report immature OS. Small differences in the short term may have a meaningful impact when extrapolated over a lifetime horizon. Therefore HTA analyses informed by indirect comparisons assuming proportional hazard may be biased. Another limitation of the identified studies was that safety outcomes and observational evidence were not evaluated.
For between-trial or indirect comparisons in RRMM to yield clinically meaningful results, it is necessary to define a relevant research question with respect to all treatment effect modifiers, particularly in terms of the number of prior therapies and relevant comparators, which are likely to affect the type of prior therapy. In general, although NMAs of RCTs can provide credible comparative estimates, research questions are often defined too broadly. Although this may not be unique to the field of RRMM, the problem is accentuated in this disease area owing to the small number of trials published for each of the many treatment comparisons (given how rapidly the field has evolved). For more targeted research questions, it may be appropriate to consider anchored MAICs to achieve more balanced comparisons and to integrate single-arm studies earlier in the process. Nevertheless, this approach only makes sense if the external population to which the estimates are matched aligns with the target population of interest for decision makers.
Although much of the existing literature was driven by a UK HTA perspective based on single technology appraisals, timely multiple technology appraisals are needed for a rapidly expanding disease area. The use of cumulative or “living” NMAs (or cumulative evidence synthesis projects), whereby comparative analysis methods are agreed upon and continually updated for an entire disease area, may reduce the lag-time between new evidence being available and synthesis results.74 This is in contrast to current standard practice whereby multiple evidence synthesis projects are ongoing simultaneously using different methods by different project teams.75 All of these improvements will require a collaborative approach between clinicians and methodologists to ensure future evidence synthesis studies provide utility to clinicians, payers, and patients.
Supplementary Material
Acknowledgments
Source of financial support: This manuscript was funded by Amgen Inc, USA.
Conflict of interest: (Mateos): Honoraria from lectures: Janssen, Celgene, Amgen, Takeda; Scientific advisory board: Janssen, Celgene, Amgen, Takeda, Abbvie, GSK, Pharmamar. (Fonseca): Consulting: Amgen, BMS, Celgene, Takeda, Bayer, Janssen, Novartis, Pharmacyclics, Sanofi, Merck, Juno, Kite, Aduro, AbbVie, Karyopharm. Scientific Advisory Board Adaptive Biotechnologies. (Weisel): Honoraria: Amgen, BMS, Celgene, Janssen, Takeda. Advisory Board: Amgen, BMS, Celgene, Janssen, Juno, Novartis, Sanofi. Research funding: Amgen, Celgene, Sanofi, Janssen. (Landgren): Received research funding from: National Institutes of Health (NIH), US Food and Drug Administration (FDA), Multiple Myeloma Research Foundation (MMRF), International Myeloma Foundation (IMF), Leukemia and Lymphoma Society (LLS), Perelman Family Foundation, Rising Tides Foundation, Amgen, Celgene, Janssen, Takeda, Glenmark, Seattle Genetics, Karyopharm; Honoraria/ad boards: Adaptive, Amgen, Binding Site, BMS, Celgene, Cellectis, Glenmark, Janssen, Juno, Pfizer; and serves on Independent Data Monitoring Committees (IDMCs) for clinical trials lead by Takeda, Novartis, Janssen.
Footnotes
Supplemental Material
Supplementary data associated with this article can be found in the online version at https://doi.org/10.1016/j.jval.2019.11.003.
REFERENCES
- 1.Anderson KC, Kyle RA, Rajkumar SV, et al. Clinically relevant end points and new drug approvals for myeloma. Leukemia. 2008;22(2):231–239. [DOI] [PubMed] [Google Scholar]
- 2.Rajkumar SV, Harousseau JL, Durie B, et al. Consensus recommendations for the uniform reporting of clinical trials: report of the International Myeloma Workshop Consensus Panel 1. Blood. 2011;117(18):4691–4695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Moreau P, San Miguel J, Sonneveld P, et al. Multiple myeloma: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann Oncol. 2017;28(suppl 4):iv52–iv61. [DOI] [PubMed] [Google Scholar]
- 4.Coit DG, Thompson JA, Albertini MR, et al. Cutaneous Melanoma, Version 2. 2019, NCCN Clinical Practice Guidelines in Oncology. J Natl Compr Canc Netw. 2019;17(4):367–402. [DOI] [PubMed] [Google Scholar]
- 5.Laubach J, Garderet L, Mahindra A, et al. Management of relapsed multiple myeloma: recommendations of the International Myeloma Working Group. Leukemia. 2016;30(5):1005–1017. [DOI] [PubMed] [Google Scholar]
- 6.Dias S, Sutton AJ, Ades AE, Welton NJ. Evidence synthesis for decision making 2: a generalized linear modeling framework for pairwise and network meta-analysis of randomized controlled trials. Med Decis Making. 2013;33(5):607–617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Phillippo DM, Ades AE, Dias S, et al. NICE DSU Technical Support Document 18: Methods for population-adjusted indirect comparisons in submission to NICE. National Institute of Health and Care Excellence; 2016. [Google Scholar]
- 8.Majer I, van de Wetering G, Polanyi Z, Krishna A, Gray E, Roy A. Panobinostat plus bortezomib versus lenalidomide in patients with relapsed and/or refractory multiple myeloma: a matching-adjusted indirect treatment comparison of survival outcomes using patient-level data. Appl Health Econ Health Policy. 2017;15(1):45–55. [DOI] [PubMed] [Google Scholar]
- 9.Jansen JP, Trikalinos T, Cappelleri JC, et al. Indirect treatment comparison/ network meta-analysis study questionnaire to assess relevance and credibility to inform health care decision making: an ISPOR-AMCP-NPC Good Practice Task Force report. Value Health. 2014;17(2):157–173. [DOI] [PubMed] [Google Scholar]
- 10.Botta C, Ciliberto D, Rossi M, et al. Network meta-analysis of randomized trials in multiple myeloma: efficacy and safety in relapsed/refractory patients. Blood Adv. 2017;1(7):455–466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Dranitsaris G, Kaura S. Lenalidomide versus bortezomib: an indirect comparison. Int J Hematol Oncol. 2014;3(2):131–136. [Google Scholar]
- 12.Maiese EM, Ainsworth C, Le Moine JG, Ahdesmaki O, Bell J, Hawe E. Comparative efficacy of treatments for previously treated multiple myeloma: a systematic literature review and network meta-analysis. Clin Ther. 2018;40(3):480–494.e423. [DOI] [PubMed] [Google Scholar]
- 13.van Beurden-Tan CHY, Franken MG, Blommestein HM, Uyl-de Groot CA, Sonneveld P. Systematic literature review and network meta-analysis of treatment outcomes in relapsed and/or refractory multiple myeloma. J Clin Oncol. 2017;35(12):1312–1319. [DOI] [PubMed] [Google Scholar]
- 14.van Sanden S, Ito T, Vogel M, Diels J. Comparative efficacy of daratumumab monotherapy and pomalidomide plus low dose dexamethasone (POMA+LoDex) in the treatment of multiple myeloma: a matching adjusted indirect comparison (MAIC). Abstract presented at International Society for Pharmacoeconomics and Outcomes Research Annual European Congress Vienna, Austria; 2016. [Google Scholar]
- 15.Durand A, Rtveladze K, Pritchard C, Cooper C, Mujica-Mota R. Panobinostat (Farydak®) for treating multiple myeloma in people who have received at least one prior therapy. Single Technology Appraisal NIHR HTA Programme. Optimity Advisors and Peninsula Technology Assessment Group; 2015. [Google Scholar]
- 16.Edwards S, Barton S, Karner C, et al. Daratumumab for treating relapsed and refractory multiple myeloma. National Institue for Health and Care Excellence; 2018. [Google Scholar]
- 17.Hoyle M, Rogers G, Gardside R, Moxham T, Stein K. The clinical- and cost-effectiveness of lenalidomide for multiple myeloma in people who have received at least one prior therapy: an evidence review of the submission from Celgene. Peninsula Technology Assessment Group; 2008. [Google Scholar]
- 18.Ollendorf DA, Chapman R, Khan S, et al. Treatment options for relapsed or refractory multiple myeloma: Effectiveness, value, and value-based price benchmarks. Institute for Clinical and Economic Review; 2016. [Google Scholar]
- 19.Wood S, Sidhu R, Yates S. Pomalidomide with dexamethasone for treating relapsed and refractory multiple myeloma after at least two regimens including lenalidomide and bortezomib; 2017. [Google Scholar]
- 20.Gala S, Mwamburi M. Indirect treatment comparison of newer treatments in relapsed/refractory multiple myeloma. Abstract presented at: International Society for Pharmacoeconomics and Outcomes Research Annual International Meeting Washington, DC; 2016. [Google Scholar]
- 21.Maguire A, Schmitz S, Kuhn I, et al. Updated results of a systematic review of the relative effectiveness of treatments in relapsed/refractory multiple myeloma. Paper presented at: European Hematology Association Congress Copenhagen, Denmark; 2016. [Google Scholar]
- 22.Arnulf B, Pylypenko H, Grosicki S, et al. Updated survival analysis of a randomized phase III study of subcutaneous versus intravenous bortezomib in patients with relapsed multiple myeloma. Haematologica. 2012;97(12):1925–1928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Baz RC, Martin TG 3rd, Lin HY, et al. Randomized multicenter phase 2 study of pomalidomide, cyclophosphamide, and dexamethasone in relapsed refractory myeloma. Blood. 2016;127(21):2561–2568. [DOI] [PubMed] [Google Scholar]
- 24.Chanan-Khan AA, Niesvizky R, Hohl RJ, et al. Phase III randomised study of dexamethasone with or without oblimersen sodium for patients with advanced multiple myeloma. Leuk Lymphoma. 2009;50(4):559–565. [DOI] [PubMed] [Google Scholar]
- 25.Dimopoulos M, Siegel DS, Lonial S, et al. Vorinostat or placebo in combination with bortezomib in patients with multiple myeloma (VANTAGE 088): a multicentre, randomised, double-blind study. Lancet Oncol. 2013;14(11):1129–1140. [DOI] [PubMed] [Google Scholar]
- 26.Dimopoulos M, Spencer A, Attal M, et al. Lenalidomide plus dexamethasone for relapsed or refractory multiple myeloma. N Engl J Med. 2007;357(21):2123–2132. [DOI] [PubMed] [Google Scholar]
- 27.Dimopoulos MA, Moreau P, Palumbo A, et al. Carfilzomib and dexamethasone versus bortezomib and dexamethasone for patients with relapsed or refractory multiple myeloma (ENDEAVOR): a randomised, phase 3, open-label, multicentre study. Lancet Oncol. 2016;17(1):27–38. [DOI] [PubMed] [Google Scholar]
- 28.Dimopoulos MA, Oriol A, Nahi H, et al. Daratumumab, lenalidomide, and dexamethasone for multiple myeloma. N Engl J Med. 2016;375(14):1319–1331. [DOI] [PubMed] [Google Scholar]
- 29.Garderet L, Iacobelli S, Moreau P, et al. Superiority of the triple combination of bortezomib-thalidomide-dexamethasone over the dual combination of thalidomide-dexamethasone in patients with multiple myeloma progressing or relapsing after autologous transplantation: the MMVAR/IFM 2005–04 Randomized Phase III Trial from the Chronic Leukemia Working Party of the European Group for Blood and Marrow Transplantation. J Clin Oncol. 2012;30(20):2475–2482. [DOI] [PubMed] [Google Scholar]
- 30.Hazarika M, Rock E, Williams G, et al. Lenalidomide in combination with dexamethasone for the treatment of multiple myeloma after one prior therapy. Oncologist. 2008;13(10):1120–1127. [DOI] [PubMed] [Google Scholar]
- 31.Hjorth M, Hjertner O, Knudsen LM, et al. Thalidomide and dexamethasone vs. bortezomib and dexamethasone for melphalan refractory myeloma: a randomized study. Eur J Haematol. 2012;88(6):485–496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Jagannath S, Barlogie B, Berenson J, et al. A phase 2 study of two doses of bortezomib in relapsed or refractory myeloma. Br J Haematol. 2004;127(2):165–172. [DOI] [PubMed] [Google Scholar]
- 33.Jakubowiak A, Offidani M, Pegourie B, et al. Randomized phase 2 study: elotuzumab plus bortezomib/dexamethasone vs bortezomib/dexamethasone for relapsed/refractory MM. Blood. 2016;127(23):2833–2840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kropff M, Baylon HG, Hillengass J, et al. Thalidomide versus dexamethasone for the treatment of relapsed and/or refractory multiple myeloma: results from OPTIMUM, a randomized trial. Haematologica. 2012;97(5):784–791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kropff M, Vogel M, Bisping G, et al. Bortezomib and low-dose dexamethasone with or without continuous low-dose oral cyclophosphamide for primary refractory or relapsed multiple myeloma: a randomized phase III study. Ann Hematol. 2017;96(11):1857–1866. [DOI] [PubMed] [Google Scholar]
- 36.Leleu X, Attal M, Arnulf B, et al. Pomalidomide plus low-dose dexamethasone is active and well tolerated in bortezomib and lenalidomide-refractory multiple myeloma: Intergroupe Francophone du Myelome 2009–02. Blood. 2013;121(11):1968–1975. [DOI] [PubMed] [Google Scholar]
- 37.Lonial S, Dimopoulos M, Palumbo A, et al. Elotuzumab therapy for relapsed or refractory multiple myeloma. N Engl J Med. 2015;373(7):621–631. [DOI] [PubMed] [Google Scholar]
- 38.Lonial S, Weiss BM, Usmani SZ, et al. Daratumumab monotherapy in patients with treatment-refractory multiple myeloma (SIRIUS): an open-label, randomised, phase 2 trial. Lancet. 2016;387(10027):1551–1560. [DOI] [PubMed] [Google Scholar]
- 39.Miguel JS, Weisel K, Moreau P, et al. Pomalidomide plus low-dose dexamethasone versus high-dose dexamethasone alone for patients with relapsed and refractory multiple myeloma (MM-003): a randomised, open-label, phase 3 trial. Lancet Oncol. 2013;14(11):1055–1066. [DOI] [PubMed] [Google Scholar]
- 40.Moreau P, Masszi T, Grzasko N, et al. Oral ixazomib, lenalidomide, and dexamethasone for multiple myeloma. N Engl J Med. 2016;374(17):1621–1634. [DOI] [PubMed] [Google Scholar]
- 41.Orlowski RZ, Gercheva L, Williams C, et al. A phase 2, randomized, doubleblind, placebo-controlled study of siltuximab (anti-IL-6 mAb) and bortezomib versus bortezomib alone in patients with relapsed or refractory multiple myeloma. Am J Hematol. 2015;90(1):42–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Orlowski RZ, Nagler A, Sonneveld P, et al. Randomized phase III study of pegylated liposomal doxorubicin plus bortezomib compared with bortezomib alone in relapsed or refractory multiple myeloma: combination therapy improves time to progression. J Clin Oncol. 2007;25(25):3892–3901. [DOI] [PubMed] [Google Scholar]
- 43.Palumbo A, Chanan-Khan A, Weisel K, et al. Daratumumab, bortezomib, and dexamethasone for multiple myeloma. N Engl J Med. 2016;375(8):754–766. [DOI] [PubMed] [Google Scholar]
- 44.Richardson PG, Siegel DS, Vij R, et al. Pomalidomide alone or in combination with low-dose dexamethasone in relapsed and refractory multiple myeloma: a randomized phase 2 study. Blood. 2014;123(12):1826–1832. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Richardson PG, Sonneveld P, Schuster MW, et al. Bortezomib or high-dose dexamethasone for relapsed multiple myeloma. N Engl J Med. 2005;352(24):2487–2498. [DOI] [PubMed] [Google Scholar]
- 46.San-Miguel JF, Hungria VT, Yoon SS, et al. Panobinostat plus bortezomib and dexamethasone versus placebo plus bortezomib and dexamethasone in patients with relapsed or relapsed and refractory multiple myeloma: a multicentre, randomised, double-blind phase 3 trial. Lancet Oncol. 2014;15(11):1195–1206. [DOI] [PubMed] [Google Scholar]
- 47.Schey S, Brown SR, Tillotson AL, et al. Bendamustine, thalidomide and dexamethasone combination therapy for relapsed/refractory myeloma patients: results of the MUKone randomized dose selection trial. Br J Haematol. 2015;170(3):336–348. [DOI] [PubMed] [Google Scholar]
- 48.Sehgal K, Das R, Zhang L, et al. Clinical and pharmacodynamic analysis of pomalidomide dosing strategies in myeloma: impact of immune activation and cereblon targets. Blood. 2015;125(26):4042–4051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Stewart AK, Rajkumar SV, Dimopoulos MA, et al. Carfilzomib, lenalidomide, and dexamethasone for relapsed multiple myeloma. N Engl J Med. 2015;372(2):142–152. [DOI] [PubMed] [Google Scholar]
- 50.Weber DM, Chen C, Niesvizky R, et al. Lenalidomide plus dexamethasone for relapsed multiple myeloma in North America. N Engl J Med. 2007;357(21):2133–2142. [DOI] [PubMed] [Google Scholar]
- 51.White D, Kassim A, Bhaskar B, et al. Results from AMBER, a randomized phase 2 study ofbevacizumab and bortezomib versus bortezomib in relapsed or refractory multiple myeloma. Cancer. 2013;119(2):339–34 . [DOI] [PubMed] [Google Scholar]
- 52.Yakoub-Agha I, Mary JY, Hulin C, et al. Low-dose vs. high-dose thalidomide for advanced multiple myeloma: a prospective trial from the Intergroupe Francophone du Myelome. Eur J Haematol. 2012;88(3):249–259. [DOI] [PubMed] [Google Scholar]
- 53.Richardson PG, Schlossman RL, Alsina M, et al. PANORAMA 2: panobinostat in combination with bortezomib and dexamethasone in patients with relapsed and bortezomib-refractory myeloma. Blood. 2013;122(14):2331–2337. [DOI] [PubMed] [Google Scholar]
- 54.Dimopoulos MA, Palumbo A, Corradini P, et al. Safety and efficacy of pomalidomide plus low-dose dexamethasone in STRATUS (MM-010): a phase 3b study in refractory multiple myeloma. Blood. 2016;128(4):497–503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Gooding S, Lau IJ, Sheikh M, et al. Double relapsed and/or refractory multiple myeloma: clinical outcomes and real world healthcare costs. PLoS One. 2015;10(9):e0136207. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Tarant JL, Ashcroft J, Feyler S, Owen RG, Parrish C, Cook G. Treatment patterns & survival in multiple myeloma patients sequentially exposed to thalidomide, bortezomib & lenalidomide in a UK single centre. Blood. 2013;122(21). [Google Scholar]
- 57.Lokhorst HM, Plesner T, Laubach JP, et al. Targeting CD38 with daratumumab monotherapy in multiple myeloma. N Engl J Med. 2015;373(13):1207–1219. [DOI] [PubMed] [Google Scholar]
- 58.Dimopoulos MA, Orlowski RZ, Facon T, et al. Retrospective matched-pairs analysis of bortezomib plus dexamethasone versus bortezomib monotherapy in relapsed multiple myeloma. Haematologica. 2015;100(1):100–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Felix J, Aragao F, Almeida JM, et al. Time-dependent endpoints as predictors of overall survival in multiple myeloma. BMC Cancer. 2013;13:122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Jansen JP. Network meta-analysis of survival data with fractional polynomials. BMC Med Res Methodol. 2011;11:61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Ouwens MJ, Philips Z, Jansen JP. Network meta-analysis of parametric survival curves. Res Synth Methods. 2010;1(3–4):258–271. [DOI] [PubMed] [Google Scholar]
- 62.Dimopoulos MA, Kaufman JL, White D, et al. A comparison of the efficacy of immunomodulatory-containing regimens in relapsed/refractory multiple myeloma: a network meta-analysis. Clin Lymphoma Myeloma Leuk. 2018;18(3):163–173.e6. [DOI] [PubMed] [Google Scholar]
- 63.Cope S, Zhang JF, Saletan S, et al. A process for assessing the feasibility of a network meta-analysis: a case study of everolimus in combination with hormonal therapy versus chemotherapy for advanced breast cancer. BMC Med. 2014;12:93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Chaimani A, Salanti G, Leucht S, et al. Common pitfalls and mistakes in the set-up, analysis and interpretation of results in network meta-analysis: what clinicians should look for in a published article. Evid Based MentHealth. 2017;20(3):88–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Bafeta A, Trinquart L, Seror R, Ravaud P. Reporting of results from network meta-analyses: methodological systematic review. BMJ. 2014;348:g174 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Holmes EM, Leahy J, Walsh CD, White A, Donnan PT, Lamrock F. Difficulties arising in reimbursement recommendations on new medicines due to inadequate reporting of population adjustment indirect comparison methods. Res Synth Methods. 2019;10(4):615–617. [DOI] [PubMed] [Google Scholar]
- 67.Kang JDY, Schafer JL. Demystifying double robustness: a comparison of alternative strategies for estimating a population mean from incomplete data. Stat Sci. 2007;22(4):523–539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Saramago P, Chuang LH, Soares MO. Network meta-analysis of (individual patient) time to event data alongside (aggregate) count data. BMC Med Res Methodol. 2014;14:105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Jansen JP. Network meta-analysis of individual and aggregate level data. Res Synth Methods. 2012;3(2):177–190. [DOI] [PubMed] [Google Scholar]
- 70.Cope S, Capkun-Niggli G, Gale R, et al. Efficacy of once-daily indacaterol relative to alternative bronchodilators in COPD: a patient-level mixed treatment comparison. Value Health. 2012;15(3):524–533. [DOI] [PubMed] [Google Scholar]
- 71.Faria R, Hernandez Alava M, Manca A, Wailoo AJ. Decision Support Unit. NICE DSU Technical Support Document 17: The use of observational data to inform estimates of treatment effectivensss for technology appraisal: methods for comparative individual patient data. National Institute of Health and Care Excellence; 2015. [Google Scholar]
- 72.Cope S, Zhang JF, Williams J, Jansen JP. Efficacy of once-daily indacaterol 75 mg relative to alternative bronchodilators in COPD: a study level and a patient level network meta-analysis. BMC Pulm Med. 2012;12:29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Saramago P, Woods B, Weatherly H, et al. Methods for network meta-analysis of continuous outcomes using individual patient data: a case study in acupuncture for chronic pain. BMC Med Res Methodol. 2016;16(1):131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Nikolakopoulou A, Mavridis D, Furukawa TA, et al. Living network meta-analysis compared with pairwise meta-analysis in comparative effectiveness research: empirical study. BMJ. 2018;360:k585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Crequit P, Trinquart L, Ravaud P. Live cumulative network meta-analysis: protocol for second-line treatments in advanced non-small-cell lung cancer with wild-type or unknown status for epidermal growth factor receptor. BMJ Open. 2016;6(8):e011841. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
