Skip to main content
Journal of Managed Care & Specialty Pharmacy logoLink to Journal of Managed Care & Specialty Pharmacy
. 2018 Jan;24(1):10.18553/jmcp.2018.24.1.29. doi: 10.18553/jmcp.2018.24.1.29

Cost-effectiveness of Drugs to Treat Relapsed/Refractory Multiple Myeloma in the United States

Josh J Carlson 1,*, Gregory F Guzauskas 1, Richard H Chapman 2, Patricia G Synnott 2, Shanshan Liu 2, Elizabeth T Russo 2, Steven D Pearson 2, Elizabeth D Brouwer 1, Daniel A Ollendorf 2
PMCID: PMC10398323  PMID: 29290170

Abstract

This article has been corrected. See J Manag Care Spec Pharmacy. 2018;24(7):714.

BACKGROUND:

New 3-drug regimens have been developed and approved to treat multiple myeloma (MM). The absence of direct comparative data and the high cost of treatment support the need to assess the relative clinical and economic outcomes across all approved regimens.

OBJECTIVE:

To evaluate the cost-effectiveness of treatments for relapsed and/or refractory MM from a U.S. health system perspective.

METHODS:

We developed a partition survival model with 3 health states (progression-free, progression, and death) to evaluate the following regimens: carfilzomib (CFZ), elotuzumab (ELO), ixazomib (IX), daratumumab (DAR), and panobinostat (PAN) in combination with lenalidomide (LEN) or bortezomib (BOR) plus dexamethasone (DEX) in the second and/or third line of therapy. To estimate relative treatment effects, we developed a network meta-analysis and applied progression-free survival hazard ratios to baseline parametric progression-free survival functions derived from pooled data on LEN+DEX. We estimated overall survival using data on the relationship between progression-free survival and overall survival from a large meta-analysis of MM patients. Modeled costs included those related to drug treatment, administration, monitoring, adverse events, and progression. Utilities were from publicly available data and manufacturer data, if published sources were unavailable.

RESULTS:

Model results showed that regimens containing DAR yielded the highest expected life years (DAR range: 6.71-7.38 vs. non-DAR range: 3.25-5.27) and quality-adjusted life-years (QALY; DAR range: 4.38-5.44 vs. non-DAR range: 2.04-3.46), with DAR+BOR+DEX (second line) and PAN+BOR+DEX (third line) as the most cost-effective options (incremental cost-effectiveness ratio: $50,700 and cost saving, respectively). The applicability of the PAN+BOR+DEX result may be challenging, however, because of ongoing toxicity concerns. In the probabilistic sensitivity analysis, second-line DAR+BOR+DEX and third-line PAN+BOR+DEX had an 89% and 87% probability of being cost-effective at the $150,000 per QALY threshold, respectively.

CONCLUSIONS:

The introduction of newer drugs and regimens to treat second- and third-line relapsed/refractory MM appears to provide clinical benefits by lengthening progression-free and overall survival and improving quality of life. However, only the addition of DAR or PAN may be considered cost-effective options according to commonly cited thresholds, and PAN+BOR+DEX results require cautious interpretation. Achieving levels of value more closely aligned with patient benefit would require substantial discounts from the remaining agents evaluated.


What is already known about this subject

  • Multiple myeloma (MM) treatment has historically been anchored by 2 drugs, bortezomib and lenalidomide, each in combination with dexamethasone.

  • Over the past decade, 6 more drugs entered the market and have demonstrated improved outcomes compared with standard care.

What this study adds

  • This study models comparative clinical and economic outcomes of combination drug therapies for MM treatment.

  • There is considerable uncertainty about comparative long-term outcomes, such as overall survival and the comparative trade-offs between effectiveness, toxicity, and costs for these therapies and their various combinations.

  • This study provides a comprehensive analysis of the value of these drug combinations and can inform health care decision making for a variety of stakeholders.

Multiple myeloma (MM) is a hematological cancer that occurs when bone marrow produces malignant plasma cells that enter the bloodstream. MM is the second most common hematological malignancy, with 25,000 new cases diagnosed every year in the United States.1 The disease disproportionately affects older people, with the median age of onset at 66 years.1 Despite recent advances, prognosis remains relatively poor, with a 5-year survival rate of 48.5%.1 MM progression can be relatively slow in many individuals, often involving multiple rounds of remission after treatment followed by subsequent relapse. About 100,000 individuals are currently living with the disease in the United States.1

Over the past decade, treatment of MM in the United States has been anchored by 2 drugs: bortezomib (Velcade; BOR) and lenalidomide (Revlimid; LEN), often given in combination with dexamethasone (DEX). Other medications have more recently become available specifically for the treatment of relapsed or refractory disease, including pomalidomide (Pomalyst; POM), carfilzomib (Kyprolis; CFZ), ixazomib (Ninlaro; IX), daratumumab (Darzalex; DAR), elotuzumab (Empliciti; ELO), and panobinostat (Farydak; PAN).

These new agents have demonstrated improved outcomes compared with standard care approaches and are recommended as treatment options by national clinical guidelines.2 Collectively, they represent important clinical advances in a disease setting that has historically lacked a variety of treatment options. However, there remains considerable uncertainty regarding the comparative long-term outcomes (i.e., overall survival) and the comparative trade-offs between effectiveness and toxicity. In addition, there is considerable concern and uncertainty about the cost and value of these therapies, since the cost of a single course of drug therapy is estimated to range from $75,000 to $250,000 for U.S. patients with relapsed and/or refractory disease.3 These estimates may actually be conservative, given the increasing use of triple therapy and “treat to progression” labeling for the newest agents.3

The increasing cost of U.S. health care, and specifically for cancer, has generated renewed discussion about the value of medical technologies. The availability of effective treatment options for MM patients is of paramount importance. However, in an era of continuing increases in health care spending and drug prices, it is also important to understand the relationship between costs and outcomes achieved.

The objective of this study was to assess the comparative clinical and economic outcomes for drugs used to treat patients with relapsed and/or refractory MM in the second or third line of therapy from a U.S. health system perspective. This analysis can be used to inform health care decision making for a wide range of MM stakeholders.

Methods

Model Approach

The following pharmaceutical regimens were included in the analysis: LEN+DEX, BOR+DEX, CFZ+LEN+DEX, ELO+LEN+DEX, IX+LEN+DEX, PAN+BOR+DEX, DAR+LEN+DEX, and DAR+BOR+DEX (Table 1). We developed a 3-state partition survival model, which included a progression-free survival (PFS) state, progressed disease with subsequent treatments, and death. Patients in the PFS state could be either on or off treatment to account for patients who stop therapy but remain in the PFS state. We used a cycle length of 1 week to reflect the dosing schedules for included drug regimens. The recommended dosage schedules for the regimens of interest were based on indications for treatment of relapsed and/or refractory disease labeled by the U.S. Food and Drug Adminstration, as well as expert input regarding common treatment approaches for the populations of interest. We used a health sector perspective, a lifetime horizon, a 3% discount rate for costs and outcomes, and a half-cycle correction. The upper bound willingness to pay for cost-effectiveness was $150,000 per quality-adjusted life-years (QALY) gained, which approaches the upper limit of commonly cited thresholds.4

TABLE 1.

Key Model Parameters

Survival Hazard Ratios Base Case Lower Upper PSA Distribution Source
Second-line PFS hazard ratios vs. LEN-DEX
CFZ-LEN-DEX 0.69 0.53 0.91 LogNormal Network meta-analysis
ELO-LEN-DEX 0.70 0.56 1.00 LogNormal Network meta-analysis
IX-LEN-DEX 0.74 0.65 1.19 LogNormal Network meta-analysis
DAR-LEN-DEXa 0.37 0.27 0.52 LogNormal Network meta-analysis
DAR-BOR-DEXa 0.39 0.28 0.53 LogNormal Network meta-analysis
Third-line PFS hazard ratios vs. LEN-DEX
BOR-DEX 0.93 0.58 2.04 LogNormal Network meta-analysis
CFZ-LEN-DEX 0.69 0.54 0.87 LogNormal Network meta-analysis
ELO-LEN-DEX 0.70 0.49 0.87 LogNormal Network meta-analysis
IX-LEN-DEX 0.74 0.40 0.84 LogNormal Network meta-analysis
PAN-BOR-DEX 0.59 0.31 1.10 LogNormal Network meta-analysis
DAR-LEN-DEXa 0.37 0.27 0.52 LogNormal Network meta-analysis
DAR-BOR-DEXa 0.39 0.28 0.53 LogNormal Network meta-analysis
Hazard ratio for OS vs. PFS 0.41 0.31 0.58 LogNormal 24
Quality of Life Base Case Lower Upper PSA Distribution Source
Second-line health state utilities
Progression-free, on treatment 0.82 0.78 0.88 Beta Data on fileb
Progression-free, off treatment 0.84 0.82 0.97 Beta Data on fileb
Progressed disease 0.65 0.62 0.74 Beta Data on fileb
Third-line health state utilities
Progression-free, on treatment 0.65 0.52 0.78 Beta 25
Progression-free, off treatment 0.72 0.58 0.86 Beta 37
Progressed disease 0.61 0.49 0.73 Beta 25
Adverse event disutility 0.08 0.07 0.08 Beta 25
Costs Base Case Lower Upper PSA Distribution Source
Drug acquisition and administration costs,c $
Bortezomib 3.5 mg vial 1,503.00 1,202.40 1,803.60 Normal RED BOOK
Bortezomib administration 111.42 89.14 133.70 Normal CPT 96409
Carfilzomib 60 mg vial 1,971.50 1,577.20 2,365.80 Normal RED BOOK
Carfilzomib administration 209.24 167.39 251.09 Normal CPT 96360, 96361, 96413
Dexamethasone per mg 0.32 0.26 0.39 Normal RED BOOK
Elotuzumab 300 mg vial 1,776.00 1,420.80 2,131.20 Normal RED BOOK
Elotuzumab 400 mg vial 2,368.00 1,894.40 2,841.60 Normal RED BOOK
Elotuzumab administration 227.87 182.30 273.44 Normal CPT 96413, 96415, 96417
Ixazomib capsule 3,006.00 2,404.80 3,607.20 Normal RED BOOK
Lenalidomide capsule 552.98 442.38 663.58 Normal RED BOOK
Panobinostat capsule 1,222.22 977.78 1,466.67 Normal RED BOOK
Daratumumab 400 mg vial 1,850.40 1,480.32 2,220.48 Normal RED BOOK
Daratumumab 100 mg vial 462.60 370.08 555.12 Normal RED BOOK
Daratumumab administration 399.83 319.86 479.80 Normal CPT 96413, 96415, 96417
Health State Cost per Week Progression-Free AE Prophylaxis, $ Progressed Disease Treatment, $
LEN-DEX 22.00 427.45
CFZ-LEN-DEX 104.73 367.30
ELO-LEN-DEX 85.37 369.26
IX-LEN-DEX 85.26 377.05
PAN-BOR-DEX 39.16 337.41
DAR-LEN-DEX 49.45 273.54
DAR-BOR-DEX 90.71 283.94

aDAR hazard ratios were assessed for a general population and were assumed to be equivalent in second- and third-line settings.

bAmgen. Data provided in response to ICER data request. QOL/Utility Data from ASPIRE Cost Effectiveness Model. February 22, 2016.

cCosts assessed March 2016.

BOR = bortezomib; CFZ = carfilzomib; CPT = Current Procedural Terminology; DAR = daratumumab; DEX = dexamethasone; ELO = elotuzumab; IX = ixazomib; LEN = lenalidomide; OS = overall survival; PAN = panobinostat; PFS = progression-free survival; PSA = probabilistic sensitivity analysis.

Network Meta-analysis

A Bayesian network meta-analysis (NMA) was performed to combine evidence on direct and indirect comparisons across the treatment regimens of interest (Appendix A, available in online article).5 The evaluated trials specified similar inclusion criteria. Each trial included adult patients aged ≥18 years with measurable relapsed and/or refractory MM. All patients had previously received therapies and had adequate renal, hepatic, and hematologic function. The trial populations were also similar with respect to age, Eastern Cooperative Oncology Group (ECOG) performance status, International Staging System (ISS) stage, receipt of previous stem cell transplant, and number and distribution of previous regimens.

Quantitative analyses focused on PFS and were conducted using the NetMetaXL tool (http://www.netmetaxl.com/). Adjusted hazard ratios from the randomized trials were log-transformed and entered into the spreadsheet, and 95% confidence intervals (CIs) on log scale were used to specify variance estimates (i.e., standard errors).6-15 A total of 40,000 iterations each were used for “burn-in” (for model convergence) and model (for model results) simulations. Review of the deviance information criterion statistics, as well as comparison of the residual deviance with the number of unconstrained data points, was used to assess the best model fit under multiple alternative assumptions. Although a random effects approach was preferred, the available network was constructed of primarily single-study connections and necessitated a fixed-effects model to preserve statistically significant effects observed in trials.

We also conducted sensitivity analyses based on the shape and scale parameters of digitized parametric survival curves to test and address the potential violation of the proportional hazards assumption.16 We did this for the overall dataset and a subset of data from the carfilzomib, ixazomib, and elotuzumab trials to assess whether inclusion of more contemporary data for LEN+DEX had a material effect on results. In this instance, 30,000 iterations were used for burn-in and model simulations.

Survival Curve Estimation

We fit parametric survival curves to PFS Kaplan-Meier data for the baseline comparator (LEN+DEX) in the second- and third-line settings, using the approach described by Hoyle and Henley (2011).17 LEN+DEX was chosen as the baseline because clinical experts considered LEN+DEX to be the most commonly used comparator and because of the availability of LEN+DEX survival data by line of therapy. To do this, we extracted data points from digitized survival curves, then used the extracted values, the number of surviving patients at each time interval, and maximum likelihood functions to estimate the underlying individual patient data.7,8,18 We assumed that the rate of censoring was the same between the second- and third-line settings, which allowed us to estimate the number at risk at set time points for the second-and third-line curves from the pooled number at risk data.

Base case PFS curves for LEN+DEX were derived from parametric fits to pooled Kaplan-Meier data from the MM-009 and MM-010 trials of LEN+DEX.7,8 For the base case, we selected the Weibull parametric function from the candidate distributions based on (a) face validity (log-normal and log-logistic fits exhibited unrealistically elongated tails) and (b) Akaike Information Criterion, a graphical assessment of each parametric function and a knowledge of the expected extrapolation of the PFS times. We then used PFS hazard ratios acquired from our network meta-analysis results, applied to the baseline comparator curve (LEN+DEX), to derive PFS curves for the other interventions by line of treatment (Table 1).6,13,14,19-23 We assumed that the regimens’ NMA-derived treatment effects were consistent for the second- and third-line settings, since stratified analyses suggested similar effect sizes for most comparisons (data not shown).

The data on overall survival (OS) for these regimens were not uniformly available and were prone to bias because of crossover to the active comparator, as well as the availability of different drugs after progression in trials conducted at different points in time. Therefore, we applied a 2.45-month (95% CI = 1.7-3.2) increase in OS for each additional month of PFS for each regimen-specific OS curve, based on a systematic review of this relationship in studies of nearly 23,000 MM patients.24 This approach has been used previously in support of model submissions to health technology assessment agencies.25 We operationalized this estimate by deriving an OS to PFS hazard ratio (1/2.45 = 0.41), which we then applied to each regimen’s PFS curve to estimate the corresponding OS curve. We varied this parameter in a sensitivity analysis and ran a scenario analysis using a weighted average estimate of the relationship of PFS to OS from the available clinical trials in our assessment (3.27-month increase in OS for each additional month of median PFS).

Model Parameters: Health State Utilities

Health state utilities were derived from publicly available literature and/or data made available to us by manufacturers. We used consistent, sequence-specific health state utility values across treatments evaluated in the model. For the progression-free health state, different utilities were applied, depending on whether the patient was on or off treatment, to represent treatment-related decreased quality of life. We applied a regimen-weighted disutility for experiencing any grade 3/4 adverse event (AE). The total percentage of patients who experienced any grade 3/4 AEs for each regimen was multiplied by the AE disutility and then subtracted from the total QALYs gained during PFS for each regimen. We assumed that the total time that patients experienced any grade 3/4 AE was 1 month.

Model Parameters: Adverse Events

The model included grade 3/4 AEs derived from key clinical trials and/or each drug’s prescribing information. The model included any reported grade 3/4 AEs that occurred in at least 5% of patients for any of the treatment comparators.3,26 The frequency of each event was reported as the proportion of patients experiencing the event, as well as the total number of occurrences based on published sources (Appendix B, available in online article). This proportion was then multiplied by the average cost per event to derive a total AE cost for each regimen. Costs per AE were based on a previously published analysis, supplemented by data from the Centers for Medicare & Medicaid Services (CMS) list of Medicare Severity-Diagnosis Related Groups (MS-DRGs) for the fiscal year 2016.3,26

Model Parameters: Cost Inputs

All patients were assumed to initiate study medications at cycle 0 (beginning of the model). For all regimens, patients received planned doses based on individual patient characteristics. The estimation of drug use was derived from several factors, including the relative dose intensity reported in trials or directly provided by manufacturers and the dosing schedule where the dose may be fixed by weight or by body surface area (BSA), assuming average patient characteristics of 1.7 meters tall and 80 kilogram body mass (BSA = 1.92; formula: 0.007184*[mass ^0.425]*[height^0.725]; unpublished data. Amgen. Data provided in response to ICER data request. QOL/Utility Data from ASPIRE Cost Effectiveness Model. February 22, 2016).

The treatment utilization and costs of treat-to-progression regimens were applied to all patients who remained in the PFS health state over time. In finite-cycle regimens, patients could remain in the PFS state after active treatment concluded. No vial sharing was assumed in the base case. Drug unit costs were applied to the utilization estimates to calculate total estimated treatment costs.

We used the wholesale acquisition cost for each drug and noted each available formulation.25 Based on the regimen-specific dosage previously specified, the model used the lowest cost combination of tablets and/or vials for each regimen. Drug administration costs were determined from the Final 2016 Medicare Coding & Payment for Drug Administration Services under the Physician Fee Schedule.27 All drug and administration costs were varied by ±20% in sensitivity analyses.

Other Direct Costs.

We included per cycle AE prophylaxis and monitoring costs based on management guidelines and the literature.6 The cost of postprogression treatment was calculated using a treatment landscape analysis to estimate the proportion of patients who received different available treatments upon progression. The specific treatment distribution was derived from Farr et al. (2016).28 The model assumed that patients received 1 further line of treatment lasting 124 days followed by best supportive care.28 We then calculated a mean cost per week for each regimen by averaging the cost of additional treatment (weighted by the distribution) and supportive care over the mean time in the progression health state.

Model Outcomes

The model estimated the amount of time, on average, that patients spent progression-free and in progression. Unadjusted and utility-adjusted time spent in each health state were summed to provide estimates of life expectancy and quality-adjusted life expectancy. Model outcomes of interest for each intervention included cost, quality-adjusted life expectancy, life expectancy, and mean time in the progression-free and postprogression health states, as well as the relative cost-effectiveness for each intervention compared with the next best comparator. We also provided the results of pairwise comparisons by line versus a standard comparator, LEN-DEX.

Sensitivity Analyses

One-way sensitivity analyses used 95% CIs from clinical evidence when available and plausible values from the published literature when absent. We also conducted a probabilistic sensitivity analysis by jointly varying all model parameters over 4,000 simulations, then calculating 95% credible range estimates for each model outcome. The following distributions were applied to model variables: hazard ratios (log-normal distribution), utility values (beta distribution), proportions of AEs (beta distribution), costs of drugs (normal distribution), costs of AEs (normal distribution), monthly cost of progression (normal distribution), and administration costs (normal distribution).

We ran 4 scenario analyses: (1) using an unadjusted estimate of the relationship of median PFS to median OS based on a weighted average from the trials in our analysis that report both outcomes (3.27-month increase in OS for each additional month of median PFS); (2) using BOR+DEX as the baseline comparator; (3) adjusting the second- and third-line baseline curves to reflect more recent LEN+DEX regimen curves using the relationship between the ASPIRE trial LEN+DEX data and the MM-009/MM-010 pooled LEN+DEX data6,7; and (4) using different second-line utility estimates for triplet (0.83, 0.85, and 0.66 for PFS on treatment, PFS off treatment, and progression, respectively) versus doublet regimens (0.81, 0.83, 0.4, respectively) derived from the ASPIRE trial data (triplet regimens include 3 drugs [e.g., CFZ+LEN+DEX]; doublet regimens include 2 drugs [e.g., LEN+DEX].7,8,18

Results

Network Meta-analysis

Results of the NMA are included in Table 1. Trial populations were similar with respect to age, ECOG performance status, ISS stage, receipt of previouis stem cell transplant, and number and distribution of previous regimens. (Appendix A). Definitions of disease risk varied, but the percentage of patients with high-risk disease ranged from 13%-32% across studies reporting this element. All of the regimens had favorable hazard ratios compared with LEN+DEX; ELO+LEN+DEX and IX+LEN+DEX had the greatest uncertainty. Results for PAN+BOR+DEX in the third-line setting should be interpreted with great caution because of censoring issues and high rates of toxicity-related discontinuation in the overall and third-line subgroup populations of the PANORAMA-1 study.14 PAN+BOR+DEX is also only 1 of 2 regimens without direct comparative evidence versus LEN+DEX; therefore, greater reliance on the study network and its assumptions regarding minimal heterogeneity across study populations and constant hazards over time was required. While censoring was factored into our analytic approach, the relative treatment effect of PAN+BOR+DEX versus LEN+DEX had much greater uncertainty than the other comparisons.

Cost-effectiveness

The results for the aggregate discounted clinical and economic outcomes by line and regimen are provided in Tables 2 and 3. In the second line, total QALYs ranged from a low of 2.59 for LEN+DEX to a high of 5.44 for DAR+LEN+DEX. Total life years ranged from 3.53 for LEN+DEX to 7.38 for DAR+LEN+DEX. Total costs ranged from $189,357 for BOR+DEX to $845,527 for DAR+LEN+DEX. In the third line, results followed a similar pattern, with total QALYs ranging from a low of 2.04 for LEN+DEX to a high of 4.38 for DAR+LEN+DEX. Total life years ranged from 3.25 for LEN+DEX to 6.97 for DAR+LEN+DEX. Total costs ranged from $175,315 for BOR+DEX to $789,202 for DAR+LEN+DEX.

TABLE 2.

Comparative Outcomes

Regimen Second Line Third Line (All Comparators) Third Line (PAN-BOR-DEX Omitted)
Total Cost, $ QALYs ICER Total Cost, $ QALYs ICER Total Cost, $ QALYs ICER
LEN-DEX 309,997 2.59 Dominated 281,754 2.04 Dominated 281,754 2.04 Dominated
BOR-DEX 189,357 2.74 Dominant 175,315 2.16 Dominant 175,315 2.16 Dominant
IX-LEN-DEX 622,378 3.27 Dominated 566,512 2.60 Dominated 566,512 2.60 Dominated
ELO-LEN-DEX 665,728 3.41 Dominated 608,651 2.71 Dominated 608,651 2.71 Dominated
CFZ-LEN-DEX 492,872 3.45 Dominated 459,868 2.74 Dominated 459,868 2.74 Dominated
PAN-BOR-DEX 190,876 3.23 14,598
DAR-BOR-DEX 447,182 5.29 50,704 423,119 4.38 248,762 423,119 4.38 60,359
DAR-LEN-DEX 845,527 5.44 2,707,547 789,202 4.38 Equal outcomes, higher cost vs. DAR-BOR-DEX 789,202 4.38 Equal outcomes, higher cost vs. DAR-BOR-DEX

BOR = bortezomib; CFZ = carfilzomib; DAR = daratumumab; DEX = dexamethasone; ELO = elotuzumab; ICER = incremental cost-effectiveness ratio; IX = ixazomib; LEN = lenalidomide; OS = overall survival; PAN = panobinostat; PFS = progression-free survival; QALY = quality-adjusted life-year.

TABLE 3.

Results per Regimen

Second Line LEN-DEX BOR-DEX CFZ-LEN-DEX ELO-LEN-DEX IX- LEN-DEX DAR- LEN-DEX DAR- BOR-DEX
Total costs, $ 309,997 189,357 492,872 665,728 622,378 845,527 447,182
  Drug acquisition 264,898 133,774 432,799 596,124 571,390 762,407 368,096
  Supportive care 528 1,608 1,882 2,607 2,491 4,947 2,515
  Administration 8,226 8,377 14,698 23,981 22,960
  Progression 40,221 41,167 45,358 45,143 44,330 50,723 51,003
  Adverse event 4,351 4,583 4,457 7,156 4,166 3,469 2,607
Total QALYs 2.59 2.74 3.45 3.41 3.27 5.44 5.29
  PFS 1.41 1.50 1.91 1.89 1.81 3.13 3.05
  Progression 1.17 1.24 1.54 1.52 1.46 2.31 2.24
Total life-years 3.53 3.73 4.71 4.66 4.46 7.38 7.11
  PFS 1.73 1.83 2.34 2.31 2.21 3.82 3.67
  Progression 1.80 1.91 2.37 2.34 2.25 3.55 3.44
ICER vs. LEN-DEX -792,583 211,458 430,009 454,684 187,728 50,704
Third Line LEN-DEX BOR-DEX CFZ-LEN-DEX ELO-LEN-DEX IX- LEN-DEX PAN-BOR-DEX DAR-LEN-DEX DAR-BOR-DEX
Total costs, $ 281,754 175,315 459,868 608,651 566,512 190,876 789,202 423,119
  Drug acquisition 237,670 121,751 401,201 541,632 516,793 131,500 707,051 344,684
  Supportive care 473 1,441 1,779 2,364 2,255 411 4,579 2,403
  Administration 7,365 8,113 13,394 3,095 22,394 21,412
  Progression 39,261 40,175 44,318 44,105 43,298 46,744 51,708 52,014
  Adverse event 4,351 4,583 4,457 7,156 4,166 9,127 3,469 2,607
Total QALYs 2.04 2.16 2.74 2.71 2.60 3.23 4.38 4.38
  PFS 1.00 1.07 1.37 1.36 1.30 1.69 2.28 2.35
  Progression 1.03 1.09 1.37 1.36 1.30 1.54 2.10 2.03
Total life-years (OS) 3.25 3.44 4.37 4.32 4.14 4.93 6.97 6.71
  PFS 1.55 1.64 2.12 2.09 2.00 2.41 3.52 3.38
  Progression 1.70 1.79 2.25 2.23 2.14 2.52 3.44 3.33
ICER vs. LEN-DEX -853,800 252,293 484,168 508,021 Dominant 216,360 60,359

BOR = bortezomib; CFZ = carfilzomib; DAR = daratumumab; DEX = dexamethasone; ELO = elotuzumab; ICER = incremental cost-effectiveness ratio; IX = ixazomib; LEN = lenalidomide; OS = overall survival; PAN = panobinostat; PFS = progression-free survival; QALY = quality-adjusted life-year.

Table 2 shows the comparative results using a league table approach, which listed interventions from lowest to highest QALYs and then calculated an incremental cost-effectiveness ratio (ICER) for each intervention compared with the next best option. Interventions that were dominated were removed from the calculations, and a new ICER was computed versus the next best comparator. Table 2 shows that in the second line, BOR+DEX dominates LEN+DEX; DAR+BOR+DEX has an ICER of $50,704 versus BOR+DEX; and DAR+LEN+DEX has an ICER of $2,707,547 versus DAR+BOR+DEX. In the third line, BOR+DEX dominates LEN+DEX; PAN+BOR+DEX has an ICER of $14,124 versus BOR+DEX; DAR+BOR+DEX has an ICER of $248,762; and DAR+BOR+DEX is cost minimizing versus DAR+LEN+DEX. If we remove PAN+BOR+DEX because of the challenges mentioned in the Network Meta-analysis section (including high levels of treatment discontinuation in the PANORAMA-1 trial and lack of direct comparative evidence to LEN+DEX, among others), DAR+BOR+DEX has an ICER of $60,359 versus BOR+DEX.

The results of the pairwise analyses can be found in Table 3. ICERS for new second-line regimens versus LEN+DEX were estimated to be $51,000 per QALY for DAR+BOR+DEX, followed by DAR+LEN+DEX ($188,000) and CFZ+LEN+DEX ($211,000), with greater than $400,000 per QALY for ELO+LEN+DEX and IX+LEN+DEX. In the third line, ICERs for new regimens versus LEN+DEX were estimated to range from dominant for PAN+BOR+DEX to $60,000 per QALY for DAR+BOR+DEX, followed by DAR+LEN+DEX ($216,000), CFZ+LEN+DEX ($253,000), and approximately $500,000 per QALY for ELO+LEN+DEX and IX+LEN+DEX. As noted previously, results for PAN+BOR+DEX should be interpreted with caution because of population censoring and reliance on indirect treatment comparisons in the NMA. In both settings, incremental costs were driven primarily by increased drug costs rather than progression, supportive care, or AE costs.

Sensitivity Analyses

In each one-way analysis (not shown), results were by far most sensitive to the PFS hazard ratios for each intervention versus LEN+DEX, followed by the estimated link between PFS and OS (2.45 months of OS for each month of PFS, per Felix et al., 201324), drug costs, dosage intensity, and health state utilities.6

Probabilistic sensitivity analysis showed variability in model outcomes (Figure 1). In the second-line setting among the new agents, DAR+BOR+DEX had an 87% probability of being cost-effective at $150,000 per QALY, while all other agents had a 0% probability of being cost-effective at that threshold. In the third-line setting, PAN+BOR+DEX had an 87% probability of being cost-effective at $150,000 per QALY; however, this probability declined with increasing willingness to pay for the better survival outcomes of DAR+BOR+DEX. No other new agents were cost-effective at a cost-effectiveness acceptability threshold of $150,000 per QALY.

FIGURE 1.

FIGURE 1

Cost-effectiveness Acceptability Curves

We performed several scenario analyses (data not shown). The resulting ICERs varied but no regimens were found to have ICERs below $150,000 per QALY that were not already estimated to be so in the primary analysis.

Finally, we performed a threshold analysis to estimate the unit price coincident with commonly used cost-effectiveness thresholds, by holding all other parameters constant (including BOR, LEN, and DEX costs) and identifying the threshold price for each novel agent that made the overall regimen cost-effective at different willingness-to-pay thresholds. We also implemented this analysis with the probabilistic sensitivity analysis to calculate 95% credible range estimates for each threshold price. Table 4 demonstrates that most agents, in combination regimens, would require substantial discounts to meet the highest cost-effectiveness threshold of $150,000 per QALY, and some would not even be cost-effective at a price of $0, given the existing cost of the other drugs in the regimen.

TABLE 4.

Drug Cost Thresholds

Second Line, $ Third Line, $
WTP Threshold 50,000 100,000 150,000 50,000 100,000 150,000
CFZ-LEN-DEX 55 649 1,242 0 445 946
(−906-1,063) (−68-1,733) (405-2,661) (−938-622) (−633-1,518) (−386-2,417)
ELO-LEN-DEX −69 252 572 −126 162 449
(−535-619) (−141-903) (138-1,272) (−644-484) (−266-692) (34-1,032)
IX-LEN-DEX −278 127 533 -347 19 385
(−903-567) (−294-830) (84-1,329) (−1,046-593) (−502-769) (−40-1,180)
PAN-BOR-DEX 3,459 4,344 5,229
(2,389-5,552) (2,668-8,242) (2,792-10,987)
DAR-LEN-DEX −165 567 1,298 −293 351 995
(−779-486) (−51-1,239) (614-2,093) (−902-417) (−239-1,080) (338-1,800)
DAR-BOR-DEX 1,840 2,582 3,324 1,708 2,397 3,087
(1,495-2,278) (2,139-3,050) (2,674-3,976) (1,374-2,114) (1,948-2,959) (2,479-3,845)

Note: Results reflect threshold prices for the first listed drug in each triplet regimen only (all other parameter values held constant).

BOR = bortezomib; CFZ = carfilzomib; DAR = daratumumab; DEX = dexamethasone; ELO = elotuzumab; IX = ixazomib; LEN = lenalidomide; PAN = panobinostat; WTP = willingness to pay.

Discussion

We evaluated 8 drugs in 8 regimens for 2 relapsed and/or refractory MM populations, second and third lines. All the regimens evaluated were estimated to increase time in the progression-free health state and overall survival versus the standard regimens of LEN+DEX and BOR+DEX, but at substantial additional cost compared with these doublet regimens. As such, the value for most of these regimens, according to commonly used thresholds for cost-effectiveness, would be considered questionable. A few regimens were estimated to be in the cost-effective range, including DAR-BOR-DEX (second line) and PAN-BOR-DEX (third line), although the analysis for PAN+BOR+DEX was reserved to the third line and had data challenges that limited our willingness to make strong conclusions about its value. If we remove PAN+BOR+DEX from the third-line analysis, DAR-BOR-DEX becomes the most cost-effective. The LEN+DEX-based regimens with the best ICERs were DAR+LEN+DEX and CFZ+LEN+DEX, although neither regimen fell below the $150,000 per QALY threshold due, in part, to the high cost of LEN and the longer duration of therapy because of increased PFS compared with doublet regimens.

The analyses reported here reveal that important advances in the treatment of relapsed and/or refractory MM have been made over the past decade, which have expanded treatment options and improved patient outcomes. However, only a few regimens have done so in a cost-effective manner.

Our findings highlight a few key implications for stakeholders facing treatment, recommendation, or coverage and reimbursement decisions related to relapsed and/or refractory MM. First, there are cost-effective options for patients in this setting. DAR, the agent with the highest estimated life expectancy and QALY outcomes, demonstrated good value when used in combination with BOR-DEX in the second- and third-line settings. Depending on the clinical situation and patient and provider preferences, this regimen may be preferred over other regimens. Second, new agents combined with BOR uniformly had better ICERs relative to the same new agents in combination with LEN. This is primarily a function of the drug cost for BOR and LEN, respectively. The cost per month for BOR is about half as much as that for LEN. Couple this with the treat-to-progression dosing schedules, and the BOR-based regimens were uniformly more cost-effective compared with the LEN-based regimens.

Importantly, incremental drug costs included additional costs of the new drug as well as extended use of LEN+DEX or BOR-DEX because of improved PFS for the entire regimen. For example, the total treatment cost of LEN in the preprogression state when given as part of the CFZ+LEN+DEX regimen is $260,392 versus $239,745 when given as part of the LEN+DEX regimen because of the longer time in the progression-free state and therefore longer time on treatment. With treat-to-progression strategies, the additional clinical benefits of extending time in the progression-free health state come with consistent extra costs, whereas regimens that generally include 1 or more agents with a fixed dosing schedule (i.e., CFZ+LEN+DEX and PAN+BOR+DEX) do not incur the same amount of additional cost. In this context, the cost and value in real-world settings will be different if the clinical community deviates from the fixed dosing strategies suggested in the prescribing information.

The drug cost threshold analysis highlights a current challenge to providing cost-effective cancer treatment over and above that related to the cost of individual drugs, that is, the issue of adding new agents to existing regimens and creating expensive combination regimens of 2 or more drugs. This challenge has been discussed previously and remains a challenge globally.29,30 Specifically, manufacturers of new drugs that are used in combination with other, often expensive, drugs have no influence over the cost of the other drugs; therefore, manufacturers have a limited ability to establish prices in line with a given health system’s willingness to pay for health gains. As our threshold analysis revealed, for many of the drugs evaluated, the discounts that would be required to achieve commonly used cost-effectiveness thresholds were unrealistic and at times entered into the negative space. However, these are real extra costs to the health system and must be factored into economic analyses.

There are limited options available to address this issue. Indication-based pricing has gained some traction as a potential option to align the use of drugs with the specific value delivered to patients and patient populations, but the challenges with creating this system for an individual drug therapy would only be exacerbated for combination regimens. CMS has indicated a willingness to explore innovative contracts, especially in oncology, but for indication-based pricing to work with combination regimens, the payer would need to establish a comprehensive indication-based pricing process that applied to all manufacturers, and we are a long way from such a system at present.31 Some companies have indicated that they are willing to entertain decreased costs for drugs used in combination if they make all the drugs.32 Another possibility is for manufacturers to work together to provide a group discount. This option has begun to gain traction in Europe, where manufacturers can work with large public payers, although implementation in the United States would be challenged by its fragmented health system.33

We found limited data on the cost-effectiveness of drugs to treat MM in the United States. A recently published cost-effectiveness analysis by Jakubowiak et al. (2016) examined CFZ+LEN+DEX versus LEN+DEX in relapsed MM from a U.S. perspective.34 Although the total incremental cost was similar to that in our model ($179,400 vs. $183,000, respectively), the estimate of QALYs gained with CFZ+LEN+DEX was notably different (1.67 vs. 0.86).

This difference appears to be a result of 2 key differences between models. First, the independently modeled PFS and OS curves in the Jakubowiak et al. analysis yielded much more favorable estimates of treatment effect for CFZ+LEN+DEX than those reported in the ASPIRE trial versus LEN+DEX (PFS odds ratio = 0.51 [model] vs. 0.69 [published hazard ratio]; OS hazard ratio = 0.70 [model] vs. 0.79 [published hazard ratio]). These differences appear to explain a modeled increase in PFS that was over 5 months longer than the median PFS observed in the ASPIRE trial (unpublished data. Amgen. Data provided in response to ICER data request. QOL/Utility Data from ASPIRE Cost Effectiveness Model. February 22, 2016) Second, Jakubowiak et al. used a log-logistic parametric function for the PFS curve, which is prone to long tails in the distribution, whereas we used a Weibull function. Finally, we note that 1 of the findings of the Jakubowiak et al. analysis appears to be counterintuitive, in that CFZ+LEN+DEX patients spend approximately 4 years in the postprogression state in the model versus approximately 3 years for LEN+DEX; however, the postprogression treatment costs for LEN+DEX are reported to be higher.

Limitations

Our analysis had several limitations that warrant mention. We had limited data for all agents on OS. Therefore, we used a method supported by other methodologists, and used in previous analyses, to estimate OS using PFS.25,35 Although directly observed data on OS would be preferred, we felt this method allowed for an unbiased and uniformly applicable approach to estimated OS using PFS, the primary endpoint from all the pivotal trials. This approach would be expected to limit potential bias across drugs given that the agents and regimens evaluated were all approved at different times, so the standard of care, especially after progression, would be expected to vary and potentially affect OS. The observed relationship in any individual study may have been different than that applied uniformly in our model. We therefore tested the estimate in 1-way sensitivity and scenario analyses and found that, while the assumed relationship of PFS to OS was a sensitive parameter, its effect was much less than that of varying PFS hazard ratios and did not substantially affect the primary findings.

We did not have sufficient data for each regimen in the second and third lines. Therefore, we used line-specific data from LEN+DEX, the baseline comparator, and applied the same treatment effect for the new regimens to the separate baseline population estimates, under the assumption that the treatment effect (i.e., the hazard ratio) was consistent across the second and third lines. This assumption had validity in that we found no consistent evidence of a differential treatment effect by line of therapy, and the trials were powered to detect differences in the overall effect in the full intent-to-treat population.

Finally, we note that PFS results in the tables will not match those seen in clinical trials because of our anchoring of hazard ratios to the baseline survival curves for LEN+DEX rather than use of observed survival curves in each trial. However, because of the fixed-effects nature of the NMA, relative effects from each trial are essentially preserved. Our drug cost estimates also had good face validity when compared against an analysis performed by Potluri et al. (2015) using the MarketScan claims database (total LEN+DEX cost in the model: $280,000 vs. Potluri et al.: approximately $310,000).36

Conclusions

The introduction of newer drugs and regimens to treat second- and third-line relapsed and/or refractory MM appears to provide clinical benefits by lengthening PFS and OS and improving quality of life. However, only the addition of DAR or PAN may be considered cost-effective options according to commonly cited thresholds, and PAN+BOR+DEX results require cautious interpretation. Achieving levels of value more closely aligned with patient benefit would require substantial discounts for the remaining agents evaluated.

APPENDIX A. Key Trials Included in the Network Meta-analysis

Key Trials Patient Characteristics Treatment Comparator Harms (Treatment Arm)
ASPIRE
Open-label RCT
Phase 3
Carfilzomib (CFZ)
• Median age: 64
• ECOG = 2: 9.5%
• ISS Stage III: 20%
• Previous SCT: 57%
• High risk: 12.6%
• Prior regimens (median): 2
• Prior BOR: 65.8%
• Prior LEN: 19.8%
CFZ+LEN+DEX
(n = 396)
LEN+DEX
(n = 396)
• Discontinued d/t AEs: 15%
• SAEs: 60%
• Tx-related deaths: 2%
• Median f/u: 32.3 m • Median f/u: 31.5 m
• OS HR: 0.79 (95% CI: 0.63-0.99; P = 0.04)
• PFS HR: 0.69 (95% CI: 0.57-0.83)
• Median PFS: 26.3 m
• ORR: 87.1%
• Median PFS: 17.6 m
• ORR: 66.7%, P < 0.001
CASTOR
Open-label RCT
Phase 3
Daratumumab (DAR)
• Median age: 64
• ECOG = 2: NR
• ISS Stage III: 22%
• Previous SCT: 61%
• del(17p): 10%
• Prior regimens (median): 2
• Prior BOR: 66%
• Prior LEN: 76%
DAR+BOR+DEX
(n = 251)
BOR+DEX
(n = 247)
• Discontinued d/t AEs: 7%
• SAEs: 76%
• Tx-related deaths: 5%
• Median f/u: 7.4 m
• Deaths: 11.6% • Deaths: 14.6%
• PFS HR: 0.39 (95% CI: 0.28-0.53; P < 0.001)
• Median PFS: NR
• ORR: 82.9%
• Median PFS: 7.2 m
• ORR: 63.2%
POLLUX
Open-label RCT
Phase 3
Daratumumab (DAR)
• Median age: 65
• ECOG = 2: 5%
• ISS Stage III: 20%
• Previous SCT: 63%
• del(17p): 8%
• Prior regimens (median): 1
• Prior BOR+LEN: 15%
DAR+LEN+DEX
(n = 286)
LEN+DEX
(n = 283)
• Discontinued d/t AEs: 8%
• SAEs: 49%
• Tx-related deaths: 4%
• Median f/u: 13.5 m
• Deaths: 10.5% • Deaths: 15.9%
• PFS HR: 0.37 (95% CI: 0.27-0.52; P < 0.001)
• Median PFS: NR
• ORR: 92.9%
• Median PFS: 18.4 m
• ORR: 76.4%
ELOQUENT-2
Open-label RCT
Phase 3
Elotuzumab (ELO)
• Median age: 66
• ECOG = 2: 9%
• ISS Stage III: 21%
• Previous SCT: 54%
• del(17p): 32%
• Prior regimens (median): 2
• Prior BOR: 70%
• Prior LEN: 6%
ELO+LEN+DEX
(n = 321)
LEN+DEX
(n = 325)
• Discontinued d/t AEs: 13%
• SAEs: 65%
• Tx-related deaths: 2%
• Median f/u: 24.5 m
• OS HR: 0.71 (95% CI: 0.54-0.93)
• PFS HR: 0.70 (95% CI: 0.57-0.85; P < 0.001)
• Median PFS: 19.4 m
• ORR: 79%
• Median PFS: 14.9 m
• ORR: 66%, P < 0.001
TOURMALINE-MM1
Double-blind RCT
Phase 3 (unpublished)
Ixazomib (IX)
• Median age: 66
• ECOG = 2: 6%
• ISS Stage III: 13%
• Previous SCT: 57%
• High risk: 19%
• Prior regimens (median): 2
• Prior BOR: 69%
• Prior LEN: 12%
IX+LEN+DEX
(n = 360)
Placebo+LEN+DEX
(n = 362)
• Discontinued d/t AEs: 13%
• SAEs: 40%
• Tx-related deaths: NR
• Median f/u (PFS): 23 m
• Deaths: 22.5% • Deaths: 24.8%
• PFS HR: 0.74 (95% CI: 0.59-0.94; P = 0.012)
• Median PFS: 20.6 m
• ORR: 78%
• Median PFS: 14.7 m
• ORR: 72%, P < 0.001
PANORAMA-1
Double-blind RCT
Phase 3
Panobinostat (PAN)
• Median age: 63
• ECOG = 2: 5%
• ISS Stage III: 22%
• Previous SCT: 58%
• 1 prior regimen: 51%
• Prior BOR+DEX: 38%
• Prior LEN: 21%
PAN+BOR+DEX
(n = 387)
Placebo+BOR+DEX
(n = 381)
• Discontinued d/t AEs: 36%
• SAEs: 60%
• Tx-related deaths: 3%
• Median f/u: 6.4 m • Median f/u: 5.9 m
• OS HR: 0.87 (95% CI: 0.69-1.10; P = 0.26)
• PFS HR: 0.63 (95% CI: 0.52-0.76; P < 0.0001)
• Median PFS: 11.99 m
• ORR: 60.7%
• Median PFS: 8.08 m
• ORR: 54.6%, P = 0.09

AE = adverse event; BOR = bortezomib; CI = confidence interval; del(17p) = deletion in 17 p region of tumor protein 53 gene; DEX = dexamethasone; d/t = due to; ECOG=Eastern Cooperative Oncology Group; f/u = follow-up; HR = hazard ratio; ISS = International Staging System; LEN = lenalidomide; NR = not reported; ORR = objective response rate; OS = survival; PFS = progression-free survival; RCT = randomized controlled trial; SAE = serious adverse event; SCT = stem cell transplant; Tx = treatment.

APPENDIX B. Grade 3/4 Adverse Event Rates3,26

LEN-DEX % BOR-DEX % CFZ-LEN-DEX % ELO-LEN-DEX % IX-LEN-DEX % PAN-BOR-DEX % DAR-LEN-DEX % DAR-BOR-DEX % Per AE Cost $
n = 353 n = 64 n = 392 n = 318 n = 360 n = 381 n = 360 n = 381
Anemia 17.50 19.50 14.80 18.90 11.70 17.80 12.40 14.40 971
Arrythmias 3.00 2.00 NR NR 5.50 3.00 NR NR 6,998
Back pain 3.00 3.00 NR 5.00 0.80 0.80 1.40 NR 10,728
Cataract 2.80 NR NR 6.30 NR NR NR NR 3,700
Deep vein thrombosis 3.40 NR 4.10 5.70 3.00 NR 1.80 NR 31,645
Diarrhea 3.50 8.00 3.80 5.00 6.40 25.50 5.30 3.70 9,738
Fatigue 5.10 11.90 7.70 12.60 4.00 23.90 6.40 4.50 8,437
Hyperglycemia 6.30 NR 4.60 17.00 2.20 NR NR NR 166
Hypertension 1.50 NR 4.30 NR 3.00 NR NR 6.60 5,478
Hypocalcemia 3.60 2.00 2.60 11.30 4.40 5.00 NR NR 1,155
Hypokalemia 5.20 7.00 10.50 11.30 4.40 18.00 NR NR 1,707
Lymphopenia 27.40 40.20 46.40 76.70 32.50 53.60 5.30 9.50 166
Nausea 0.40 0.50 0.20 0.90 1.70 5.50 1.40 NR 11,934
Neutropenia 34.70 11.40 38.80 33.60 21.90 34.50 51.90 12.80 166
Peripheral/sensory neuropathy 1.50 14.60 1.70 3.80 2.50 17.60 NR 4.50 783
Pneumonia 8.10 10.30 8.90 14.20 6.00 12.60 7.80 8.20 14,855
Thrombocytopenia 14.30 31.40 25.80 19.20 25.30 67.30 12.70 45.30 166
Vomiting 0.70 1.30 NR 0.30 1.10 7.30 1.10 NR 11,934

AE = adverse event; BOR = bortezomib; CFZ = carfilzomib; DAR = daratumumab; DEX = dexamethasone; ELO = elotuzumab; IX = ixazomib; LEN = lenalidomide; NR = not reported; PAN = panobinostat.

REFERENCES


Articles from Journal of Managed Care & Specialty Pharmacy are provided here courtesy of Academy of Managed Care Pharmacy

RESOURCES