Skip to main content
Journal of Clinical Oncology logoLink to Journal of Clinical Oncology
. 2013 Feb 19;31(10):1285–1292. doi: 10.1200/JCO.2012.44.4489

Effect of Ruxolitinib Therapy on Myelofibrosis-Related Symptoms and Other Patient-Reported Outcomes in COMFORT-I: A Randomized, Double-Blind, Placebo-Controlled Trial

Ruben A Mesa 1,, Jason Gotlib 1, Vikas Gupta 1, John V Catalano 1, Michael W Deininger 1, Alan L Shields 1, Carole B Miller 1, Richard T Silver 1, Moshe Talpaz 1, Elliott F Winton 1, Jimmie H Harvey 1, Thomas Hare 1, Susan Erickson-Viitanen 1, William Sun 1, Victor Sandor 1, Richard S Levy 1, Hagop M Kantarjian 1, Srdan Verstovsek 1
PMCID: PMC4979167  PMID: 23423753

Abstract

Purpose

To assess the effects of ruxolitinib on symptom burden and quality of life (QoL) and to evaluate the ability of the modified Myelofibrosis Symptom Assessment Form (MFSAF) v2.0 to measure meaningful changes in myelofibrosis-related symptoms in patients with myelofibrosis.

Patients and Methods

COMFORT-I (Controlled Myelofibrosis Study With Oral JAK Inhibitor Treatment–I) is a double-blind, placebo-controlled phase III study evaluating ruxolitinib in patients with intermediate-2 or high-risk myelofibrosis. Exploratory analyses were conducted on the following patient-reported outcomes (PROs) assessments: modified MFSAF v2.0 (individual symptoms and Total Symptom Score [TSS]), European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire-Core 30 (EORTC QLQ-C30), Patient Reported Outcomes Measurement Information System (PROMIS) Fatigue Scale, and Patient Global Impression of Change (PGIC).

Results

Patients receiving ruxolitinib experienced improvements in individual myelofibrosis-related symptoms, although patients receiving placebo experienced worsening (P < .001). The majority (91%) of ruxolitinib-treated patients designated as ≥ 50% TSS responders (≥ 50% TSS improvement) self-reported their condition as either “Much improved” or “Very much improved” on the PGIC. These patients achieved significant improvements in the EORTC QLQ-C30 functional domains and Global Health Status/QoL versus patients receiving placebo, who experienced worsening on these measures (P ≤ .0135). Ruxolitinib-treated patients with a lesser degree of symptom improvement (< 50% TSS responders) also achieved improvements over placebo on these measures. The degree of spleen volume reduction with ruxolitinib correlated with improvements in TSS, PGIC, PROMIS Fatigue Scale, and EORTC Global Health Status/QoL. Ruxolitinib-treated patients who achieved a ≥ 35% reduction in spleen volume experienced the greatest improvements in these PROs.

Conclusion

Ruxolitinib-treated patients achieved clinically meaningful improvements in myelofibrosis-related symptoms and QoL, but patients receiving placebo reported worsening of symptoms and other PROs.

INTRODUCTION

Myelofibrosis is characterized by splenomegaly,1 cytopenias,1 and symptoms that may be debilitating, such as fatigue, pruritus, night sweats, fever, bone pain, and weight loss.2 These symptoms are highly prevalent among patients with myelofibrosis and can adversely affect quality of life (QoL).2 The presence of myelofibrosis-related constitutional symptoms (unexplained fever, drenching night sweats, weight loss) has been identified as a risk factor for shortened survival.3 Splenomegaly and myelofibrosis symptoms are thought to be driven by dysregulation of the Janus kinase (JAK) –STAT pathway resulting from mutations that lead to constitutively active JAK24,5 or increased proinflammatory cytokines that signal through JAK1 and JAK2.6

There are many types of patient-reported outcome (PRO) tools used in oncology; until recently, none were specifically designed to evaluate the symptoms of myelofibrosis.7 The Myelofibrosis Symptom Assessment Form (MFSAF) was developed to evaluate the presence and severity of myelofibrosis-related symptoms.7 In a phase II study of patients with myelofibrosis treated with ruxolitinib, a JAK1/JAK2 inhibitor, the MFSAF proved sensitive to ruxolitinib-associated improvements in symptoms over time, and symptom improvements correlated with objective measures of efficacy.8 Subsequently, a more streamlined version of the MFSAF, the modified MFSAF v2.0, was developed and was used in a phase III, double-blind, placebo-controlled study (COMFORT-I; Controlled Myelofibrosis Study With Oral JAK Inhibitor Treatment–I).9 In this analysis, we evaluated the ability of the modified MFSAF v2.0 to measure meaningful changes in myelofibrosis-related symptoms and the effects of ruxolitinib on symptom burden and other PROs in COMFORT-I.

PATIENTS AND METHODS

Patients

Adult patients who were diagnosed with primary myelofibrosis, postpolycythemia vera myelofibrosis, or postessential thrombocythemia myelofibrosis10; were classified as International Prognostic Scoring System3 high risk or intermediate 2 risk; had palpable splenomegaly (≥ 5 cm below left costal margin); and were resistant or refractory to, intolerant of, or not candidates for available therapy (in the investigator's opinion) were enrolled. Full inclusion and exclusion criteria have been previously reported.9

Study Design

In this multicenter, double-blind phase III trial, patients were randomly assigned 1:1 to receive placebo or ruxolitinib (starting doses of 15 mg twice daily for platelet counts 100 to 200 × 109/L and 20 mg twice daily for platelet counts > 200 × 109/L). Dose modification occurred in a blinded fashion for both arms on the basis of predefined criteria. Dose increases for lack of efficacy were permitted, and dose reductions for declining platelet or absolute neutrophil counts were required. The minimum recommended dose was 5 mg twice per day and the maximum permitted dose was 25 mg twice per day.9 Before week 24, patients receiving placebo were eligible for early unblinding and crossover to ruxolitinib if they had a ≥ 25% increase from baseline in spleen volume along with worsening early satiety accompanied by weight loss or worsening splenic pain with increased narcotic requirements. After week 24, patients receiving placebo with asymptomatic spleen growth ≥ 25% could also cross over to ruxolitinib. The primary end point was the proportion of patients who achieved a ≥ 35% reduction in spleen volume by magnetic resonance imaging or computed tomography scans from baseline to week 24. The comparative secondary end point controlled for type I error was the proportion of patients who achieved ≥ 50% reduction (improvement) in the Total Symptom Score (TSS) from baseline to week 24 by using the modified MFSAF v2.0 electronic diary.9

This study was conducted in accordance with local regulatory requirements and Good Clinical Practice guidelines of the International Conference on Harmonisation. All patients provided written informed consent.

PRO Assessments

The PRO measures used in the COMFORT-I study have been previously described in detail9; a brief overview is provided here.

Modified MFSAF v2.0 and TSS

The modified MFSAF v2.0, an electronic daily symptom diary, was developed for the COMFORT-I study on the basis of prior paper versions of myelofibrosis symptom assessment forms7,8 along with feedback from the US Food and Drug Administration. Patients completed the modified MFSAF v2.0 every night from baseline through week 24 (25 weeks total) with electronic data downloaded to a central server. Patients rated the following myelofibrosis symptoms, at their worst as experienced in the 24 hours before assessment, by using a scale from 0 (absent) to 10 (worst imaginable): night sweats, pruritus/itching, abdominal discomfort, pain under the ribs (left side), early satiety, bone/muscle pain, and inactivity. The TSS reflects the sum of the scores of these symptoms except inactivity, for a maximum possible score of 60 (ie, most severe symptom experience). The baseline TSS was the average of seven daily measurements before baseline (at least four of which had to be non-missing), and the week 24 TSS was the average of 28 daily measurements before week 24 (at least 21 of which had to be non-missing).

PGIC Scale

The Patient Global Impression of Change (PGIC) scale assessed patients' perceptions of change in their myelofibrosis symptoms over time. The PGIC has been widely used to evaluate a patient's overall sense of whether a treatment has been beneficial.11 In this study, patients answered the following question at monthly study visits beginning at week 4: “Since the start of the treatment you've received in this study, your myelofibrosis symptoms are (1) Very much improved, (2) Much improved, (3) Minimally improved, (4) No change, (5) Minimally worse, (6) Much worse, (7) Very much worse.”

EORTC QLQ-C30

The European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire-Core 30 (EORTC QLQ-C30) is a 30-item questionnaire used to evaluate QoL and includes five functional domains (physical, cognitive, role, emotional, and social) and a global health status scale. Each subscale is evaluated on a standardized scale of 0 (worst) to 100 (best).12 Patients completed the EORTC QLQ-C30 at the baseline visit and at each study visit.

PROMIS Fatigue Scale: Short Form

This scale measures the frequency and impact of fatigue.13 The Patient Reported Outcomes Measurement Information System (PROMIS) Fatigue Scale contains seven items with a recall period of 7 days. Each of the items uses a 5-point response option with scores of 1 (never) to 5 (always). An average score is calculated and transformed to a final score on a 100-point scale ranging from 0 (never fatigued) to 100 (always fatigued). Patients completed the instrument at the baseline visit and at each study visit.

Statistical Analysis

Patient disposition and baseline PRO scores were summarized descriptively. Percent changes from baseline in individual symptom scores (average symptom scores from the previous 28 days, as measured by the modified MFSAF v2.0), EORTC QLQ-C30 subscale scores and PROMIS Fatigue Scale scores were calculated at weeks 4, 8, 12, 16, 20 (individual symptom scores only), and 24. Treatment comparisons (ruxolitinib v placebo) in these percent changes were performed by using Wilcoxon rank sum test at each time point. Percent changes were calculated at the patient level before descriptive statistics were derived.

Previous data14 suggested that ruxolitinib doses as low as 10 mg twice per day were associated with symptom benefits similar to those of higher doses in patients with myelofibrosis. To confirm this, we also assessed the effect of dose on symptom improvements. Median percent change from baseline in TSS at week 24 was calculated across groups on the basis of final titrated ruxolitinib dose at week 24 (average ruxolitinib dose during weeks 21 to 24). Median percent change from baseline in TSS at week 24 in ruxolitinib-treated patients with and without new onset/worsening of grade 3 to 4 anemia or grade 3 to 4 thrombocytopenia were also calculated. These TSS analyses were descriptive.

The relationships between the following variables were also evaluated: (1) improvement in TSS (≥ 50% or < 50% TSS response) with the PGIC score and change from baseline in EORTC QLQ-C30 scores at week 24 and (2) spleen volume reductions (< 10%, 10% to 35%, ≥ 35%) with the PGIC, percent change from baseline in TSS, and changes from baseline in PROMIS Fatigue Scale and EORTC QLQ-C30 Global Health Status/QoL scores at week 24. For the relationship between TSS response and EORTC QLQ-C30 subscale score, an analysis of covariance was used with the baseline EORTC QLQ-C30 subscale as the covariate, the TSS response as the main effect, and the all-placebo group as the reference level for comparisons. For the relationship between spleen volume reduction and improvement on the PROs, analysis of covariance was used with the baseline value as the covariate, the spleen volume reduction group as the main effect, and with the all-placebo group as the reference level for comparisons. Because these analyses were exploratory and descriptive, no adjustments were made for multiple comparisons. These analyses were based on observed data (no missing value imputation was performed).

The interpretability of the modified MFSAF v2.0 was explored as described in the Appendix (online only). Test-retest reliability between week 7 and week 8 was measured by using intraclass correlation coefficients.

RESULTS

Patient Disposition and Baseline PRO Measures

Patient disposition (N = 309) at the time of the primary analysis, when all patients completed at least 24 weeks of treatment or discontinued and at least half the remaining patients completed 36 weeks of treatment, is shown in Figure 1. As reported previously, baseline characteristics were generally similar in the two treatment arms.9 The primary and secondary efficacy end points were also previously reported.9 Briefly, by week 24, a significantly greater proportion of ruxolitinib-treated patients had a ≥ 35% reduction in spleen volume compared with patients in the placebo group (41.9% v 0.7%; odds ratio, 134.4; 95% CI, 18.0 to 1,004.9; P < .001).9 Ruxolitinib treatment also provided improvement in symptom burden, with 45.9% of patients achieving a ≥ 50% reduction in TSS from baseline to week 24 versus 5.3% of the placebo arm (odds ratio, 15.3; 95% CI, 6.9 to 33.7; P < .001).9 Only 3.9% of ruxolitinib-treated patients showed a significant worsening of TSS from baseline to week 24 (ie, > 50% increase in TSS) compared with 33.0% of patients receiving placebo.

Fig 1.

Fig 1.

CONSORT diagram illustrating the disposition of the 309 enrolled patients at the time of primary analysis data cutoff. (*) Three patients not evaluable for safety but included in the intent-to-treat analysis of efficacy.

The majority of patients completed the EORTC QLQ-C30, PGIC, and PROMIS Fatigue Scale at each study visit (Appendix Table A1, online only), and the electronic daily data capture for the modified MFSAF v2.0 resulted in high compliance rates for completion of this PRO and almost no missing data. Completion of the electronic daily diary required 1 minute or less for 94% of patients, and test-retest reliability intraclass correlation coefficients (week 7 to week 8) were 0.97 for patients treated with placebo and 0.98 for patients treated with ruxolitinib.

Mean scores at baseline in the PRO measures were similar between treatment groups (Table 1). Moreover, all individual myelofibrosis-related symptoms assessed by the modified MFSAF v2.0 were prevalent in the majority of patients in the COMFORT-I study population at baseline. The most prevalent symptoms (reported by > 90% of patients receiving ruxolitinib and placebo) were abdominal discomfort, early satiety, and inactivity. Of note, these same symptoms ranked greatest in severity (Fig 2). EORTC QLQ-C30 subscales reflected poor QoL at baseline and were similar to scores of other populations with advanced cancer and another patient population with myeloproliferative neoplasms.15,16 Of the five functional domains, patients suffered the greatest burden in role functioning.

Table 1.

Baseline Scores on Patient-Reported Outcome Measures

Parameter Measured Ruxolitinib
Placebo
Maximum Score Possible
Mean Range Mean Range
Modified MFSAF v2.0, TSS 18.2 0–50.1 16.9 0–52.7 60 (worst possible)
EORTC QLQ-C30 Subscales
    Global Health Status/QoL 52.7 0–100 52.9 0–100 100 (best possible)
    Physical Functioning 69.7 0–100 67.2 20–100 100 (best possible)
    Role Functioning 64.5 0–100 63.2 0–100 100 (best possible)
    Emotional Functioning 73.3 0–100 75.5 0–100 100 (best possible)
    Cognitive Functioning 80.7 0–100 80.1 16.7–100 100 (best possible)
    Social Functioning 68.0 0–100 66.1 0–100 100 (best possible)
PROMIS Fatigue Scale 43.7 10.7–85.7 43.3 0–89.3 100 (worst possible)

Abbreviations: EORTC QLQ-C30, European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire-Core 30; MFSAF, Myelofibrosis Symptom Assessment Form; PROMIS, Patient Reported Outcomes Measurement Information System; QoL, quality of life; TSS, Total Symptom Score.

Fig 2.

Fig 2.

Individual symptom scores at baseline (in patients with symptoms at baseline) as measured by the modified Myelofibrosis Symptom Assessment Form v2.0. The most prevalent symptoms (reported in > 90% of patients in both treatment groups) and most severe symptoms were abdominal discomfort, early satiety, and inactivity. Scale range: 0, absent to 10, worst imaginable.

Changes in Individual Myelofibrosis-Related Symptoms, QoL, and Fatigue

Overall, individual symptom scores as assessed by the modified MFSAF v2.0 at each 4-week time point in patients receiving ruxolitinib showed improvement relative to baseline (Fig 3). There was approximately linear worsening in symptom scores for patients receiving placebo over the entire 24 weeks. The differences between ruxolitinib- and placebo-treated groups were significant at all time points for all symptoms (P < .001; Fig 3). Improvements relative to baseline and placebo in PROMIS Fatigue Scale score and most EORTC QLQ-C30 subscales were also seen with ruxolitinib treatment (Appendix Table A2, online only).

Fig 3.

Fig 3.

(A-G) Assessment of individual symptom burden by using the modified Myelofibrosis Symptom Assessment Form v2.0. Individual symptom scores at each 4-week time point improved in patients receiving ruxolitinib, whereas patients who received placebo experienced a worsening of symptoms. P < .001 for all comparisons between ruxolitinib and placebo at all time points (weeks 4, 8, 12, 16, 20, and 24). Individual symptom scores at each 4-week time point were calculated by averaging the daily individual symptom scores from the previous 28 days.

Improvement in TSS by Average Total Daily Ruxolitinib Dose and Effect of Anemia or Thrombocytopenia on TSS in Ruxolitinib-Treated Patients

Although patients in the study began dosing at either 15 or 20 mg twice per day, individualized dose optimization resulted in an overall average dose exposure for ruxolitinib-treated patients of 15.5 mg twice per day. Median improvements from baseline at week 24 in TSS exceeding 50% were observed for all dose groups in which the dose was ≥ 10 mg twice per day: 71.1% (n = 30), 59.6% (n = 23), 67.7% (n = 38), and 66.2% (n = 20) for dose groups of 10 mg twice per day, 15 mg twice per day, 20 mg twice per day, and 25 mg twice per day, respectively. Ruxolitinib-treated patients who developed new-onset or worsening of grade 3 or 4 anemia achieved TSS improvements at week 24 that were similar in magnitude to improvements in those who did not experience grade 3 or 4 anemia (median, −46.4% [n = 47] and −58.3% [n = 82], respectively). Ruxolitinib-treated patients with and without new-onset or worsening of grade 3 or 4 thrombocytopenia also experienced TSS improvements (median, −26.8% [n = 13] and −62.5% [n = 116], respectively). Although there was an apparent difference in the magnitude of the improvements between these groups, the number of patients with grade 3 or 4 thrombocytopenia was too small to draw a clear conclusion. In contrast, patients receiving placebo showed worsening TSS scores (median, 14.6% [n = 103]).

Relationship Between TSS Improvement and PGIC

As detailed in Table 2, 62 (91.2%) of ruxolitinib patients who were ≥ 50% TSS responders characterized their condition as either “Much improved” or “Very much improved.” In addition, 67 (73.7%) of the placebo group who were less than 50% TSS responders characterized their condition as “unchanged” or “worsening.” These results suggest a relationship between the TSS and the PGIC in that the individual patient perceived and reported meaningful benefit from the treatment in terms of an overall improvement in myelofibrosis symptoms, which further supports the use of the modified MFSAF v2.0.

Table 2.

Relationship Between PGIC Scale and TSS

PGIC Scale Ruxolitinib* (n = 127)
Placebo* (n = 100)
Total Sample* (N = 227)
≥ 50% TSS Responder (n = 68)
< 50% TSS Responder (n = 59)
≥ 50% TSS Responder (n = 9)
< 50% TSS Responder (n = 91)
≥ 50% TSS Responder (n = 77)
< 50% TSS Responder (n = 150)
No. % No. % No. % No. % No. % No. %
Very much improved 35 51.5 8 13.6 1 11.1 0 0.0 36 46.8 8 5.3
Much improved 27 39.7 19 32.2 3 33.3 7 7.7 30 39.0 26 17.3
Minimally improved 3 4.4 22 37.3 3 33.3 17 18.7 6 7.8 39 26.0
No change 0 0.0 6 10.2 2 22.2 30 33.0 2 2.6 36 24.0
Minimally worse 2 2.9 3 5.1 0 0.0 19 20.9 2 2.6 22 14.7
Much worse 1 1.5 0 0.0 0 0.0 14 15.4 1 1.3 14 9.3
Very much worse 0 0.0 1 1.7 0 0.0 4 4.4 0 0.0 5 3.3

NOTE. ≥ 50% TSS responders are patients who achieved a ≥ 50% improvement in modified Myelofibrosis Symptom Assessment Form (MFSAF) v2.0 Total Symptom Score (TSS; baseline TSS–week-24 TSS); < 50% TSS responders are patients who achieved a < 50% improvement in modified MFSAF v2.0 TSS (baseline TSS–week-24 TSS).

Abbreviation: PGIC, Patient Global Impression of Change.

*

Number of patients with a baseline and week-24 TSS and who completed the PGIC at week 24.

Relationship Between TSS Improvement and EORTC QLQ-C30

For each subscale of the EORTC QLQ-C30, ruxolitinib- treated patients reported increases (improvements) from baseline, whereas patients receiving placebo reported decreases (worsening). Ruxolitinib-treated patients defined as ≥ 50% TSS responders achieved significantly greater improvements in the EORTC QLQ-C30 subscales versus patients receiving placebo who continued to show deterioration in their QoL (P ≤ .0135; Fig 4). Notably, ruxolitinib-treated patients with a lesser degree of symptom response (< 50% TSS responders) also achieved significant improvements over placebo for all EORTC QLQ-C30 subscales (P ≤ .0075), with the exception of the Emotional Functioning and Cognitive Functioning domains (Fig 4).

Fig 4.

Fig 4.

(A-F) Relationship between symptoms as assessed by the modified Myelofibrosis Symptom Assessment Form v2.0, with quality of life (QoL) as assessed by the European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire Core 30 (EORTC QLQ-C30) at baseline. Patients receiving ruxolitinib who were categorized as ≥ 50% Total Symptom Score (TSS) responders achieved significantly greater improvements in the EORTC QLQ-C30 subscales versus patients in the placebo group.

Relationship Between Spleen Volume Reductions and PROs

In the ruxolitinib arm, improvements in TSS, abdominal symptoms, nonabdominal symptoms, and the PGIC score correlated with reductions in spleen size; patients who had a ≥ 35% reduction in spleen volume had the greatest improvement in symptoms and perceived change in condition (Figs 5A to 5D). Of note, however, ruxolitinib-treated patients who exhibited even small spleen volume reductions (< 10%) achieved meaningful improvements in these PRO measures compared with patients in the placebo group. In addition, ruxolitinib-treated patients who achieved a ≥ 10% reduction in spleen volume had significant improvements versus placebo in both the PROMIS Fatigue Scale (P < .001; Fig 5E) and EORTC QLQ-C30 Global Health Status (P < .001; Fig 5F). Indeed, the improvements in PRO parameters for ruxolitinib-treated patients who achieved ≥ 35% reduction in spleen volume versus those who achieved a 10% to less than 35% reduction in spleen volume were not significantly different (all P ≥ .07).

Fig 5.

Fig 5.

(A-F) Relationship between spleen volume reduction with ruxolitinib and patient-reported outcomes (PROs). Ruxolitinib-treated patients who achieved a ≥ 35% reduction in spleen volume experienced the greatest improvements in all PROs, whereas patients receiving placebo reported worsening of symptoms on these measures. However, patients given ruxolitinib who had ≥ 10% reduction in spleen volume also achieved significant improvements in all PROs. (D) Change from baseline in patient perception of their disease (baseline = score of 4 [no change]). PROMIS, Patient Reported Outcomes Measurement Information System; QoL, quality of life.

DISCUSSION

In this randomized, placebo-controlled study, patients showed a high prevalence and severity of individual myelofibrosis-related symptoms at baseline, and the modified MFSAF v2.0 was sensitive to differentiating responses in the placebo and ruxolitinib arms over time. Individual symptom scores with ruxolitinib showed improvement relative to baseline and to placebo early in the course of study treatment. The true magnitude of symptom response within 4 weeks of initiating ruxolitinib therapy may be underestimated in this analysis, because symptom scores calculated by using a 28-day average included scores from the initial days of treatment that may have potentially diluted the benefit seen later in the treatment period. Analysis of TSS in ruxolitinib-treated patients by using a 7-day moving average over time shows that the majority of TSS responses (≥ 50% reduction in TSS relative to baseline) occur within the first 4 weeks of treatment.9 The nightly recording of symptoms by the patient and electronic data transfers contributed to a high degree of compliance by patients in obtaining symptom data. Overall, the baseline prevalence of symptoms and rapid, clinically meaningful improvements exhibited by ruxolitinib-treated patients are consistent with those from the phase II study, in which a paper-pencil version of the MFSAF was sensitive to detecting early and sustained symptom improvements with ruxolitinib treatment.8 Meaningful reductions in symptom burden were observed for ruxolitinib doses ≥ 10 mg twice per day, while placebo-treated patients reported symptom worsening. Notably, ruxolitinib-treated patients also demonstrated improvement in fatigue (a common symptom in patients with myelofibrosis) relative to baseline and to patients in the placebo group on two PRO instruments.

The modified MFSAF v2.0 was also shown to correlate well with other established PRO assessment tools, including the PGIC and the EORTC QLQ-C30, further supporting use of the MFSAF in patients with myelofibrosis. At baseline, many of the scores for EORTC QLQ-C30 subscales, such as the Global Health Status/QoL, were much lower than those of the general population15,16 and were consistent with scores for other populations with advanced cancer15 as well as for another patient population with myelofibrosis.16 Moreover, the changes in the EORTC QLQ-C30 subscale scores with treatment mirrored trends in the MFSAF, with ruxolitinib ≥ 50% TSS responders showing the most improvement. Even ruxolitinib less than 50% TSS responders demonstrated significant improvements compared with those in the placebo group in EORTC QLQ-C30 subscale scores except for the Emotional Functioning and Cognitive Functioning domains; scores for these two domains were similar between the COMFORT-I population and general population at baseline.16

Finally, improvements in symptom burden and perceived change in condition, fatigue, and QoL in ruxolitinib-treated patients were not dependent on reaching the protocol-defined threshold of spleen response (≥ 35% reduction in spleen volume at week 24). Even patients with spleen volume reductions ≥ 10% who received ruxolitinib had significant improvements in symptom burden and perceived change in condition, fatigue, and QoL. A similar finding was observed in the phase II study.8

In conclusion, changes in the modified MFSAF v2.0 with ruxolitinib therapy were clinically meaningful. In addition, the modified MFSAF v2.0 correlated well with established PRO measures. Although there was a trend for greater improvements in PROs with greater spleen volume reductions with ruxolitinib, even patients with modest changes in spleen size or symptom scores demonstrated improvements in symptom burden and QoL, whereas patients receiving placebo continued to worsen by these same measures. These data support the use of the MFSAF in clinical studies of treatments for myelofibrosis.

Supplementary Material

Protocol

Acknowledgment

Presented as a poster at the 53rd American Society of Hematology Annual Meeting and Exposition, San Diego, CA, December 10-13, 2011.

We thank Susan Kralian, PhD, of the Curry Rockefeller Group for medical writing assistance and Hema Gowda and Kelly Reith of Incyte for additional editorial support.

Appendix

Methods

Two approaches to determine the interpretability of the modified Myelofibrosis Symptom Assessment Form (MFSAF) v2.0 electronic diary were used in this study. First, an anchor-based approach was used. Anchor-based methods use an external, patient-centric criterion to categorize patients into groups, each reflecting an a priori determined change grouping (eg, no change, large positive change, or large negative change). In this analysis, patient responses to the Patient Global Impression of Change (PGIC) Scale administered at week 24 were selected as the relevant anchors. Patients were grouped as either “≥ 50% Total Symptom Score (TSS) responders” (ie, achieved a ≥ 50% reduction in TSS from baseline to week 24) or “less than 50% TSS responders.” Next, these groups were anchored to the PGIC response they provided at week 24. In this way, both responder groups could be evaluated relative to whether the patients who were assigned to those groups felt like their condition improved (ie, did they perceive treatment benefit?). It would be expected that the proportion of patients labeled “≥ 50% TSS responders” would be only minimally represented in the “no change” or “worsening” categories and over-represented in the “improvement” categories compared with “less than 50% TSS responders.”

It was also important to demonstrate that the mean change in the TSS from baseline to week 24 in the treatment group was larger (ie, demonstrated greater symptom improvement) than observed in the placebo group. To understand the point at which an observed between-group difference in the TSS becomes meaningful and indicative of patient benefit, several distribution- and anchor-based methods can be used. These methods are not considered appropriate for determining the precise point at which a score change indicates clinical significance but are useful for evaluating the meaning of an observed score change. Minimally important difference (MID) and minimal clinically important difference (MCID) estimates can be used to inform the actual differences reported in the efficacy analysis. Conclusions in regard to treatment benefit can be supported to the extent that actual mean differences exceed these point estimates.

Results

Interpretability of changes in the modified MFSAF v2.0.

Several methods for estimating MID/MCID were used. By using a distribution-based method, an MID/MCID can be estimated from the SEM (Wyrwich KW, et al: Med Care 37:469-478, 1999) or from its formula by taking the standard deviation (SD) of the baseline TSS multiplied by the square root of 1 minus the reliability coefficient. As administered in COMFORT-I (Controlled Myelofibrosis Study With Oral JAK Inhibitor Treatment–I) the SEM = (11.4 × square root of [1 − 0.89]) or 3.78. Other distribution-based methods of estimating MID/MCID have been proposed, and one study (Norman GR, et al: Med Care 41:582-592, 2003) suggested using a one-half SD rule as appropriate for generating an MID/MCID. As administered in COMFORT-I, the one-half SD estimate = (0.5 × 11.4) or 5.7. Finally, defining MID/MCID via anchor-based methods is also possible and provides a categorical rating of change between baseline TSS and the week 24 TSS. In this way, an MID/MCID is typically defined as the mean change in the TSS among all patients, indicating that they are “minimally improved” on the PGIC scale (ie, the MID/MCID anchor). As administered in COMFORT-I, the patient-centric anchor-based MID is −0.1, suggesting that patients feel minimally improved as long as their myelofibrosis symptoms are not worsening. As indicated in Appendix Table A3, the mean change in TSS among patients reporting that they are “Much improved” or “Very much improved” exceeds estimates of MID/MCID.

Table A1.

Completion Rates for PRO Instruments in COMFORT-I by Study Visit

PRO Instrument Study Visit No. of Patients With Missing Data Value*
Completed PRO Instrument (%)
Ruxolitinib (n = 155) Placebo (n = 154) Total (N = 309)
Modified MFSAF v2.0 Baseline 6 2 8 97.4
Week 4 0 3 3 99.0
Week 12 4 9 13 95.8
Week 24 9 22 31 90.0
EORTC QLQ-C30 Global Health Status/QoL Baseline 7 7 14 95.5
Week 4 8 5 13 95.8
Week 12 4 10 14 95.5
Week 24 0 23 23 92.6
PGIC Week 4 12 12 24 92.2
Week 12 7 22 29 90.6
Week 24 6 27 33 89.3
PROMIS Fatigue Scale Baseline 2 3 5 98.4
Week 4 4 7 11 96.4
Week 12 4 11 15 95.1
Week 24 0 20 20 93.5

Abbreviations: COMFORT-I, Controlled Myelofibrosis Study With Oral JAK Inhibitor Treatment–I; EORTC QLQ-C30, European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire-Core 30; MFSAF, Myelofibrosis Symptom Assessment Form; PGIC, Patient Global Impression of Change; PRO, patient-reported outcome; PROMIS, Patient Reported Outcomes Measurement Information System; QoL, quality of life.

*

For the MFSAF, missing data values are the result of patients not completing the required number of entries for the reporting period (at least 4 of 7 days for baseline, and at least 20 of 28 days for week 12 or 24). For the remaining PROs, missing data values are the result of patients not completing data entries for the study visit.

The completion data for the EORTC QLQ-C30 Global Health Status/QoL subscale is shown; other subscales of the EORTC QLQ-C30 were completed at similar rates by patients.

Table A2.

Mean Percent Change From Baseline in EORTC QLQ-C30 Subscale Scores and PROMIS Fatigue Scale by Study Visit

PRO Instrument Week 4
Week 8
Week 12
Week 16
Week 24
Ruxolitinib Placebo Ruxolitinib Placebo Ruxolitinib Placebo Ruxolitinib Placebo Ruxolitinib Placebo
EORTC QLQ-C30
    Global Health Status/QoL 41.4 1.2 31.8 2.0 34.5 15.8 38.4 9.7 44.6 8.7
    Physical functioning 14.1 −4.8 19.4 −0.4 19.7 −2.8 21.0 −2.7 23.8 −4.5
    Role functioning 15.7 −16.8 18.4 −11.3 15.1 −13.0 18.5 −15.5 19.7 −17.9
    Emotional functioning 16.6 7.2 14.4 10.2 15.9 6.0 19.2 8.8 19.4 4.0
    Cognitive functioning 4.7 6.7 7.7 3.7 6.9 6.6 9.5 7.3 8.2 4.9
    Social functioning 23.9 −3.5 26.2 −6.8 29.6 −5.7 20.6 −7.9 24.4 −14.6
    Fatigue −20.3 7.2 −15.9 14.8 −16.7 9.4 −21.0 4.5 −19.9 6.6
    Nausea and vomiting −56.2 −42.8 −73.3 −33.2 −77.0 −35.1 −71.7 −37.5 −77.2 −26.9
    Pain −14.8 40.3 −32.6 27.9 −23.8 35.2 −18.5 23.3 −33.9 33.5
    Dyspnea −43.3 −11.4 −29.3 −11.8 −30.6 −7.3 −29.5 −0.4 −34.6 0.5
    Insomnia −24.9 −6.5 −37.5 −8.1 −32.8 −3.5 −34.4 −6.2 −41.2 −9.2
    Appetite loss −63.1 3.7 −67.8 8.1 −69.7 −5.7 −67.2 −17.5 −64.0 −3.1
    Constipation −34.3 −19.6 −54.4 −33.3 −48.7 −27.8 −43.3 −23.0 −48.9 −37.9
    Diarrhea −42.0 −6.6 −53.9 −22.7 −49.0 −24.5 −49.0 −8.5 −47.2 −0.8
    Financial difficulties −30.2 −20.8 −29.5 −18.6 −33.3 −19.4 −24.0 −22.1 −33.0 0.5
PROMIS Fatigue −10.3 10.0 −14.4 12.7 −10.1 12.0 −12.8 7.5 −15.6 9.1

NOTE. For Global Health Status/QoL and functional scales, positive changes from baseline indicate improvement. For individual symptom scales, negative changes from baseline indicate improvement. P < .05 for all comparisons except the following: cognitive functioning (all time points), nausea and vomiting (week 4), constipation (all time points), and financial difficulties (all time points).

Abbreviations: EORTC QLQ-C30, European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire-Core 30; PRO, patient-reported outcome; PROMIS, Patient Reported Outcomes Measurement Information System; QoL, quality of life.

Table A3.

Modified MFSAF v2.0 Diary TSS Changes by PGIC Scale (Baseline–Week-24 Total Score) in Patients Enrolled in the COMFORT-I Study

Total Population Modified MFSAF v2.0 Diary TSS Change
No. Mean* SD
Very much improved 47 13.9 10.2
Much improved 63 8.6 9.8
Minimally improved 47 −0.1 8.9
No change 38 −1.1 6.9
Minimally worse 25 −4.3 9.6
Much worse 15 −3.9 10.8
Very much worse 5 −10.8 12.0
Total 240 3.9 11.6

NOTE. As a reflection of change regardless of treatment, all enrolled patients were eligible for this analysis (including crossover patients).

Abbreviations: COMFORT-I, Controlled Myelofibrosis Study With Oral JAK Inhibitor Treatment–I; MFSAF, Myelofibrosis Symptom Assessment Form; PGIC, Patient Global Impression of Change; SD, standard deviation; TSS, Total Symptom Score.

*

In this analysis, positive mean change scores indicate improvement; negative (–) scores indicate worsening.

Two hundred forty of the 256 patients with a baseline TSS and week-24 TSS who completed the PGIC at week 24.

Footnotes

Written on behalf of all COMFORT-I investigators.

All of the authors have approved this article and accept responsibility for the integrity of this work. This article is original and has not been published elsewhere in whole or in part.

Authors' disclosures of potential conflicts of interest and author contributions are found at the end of this article.

Clinical trial information: NCT00952289.

AUTHORS' DISCLOSURES OF POTENTIAL CONFLICTS OF INTEREST

Although all authors completed the disclosure declaration, the following author(s) indicated a financial or other interest that is relevant to the subject matter under consideration in this article. Certain relationships marked with a “U” are those for which no compensation was received; those relationships marked with a “C” were compensated. For a detailed description of the disclosure categories, or for more information about ASCO's conflict of interest policy, please refer to the Author Disclosure Declaration and the Disclosures of Potential Conflicts of Interest section in Information for Contributors.

Employment or Leadership Position: Thomas Hare, Incyte (C); Susan Erickson-Viitanen, Incyte (C); William Sun, Incyte (C); Victor Sandor, Incyte (C); Richard S. Levy, Incyte (C) Consultant or Advisory Role: Jason Gotlib, Incyte (C); Vikas Gupta, Incyte (C), Novartis (C), sanofi-aventis (C), YM BioSciences (C); John V. Catalano, Incyte (C), Novartis (C); Alan L. Shields, Incyte (C); Carole B. Miller, Incyte (C), Novartis (C); Moshe Talpaz, Novartis (C), sanofi-aventis (C); Elliott F. Winton, Incyte (C) Stock Ownership: Thomas Hare, Incyte; Susan Erickson-Viitanen, Incyte; William Sun, Incyte; Victor Sandor, Incyte; Richard S. Levy, Incyte Honoraria: Jason Gotlib, Incyte; Vikas Gupta, Incyte, Novartis; John V. Catalano, Incyte, Novartis; Michael W. Deininger, Incyte; Carole B. Miller, Incyte, Novartis; Moshe Talpaz, Novartis; Elliott F. Winton, Incyte Research Funding: Ruben A. Mesa, Incyte; Jason Gotlib, Incyte; Vikas Gupta, Incyte, Novartis; Moshe Talpaz, Incyte, Novartis, sanofi-aventis; Elliott F. Winton, Incyte; Hagop M. Kantarjian, Incyte; Srdan Verstovsek, Incyte Expert Testimony: None Other Remuneration: Vikas Gupta, Incyte, Novartis

AUTHOR CONTRIBUTIONS

Conception and design: Ruben A. Mesa, Vikas Gupta, Susan Erickson-Viitanen, William Sun, Richard S. Levy, Srdan Verstovsek

Provision of study materials or patients: Ruben A. Mesa, Jason Gotlib, Vikas Gupta, John V. Catalano, Michael W. Deininger, Alan L. Shields, Carole B. Miller, Richard T. Silver, Moshe Talpaz, Elliott F. Winton, Jimmie H. Harvey, Hagop M. Kantarjian, Srdan Verstovsek

Collection and assembly of data: Ruben A. Mesa, Jason Gotlib, Vikas Gupta, John V. Catalano, Michael W. Deininger, Alan L. Shields, Carole B. Miller, Richard T. Silver, Moshe Talpaz, Elliott F. Winton, Jimmie H. Harvey, Thomas Hare, Victor Sandor, Hagop M. Kantarjian, Srdan Verstovsek

Data analysis and interpretation: Ruben A. Mesa, Vikas Gupta, John V. Catalano, Alan L. Shields, Richard T. Silver, Elliott F. Winton, Jimmie H. Harvey, Susan Erickson-Viitanen, William Sun, Victor Sandor, Hagop M. Kantarjian

Manuscript writing: All authors

Final approval of manuscript: All authors

Support

Supported by Incyte.

REFERENCES

  • 1.Tefferi A. Myelofibrosis with myeloid metaplasia. N Engl J Med. 2000;342:1255–1265. doi: 10.1056/NEJM200004273421706. [DOI] [PubMed] [Google Scholar]
  • 2.Mesa RA, Niblack J, Wadleigh M, et al. The burden of fatigue and quality of life in myeloproliferative disorders (MPDs): An international Internet-based survey of 1179 MPD patients. Cancer. 2007;109:68–76. doi: 10.1002/cncr.22365. [DOI] [PubMed] [Google Scholar]
  • 3.Cervantes F, Dupriez B, Pereira A, et al. New prognostic scoring system for primary myelofibrosis based on a study of the International Working Group for Myelofibrosis Research and Treatment. Blood. 2009;113:2895–2901. doi: 10.1182/blood-2008-07-170449. [DOI] [PubMed] [Google Scholar]
  • 4.Kralovics R, Passamonti F, Buser AS, et al. A gain-of-function mutation of JAK2 in myeloproliferative disorders. N Engl J Med. 2005;352:1779–1790. doi: 10.1056/NEJMoa051113. [DOI] [PubMed] [Google Scholar]
  • 5.Tefferi A. Novel mutations and their functional and clinical relevance in myeloproliferative neoplasms: JAK2, MPL, TET2, ASXL1, CBL, IDH and IKZF1. Leukemia. 2010;24:1128–1138. doi: 10.1038/leu.2010.69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Vainchenker W, Dusa A, Constantinescu SN. JAKs in pathology: Role of Janus kinases in hematopoietic malignancies and immunodeficiencies. Semin Cell Dev Biol. 2008;19:385–393. doi: 10.1016/j.semcdb.2008.07.002. [DOI] [PubMed] [Google Scholar]
  • 7.Mesa RA, Schwager S, Radia D, et al. The Myelofibrosis Symptom Assessment Form (MFSAF): An evidence-based brief inventory to measure quality of life and symptomatic response to treatment in myelofibrosis. Leuk Res. 2009;33:1199–1203. doi: 10.1016/j.leukres.2009.01.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Mesa RA, Kantarjian H, Tefferi A, et al. Evaluating the serial use of the Myelofibrosis Symptom Assessment Form for measuring symptomatic improvement: Performance in 87 myelofibrosis patients on a JAK1 and JAK2 inhibitor (INCB018424) clinical trial. Cancer. 2011;117:4869–4877. doi: 10.1002/cncr.26129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Verstovsek S, Mesa RA, Gotlib J, et al. A double-blind placebo-controlled trial of ruxolitinib for myelofibrosis. N Engl J Med. 2012;366:799–807. doi: 10.1056/NEJMoa1110557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tefferi A, Vardiman JW. Classification and diagnosis of myeloproliferative neoplasms: The 2008 World Health Organization criteria and point-of-care diagnostic algorithms. Leukemia. 2008;22:14–22. doi: 10.1038/sj.leu.2404955. [DOI] [PubMed] [Google Scholar]
  • 11.Cella D, Hahn EA, Dineen K. Meaningful change in cancer-specific quality of life scores: Differences between improvement and worsening. Qual Life Res. 2002;11:207–221. doi: 10.1023/a:1015276414526. [DOI] [PubMed] [Google Scholar]
  • 12.Aaronson NK, Ahmedzai S, Bergman B, et al. The European Organization for Research and Treatment of Cancer QLQ-C30: A quality-of-life instrument for use in international clinical trials in oncology. J Natl Cancer Inst. 1993;85:365–376. doi: 10.1093/jnci/85.5.365. [DOI] [PubMed] [Google Scholar]
  • 13.Garcia SF, Cella D, Clauser SB, et al. Standardizing patient-reported outcomes assessment in cancer clinical trials: A patient-reported outcomes measurement information system initiative. J Clin Oncol. 2007;25:5106–5112. doi: 10.1200/JCO.2007.12.2341. [DOI] [PubMed] [Google Scholar]
  • 14.Verstovsek S, Kantarjian H, Mesa RA, et al. Safety and efficacy of INCB018424, a JAK1 and JAK2 inhibitor, in myelofibrosis. N Engl J Med. 2010;363:1117–1127. doi: 10.1056/NEJMoa1002028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Scott NW, Fayers PM, Aaronson NK, et al. EORTC QLQ-C30 Reference Values. 2008. http://groups.eortc.be/qol/sites/default/files/img/newsletter/reference_values_manual2008.pdf.
  • 16.Scherber R, Dueck AC, Johansson P, et al. The Myeloproliferative Neoplasm Symptom Assessment Form (MPN-SAF): International prospective validation and reliability trial in 402 patients. Blood. 2011;118:401–408. doi: 10.1182/blood-2011-01-328955. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Protocol

Articles from Journal of Clinical Oncology are provided here courtesy of American Society of Clinical Oncology

RESOURCES