Abstract
Introduction
Many health economic evaluations rely on the validity of the utility measurement for health-related quality of life (HRQoL). While generic utility measures perform well in HRQoL assessments of many diseases and patient populations, appropriateness for cancer-specific disease burdens needs attention and condition-specific measures could be a viable option. This study assessed the clinical validity of the cancer-specific EORTC QLU-C10D, a utility scoring algorithm for the EORTC QLQ-C30, in patients with glioblastoma. We expect the EORTC QLU-C10D to be sensitive and responsive in glioblastoma patients. Furthermore, we compared its statistical efficiency with the generic utility measure EQ-5D-3L.
Methods
We used data from a multi-center randomized controlled trial (NCT00689221) with patients from 146 study sites in 25 countries. Both, the QLQ-C30 and the EQ-5D-3L, had been administered at seven assessment points together. Utilities of both measures were calculated for four country value set (Australia, Canada, UK, USA). Ceiling effects, agreement (Bland–Altman plots (BA), intra-class correlation (ICC)), were calculated to analyze construct validity. Sensitivity to known-groups (performance status; global health) and responsiveness to changes (progressive vs. non-progressive; stable vs. improved or deteriorated HRQoL) were investigated for clinical validity. Relative Efficiency (RE) was calculated to compare statistical efficiency of both utility measures.
Results
435 patients were included at baseline and six subsequent time points (median timeframe 497 days). QLU-C10D country value set showed negligible ceiling effects (< 6.7%) and high agreement with EQ-5D-3L (ICC > 0.750). BA indicated that differences between both utility measures increased with deteriorating health states. While the QLU-C10D was more sensitive to global health groups (RE > 1.2), the EQ-5D-3L was more sensitive to performance status groups (RE < 0.7) than the other utility measure. Statistical efficiency to detect differences between change groups and within HRQoL deterioration group (RE > 1.4) favored QLU-C10D in 18 of 24 (75%) and 20 of 24 (83%) comparisons with the EQ-5D-3L respectively. Responsiveness to overall HRQoL change (RE > 3.4) also favored the QLU-C10D.
Conclusion
Our results indicate that the QLU-C10D is a valid utility measure to assess HRQoL in patients with glioblastoma. This facilitates the investigation of HRQoL profiles and utilities in this patient population by administering a single questionnaire, the EORTC QLQ-C30. Efficiency analyses point to higher statistical power of the QLU-C10D compared to the EQ-5D-3L.
Supplementary Information
The online version contains supplementary material available at 10.1007/s10198-024-01729-4.
Keywords: Glioblastoma, Health-related quality of life, QLU-C10D, EQ-5D-3L, Validity, Utility
Introduction
Patients diagnosed with glioblastoma are confronted with an incurable disease and the outlook of a markedly limited life span with considerable health deterioration [1]. Frequently, patients already show an impaired health state at diagnosis and a wide range of symptoms, some of which are not common cancer symptoms [2]. During the diagnostic phase they frequently suffer from cognitive deficits, seizures and headaches. As the disease progresses, nausea, vomiting, fatigue, drowsiness, aphasia and motor deficits often start to appear [3, 4]. To evaluate impact of disease or treatment, measures are needed which adequately capture patients’ experience [5].
There are well validated HRQoL instruments available for brain tumor patients, such as the European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Core Questionnaire QLQ-C30, along with its brain tumor-specific instrument, the QLQ-BN20 [6–9]. These are most frequently used as secondary outcome measures in cancer clinical trials in this field [10]. When it comes to health economics and cost-utility studies though, there is a lack of disease-specific instruments. Analysis in these contexts rely commonly on generic measures, to facilitate comparisons across diseases. Further, health economic evaluations are based on a different measurement approach than HRQoL profile measures, like those mentioned above. Health states for health-economic evaluations are represented by utilities, numeric values of a cardinal scale.
Utility instruments are designed to take into account the preference towards a particular health state over another to value health states [11]. They consist of a health classification system—a set of HRQoL aspects, which may take on different severity levels—and a preference-based scoring algorithm. Thereby individual patient responses are converted into one utility value, that expresses the value which has been assigned to a particular health state by the general population [12]. Utility scales are anchored at 0 (representing being dead) and 1 (full health). Negative values are possible at country-specific value sets, representing health states worse than being dead [13, 14].
A frequently used HRQoL utility measure in health economic assessments, the EQ-5D-3L, is a well validated generic measure for a broad range of diseases and populations [15, 16]. This measure captures five dimensions on generic health aspects each with three response options. So, this measure is able to describe 35 = 243 distinct health states. While in general the EQ-5D-3L performs well in a cancer setting, limitations in capturing cancer symptoms like fatigue or nausea, have been discussed [16–18]. Improvements of the updated version, the EQ-5D-5L with five response options for each domain, have been demonstrated, but a consensual English value set is not available yet and it is not clear whether the expansion of response categories is sufficient to resolve described short comings or if disease-specific domains still may add valuable information ([19–21]).
The QLU-C10D is a more recently developed scoring algorithm for the QLQ-C30, using 10 dimensions with four severity levels (i.e. theoretically describing 410 = 1,048,576 distinct health states) that allows researchers to obtain utility scores from QLQ-C30 data [22, 23]. Therefore, the QLU-C10D is expected to capture a greater range of health states relevant to cancer patients and thus to be of potential added value to the measurement of utilities in cancer patients.
The primary objective of this retrospective study was to assess the clinical validity of the QLU-C10D in patients with newly diagnosed glioblastoma. Clinical validity is a psychometric requirement of patient-reported outcome measures, proofing that the instrument is able to detect clinically relevant differences and changes [12, 18]. Our study therefore adds psychometric information to the QLU-C10D portfolio complying with COSMIN standards on quality criteria for patient-reported outcome measures [24]. Additionally, we compared the QLU-C10D with the generic EQ-5D-3L with regard to statistical measurement efficiency. Both measures were administered in the course of the original study at each assessment. By using and comparing value sets from four different countries sharing the same language, namely for Australia (AUS), Canada (CAN), United Kingdom (UK) and USA, we furthermore provide an insight into potential cultural differences of the QLU-C10D utility weights, unbiased by differences that may result from translations. These regions represent a large proportion of the nationalities in the original study sample.
Methods
Original data source
Data for this analysis was drawn from an EORTC cancer clinical trial (ClinicalTrials.gov, NCT00689221), a global, multi-center, randomized, open-label, phase 3 trial. 545 eligible newly diagnosed glioblastoma patients with a methylated MGMT promoter, were randomly assigned to treatment arms: either treated with temozolomide chemoradiotherapy alone (TMZ) or with added Cilengitide [25]. Median follow-up time was 29 months. The present analysis encompasses baseline and six assessments thereafter (T1–T6), including follow-up at six months after treatment.
Patient data with completed HRQoL measures at baseline was used for the current analysis. For responsiveness analysis, patients with missing data at subsequent timepoints were excluded at both timepoints. Further exclusion criteria for patient data were (a) missing clinical data at baseline (n = 39), and (b) missing data or invalid dates of completion of one of the HRQoL measures (n = 71).
To investigate potential selection bias we compared the selected data with original patient data at baseline and subsequent time points. Chi-squared tests were performed to check if data selection was biased in terms of sex, randomization group, and clinical subgroups (ECOG Performance Status (PS), steroid intake, extent of surgery). T-tests were performed to check bias of selected data in terms of age, time of progression-free survival and overall survival in days from randomization. Significant results (p ≤ 0.05) were considered signals for selection bias. 10% of all tested patient characteristic comparisons are expected to differ at significance level, which is here used as threshold for signaling bias [26].
HRQOL measures
EORTC QLU-C10D
The QLU-C10D utility score is calculated from 13 items of the QLQ-C30. Each item (e. g. “Have you had pain?”) has four response levels (1 = “Not at all” to 4 = “very much”). The 13 items form ten dimensions: Four covering aspects of functioning (Physical Functioning (PF), Role Functioning (RF), Social Functioning (SF), and Emotional Functioning ((EF)), and six covering symptoms (Pain (PA), Fatigue (FA), Sleep (SL), Appetite (AP), Nausea (NA), and Bowel Problems (BO)) commonly experienced by cancer patients.
EQ-5D-3L
The EQ-5D-3L comprises of five dimensions: mobility (MO), self-care (SC), usual activities (UA), pain/discomfort (PD), and anxiety/depression (AD). Each dimension is presented with three response levels of severity (e.g. “I have no pain or discomfort” (Level 1), “I have moderate pain or discomfort” (Level 2), “I have extreme pain or discomfort” (Level 3)).
Index scales were calculated according to algorithms published for scoring the AUS [27, 28]) CAN [29, 30], UK [31, 32], and USA [33, 34] country-specific value sets of each measure.
Known-groups
For cross sectional sensitivity analysis ECOG PS groups (0 vs. > 0) was available at baseline. It was previously shown to be associated with HRQoL deterioration in glioblastoma patients ([35]). Furthermore, the QLQ-C30 Global Health Status scale (GHS) was considered to reflect HRQoL groups in glioblastoma patients at baseline, with scores ≤ 50 vs. > 50 threshold, the “low mean” in cancer populations ([36]). The GHS is not part of the QLU-C10D items. The GHS is frequently reported as HRQoL score for patients with glioblastoma ([37]), and most importantly, is a common HRQoL outcome in cancer clinical trials (e. g. [38–42]). With regard to responsiveness we investigated changes over time in HRQoL of the total sample (overall change), clinical response groups (progressive vs. non-progressive) and changes measured with the GHS. Minimal important change over time of ± 6 GHS scores ([43]) was used to define GHS change groups (deterioration and improvement groups) vs. GHS stable group (< 6 scores change). Treatment arm was not used as a known-group factor, because no significant treatment effect was reported for HRQoL outcomes [25].
Statistical analyses
Descriptive statistics are presented as means, standard deviations (SD), and frequencies.
Validity
Floor and ceilings effects were calculated for index scales of the QLU-C10D and the EQ-5D-3L and for each of their domains. Frequencies of more than 15% at floor or ceiling levels were considered relevant, i.e. potentially limiting validity [44].
Construct validity
Construct validity was analyzed by assessing correlations and agreement between the cancer-specific QLU-C10D and the generic EQ-5D-3L. The EQ-5D-3L served as a comparator for construct validity, because it is a well validated measure and recommended by health-care authorities [15, 17, 21]. Both measures were developed to measure HRQoL utility, hence a certain level of agreement was expected, but due to the different designs and scopes, differences should also be detectable.
Overall we expected high agreement between both measures, because of the similar construct. At the domain level we expected both high and low correlations (convergent and divergent validity) between QLU-C10D and EQ-5D-3L, because the QLU-C10D comprises of rather generic and cancer-specific domains. We expected moderate to strong correlations between conceptually similar domains (PF and MO/UA, RF and UA, SF and UA, PA and PD, EF and AD, FA and UA), which would indicate that these domains address related HRQoL concepts. However, rather disease-specific domains of the QLU-C10D were not expected to relate with EQ-5D-3L domains to that extent. Hence low correlations between these QLU-C10D domains (SL, AP, NA, BP) and the EQ-5D-3L domains were expected, indicating that QLU-C10D also shows divergent validity from EQ-5D-3L because of the disease-specific domains [45]. Pearson correlation coefficients (r) between domains of the QLU-C10D and the EQ-5D-3L and intra-class correlations (ICC) for both index scales were calculated. Standard convention was used for interpreting r (< 0.50 signal weak, < 0.70 moderate and ≥ 0.70 strong correlations) [46]. ICC was computed based on pairs of observations of EQ-5D-3L and QLU-C10D raw scores and in country-wise index scales. ICC values for absolute agreement ≥ 0.75 indicate excellent agreement [47].
Bland–Altman plots were generated to analyze agreement between the QLU-C10D and the EQ-5D-3L. The visual presentation of differences between the measures and best estimate of the true value (mean of both measures) allowed detailed identification of (dis-)agreements of both measures. The difference between both measures is plotted on the y-axis and the average on the x-axis. Proportional bias was analyzed with linear regressions, using means to predict the difference of both measures. Significant F-values indicate the presence of proportional bias. Lacking country-specific minimal important difference (MID) for glioblastoma patients for the other value sets, we used MID reported for the EQ-5D-3L UK value set in glioma patients as pre-defined acceptable levels of agreement (LOA) of ± 0.15 [48].
Sensitivity
The ability to discriminate between patient groups was assessed by calculating independent t-tests across clinically known-groups. These were the ECOG PS (0 vs. > 0), and the GHS (≤ 50 vs. > 50) at baseline, and difference in change between clinical response groups (progressive vs non-progressive) at subsequent timepoints. A one-way analysis of variance (ANOVA) was calculated for differences in change between the three GHS change groups.
Responsiveness
Responsiveness was assessed for overall change, and within clinical response and GHS subgroups (baseline to Tx) using paired t-tests. Responsiveness for changes within clinical response subgroups were assessed between baseline and end of treatment at T5. Responsiveness for changes within GHS change subgroups was analyzed for the periods between baseline and two time-points during chemoradiotherapy (T1 & T2 = first and last timepoint of chemoradiotherapy), three timepoints during chemotherapy (T3-T5), and at follow-up assessment (T6) (see Fig. 1).
Fig. 1.
Original Study Procedures with assessment timepoints for HRQoL. Abbreviations: SD = Standard Deviation; HRQoL = Health-related Quality of Life; retr. = retrospective; Q = QLU-C10D; E = EQ-5D-3L; Country value set: AUS = Australia, CAN = Canada, UK = United Kingdom
Effect size measures
Effect size of between-group differences, Cohen’s d, was calculated by dividing the mean difference with the pooled SD. Magnitude of effect size was interpreted according to Cohen’s guidelines [49], i.e. ≥ 0.5 were considered medium and ≥ 0.8 large.
Magnitude of change over time within the total sample was estimated with Standardized Response Mean (SRM), which standardizes mean change with the SD of change. The Responsiveness Index (RI) was calculated as a quotient of SD of stable or non-progressive groups as denominator, and within-group change (progressive, decrease, increase) as numerator.
To compare value sets of both measures directly in terms of statistical efficiency, we calculated Relative Efficiency (RE) as a ratio of positive t-values or F-values with the QLU-C10D in the numerator and the EQ-5D-3L in the denominator. Results > 1 indicate a higher efficiency of the QLU-C10D, and results < 1 indicate a higher efficiency of the EQ-5D-3L. RE was only calculated if at least one of the measures found a statistically significant effect. Further, differences in effect sizes were calculated, under the same premise as RE calculations. Difference of RI (DRI) was calculated to compare HRQoL measures in terms of within-groups responsiveness. DRI was calculated by subtracting positive RI-values of the EQ-5D-3L from positive RI-values of the QLU-C10D [50]. Likewise, the difference of Cohen’s d and SRM was calculated. Results > 0 indicate a higher responsiveness of the QLU-C10D, and < 0 favor the EQ-5D-3L. A maximal probability of error 1% (i.e. a p-value ≤ 0.01) was used to keep alpha error accumulation at a minimum [51].
Results
Descriptive statistics and bias analyses
The QLQ-C30 and EQ-5D-3L were completed by 435 of 545 patients at baseline, and 215 of 262 at follow-up (= T6). An overview of sociodemographic and clinical characteristics is presented in Table 1. Mean utilities of the QLU-C10D value sets at baseline were between 0.738 and 0.792. Between 15 and 22% of the original study could not be included due to missing data at different time points. With the exception of steroid intake (baseline, T2, T4, T5) and age (T4, T5, T6), no significant differences between selected data and the original sample were found. These exceptions account for less than 10% of expected differences, and therefore are considered negligible. An overview of bias analysis is included in Appendix E.
Table 1.
Sample Characteristics at baseline (N = 435)
| Age (mean)/(SD); (median) | 56,2/11 | 57.9 |
| Sex: Male: (n); (%) | 236 | 54.3 |
| Female: (n); (%) | 199 | 45.7 |
| Steroid at baseline: No: (n); (%) | 268 | 61.6 |
| Yes: (n); (%) | 167 | 38.4 |
| Treatment: Chemoradiotherapy: (n); (%) | 218 | 50.1 |
| Added Cilengitide: (n); (%) | 217 | 49.9 |
| ECOG Performance status 0: (n); (%) | 252 | 57.9 |
| Performance status > 0: (n); (%) | 183 | 42.1 |
| Extent of surgery: Biopsy: (n); (%) | 14 | 3.2 |
| Partial resection: (n); (%) | 211 | 48.5 |
| Complete resection: (n); (%) | 208 | 47.8 |
| Overall survival: days (median); (SD) | 710 | 318.6 |
| Progression free survival: days (median); (SD) | 255 | 310.3 |
Floor & ceiling effects
Ceiling effect (proportion of patients reporting full health) on the QLU-C10D index scale was 4.14%. The EQ-5D-3L index scale showed higher ceiling effects (27.13%).
The largest ceiling effect of the QLU-C10D domains was NA, at 88.05%, and the smallest proportion was SF, at 27.59%. The EQ-5D-3L domains MO (73.56%) and UA (51.49%) exhibit higher ceiling effects than the conceptually similar QLU-C10D domains (PF: 46.90%, RF: 36.63%, and SF: 27.58%). Floor and ceiling effects of index scores and response levels per domain are presented in Tables 2 and 3.
Table 2.
Floor & Ceiling Effects Index Scores at Baseline (Raw & Country value set)
| BASELINE (N = 435) | Floor | n / % | Ceiling | n / % |
|---|---|---|---|---|
| QLU-C10D (raw) | 40 | 0 / 0 | 10 | 18 / 4.14 |
| AUS | − 0.10 | 0 / 0 | 1.00 | 29 / 6.67 |
| CAN | − 0.15 | 0 / 0 | 1.00 | 18 / 4.14 |
| UK | − 0.08 | 0 / 0 | 1.00 | 18 / 4.14 |
| USA | 0.03 | 0 / 0 | 1.00 | 18 / 4.14 |
| EQ-5D-3L (raw) | 15 | 0 / 0 | 5 | 118 / 27.13 |
| AUS | − 0.22 | 0 / 0 | 1.00 | 118 / 27.13 |
| CAN | − 0.31 | 0 / 0 | 1.00 | 118 / 27.13 |
| UK | − 0.59 | 0 / 0 | 1.00 | 118 / 27.13 |
| USA | − 0.11 | 0 / 0 | 1.00 | 118 / 27.13 |
Floor health state for QLU-C10D = 44444 44444, EQ-5D-3L = 33333; Ceiling health state for QLU-C10D = 11111 11111, EQ-5D-3L = 11111; Lowest health states observed for QLU-C10D = 34444 44444 (utility value AUS = 0.004; CAN = − 0.053; UK = 0.016; USA = 0.151), EQ-5D-3L = 33232 (utility value AUS = − 0.086; CAN = − 0.088; UK = − 0.371; USA = − 0.012)
Abbreviations Country value set: AUS = Australia, CAN = Canada, UK = United Kingdom, USA = United States of America
Table 3.
Floor & Ceiling Effects Single Domains EQ-5D-3L & QLU-C10D raw scoring (N = 435)
| EQ-5D-3L | MO — Mobility | SC — Self-Care | UA — Usual Activities | PD — Pain / Discomfort | AD — Anxiety / Depression | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| level | n | % | n | % | n | % | n | % | n | % |
| 1 | 320 | 73.56 | 368 | 84.60 | 224 | 51.49 | 295 | 67.82 | 213 | 48.96 |
| 2 | 112 | 25.75 | 60 | 13.79 | 168 | 38.62 | 135 | 31.03 | 202 | 46.44 |
| 3 | 3 | 0.69 | 7 | 1.61 | 43 | 9.89 | 5 | 1.15 | 20 | 4.60 |
| QLU-C10 Dimensions | PF | RF | SF | EF | PA | FA | SL | AP | NA | BP | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Level | n | % | n | % | n | % | n | % | N | % | n | % | n | % | n | % | n | % | n | % |
| 1 | 204 | 46.90 | 168 | 36.63 | 120 | 27.59 | 202 | 46.43 | 283 | 65.05 | 119 | 27.36 | 212 | 48.74 | 349 | 80.23 | 383 | 88.05 | 291 | 66.90 |
| 2 | 132 | 30.34 | 146 | 33.56 | 159 | 36.55 | 165 | 37.93 | 121 | 27.82 | 223 | 51.26 | 143 | 32.87 | 56 | 12.87 | 44 | 10.11 | 107 | 24.60 |
| 3 | 69 | 15.86 | 76 | 17.47 | 98 | 22.53 | 43 | 9.89 | 25 | 5.75 | 68 | 15.63 | 58 | 13.33 | 23 | 5.29 | 4 | 0.92 | 26 | 5.98 |
| 4 | 30 | 6.90 | 45 | 10.34 | 58 | 13.33 | 25 | 5.75 | 6 | 1.38 | 25 | 5.75 | 22 | 5.06 | 7 | 1.61 | 4 | 0.92 | 11 | 2.52 |
Abbreviations: PF = Physical Functioning; RF = Role Functioning; SF = Social Functioning; EF = Emotional Functioning; PA = Pain; FA = Appetite; NA = Nausea; BP = Bowel Problems
Construct validity
The four country value set of each HRQoL measure produced similar correlation coefficients, presented in Table 4. ICC coefficients of index scales of all country value set va indicate strong agreement, ranging between 0.718 and 0.750. Mostly moderate positive correlations were found between conceptually similar domains, ranging between 0.329 and 0.632, and weak correlations (r < 0.291) between the cancer-specific QLU-C10D and the generic EQ-5D-3L domains. Collectively the correlations between QLU-C10D and EQ-5D-3L domains indicate that correlation coefficients increase with content-wise proximity of the concepts.
Table 4.
Correlations between QLU-C10D and EQ-5D-3L Index Scales and Domains at baseline (N = 435)
| Domains (Pearson’s r) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | ||
| QLU-C10D | Index Scales (ICC) |
PF | RF | SFg | EF | PA | FA | SL | AP | NA | BP |
| EQ-5D-3L | MO, UA | UA | UA | AD | PD | UA | All | All | All | All | |
| Expected Correlation | Strong ≥ 0.7 |
Strong, moderate > 0.5 |
Moderate ≥ 0.5 |
Moderate ≥ 0.5 |
Strong ≥ 0.7 |
Strong ≥ 0.7 |
Moderate < 0.7 |
Weak < 0.5 |
Weak < 0.5 |
Weak < 0.5 |
Weak < 0.5 |
| AUS | .749 | .496* | .615 | .467 | .632 | .555 | .329 | < .171 | < .146 | < .291 | < .269 |
| CAN | .734 | .483* | .601 | .466 | .614 | .565 | .396 | < .171 | < .172 | < .255 | < .270 |
| UK | .718 | .497* | .615 | .496 | .601 | .601 | .338 | < .228 | < .157 | < .278 | < .248 |
| USA | .750 | .495* | .598 | .479 | .570 | .594 | .411 | < .219 | < .165 | < .283 | < .278 |
All correlations are significant with p-values < .01
Abbreviations: ICC = Intra-Class Correlations; AD = Anxiety, Depression; AP = Appetite; BP = Bowel Problems; EF = Emotional Functioning; FA = Fatigue; MO = Mobility; NA = Nausea; PA = Pain; PD = Pain, Discomfort; PF = Physical Functioning; RF = Role Functioning; SF = Social Functioning; SL = Sleep; UA = Usual Activities; Country value sets: AUS = Australia, CAN = Canada; UK = United Kingdom; USA = United States of America
*Smallest correlation with the two EQ-5D-3L domains
Bland–Altman plots of index scores showed a negative bias, indicating that the QLU-C10D scores tend to be lower than the EQ-5D-3L scores with all value sets (Fig. 2). Mean bias ranged from -0.058 (CAN) to -0.001 (UK). Levels of agreement (LOA) exceeded the pre-defined level of ± 0.15 for all value sets. Furthermore, Bland–Altman plots indicate that the extent of agreement differs across the measurement range in all value sets, i.e. dispersion increases towards the lower end of the scale. Proportional bias was detected with two value set pairs.
Fig. 2.
Bland–Altman Plots comparing QLU-C10D and EQ-5D-3L Country value sets at baseline. Dotted horizontal line = 0 at y-axis; LOA = Level of Agreement (= mean ± 1.96 * SD); Proportional bias: Results of Linear regressions, predicting differences with means (df = 1, 433): AUSTRALIAN Utilities, r = .09, R2 = .008, F = 3.54, p = .061; CANADIAN Utilities, r = .229, R2 = .052, F = 23.96, p < .001; UK Utilities, r = .444, R2 = .197, F = 106.10, p < .001; USA Utilities, r = .001, R2 = .000, F = 0.0, p = .992; Regression lines are not displayed
Sensitivity
Results of sensitivity analysis show that both measures detected significant medium to large differences when categorizing patients by ECOG PS (0 vs. > 0), and the GHS (≤ 50 vs. > 50), with all value sets. The EQ-5D-3L showed higher sensitivity for ECOG groups (Cohen’s d > 0.75). The QLU-C10D detected larger differences between GHS groups (Cohen’s d > 1.14). These results are reflected in RE of the four value set comparisons, which favor the EQ-5D-3L for ECOG groups (RE < 0.69), and the QLU-C10D for GHS groups (RE > 1.24) (see Fig. 3 or detailed statistics in Appendix A).
Fig. 3.
Relative Efficiency (RE) of Country value sets for detecting known-groups at baseline. Abbreviations Country value sets: AUS = Australia, CAN = Canada, UK =United Kingdom, US =United States of America; Known Groups: ECOG Performance status (PS) 0 vs. > 0; QLQ-C30 Global Health Status Score (GHS) ≤ 50 vs. > 50; RE results > 1 favor QLU-C10D, < 1 favor EQ-5D-3L. Descriptive & detailed results see Appendix A
Responsiveness
Significant overall change within the total sample (paired t-tests of all patients) from baseline to T1 and to T3 (first chemoradiation and first chemotherapy) was detected only with the QLU-C10D. No significant changes with either measure were detected from baseline to later assessments. Twelve country value set pair comparisons (= 3 periods × 4 value set pairs) of both measures show a) larger SRM effects are detected with the QLU-C10D than with EQ-5D-3L value sets (SRM − 0.26 to − 0.17 vs. − 0.04 to 0.04), and b) RE results (> 3.42) also favored the QLU-C10D (see Fig. 4A; detailed results are presented in Appendix B).
Fig. 4.
Responsiveness Analysis QLU-C10D vs. EQ-5D-3L. Grey arrows signify values outside the scope of x-axis; Abbreviations: Country value sets: AUS = Australia, CAN = Canada, UK = United Kingdom, US = United States of America; RE = Relative Efficiency; GHS = Global Health Scale (QLQ-C30); A RE = quotient of t-values of paired t-tests (Tx—Baseline); B RE = quotient of f-values of one-way ANOVA, comparing QLQ-C30 GHS groups (> 6 points decrease vs. ≤ 6 points change vs. > 6 points increase); C & D RE = quotient of t-values calculated with paired t-tests (Tx—Baseline); E & F Responsiveness Index (RI) standardizes mean Decrease or Increase of GHS groups with the standard deviation of the stable group; Difference RI = RI QLU-C10D minus RI EQ-5D-3L; Details of calculations and results of A are presented in Appendix B, B-F in Appendix C
All QLU-C10D value sets detected significant changes within progressive patients (RI < − 0.56) as well as differences in change between progressive and non-progressive patients (Cohen’s d > − 0.55). These effects were not detected by the EQ-5D-3L. All eight value set comparisons (4 within progressive subgroup, 4 between response groups) favored the QLU-C10D (DRIs ≥ 0.28, REs ≥ 1.53; see detailed results in Appendix D).
Significant results were observed within both decreasing and increasing GHS change groups, with the exception of the increase group at T1 when measured with the QLU-C10D. These findings consistently aligned with the expected direction when compared to the baseline, across all time points and using both measures including no significant changes in the stable group (see Appendix C for details). Twenty-four comparisons (= 6 periods × 4 value set pairs) for each group were performed. In total, 20 of 24 RE calculations (> 1.42) of the decreasing group favored the QLU-C10D (see Fig. 4C). Also, 16 of 24 DRI calculations of decreasing GHS change groups favored the QLU-C10D (see Fig. 4E). For calculations of increasing GHS change groups, 16 of 24 RE and 20 of 24 DRI results favored the EQ-5D-3L (seeFig. 4D and F). Difference between mean change of GHS change groups (decreasing, increasing, stable) was significant at all time points with both HRQoL measures (see Appendix C). Twenty-four comparisons (= 6 periods × 4 value set pairs) show that 18 RE calculations (> 1.02) favor QLU-C10D (see Fig. 4B).
In summary, with regard to responsiveness the QLU-C10D showed favorable characteristics and outperformed the EQ-5D-3L in most comparisons of the four value sets (71.7%). Especially in the detection of differences between GHS change subgroups (75.0%), clinical response groups (100.0%), of changes within progressive group (100.0%) and deteriorating GHS subgroup (83.3%) favorable results have been shown for the QLU-C10D.
Discussion
The aim of this analysis was to assess the validity of the QLU-C10D in patients with glioblastoma with four value sets – AUS, CAN, UK, and USA. To our knowledge, this is the first validation of the QLU-C10D in this patient population [52, 53].
Overall, results of our analysis indicate that the QLU-C10D can be considered a clinically valid utility measure in newly diagnosed glioblastoma patients. A range of validity indicators point to this conclusion.
Firstly, we investigated construct validity on multiple levels. Floor or ceiling effects are considered signals for limited validity. The QLU-C10D index scale showed negligible ceiling effects. The highest ceiling effects in QLU-C10D single domains showed AP, NA, BP and PA, which are not common symptoms of patients with glioblastoma in early stages of disease [2, 3, 5]. Conceptually similar domains with EQ-5D-3L, Mobility and Physical Functioning, showed divergence in floor and ceiling levels. Similar results of this domain pair were previously reported [54, 55]. This indicates that the underlying concepts of the conceptually similar domains are not identical.
Indeed, this assumption was supported by the results of correlation analyses. This analysis overall showed expected results: QLU-C10D and EQ-5D-3L index scales showed strong agreement, conceptually similar domains indicated convergent validity, while conceptually distant domains (EQ-5D-3L domains and QLU-C10D cancer-specific symptom domains) showed divergent validity. Yet, the Emotional Functioning scale of the QLU-C10D and the anxiety/depression domain of the EQ-5D-3L, as well as the pain scales of both instruments, showed lower correlations than expected. Unlike the QLU-C10D domains, AD and PD are EQ-5D-3L composite domains, where previous research has shown that splitting these domains up in separate items, e. g. one item for anxiety and one separate item for depression, reduced the number of “no problem” reports on this domain [56]. Another source of divergence are different scale levels, which contribute to limited correlations of conceptually similar domains. For example the most severe level at Physical Function is labelled “severe problems” at the QLU-C10D, and “confined to bed” at the EQ-5D-3L Mobility domain. Given that an updated version of the EQ-5D with five response options instead of three is being used increasingly, comparisons with this novel version might provide more insight into this discrepancy [19].
Secondly, we assessed QLU-C10D performance in detecting known groups and changes over time as core psychometric properties of a HRQoL measure. The QLU-C10D performed well in the majority of tests. It was able to detect ECOGS PS groups, HRQoL groups measured with the QLQ-C30 GHS, changes within progressive patients, and correctly measured direction of GHS change (e. g. deteriorated, improved). The latter is of special interest, as the QLQ-C30 GHS is frequently used as an HRQoL outcome in cancer clinical trials (e. g. [37–41]), and it might be considered a minimum requirement of the QLU-C10D to detect HRQoL changes measured with that scale. Our consistent finding on the stable HRQoL group (i.e. no change measured by GHS and no change detected by the QLU-C10D) is in contrast with Shaw et al. [57], who reported significant changes in QLU-C10D scores in stable HRQoL groups in melanoma and head & neck cancer patients.
In direct comparison with the generic EQ-5D-3L our analyses showed that the QLU-C10D was more sensitive in detecting differences in GHS groups while the EQ-5D-3L showed higher sensitivity for ECOG groups. The latter is in contrast to previous research, reporting higher sensitivity of the QLU-C10D than the EQ-5D-3L for ECOG PS in myelodysplastic syndrome patients [45], mixed cancer samples [57, 58], and lung cancer patients [59], all using the same cutoffs as the here presented analysis. This discrepancy with the previous research was surprising. A potential reason could be that in this patient population the EQ-5D-3L self-care domain covers aspects of categorization which is reflected in sensitivity for ECOG PS. With regard to responsiveness the QLU-C10D outperformed the EQ-5D-3L in most comparisons (71.7%). Previous research likewise has indicated higher responsiveness of QLU-C10D than EQ-5D-3L [57–59].
Finally, on the topic of country differences without a potential language bias, our secondary aim, we found that the four QLU-C10D value sets largely perform the same way, meaning that although utility weights of single domains vary, effect sizes and directions calculated with these value sets are mostly highly similar.
Limitations and critical appraisal
The main limitation of our analyses is their retrospective nature. Group comparisons therefore were guided by the availability of clinical information, while other potentially relevant information could not be retrieved. It was therefore also only possible to use the three level version of the EQ-5D as comparator, which still is the preferred generic utility instrument in many jurisdictions [21, 60]. To learn more about the differences between cancer-specific and generic utilities future investigations of the QLU-C10D require not only comparison with the novel EQ-5D version but also with other, generic utility instruments.
We also had to exclude patients from analyses due to missing HRQoL data. Although we performed a bias analysis with regard to available sociodemographic and clinical information, we cannot fully rule out that more severely HRQoL-impaired patients dropped out of assessment. This risk might be indicated by the bias analysis, where at some timepoints we found a significantly higher proportion of patients with steroid intake and higher mean age in the original sample than in the data for this analysis. However, we found no significant HRQoL differences with factors age or steroid intake. Considering that significant differences in sample composition were found only in 7 of 77 (11 factors × 7 assessments) tests, we suggest that potential bias effects on this analysis are negligible.
Finally, lacking a better estimate we used the EQ-5D-3L MID for glioma patients with the UK value set as a measure of relevant difference in Bland–Altman plots (48). This MID might not necessarily be transferrable to the value sets of other countries as used here.
Conclusion
In conclusion, our results suggest that the QLU-C10D—as a preference-based scoring algorithm for data collected with the QLQ-C30—is a valid measure for use in newly diagnosed glioblastoma patients. Hence, the QLQ-C30 can provide both, cancer-specific HRQoL health profiles and a utility score (QLU-C10D), and serve multiple purposes in cancer clinical trials and health economic studies in glioblastoma. The tendency of the QLU-C10D to be more sensitive to health changes, especially when it comes to detecting deteriorating HRQoL, translates into higher statistical power and consequently smaller required sample sizes.
Supplementary Information
Below is the link to the electronic supplementary material.
Acknowledgements
This research was supported by the European Organisation for the Research and Treatment of cancer (EORTC), which provided the data for this study. We thank Emma Lidington and Thierry Gorlia from the EORTC for assistance, and the anonymous reviewers of the EORTC Quality of Life Group for comments that greatly improved the manuscript. The original study was financially supported by Merck (CrossRef Funder ID: 10.13039/100009945). Merck reviewed this manuscript for medical accuracy only before journal submission. The authors are fully responsible for the content of this manuscript, and the views and opinions described in the publication reflect solely those of the authors.
Funding
Open access funding provided by University of Innsbruck and Medical University of Innsbruck. This research was funded by the European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group, grant number (#12/2016).
Data and materials availability
The data used for this analyses is owned by Merck and the EORTC. It cannot be shared by the authors of this publication. An official request though can be made with the EORTC. See EORTC Policy on Data Sharing: https://www.eortc.org/app/uploads/2023/06/L-01-POL-01.pdf
Declarations
Conflict of interest
The authors have no conflict of interest to declare. The EORTC Quality of Life Group business model involves charges for commercial companies using EORTC instruments. Academic use of EORTC instruments is free of charge.
Ethical approval
All patients provided written informed consent when participating in the original trials which this retrospective analysis relies on. The original study protocols were developed in accordance with the Declaration of Helsinki and ethical approval was sought at participating centres. The data sharing agreement complies with GDPR 2016.
Consent for publication
All authors have approved the submission of the manuscript.
Footnotes
The research in this article was done during Dr. Andrew Bottomley’s tenure at EORTC. Dr. Andrew Bottomley is no longer affiliated with EORTC.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Allemani, C., Matsuda, T., Di Carlo, V., Harewood, R., Matz, M., Nikšić, M., Hood, M.: Global surveillance of trends in cancer survival 2000–14 (CONCORD-3): analysis of individual records for 37 513 025 patients diagnosed with one of 18 cancers from 322 population-based registries in 71 countries. The Lancet 391(10125), 1023–1075 (2018) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Rimmer, B., Bolnykh, I., Dutton, L., Lewis, J., Burns, R., Gallagher, P., Sharp, L.: Health-related quality of life in adults with low-grade gliomas: a systematic review. Quality Life Res 32, 1–27 (2022) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Ijzerman-Korevaar, M., Snijders, T.J., de Graeff, A., Teunissen, S.C., de Vos, F.Y.: Prevalence of symptoms in glioma patients throughout the disease trajectory: a systematic review. J Neuro-oncol 140(3), 485–496 (2018) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Walbert, T., & Chasteen, K. (2015). Palliative and supportive care for glioma patients. Curr Understanding Treatment Gliomas 163 171–184 [DOI] [PubMed]
- 5.Fountain, D.M., Allen, D., Joannides, A.J., Nandi, D., Santarius, T., Chari, A.: Reporting of patient-reported health-related quality of life in adults with diffuse low-grade glioma: a systematic review. Neuro Oncol. 18(11), 1475–1486 (2016) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Aaronson, N.K., Ahmedzai, S., Bergman, B., Bullinger, M., Cull, A., Duez, N.J., Takeda, F.: The European organization for research and treatment of cancer QLQ-C30: a quality-of-life instrument for use in international clinical trials in oncology. JNCI: J the National Cancer Inst 85(5), 365–376 (1993) [DOI] [PubMed] [Google Scholar]
- 7.Osoba, D., Aaronson, N., Zee, B., Sprangers, M., Te Velde, A. (1997). Modification of the EORTC QLQ-C30 (version 2.0) based on content validity and reliability testing in large samples of patients with cancer. Qual life res, 6(2), 103–108. 10.1023/a:1026429831234 [DOI] [PubMed]
- 8.Taphoorn, M.J., Claassens, L., Aaronson, N.K., Coens, C., Mauer, M., Osoba, D., Bottomley, A.: The EORTC QLQ-BN20 questionnaire for assessing the health-related quality of life (HRQoL) in brain cancer patients: a phase IV validation study on behalf of the EORTC QLG, BCG, ROG NCIC-CTG. J Clin Oncol 26(15_suppl), 2041–2041 (2008) [Google Scholar]
- 9.Taphoorn, M.J., Claassens, L., Aaronson, N.K., Coens, C., Mauer, M., Osoba, D., EORTC Quality of Life Group: An international validation study of the EORTC brain cancer module (EORTC QLQ-BN20) for assessing health-related quality of life and symptoms in brain cancer patients. Euro J Cancer 46(6), 1033–1040 (2010) [DOI] [PubMed] [Google Scholar]
- 10.Bottomley, A., Pe, M., Sloan, J., Basch, E., Bonnetain, F., Calvert, M., Campbell, A., Cleeland, C., Cocks, K., Collette, L., Dueck, A., Devlin, N., Flechtner, H., Gotay, C., Greimel, E., Griebsch, I., Groenvold, M., Hamel, J., King, M., Kluetz, P., Koller, M., Malone, D., Martinelli, F., Mitchell, S., Moinpour, C., Musoro, J., O’Connor, D., Oliver, K., Piault-Louis, E., Piccart, M., Pimentel, F., Quinten, C., Reijneveld, J., Schürmann, C., Smith, A., Soltys, K., Taphoorn, M., Velikova, G., Coens, C.: Analysing data from patient-reported outcome and quality of life endpoints for cancer clinical trials: a start in setting international standards. The Lancet. Oncology 17(11), e510-4 (2016) [DOI] [PubMed] [Google Scholar]
- 11.Drummond, M.: Methods for the economic evaluation of health care programmes. Oxford University Press, Oxford (2005) [Google Scholar]
- 12.Devlin, N.J., Lorgelly, P.K.: QALYs as a measure of value in cancer. J. Cancer Policy 11, 19–25 (2017) [Google Scholar]
- 13.Rowen, D., Brazier, J., Van Hout, B.: A comparison of methods for converting DCE values onto the full health-dead QALY scale. Med. Decis. Making 35(3), 328–340 (2015) [DOI] [PubMed] [Google Scholar]
- 14.Bahrampour, M., Byrnes, J., Norman, R., Scuffham, P.A., Downes, M.: Discrete choice experiments to generate utility values for multi-attribute utility instruments: a systematic review of methods. Eur. J. Health Econ. 21, 983–992 (2020) [DOI] [PubMed] [Google Scholar]
- 15.Rabin, R., de Charro, F.: EQ-5D: a measure of health status from the EuroQol Group. Ann. Med. 33(5), 337–343 (2001). 10.3109/07853890109002087 [DOI] [PubMed] [Google Scholar]
- 16.Langworth, L., Yang, Y., Young, T., Mulhern, B., Hernández Alava, M., Mukuria, C., Rowen, D., Tosh, J., Tsuchiya, A., Evans, P., Devianee Keetharuth, A., Brazier, J.: Use of generic and condition-specific measures of health-related quality of life in NICE decision-making: a systematic review, statistical modelling and survey. Health Technol. Assess. 9(18), 1–224 (2014) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Shaw, J.W., Bennett, B., Trigg, A., DeRosa, M., Taylor, F., Kiff, C., Cocks, K.: A comparison of generic and condition-specific preference-based measures using data from nivolumab trials: EQ-5D-3L, mapping to the EQ-5D-5L, and European organisation for research and treatment of cancer quality of life utility measure-core 10 dimensions. Value in Health 24(11), 1651–1659 (2021) [DOI] [PubMed] [Google Scholar]
- 18.Tordrup, D., Mossman, J., Kanavos, P.: Responsiveness of the EQ-5D to clinical change: is the patient experience adequately represented? Int. J. Technol. Assess. Health Care 30(1), 10–19 (2014) [DOI] [PubMed] [Google Scholar]
- 19.Herdman, M., Gudex, C., Lloyd, A., Janssen, M.F., Kind, P., Parkin, D., Badia, X.: Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual. Life Res. 20, 1727–1736 (2011) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Feng, Y.S., Kohlmann, T., Janssen, M.F., Buchholz, I.: Psychometric properties of the EQ-5D-5L: a systematic review of the literature. Qual. Life Res. 30, 647–673 (2021) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.National institute for health and care excellence: position statement on use of of the EQ-5D-5L value set for England. https://www.nice.org.uk/about/what-we-do/our-programmes/nice-guidance/technology-appraisal-guidance/eq-5d-5l. Accessed July 2024
- 22.King, M.T., Costa, D.S.J., Aaronson, N.K., Brazier, J.E., Cella, D.F., Fayers, P.M., Viney, R.: QLU-C10D: a health state classification system for a multi-attribute utility measure based on the EORTC QLQ-C30. Qual. Life Res. 25, 625–636 (2016) [DOI] [PubMed] [Google Scholar]
- 23.Norman, R., Viney, R., Aaronson, N.K., Brazier, J.E., Cella, D., Costa, D.S.J., King, M.T.: Using a discrete choice experiment to value the QLU-C10D: feasibility and sensitivity to presentation format. Qual. Life Res. 25, 637–649 (2016) [DOI] [PubMed] [Google Scholar]
- 24.Mokkink, L.B., Terwee, C.B., Patrick, D.L., Alonso, J., Stratford, P.W., Knol, D.L., De Vet, H.C.: The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual. Life Res. 19, 539–549 (2010) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Stupp, R., Hegi, M.E., Gorlia, T., Erridge, S.C., Perry, J., Hong, Y.K., CENTRIC Study Team: Cilengitide combined with standard treatment for patients with newly diagnosed glioblastoma with methylated MGMT promoter (CENTRIC EORTC 26071–22072 study): a multicentre, randomised, open-label, phase 3 trial. Lancet Oncol 15(10), 1100–1108 (2014) [DOI] [PubMed] [Google Scholar]
- 26.Lundh, A., Gøtzsche, P.C.: Recommendations by cochrane review groups for assessment of the risk of bias in studies. BMC Med. Res. Methodol. 8, 1–9 (2008) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Viney, R., Norman, R., King, M.T., Cronin, P., Street, D.J., Knox, S., Ratcliffe, J.: Time trade-off derived EQ-5D weights for Australia. Value in Health 14(6), 928–936 (2011) [DOI] [PubMed] [Google Scholar]
- 28.King, M.T., Viney, R., Pickard, A.S., Rowen, D., Aaronson, N.K., Brazier, J.E., Cella, D., Costa, D.S.J., Fayers, P.M., Kemmler, G., McTaggart-Cowan, H., Mercieca-Bebber, R., Peacock, S., Street, D.J., Young, T.A., Norman, R.: Australian utility weights for the EORTC QLU-C10D, a multi- attribute utility instrument derived from the cancer-specific quality of life questionnaire, EORTC QLQ-C30. Pharmacoeconomics 36(2), 225–238 (2018) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Bansback, N., Tsuchiya, A., Brazier, J., Anis, A.: Canadian valuation of EQ-5D health states: preliminary value set and considerations for future valuation studies. PLoS ONE 7(2), e31115 (2012) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.McTaggart-Cowan, H., King, M.T., Norman, R., Costa, D., Pickard, A.S., Regier, D., Viney, R., Peacock, S.: The EORTC QLU-C10D: the Canadian valuation study and algorithm to derive cancer-specific utilities from the EORTC QLQ-C30. MDM Policy Pract 4(1), 2381468319842532 (2019) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Dolan, P.: Modeling valuations for EuroQol health states. Medical care 35, 1095–1108 (1997) [DOI] [PubMed] [Google Scholar]
- 32.Norman, R., Mercieca-Bebber, R., Brazier, J.E., Cella, D., Pickard, A.S., Rowen, D., Street, D.J., Viney, R., King, M.T.: UK utility weights for the EORTC QLU-C10D. Health Econ. 28(12), 1385–1401 (2019) [DOI] [PubMed] [Google Scholar]
- 33.Shaw, J.W., Johnson, J.A., Coons, S.J.: US valuation of the EQ-5D health states: development and testing of the D1 valuation model. Medical care 43, 203–220 (2005) [DOI] [PubMed] [Google Scholar]
- 34.Revicki, D., King, M.T., Viney, R., Pickard, A.S., Mercieca-Bebber, R., Shaw, J.W., Norman, R.: United States utility algorithm for the EORTC QLU-C10D, a multi-attribute utility instrument based on a cancer- specific quality of life instrument. Med. Decis. Making 41(4), 485–501 (2021) [DOI] [PubMed] [Google Scholar]
- 35.Coomans, M.B., Dirven, L., Aaronson, N., Baumert, B.G., van den Bent, M., Bottomley, A., Taphoorn, M.J.: Factors associated with health-related quality of life (HRQoL) deterioration in glioma patients during the progression-free survival period. Neuro Oncol. 24(12), 2159–2169 (2022) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.King, M.T.: The interpretation of scores from the EORTC quality of life questionnaire QLQ-C30. Qual. Life Res. 5, 555–567 (1996) [DOI] [PubMed] [Google Scholar]
- 37.Mauer, M., Stupp, R., Taphoorn, M.J.B., Coens, C., Osoba, D., Marosi, C., Bottomley, A.: The prognostic value of health-related quality-of-life data in predicting survival in glioblastoma cancer patients: results from an international randomised phase III EORTC brain tumour and radiation oncology groups, and NCIC clinical trials group study. Br. J. Cancer 97(3), 302–307 (2007) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Xiao, Q., Yang, S., Ding, G., Luo, M.: Anti-vascular endothelial growth factor in glioblastoma: a systematic review and meta-analysis. Neurol. Sci. 39(12), 2021–2031 (2018) [DOI] [PubMed] [Google Scholar]
- 39.Taphoorn, M.J., Dirven, L., Kanner, A.A., Lavy-Shahaf, G., Weinberg, U., Taillibert, S., Stupp, R.: Influence of treatment with tumor-treating fields on health-related quality of life of patients with newly diagnosed glioblastoma: a secondary analysis of a randomized clinical trial. JAMA Oncol. 4(4), 495–504 (2018) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Incekara, F., Smits, M., Dirven, L., Bos, E.M., Balvers, R.K., Haitsma, I.K., Vincent, A.J.: Intraoperative B-mode ultrasound guided surgery and the extent of glioblastoma resection: a randomized controlled trial. Front. Oncol. 11, 649797 (2021) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Strosberg, J., Wolin, E., Chasen, B., Kulke, M., Bushnell, D., Caplin, M., NETTER-1 Study Group: Health-related quality of life in patients with progressive midgut neuroendocrine tumors treated with 177Lu-dotatate in the phase III NETTER-1 trial. J Clin Oncol 36(25), 2578 (2018) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Grønberg, B.H., Bremnes, R.M., Fløtten, O., Amundsen, T., Brunsvig, P.F., Hjelde, H.H., Sundstrøm, S.: Phase III study by the Norwegian lung cancer study group: pemetrexed plus carboplatin compared with gemcitabine plus carboplatin as first-line chemotherapy in advanced non-small-cell lung cancer. Clin. Oncol. 27(19), 3217–3224 (2009) [DOI] [PubMed] [Google Scholar]
- 43.Dirven, L., Musoro, J.Z., Coens, C., Reijneveld, J.C., Taphoorn, M.J., Boele, F.W., Bottomley, A.: Establishing anchor-based minimally important differences for the EORTC QLQ-C30 in glioma patients. Neuro Oncol. 23(8), 1327–1336 (2021) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Terwee, C.B., Bot, S.D., de Boer, M.R., van der Windt, D.A., Knol, D.L., Dekker, J., Bouter, L.M., de Vet, H.C.: Quality criteria were proposed for measurement properties of health status questionnaires. J. Clin. Epidemiol. 60(1), 34–42 (2007) [DOI] [PubMed] [Google Scholar]
- 45.Gamper, E.M., Cottone, F., Sommer, K., Norman, R., King, M., Breccia, M., Efficace, F.: The EORTC QLU-C10D was more efficient in detecting clinical known group differences in myelodysplastic syndromes than the EQ-5D-3L. J. Clin. Epidemiol. 137, 31–44 (2021) [DOI] [PubMed] [Google Scholar]
- 46.Ferguson, C.J.: An effect size primer: a guide for clinicians and researchers. Prof. Psychol. Res. Pract. 40(5), 532–538 (2016) [Google Scholar]
- 47.Cichetti, D.V.: Guidelines, criteria, and rules of thumb for evaluat- ing normed andstandardized assessment instruments in psychology. Psychol. Assess. 6, 284–290 (1994) [Google Scholar]
- 48.Sagberg, L.M., Jakola, A.S., Solheim, O.: Quality of life assessed with EQ-5D in patients undergoing glioma surgery: what is the responsiveness and minimal clinically important difference? Qual. Life Res. 23, 1427–1434 (2014) [DOI] [PubMed] [Google Scholar]
- 49.Cohen, J.: Statistical power analysis for the behavioral sciences. Academic press (2013) [Google Scholar]
- 50.Tuley, M.R., Mulrow, C.D., McMahan, C.A.: Estimating and testing an index of responsiveness and the relationship of the index to power. J. Clin. Epidemiol. 44(4–5), 417–421 (1991) [DOI] [PubMed] [Google Scholar]
- 51.VanderWeele, T.J., Mathur, M.B.: Some desirable properties of the Bonferroni correction: is the Bonferroni correction really so bad? Am. J. Epidemiol. 188(3), 617–618 (2019) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Boone, M., Roussel, M., Chauffert, B., Le Gars, D., Godefroy, O.: Prevalence and profile of cognitive impairment in adult glioma: a sensitivity analysis. J Neuro-oncol 129(1), 123 (2016) [DOI] [PubMed] [Google Scholar]
- 53.Koekkoek, J.A.F., Kerkhof, M., Dirven, L., Heimans, J.J., Reijneveld, J.C., Taphoorn, M.J.B.: Seizure outcome after radiotherapy and chemotherapy in low-grade glioma patients: a systematic review. Neuro Oncol. 17(11), 924–934 (2015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Bulamu, N.B., Vissapragada, R., Chen, G., Ratcliffe, J., Mudge, L.A., Smithers, B.M., Watson, D.I.: Responsiveness and convergent validity of QLU-C10D and EQ-5D-3L in assessing short-term quality of life following esophagectomy. Health Qual. Life Outcomes 19(1), 1–11 (2021) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Pilz, M. J., Seyringer, S., Al-Naesan, I., King, M. T., Bottomley, A., Norman, R., & Gamper, E. M. (2024). Cancer-specific health utilities: evaluation of core measurement properties of the EORTC QLU-C10D in lung cancer patients—data from four multicentre LUX-lung trials, applying six country value set. PharmacoEconomics-Open 8, 627–640. 10.1007/s41669-024-00484-9 [DOI] [PMC free article] [PubMed]
- 56.Tsuchiya, A., Bansback, N., Hole, A.R., Mulhern, B.: Manipulating the 5 dimensions of the EuroQol instrument: the effects on self-reporting actual health and valuing hypothetical health states. Med. Decis. Making 39(4), 380–392 (2019) [DOI] [PubMed] [Google Scholar]
- 57.Shaw, J.W., Bennett, B., Trigg, A., DeRosa, M., Taylor, F., Kiff, C., Cocks, K.: A comparison of generic and condition-specific preference-based measures using data from nivolumab trials: EQ-5D-3L, mapping to the EQ-5D-5L, and European organisation for research and treatment of cancer quality of life utility measure-core 10 dimensions. Value in Health 24(11), 1651–1659 (2021) [DOI] [PubMed] [Google Scholar]
- 58.Pilz, M. J., Seyringer, S., Hallsson, L. R., Bottomley, A., Jansen, F., King, M. T., & Gamper, E. M. (2024). The EORTC QLU-C10D is a valid cancer-specific preference-based measure for cost-utility and health technology assessment in the Netherlands. European J Health Econ, 1–17. [DOI] [PMC free article] [PubMed]
- 59.Seyringer, S., Pilz, M. J., Al-Naesan, I., King, M. T., Bottomley, A., Norman, R., Schlosser, L., Hell, T., Gamper, E. M. (2023). Clinical validity of the EORTC QLU-C10D in lung cancer: retrospective analysis of four lung cancer trials using six different country value set. On behalf of the EORTC quality of life group. submitted
- 60.Kennedy-Martin, M., Slaap, B., Herdman, M., van Reenen, M., Kennedy-Martin, T., Greiner, W., Boye, K.S.: Which multi-attribute utility instruments are recommended for use in cost-utility analysis? A review of national health technology assessment (HTA) guidelines. Eur. J. Health Econ. 21, 1245–1257 (2020) [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data used for this analyses is owned by Merck and the EORTC. It cannot be shared by the authors of this publication. An official request though can be made with the EORTC. See EORTC Policy on Data Sharing: https://www.eortc.org/app/uploads/2023/06/L-01-POL-01.pdf




